30 open-source projects similar to anomalyco/models.dev, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Models.dev alternative.
This project is a command-line utility designed to monitor and analyze token consumption and financial expenditure for AI coding assistants. By parsing local session logs directly on the user's machine, it provides a privacy-focused way to track development activity without transmitting sensitive data to external servers. The tool distinguishes itself through its ability to aggregate disparate log formats from multiple coding assistants into a unified, schema-agnostic representation. It features a decoupled pricing engine that allows users to apply custom model-specific cost multipliers, over
ruby_llm is an LLM integration framework and AI agent orchestrator designed to connect applications to multiple large language model providers through a unified interface. It serves as a toolkit for building autonomous assistants with custom personas, managing structured output via JSON schemas, and implementing vector embedding engines for semantic search. The project distinguishes itself as an observability suite and multimodal toolkit. It provides specialized capabilities for tracking token usage, calculating model costs, and tracing workflows via OpenTelemetry, while supporting the proces
OmniRoute is a unified LLM API gateway that connects multiple AI providers to a single endpoint. Its primary purpose is to simplify the integration of various AI models into tools and agents by translating different provider formats into a standardized API. The project distinguishes itself through a multi-strategy request routing system that optimizes for cost, speed, and availability, including automatic model fallbacks and a circuit-breaker resilience model to isolate provider failures. It employs a local-first security posture, using AES-256-GCM encryption to store API keys and conversatio
This project is a community-driven repository that serves as a directory for artificial intelligence providers offering free usage tiers and trial credits for large language model inference. It functions as a resource for developers to discover and integrate external AI services into applications while minimizing initial infrastructure costs. The repository provides structured metadata that enables developers to track request constraints, token limits, and rate requirements across multiple providers. By utilizing standardized data structures and declarative configuration, it assists in managi
llmware is a Python framework for AI agent orchestration and model management, designed to coordinate multi-model workflows and autonomous agents. It provides a unified model catalog and standardized interface to execute specialized language models for complex research, analysis, and structured data generation. The project distinguishes itself through its heavy emphasis on local execution and quantized inference, allowing models to run on private infrastructure using CPU, GPU, and NPU acceleration via runtimes like ONNX and OpenVino. It features a specialized ability to translate natural lang
LLMLingua is a prompt compression tool that reduces token count in prompts before they are sent to a large language model, cutting API costs and latency while preserving task performance. It operates as an extractive pipeline using a BERT-level Transformer encoder to classify each token for removal based on full bidirectional context from the prompt, retaining only key information and discarding non-essential tokens. The tool is trained through a knowledge distillation process, where a compact compression model learns from an extractive dataset derived from a large language model's output to
Paseo is an LLM coding agent orchestrator and multi-agent workflow manager designed to coordinate multiple AI agents across isolated git worktrees. It provides a unified control interface for managing these agents and their associated environments to execute complex programming tasks. The system distinguishes itself through a remote agent daemon that enables secure access to local coding agents via encrypted relays. It employs a git worktree environment manager to isolate parallel tasks into dedicated directories and branch-based server URLs, preventing file collisions and network port confli
ClawWork is a suite of tools designed to monitor agent finances, provide isolated execution environments, simulate economic behaviors, and benchmark performance. It functions as an autonomous agent sandbox where AI agents can run code and generate professional business deliverables. The project focuses on the financial sustainability of AI assistants through an economic simulation environment. This includes tools for tracking token expenditures and income generation, as well as simulations that analyze the trade-offs between immediate earnings and long-term skill acquisition. The system incl
OpenFGA is a fine-grained authorization server and policy decision point that implements relationship-based access control. It serves as a centralized authorization service for evaluating access requests and managing relationship tuples across distributed microservices and multi-tenant environments. The engine combines relationship graphs with attribute-based access control, using the Common Expression Language to evaluate dynamic runtime attributes and conditional access rules. It handles complex hierarchies and nested permissions by traversing chains of associations and parent-child links t
AgentOps is an observability platform and developer toolkit for monitoring the execution, performance, and reliability of autonomous agents powered by large language models. It serves as a system for tracking AI agent behavior, debugging complex workflows, and benchmarking model performance. The platform is distinguished by its ability to visualize multi-agent workflows through execution path graphing and session replays. It provides specific tools for calculating financial spend across various language model providers and supports a self-hosted observability stack for users who require full
GPTCache is a semantic caching layer and response optimizer for large language models. It functions as pluggable middleware for orchestration frameworks, utilizing vector database caching to store and retrieve model responses based on the semantic similarity of prompts rather than exact text matches. The system uses embeddings to determine cache hits by comparing the distance between new queries and stored vectors. It employs a hybrid storage model that persists original prompts in relational databases while maintaining high-dimensional embeddings in vector stores. The project covers a broad
GenericAgent is an LLM agent framework and autonomous system controller designed to manage local systems, web browsers, and hardware interfaces through action and observation loops. It functions as a tool orchestrator that routes model calls to local executors, enabling the automation of complex tasks on a host machine. The project is distinguished by its self-evolving AI agent capabilities, which convert successful execution paths into reusable procedural scripts and skill trees to reduce future reasoning overhead. It employs a context optimization engine that utilizes layered memory hierarc
ChatALL is a multi-model chat client and productivity tool designed to evaluate the quality of answers from different large language models. It provides a unified interface for interacting with various AI chatbots across different service providers from a single window, allowing users to send a single prompt to multiple models simultaneously. The application enables side-by-side response comparison through a dynamic columnar layout and concurrent querying. It functions as a local chat history manager, using a privacy-focused storage system to keep prompt records and conversation history saved
OpenPlayground is a web-based comparison playground and multi-provider client used to test and evaluate outputs from multiple large language models and local inference engines side-by-side. It serves as a local testing environment for routing prompts to various external APIs and on-device models through a single interface. The project enables concurrent request dispatching, allowing a single prompt to be sent to multiple models simultaneously for comparative analysis. It includes a parameter tuning interface for refining model behavior via generation settings and provides a system for detecti
ollama-python is a Python client for interacting with large language models. It provides an interface for sending prompts to receive text and chat completions, as well as a dedicated client for generating numerical vector embeddings from text. The project includes a wrapper that emulates the OpenAI API, allowing applications built for that standard to interact with local models. It also provides a non-blocking asynchronous client for executing concurrent requests. The library covers the full model lifecycle, including the ability to pull, create, list, and delete models within a local enviro
DataHub is a metadata management platform designed to unify technical, operational, and business context across diverse data ecosystems. By utilizing a graph-based metadata model and an event-driven ingestion architecture, it creates a centralized source of truth that maps complex data relationships, lineage, and ownership. This foundational framework enables organizations to maintain a synchronized view of their data landscape, supporting both human-led discovery and automated data operations. The platform distinguishes itself through its focus on grounding artificial intelligence and autono
Awesome Coding Plan is a community-driven knowledge repository that provides a comparative analysis of subscription-based coding environments and artificial intelligence development tools. It functions as a tracker for developer tool costs, aggregating data on pricing structures, usage quotas, and token limits to assist in the selection of cloud-based coding services. The project utilizes a standardized framework to evaluate the performance and economic efficiency of various language models. By organizing technical metrics into a unified format, it allows for the objective assessment of proce
Lowdb is a lightweight local data persistence library that manages application state as a JSON document. It functions as a database utility that stores information directly in local files or browser storage, removing the requirement for a dedicated database server or complex infrastructure. By treating stored data as standard objects and arrays, it provides a straightforward interface for managing application information. The library distinguishes itself through a modular adapter-based architecture that decouples data manipulation from the underlying storage medium. This design allows for the
Grav is a flat-file content management system that eliminates the need for a traditional database by storing site content and configuration in human-readable Markdown and YAML files. Built as a modular PHP web framework, it uses a hierarchical page routing system where the physical directory structure directly determines the site's URL paths. The platform is distinguished by its event-driven plugin architecture and a command-line interface that prioritizes system administration, deployment, and maintenance tasks. It utilizes a blueprint-driven system to generate administrative forms from stru
This project is a developer onboarding tool and GitHub issue discovery portal. It serves as a curated directory and contribution guide designed to match new programmers with beginner-friendly open source tasks based on their technical skills and experience level. The platform operates as a static project directory, using a serverless catalog of repositories stored in JSON files. It provides the ability to filter tasks by programming language and difficulty, enabling users to identify approachable starting points in active software projects. The system includes capabilities for repository cur
Caveman is a set of tools and configurations designed for large language model token optimization. It focuses on reducing the amount of data processed during AI interactions to lower costs and maximize the available context window. The project implements a fragmented communication style that replaces full grammatical sentences with concise technical keywords. This approach extends to AI context optimization by condensing memory files and tool descriptions, and includes a specialized configuration for generating terse, one-line code reviews and short conventional commit messages. The system i
This project is a curated database and repository of standardized infrared signal sequences designed for hardware remote emulation. It provides a comprehensive library of infrared codes used to operate a wide variety of consumer electronics, household appliances, and hardware devices. The collection organizes infrared commands by manufacturer and device category to facilitate the remote operation of audio-visual equipment, climate control systems, lighting, and photography hardware. It also includes specialized signals for gaming consoles, monitors, and videoconferencing equipment. The syste
Manifest is a language model provider unification system that standardizes access to multiple AI backends through a single interface. It functions as a centralized management layer for integrating various cloud-based and local model providers to simplify how applications request completions. The system provides intelligent model routing and high availability infrastructure by directing queries based on complexity and automatically triggering model fallbacks when a primary provider fails. It distinguishes itself through multi-tenant AI management, organizing agents into isolated groups with de
This project is a curated ebook directory and digital library designed for self-directed learning. It functions as a version-controlled knowledge base that provides a collection of downloadable academic textbooks and educational resources across multiple disciplines. The library is built as a markdown-based system where book metadata and download links are stored in flat files. These files are managed through a git-driven content pipeline and rendered as a static site resource index for fast access and simple hosting. The system utilizes a directory-based categorization model to organize mat
KeePassX is a cross-platform password vault application and database manager that stores usernames and passwords in an encrypted file. It utilizes the KeePass database format to secure records using a master password or an optional key file. The application includes an automatic form filler that populates login fields in external applications and a secure password generator for creating high-entropy randomized passwords. The system provides tools for organizing credentials through custom metadata and grouping, alongside utilities for searching records and importing or exporting data.
This project is a product support database and software version metadata registry that tracks release and discontinuation timelines for software and consumer electronics hardware. It provides a programmatic software lifecycle API for retrieving end-of-life and support dates via JSON. The project maps software products to standardized identifiers, including CPE, Repology, and PURL, to ensure cross-ecosystem identification. It also includes a runtime compatibility mapper that matches software versions to their minimum required compatible runtimes and dependencies. The system transforms lifecyc
ChatHub is a browser-based AI workspace and chatbot aggregator that provides a unified interface for interacting with multiple large language models. It functions as a multi-model AI client, allowing users to send a single prompt to several chatbots simultaneously and compare their responses side-by-side. The project distinguishes itself by acting as a cross-model response comparator that aggregates various web-based AI interfaces into a single view. It includes an AI prompt manager for storing and organizing reusable prompts to be used across different model sessions. The system covers a br
NeDB is a JavaScript embedded NoSQL document store designed for Node.js and the browser. It functions as an in-memory data store with the option to persist documents to a local file system, ensuring data survives application restarts. The project utilizes a MongoDB-compatible API to perform data operations, allowing it to serve as a lightweight document indexing system and a persistent file database without requiring a separate database server. Capabilities include querying, inserting, updating, and deleting documents, as well as the ability to create indexes on specific fields to accelerate
This project is an LLM API proxy gateway and compatibility layer designed to route, translate, and proxy requests between model clients and various large language model providers. It functions as a multi-provider router that maps model requests to alternative backends based on configurable tiers and capabilities. The gateway acts as a translation layer that converts API request and response formats between different providers, such as OpenAI or Gemini, to ensure compatibility. It includes a tool-use proxy to handle the execution and processing of model tool definitions and function calls, and