30 open-source projects similar to diegosouzapw/omniroute, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best OmniRoute alternative.
MIRIX is an AI agent state orchestrator and long-term memory system designed to provide persistent context for large language models. It functions as a multi-modal AI memory pipeline that processes text, voice, and screen captures into structured knowledge stores, including a dedicated screen activity knowledge base. The project distinguishes itself by integrating a multi-modal observation pipeline that monitors desktop activity in real-time to build a searchable history of user actions. It utilizes a multi-tiered memory hierarchy—separating episodic, semantic, procedural, and core stores—and
InsForge is a backend-as-a-service platform that provides an integrated suite of tools for managing relational databases, identity provision, object storage, and serverless compute. It functions as an open-source identity provider and a PostgreSQL database manager featuring integrated vector storage and row-level security. The platform serves as an LLM orchestration gateway, offering a unified endpoint to route requests across various AI providers through an OpenAI-compatible interface. It enables AI-driven application generation and connects AI agents to backend resources using a standardize
ClawRouter is an AI model router and API gateway designed to classify query complexity and assign prompts to the most efficient model tier. It operates as a multi-model AI proxy that orchestrates traffic between various large language models and AI media generators through a unified interface. The project distinguishes itself by integrating a non-custodial micropayment processor using the x402 protocol. This allows for per-request API access and USDC settlement on Base and Solana chains, replacing static API keys with wallet-based authentication and real-time budget enforcement. The system c
Kilocode is an autonomous engineering platform designed to orchestrate AI agents for complex software development tasks. It functions as a comprehensive system for automating coding, testing, and repository management by integrating directly with your codebase and terminal. The platform provides a unified gateway for model orchestration, allowing for the management of agentic workflows, event-driven automation, and persistent session state across distributed development environments. The platform distinguishes itself through its federated task management and policy-based access control, which
Mastra is an orchestration framework designed for building, deploying, and managing autonomous AI agents and multi-agent systems. It provides a comprehensive suite of primitives for creating resilient AI applications, including durable workflow orchestration, event-driven agent loops, and semantic memory management. By integrating these core components, the platform enables developers to build complex, multi-step processes that can reason about goals and execute tasks without manual intervention. The framework distinguishes itself through its focus on observability and secure, isolated execut
FastMCP is a Python framework designed for building servers that expose functions, resources, and prompts to AI models using the Model Context Protocol. It simplifies the development process by automatically deriving tool metadata, input schemas, and documentation directly from Python function signatures and type hints. The framework provides a unified container for managing these components, allowing developers to build modular applications that integrate seamlessly with AI assistants. The project distinguishes itself through its support for interactive, server-defined user interface compone
Wenda is a self-hosted infrastructure and gateway platform for deploying language models within internal networks to ensure data privacy and security. It functions as a centralized hub and API gateway that unifies communication between various offline model runners and online service providers through a single interface. The platform includes a workflow orchestrator that uses custom scripts and API calls to automate complex conversation flows and model settings. It also incorporates a retrieval system that augments model responses with external knowledge retrieved from vector databases and se
This project is an LLM API proxy gateway and compatibility layer designed to route, translate, and proxy requests between model clients and various large language model providers. It functions as a multi-provider router that maps model requests to alternative backends based on configurable tiers and capabilities. The gateway acts as a translation layer that converts API request and response formats between different providers, such as OpenAI or Gemini, to ensure compatibility. It includes a tool-use proxy to handle the execution and processing of model tool definitions and function calls, and
AIClient-2-API is an AI gateway and proxy server that translates diverse large language model interfaces into a single standardized API format. It functions as an OpenAI API compatible proxy and multi-provider orchestrator, allowing a single client to interact with multiple different model backends through a unified interface. The project distinguishes itself by acting as a load balancer that distributes requests across multiple provider accounts using health checks and polling to bypass quota limits. It includes a TLS fingerprint emulator to simulate browser characteristics and prevent API a
Plano is an AI agent orchestrator and LLM gateway proxy that unifies access to multiple AI providers through a single interoperable interface. It functions as a model routing engine that decouples applications from specific vendors using semantic aliases, allowing traffic to be shifted between providers without modifying application code. The system distinguishes itself with intent-based agent routing, which directs prompts to specialized agents based on semantic analysis. It features an interceptor-based filter chain system that acts as guardrail middleware to enforce safety policies, rewrit
This project is a multi-provider AI gateway and proxy server that intercepts and routes requests between AI clients and various large language model providers. It functions as an API protocol translator and model router, mapping incoming requests to specific upstream providers or local runners to provide a unified interface for multiple models. The system distinguishes itself by bridging chat platforms and command line interfaces, converting messages from chat services into managed command line sessions. It further optimizes traffic by executing certain web search and fetch requests locally a
Helicone is an AI gateway and observability platform designed to intercept, manage, and monitor interactions with large language models. By acting as a reverse-proxy, it provides a centralized layer for routing requests across multiple AI providers, allowing developers to maintain consistent application logic while gaining deep visibility into model performance, usage, and costs. The platform distinguishes itself through a robust suite of traffic management and prompt engineering tools. It enables policy-driven control, including automatic failover between providers, rate limiting, and edge-b
CoAI is an enterprise-grade, self-hostable AI gateway platform that unifies access to over 200 AI models from more than 35 providers through a single OpenAI-compatible API endpoint. It functions as a multi-tenant gateway, routing requests across providers with load balancing, automatic failover, and priority-based routing, while exposing standard OpenAI API endpoints for chat, image generation, model listing, and billing to enable seamless integration with existing tools and clients. The platform distinguishes itself through a comprehensive set of operational capabilities built around the gat
BAML is a prompt engineering framework and LLM client generator that defines AI prompts as type-safe functions. It serves as a structured data extraction tool and workflow orchestrator, transforming unstructured model responses into strongly typed objects using a custom schema language and alignment algorithms. The project distinguishes itself by using a compiler to generate language-specific boilerplate code for API communication and output parsing. It features a dedicated environment for designing complex prompt templates with conditional logic and reusable snippets, and employs genetic alg
Mods is a terminal-based AI client that sends prompts to large language models and streams responses back to the command line. It functions as a multi-provider AI gateway, routing queries to OpenAI, Cohere, Groq, Gemini, and local endpoints, and includes a conversation history manager that saves, caches, branches, and resumes text-based interactions. The tool also operates as a Model Context Protocol client, connecting to external MCP servers via stdio, SSE, or HTTP to extend model capabilities with specialized tools and data. The project distinguishes itself through a config-driven provider
Koog is an LLM agent framework used to build autonomous entities that execute tool-based workflows. It utilizes a graph-based workflow engine to define agent behaviors and decision paths as a directed graph of nodes and edges. The framework distinguishes itself through a model provider orchestrator that enables dynamic switching, load balancing, and automatic fallbacks between different AI backends. It implements the Model Context Protocol to connect agents to remote tool servers and features a RAG memory system using vector embeddings to maintain long-term conversation context. The project
MemOS is an open-source persistent memory layer for AI agents and large language models, providing a self-hosted server that stores and retrieves structured memory across sessions. It enables AI systems to recall user preferences, history, and context without retraining, using a graph-based API and a web management interface for viewing, editing, and organizing memory items, skills, traces, and knowledge bases. The system distinguishes itself through a portable memory interchange protocol that allows memory to be transferred between different AI models, devices, and applications, along with a
This project is an artificial intelligence API gateway that centralizes connections to multiple model providers into a single, standardized interface. By acting as a proxy, it translates diverse provider protocols into a format compatible with existing clients, allowing developers to integrate various language models without managing provider-specific software development kits. The gateway distinguishes itself through a robust traffic management layer that includes intelligent request routing, weighted load balancing, and automated failover mechanisms to ensure service availability. It incorp
Axonhub is an AI gateway and multi-model API proxy that provides a unified interface for routing requests to multiple large language model providers. It functions as a load balancer and translation layer, converting a standardized API format into provider-specific payloads to enable communication with various AI models without provider-specific code. The system manages traffic through rule-based routing and automatic failover to maintain high availability. It differentiates its operations by providing a provider-agnostic interface that decouples client requests from specific model backends us
RouteLLM is a routing framework and traffic manager designed to direct prompts between high-capability and low-cost large language models. It functions as an API gateway that mimics the OpenAI specification to route requests across different model providers. The system optimizes operational costs by splitting traffic between model tiers based on predicted win rates and prompt complexity. It includes a calibration tool to analyze sample queries and determine the optimal cost-quality tradeoff for traffic distribution. The framework provides a tool for measuring the accuracy and cost efficiency
gpt4free-ts is a TypeScript-based LLM API proxy and gateway that provides a unified interface for accessing large language models without paid subscriptions or official API keys. It functions as a containerized AI bridge that routes requests to various free third-party providers to retrieve chat completions. The project acts as an OpenAI API wrapper, translating requests and responses into the standard OpenAI chat completions format to ensure compatibility with existing AI tools. It utilizes a provider-based routing system to distribute request loads across available endpoints. The gateway s
TensorZero is an inference gateway and experimentation framework designed to manage the lifecycle of large language models in production environments. It functions as a central proxy that routes requests across multiple artificial intelligence providers while providing the infrastructure necessary to monitor performance, track costs, and ensure service reliability. The platform distinguishes itself by integrating a comprehensive evaluation engine and an observability pipeline directly into the request flow. It enables developers to conduct controlled experiments and A/B tests to compare diffe
Omi is an open-source wearable AI platform that captures audio and screen data to provide real-time conversational assistance and memory. It integrates a wearable hardware development kit with a vector memory database and large language model capabilities to create a persistent digital record of user interactions. The platform is distinguished by its BLE audio streaming pipeline, which transmits raw audio from wearable hardware for real-time transcription and speaker identification. It utilizes a plugin-based agent tool framework that allows AI assistants to autonomously invoke custom functio
KeePassX is a cross-platform password vault application and database manager that stores usernames and passwords in an encrypted file. It utilizes the KeePass database format to secure records using a master password or an optional key file. The application includes an automatic form filler that populates login fields in external applications and a secure password generator for creating high-entropy randomized passwords. The system provides tools for organizing credentials through custom metadata and grouping, alongside utilities for searching records and importing or exporting data.
This project is a terminal-based command line interface client and agent orchestrator for interacting with multiple large language model providers. It functions as an OpenAI API client and a local API gateway that exposes chat completions and embeddings through an HTTP server. The system distinguishes itself by providing a retrieval-augmented generation tool for indexing local files and URLs into a vector database to provide custom document context. It allows for the creation of specialized AI agents that combine custom system prompts with tool calling and external function execution. The to
TaskWeaver is an LLM agent framework that interprets natural language requests and executes them as Python code, SQL queries, or shell commands. It functions as a conversational code interpreter that maintains stateful data structures across turns, generating executable code from user prompts within a session-based environment. The system is designed as a self-hosted AI agent platform that can be deployed in Docker, managing sessions and providing a web UI for data analytics and automation tasks. The framework distinguishes itself through a role-based multi-agent architecture that divides the
Quotio is a local LLM API proxy gateway and credential manager that intercepts and routes requests from command-line tools and integrated development environments to various AI model providers. It serves as a centralized authentication hub, managing API keys and service accounts to provide a unified interface for external AI agents. The project distinguishes itself through a routing engine that implements priority-chain and round-robin load balancing to distribute workloads across multiple accounts. It features automated API key failover, which redirects requests to backup authentication keys
This is a mobile object database and NoSQL local data store that replaces relational tables with a schema-based model. It functions as a reactive data store, using live object observations and change notifications to trigger automatic user interface refreshes. The system provides built-in mobile cloud data synchronization to keep local datasets consistent with a remote server across multiple devices. It also includes security features for encrypted local storage, protecting sensitive on-disk data using at-rest encryption keys and fine-grained access control. Broad capabilities include object
This is an open-source platform for creating, hosting, and interacting with persistent AI characters that maintain personality and memory across conversations. The system orchestrates the full lifecycle of an AI companion by combining character definitions, conversation history, memory retrieval, model abstraction, and external communication channels into a unified runtime pipeline. The platform enables users to define detailed character personalities through structured configuration files that shape conversational behavior, and supports multi-turn dialogue through a memory system that stores