1 repo
Tools for distributing network traffic across multiple service instances to ensure high availability.
Distinguishing note: Focuses on infrastructure-level traffic distribution for AI model deployments.
Explore 1 awesome GitHub repository matching devops & infrastructure · Traffic Load Balancers. Refine with filters or upvote what's useful.
LiteLLM is a unified gateway and proxy server designed to centralize access to over one hundred language model providers. It provides a standardized API interface that abstracts vendor-specific schemas, allowing developers to interact with diverse models through a single, consistent format. By acting as a central traffic management layer, it enables organizations to route, secure, and govern model interactions across multiple deployments. The platform distinguishes itself through its policy-driven architecture, which uses configuration-based routing to manage traffic distribution, load balanc
Distributes traffic across multiple model deployments using routing rules and automatic fallbacks for high availability.