1 repo
Utilities for handling real-time, chunked responses from AI models.
Distinguishing note: Focuses on streaming protocol handling, distinct from standard request-response.
Explore 1 awesome GitHub repository matching artificial intelligence & ml · Response Streamers. Refine with filters or upvote what's useful.
LiteLLM is a unified gateway and proxy server designed to centralize access to over one hundred language model providers. It provides a standardized API interface that abstracts vendor-specific schemas, allowing developers to interact with diverse models through a single, consistent format. By acting as a central traffic management layer, it enables organizations to route, secure, and govern model interactions across multiple deployments. The platform distinguishes itself through its policy-driven architecture, which uses configuration-based routing to manage traffic distribution, load balanc
Enables real-time response chunking during completion requests for efficient text generation.