2 repos
Network services that expose model inference capabilities through standardized web APIs to support automated application workflows.
Explore 2 awesome GitHub repositories matching artificial intelligence & ml · Inference API Servers. Refine with filters or upvote what's useful.
ComfyUI is a node-based generative AI orchestration engine designed for constructing, testing, and executing complex image and video synthesis pipelines. By utilizing a directed acyclic graph execution model, the platform allows users to build reproducible workflows through modular, interconnected processing blocks wit
Serves visual, node-based generative pipelines as programmable API endpoints for integration into external software.
Llama.cpp is an inference engine designed for the local execution of text-based and multimodal language models on consumer hardware. It provides a core environment for running models that process both text and image inputs, utilizing hardware-accelerated backends to optimize performance across diverse CPU and GPU archi
Exposes inference capabilities via a lightweight HTTP server that supports standard chat completion and embedding endpoints.