2 repos

Awesome GitHub RepositoriesInference API Servers

Network services that expose model inference capabilities through standardized web APIs to support automated application workflows.

Explore 2 awesome GitHub repositories matching artificial intelligence & ml · Inference API Servers. Refine with filters or upvote what's useful.

We'll search the best matching repositories with AI.

Comfy-Org/ComfyUI
Comfy-Org/ComfyUI
103,654GitHubView on GitHub
ComfyUI is a node-based generative AI orchestration engine designed for constructing, testing, and executing complex image and video synthesis pipelines. By utilizing a directed acyclic graph execution model, the platform allows users to build reproducible workflows through modular, interconnected processing blocks wit
Serves visual, node-based generative pipelines as programmable API endpoints for integration into external software.
Pythonaicomfycomfyui
ggml-org/llama.cpp
ggml-org/llama.cpp
95,400GitHubView on GitHub
Llama.cpp is an inference engine designed for the local execution of text-based and multimodal language models on consumer hardware. It provides a core environment for running models that process both text and image inputs, utilizing hardware-accelerated backends to optimize performance across diverse CPU and GPU archi
Exposes inference capabilities via a lightweight HTTP server that supports standard chat completion and embedding endpoints.
C++ggml

Explore sub-tags

Workflow-Driven Inference ServersInference servers that execute visual, node-based generative pipelines as programmable API endpoints.