awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Inference API Servers · Awesome GitHub Repositories

2 repos

Awesome GitHub RepositoriesInference API Servers

Network services that expose model inference capabilities through standardized web APIs to support automated application workflows.

Explore 2 awesome GitHub repositories matching artificial intelligence & ml · Inference API Servers. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Machine Learning
  4. Infrastructure
  5. Deployment & Serving
  6. Inference Servers and Runtimes
  7. Inference API Servers

Awesome Inference API Servers GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • Comfy-Org/ComfyUI

    Comfy-Org/ComfyUI

    103,654GitHubView on GitHub↗

    ComfyUI is a node-based generative AI orchestration engine designed for constructing, testing, and executing complex image and video synthesis pipelines. By utilizing a directed acyclic graph execution model, the platform allows users to build reproducible workflows through modular, interconnected processing blocks wit

    Serves visual, node-based generative pipelines as programmable API endpoints for integration into external software.

    Pythonaicomfycomfyui
  • ggml-org/llama.cpp

    ggml-org/llama.cpp

    95,400GitHubView on GitHub↗

    Llama.cpp is an inference engine designed for the local execution of text-based and multimodal language models on consumer hardware. It provides a core environment for running models that process both text and image inputs, utilizing hardware-accelerated backends to optimize performance across diverse CPU and GPU archi

    Exposes inference capabilities via a lightweight HTTP server that supports standard chat completion and embedding endpoints.

    C++ggml

Explore sub-tags

  • Workflow-Driven Inference ServersInference servers that execute visual, node-based generative pipelines as programmable API endpoints.