awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Inference Backends · Awesome GitHub Repositories

2 repos

Awesome GitHub RepositoriesInference Backends

Hardware-agnostic layers for executing models across diverse computing environments.

Distinguishing note: Focuses on the abstraction layer for hardware compatibility rather than the model runtime itself.

Explore 2 awesome GitHub repositories matching artificial intelligence & ml · Inference Backends. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Inference Backends

Awesome Inference Backends GitHub Repositories

Describe the repository you're looking for…
Find the best repos with AI.We'll search the best matching repositories with AI.
  • zai-org/ChatGLM-6B

    zai-org/ChatGLM-6B

    41,232View on GitHub↗

    ChatGLM-6B is a generative AI inference engine designed for local execution of transformer-based language models. It provides a comprehensive runtime environment that allows users to load and run pre-trained neural network weights directly on their own hardware, ensuring data privacy and independence from external cloud services. The project distinguishes itself through a hardware-agnostic execution backend that supports deployment across diverse environments, including standard processors, Apple Silicon, and multi-GPU configurations. It incorporates advanced optimization techniques such as w

    Provides a hardware-agnostic layer to enable model execution across diverse computing environments.

    Python
    41,232View on GitHub↗
  • janhq/jan

    janhq/jan

    40,489View on GitHub↗

    Jan is a desktop application that functions as a local artificial intelligence model runtime and an open-standard API server. It enables the execution of large language models directly on local hardware, ensuring that data remains private and accessible offline while providing a unified interface for managing model weights and inference runtimes. The platform distinguishes itself by offering a modular inference backend that allows users to swap execution engines based on hardware compatibility and performance needs. It acts as a cross-platform orchestrator, providing the ability to switch bet

    Swaps between different underlying model execution engines to balance performance and hardware compatibility.

    TypeScriptchatgptgptllamacpp
    40,489View on GitHub↗