awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Inference Runtimes · Awesome GitHub Repositories

3 repos

Awesome GitHub RepositoriesInference Runtimes

Explore 3 awesome GitHub repositories matching artificial intelligence & ml · Inference Runtimes. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Machine Learning
  4. Frameworks
  5. Inference Runtimes

Awesome Inference Runtimes GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • nomic-ai/gpt4all

    nomic-ai/gpt4all

    77,146GitHubView on GitHub↗

    GPT4All is a cross-platform runtime environment designed to execute large language models directly on local consumer hardware. By leveraging an optimized C++ inference backend, it enables private, offline AI interactions without requiring an internet connection or external cloud services. The project provides a compreh

    Transforms local data into searchable vector collections to provide context-aware, private knowledge retrieval for language models.

    C++ai-chatllm-inference
  • meta-llama/llama

    meta-llama/llama

    59,157GitHubView on GitHub↗

    Llama is a computational framework and runtime environment designed for executing transformer-based neural networks locally. It functions as a generative AI inference engine, enabling the processing of input sequences through pre-trained model weights to produce text completions and structured data outputs directly on

    Optimizes the loading and execution of transformer-based neural networks on standard computing hardware.

    Python
  • zylon-ai/private-gpt

    zylon-ai/private-gpt

    57,116GitHubView on GitHub↗

    This project is a privacy-first backend service designed to facilitate retrieval-augmented generation by processing local documents into searchable vector representations. It provides a modular architecture that allows users to ingest diverse file formats, manage document metadata, and perform semantic searches to prov

    Transforms local data into searchable collections to enable context-aware responses from both local and cloud-based models.

    Python

Explore sub-tags

  • Large Language Model RuntimesExecution environments specifically optimized for loading and running transformer-based neural networks on standard hardware.
  • Retrieval Augmented Generation EnginesBackends that process local data into searchable collections to provide context-aware responses for AI models.