awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Local Inference and Deployment · Awesome GitHub Repositories

4 repos

Awesome GitHub RepositoriesLocal Inference and Deployment

Infrastructure and engines for executing models directly on local hardware or edge devices, prioritizing privacy and performance.

Explore 4 awesome GitHub repositories matching artificial intelligence & ml · Local Inference and Deployment. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Artificial Intelligence
  4. Local Inference and Deployment

Awesome Local Inference and Deployment GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • vllm-project/vllm

    vllm-project/vllm

    70,745GitHubView on GitHub↗

    vLLM is a high-throughput inference engine designed for the efficient serving and execution of large language models. It functions as a production-ready distributed model server, providing standard API protocols for online serving while also supporting offline batch processing. The system is built to maximize token gen

    Pythonamdblackwellcuda
  • meta-llama/llama

    meta-llama/llama

    59,157GitHubView on GitHub↗

    Llama is a computational framework and runtime environment designed for executing transformer-based neural networks locally. It functions as a generative AI inference engine, enabling the processing of input sequences through pre-trained model weights to produce text completions and structured data outputs directly on

    Python
  • zylon-ai/private-gpt

    zylon-ai/private-gpt

    57,116GitHubView on GitHub↗

    This project is a privacy-first backend service designed to facilitate retrieval-augmented generation by processing local documents into searchable vector representations. It provides a modular architecture that allows users to ingest diverse file formats, manage document metadata, and perform semantic searches to prov

    Python
  • ultralytics/ultralytics

    ultralytics/ultralytics

    53,426GitHubView on GitHub↗

    Ultralytics is a comprehensive computer vision framework designed for training, validating, and deploying deep learning models across a wide range of visual recognition tasks. It provides a unified interface for core operations including object detection, instance segmentation, pose estimation, and image classification

    Pythonclicomputer-visiondeep-learning

Explore sub-tags

  • Edge AI Model DeploymentTechnologies that optimize and deploy machine learning models to run efficiently on local hardware and edge devices.
  • Local Inference EnginesSoftware frameworks that enable the execution of generative artificial intelligence models directly on local computing hardware.
  • Privacy-First AI BackendsInfrastructure that ensures data privacy and security by processing AI requests locally rather than on external servers.