awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Engines, Runtimes & Servers · Awesome GitHub Repositories

8 repos

Awesome GitHub RepositoriesEngines, Runtimes & Servers

Explore 8 awesome GitHub repositories matching artificial intelligence & ml · Engines, Runtimes & Servers. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Machine Learning
  4. Infrastructure
  5. Model Inference and Serving
  6. Engines, Runtimes & Servers

Awesome Engines, Runtimes & Servers GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • hacksider/Deep-Live-Cam

    hacksider/Deep-Live-Cam

    79,568GitHubView on GitHub↗

    Deep-Live-Cam is a generative video transformation tool designed for real-time facial manipulation and cinematic enhancement. It functions as a local-first AI runtime, performing all media processing directly on the user's hardware to ensure complete data privacy without external network dependencies. By utilizing a hi

    Executes deep learning models directly on hardware-specific providers to minimize latency.

    Pythonaiai-deep-fakeai-face
  • vllm-project/vllm

    vllm-project/vllm

    70,745GitHubView on GitHub↗

    vLLM is a high-throughput inference engine designed for the efficient serving and execution of large language models. It functions as a production-ready distributed model server, providing standard API protocols for online serving while also supporting offline batch processing. The system is built to maximize token gen

    Executes custom model architectures using highly optimized native implementations and support for various data formats.

    Pythonamdblackwellcuda
  • hiyouga/LlamaFactory

    hiyouga/LlamaFactory

    67,386GitHubView on GitHub↗

    LlamaFactory is a unified framework for fine-tuning and adapting large language models. It provides a comprehensive platform that standardizes training workflows across diverse machine learning architectures, allowing users to execute both full-tuning and parameter-efficient methods through a single interface. The pro

    Exposes trained models via standardized network protocols to facilitate scalable and reliable prediction services.

    Pythonagentaideepseek
  • meta-llama/llama

    meta-llama/llama

    59,157GitHubView on GitHub↗

    Llama is a computational framework and runtime environment designed for executing transformer-based neural networks locally. It functions as a generative AI inference engine, enabling the processing of input sequences through pre-trained model weights to produce text completions and structured data outputs directly on

    Maintains context within a sliding window buffer to process inference tasks independently without persistent server state.

    Python
  • ultralytics/yolov5

    ultralytics/yolov5

    56,830GitHubView on GitHub↗

    YOLOv5 is a comprehensive computer vision framework designed for end-to-end deep learning, specializing in real-time object detection, image classification, and instance segmentation. It provides a unified toolkit that manages the entire lifecycle of a model, from initial dataset configuration and hyperparameter tuning

    Deploys exported models to web browsers using specialized formats for real-time client-side detection.

    Pythoncoremldeep-learningios
  • karpathy/nanoGPT

    karpathy/nanoGPT

    53,461GitHubView on GitHub↗

    nanoGPT is a lightweight engine for training and fine-tuning transformer-based language models from scratch. It provides a minimalist codebase designed for educational exploration and rapid experimentation with neural network architectures, utilizing self-attention and feed-forward layers to process sequences and predi

    Exposes a command-line interface for sampling text sequences with adjustable generation settings.

    Python
  • facebookresearch/segment-anything

    facebookresearch/segment-anything

    53,431GitHubView on GitHub↗

    This project provides a deep learning architecture designed to identify and isolate distinct objects within images by generating precise pixel-level masks. It functions as a browser-based inference engine, enabling the execution of complex machine learning models directly within web environments without requiring serve

    Enables the execution of sophisticated deep learning models directly within the browser environment using hardware-accelerated runtimes.

    Jupyter Notebook
  • unslothai/unsloth

    unslothai/unsloth

    52,461GitHubView on GitHub↗

    Unsloth is a high-performance training and inference platform designed to optimize the lifecycle of large language and multimodal models. It provides a comprehensive engine for fine-tuning, executing, and managing models locally, with a focus on reducing memory consumption and increasing compute speed on consumer-grade

    Exposes loaded models via command-line API endpoints with built-in authentication for scalable inference services.

    Pythonagentdeepseekdeepseek-r1

Explore sub-tags

  • Browser-based Inference EnginesTools and libraries that enable the execution of pre-trained machine learning models directly within web browsers.
  • Custom Model ArchitecturesSpecialized model structures designed for unique inference requirements that deviate from standard off-the-shelf architectures.
  • Custom Model Execution EnginesSystems that support the execution of user-defined or custom model architectures via optimized backends.
GPU-Accelerated
Executes deep learning models on hardware-specific providers to minimize latency.
  • Inference Execution Models1 sub-tagArchitectural approaches for managing inference tasks, including the use of sliding windows to maintain context during execution.
  • Model Inference ServersDedicated server applications that host machine learning models to provide scalable, network-accessible inference services.
  • Text Generation RuntimesCommand-line tools and engines for sampling sequences from neural networks with configurable generation parameters.