awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Model Inference · Awesome GitHub Repositories

4 repos

Awesome GitHub RepositoriesModel Inference

Frameworks and utilities for loading models, generating predictions from input data, and processing or configuring inference results.

Explore 4 awesome GitHub repositories matching artificial intelligence & ml · Model Inference. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Model Lifecycle Management
  4. Model Inference and Serving
  5. Model Inference

Awesome Model Inference GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • vllm-project/vllm

    vllm-project/vllm

    70,745GitHubView on GitHub↗

    vLLM is a high-throughput inference engine designed for the efficient serving and execution of large language models. It functions as a production-ready distributed model server, providing standard API protocols for online serving while also supporting offline batch processing. The system is built to maximize token gen

    Pythonamdblackwellcuda
  • ultralytics/yolov5

    ultralytics/yolov5

    56,830GitHubView on GitHub↗

    YOLOv5 is a comprehensive computer vision framework designed for end-to-end deep learning, specializing in real-time object detection, image classification, and instance segmentation. It provides a unified toolkit that manages the entire lifecycle of a model, from initial dataset configuration and hyperparameter tuning

    Pythoncoremldeep-learningios
  • karpathy/nanoGPT

    karpathy/nanoGPT

    53,461GitHubView on GitHub↗

    nanoGPT is a lightweight engine for training and fine-tuning transformer-based language models from scratch. It provides a minimalist codebase designed for educational exploration and rapid experimentation with neural network architectures, utilizing self-attention and feed-forward layers to process sequences and predi

    Python
  • ultralytics/ultralytics

    ultralytics/ultralytics

    53,426GitHubView on GitHub↗

    Ultralytics is a comprehensive computer vision framework designed for training, validating, and deploying deep learning models across a wide range of visual recognition tasks. It provides a unified interface for core operations including object detection, instance segmentation, pose estimation, and image classification

    Pythonclicomputer-visiondeep-learning

Explore sub-tags

  • Inference Configuration ParametersSettings that control the sensitivity and output behavior of model inference.
  • Inference GeneratorsInterfaces for producing text outputs from models using configurable sampling parameters.
  • Inference Result ProcessorsStructures and utilities for parsing, filtering, and manipulating raw model output data.
  • Model Loading Utilities
Mechanisms for importing and initializing custom-trained model weights into an inference engine.
  • Offline Inference EnginesSystems designed for batch-processed, non-real-time model execution and text generation.