awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Inference & Deployment · Awesome GitHub Repositories

4 repos

Awesome GitHub RepositoriesInference & Deployment

Explore 4 awesome GitHub repositories matching artificial intelligence & ml · Inference & Deployment. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Machine Learning
  4. Infrastructure
  5. Model Optimization
  6. Inference & Deployment

Awesome Inference & Deployment GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • tensorflow/tensorflow

    tensorflow/tensorflow

    193,864GitHubView on GitHub↗

    TensorFlow is a comprehensive machine learning framework designed for the construction, training, and deployment of complex mathematical models. It utilizes a graph-based execution model that represents operations as directed acyclic graphs, enabling automatic differentiation and efficient parallel processing. The syst

    Refines models for production execution to improve performance and reduce resource consumption on target hardware.

    C++deep-learningdeep-neural-networksdistributed
  • vllm-project/vllm

    vllm-project/vllm

    70,745GitHubView on GitHub↗

    vLLM is a high-throughput inference engine designed for the efficient serving and execution of large language models. It functions as a production-ready distributed model server, providing standard API protocols for online serving while also supporting offline batch processing. The system is built to maximize token gen

    Supports configurable, high-performance attention backends that automatically detect and optimize computation for specific hardware accelerators.

    Pythonamdblackwellcuda
  • ultralytics/ultralytics

    ultralytics/ultralytics

    53,426GitHubView on GitHub↗

    Ultralytics is a comprehensive computer vision framework designed for training, validating, and deploying deep learning models across a wide range of visual recognition tasks. It provides a unified interface for core operations including object detection, instance segmentation, pose estimation, and image classification

    Exports and optimizes models for high-performance execution across cloud and edge hardware environments.

    Pythonclicomputer-visiondeep-learning
  • facebookresearch/segment-anything

    facebookresearch/segment-anything

    53,431GitHubView on GitHub↗

    This project provides a deep learning architecture designed to identify and isolate distinct objects within images by generating precise pixel-level masks. It functions as a browser-based inference engine, enabling the execution of complex machine learning models directly within web environments without requiring serve

    Optimizes and compresses deep learning models to minimize resource consumption during browser-based deployment.

    Jupyter Notebook

Explore sub-tags

  • Attention BackendsOptimized computational backends specifically designed to accelerate the attention mechanisms used in transformer models.
  • Deployment OptimizationsMethods for refining models for production execution to improve performance and reduce resource consumption on target hardware.
  • Edge and MobileReduces model size and computational requirements through quantization and compression.
  • Model Deployment Toolkits
Toolkits that streamline the packaging, configuration, and deployment of machine learning models into production environments.
  • Web Model OptimizersCompressing and converting models for efficient browser deployment.