awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Model Optimization Toolkits · Awesome GitHub Repositories

3 repos

Awesome GitHub RepositoriesModel Optimization Toolkits

Utilities for compressing, quantizing, and tuning models to improve performance and reduce resource consumption on target hardware.

Explore 3 awesome GitHub repositories matching artificial intelligence & ml · Model Optimization Toolkits. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Machine Learning Infrastructure
  4. Model Optimization Toolkits

Awesome Model Optimization Toolkits GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • vllm-project/vllm

    vllm-project/vllm

    70,745GitHubView on GitHub↗

    vLLM is a high-throughput inference engine designed for the efficient serving and execution of large language models. It functions as a production-ready distributed model server, providing standard API protocols for online serving while also supporting offline batch processing. The system is built to maximize token gen

    Pythonamdblackwellcuda
  • ultralytics/ultralytics

    ultralytics/ultralytics

    53,426GitHubView on GitHub↗

    Ultralytics is a comprehensive computer vision framework designed for training, validating, and deploying deep learning models across a wide range of visual recognition tasks. It provides a unified interface for core operations including object detection, instance segmentation, pose estimation, and image classification

    Pythonclicomputer-visiondeep-learning
  • unslothai/unsloth

    unslothai/unsloth

    52,461GitHubView on GitHub↗

    Unsloth is a high-performance training and inference platform designed to optimize the lifecycle of large language and multimodal models. It provides a comprehensive engine for fine-tuning, executing, and managing models locally, with a focus on reducing memory consumption and increasing compute speed on consumer-grade

    Pythonagentdeepseekdeepseek-r1

Explore sub-tags

  • Attention BackendsOptimized computational backends specifically designed to accelerate the attention mechanisms used in transformer models.
  • Model Deployment ToolkitsToolkits that streamline the packaging, configuration, and deployment of machine learning models into production environments.
  • Model Quantization FrameworksFrameworks that reduce model size and computational requirements by converting high-precision weights into lower-precision formats.
  • Training Backend Optimizers
Optimization algorithms and software layers that improve the speed and efficiency of the model training process.