awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Model Optimization · Awesome GitHub Repositories

10 repos

Awesome GitHub RepositoriesModel Optimization

Techniques and utilities designed to improve model performance, reduce resource consumption, and refine parameters for specific deployment environments.

Explore 10 awesome GitHub repositories matching artificial intelligence & ml · Model Optimization. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Model Lifecycle Management
  4. Model Optimization

Awesome Model Optimization GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • tensorflow/tensorflow

    tensorflow/tensorflow

    193,864GitHubView on GitHub↗

    TensorFlow is a comprehensive machine learning framework designed for the construction, training, and deployment of complex mathematical models. It utilizes a graph-based execution model that represents operations as directed acyclic graphs, enabling automatic differentiation and efficient parallel processing. The syst

    C++deep-learningdeep-neural-networksdistributed
  • huggingface/transformers

    huggingface/transformers

    156,730GitHubView on GitHub↗

    Transformers is a comprehensive library for machine learning that provides a unified interface for training, fine-tuning, and deploying transformer-based models. It supports a wide range of tasks, including text classification, language modeling, question answering, and sequence-to-sequence translation, while offering

    Pythonaudiodeep-learningdeepseek
  • mlabonne/llm-course

    mlabonne/llm-course

    75,340GitHubView on GitHub↗

    This project is a comprehensive educational curriculum and engineering handbook focused on the lifecycle of large language models. It serves as a structured knowledge base for machine learning practitioners, covering the fundamental mathematical and architectural principles of transformer-based sequence modeling, as we

    courselarge-language-modelsllm
  • vllm-project/vllm

    vllm-project/vllm

    70,745GitHubView on GitHub↗

    vLLM is a high-throughput inference engine designed for the efficient serving and execution of large language models. It functions as a production-ready distributed model server, providing standard API protocols for online serving while also supporting offline batch processing. The system is built to maximize token gen

    Pythonamdblackwellcuda
  • hiyouga/LlamaFactory

    hiyouga/LlamaFactory

    67,386GitHubView on GitHub↗

    LlamaFactory is a unified framework for fine-tuning and adapting large language models. It provides a comprehensive platform that standardizes training workflows across diverse machine learning architectures, allowing users to execute both full-tuning and parameter-efficient methods through a single interface. The pro

    Pythonagentaideepseek
  • keras-team/keras

    keras-team/keras

    63,858GitHubView on GitHub↗

    Keras is a high-level deep learning framework designed for constructing and training neural networks through the composition of modular, functional layers. It serves as a comprehensive modeling toolkit that provides standardized procedures for defining, evaluating, and deploying complex architectures. By utilizing a di

    Pythondata-sciencedeep-learningjax
  • ultralytics/yolov5

    ultralytics/yolov5

    56,830GitHubView on GitHub↗

    YOLOv5 is a comprehensive computer vision framework designed for end-to-end deep learning, specializing in real-time object detection, image classification, and instance segmentation. It provides a unified toolkit that manages the entire lifecycle of a model, from initial dataset configuration and hyperparameter tuning

    Pythoncoremldeep-learningios
  • facebookresearch/segment-anything

    facebookresearch/segment-anything

    53,431GitHubView on GitHub↗

    This project provides a deep learning architecture designed to identify and isolate distinct objects within images by generating precise pixel-level masks. It functions as a browser-based inference engine, enabling the execution of complex machine learning models directly within web environments without requiring serve

    Jupyter Notebook
  • ultralytics/ultralytics

    ultralytics/ultralytics

    53,426GitHubView on GitHub↗

    Ultralytics is a comprehensive computer vision framework designed for training, validating, and deploying deep learning models across a wide range of visual recognition tasks. It provides a unified interface for core operations including object detection, instance segmentation, pose estimation, and image classification

    Pythonclicomputer-visiondeep-learning
  • unslothai/unsloth

    unslothai/unsloth

    52,461GitHubView on GitHub↗

    Unsloth is a high-performance training and inference platform designed to optimize the lifecycle of large language and multimodal models. It provides a comprehensive engine for fine-tuning, executing, and managing models locally, with a focus on reducing memory consumption and increasing compute speed on consumer-grade

    Pythonagentdeepseekdeepseek-r1

Explore sub-tags

  • Deployment OptimizationsMethods for refining models for production execution to improve performance and reduce resource consumption on target hardware.
  • Edge and Mobile Model OptimizationReduces model size and computational requirements through quantization and compression.
  • Hyperparameter OptimizationAutomated methods for searching and selecting the best configuration parameters for a model.
  • Model EnsemblingMethods that combine predictions from multiple models to improve overall accuracy and robustness.
  • Model Performance OptimizationMethods to enhance model speed and accuracy through techniques like quantization and hardware acceleration.
  • Model PruningThe process of removing redundant parameters from a neural network to reduce model size and computational requirements.
  • Model QuantizationTechniques and tools for reducing the memory footprint and computational requirements of neural networks to improve inference performance.
  • Parameter-Efficient Fine-Tuning1 sub-tagMethods for adapting models by updating a subset of parameters.
  • Performance ProfilersTools for measuring execution speed, memory usage, and accuracy metrics of models.
  • Quantization MethodsTechniques for reducing the precision of model weights to decrease memory usage and accelerate inference.
  • Quantization Plugin InterfacesExtensible interfaces that allow developers to register custom quantization methods.
  • Quantized AdaptersLow-precision weight updates for efficient fine-tuning.
  • Web Model OptimizersCompressing and converting models for efficient browser deployment.