awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Inference Optimizations · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesInference Optimizations

Techniques and mechanisms designed to reduce latency and increase throughput during the model inference phase.

Explore 1 awesome GitHub repository matching artificial intelligence & ml · Inference Optimizations. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Artificial Intelligence & Machine Learning
  4. Model Optimization Tools
  5. Inference Optimizations

Awesome Inference Optimizations GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • huggingface/transformers

    huggingface/transformers

    156,730GitHubView on GitHub↗

    Transformers is a comprehensive library for machine learning that provides a unified interface for training, fine-tuning, and deploying transformer-based models. It supports a wide range of tasks, including text classification, language modeling, question answering, and sequence-to-sequence translation, while offering

    Pythonaudiodeep-learningdeepseek

Explore sub-tags

  • Batched Inference MechanismsMechanisms for processing multiple inputs simultaneously in a single forward pass.
  • Large Model OptimizationsTechniques like quantization and device mapping for large models.
  • Prompt Lookup DecodingDecoding optimization using n-gram matching from input prompts.