awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Quantization Strategies · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesQuantization Strategies

Techniques for reducing the numerical precision of model weights and activations to optimize inference speed and memory usage.

Explore 1 awesome GitHub repository matching artificial intelligence & ml · Quantization Strategies. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Machine Learning
  4. Infrastructure
  5. Model Inference and Serving
  6. Inference Optimization
  7. Quantization Strategies

Awesome Quantization Strategies GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • meta-llama/llama

    meta-llama/llama

    59,157GitHubView on GitHub↗

    Llama is a computational framework and runtime environment designed for executing transformer-based neural networks locally. It functions as a generative AI inference engine, enabling the processing of input sequences through pre-trained model weights to produce text completions and structured data outputs directly on

    Reduces numerical precision in model weights to lower memory footprint and accelerate inference on local devices.

    Python