awesome-repositories.comBlog

© 2026 Bringes Technology SRL·VAT RO45896025·hello@awesome-repositories.com

MCP Blog Curated searches Sitemap Privacy Terms

KVQuant | Awesome Repository

SqueezeAILabKVQuant

0

View on GitHub↗

427 stars·46 forks·Python·0 viewsarxiv.org/abs/2401.18079↗

KVQuant

Features

Attention Optimization - Quantizes KV cache to support extremely long context lengths.

AI search

Explore more awesome repositories

Describe what you need in plain English — the AI ranks thousands of curated open-source projects by relevance.

Start searching with AI

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization