# squeezebits/quick

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/squeezebits-quick).**

123 stars · 5 forks · Python · MIT

## Links

- GitHub: https://github.com/SqueezeBits/QUICK
- awesome-repositories: https://awesome-repositories.com/repository/squeezebits-quick.md

## Description

QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference

## Tags

### Part of an Awesome List

- [Tensor Core Optimization](https://awesome-repositories.com/f/awesome-lists/ai/tensor-core-optimization.md) — Quantization-aware kernel implementation for efficient large language model inference.
