# intel/neural-compressor

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/intel-neural-compressor).**

2,585 stars · 296 forks · Python · apache-2.0

## Links

- GitHub: https://github.com/intel/neural-compressor
- Homepage: https://intel.github.io/neural-compressor/
- awesome-repositories: https://awesome-repositories.com/repository/intel-neural-compressor.md

## Topics

`auto-tuning` `awq` `fp4` `gptq` `int4` `int8` `knowledge-distillation` `large-language-models` `low-precision` `mxformat` `post-training-quantization` `pruning` `quantization` `quantization-aware-training` `smoothquant` `sparsegpt` `sparsity`

## Tags

### Part of an Awesome List

- [Model Optimization](https://awesome-repositories.com/f/awesome-lists/devops/model-optimization.md) — Toolkit for model compression, pruning, and distillation.
