# microsoft/deepspeed-mii

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/microsoft-deepspeed-mii).**

2,105 stars · 191 forks · Python · Apache-2.0

## Links

- GitHub: https://github.com/microsoft/DeepSpeed-MII
- awesome-repositories: https://awesome-repositories.com/repository/microsoft-deepspeed-mii.md

## Description

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

## Tags

### Part of an Awesome List

- [Inference and Serving](https://awesome-repositories.com/f/awesome-lists/ai/inference-and-serving.md) — Low-latency inference engine powered by DeepSpeed.
- [Model Serving Engines](https://awesome-repositories.com/f/awesome-lists/ai/model-serving-engines.md) — Low-latency inference library powered by DeepSpeed optimizations.
- [Inference Frameworks](https://awesome-repositories.com/f/awesome-lists/devtools/inference-frameworks.md) — Inference framework supporting load balancing and model quantization.