←Backmicrosoft/DeepSpeed-MII0Copy as MarkdownView on GitHub↗2,105 stars·191 forks·Python·Apache-2.0·0 viewsDeepSpeed MIIFeaturesInference and Serving - Low-latency inference engine powered by DeepSpeed.Model Serving Engines - Low-latency inference library powered by DeepSpeed optimizations.Inference Frameworks - Inference framework supporting load balancing and model quantization.