What are the best open-source alternatives to DeepSpeed?

30 open-source projects similar to microsoft/deepspeed, ranked by shared features. Top picks: nvidia/megatron-lm, zhaochenyang20/awesome-ml-sys-tutorial, sgl-project/sglang, hpcaitech/colossalai, horovod/horovod, facebookresearch/metaseq, deepspeedai/deepspeed, eleutherai/gpt-neox, openaccess-ai-collective/axolotl, xai-org/grok-1.

Is nvidia/megatron-lm a good alternative to DeepSpeed?

Megatron-LM is a distributed transformer training library and large language model training framework designed to scale models across thousands of GPUs. It functions as a GPU-optimized deep learning toolkit and a scaling engine for mixture-of-experts architectures, enabling the training of models w…

Is zhaochenyang20/awesome-ml-sys-tutorial a good alternative to DeepSpeed?

This project provides a comprehensive technical guide and framework for engineering large-scale machine learning systems. It covers the full lifecycle of model development, focusing on the infrastructure and computational principles required to build, train, and serve generative AI models across di…

Is sgl-project/sglang a good alternative to DeepSpeed?

Sglang is a high-performance inference engine and serving system designed for large language and multimodal models. It provides a programmable interface for orchestrating complex generation workflows, enabling developers to coordinate multi-turn dialogues, tool invocations, and reasoning chains thr…

Is hpcaitech/colossalai a good alternative to DeepSpeed?

ColossalAI is a distributed deep learning framework designed for training and deploying massive artificial intelligence models across clusters of hardware accelerators. It functions as a parallel computing engine that partitions model workloads and data across multiple processors to maximize memory…

Is horovod/horovod a good alternative to DeepSpeed?

Horovod is a distributed deep learning framework and gradient synchronizer designed to scale model training across multiple GPUs and compute nodes. It functions as a distributed training orchestrator and an elastic training engine, utilizing an MPI collective communication library to synchronize we…

Is facebookresearch/metaseq a good alternative to DeepSpeed?

Metaseq is a transformer sequence modeling toolkit designed for training, fine-tuning, and deploying sequence-to-sequence models using open pre-trained weights. It provides a comprehensive framework for large language model training, including dedicated tools for sequence dataset processing and a s…

Is deepspeedai/deepspeed a good alternative to DeepSpeed?

DeepSpeed is a high-performance library designed to scale deep learning model training and inference across massive clusters of GPUs and compute nodes. It provides a comprehensive suite of tools for distributed training, enabling the execution of models that exceed the memory capacity of single dev…

Is eleutherai/gpt-neox a good alternative to DeepSpeed?

gpt-neox is a distributed training system and framework for building large-scale autoregressive language models. It implements the transformer architecture and provides a toolkit for training models with billions of parameters by distributing weights across compute clusters. The framework distingu…

Is openaccess-ai-collective/axolotl a good alternative to DeepSpeed?

Axolotl is a distributed training orchestrator and fine-tuning framework for large language models, multimodal systems, and quantized models. It provides a structured environment for specializing pre-trained models through full parameter updates or low-rank adaptation, as well as aligning model out…

Is xai-org/grok-1 a good alternative to DeepSpeed?

Grok-1 is an open-weights large language model implementation featuring a sparse mixture-of-experts architecture. It is designed for high-performance text generation and natural language processing by activating only a subset of specialized expert layers per token. The model utilizes 8-bit weight…

Back to microsoft/deepspeed

Open-source alternatives to DeepSpeed

30 open-source projects similar to microsoft/deepspeed, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best DeepSpeed alternative.

nvidia/megatron-lm
NVIDIA/Megatron-LM
16,731View on GitHub
Megatron-LM is a distributed transformer training library and large language model training framework designed to scale models across thousands of GPUs. It functions as a GPU-optimized deep learning toolkit and a scaling engine for mixture-of-experts architectures, enabling the training of models with hundreds of billions of parameters. The project implements multi-dimensional model parallelism, combining tensor, pipeline, data, expert, and context-based workload distribution. It specifically optimizes mixture-of-experts architectures through integrated memory and communication improvements t
Python
View on GitHub16,731
zhaochenyang20/awesome-ml-sys-tutorial
zhaochenyang20/Awesome-ML-SYS-Tutorial
5,371View on GitHub
This project provides a comprehensive technical guide and framework for engineering large-scale machine learning systems. It covers the full lifecycle of model development, focusing on the infrastructure and computational principles required to build, train, and serve generative AI models across distributed GPU clusters. The repository distinguishes itself by offering deep-dive tutorials and implementation strategies for complex system challenges. It emphasizes high-performance architectural primitives, such as collective communication orchestration, distributed tensor sharding, and static gr
Python
View on GitHub5,371
sgl-project/sglang
sgl-project/sglang
29,079View on GitHub
Sglang is a high-performance inference engine and serving system designed for large language and multimodal models. It provides a programmable interface for orchestrating complex generation workflows, enabling developers to coordinate multi-turn dialogues, tool invocations, and reasoning chains through a domain-specific language. The platform is built to support production-scale deployments, offering an OpenAI-compatible API that allows for integration with existing application ecosystems. The system distinguishes itself through a disaggregated architecture that separates compute-intensive pr
Pythonattentionblackwellcuda
View on GitHub29,079

Open-source alternatives to DeepSpeed

NVIDIA/Megatron-LM

zhaochenyang20/Awesome-ML-SYS-Tutorial

sgl-project/sglang

hpcaitech/ColossalAI

horovod/horovod

facebookresearch/metaseq

deepspeedai/DeepSpeed

EleutherAI/gpt-neox

OpenAccess-AI-Collective/axolotl

xai-org/grok-1

huggingface/peft

dusty-nv/jetson-inference

InternLM/xtuner

microsoft/DeepSpeedExamples

axolotl-ai-cloud/axolotl

OpenBMB/MiniCPM

huggingface/accelerate

timdettmers/bitsandbytes

hiyouga/LLaMA-Factory

unslothai/unsloth

deepspeedai/DeepSpeedExamples

Infrasys-AI/AISystem

NVIDIA-NeMo/NeMo

microsoft/Swin-Transformer

pytorch/examples

mosaicml/llm-foundry

ludwig-ai/ludwig

PaddlePaddle/ERNIE

FMInference/FlexGen

Infrasys-AI/AIInfra