What are the best open-source alternatives to DeepSpeed?

30 open-source projects similar to deepspeedai/deepspeed, ranked by shared features. Top picks: zhaochenyang20/awesome-ml-sys-tutorial, microsoft/deepspeed, huggingface/peft, axolotl-ai-cloud/axolotl, sgl-project/sglang, deepspeedai/deepspeedexamples, nvidia/megatron-lm, microsoft/deepspeedexamples, mosaicml/composer, pytorch/torchtune.

Is zhaochenyang20/awesome-ml-sys-tutorial a good alternative to DeepSpeed?

This project provides a comprehensive technical guide and framework for engineering large-scale machine learning systems. It covers the full lifecycle of model development, focusing on the infrastructure and computational principles required to build, train, and serve generative AI models across di…

Is microsoft/deepspeed a good alternative to DeepSpeed?

DeepSpeed is a distributed deep learning optimization library and framework designed for the training and inference of massive AI models. It serves as a model parallelism orchestrator and a toolkit for scaling large language models across multiple GPUs and compute nodes. The project distinguishes…

Is huggingface/peft a good alternative to DeepSpeed?

This library provides a framework for parameter-efficient fine-tuning, enabling the adaptation of large pretrained models by training only a small subset of parameters. It functions as a distributed model training system and optimization toolkit, designed to reduce the computational and memory requ…

Is axolotl-ai-cloud/axolotl a good alternative to DeepSpeed?

Axolotl is a configuration-driven framework designed for the fine-tuning, evaluation, and quantization of large language models. It functions as a comprehensive orchestrator for distributed training, enabling users to manage complex workflows across multi-node and multi-GPU environments. By utilizi…

Is sgl-project/sglang a good alternative to DeepSpeed?

Sglang is a high-performance inference engine and serving system designed for large language and multimodal models. It provides a programmable interface for orchestrating complex generation workflows, enabling developers to coordinate multi-turn dialogues, tool invocations, and reasoning chains thr…

Is deepspeedai/deepspeedexamples a good alternative to DeepSpeed?

DeepSpeedExamples is a collection of reference implementations and scripts for training, fine-tuning, and executing inference on large-scale AI models using DeepSpeed optimization. It provides a distributed model training guide and practical workflows for adapting large language models through memo…

Is nvidia/megatron-lm a good alternative to DeepSpeed?

Megatron-LM is a distributed transformer training library and large language model training framework designed to scale models across thousands of GPUs. It functions as a GPU-optimized deep learning toolkit and a scaling engine for mixture-of-experts architectures, enabling the training of models w…

Is microsoft/deepspeedexamples a good alternative to DeepSpeed?

DeepSpeedExamples is a collection of reference implementations for training and deploying large scale AI models using the DeepSpeed optimization library. It provides Python code examples for training massive models across multiple GPUs through distributed optimization techniques. The repository in…

Is mosaicml/composer a good alternative to DeepSpeed?

Composer is a PyTorch distributed training framework designed for scaling large-scale models across multi-node GPU clusters. It functions as a large language model trainer, a distributed model optimizer, and a training lifecycle manager. The project differentiates itself as a deep learning regular…

Is pytorch/torchtune a good alternative to DeepSpeed?

Torchtune is a PyTorch-native library for fine-tuning, aligning, and quantizing large language models. It provides a configurable training pipeline orchestrated through YAML recipes, with CLI overrides and component swapping, distributed training via FSDP2, memory optimizations, and parameter-effic…

Back to deepspeedai/deepspeed

Open-source alternatives to DeepSpeed

30 open-source projects similar to deepspeedai/deepspeed, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best DeepSpeed alternative.

zhaochenyang20/awesome-ml-sys-tutorial
zhaochenyang20/Awesome-ML-SYS-Tutorial
5,371View on GitHub
This project provides a comprehensive technical guide and framework for engineering large-scale machine learning systems. It covers the full lifecycle of model development, focusing on the infrastructure and computational principles required to build, train, and serve generative AI models across distributed GPU clusters. The repository distinguishes itself by offering deep-dive tutorials and implementation strategies for complex system challenges. It emphasizes high-performance architectural primitives, such as collective communication orchestration, distributed tensor sharding, and static gr
Python
View on GitHub5,371
microsoft/deepspeed
microsoft/DeepSpeed
42,533View on GitHub
DeepSpeed is a distributed deep learning optimization library and framework designed for the training and inference of massive AI models. It serves as a model parallelism orchestrator and a toolkit for scaling large language models across multiple GPUs and compute nodes. The project distinguishes itself through 3D parallelism orchestration, which combines data, pipeline, and tensor parallelism. It utilizes ZeRO-based memory partitioning to eliminate redundant storage and employs CPU-offload memory management to move weights and optimizer states to system RAM. Additionally, it provides special
Python
View on GitHub42,533
huggingface/peft
huggingface/peft
21,274View on GitHub
This library provides a framework for parameter-efficient fine-tuning, enabling the adaptation of large pretrained models by training only a small subset of parameters. It functions as a distributed model training system and optimization toolkit, designed to reduce the computational and memory requirements typically associated with full model fine-tuning. The project distinguishes itself through a suite of methods for modular adapter composition, including low-rank matrix decomposition and activation-based scaling. It supports the integration of multiple task-specific adapter modules, allowin
Pythonadapterdiffusionfine-tuning
View on GitHub21,274

Open-source alternatives to DeepSpeed

zhaochenyang20/Awesome-ML-SYS-Tutorial

microsoft/DeepSpeed

huggingface/peft

axolotl-ai-cloud/axolotl

sgl-project/sglang

deepspeedai/DeepSpeedExamples

NVIDIA/Megatron-LM

microsoft/DeepSpeedExamples

mosaicml/composer

pytorch/torchtune

dmlc/xgboost

facebookresearch/fairseq

OpenRLHF/OpenRLHF

microsoft/unilm

huggingface/accelerate

hpcaitech/ColossalAI

apache/mxnet

tensorflow/nmt

PaddlePaddle/Paddle

PaddlePaddle/PaddleDetection

microsoft/Swin-Transformer

horovod/horovod

d2l-ai/d2l-en

zihangdai/xlnet

dusty-nv/jetson-inference

karpathy/llm.c

facebookresearch/flashlight

pytorch/torchtitan

Dao-AILab/flash-attention

huggingface/pytorch-image-models