What are the best open-source alternatives to DeepSpeedExamples?

30 open-source projects similar to deepspeedai/deepspeedexamples, ranked by shared features. Top picks: microsoft/deepspeedexamples, eleutherai/gpt-neox, artidoro/qlora, openbmb/minicpm, facebookresearch/fairseq, microsoft/ai-edu, ymcui/chinese-llama-alpaca, dusty-nv/jetson-inference, internlm/xtuner, microsoft/deepspeed.

Is microsoft/deepspeedexamples a good alternative to DeepSpeedExamples?

DeepSpeedExamples is a collection of reference implementations for training and deploying large scale AI models using the DeepSpeed optimization library. It provides Python code examples for training massive models across multiple GPUs through distributed optimization techniques. The repository in…

Is eleutherai/gpt-neox a good alternative to DeepSpeedExamples?

gpt-neox is a distributed training system and framework for building large-scale autoregressive language models. It implements the transformer architecture and provides a toolkit for training models with billions of parameters by distributing weights across compute clusters. The framework distingu…

Is artidoro/qlora a good alternative to DeepSpeedExamples?

This project is a quantized fine-tuning framework for large language models. It implements a low-rank adaptation library and a four-bit quantizer to reduce the GPU memory requirements needed to train large models. The framework utilizes four-bit quantization and low-rank adapters to enable model t…

Is openbmb/minicpm a good alternative to DeepSpeedExamples?

MiniCPM is a collection of small language models designed for local, on-device deployment in resource-constrained environments. The project focuses on running dense Transformer models on consumer hardware, including GPUs, CPUs, and Apple Silicon, without requiring custom code forks. The project di…

Is facebookresearch/fairseq a good alternative to DeepSpeedExamples?

Fairseq is a PyTorch toolkit for sequence-to-sequence modeling, specializing in neural machine translation, automatic speech recognition, and large-scale language model training. It provides a framework for processing and aligning diverse data sources, including text, audio, and video, to support t…

Is microsoft/ai-edu a good alternative to DeepSpeedExamples?

ai-edu is a comprehensive AI education curriculum and machine learning courseware collection. It provides theoretical tutorials, deep learning lab exercises, and project blueprints designed to teach artificial intelligence fundamentals through a combination of study and practical implementation. T…

Is ymcui/chinese-llama-alpaca a good alternative to DeepSpeedExamples?

This project is a comprehensive toolkit for adapting large language models to the Chinese language, providing a specialized framework for fine-tuning, inference, and local deployment. It serves as a coordinated suite for language-specific adaptation, including tools for expanding tokenizers and imp…

Is dusty-nv/jetson-inference a good alternative to DeepSpeedExamples?

jetson-inference is a set of libraries and tools for executing optimized deep learning models on embedded GPU hardware. Its primary purpose is to enable real-time computer vision and AI inference at the edge with low latency and high throughput. The project distinguishes itself through high-perfor…

Is internlm/xtuner a good alternative to DeepSpeedExamples?

xtuner is a comprehensive training engine for large language models, offering a toolkit for pre-training, supervised fine-tuning, and the optimization of vision-language multimodal models. It serves as a distributed training accelerator and a specialized framework for scaling Mixture-of-Experts mod…

Is microsoft/deepspeed a good alternative to DeepSpeedExamples?

DeepSpeed is a distributed deep learning optimization library and framework designed for the training and inference of massive AI models. It serves as a model parallelism orchestrator and a toolkit for scaling large language models across multiple GPUs and compute nodes. The project distinguishes…

Back to deepspeedai/deepspeedexamples

Open-source alternatives to DeepSpeedExamples

30 open-source projects similar to deepspeedai/deepspeedexamples, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best DeepSpeedExamples alternative.

microsoft/deepspeedexamples
microsoft/DeepSpeedExamples
6,822View on GitHub
DeepSpeedExamples is a collection of reference implementations for training and deploying large scale AI models using the DeepSpeed optimization library. It provides Python code examples for training massive models across multiple GPUs through distributed optimization techniques. The repository includes optimized patterns for deploying and running large language model predictions in production environments. It also serves as a guide for model compression to reduce memory footprints and as a source for performance benchmarks to measure execution speed and resource utilization. The project cov
Python
View on GitHub6,822
eleutherai/gpt-neox
EleutherAI/gpt-neox
7,392View on GitHub
gpt-neox is a distributed training system and framework for building large-scale autoregressive language models. It implements the transformer architecture and provides a toolkit for training models with billions of parameters by distributing weights across compute clusters. The framework distinguishes itself through extensive support for distributed model parallelism, including pipeline and sequence parallelism, to overcome single-device memory limits. It further supports sparse model architectures using a mixture of experts system with Sinkhorn-based routing. The project covers a broad ran
Pythondeepspeed-librarygpt-3language-model
View on GitHub7,392
artidoro/qlora
artidoro/qlora
10,929View on GitHub
This project is a quantized fine-tuning framework for large language models. It implements a low-rank adaptation library and a four-bit quantizer to reduce the GPU memory requirements needed to train large models. The framework utilizes four-bit quantization and low-rank adapters to enable model training on consumer-grade hardware. It further reduces the memory footprint through double quantization and a paged optimizer that offloads states to system RAM. The system supports distributed training across multiple GPUs to handle larger parameter scales and includes utilities for custom dataset
Jupyter Notebook
View on GitHub10,929

Open-source alternatives to DeepSpeedExamples

microsoft/DeepSpeedExamples

EleutherAI/gpt-neox

artidoro/qlora

OpenBMB/MiniCPM

facebookresearch/fairseq

microsoft/ai-edu

ymcui/Chinese-LLaMA-Alpaca

dusty-nv/jetson-inference

InternLM/xtuner

microsoft/DeepSpeed

Infrasys-AI/AISystem

microsoft/Swin-Transformer

dmlc/dgl

deepspeedai/DeepSpeed

mosaicml/llm-foundry

ml-explore/mlx-examples

huggingface/smollm

PaddlePaddle/PaddleNLP

pytorch/fairseq

apple/corenet

facebookresearch/mae

Microsoft/CNTK

huggingface/accelerate

OpenRLHF/OpenRLHF

NVIDIA/Megatron-LM

open-mmlab/mmagic

facebookresearch/metaseq

modelscope/modelscope

modelscope/swift

zyds/transformers-code