1 repo

Awesome GitHub RepositoriesDistributed Memory Optimizers

Techniques for partitioning model states across distributed hardware to enable training of large-scale models.

Distinguishing note: Focuses on memory footprint reduction through state partitioning, distinct from communication optimization.

Explore 1 awesome GitHub repository matching artificial intelligence & ml · Distributed Memory Optimizers. Refine with filters or upvote what's useful.

Find the best repos with AI.We'll search the best matching repositories with AI.

deepspeedai/DeepSpeed
deepspeedai/DeepSpeed
41,638View on GitHub
DeepSpeed is a high-performance library designed to scale deep learning model training and inference across massive clusters of GPUs and compute nodes. It provides a comprehensive suite of tools for distributed training, enabling the execution of models that exceed the memory capacity of single devices through advanced parameter partitioning, pipeline-based model parallelism, and memory-efficient state offloading. The framework distinguishes itself through specialized communication-efficient optimizers and hardware-aware acceleration techniques. By utilizing gradient compression, quantization
The framework partitions model states across available devices to reduce memory consumption and enable the training of massive models on distributed hardware.
Pythonbillion-parameterscompressiondata-parallelism
41,638View on GitHub