2 repos
Techniques and frameworks for training machine learning models across multiple computing nodes or parallel processing units.
Explore 2 awesome GitHub repositories matching artificial intelligence & ml · Distributed Learning. Refine with filters or upvote what's useful.
TensorFlow is a comprehensive machine learning framework designed for the construction, training, and deployment of complex mathematical models. It utilizes a graph-based execution model that represents operations as directed acyclic graphs, enabling automatic differentiation and efficient parallel processing. The syst
Coordinates decentralized training across network nodes to preserve data privacy while aggregating model updates.
PyTorch is a machine learning framework centered on a GPU-ready tensor library that supports multi-dimensional array operations across both CPU and accelerator hardware. It provides a foundational infrastructure for mathematical computation and dynamic neural network construction, utilizing a tape-based automatic diffe
Shards model parameters, gradients, and optimizer states across processes to enable memory-efficient distributed training.