Why is jbranchaud/til a recommended Reductions GitHub Repositories repository?

Produces a sequence of all intermediate values generated during a collection reduction.

Why is nvidia/nccl a recommended Reductions GitHub Repositories repository?

NCCL performs a reduction across multiple sources and copies the resulting value to destinations in a single operation.

2 Repos

Awesome GitHub RepositoriesReductions

Reduces a sequence to a single value by applying a combining function from either the left or the right side.

Distinct from Sequence Transformations: Distinct from Sequence Transformations: focuses on folding/reducing operations that produce a single value, not mapping transformations.

Explore 2 awesome GitHub repositories matching data & databases · Reductions. Refine with filters or upvote what's useful.

Finde die besten Repos mit KI.Wir suchen mit KI nach den am besten passenden Repositories.

jbranchaud/til
jbranchaud/til
14,116Auf GitHub ansehen
This project is a markdown knowledge base used to maintain a curated collection of concise technical notes and write-ups across various programming languages and tools. It serves as a searchable personal reference library for documenting technical discoveries and software development patterns. The system implements a learning in public workflow, transforming markdown-based content storage into a static site. It utilizes directory-based routing to map folder structures to URL paths and employs schema-driven type generation to ensure data consistency across the knowledge base. The codebase cov
Produces a sequence of all intermediate values generated during a collection reduction.
Vim Script
Auf GitHub ansehen14,116
nvidia/nccl
NVIDIA/nccl
4,816Auf GitHub ansehen
NCCL ist eine Hochleistungs-Kommunikationsbibliothek und ein Framework für verteiltes GPU-Computing, das für die Ausführung kollektiver und Punkt-zu-Punkt-Datenaustausche über mehrere GPUs in Einzel- oder Multi-Node-Systemen entwickelt wurde. Es dient als RDMA-GPU-Transportschicht und Speicher-Orchestrator, der die hochbandbreitige Synchronisation von Daten und Modellgradienten für verteiltes GPU-Training und Inference erleichtert. Die Bibliothek zeichnet sich durch ihre Fähigkeit aus, Kommunikationsprimitive direkt aus GPU-Kernels auszuführen, wodurch die Host-CPU aus dem kritischen Pfad entfernt wird. Sie nutzt topologiebewusste Pfadauswahl zur Optimierung der Datenbewegung und verwendet RDMA-basierten Netzwerktransport, einschließlich InfiniBand und NVLink, um Zero-Copy-Speicherzugriffe zwischen Geräten über verschiedene physische Knoten hinweg zu ermöglichen. Das Projekt deckt eine breite Palette an kollektiven Kommunikationsmustern ab, darunter Reduktionen, Broadcasts, Gathers und All-to-All-Austausche, neben Punkt-zu-Punkt-Remote-Speicherzugriffen. Es bietet umfassendes Communicator-Management für die Initialisierung, Partitionierung und Größenanpassung von GPU-Gruppen sowie spezialisiertes Speichermanagement für das Registrieren von Buffern und das Koordinieren von gemeinsam genutztem Gerätespeicher. Das System enthält eine Suite von Monitoring- und Observability-Tools für Health-Tracking, diagnostisches Logging und Echtzeit-Ereignisüberwachung sowie Integrationsschnittstellen für Machine-Learning-Frameworks, CUDA-Graphs, MPI und Python.
NCCL performs a reduction across multiple sources and copies the resulting value to destinations in a single operation.
C++
Auf GitHub ansehen4,816

Awesome Reductions GitHub Repositories

jbranchaud/til

NVIDIA/nccl

Unter-Tags erkunden