What are the best open-source alternatives to Learn CUDA Programming?

30 open-source projects similar to packtpublishing/learn-cuda-programming, ranked by shared features. Top picks: nvidia/cuda-samples, srush/gpu-puzzles, tensorflow/rust, infrasys-ai/aisystem, xtensor-stack/xtensor, triton-lang/triton, dask/dask, jax-ml/jax, exaloop/codon, cpp-taskflow/cpp-taskflow.

Is nvidia/cuda-samples a good alternative to Learn CUDA Programming?

This repository is a collection of reference implementations and programming examples for the CUDA Toolkit. It serves as a GPGPU implementation guide and a parallel computing reference, providing code for using graphics hardware to perform general-purpose calculations and high-performance parallel…

Is srush/gpu-puzzles a good alternative to Learn CUDA Programming?

GPU-Puzzles is an interactive learning environment and tutorial designed for mastering CUDA GPU kernel development. It serves as an educational tool and lab where users solve coding puzzles to understand how to map high-level logic to low-level GPU hardware instructions. The platform focuses on te…

Is tensorflow/rust a good alternative to Learn CUDA Programming?

This project provides Rust bindings for the TensorFlow C API, serving as a tensor computation interface and machine learning library. It enables the construction and execution of machine learning models and neural networks by bridging a systems language to high-performance backends. The framework…

Is infrasys-ai/aisystem a good alternative to Learn CUDA Programming?

AISystem is a comprehensive AI full-stack infrastructure project covering the entire pipeline from AI chip architecture to high-level training frameworks. It encompasses the development of AI compiler frameworks, inference engines, and distributed training orchestrators designed to coordinate workl…

Is xtensor-stack/xtensor a good alternative to Learn CUDA Programming?

xtensor is a C++ multidimensional array library for numerical computing that provides N-dimensional containers with an interface mirroring the NumPy API. It utilizes a lazy evaluation expression engine to defer numerical computations until assignment, which minimizes memory allocations and intermed…

Is triton-lang/triton a good alternative to Learn CUDA Programming?

Triton is a parallel computing framework and high-level programming language designed for writing custom compute kernels. It functions as a deep learning compiler, translating complex mathematical operations into high-throughput instructions that maximize hardware utilization and memory efficiency…

Is dask/dask a good alternative to Learn CUDA Programming?

Dask is a parallel computing framework and distributed task scheduler designed to scale Python data science workflows from single machines to large clusters. It functions as a cluster resource manager that orchestrates computational logic by representing tasks and their dependencies as directed acy…

Is jax-ml/jax a good alternative to Learn CUDA Programming?

This project is a high-performance numerical computing library designed for large-scale scientific and machine learning workloads. It functions as an automatic differentiation framework and a just-in-time compilation engine, transforming high-level Python code into optimized machine instructions. B…

Is exaloop/codon a good alternative to Learn CUDA Programming?

Codon is an LLVM-based Python compiler and statically typed implementation that translates source code into optimized machine instructions. It functions as a high-performance numerical backend and a GPU computing framework designed to remove runtime overhead. The project implements a compiled alte…

Is cpp-taskflow/cpp-taskflow a good alternative to Learn CUDA Programming?

Cpp-taskflow is a C++ task-parallelism framework and task graph scheduler designed to manage and execute complex dependency graphs of parallel tasks across CPU and GPU hardware. It provides a parallel algorithm library for high-performance implementations of reductions, sorts, pipelines, and iterat…

Back to packtpublishing/learn-cuda-programming

Open-source alternatives to Learn CUDA Programming

30 open-source projects similar to packtpublishing/learn-cuda-programming, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Learn CUDA Programming alternative.

nvidia/cuda-samples
NVIDIA/cuda-samples
9,319View on GitHub
This repository is a collection of reference implementations and programming examples for the CUDA Toolkit. It serves as a GPGPU implementation guide and a parallel computing reference, providing code for using graphics hardware to perform general-purpose calculations and high-performance parallel processing. The project provides specific samples for GPU kernel development and resource management. These include demonstrations of multi-GPU communication, peer-to-peer memory access, and system hardware inspection to coordinate distributed GPU resources. The codebase covers a wide range of capa
C++cudacuda-driver-apicuda-kernels
View on GitHub9,319
srush/gpu-puzzles
srush/GPU-Puzzles
12,242View on GitHub
GPU-Puzzles is an interactive learning environment and tutorial designed for mastering CUDA GPU kernel development. It serves as an educational tool and lab where users solve coding puzzles to understand how to map high-level logic to low-level GPU hardware instructions. The platform focuses on teaching parallel computing concepts and GPU architecture. Users practice developing parallel algorithms and managing GPU memory through a series of hands-on challenges. The environment utilizes a bridge between Python and CUDA to execute kernels and provide real-time feedback by validating outputs ag
Jupyter Notebookcudamachine-learningpuzzles
View on GitHub12,242
tensorflow/rust
tensorflow/rust
5,480View on GitHub
This project provides Rust bindings for the TensorFlow C API, serving as a tensor computation interface and machine learning library. It enables the construction and execution of machine learning models and neural networks by bridging a systems language to high-performance backends. The framework supports GPU-accelerated computing to increase the speed of model training and inference by offloading mathematical operations to graphics processing units. It offers both graph-based computation for defining static network architectures and an eager execution mode for immediate operation calls durin
Rust
View on GitHub5,480

Open-source alternatives to Learn CUDA Programming

NVIDIA/cuda-samples

srush/GPU-Puzzles

tensorflow/rust

Infrasys-AI/AISystem

xtensor-stack/xtensor

triton-lang/triton

dask/dask

jax-ml/jax

exaloop/codon

cpp-taskflow/cpp-taskflow

braydie/HowToBeAProgrammer

tile-ai/tilelang

ispc/ispc

HigherOrderCO/HVM2

h2oai/h2o-3

arrayfire/arrayfire

Jounce/Surge

boostorg/boost

google-deepmind/mctx

Microsoft/napajs

taskflow/taskflow

numba/numba

FFmpeg/asm-lessons

gpujs/gpu.js

HigherOrderCO/Bend

mistralai/mistral-inference

databricks/scala-style-guide

JuanitoFatas/fast-ruby

gorgonia/gorgonia

chainer/chainer