15 रिपॉजिटरी
Systems for allocating and updating static or dynamic GPU memory buffers.
Distinct from Data Buffering: Distinct from Data Buffering: focuses on GPU-specific memory allocation for graphics data rather than general flow control buffering.
Explore 15 awesome GitHub repositories matching data & databases · GPU Buffer Allocators. Refine with filters or upvote what's useful.
This project is a cross-platform graphics and compute framework that provides a unified, hardware-agnostic abstraction layer for rendering and parallel processing. It enables developers to build high-performance applications that execute consistently across diverse operating systems and hardware backends, including Vulkan, Metal, and DirectX. By mapping high-level graphics commands to native APIs, it serves as a portable foundation for both real-time 3D rendering and general-purpose GPU computing. The framework distinguishes itself through a robust architecture that supports both native deskt
Copies processed data from graphics memory back to system memory by mapping buffers for reading after rendering or compute operations complete.
bgfx is a cross-platform, graphics rendering abstraction layer designed for high-performance applications. It provides a unified interface that maps high-level rendering commands to native graphics APIs, allowing developers to maintain a single codebase that executes consistently across diverse operating systems and hardware architectures. The library distinguishes itself through a multi-threaded command submission model that decouples rendering logic from the main application thread, effectively minimizing CPU bottlenecks. It utilizes a backend-agnostic command buffer and a deferred resource
Allocates and updates vertex, index, and indirect draw buffers for the graphics processor.
This project serves as a comprehensive educational framework and reference library for mastering high-performance graphics programming and parallel compute resource management. It provides a collection of practical implementations designed to demonstrate the explicit control required by the Vulkan API, covering the fundamental mechanics of modern graphics pipelines and cross-platform hardware interaction. The repository distinguishes itself by focusing on the low-level architectural requirements of modern GPU development, including manual memory allocation, multi-threaded command recording, a
Consolidates multiple uniform blocks into a single buffer using dynamic offsets to reduce descriptor set overhead.
Sokol is a C hardware abstraction layer and cross-platform graphics library designed for managing windowing, input, and audio across different operating systems. It functions as a GPU resource manager and multimedia application framework, providing a unified API for rendering 2D and 3D graphics across WebGL, Metal, Direct3D, and OpenGL. The project is distinguished by its single-header implementation, which simplifies integration and portability. It utilizes a stateless render pass definition and a one-update-per-frame model to synchronize CPU data to GPU memory and manage resource lifecycles
Copies data from system memory to GPU buffers and images using a controlled one-update-per-frame model.
meshoptimizer is a 3D mesh optimization library designed to compress geometry and animation data, reduce memory usage, and improve rendering performance. It functions as a geometry simplification tool, a mesh compression codec, and a utility for preparing data for mesh shaders and raytracing acceleration. The library provides specialized toolsets for partitioning meshes into clusters and meshlets to optimize hardware-accelerated rendering pipelines. It also includes helpers for generating opacity micromaps and spatial clusters to increase the speed of hardware ray-triangle intersection tests.
Creates a special index buffer using provoking vertices to encode primitive IDs for efficient fragment shader access.
Magit is a complete Git interface that runs inside Emacs, providing a full-featured porcelain for version control operations without leaving the editor. It renders repository state as structured, collapsible sections within Emacs buffers, and manages Git command execution through a transactional process model with automatic buffer refresh and error handling. The interface exposes all configuration through Emacs' standard customization system and uses a transient command framework for context-sensitive menu-driven Git operations. What distinguishes Magit is its granular control over every stag
Automatically updates file-visiting buffers in Emacs when the underlying file changes on disk.
This project is a learning guide and collection of study notes designed to teach Node.js backend development. It provides a comprehensive core API reference and practical demonstrations for implementing server-side logic, network programming, and system APIs. The guide specifically covers advanced technical domains including process management for scaling applications via clusters and child processes, as well as network programming for building TCP, UDP, and HTTP services. It also includes detailed instructional material on security implementation, focusing on cryptographic hashing and encryp
Provides a utility to populate a buffer with a repeating value for initialization.
Isaac Lab is an open-source framework for training robot policies in physically simulated environments, supporting both single-agent and multi-agent reinforcement learning. It is built on an Omniverse-PhysX simulation backend that models rigid bodies, articulated systems, deformable objects, and sensors, and provides a task-based environment configuration system where each training environment is defined as a modular class specifying observation spaces, action spaces, reward functions, and termination conditions. The framework distinguishes itself through an RL-library abstraction layer that
Dynamically resizes physics simulation buffers to prevent overflow errors when collision counts exceed initial capacity.
regl is a declarative WebGL library that manages graphics state and GPU resources through functional commands instead of manual binding and state tracking. It provides a command-based drawing abstraction where shaders, attributes, and render state are encapsulated into reusable, compiled functions that can be executed efficiently. What sets regl apart is its scoped state inheritance system, which allows nested drawing commands to inherit and override render state from parent scopes for organized rendering. The library automatically recovers from GPU context loss by restoring buffer and textur
Modifies vertex buffer content each frame for animation and performs in-place subdata updates.
gfx is a hardware-agnostic graphics API abstraction that translates a unified set of graphics and compute commands into native instructions for multiple GPU drivers. It provides a common interface for cross-platform rendering and general-purpose GPU compute programming. The project features an intermediate-representation shader translation system that converts source code and SPIR-V into target-specific languages. It employs a data-driven reference test framework to verify that graphics output remains consistent across different hardware platforms. Capabilities include parallel command buffe
Provides utilities to update contents of GPU-resident buffers and textures without manual staging buffer management.
This project provides a comprehensive technical guide and framework for engineering large-scale machine learning systems. It covers the full lifecycle of model development, focusing on the infrastructure and computational principles required to build, train, and serve generative AI models across distributed GPU clusters. The repository distinguishes itself by offering deep-dive tutorials and implementation strategies for complex system challenges. It emphasizes high-performance architectural primitives, such as collective communication orchestration, distributed tensor sharding, and static gr
Uses pre-allocated memory buffers with stable addresses to pass data into static execution graphs.
Orillusion ब्राउज़र में उच्च-निष्ठा दृश्यों और विज़ुअल प्रभावों के लिए डिज़ाइन किया गया एक WebGPU 3D रेंडरिंग इंजन है। यह समानांतर गणितीय संचालन के लिए एक GPU कंप्यूट फ्रेमवर्क और यथार्थवादी सामग्री और सतहों के लिए एक भौतिक-आधारित रेंडरिंग ग्राफिक्स पाइपलाइन के रूप में कार्य करता है। सिस्टम में कंकाल एनिमेशन को चलाने और वर्टेक्स स्थितियों को इंटरपोलेट करने के लिए एक वेब-आधारित 3D एनिमेशन टूलकिट भी शामिल है। इंजन दृश्य लॉजिक के लिए एक एंटिटी घटक सिस्टम और एक मैक्रो-आधारित शेडर जनरेशन सिस्टम के उपयोग से प्रतिष्ठित है जो कई शेडर वेरिएंट बनाता है। यह CPU और GPU के बीच डेटा एक्सचेंज ओवरहेड को कम करने के लिए मेमोरी-मैप्ड बफ़र प्रबंधन के माध्यम से प्रदर्शन को अनुकूलित करता है। यह प्रोजेक्ट क्लस्टर्ड फॉरवर्ड रेंडरिंग, प्रोब-आधारित ग्लोबल इल्यूमिनेशन, और वॉल्यूमेट्रिक गॉड रेज़ और डेप्थ ऑफ़ फील्ड जैसे प्रभावों के साथ एक पोस्ट-प्रोसेसिंग पाइपलाइन सहित ग्राफिक्स क्षमताओं की एक विस्तृत श्रृंखला को कवर करता है। यह कणों और तरल पदार्थों के लिए GPU-त्वरित भौतिकी, टक्कर का पता लगाने, और हार्डवेयर-त्वरित यूजर इंटरफेस को और एकीकृत करता है।
Manages the allocation and association of uniform and storage buffers for GPU compute operations.
TileLang is a Python-embedded domain-specific language compiler that JIT-compiles and autotunes GPU kernels. It uses a tile-based DSL, automatic software pipelining, and parallel autotuning to generate optimized GPU kernels at runtime. It supports tensor core operations with Pythonic syntax, automatic memory management, and thread mapping. The compiler searches over tile sizes, thread counts, and scheduling policies, compiling and benchmarking candidates in parallel to find the fastest kernel. It also caches compiled binaries and tuning results to disk for reuse across sessions. TileLang inc
TVM's feature to allocate a single-element buffer in local or variable memory, optionally with an initial value.
FlashInfer is a library of high-performance GPU kernels purpose-built for accelerating large language model inference. It provides optimized implementations for attention operations (including flash attention, page attention, multi-head latent attention, and cascade attention) using paged key-value caches, fused kernel composition, and just-in-time compilation. The library also includes specialized kernels for mixture-of-experts layers, block-scaled low-precision quantization (FP8, FP4), and distributed collective communication. What distinguishes FlashInfer is its fused all-reduce communicat
Initializes device buffers for distributed synchronization protocols in multi-GPU inference.
TypeGPU is a tool for type-safe WebGPU development that enables writing shaders in TypeScript. It translates high-level TypeScript function definitions and structures into WebGPU Shading Language source code to automate shader generation and validate logic using a type system. The project provides a mechanism for cross-library GPU interoperability by sharing typed buffers without copying data to system memory. It also integrates the Model Context Protocol to allow AI agents to inspect generated shader code and diagnose runtime errors. The system manages WebGPU resource mapping through typed
Enforces typed read/write operations by calculating memory sizes and binary padding for GPU buffers.