1 repo
Libraries that leverage native processor instructions and hardware-specific features to maximize computational throughput.
Distinguishing note: Focuses on native instruction utilization for packed integer arithmetic rather than general-purpose system programming.
Explore 1 awesome GitHub repository matching operating systems & systems programming · Hardware-Specific Accelerators. Refine with filters or upvote what's useful.
BitNet is a quantized inference engine designed to execute highly compressed language models by performing arithmetic on low-precision, bit-level weight data. It functions as a model optimization toolkit and a high-performance kernel library, enabling the execution of large language models on consumer hardware by reducing memory footprints and increasing processing speeds. The project distinguishes itself through hardware-specific kernel optimizations that leverage native processor instructions to accelerate matrix multiplication. By utilizing packed integer arithmetic and memory-aligned weig
Perform efficient integer arithmetic on packed weights by using native hardware dot-product instructions to increase computational density on supported graphics processing units.