1 repo
Support for offloading model inference to Apple's neural engine and hardware-optimized formats.
Distinguishing note: Specifically targets Apple silicon neural engines, distinct from other vendor-specific hardware acceleration.
Explore 1 awesome GitHub repository matching artificial intelligence & ml · Apple Hardware Acceleration. Refine with filters or upvote what's useful.
Whisper.cpp is a high-performance, local-first speech recognition engine designed to run large-scale machine learning models on consumer hardware. It functions as a portable library that converts audio into text, supporting both static file transcription and real-time stream processing. By utilizing a lightweight inference engine and weight quantization, the project minimizes memory and compute overhead, allowing for efficient execution without reliance on external cloud APIs or internet connectivity. The project distinguishes itself through a hardware-agnostic compute abstraction that offloa
The project offloads model inference to the neural engine on Apple hardware to improve speech recognition performance using specialized hardware-optimized model formats.