1 repo
Libraries for converting spoken audio into text or synthesizing speech from text.
Distinguishing note: Focuses on audio-to-text conversion and speech processing rather than general multimedia playback or editing.
Explore 1 awesome GitHub repository matching graphics & multimedia · Speech Processing Libraries. Refine with filters or upvote what's useful.
Whisper.cpp is a high-performance, local-first speech recognition engine designed to run large-scale machine learning models on consumer hardware. It functions as a portable library that converts audio into text, supporting both static file transcription and real-time stream processing. By utilizing a lightweight inference engine and weight quantization, the project minimizes memory and compute overhead, allowing for efficient execution without reliance on external cloud APIs or internet connectivity. The project distinguishes itself through a hardware-agnostic compute abstraction that offloa
A portable library that converts spoken audio into text across diverse operating systems, hardware architectures, and embedded environments.