1 repo
Tools and algorithms for identifying and segmenting audio recordings based on individual speaker identities.
Distinguishing note: This category focuses specifically on the task of speaker diarization within audio processing, distinct from general speech-to-text or broader audio analysis.
Explore 1 awesome GitHub repository matching artificial intelligence & ml · Speaker Diarization. Refine with filters or upvote what's useful.
Whisper.cpp is a high-performance, local-first speech recognition engine designed to run large-scale machine learning models on consumer hardware. It functions as a portable library that converts audio into text, supporting both static file transcription and real-time stream processing. By utilizing a lightweight inference engine and weight quantization, the project minimizes memory and compute overhead, allowing for efficient execution without reliance on external cloud APIs or internet connectivity. The project distinguishes itself through a hardware-agnostic compute abstraction that offloa
Identifies and labels distinct speakers within audio recordings to organize transcripts by individual participants.