What are the best open-source alternatives to Buzz?

30 open-source projects similar to chidiwilliams/buzz, ranked by shared features. Top picks: pipecat-ai/pipecat, cjpais/handy, livekit/livekit, k2-fsa/sherpa-onnx, argmaxinc/whisperkit, jamiepine/voicebox, ggml-org/whisper.cpp, m-bain/whisperx, thewh1teagle/vibe, beingpax/voiceink.

Is pipecat-ai/pipecat a good alternative to Buzz?

Pipecat is a framework and software development kit for building real-time multimodal AI agents and speech-to-speech systems. It utilizes a frame-based data pipeline to route audio, video, and text through a modular sequence of processors, enabling the orchestration of low-latency conversational AI…

Is cjpais/handy a good alternative to Buzz?

Handy is a local speech-to-text automation tool designed to convert spoken audio into text and inject it directly into active desktop applications. By running machine learning models entirely on the host hardware, it provides a private, offline-first environment for dictation and command execution.…

Is livekit/livekit a good alternative to Buzz?

LiveKit is a comprehensive framework for building and orchestrating real-time, multimodal AI agents that interact with users through voice, video, and text. It provides a centralized, event-driven architecture to manage the entire lifecycle of automated participants, from initialization and session…

Is k2-fsa/sherpa-onnx a good alternative to Buzz?

Sherpa-ONNX is an ONNX-based speech processing toolkit that provides a local speech recognition engine, an on-device voice synthesis tool, and a speaker identification framework. It is designed as a cross-platform speech API that enables speech-to-text, text-to-speech, and speaker verification task…

Is argmaxinc/whisperkit a good alternative to Buzz?

argmaxinc/whisperkit is an open-source alternative to Buzz.

Is jamiepine/voicebox a good alternative to Buzz?

Voicebox is a local speech processing system that provides text-to-speech generation, speech-to-text transcription, and voice cloning. It utilizes local machine learning inference and GPU acceleration to process audio and text data without relying on external API calls. The project features a voic…

Is ggml-org/whisper.cpp a good alternative to Buzz?

Whisper.cpp is a high-performance, local-first speech recognition engine designed to run large-scale machine learning models on consumer hardware. It functions as a portable library that converts audio into text, supporting both static file transcription and real-time stream processing. By utilizin…

Is m-bain/whisperx a good alternative to Buzz?

WhisperX is an automated speech recognition toolkit designed to convert spoken audio into text while maintaining precise synchronization with the original media. It functions as an integrated pipeline that combines transcription, phoneme-based alignment, and speaker diarization to produce structure…

Is thewh1teagle/vibe a good alternative to Buzz?

Vibe is a cross-platform transcription tool that converts spoken audio into text by running Whisper neural models directly on your device, with no cloud dependency. It can transcribe audio from files, microphones, system output, and network streams, and supports both batch processing of multiple fi…

Is beingpax/voiceink a good alternative to Buzz?

VoiceInk is a system-wide speech-to-text dictation tool that converts spoken audio into text using local or cloud AI models. It functions as a local AI transcription engine and a context-aware voice assistant, allowing users to insert transcribed text directly into any active application on the ope…

Back to chidiwilliams/buzz

Open-source alternatives to Buzz

30 open-source projects similar to chidiwilliams/buzz, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Buzz alternative.

pipecat-ai/pipecat
pipecat-ai/pipecat
12,846View on GitHub
Pipecat is a framework and software development kit for building real-time multimodal AI agents and speech-to-speech systems. It utilizes a frame-based data pipeline to route audio, video, and text through a modular sequence of processors, enabling the orchestration of low-latency conversational AI. The project is distinguished by its ability to coordinate complex multimodal services, including speech-to-text, language models, and text-to-speech, within a single pipeline. It features semantic voice activity detection for natural turn-taking, state-machine conversation flows for dialogue manag
Pythonaichatbot-frameworkchatbots
View on GitHub12,846
cjpais/handy
cjpais/Handy
15,515View on GitHub
Handy is a local speech-to-text automation tool designed to convert spoken audio into text and inject it directly into active desktop applications. By running machine learning models entirely on the host hardware, it provides a private, offline-first environment for dictation and command execution. The system functions as a background service that manages microphone input, transcription state, and text output, enabling hands-free typing across various software environments. The project distinguishes itself through a modular pipeline that integrates local language models for post-transcription
Rustaccessibilitycross-platformspeech-to-text
View on GitHub15,515
livekit/livekit
livekit/livekit
19,358View on GitHub
LiveKit is a comprehensive framework for building and orchestrating real-time, multimodal AI agents that interact with users through voice, video, and text. It provides a centralized, event-driven architecture to manage the entire lifecycle of automated participants, from initialization and session state management to graceful shutdown. By utilizing a selective forwarding unit, the platform efficiently routes media streams between participants and agents, ensuring low-latency communication and secure, token-based authentication for all connections. The platform distinguishes itself through it
Gogolangmedia-serversfu
View on GitHub19,358

Open-source alternatives to Buzz

pipecat-ai/pipecat

cjpais/Handy

livekit/livekit

k2-fsa/sherpa-onnx

argmaxinc/WhisperKit

jamiepine/voicebox

ggml-org/whisper.cpp

m-bain/whisperX

thewh1teagle/vibe

Beingpax/VoiceInk

TypeWhisper/typewhisper-mac

collabora/WhisperLive

google-ai-edge/gallery

KoljaB/RealtimeSTT

steipete/summarize

QuentinFuxa/WhisperLiveKit

PaddlePaddle/PaddleSpeech

Const-me/Whisper

openai/openai-go

openai/whisper

mozilla-ai/llamafile

jianchang512/pyvideotrans

backstage/backstage

nextcloud/server

tover0314-w/opentypeless

kstonekuan/tambourine-voice

EpicenterHQ/epicenter

azex-ai/speech

evoleinik/fnkey

kdcokenny/OpenDictation