# chidiwilliams/buzz

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/chidiwilliams-buzz).**

17,903 stars · 1,313 forks · Python · mit

## Links

- GitHub: https://github.com/chidiwilliams/buzz
- Homepage: https://chidiwilliams.github.io/buzz
- awesome-repositories: https://awesome-repositories.com/repository/chidiwilliams-buzz.md

## Topics

`whisper`

## Description

Buzz is a desktop application that provides a local speech-to-text engine for transcribing and translating audio and video files. By leveraging local machine inference, the software ensures data privacy and offline performance, removing the need for cloud connectivity during media processing.

The application distinguishes itself through a modular plugin architecture that allows for the integration of custom functionality, such as content summarization and automated text formatting, without modifying the core codebase. It also features a speaker diarization pipeline that identifies and labels individual voices within recordings to improve the readability and organization of generated transcripts.

The system supports automated media processing by monitoring specific directories for new files, enabling users to trigger transcription or translation workflows as soon as assets are detected. Users can export results into various standard formats, including plain text and subtitle files, while utilizing hardware acceleration to increase processing speeds for large media files.

## Tags

### Artificial Intelligence & ML

- [Speech-to-Text Engines](https://awesome-repositories.com/f/artificial-intelligence-ml/speech-to-text-engines.md) — Provides a local speech-to-text engine that leverages hardware acceleration and speaker diarization.
- [Audio Transcription](https://awesome-repositories.com/f/artificial-intelligence-ml/audio-transcription.md) — Converts audio and video files into written text locally to ensure complete data privacy.
- [Local AI Inference](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-deployment-and-serving/local-and-on-device-inference/local-ai-inference.md) — Executes machine learning models locally to ensure data privacy and offline performance.
- [Multilingual Speech Translation](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/speech-processing/multilingual-speech-translation.md) — Converts spoken language from media into different languages using local machine inference.
- [Speech-to-Text Translation](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/speech-processing/speech-datasets/english/speech-to-text-translation.md) — Translates spoken language from media files into different languages using local processing. ([source](https://chidiwilliams.github.io/buzz/))
- [Speaker Diarization](https://awesome-repositories.com/f/artificial-intelligence-ml/speaker-diarization.md) — Analyzes audio to distinguish and label individual speakers within transcripts.

### Development Tools & Productivity

- [Speech-to-Text Utilities](https://awesome-repositories.com/f/development-tools-productivity/developer-utilities-libraries/workflow-productivity-enhancers/developer-productivity-utilities/accessibility-and-productivity-tools/speech-to-text-utilities.md) — Provides a local speech-to-text engine for transcribing audio and video files offline. ([source](https://cdn.jsdelivr.net/gh/chidiwilliams/buzz@main/README.md))

### Graphics & Multimedia

- [Transcription Tools](https://awesome-repositories.com/f/graphics-multimedia/media-processing-analysis/media-manipulation/media-processing-workflows/audio-analysis-synthesis/transcription-tools.md) — Converts audio and video files into text using local machine processing to ensure privacy and offline performance.
- [Media Automation](https://awesome-repositories.com/f/graphics-multimedia/media-processing-analysis/media-manipulation/media-processing/video-analysis-processing/media-automation.md) — Automates transcription and translation tasks by monitoring directories for new media assets. ([source](https://cdn.jsdelivr.net/gh/chidiwilliams/buzz@main/README.md))

### Software Engineering & Architecture

- [Modular Plugin Architectures](https://awesome-repositories.com/f/software-engineering-architecture/modular-plugin-architectures.md) — Provides a modular plugin architecture that allows for the integration of custom functionality like summarization and formatting without modifying the core codebase.
- [Plugin Architectures](https://awesome-repositories.com/f/software-engineering-architecture/plugin-architectures.md) — Enables extending core functionality through modular plugins for tasks like summarization. ([source](https://cdn.jsdelivr.net/gh/chidiwilliams/buzz@main/README.md))

### Content Management & Publishing

- [Subtitle Management Systems](https://awesome-repositories.com/f/content-management-publishing/media-management/subtitle-management-systems.md) — Exports transcripts into standard subtitle and web video track formats.

### Operating Systems & Systems Programming

- [Hardware Acceleration](https://awesome-repositories.com/f/operating-systems-systems-programming/hardware-interfacing-drivers/hardware-acceleration.md) — Offloads intensive transcription tasks to local graphics hardware to increase processing speed.
