# w-okada/voice-changer

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/w-okada-voice-changer).**

19,729 stars · 2,245 forks · Python · other

## Links

- GitHub: https://github.com/w-okada/voice-changer
- awesome-repositories: https://awesome-repositories.com/repository/w-okada-voice-changer.md

## Description

This software is a real-time voice changer that utilizes machine learning inference to transform live microphone input into target vocal characteristics. It functions as an artificial intelligence audio processing tool designed to modify vocal identity during active communication or live broadcasts.

The application distinguishes itself by executing neural network models directly within the browser environment. It leverages web-based compute acceleration and dedicated audio threading to maintain low-latency performance, allowing users to switch between different voice profiles while processing audio streams in real time.

The system integrates with external communication platforms by injecting processed media streams directly into the audio pipeline. It supports a range of audio engineering tasks, enabling the application of complex signal transformations for virtual content creation and live vocal modification.

## Tags

### Artificial Intelligence & ML

- [Real-Time Voice Cloning](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/speech-synthesis/real-time-voice-cloning.md) — Transforms live audio input into target voices using machine learning models for instantaneous vocal modification.
- [Real-Time Voice Transformation](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/speech-synthesis/voice-cloning-tools/real-time-voice-transformation.md) — Transforms live microphone input into target vocal characteristics in real time using machine learning. ([source](https://github.com/w-okada/voice-changer/tree/master/docs/))
- [Audio Processing](https://awesome-repositories.com/f/artificial-intelligence-ml/audio-processing.md) — Provides a utility for applying deep learning inference to microphone streams for low-latency voice conversion.
- [Neural Conversion Models](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/speech-synthesis/real-time-voice-cloning/neural-conversion-models.md) — Uses neural network inference to transform live vocal input into target speaker characteristics in real time.
- [Inference Engines](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-inference-serving/inference-engines.md) — Executes pre-trained neural network models on live audio buffers to perform real-time signal transformation.

### Graphics & Multimedia

- [Media Stream Injection](https://awesome-repositories.com/f/graphics-multimedia/media-processing-analysis/media-manipulation/media-processing/streaming-network-frameworks/media-stream-processing/media-stream-injection.md) — Injects processed audio directly into communication platforms by replacing standard microphone input streams.
- [Audio Worklets](https://awesome-repositories.com/f/graphics-multimedia/audio-music/audio-processing/audio-worklets.md) — Implements high-performance audio processing using dedicated browser threads to ensure smooth, real-time output.
- [Live Vocal Engineering](https://awesome-repositories.com/f/graphics-multimedia/audio-music/audio-streaming-engines/live-vocal-engineering.md) — Provides real-time vocal processing and engineering for live broadcasts and online audiences.
- [Audio Processing](https://awesome-repositories.com/f/graphics-multimedia/audio-music/audio-processing.md) — Captures and processes raw audio buffers through a low-latency pipeline for immediate vocal modification.
- [Virtual Persona Creation](https://awesome-repositories.com/f/graphics-multimedia/media-production-suites/content-creation-tools/virtual-persona-creation.md) — Enhances live broadcasts by altering vocal identity to match specific characters or creative personas.

### Programming Languages & Runtimes

- [WebAssembly](https://awesome-repositories.com/f/programming-languages-runtimes/runtime-execution-environments/webassembly.md) — Leverages compiled binary modules to execute performance-critical signal processing at near-native speeds within the browser.