# rany2/edge-tts

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/rany2-edge-tts).**

10,041 stars · 959 forks · Python · other

## Links

- GitHub: https://github.com/rany2/edge-tts
- Homepage: https://pypi.org/project/edge-tts/
- awesome-repositories: https://awesome-repositories.com/repository/rany2-edge-tts.md

## Topics

`speech-synthesis` `text-to-speech` `tts`

## Description

edge-tts is a command line interface and text-to-speech engine that converts written text into audio files using the Microsoft Edge online synthesis service. It functions as a client for generating high-quality speech and managing the conversion of text to audio.

The project provides utilities for generating synchronized SRT subtitle files by tracking word and sentence boundaries during synthesis. It also includes a voice profile discovery system to browse a catalog of available synthetic voices based on gender and personality traits.

Users can customize vocal characteristics by adjusting the pitch, rate, and volume of the output. The system supports real-time audio streaming for immediate playback without requiring local file storage, allowing for the simultaneous streaming of synthesized speech and synchronized text captions.

## Tags

### Artificial Intelligence & ML

- [Text-to-Speech](https://awesome-repositories.com/f/artificial-intelligence-ml/text-to-speech.md) — Converts written text into high-quality audio files using the Microsoft Edge online synthesis service.
- [Speech Synthesis Gateways](https://awesome-repositories.com/f/artificial-intelligence-ml/speech-to-text-integrations/unified-speech-pipelines/speech-synthesis-gateways.md) — Offloads text-to-speech conversion to a remote cloud engine via a synthesis gateway.
- [Voice Profile Managers](https://awesome-repositories.com/f/artificial-intelligence-ml/agentic-systems-frameworks/conversational-voice-interaction/voice-agents/voice-profile-managers.md) — Provides a system for querying catalogs and selecting specific vocal identity profiles.
- [Vocal Characteristic Adjustments](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/speech-processing/voice-synthesis/vocal-characteristic-adjustments.md) — Allows customization of pitch, rate, and volume to create specific vocal characteristics.
- [Voice Discovery Interfaces](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/speech-processing/voice-synthesis/voice-discovery-interfaces.md) — Provides a discovery interface to browse a catalog of synthetic voices based on gender and traits.
- [Speech Parameter Configuration](https://awesome-repositories.com/f/artificial-intelligence-ml/text-to-speech/speech-parameter-configuration.md) — Allows adjustment of voice profiles, playback rates, volume, and pitch. ([source](https://github.com/rany2/edge-tts#readme))

### Graphics & Multimedia

- [Speech Synthesis & TTS](https://awesome-repositories.com/f/graphics-multimedia/audio-music/speech-synthesis-tts.md) — Provides a command line interface for performing text-to-speech synthesis via Microsoft Edge.
- [Audio Playback Engines](https://awesome-repositories.com/f/graphics-multimedia/audio-music/audio-streaming-engines/audio-playback-engines.md) — Streams synthesized audio for immediate playback without requiring the audio to be saved to disk. ([source](https://github.com/rany2/edge-tts/blob/master/README.md))
- [Synthesis Playback Clients](https://awesome-repositories.com/f/graphics-multimedia/audio-music/audio-streaming-engines/synthesis-playback-clients.md) — Functions as a client for playing synthesized speech in real time without local file storage.
- [Instant Streaming Playback](https://awesome-repositories.com/f/graphics-multimedia/media-processing-analysis/audio-processing-systems/audio-processing/text-to-speech-engines/text-to-speech-engines/instant-streaming-playback.md) — Streams synthesized audio directly to a media player for instant listening without disk storage.

### Networking & Communication

- [Websocket Connection Managers](https://awesome-repositories.com/f/networking-communication/connection-management/websocket-connection-managers.md) — Maintains persistent duplex WebSocket connections for sending text and receiving binary audio streams.

### Part of an Awesome List

- [Vocal Tone Customization](https://awesome-repositories.com/f/awesome-lists/media/text-to-speech/vocal-tone-customization.md) — Provides tools to modify speaker characteristics and tone to change synthesized audio quality. ([source](https://github.com/rany2/edge-tts/blob/master/README.md))

### Content Management & Publishing

- [Timestamped Subtitle Generators](https://awesome-repositories.com/f/content-management-publishing/media-management/subtitle-management-systems/timestamped-subtitle-generators.md) — Generates synchronized SRT files by parsing timing metadata for word and sentence boundaries.

### Operating Systems & Systems Programming

- [Real-Time Audio Streaming Buffers](https://awesome-repositories.com/f/operating-systems-systems-programming/kernel-core-internals/process-and-memory-management/memory-management/buffer-and-cache-management/binary-buffer-managers/trace-buffer-managers/audio-buffers/real-time-audio-streaming-buffers.md) — Processes raw audio data packets in real time via streaming buffers for immediate playback.
