edge-tts is a command line interface and text-to-speech engine that converts written text into audio files using the Microsoft Edge online synthesis service. It functions as a client for generating high-quality speech and managing the conversion of text to audio.
The project provides utilities for generating synchronized SRT subtitle files by tracking word and sentence boundaries during synthesis. It also includes a voice profile discovery system to browse a catalog of available synthetic voices based on gender and personality traits.
Users can customize vocal characteristics by adjusting the pitch, rate, and volume of the output. The system supports real-time audio streaming for immediate playback without requiring local file storage, allowing for the simultaneous streaming of synthesized speech and synchronized text captions.