1 repo
Development environments and libraries that provide infrastructure for building complex neural-based audio processing pipelines.
Explore 1 awesome GitHub repository matching graphics & multimedia · Audio Processing Frameworks. Refine with filters or upvote what's useful.
GPT-SoVITS is a text-to-speech synthesis engine and voice cloning toolkit designed for generating natural-sounding human speech. It functions as a neural audio processing pipeline that maps input text to high-fidelity audio waveforms, utilizing conditional variational autoencoders and flow-based decoders to ensure expr