3 repos

Audio Processing — Graphics & Multimedia

We curate 3 GitHub repositories matching graphics & multimedia · Audio Processing. Refine with filters or upvote what's useful.

We'll search the best matching repositories with AI.

binary-husky/gpt_academic
binary-husky/gpt_academic
70,112GitHubView on GitHub
This project provides a self-hosted, web-based interface designed to integrate large language models into academic and research workflows. It functions as a modular platform for document analysis, literature processing, and data handling, allowing users to maintain full control over their data and model connectivity th
Pythonacademicchatglm-6bchatgpt
CorentinJ/Real-Time-Voice-Cloning
CorentinJ/Real-Time-Voice-Cloning
59,355GitHubView on GitHub
This project is a neural text-to-speech engine and voice cloning toolkit designed to generate synthetic speech that mimics the vocal characteristics of a target speaker. It functions as a real-time audio synthesizer, utilizing a deep learning pipeline to convert written text into high-fidelity speech output with minima
Pythondeep-learningpythonpytorch
RVC-Boss/GPT-SoVITS
RVC-Boss/GPT-SoVITS
55,111GitHubView on GitHub
GPT-SoVITS is a text-to-speech synthesis engine and voice cloning toolkit designed for generating natural-sounding human speech. It functions as a neural audio processing pipeline that maps input text to high-fidelity audio waveforms, utilizing conditional variational autoencoders and flow-based decoders to ensure expr
Pythontext-to-speechttsvits

3 repos

We curate 3 GitHub repositories matching graphics & multimedia · Audio Processing. Refine with filters or upvote what's useful.

We'll search the best matching repositories with AI.

binary-husky/gpt_academic
binary-husky/gpt_academic
70,112GitHubView on GitHub
This project provides a self-hosted, web-based interface designed to integrate large language models into academic and research workflows. It functions as a modular platform for document analysis, literature processing, and data handling, allowing users to maintain full control over their data and model connectivity th
Pythonacademicchatglm-6bchatgpt
CorentinJ/Real-Time-Voice-Cloning
CorentinJ/Real-Time-Voice-Cloning
59,355GitHubView on GitHub
This project is a neural text-to-speech engine and voice cloning toolkit designed to generate synthetic speech that mimics the vocal characteristics of a target speaker. It functions as a real-time audio synthesizer, utilizing a deep learning pipeline to convert written text into high-fidelity speech output with minima
Pythondeep-learningpythonpytorch
RVC-Boss/GPT-SoVITS
RVC-Boss/GPT-SoVITS
55,111GitHubView on GitHub
GPT-SoVITS is a text-to-speech synthesis engine and voice cloning toolkit designed for generating natural-sounding human speech. It functions as a neural audio processing pipeline that maps input text to high-fidelity audio waveforms, utilizing conditional variational autoencoders and flow-based decoders to ensure expr
Pythontext-to-speechttsvits