6 repos

Awesome GitHub RepositoriesSpeech Processing

Tools and libraries for converting, analyzing, and interpreting human speech through computational methods.

Explore 6 awesome GitHub repositories matching artificial intelligence & ml · Speech Processing. Refine with filters or upvote what's useful.

We'll search the best matching repositories with AI.

Significant-Gravitas/AutoGPT
Significant-Gravitas/AutoGPT
181,891GitHubView on GitHub
AutoGPT is an orchestration platform designed for building, managing, and deploying autonomous agents. It provides a visual canvas-based environment where users can assemble agents by connecting modular blocks that represent actions, data flows, and conditional logic. The platform supports the entire agent lifecycle, i
Pythonaiartificial-intelligenceautonomous-agents
huggingface/transformers
huggingface/transformers
156,730GitHubView on GitHub
Transformers is a comprehensive library for machine learning that provides a unified interface for training, fine-tuning, and deploying transformer-based models. It supports a wide range of tasks, including text classification, language modeling, question answering, and sequence-to-sequence translation, while offering
Pythonaudiodeep-learningdeepseek
pytorch/pytorch
pytorch/pytorch
97,601GitHubView on GitHub
PyTorch is a machine learning framework centered on a GPU-ready tensor library that supports multi-dimensional array operations across both CPU and accelerator hardware. It provides a foundational infrastructure for mathematical computation and dynamic neural network construction, utilizing a tape-based automatic diffe
Pythonautograddeep-learninggpu
openai/whisper
openai/whisper
94,839GitHubView on GitHub
This project is a speech recognition and translation engine that utilizes a sequence-to-sequence transformer architecture to convert audio into text. It is built upon a weakly supervised learning framework, which leverages large-scale, unlabelled audio-transcript data to create generalized speech representations capabl
Python
CorentinJ/Real-Time-Voice-Cloning
CorentinJ/Real-Time-Voice-Cloning
59,355GitHubView on GitHub
This project is a neural text-to-speech engine and voice cloning toolkit designed to generate synthetic speech that mimics the vocal characteristics of a target speaker. It functions as a real-time audio synthesizer, utilizing a deep learning pipeline to convert written text into high-fidelity speech output with minima
Pythondeep-learningpythonpytorch
RVC-Boss/GPT-SoVITS
RVC-Boss/GPT-SoVITS
55,111GitHubView on GitHub
GPT-SoVITS is a text-to-speech synthesis engine and voice cloning toolkit designed for generating natural-sounding human speech. It functions as a neural audio processing pipeline that maps input text to high-fidelity audio waveforms, utilizing conditional variational autoencoders and flow-based decoders to ensure expr
Pythontext-to-speechttsvits

Explore sub-tags

6 repos

Awesome GitHub RepositoriesSpeech Processing

Tools and libraries for converting, analyzing, and interpreting human speech through computational methods.

Explore 6 awesome GitHub repositories matching artificial intelligence & ml · Speech Processing. Refine with filters or upvote what's useful.

We'll search the best matching repositories with AI.

Significant-Gravitas/AutoGPT
Significant-Gravitas/AutoGPT
181,891GitHubView on GitHub
AutoGPT is an orchestration platform designed for building, managing, and deploying autonomous agents. It provides a visual canvas-based environment where users can assemble agents by connecting modular blocks that represent actions, data flows, and conditional logic. The platform supports the entire agent lifecycle, i
Pythonaiartificial-intelligenceautonomous-agents
huggingface/transformers
huggingface/transformers
156,730GitHubView on GitHub
Transformers is a comprehensive library for machine learning that provides a unified interface for training, fine-tuning, and deploying transformer-based models. It supports a wide range of tasks, including text classification, language modeling, question answering, and sequence-to-sequence translation, while offering
Pythonaudiodeep-learningdeepseek
pytorch/pytorch
pytorch/pytorch
97,601GitHubView on GitHub
PyTorch is a machine learning framework centered on a GPU-ready tensor library that supports multi-dimensional array operations across both CPU and accelerator hardware. It provides a foundational infrastructure for mathematical computation and dynamic neural network construction, utilizing a tape-based automatic diffe
Pythonautograddeep-learninggpu
openai/whisper
openai/whisper
94,839GitHubView on GitHub
This project is a speech recognition and translation engine that utilizes a sequence-to-sequence transformer architecture to convert audio into text. It is built upon a weakly supervised learning framework, which leverages large-scale, unlabelled audio-transcript data to create generalized speech representations capabl
Python
CorentinJ/Real-Time-Voice-Cloning
CorentinJ/Real-Time-Voice-Cloning
59,355GitHubView on GitHub
This project is a neural text-to-speech engine and voice cloning toolkit designed to generate synthetic speech that mimics the vocal characteristics of a target speaker. It functions as a real-time audio synthesizer, utilizing a deep learning pipeline to convert written text into high-fidelity speech output with minima
Pythondeep-learningpythonpytorch
RVC-Boss/GPT-SoVITS
RVC-Boss/GPT-SoVITS
55,111GitHubView on GitHub
GPT-SoVITS is a text-to-speech synthesis engine and voice cloning toolkit designed for generating natural-sounding human speech. It functions as a neural audio processing pipeline that maps input text to high-fidelity audio waveforms, utilizing conditional variational autoencoders and flow-based decoders to ensure expr
Pythontext-to-speechttsvits

Awesome Speech Processing GitHub Repositories

Significant-Gravitas/AutoGPT

huggingface/transformers

pytorch/pytorch

openai/whisper

CorentinJ/Real-Time-Voice-Cloning

RVC-Boss/GPT-SoVITS

Explore sub-tags

Awesome Speech Processing GitHub Repositories

Significant-Gravitas/AutoGPT

huggingface/transformers

pytorch/pytorch

openai/whisper

CorentinJ/Real-Time-Voice-Cloning

RVC-Boss/GPT-SoVITS

Explore sub-tags