What are the best open-source alternatives to NeMo?

30 open-source projects similar to nvidia/nemo, ranked by shared features. Top picks: facebookresearch/fairseq, nvidia/megatron-lm, pipecat-ai/pipecat, nvidia-nemo/nemo, sanchit-gandhi/whisper-jax, eleutherai/gpt-neox, speechbrain/speechbrain, espnet/espnet, aigc-audio/audiogpt, microsoft/deepspeed.

Is facebookresearch/fairseq a good alternative to NeMo?

Fairseq is a PyTorch toolkit for sequence-to-sequence modeling, specializing in neural machine translation, automatic speech recognition, and large-scale language model training. It provides a framework for processing and aligning diverse data sources, including text, audio, and video, to support t…

Is nvidia/megatron-lm a good alternative to NeMo?

Megatron-LM is a distributed transformer training library and large language model training framework designed to scale models across thousands of GPUs. It functions as a GPU-optimized deep learning toolkit and a scaling engine for mixture-of-experts architectures, enabling the training of models w…

Is pipecat-ai/pipecat a good alternative to NeMo?

Pipecat is a framework and software development kit for building real-time multimodal AI agents and speech-to-speech systems. It utilizes a frame-based data pipeline to route audio, video, and text through a modular sequence of processors, enabling the orchestration of low-latency conversational AI…

Is nvidia-nemo/nemo a good alternative to NeMo?

NeMo is a comprehensive framework designed for the development, training, and deployment of large-scale conversational and generative artificial intelligence models. It provides an integrated platform for building multimodal systems, encompassing speech processing, language modeling, and reinforcem…

Is sanchit-gandhi/whisper-jax a good alternative to NeMo?

whisper-jax is a high-performance implementation of the Whisper automatic speech recognition model rewritten using the JAX framework. It is designed for accelerated inference and uses XLA compilation to optimize model execution on hardware accelerators. The project focuses on TPU optimized transcr…

Is eleutherai/gpt-neox a good alternative to NeMo?

gpt-neox is a distributed training system and framework for building large-scale autoregressive language models. It implements the transformer architecture and provides a toolkit for training models with billions of parameters by distributing weights across compute clusters. The framework distingu…

Is speechbrain/speechbrain a good alternative to NeMo?

SpeechBrain is an all-in-one deep learning toolkit designed for speech and audio processing. Built as a modular library, it provides a structured environment for developing, training, and deploying neural network models across a wide range of tasks, including automatic speech recognition, speaker i…

Is espnet/espnet a good alternative to NeMo?

ESPnet is a comprehensive speech processing toolkit and PyTorch-based trainer designed for building end-to-end speech recognition, synthesis, and translation models. It provides a structured framework for developing automatic speech recognition systems using transducer and encoder-decoder architect…

Is aigc-audio/audiogpt a good alternative to NeMo?

AudioGPT is an LLM-driven audio framework and processing suite that uses large language models to orchestrate neural audio pipelines. It functions as a multimodal audio generator and processing system, integrating a collection of pretrained models to handle speech synthesis, sound generation, and a…

Is microsoft/deepspeed a good alternative to NeMo?

DeepSpeed is a distributed deep learning optimization library and framework designed for the training and inference of massive AI models. It serves as a model parallelism orchestrator and a toolkit for scaling large language models across multiple GPUs and compute nodes. The project distinguishes…

Back to nvidia/nemo

Open-source alternatives to NeMo

30 open-source projects similar to nvidia/nemo, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best NeMo alternative.

facebookresearch/fairseq
facebookresearch/fairseq
32,228View on GitHub
Fairseq is a PyTorch toolkit for sequence-to-sequence modeling, specializing in neural machine translation, automatic speech recognition, and large-scale language model training. It provides a framework for processing and aligning diverse data sources, including text, audio, and video, to support tasks such as speech-to-text conversion and multimodal sequence learning. The project is distinguished by its distributed training capabilities, which utilize parameter sharding, mixed-precision training, and CPU offloading to handle models that exceed single-device memory. It also includes specializ
Python
View on GitHub32,228
nvidia/megatron-lm
NVIDIA/Megatron-LM
16,731View on GitHub
Megatron-LM is a distributed transformer training library and large language model training framework designed to scale models across thousands of GPUs. It functions as a GPU-optimized deep learning toolkit and a scaling engine for mixture-of-experts architectures, enabling the training of models with hundreds of billions of parameters. The project implements multi-dimensional model parallelism, combining tensor, pipeline, data, expert, and context-based workload distribution. It specifically optimizes mixture-of-experts architectures through integrated memory and communication improvements t
Python
View on GitHub16,731
pipecat-ai/pipecat
pipecat-ai/pipecat
12,846View on GitHub
Pipecat is a framework and software development kit for building real-time multimodal AI agents and speech-to-speech systems. It utilizes a frame-based data pipeline to route audio, video, and text through a modular sequence of processors, enabling the orchestration of low-latency conversational AI. The project is distinguished by its ability to coordinate complex multimodal services, including speech-to-text, language models, and text-to-speech, within a single pipeline. It features semantic voice activity detection for natural turn-taking, state-machine conversation flows for dialogue manag
Pythonaichatbot-frameworkchatbots
View on GitHub12,846

Open-source alternatives to NeMo

facebookresearch/fairseq

NVIDIA/Megatron-LM

pipecat-ai/pipecat

NVIDIA-NeMo/NeMo

sanchit-gandhi/whisper-jax

EleutherAI/gpt-neox

speechbrain/speechbrain

espnet/espnet

AIGC-Audio/AudioGPT

microsoft/DeepSpeed

dusty-nv/jetson-inference

openai/whisper

vocodedev/vocode-core

livekit/agents

PaddlePaddle/PaddleSpeech

TEN-framework/ten-framework

elevenlabs/elevenlabs-python

NVIDIA/Isaac-GR00T

google-gemini/cookbook

livekit/livekit

facebookresearch/wav2letter

facebookresearch/metaseq

mastra-ai/mastra

mistralai/mistral-src

facebookresearch/mmf

mosaicml/llm-foundry

NVIDIA/FasterTransformer

pytorch/torchtitan

pytorch/fairseq

google/uis-rnn