# facebookresearch/encodec

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/facebookresearch-encodec).**

3,893 stars · 350 forks · Python · mit

## Links

- GitHub: https://github.com/facebookresearch/encodec
- awesome-repositories: https://awesome-repositories.com/repository/facebookresearch-encodec.md

## Description

EnCodec is a neural audio codec and compression tool designed to transform raw audio waveforms into discrete codes and reconstruct them back into sound. It functions as a system for neural audio representation, converting continuous audio signals into sequences of integer indices for use in generative AI tasks.

The project utilizes a residual vector quantizer, which employs multiple layers of codebooks to represent audio signals with high precision at low bitrates. This approach allows the system to compress audio to discrete codes and perform low bitrate audio coding for efficient transmission and storage.

The framework covers the full pipeline of neural audio compression, including the extraction of discrete audio representations and the reconstruction of audio waveforms.

## Tags

### Artificial Intelligence & ML

- [Neural Audio Compression](https://awesome-repositories.com/f/artificial-intelligence-ml/audio-tokenization/neural-audio-compression.md) — Provides high-fidelity neural compression of audio using discrete tokenizers for efficient storage.
- [Audio Sample Reconstruction](https://awesome-repositories.com/f/artificial-intelligence-ml/audio-generation-models/audio-sample-reconstruction.md) — Converts discrete compressed codes back into playable audio signals without distortion.
- [Waveform Decoders](https://awesome-repositories.com/f/artificial-intelligence-ml/audio-tokenization/waveform-decoders.md) — Converts discrete neural tokens back into high-fidelity audio waveforms. ([source](https://cdn.jsdelivr.net/gh/facebookresearch/encodec@main/README.md))
- [Discrete Audio Representations](https://awesome-repositories.com/f/artificial-intelligence-ml/discrete-audio-representations.md) — Transforms continuous audio signals into sequences of integer indices for efficient storage and generative AI tasks.
- [Audio Codebook Tokenization](https://awesome-repositories.com/f/artificial-intelligence-ml/discretized-visual-representations/audio-codebook-tokenization.md) — Maps continuous neural embeddings to a finite vocabulary to enable discrete audio tokenization.
- [Residual Vector Quantizers](https://awesome-repositories.com/f/artificial-intelligence-ml/vector-quantization/residual-vector-quantizers.md) — Implements a residual vector quantizer with multiple codebook layers to achieve high-precision audio compression at low bitrates.
- [One-Dimensional Convolutions](https://awesome-repositories.com/f/artificial-intelligence-ml/convolutional-neural-networks/one-dimensional-convolutions.md) — Implements one-dimensional convolutional filters to extract hierarchical temporal features from raw audio samples.
- [Quantized Audio Encoder-Decoders](https://awesome-repositories.com/f/artificial-intelligence-ml/encoder-decoder-architectures/quantized-audio-encoder-decoders.md) — Uses a mirrored encoder-decoder structure with a quantization bottleneck for audio reconstruction.
- [Latent Space Encoders](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/diffusion-visual-models/generative-ai-models/latent-space-generative-models/latent-space-projections/latent-space-encoders.md) — Constrains audio information into a low-dimensional discrete sequence to extract the most salient features.
- [Convolutional Architectures](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/speech-processing/sequence-to-sequence-tasks/convolutional-architectures.md) — Implements a symmetric convolutional architecture to downsample and upsample raw audio waveforms.
- [Vector-Quantized VAEs](https://awesome-repositories.com/f/artificial-intelligence-ml/model-training/variational-autoencoders/vector-quantized-vaes.md) — Utilizes a vector-quantized variational autoencoder to map audio to a discrete latent codebook.

### Data & Databases

- [Residual](https://awesome-repositories.com/f/data-databases/vector-quantization/residual.md) — Employs additive codebooks to achieve high precision audio representation at low bitrates.

### Graphics & Multimedia

- [Discrete Token Extraction](https://awesome-repositories.com/f/graphics-multimedia/media-processing-analysis/media-manipulation/media-processing-workflows/audio-analysis-synthesis/audio-feature-extraction/discrete-token-extraction.md) — Transforms audio waveforms into sequences of discrete codes for machine learning tasks. ([source](https://cdn.jsdelivr.net/gh/facebookresearch/encodec@main/README.md))
- [Audio Feature Extraction](https://awesome-repositories.com/f/graphics-multimedia/media-processing-analysis/media-manipulation/media-processing-workflows/audio-analysis-synthesis/audio-feature-extraction.md) — Uses stacked convolutional filters to extract temporal features across multiple time scales from raw audio.

### Networking & Communication

- [Low Bitrate Audio Coding](https://awesome-repositories.com/f/networking-communication/low-bitrate-audio-coding.md) — Encodes audio at extremely low bandwidths for efficient transmission and storage.
