Command-line utilities and software libraries for automatically adjusting audio files to consistent volume levels.
cmus is a terminal-based audio player that runs entirely within a terminal window, using the ncurses library to render its interface without any graphical toolkit. It is designed to play local music files and internet radio streams, with support for CUE sheet metadata that splits single audio files into logical tracks with correct timing, and gapless playback that eliminates audible pauses between consecutive tracks. The player distinguishes itself through a Vi-inspired incremental search interface that locates tracks and commands in real time as you type, and a tab-based workspace model that organizes the interface into independent views for the library, playlist, and file browser. It also provides a Unix socket IPC interface for remote control from external programs, and can execute external commands with selected tracks as arguments. Volume normalization is handled through ReplayGain metadata, adjusting playback levels per track or album for consistent listening. Additional capabilities include a play queue for arranging upcoming tracks, playback state persistence that remembers and restores the last position on restart, and scrobbling support for reporting played tracks to Last.fm or Libre.fm. The interface supports keyboard shortcut customization, tab-completion command execution, and dynamic playlist filtering that refines the visible track list in real time.
This is a terminal-based music player rather than a utility for modifying audio files, as its loudness normalization is limited to real-time playback adjustments via ReplayGain metadata.
Ultimate Vocal Remover is a desktop application designed for AI-driven audio source separation. It utilizes deep learning models to isolate vocals, drums, and other individual instruments from mixed audio files, providing a utility for professional production and creative editing workflows. The software distinguishes itself by leveraging GPU-accelerated tensor computation to perform complex signal processing tasks, significantly reducing the time required for high-fidelity audio extraction. It incorporates a modular plugin architecture that integrates external utilities to support a wide range of audio file formats, ensuring compatibility across diverse media libraries. Beyond core separation capabilities, the toolkit includes features for modifying audio pitch and tempo to meet specific project requirements. It also supports automated batch processing, allowing users to queue multiple files for sequential handling without manual intervention. The application is distributed as a desktop utility with documentation available for installation and configuration.
This is an AI-driven audio source separation tool designed for isolating instruments and vocals, rather than a utility for normalizing the perceived loudness of audio files.
FFmpeg is a cross-platform multimedia framework designed for the recording, conversion, and streaming of audio and video content. It functions as a comprehensive toolkit that provides both a command-line utility for direct media manipulation and a collection of low-level libraries for integration into custom applications. At its core, the project utilizes a packet-based stream engine and a format-agnostic abstraction layer to handle diverse media standards, containers, and network protocols. The framework distinguishes itself through a modular, graph-based filter execution model that allows for complex, non-linear transformations of audio and video frames. It supports high-performance processing by offloading intensive encoding and decoding tasks to dedicated hardware and utilizing threaded parallel processing to maximize throughput across multiple processor cores. This architecture enables users to construct intricate pipelines for tasks ranging from simple format conversion to advanced real-time media filtering and analysis. Beyond core transcoding, the project covers a broad functional surface including live streaming, hardware device capture, and secure network transport. It provides extensive capabilities for metadata management, subtitle processing, and stream synchronization, alongside diagnostic tools for inspecting media integrity and performance. The system is highly extensible, allowing for the dynamic integration of external codecs and third-party libraries to support specialized media requirements.
FFmpeg is a powerful multimedia framework that includes the loudnorm filter for EBU R128-compliant loudness normalization, making it a highly capable, albeit command-line-driven, tool for batch-processing and adjusting audio libraries.