30 open-source projects similar to slhck/ffmpeg-normalize, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Ffmpeg Normalize alternative.
FFmpeg Batch is a graphical desktop application designed to automate complex audio and video processing tasks. It functions as a workflow manager that allows users to execute repetitive multimedia operations in bulk, providing a centralized interface for managing encoding jobs, stream composition, and metadata across large collections of files. The application distinguishes itself by offering event-driven directory watching, which triggers automated workflows for incoming media without manual intervention. It utilizes configuration-based task templating to ensure consistent encoding parameter
EasyEffects is a real-time audio processor and system-wide effects manager designed for PipeWire audio streams. It functions as a comprehensive suite for applying filters, equalizers, and limiters to both input and output audio across the entire system. The project distinguishes itself through its use of deep learning for neural network noise suppression and voice isolation, as well as its ability to simulate physical acoustic environments using impulse-response convolution. It includes a sophisticated preset management system that allows users to associate specific audio configurations with
This project is a multi-room music server and library aggregator that centralizes local audio files and various streaming services into a single unified database. It functions as a media controller for DLNA and UPnP devices, enabling the management of music, podcasts, and audiobooks across networked speakers. The system is distinguished by its ability to synchronize audio playback across multiple speakers and brands simultaneously. It features a digital signal processor for loudness normalization and frequency correction, alongside an acoustic fingerprinting engine used for metadata enrichmen
VERT is a media conversion platform designed to transform images, audio, video, and documents into various formats. It functions as a batch file processor that allows users to apply consistent conversion settings and custom naming patterns to multiple assets simultaneously, bundling the final outputs into compressed archives for streamlined organization. The system distinguishes itself through a distributed architecture that routes heavy media transcoding tasks across local hardware or remote server infrastructure. This approach optimizes performance by balancing computational workloads, allo
Pyvideotrans is an automated video localization platform designed to transcribe, translate, and dub media content for international distribution. It functions as an end-to-end workflow that combines speech recognition, text translation, and synthetic voice generation to process video files into localized versions. The system distinguishes itself by offering a choice between local model inference for privacy and integration with third-party cloud services via user-provided credentials. This architecture allows users to maintain control over their billing and data security while utilizing modul
Waifu2x-Extension-GUI is a desktop application designed for high-fidelity media restoration and enhancement. It functions as a graphical interface that orchestrates specialized deep learning engines to upscale, denoise, and interpolate images and videos, improving visual clarity and motion smoothness. The software distinguishes itself through its ability to manage complex, automated media processing pipelines. Users can chain multiple tasks—such as format conversion, scene detection, and frame rate interpolation—into sequential workflows that execute without manual intervention. It provides g
PixivUtil2 is a digital content scraper and media downloader designed for the automated archiving of artwork and videos from Pixiv. It functions as a batch image archiver that retrieves posts based on member IDs, tags, or bookmarked collections. The software uses browser session cookies and user agents to authenticate accounts, enabling the retrieval of restricted or age-restricted material. It differentiates itself through the ability to synchronize bookmarked content and the use of external media tools to re-encode animated images and videos into different formats. The system manages large
Facefusion is a modular framework designed for automated image and video manipulation, specializing in tasks such as face swapping, enhancement, and restoration. It functions as a computer vision processing pipeline that chains independent machine learning modules to perform complex transformations, including facial animation, age modification, and lip synchronization. The system is built to handle both real-time interactive feeds and large-scale batch processing tasks. The platform distinguishes itself through a highly extensible architecture that supports custom processing modules and inter
PeerTube is a decentralized, open-source video hosting platform that enables users to operate independent, interoperable servers. By utilizing the ActivityPub protocol, it connects these servers into a global, federated network where users can follow channels, discover content, and interact across different instances. The platform is designed to function as a self-hosted video content management system, providing a community-driven alternative to centralized media services. What distinguishes PeerTube is its hybrid approach to content delivery and infrastructure management. It integrates peer
This project is a RESTful media extraction service that provides a programmatic interface for downloading video and image content from social media platforms. It functions as a scraper that parses shared URLs and user profile identifiers to isolate direct media streams and associated metadata from platform-specific data structures. The service distinguishes itself through its ability to emulate cryptographic signatures and security tokens required to authenticate requests against protected backend services. By simulating headless browser behavior and managing cookies and headers, the system b
This project is a command-line media downloader designed for the systematic retrieval and organization of digital content from diverse online platforms. It functions as an extensible extraction engine that utilizes a declarative format-selection pipeline to automate the identification, merging, and downloading of specific audio and video streams based on user-defined criteria. The system distinguishes itself through a modular architecture that supports custom plugins and site-specific scripts, allowing for the bypass of platform restrictions and the handling of complex authentication challeng
resumable.js is a JavaScript library for managing large file uploads using the HTML5 File API. It functions as a chunked data transmitter and resumable upload manager, dividing files into smaller segments to ensure reliable delivery to a remote server. The library is distinguished by its ability to recover upload progress after network interruptions or browser restarts. It achieves this through resumable data transfer and server-side chunk verification, which checks for existing segments on the server to avoid redundant data transmission. The system manages concurrent file uploads and reques
AzuraCast is a self-hosted web radio management suite and dashboard designed for internet radio broadcasting. It functions as an automated playback manager and broadcasting system, integrating an audio orchestrator for transcoding and cross-fading with a streaming server to distribute audio via mount points. The platform enables multi-tenant station management, allowing a single installation to host multiple independent radio entities. It distinguishes itself by combining an Auto-DJ broadcast system with the ability to manage live DJ accounts and coordinate real-time broadcasts. The system c
This project is an AI watermark removal tool designed to restore pixels hidden by logos and masks in images and videos. It utilizes reverse alpha blending to eliminate watermarks and recover original media content. The tool operates as a local media processor, executing all restoration algorithms on the client device or within the browser to prevent files from being uploaded to remote servers. It further functions as a browser media interceptor via extensions and userscripts to automatically replace watermarked previews and downloads with cleaned versions. Capabilities include bulk media pro
Crab is a memory-safe systems programming language and ahead-of-time compiler designed for high-performance software. It utilizes a type system based on strict ownership and borrowing rules to prevent memory access errors at compile time. The toolchain includes an automated documentation generator that produces technical reference manuals and API guides by parsing structured source code annotations. It also features a dependency management tool to resolve external library graphs and ensure consistent project builds. The project covers software build automation, static type checking, and mult
YoutubeDownloader is a desktop application designed to retrieve and archive video and audio content from online platforms. It enables users to download media files directly to local storage, providing options to select specific quality levels and file formats to suit local playback requirements. The application distinguishes itself through its ability to access restricted or private content by utilizing personal account credentials. By managing session authentication, it allows for the retrieval of media that is not accessible to the general public. Furthermore, it incorporates automated work
Ultimate Vocal Remover is a desktop application designed for AI-driven audio source separation. It utilizes deep learning models to isolate vocals, drums, and other individual instruments from mixed audio files, providing a utility for professional production and creative editing workflows. The software distinguishes itself by leveraging GPU-accelerated tensor computation to perform complex signal processing tasks, significantly reducing the time required for high-fidelity audio extraction. It incorporates a modular plugin architecture that integrates external utilities to support a wide rang
This repository is a technical documentation site and a collection of guides and references for implementing networking, security, and cloud infrastructure services. It functions as a static-site generated portal and a headless content platform, separating source files from the presentation layer to enable flexible rendering. The project utilizes markdown-based documentation stored in a version-controlled Git repository. It provides specialized technical content including an AI platform documentation for building agents and managing inference, a cloud infrastructure guide for DNS and CDN conf
This project is an AI singing voice conversion system and vocal processor used for training generative voice models and converting vocal recordings or live input into a target voice. It functions as a VITS model trainer and a real-time voice changer that transforms vocal timbre and pitch to change the identity of a singer. The system provides a graphical management dashboard for controlling training hyperparameters and voice conversion presets. It supports low-latency audio streaming for live microphone input and employs pitch estimation to ensure precise matching between source and target vo
GPAC is an open-source multimedia framework built around a pluggable filter graph pipeline, where modular processing units called filters connect into a directed graph to handle media workflows. At its core, the framework centers all media packaging and manipulation on the ISO Base Media File Format (ISOBMFF), with specialized tools for reading, writing, fragmenting, and encrypting MP4 and related containers. It also provides a declarative scene graph composition system for describing interactive multimedia scenes using MPEG-4 BIFS, X3D, SVG, or VRML syntax, alongside a hardware-accelerated re
pyinfra is an agentless infrastructure automation framework that turns declarative Python code into idempotent shell commands to manage servers, containers, and local machines over SSH without requiring any pre-installed software on target hosts. It operates by comparing the desired state of a system against its current state, using a dry-run simulation mode to preview changes and a fact-based conditional execution engine to gather host attributes at runtime and control which operations run. The tool compiles Python operations into optimized shell commands and executes them in parallel across
ncmdump is a proprietary audio cache converter and binary stream decryption utility. It decrypts raw audio data from specialized music cache files and transforms them into standard audio formats. The project functions both as a standalone tool and a cross-language conversion library. It exposes its internal decryption and conversion logic as a dynamic library, allowing the capabilities to be embedded into external applications written in different programming languages. The utility provides batch media processing through recursive directory traversal, enabling the identification and conversi
Recordly is a screen recording and video editing suite designed for creating product demonstrations. It combines screen and audio capture software with a dedicated demo video editor and tools for merging webcam overlays and exporting projects as MP4 files or looping GIFs. The platform features a specialized cursor animation engine that applies smoothing, motion blur, and click animations to the rendered mouse movements. It also provides customizable webcam bubbles and a system for placing recordings inside styled containers with custom wallpapers and drop shadows. The editing workflow center
downkyicore is a cross-platform desktop application designed for the automated batch downloading of Bilibili videos, including support for high-resolution 8K and HDR content. It functions as a batch media downloader that manages multiple files and collections simultaneously through an automated queue system. The project integrates FFmpeg for media processing to separate audio and video streams and remove watermarks. It uses specialized mechanisms to bypass access restrictions and rate limits by employing random identifiers and mobile referrers during the parsing process. The system includes
SmarterStreaming is a cross-platform SDK for hardware-accelerated audio and video capture, encoding, and streaming. It provides a complete media pipeline for low-latency RTMP and RTSP streaming, and includes an embedded lightweight RTSP server that can serve live feeds directly from the source device to local network clients without requiring a separate server. The SDK also integrates with GB28181 surveillance platforms, enabling compliant device registration and streaming for standardized video monitoring systems. The project distinguishes itself through a set of integrated capabilities for
MusicBrainz Picard is a metadata tagger and audio tag editor that identifies and tags audio files using the MusicBrainz community music database. It functions as a plugin-extensible tagging framework and a scriptable file organizer capable of reading and writing tags across various audio formats including MP3, FLAC, and WAV. The project is distinguished by its acoustic fingerprint identifier, which matches unknown music files to known recordings via sonic fingerprints. It features a custom scripting language for automating metadata transformations and organizing files into structured folder h
This command-line utility provides a specialized interface for managing and manipulating audiobook files. It functions as a processing engine designed to handle audio merging, splitting, and format standardization, allowing users to organize large collections through automated workflows. The tool distinguishes itself by integrating silence detection to automatically identify logical break points for chapter creation and segmentation. It supports parallel task execution to maximize throughput during batch operations and utilizes metadata injection to ensure that chapter markers, cover art, and
MoeKoeMusic is a cross-platform desktop music player that functions as a client for NetEase Cloud Music. It provides music streaming with synchronized lyrics display, theme customization, and a plugin system for extending functionality. The application is built on Electron to run on Windows, macOS, and Linux. It features real-time WebSocket streaming of playback state and synchronized lyrics, a plugin system that loads external modules, and dynamic theme customization through CSS variables. Authentication with NetEase Cloud Music accounts supports login via QR code, phone, or password, along