30 open-source projects similar to ffmpeg/ffmpeg, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best FFmpeg alternative.
GPAC is an open-source multimedia framework built around a pluggable filter graph pipeline, where modular processing units called filters connect into a directed graph to handle media workflows. At its core, the framework centers all media packaging and manipulation on the ISO Base Media File Format (ISOBMFF), with specialized tools for reading, writing, fragmenting, and encrypting MP4 and related containers. It also provides a declarative scene graph composition system for describing interactive multimedia scenes using MPEG-4 BIFS, X3D, SVG, or VRML syntax, alongside a hardware-accelerated re
HandBrake is an open-source media converter and video transcoding application designed to process digital video and audio files. It functions as a desktop utility that converts media from nearly any format into widely supported codecs, facilitating video format conversion and the optimization of files for specific playback requirements. The software serves as a tool for digital media archiving, allowing users to compress and preserve high-quality video into manageable formats. It also functions as a DVD and Blu-ray ripper, enabling the extraction and conversion of content from physical optica
This project is a cross-platform implementation of the WebRTC standard, providing a comprehensive library for building real-time audio, video, and data communication applications. It functions as a peer-to-peer networking framework and media processing engine, enabling direct, low-latency connections between devices without relying on central servers. By strictly adhering to official protocol specifications, the library ensures interoperability with browsers and other native communication software across mobile, desktop, and server environments. The engine distinguishes itself through a modul
This project is a community-curated directory of open-source software designed for deployment in private server environments and home labs. It serves as a comprehensive resource for discovering independent, self-hosted alternatives to mainstream cloud services, enabling users to maintain full data ownership and control over their digital infrastructure. The directory is structured through a hierarchical taxonomy that organizes a vast collection of applications into logical categories, ranging from media management and data analytics to private communication and team productivity tools. It dis
The NGINX RTMP module is a server-side extension that functions as a live video streaming engine. It enables the ingestion, processing, and distribution of real-time audio and video feeds, supporting both RTMP and HLS protocols to facilitate media delivery to multiple clients. The module distinguishes itself by integrating directly into the host server event loop, allowing for high-concurrency network input and output without blocking the main thread. It provides a toolkit for managing media streams through event-driven callbacks, which can trigger external process invocations for custom tran
Shotcut is a professional-grade, cross-platform non-linear video editor built on the MLT multimedia framework. It provides a comprehensive suite for post-production, supporting multi-track timeline editing, high-fidelity color processing, and complex visual effects. The application is designed to handle diverse audio and video formats natively, ensuring high-resolution and HDR workflows are managed within a unified environment. The software distinguishes itself through a modular architecture that emphasizes performance and precision. It utilizes a GPU-accelerated rendering pipeline and proxy-
Cat-catch is a browser-based media utility designed to detect, capture, and manage web-based video and audio resources. It functions as a comprehensive sniffing and download management system, enabling users to identify hidden or protected media assets directly from active web pages. The tool specializes in reconstructing fragmented streaming protocols, such as DASH and M3U8, into complete files while providing options for real-time stream recording and playback control. The project distinguishes itself through its deep integration with local system environments and external automation tools.
ZLMediaKit is a high-concurrency streaming engine designed to function as a protocol gateway and media server. It provides a native development kit for embedding streaming and transcoding capabilities directly into custom applications, supporting the ingestion, distribution, and recording of live video feeds across various network protocols. The platform distinguishes itself through a high-performance architecture that bridges legacy surveillance protocols with modern web-compatible streaming standards. It utilizes a thread-per-core execution model and zero-copy buffer management to maintain
VLC is a cross-platform multimedia player and framework designed to decode and render virtually any audio or video format, network stream, or physical disc without requiring external codecs. It functions as both a standalone application and a portable library, providing a modular architecture that allows developers to integrate playback, filtering, and streaming capabilities into third-party software. The project distinguishes itself through a highly modular plugin-based engine that supports real-time media processing, including format transcoding and the application of audio and video filter
PeerTube is a decentralized, open-source video hosting platform that enables users to operate independent, interoperable servers. By utilizing the ActivityPub protocol, it connects these servers into a global, federated network where users can follow channels, discover content, and interact across different instances. The platform is designed to function as a self-hosted video content management system, providing a community-driven alternative to centralized media services. What distinguishes PeerTube is its hybrid approach to content delivery and infrastructure management. It integrates peer
ffmpeg-python is a Python wrapper that translates programmatic method calls into command-line arguments for executing FFmpeg media processing tasks. It functions as a multimedia transcoding interface and a media stream capture tool, allowing for the recording of live audio and video from hardware devices and network sources. The library features a fluent interface for constructing complex directed graphs of audio and video filters through method chaining. It also includes an FFprobe metadata extractor that retrieves structured technical properties from media files and returns them as Python d
This project is a professional live video production suite designed for capturing, encoding, and broadcasting high-quality media. At its core, it features a real-time media processing engine that utilizes hardware acceleration to composite multiple audio and video sources with minimal latency. The application provides a centralized studio interface for managing complex scene transitions, layering visual sources through a hierarchical scene-graph engine, and streaming content to multiple platforms simultaneously. The software is built on a cross-platform abstraction layer that ensures consiste
Pipecat is a framework and software development kit for building real-time multimodal AI agents and speech-to-speech systems. It utilizes a frame-based data pipeline to route audio, video, and text through a modular sequence of processors, enabling the orchestration of low-latency conversational AI. The project is distinguished by its ability to coordinate complex multimodal services, including speech-to-text, language models, and text-to-speech, within a single pipeline. It features semantic voice activity detection for natural turn-taking, state-machine conversation flows for dialogue manag
This project is an open-source multimedia player for Windows designed for high-performance audio and video playback. It functions as a DirectShow-based media renderer that utilizes hardware-accelerated graphics APIs to perform color space conversion and high-quality scaling directly on the display adapter. The application distinguishes itself through granular control over playback dynamics and visual output. Users can manipulate video orientation through rotation, flipping, and zooming, while also leveraging support for high dynamic range rendering. The player supports automated playback sequ
Spotify-downloader is a command-line utility designed to archive music from Spotify by matching track URLs to external video sources. It functions as a high-fidelity downloader that retrieves audio content and saves it as local files, ensuring optimal sound quality by selecting the highest available bitrate from the source media. The tool distinguishes itself through its ability to maintain local music collections by mirroring remote playlist states. It performs local-remote synchronization to determine which tracks require downloading or removal, while utilizing a modular architecture to dec
NAudio is a .NET audio library that provides playback, recording, format conversion, and signal processing capabilities for Windows applications. It serves as an audio format converter between common formats like WAV, MP3, AAC, and AIFF using ACM or Media Foundation codecs, and functions as an audio playback and recording API through Windows audio APIs including WaveOut, DirectSound, ASIO, and WASAPI. The library includes an audio stream processor that can mix, trim, loop, fade, resample, and adjust pitch or channel count of audio streams with a floating-point engine. It also provides an audi
DistroAV is a live broadcast signal router and network video routing tool designed to transmit audio and video sources over a network. It serves as a system for routing and transmitting media using the Network Device Interface protocol. The project enables the integration of remote network streams into local production scenes as standard input sources. It supports broadcasting program output for remote monitoring and the transmission of individual sources or scenes to network destinations. The software includes capabilities for codec-based media decoding, manual IP-based source discovery, an
This project is a framework for developing multimodal AI agents that function as programmable participants in real-time communication rooms. It enables the construction of agents that can see, hear, and speak by integrating speech-to-text, large language models, and text-to-speech pipelines to facilitate low-latency, natural conversations. The system is distinguished by its advanced orchestration of real-time media and conversational flow, including support for full-duplex speech, preemptive response generation, and sophisticated interruption management. It further differentiates itself throu
The Android NDK samples provide a comprehensive collection of code examples demonstrating how to integrate C and C++ native code into Android applications. This repository serves as a practical guide for developers utilizing the Android Native Development Kit to implement performance-critical application components that require direct hardware access and low-level system interaction. The project highlights the use of the Java Native Interface to bridge managed code with native modules, enabling cross-language function calls and efficient data exchange. It demonstrates how to manage native act
VERT is a media conversion platform designed to transform images, audio, video, and documents into various formats. It functions as a batch file processor that allows users to apply consistent conversion settings and custom naming patterns to multiple assets simultaneously, bundling the final outputs into compressed archives for streamlined organization. The system distinguishes itself through a distributed architecture that routes heavy media transcoding tasks across local hardware or remote server infrastructure. This approach optimizes performance by balancing computational workloads, allo
go2rtc is a media streaming server that functions as a protocol-agnostic gateway for video and audio feeds. It ingests media from diverse sources and redistributes them across multiple streaming standards, enabling compatibility between proprietary camera hardware and web-based playback clients. The system utilizes a centralized configuration schema to manage stream routing and lifecycle orchestration based on client demand. The platform distinguishes itself through its focus on low-latency delivery, utilizing peer-to-peer connections to facilitate sub-second playback directly within web brow
This project is a C-based media engineering guide and multimedia processing framework designed for managing codecs, frames, and packets within the FFmpeg and Libav ecosystem. It provides technical documentation and implementation patterns for transcoding, remuxing, and resizing video and audio data. The project includes a containerized development environment that encapsulates required media libraries and toolchains within a virtual image to ensure consistent build environments. The framework covers a range of multimedia engineering workflows, including adaptive bitrate streaming, media cont
Subtitle Edit is a desktop application designed for the creation, synchronization, and adjustment of text-based subtitle files. It provides a graphical interface for managing subtitle workflows, allowing users to modify content and formatting to ensure accurate display during video playback. The application distinguishes itself through a specialized synchronization workflow that utilizes visual waveform displays to align subtitle timestamps with audio and video cues. It supports a wide range of industry-standard file formats, enabling users to convert subtitle data to ensure compatibility acr
YoutubeDownloader is a desktop application designed to retrieve and archive video and audio content from online platforms. It enables users to download media files directly to local storage, providing options to select specific quality levels and file formats to suit local playback requirements. The application distinguishes itself through its ability to access restricted or private content by utilizing personal account credentials. By managing session authentication, it allows for the retrieval of media that is not accessible to the general public. Furthermore, it incorporates automated work
Jellyfin Web is the browser-based frontend for the Jellyfin media server, providing the user interface for browsing, playing, and administering a self-hosted media collection. It functions as a cross-platform media client that works across desktop and mobile browsers, offering a dashboard for server configuration, user management, and plugin administration. The web client serves as the primary interface for organizing and streaming personal media libraries, including movies, TV shows, music, and photos. The web interface supports a range of media management capabilities, including library org
This is a cross-platform media processing library that reads, writes, encodes, and decodes media in both browser and server environments. It supports common container formats including ISOBMFF, Matroska, Ogg, MPEG-TS, and HLS, and handles codec operations through a combination of WebCodecs API and WebAssembly-based encoders. Media is processed in streaming pipelines that maintain constant memory usage and automatically apply backpressure from output speed to all upstream components. The library distinguishes itself through a plugin-based codec registration system that allows extending support
Mediago is a multi-platform video downloader and extraction utility that supports content retrieval from over a thousand different social and video sharing platforms. It functions as an AI-powered download automator that uses natural language commands to trigger and manage video download workflows. The project provides specialized capabilities for sniffing and capturing m3u8 and HLS video streams via a browser extension that detects resources on active web pages. It can be deployed as a containerized headless server, allowing users to manage remote download tasks across a network through a we
MoviePy is a Python video editing library and automated video processor designed for programmatically cutting, concatenating, and manipulating video and audio files. It serves as a non-linear video editor and an interface for FFmpeg to handle the reading, writing, and conversion of diverse media formats and codecs. The library enables automated video composition through the layering of multiple video and audio streams using transparency and coordinate-based positioning. It supports dynamic content generation by inserting text overlays and performing custom video frame processing where raw fra
MusicBrainz Picard is a metadata tagger and audio tag editor that identifies and tags audio files using the MusicBrainz community music database. It functions as a plugin-extensible tagging framework and a scriptable file organizer capable of reading and writing tags across various audio formats including MP3, FLAC, and WAV. The project is distinguished by its acoustic fingerprint identifier, which matches unknown music files to known recordings via sonic fingerprints. It features a custom scripting language for automating metadata transformations and organizing files into structured folder h
Restreamer is a self-hosted video broadcast platform and RTMP streaming server. It functions as a live media processing gateway and a multi-destination stream relay, providing a web-based management interface to configure video codecs, hardware acceleration, and stream routing. The system enables multi-platform video streaming by duplicating a single live video source and forwarding it to various third-party broadcast services and external servers simultaneously. It also supports direct-to-website broadcasting, allowing users to host live content for private or public audiences via customizab