30 open-source projects similar to kkroening/ffmpeg-python, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Ffmpeg Python alternative.
PHP-FFmpeg is an object-oriented wrapper for executing FFmpeg binary commands within PHP applications. It serves as a multimedia processing library and toolkit for transcoding, clipping, merging, and filtering audio and video files through a standardized programmatic interface. The project provides specialized drivers for video manipulation, audio editing, and media metadata extraction. These drivers allow for the application of visual filters, the modification of audio sample rates, and the probing of multimedia files to retrieve technical specifications and validate file integrity. The lib
node-fluent-ffmpeg is a Node.js wrapper for FFmpeg that provides a fluent interface for executing media commands and processing files. It functions as a process manager that handles the lifecycle of external FFmpeg binaries, enabling programmatic media transcoding, video thumbnail generation, and metadata extraction via ffprobe. The library distinguishes itself through a command builder that translates JavaScript method calls into command-line arguments. It features event-driven progress monitoring to track processed frames and throughput, as well as the ability to route processed media data
GPAC is an open-source multimedia framework built around a pluggable filter graph pipeline, where modular processing units called filters connect into a directed graph to handle media workflows. At its core, the framework centers all media packaging and manipulation on the ISO Base Media File Format (ISOBMFF), with specialized tools for reading, writing, fragmenting, and encrypting MP4 and related containers. It also provides a declarative scene graph composition system for describing interactive multimedia scenes using MPEG-4 BIFS, X3D, SVG, or VRML syntax, alongside a hardware-accelerated re
RxFFmpeg is an Android multimedia framework and media transcoder based on FFmpeg. It provides a set of tools for video and audio editing, transcoding, and processing on Android devices. The framework integrates a video player component for rendering local files and network streams with zoom and rotation support. It also includes specialized libraries for Android video editing, such as cropping and splicing, and Android audio processing for mixing tracks and modifying voice pitch. The project covers broad media manipulation capabilities, including the conversion of images to video, the extrac
MoviePy is a Python video editing library and automated video processor designed for programmatically cutting, concatenating, and manipulating video and audio files. It serves as a non-linear video editor and an interface for FFmpeg to handle the reading, writing, and conversion of diverse media formats and codecs. The library enables automated video composition through the layering of multiple video and audio streams using transparency and coordinate-based positioning. It supports dynamic content generation by inserting text overlays and performing custom video frame processing where raw fra
QtAV is a cross-platform media engine and multimedia framework that combines FFmpeg decoding with the Qt framework for audio and video rendering. It functions as a hardware-accelerated video player, an OpenGL video renderer, and a multimedia stream transcoder. The project distinguishes itself through a hardware-abstraction decoding layer that utilizes GPU interfaces such as VA-API and VideoToolbox to decode high-resolution video. It employs a zero-copy memory transfer path to move decoded video data directly to graphics APIs, reducing CPU overhead and enabling high-performance YUV rendering.
This project is a C-based media engineering guide and multimedia processing framework designed for managing codecs, frames, and packets within the FFmpeg and Libav ecosystem. It provides technical documentation and implementation patterns for transcoding, remuxing, and resizing video and audio data. The project includes a containerized development environment that encapsulates required media libraries and toolchains within a virtual image to ensure consistent build environments. The framework covers a range of multimedia engineering workflows, including adaptive bitrate streaming, media cont
FFmpeg is a cross-platform multimedia framework designed for the recording, conversion, and streaming of audio and video content. It functions as a comprehensive toolkit that provides both a command-line utility for direct media manipulation and a collection of low-level libraries for integration into custom applications. At its core, the project utilizes a packet-based stream engine and a format-agnostic abstraction layer to handle diverse media standards, containers, and network protocols. The framework distinguishes itself through a modular, graph-based filter execution model that allows f
Editly is a headless, programmatic video engine and automated assembler. It functions as a declarative video editor that generates MP4 and GIF exports from structured data or code, removing the need for a manual graphical user interface. The system is distinguished by its ability to integrate GLSL fragment shaders as visual layers within a programmatic timeline. It uses a configuration-based model to define clips, layers, and audio tracks, allowing for reproducible video assembly and the generation of custom programmatic graphics. The engine covers a broad range of media production capabilit
This is a cross-platform media processing library that reads, writes, encodes, and decodes media in both browser and server environments. It supports common container formats including ISOBMFF, Matroska, Ogg, MPEG-TS, and HLS, and handles codec operations through a combination of WebCodecs API and WebAssembly-based encoders. Media is processed in streaming pipelines that maintain constant memory usage and automatically apply backpressure from output speed to all upstream components. The library distinguishes itself through a plugin-based codec registration system that allows extending support
This project is an educational suite and technical guide designed for mastering video codecs and signal processing. It provides a structured curriculum through an engineering course, interactive labs, and tutorials focused on the fundamental principles of video compression and digital signal processing. The resource includes a technical guide for analyzing specific codecs like AV1, VP9, and H.265. It distinguishes itself by providing a containerized media lab, which ensures a consistent development environment for experimenting with video technology tools and notebooks. The project covers a
mobile-ffmpeg provides a framework for mobile multimedia processing, allowing Android, iOS, and tvOS applications to execute video and audio transcoding and filtering tasks. It enables the integration of pre-compiled binaries into mobile projects to avoid manual cross-compilation for different hardware architectures. The project supports cross-platform compilation and the integration of third-party codecs and external libraries. It provides a mechanism to deploy universal binaries across multiple operating systems and different CPU architectures. The toolset covers media metadata analysis, r
FFmpeg Batch is a graphical desktop application designed to automate complex audio and video processing tasks. It functions as a workflow manager that allows users to execute repetitive multimedia operations in bulk, providing a centralized interface for managing encoding jobs, stream composition, and metadata across large collections of files. The application distinguishes itself by offering event-driven directory watching, which triggers automated workflows for incoming media without manual intervention. It utilizes configuration-based task templating to ensure consistent encoding parameter
m3u8-downloader is a tool for extracting, downloading, and merging video segments from m3u8 playlist files into a single file. It functions as an HLS stream extractor and downloader that parses playlists to identify and retrieve individual video fragments. The project features a dedicated AES-128 stream decryptor to ensure encrypted media is playable and utilizes an MP4 transcoder to convert fragmented stream segments into a standard container for broad media player compatibility. It employs cross-origin script injection to bypass browser security restrictions and extract video source links d
Tdarr is a distributed video processing and media library automation tool. It functions as a server-node architecture that manages the scanning, analysis, and normalization of audio and video files based on custom rules. The system distributes heavy compute workloads, such as transcoding and health checks, across multiple remote nodes to optimize hardware utilization. It uses a plugin-based pipeline to execute sequences of filters and transformations, automating media conversion via FFmpeg and HandBrake to standardize file formats and containers. The project covers media library health audit
LAVFilters is an open-source media filter pack consisting of splitters and decoders designed for the DirectShow framework. It functions as a set of software components that convert compressed media data into raw formats for hardware or software rendering. The collection includes a media splitter to separate combined audio and video container formats into individual streams, as well as a specialized Blu-ray disc player filter used to identify and extract movie titles and playlists from disc structures. The project provides capabilities for audio and video stream decoding, media stream demuxin
This command-line utility provides a specialized interface for managing and manipulating audiobook files. It functions as a processing engine designed to handle audio merging, splitting, and format standardization, allowing users to organize large collections through automated workflows. The tool distinguishes itself by integrating silence detection to automatically identify logical break points for chapter creation and segmentation. It supports parallel task execution to maximize throughput during batch operations and utilizes metadata injection to ensure that chapter markers, cover art, and
react-native-image-picker is a cross-platform mobile media picker used to select images and videos from a system gallery or capture new media via the device camera. It serves as a native device camera interface and a mobile gallery file picker for accessing local storage. The library includes a media metadata extractor to retrieve technical details from selected files, such as image dimensions, file size, video duration, and EXIF data. The project provides a bridge between JavaScript and native mobile operating systems to trigger system-provided gallery and camera interfaces. It handles the
Pydub is a Python audio manipulation library and digital audio processor used for editing, slicing, and converting audio files and segments. It serves as a programmatic wrapper for FFmpeg to import and export a wide variety of audio formats. The library functions as an audio signal generator capable of creating synthetic waveforms, such as sine waves and white noise. It also provides tools for digital signal processing, including the application of filters, fades, crossfades, and gain adjustments to sound signals. Its broader capabilities cover programmatic audio editing through concatenatio
Gifify is a tool for converting video files into optimized animated GIFs. It functions as a video to GIF converter and optimization utility that extracts specific clips from video files and burns text or subtitle overlays directly into the frames. The project differentiates itself through specialized GIF optimization, using lossy compression, color count limiting, and custom color palette generation to reduce file sizes. It also provides precise control over the output by allowing users to adjust playback speed, reverse playback direction, and resize dimensions. The software covers a broad s
VDO.Ninja is a low-latency peer-to-peer media routing service and video streaming platform designed to integrate remote audio and video feeds into professional production workflows. It functions as a WebRTC broadcast integration tool and studio controller, allowing for the direct transmission of high-definition media between publishers and viewers with minimal delay. The platform distinguishes itself through extensive protocol bridging, converting between WebRTC, WHIP, WHEP, SRT, and RTMP to ensure compatibility across diverse network environments and professional studio software. It includes
Auto-editor is a command-line automated video editor that uses FFmpeg to remove silence and inactive footage from video files. It functions as a processing suite with specialized cut generators that identify segments to trim based on loudness thresholds, motion analysis, and speech-to-text transcription. The tool distinguishes itself by offering a flexible post-production workflow, allowing users to export automated cut timelines as XML or JSON files for use in professional non-linear editing software. Beyond simple deletion, it can perform dynamic playback adjustments, such as increasing the
This project is an RTMP media streaming SDK and a real-time communication framework designed for pushing and playing audio and video streams. It provides tools for interactive broadcasting, low-latency voice and video calls, and a cross-platform media player compatible with Windows, iOS, and Android. The toolkit enables interactive live broadcasting with support for multi-host interactions and the ability to push streams to distribution servers via CDN. It includes a cloud recording manager for capturing live sessions and saving them as files to cloud storage, along with a system for composit
This repository provides a collection of reference implementations and practical demonstrations for using WebRTC to establish real-time audio, video, and data communication. It contains code samples for negotiating peer-to-peer connections, managing media streams, and utilizing low-latency data channels. The project demonstrates the capture of audio and video from hardware devices, as well as the redirection of canvas element content into media streams. It includes examples of transferring arbitrary text and binary data between peers and managing the negotiation of direct connections. The sa
Pipecat is a framework and software development kit for building real-time multimodal AI agents and speech-to-speech systems. It utilizes a frame-based data pipeline to route audio, video, and text through a modular sequence of processors, enabling the orchestration of low-latency conversational AI. The project is distinguished by its ability to coordinate complex multimodal services, including speech-to-text, language models, and text-to-speech, within a single pipeline. It features semantic voice activity detection for natural turn-taking, state-machine conversation flows for dialogue manag
The NGINX RTMP module is a server-side extension that functions as a live video streaming engine. It enables the ingestion, processing, and distribution of real-time audio and video feeds, supporting both RTMP and HLS protocols to facilitate media delivery to multiple clients. The module distinguishes itself by integrating directly into the host server event loop, allowing for high-concurrency network input and output without blocking the main thread. It provides a toolkit for managing media streams through event-driven callbacks, which can trigger external process invocations for custom tran
open source、high performance、industrial rtsp streaming server,a lot of optimization on streaming relay,KeyFrame cache,RESTful,and web management,also EasyDarwin support distributed load balancing,a simple streaming media cloud platform architecture.
Streamlink is a command line video stream extractor that retrieves direct stream URLs from online services for use in external media players. It functions as a local media stream pipe, redirecting raw video data from web services into local files or players via standard input or HTTP. The project includes a headless browser stream scraper to intercept network requests and extract media data from script-heavy websites, alongside a dedicated processor for HLS and DASH segmented media streams. The tool utilizes a modular video plugin framework, allowing support for new streaming platforms to be
Simpl is an HTML, CSS, and JavaScript example library that provides a collection of minimal functional demonstrations of core web technologies and native browser APIs. It serves as a reference implementation and pattern gallery for frontend development, offering practical examples of how to implement common web features. The project showcases a wide range of browser capabilities, including real-time web communication via WebSockets and WebRTC, responsive web design techniques for adaptive layouts, and the implementation of offline workflows using service workers. It also provides demonstratio
VitamioBundle is an Android media player library and playback framework designed for decoding and rendering high-performance audio and video content. It provides a hardware-accelerated video decoder to reduce CPU load and battery consumption on mobile devices. The project includes an adaptive bitrate streaming client that adjusts video quality in real-time based on network conditions to prevent playback interruptions. It also features a video rendering engine capable of configuring pixel formats and chroma output to optimize visual quality for mobile displays. The library covers broad multim