30 open-source projects similar to php-ffmpeg/php-ffmpeg, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best PHP FFmpeg alternative.
ffmpeg-python is a Python wrapper that translates programmatic method calls into command-line arguments for executing FFmpeg media processing tasks. It functions as a multimedia transcoding interface and a media stream capture tool, allowing for the recording of live audio and video from hardware devices and network sources. The library features a fluent interface for constructing complex directed graphs of audio and video filters through method chaining. It also includes an FFprobe metadata extractor that retrieves structured technical properties from media files and returns them as Python d
RxFFmpeg is an Android multimedia framework and media transcoder based on FFmpeg. It provides a set of tools for video and audio editing, transcoding, and processing on Android devices. The framework integrates a video player component for rendering local files and network streams with zoom and rotation support. It also includes specialized libraries for Android video editing, such as cropping and splicing, and Android audio processing for mixing tracks and modifying voice pitch. The project covers broad media manipulation capabilities, including the conversion of images to video, the extrac
node-fluent-ffmpeg is a Node.js wrapper for FFmpeg that provides a fluent interface for executing media commands and processing files. It functions as a process manager that handles the lifecycle of external FFmpeg binaries, enabling programmatic media transcoding, video thumbnail generation, and metadata extraction via ffprobe. The library distinguishes itself through a command builder that translates JavaScript method calls into command-line arguments. It features event-driven progress monitoring to track processed frames and throughput, as well as the ability to route processed media data
QtAV is a cross-platform media engine and multimedia framework that combines FFmpeg decoding with the Qt framework for audio and video rendering. It functions as a hardware-accelerated video player, an OpenGL video renderer, and a multimedia stream transcoder. The project distinguishes itself through a hardware-abstraction decoding layer that utilizes GPU interfaces such as VA-API and VideoToolbox to decode high-resolution video. It employs a zero-copy memory transfer path to move decoded video data directly to graphics APIs, reducing CPU overhead and enabling high-performance YUV rendering.
This project is a C-based media engineering guide and multimedia processing framework designed for managing codecs, frames, and packets within the FFmpeg and Libav ecosystem. It provides technical documentation and implementation patterns for transcoding, remuxing, and resizing video and audio data. The project includes a containerized development environment that encapsulates required media libraries and toolchains within a virtual image to ensure consistent build environments. The framework covers a range of multimedia engineering workflows, including adaptive bitrate streaming, media cont
MoviePy is a Python video editing library and automated video processor designed for programmatically cutting, concatenating, and manipulating video and audio files. It serves as a non-linear video editor and an interface for FFmpeg to handle the reading, writing, and conversion of diverse media formats and codecs. The library enables automated video composition through the layering of multiple video and audio streams using transparency and coordinate-based positioning. It supports dynamic content generation by inserting text overlays and performing custom video frame processing where raw fra
FFmpeg Batch is a graphical desktop application designed to automate complex audio and video processing tasks. It functions as a workflow manager that allows users to execute repetitive multimedia operations in bulk, providing a centralized interface for managing encoding jobs, stream composition, and metadata across large collections of files. The application distinguishes itself by offering event-driven directory watching, which triggers automated workflows for incoming media without manual intervention. It utilizes configuration-based task templating to ensure consistent encoding parameter
Jumpcutter is an audio-based video cutter and automatic editor designed to eliminate dead air from video files. It functions as a utility that condenses footage by detecting and removing silent sections based on audio track analysis. The tool utilizes FFmpeg to automatically identify quiet gaps and strip them from recordings. This process focuses on removing silent video sections to create faster-paced content without the need for manual editing. The system operates by calculating decibel levels against a defined volume threshold to generate a list of timestamps for audible segments. These s
GPAC is an open-source multimedia framework built around a pluggable filter graph pipeline, where modular processing units called filters connect into a directed graph to handle media workflows. At its core, the framework centers all media packaging and manipulation on the ISO Base Media File Format (ISOBMFF), with specialized tools for reading, writing, fragmenting, and encrypting MP4 and related containers. It also provides a declarative scene graph composition system for describing interactive multimedia scenes using MPEG-4 BIFS, X3D, SVG, or VRML syntax, alongside a hardware-accelerated re
This project is an educational suite and technical guide designed for mastering video codecs and signal processing. It provides a structured curriculum through an engineering course, interactive labs, and tutorials focused on the fundamental principles of video compression and digital signal processing. The resource includes a technical guide for analyzing specific codecs like AV1, VP9, and H.265. It distinguishes itself by providing a containerized media lab, which ensures a consistent development environment for experimenting with video technology tools and notebooks. The project covers a
FFmpeg is a cross-platform multimedia framework designed for the recording, conversion, and streaming of audio and video content. It functions as a comprehensive toolkit that provides both a command-line utility for direct media manipulation and a collection of low-level libraries for integration into custom applications. At its core, the project utilizes a packet-based stream engine and a format-agnostic abstraction layer to handle diverse media standards, containers, and network protocols. The framework distinguishes itself through a modular, graph-based filter execution model that allows f
m3u8-downloader is a tool for extracting, downloading, and merging video segments from m3u8 playlist files into a single file. It functions as an HLS stream extractor and downloader that parses playlists to identify and retrieve individual video fragments. The project features a dedicated AES-128 stream decryptor to ensure encrypted media is playable and utilizes an MP4 transcoder to convert fragmented stream segments into a standard container for broad media player compatibility. It employs cross-origin script injection to bypass browser security restrictions and extract video source links d
Tdarr is a distributed video processing and media library automation tool. It functions as a server-node architecture that manages the scanning, analysis, and normalization of audio and video files based on custom rules. The system distributes heavy compute workloads, such as transcoding and health checks, across multiple remote nodes to optimize hardware utilization. It uses a plugin-based pipeline to execute sequences of filters and transformations, automating media conversion via FFmpeg and HandBrake to standardize file formats and containers. The project covers media library health audit
mobile-ffmpeg provides a framework for mobile multimedia processing, allowing Android, iOS, and tvOS applications to execute video and audio transcoding and filtering tasks. It enables the integration of pre-compiled binaries into mobile projects to avoid manual cross-compilation for different hardware architectures. The project supports cross-platform compilation and the integration of third-party codecs and external libraries. It provides a mechanism to deploy universal binaries across multiple operating systems and different CPU architectures. The toolset covers media metadata analysis, r
Editly is a headless, programmatic video engine and automated assembler. It functions as a declarative video editor that generates MP4 and GIF exports from structured data or code, removing the need for a manual graphical user interface. The system is distinguished by its ability to integrate GLSL fragment shaders as visual layers within a programmatic timeline. It uses a configuration-based model to define clips, layers, and audio tracks, allowing for reproducible video assembly and the generation of custom programmatic graphics. The engine covers a broad range of media production capabilit
Auto-editor is a command-line automated video editor that uses FFmpeg to remove silence and inactive footage from video files. It functions as a processing suite with specialized cut generators that identify segments to trim based on loudness thresholds, motion analysis, and speech-to-text transcription. The tool distinguishes itself by offering a flexible post-production workflow, allowing users to export automated cut timelines as XML or JSON files for use in professional non-linear editing software. Beyond simple deletion, it can perform dynamic playback adjustments, such as increasing the
This command-line utility provides a specialized interface for managing and manipulating audiobook files. It functions as a processing engine designed to handle audio merging, splitting, and format standardization, allowing users to organize large collections through automated workflows. The tool distinguishes itself by integrating silence detection to automatically identify logical break points for chapter creation and segmentation. It supports parallel task execution to maximize throughput during batch operations and utilizes metadata injection to ensure that chapter markers, cover art, and
LosslessCut is a desktop application designed for the precise editing of video and audio files without re-encoding the underlying media streams. By performing stream copying and container remuxing, the software allows users to cut, merge, and rearrange media segments while maintaining the original bit-perfect quality of the source content. The application distinguishes itself by utilizing a stream-copying data pipeline that transfers raw media packets directly from source to destination, significantly reducing processing time compared to traditional transcoding workflows. It also functions as
OpenCut is an open-source, non-linear video editor designed for professional-grade media production. It functions as a standalone desktop multimedia suite that provides a comprehensive workspace for importing, arranging, trimming, and manipulating video and audio clips on a multi-track timeline. The application distinguishes itself through a non-destructive editing workflow, where all modifications are stored as metadata references to preserve the integrity of original source files. It leverages a cross-platform graphics abstraction to maintain consistent visual output across different hardwa
Autocut is a text-based video editor and automatic speech recognition tool. It allows users to cut and merge video clips by modifying a text transcript instead of using a traditional timeline. The system operates as an FFmpeg video processor and subtitle manipulation utility. It converts spoken audio into text and compacts subtitle files into simplified formats, enabling the removal of unwanted video segments by deleting corresponding sentences from a transcription file. The project covers automated video transcription, non-linear video cutting, and subtitle file management. It supports hard
Shotcut is a professional-grade, cross-platform non-linear video editor built on the MLT multimedia framework. It provides a comprehensive suite for post-production, supporting multi-track timeline editing, high-fidelity color processing, and complex visual effects. The application is designed to handle diverse audio and video formats natively, ensuring high-resolution and HDR workflows are managed within a unified environment. The software distinguishes itself through a modular architecture that emphasizes performance and precision. It utilizes a GPU-accelerated rendering pipeline and proxy-
Gifify is a tool for converting video files into optimized animated GIFs. It functions as a video to GIF converter and optimization utility that extracts specific clips from video files and burns text or subtitle overlays directly into the frames. The project differentiates itself through specialized GIF optimization, using lossy compression, color count limiting, and custom color palette generation to reduce file sizes. It also provides precise control over the output by allowing users to adjust playback speed, reverse playback direction, and resize dimensions. The software covers a broad s
This is a cross-platform media processing library that reads, writes, encodes, and decodes media in both browser and server environments. It supports common container formats including ISOBMFF, Matroska, Ogg, MPEG-TS, and HLS, and handles codec operations through a combination of WebCodecs API and WebAssembly-based encoders. Media is processed in streaming pipelines that maintain constant memory usage and automatically apply backpressure from output speed to all upstream components. The library distinguishes itself through a plugin-based codec registration system that allows extending support
react-native-image-picker is a cross-platform mobile media picker used to select images and videos from a system gallery or capture new media via the device camera. It serves as a native device camera interface and a mobile gallery file picker for accessing local storage. The library includes a media metadata extractor to retrieve technical details from selected files, such as image dimensions, file size, video duration, and EXIF data. The project provides a bridge between JavaScript and native mobile operating systems to trigger system-provided gallery and camera interfaces. It handles the
The Android NDK samples provide a comprehensive collection of code examples demonstrating how to integrate C and C++ native code into Android applications. This repository serves as a practical guide for developers utilizing the Android Native Development Kit to implement performance-critical application components that require direct hardware access and low-level system interaction. The project highlights the use of the Java Native Interface to bridge managed code with native modules, enabling cross-language function calls and efficient data exchange. It demonstrates how to manage native act
VLC is a cross-platform multimedia player and framework designed to decode and render virtually any audio or video format, network stream, or physical disc without requiring external codecs. It functions as both a standalone application and a portable library, providing a modular architecture that allows developers to integrate playback, filtering, and streaming capabilities into third-party software. The project distinguishes itself through a highly modular plugin-based engine that supports real-time media processing, including format transcoding and the application of audio and video filter
mmaction2 is a PyTorch video understanding toolbox designed for training and evaluating deep learning models. It serves as a framework for action recognition, temporal localization, and spatio-temporal action detection, providing specialized tools for both pixel-based video analysis and skeleton-based action recognition. The project distinguishes itself through a modular architecture featuring registry-based component discovery and hierarchical, config-driven model assembly. It supports multi-modal feature fusion, integrating RGB frames, optical flow, and audio, and includes capabilities for
This project is an RTMP media streaming SDK and a real-time communication framework designed for pushing and playing audio and video streams. It provides tools for interactive broadcasting, low-latency voice and video calls, and a cross-platform media player compatible with Windows, iOS, and Android. The toolkit enables interactive live broadcasting with support for multi-host interactions and the ability to push streams to distribution servers via CDN. It includes a cloud recording manager for capturing live sessions and saving them as files to cloud storage, along with a system for composit
Simpl is an HTML, CSS, and JavaScript example library that provides a collection of minimal functional demonstrations of core web technologies and native browser APIs. It serves as a reference implementation and pattern gallery for frontend development, offering practical examples of how to implement common web features. The project showcases a wide range of browser capabilities, including real-time web communication via WebSockets and WebRTC, responsive web design techniques for adaptive layouts, and the implementation of offline workflows using service workers. It also provides demonstratio
This project is a community-driven directory of software resources, libraries, and tools designed to support iOS application development. It serves as a centralized reference point for developers, organizing a vast ecosystem of third-party components into a searchable, structured index to facilitate discovery and project integration. The repository distinguishes itself through its collaborative curation model, which aggregates disparate utilities into a single, maintainable catalog. By leveraging a flat-file documentation structure, it provides a clear overview of the tools available for nati