30 open-source projects similar to fluent-ffmpeg/node-fluent-ffmpeg, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Node Fluent Ffmpeg alternative.
ffmpeg-python is a Python wrapper that translates programmatic method calls into command-line arguments for executing FFmpeg media processing tasks. It functions as a multimedia transcoding interface and a media stream capture tool, allowing for the recording of live audio and video from hardware devices and network sources. The library features a fluent interface for constructing complex directed graphs of audio and video filters through method chaining. It also includes an FFprobe metadata extractor that retrieves structured technical properties from media files and returns them as Python d
PHP-FFmpeg is an object-oriented wrapper for executing FFmpeg binary commands within PHP applications. It serves as a multimedia processing library and toolkit for transcoding, clipping, merging, and filtering audio and video files through a standardized programmatic interface. The project provides specialized drivers for video manipulation, audio editing, and media metadata extraction. These drivers allow for the application of visual filters, the modification of audio sample rates, and the probing of multimedia files to retrieve technical specifications and validate file integrity. The lib
FFmpeg is a cross-platform multimedia framework designed for the recording, conversion, and streaming of audio and video content. It functions as a comprehensive toolkit that provides both a command-line utility for direct media manipulation and a collection of low-level libraries for integration into custom applications. At its core, the project utilizes a packet-based stream engine and a format-agnostic abstraction layer to handle diverse media standards, containers, and network protocols. The framework distinguishes itself through a modular, graph-based filter execution model that allows f
MoviePy is a Python video editing library and automated video processor designed for programmatically cutting, concatenating, and manipulating video and audio files. It serves as a non-linear video editor and an interface for FFmpeg to handle the reading, writing, and conversion of diverse media formats and codecs. The library enables automated video composition through the layering of multiple video and audio streams using transparency and coordinate-based positioning. It supports dynamic content generation by inserting text overlays and performing custom video frame processing where raw fra
RxFFmpeg is an Android multimedia framework and media transcoder based on FFmpeg. It provides a set of tools for video and audio editing, transcoding, and processing on Android devices. The framework integrates a video player component for rendering local files and network streams with zoom and rotation support. It also includes specialized libraries for Android video editing, such as cropping and splicing, and Android audio processing for mixing tracks and modifying voice pitch. The project covers broad media manipulation capabilities, including the conversion of images to video, the extrac
QtAV is a cross-platform media engine and multimedia framework that combines FFmpeg decoding with the Qt framework for audio and video rendering. It functions as a hardware-accelerated video player, an OpenGL video renderer, and a multimedia stream transcoder. The project distinguishes itself through a hardware-abstraction decoding layer that utilizes GPU interfaces such as VA-API and VideoToolbox to decode high-resolution video. It employs a zero-copy memory transfer path to move decoded video data directly to graphics APIs, reducing CPU overhead and enabling high-performance YUV rendering.
FFmpeg Batch is a graphical desktop application designed to automate complex audio and video processing tasks. It functions as a workflow manager that allows users to execute repetitive multimedia operations in bulk, providing a centralized interface for managing encoding jobs, stream composition, and metadata across large collections of files. The application distinguishes itself by offering event-driven directory watching, which triggers automated workflows for incoming media without manual intervention. It utilizes configuration-based task templating to ensure consistent encoding parameter
This project is a C-based media engineering guide and multimedia processing framework designed for managing codecs, frames, and packets within the FFmpeg and Libav ecosystem. It provides technical documentation and implementation patterns for transcoding, remuxing, and resizing video and audio data. The project includes a containerized development environment that encapsulates required media libraries and toolchains within a virtual image to ensure consistent build environments. The framework covers a range of multimedia engineering workflows, including adaptive bitrate streaming, media cont
mobile-ffmpeg provides a framework for mobile multimedia processing, allowing Android, iOS, and tvOS applications to execute video and audio transcoding and filtering tasks. It enables the integration of pre-compiled binaries into mobile projects to avoid manual cross-compilation for different hardware architectures. The project supports cross-platform compilation and the integration of third-party codecs and external libraries. It provides a mechanism to deploy universal binaries across multiple operating systems and different CPU architectures. The toolset covers media metadata analysis, r
Execa is a promise-based process execution library that serves as a wrapper for the Node.js child process module. It functions as a shell command runner and subprocess management tool, simplifying the execution of external commands and binaries. The library distinguishes itself through automatic argument escaping to prevent shell injection and the use of abort signals for graceful process termination. It also provides an inter-process communication wrapper for exchanging structured JSON data and messages between parent and child processes. Its capabilities cover a broad range of process I/O
Pydub is a Python audio manipulation library and digital audio processor used for editing, slicing, and converting audio files and segments. It serves as a programmatic wrapper for FFmpeg to import and export a wide variety of audio formats. The library functions as an audio signal generator capable of creating synthetic waveforms, such as sine waves and white noise. It also provides tools for digital signal processing, including the application of filters, fades, crossfades, and gain adjustments to sound signals. Its broader capabilities cover programmatic audio editing through concatenatio
Nodemon is a command-line utility designed to automate development workflows by monitoring file system changes and managing the lifecycle of long-running processes. It functions as a process monitor that tracks directory trees for modifications, automatically terminating and restarting applications to ensure a continuous feedback loop during the coding process. The tool distinguishes itself through its language-agnostic design, which decouples monitoring logic from the target application's runtime. While it provides default support for Node.js, it can execute arbitrary command-line instructio
Jumpcutter is an audio-based video cutter and automatic editor designed to eliminate dead air from video files. It functions as a utility that condenses footage by detecting and removing silent sections based on audio track analysis. The tool utilizes FFmpeg to automatically identify quiet gaps and strip them from recordings. This process focuses on removing silent video sections to create faster-paced content without the need for manual editing. The system operates by calculating decibel levels against a defined volume threshold to generate a list of timestamps for audible segments. These s
This is a cross-platform media processing library that reads, writes, encodes, and decodes media in both browser and server environments. It supports common container formats including ISOBMFF, Matroska, Ogg, MPEG-TS, and HLS, and handles codec operations through a combination of WebCodecs API and WebAssembly-based encoders. Media is processed in streaming pipelines that maintain constant memory usage and automatically apply backpressure from output speed to all upstream components. The library distinguishes itself through a plugin-based codec registration system that allows extending support
react-native-image-picker is a cross-platform mobile media picker used to select images and videos from a system gallery or capture new media via the device camera. It serves as a native device camera interface and a mobile gallery file picker for accessing local storage. The library includes a media metadata extractor to retrieve technical details from selected files, such as image dimensions, file size, video duration, and EXIF data. The project provides a bridge between JavaScript and native mobile operating systems to trigger system-provided gallery and camera interfaces. It handles the
Janus is a WebRTC media gateway that routes real-time audio, video, and data between web browsers and server-side application logic. It functions as a central media relay that manages session negotiation and encryption for multiple browser endpoints. The project utilizes a modular plugin architecture that decouples the core server from specific media logic, allowing developers to implement custom modules for media processing, event handling, and transport protocols. This design enables the server to act as a protocol translation bridge, converting WebRTC streams into legacy formats such as SI
GPAC is an open-source multimedia framework built around a pluggable filter graph pipeline, where modular processing units called filters connect into a directed graph to handle media workflows. At its core, the framework centers all media packaging and manipulation on the ISO Base Media File Format (ISOBMFF), with specialized tools for reading, writing, fragmenting, and encrypting MP4 and related containers. It also provides a declarative scene graph composition system for describing interactive multimedia scenes using MPEG-4 BIFS, X3D, SVG, or VRML syntax, alongside a hardware-accelerated re
This project is an educational suite and technical guide designed for mastering video codecs and signal processing. It provides a structured curriculum through an engineering course, interactive labs, and tutorials focused on the fundamental principles of video compression and digital signal processing. The resource includes a technical guide for analyzing specific codecs like AV1, VP9, and H.265. It distinguishes itself by providing a containerized media lab, which ensures a consistent development environment for experimenting with video technology tools and notebooks. The project covers a
ffmpeg-kit is a cross-platform SDK that wraps FFmpeg and FFprobe into native libraries for Android, iOS, macOS, Linux, and tvOS, enabling applications to execute media processing commands through platform-specific APIs. It provides a concurrent command executor that runs multiple FFmpeg operations simultaneously and collects results independently via thread-safe interfaces. The project includes a build system that compiles FFmpeg native libraries from source with configurable codec and library options for each target platform, and offers eight precompiled binary packages with different sets o
Media-kit is a cross-platform media framework designed for Dart and Flutter applications. It provides a unified interface for building custom audio and video players, enabling developers to implement consistent media playback capabilities across mobile and desktop operating systems. The framework distinguishes itself by utilizing a hardware-accelerated media pipeline that offloads decoding and rendering tasks to dedicated device resources. By mapping video frames directly onto hardware-accelerated textures and employing a unified decoding pipeline, it maintains high-performance output while p
Kooha is a screen recorder for Linux desktops that utilizes the Wayland protocol and XDG Portals for secure recording. It functions as a hardware-accelerated screen capture tool that offloads video compression to the GPU to reduce CPU load and power consumption. The application integrates the PipeWire framework to capture system and microphone audio streams and leverages FFmpeg for muxing video streams and exporting various codecs and containers. Its user interface is a native Linux application built with the GTK toolkit. The software covers screen recording and capture of entire displays, s
Restreamer is a self-hosted video broadcast platform and RTMP streaming server. It functions as a live media processing gateway and a multi-destination stream relay, providing a web-based management interface to configure video codecs, hardware acceleration, and stream routing. The system enables multi-platform video streaming by duplicating a single live video source and forwarding it to various third-party broadcast services and external servers simultaneously. It also supports direct-to-website broadcasting, allowing users to host live content for private or public audiences via customizab
Autocut is a text-based video editor and automatic speech recognition tool. It allows users to cut and merge video clips by modifying a text transcript instead of using a traditional timeline. The system operates as an FFmpeg video processor and subtitle manipulation utility. It converts spoken audio into text and compacts subtitle files into simplified formats, enabling the removal of unwanted video segments by deleting corresponding sentences from a transcription file. The project covers automated video transcription, non-linear video cutting, and subtitle file management. It supports hard
This project is an AI video post-production suite that uses large language models and programmatic tools to automate editing, transcription, and subtitle generation. It functions as an AI editing agent that translates natural language instructions into shell commands, providing a programmatic interface for manipulating media via FFmpeg. The toolkit includes a motion graphics engine that generates technical animations and visual overlays through code-driven rendering and mathematical definitions. It distinguishes itself by combining an AI-powered transcriber for word-level timestamps with an a
LAVFilters is an open-source media filter pack consisting of splitters and decoders designed for the DirectShow framework. It functions as a set of software components that convert compressed media data into raw formats for hardware or software rendering. The collection includes a media splitter to separate combined audio and video container formats into individual streams, as well as a specialized Blu-ray disc player filter used to identify and extract movie titles and playlists from disc structures. The project provides capabilities for audio and video stream decoding, media stream demuxin
This project is an open-source video production suite and non-linear video editor. It provides a multi-track timeline for cutting, splicing, and arranging video and audio clips with frame-level precision, serving as a comprehensive workspace for video post-production. The suite includes specialized tools for keyframe animation, allowing for the creation of 2D and 3D visual effects and motion graphics. It also features a multi-track audio mixer for blending sound sources and adjusting levels to accompany visual content. Capability areas cover a full post-production workflow, including color c
ExifTool is a command line application and batch metadata processor used for reading, writing, and editing EXIF, IPTC, XMP, and other metadata in image and video files. It functions as a file metadata editor and image metadata extractor capable of modifying embedded internal properties across a wide variety of supported file formats. The tool specializes in extracting camera maker notes and industry standard descriptors from digital media. It supports metadata migration workflows, allowing embedded information to be copied from a source file and applied to a destination file. The utility cov
ytDownloader is a video downloader and media extraction tool that uses the yt-dlp engine to retrieve video and audio files from various social media and video sharing platforms. It functions as a utility for capturing full media files, specific segments or ranges of tracks, and entire video playlists. The project includes a hardware-accelerated video compressor to reduce file sizes while maintaining visual quality. It also features a subtitle downloader capable of retrieving both text captions and embedded subtitle tracks for accessibility and translation. The system handles broad media task
Air is a live reload tool and build automation utility for Go applications. It functions as a hot reloading orchestrator that monitors source files for changes to automatically recompile and restart application binaries. The tool distinguishes itself by offering a managed build and execution pipeline, allowing for custom build commands, environment variable injection, and platform-specific build overrides based on the host operating system. It further synchronizes the development cycle by using a proxy to signal web browsers to reload automatically when static files or source code are modifie