30 open-source projects similar to arthenica/ffmpeg-kit, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Ffmpeg Kit alternative.
This project is an AI video post-production suite that uses large language models and programmatic tools to automate editing, transcription, and subtitle generation. It functions as an AI editing agent that translates natural language instructions into shell commands, providing a programmatic interface for manipulating media via FFmpeg. The toolkit includes a motion graphics engine that generates technical animations and visual overlays through code-driven rendering and mathematical definitions. It distinguishes itself by combining an AI-powered transcriber for word-level timestamps with an a
Pixez-flutter is a cross-platform mobile application designed for browsing, searching, and downloading digital artwork from external platforms. It provides a comprehensive interface for managing remote media collections, including support for viewing illustrations, manga, and animated content. The application handles user authentication and session persistence to ensure a consistent experience across different devices. The project distinguishes itself through a sophisticated network management layer that enables users to bypass regional restrictions and network limitations. By utilizing custo
LibGDX is a Java-based framework designed for cross-platform game development, enabling the creation and deployment of 2D and 3D games across desktop, mobile, and web environments from a single codebase. It functions as a comprehensive library that abstracts hardware-accelerated graphics, audio, input, and file system access, providing a unified interface for developers to manage game logic and application lifecycles. The framework distinguishes itself through a high-performance architecture that prioritizes efficiency and native interoperability. It utilizes a batch-oriented graphics pipelin
Pydub is a Python audio manipulation library and digital audio processor used for editing, slicing, and converting audio files and segments. It serves as a programmatic wrapper for FFmpeg to import and export a wide variety of audio formats. The library functions as an audio signal generator capable of creating synthetic waveforms, such as sine waves and white noise. It also provides tools for digital signal processing, including the application of filters, fades, crossfades, and gain adjustments to sound signals. Its broader capabilities cover programmatic audio editing through concatenatio
node-fluent-ffmpeg is a Node.js wrapper for FFmpeg that provides a fluent interface for executing media commands and processing files. It functions as a process manager that handles the lifecycle of external FFmpeg binaries, enabling programmatic media transcoding, video thumbnail generation, and metadata extraction via ffprobe. The library distinguishes itself through a command builder that translates JavaScript method calls into command-line arguments. It features event-driven progress monitoring to track processed frames and throughput, as well as the ability to route processed media data
ffmpeg-python is a Python wrapper that translates programmatic method calls into command-line arguments for executing FFmpeg media processing tasks. It functions as a multimedia transcoding interface and a media stream capture tool, allowing for the recording of live audio and video from hardware devices and network sources. The library features a fluent interface for constructing complex directed graphs of audio and video filters through method chaining. It also includes an FFprobe metadata extractor that retrieves structured technical properties from media files and returns them as Python d
Jumpcutter is an audio-based video cutter and automatic editor designed to eliminate dead air from video files. It functions as a utility that condenses footage by detecting and removing silent sections based on audio track analysis. The tool utilizes FFmpeg to automatically identify quiet gaps and strip them from recordings. This process focuses on removing silent video sections to create faster-paced content without the need for manual editing. The system operates by calculating decibel levels against a defined volume threshold to generate a list of timestamps for audible segments. These s
MoviePy is a Python video editing library and automated video processor designed for programmatically cutting, concatenating, and manipulating video and audio files. It serves as a non-linear video editor and an interface for FFmpeg to handle the reading, writing, and conversion of diverse media formats and codecs. The library enables automated video composition through the layering of multiple video and audio streams using transparency and coordinate-based positioning. It supports dynamic content generation by inserting text overlays and performing custom video frame processing where raw fra
Restreamer is a self-hosted video broadcast platform and RTMP streaming server. It functions as a live media processing gateway and a multi-destination stream relay, providing a web-based management interface to configure video codecs, hardware acceleration, and stream routing. The system enables multi-platform video streaming by duplicating a single live video source and forwarding it to various third-party broadcast services and external servers simultaneously. It also supports direct-to-website broadcasting, allowing users to host live content for private or public audiences via customizab
Kooha is a screen recorder for Linux desktops that utilizes the Wayland protocol and XDG Portals for secure recording. It functions as a hardware-accelerated screen capture tool that offloads video compression to the GPU to reduce CPU load and power consumption. The application integrates the PipeWire framework to capture system and microphone audio streams and leverages FFmpeg for muxing video streams and exporting various codecs and containers. Its user interface is a native Linux application built with the GTK toolkit. The software covers screen recording and capture of entire displays, s
This project is an open-source video production suite and non-linear video editor. It provides a multi-track timeline for cutting, splicing, and arranging video and audio clips with frame-level precision, serving as a comprehensive workspace for video post-production. The suite includes specialized tools for keyframe animation, allowing for the creation of 2D and 3D visual effects and motion graphics. It also features a multi-track audio mixer for blending sound sources and adjusting levels to accompany visual content. Capability areas cover a full post-production workflow, including color c
Autocut is a text-based video editor and automatic speech recognition tool. It allows users to cut and merge video clips by modifying a text transcript instead of using a traditional timeline. The system operates as an FFmpeg video processor and subtitle manipulation utility. It converts spoken audio into text and compacts subtitle files into simplified formats, enabling the removal of unwanted video segments by deleting corresponding sentences from a transcription file. The project covers automated video transcription, non-linear video cutting, and subtitle file management. It supports hard
Remotion is a programmatic video framework that enables the creation of video content using component-based logic and standard web technologies. By leveraging a declarative animation engine, it allows developers to structure visual content as a hierarchy of reusable components, ensuring that animations and state updates remain consistent through deterministic frame execution. The framework distinguishes itself by utilizing a headless browser renderer that captures visual output frame-by-frame to generate high-quality video files. This architecture supports a cloud-native media pipeline, allow
LAVFilters is an open-source media filter pack consisting of splitters and decoders designed for the DirectShow framework. It functions as a set of software components that convert compressed media data into raw formats for hardware or software rendering. The collection includes a media splitter to separate combined audio and video container formats into individual streams, as well as a specialized Blu-ray disc player filter used to identify and extract movie titles and playlists from disc structures. The project provides capabilities for audio and video stream decoding, media stream demuxin
FFmpeg is a cross-platform multimedia framework designed for the recording, conversion, and streaming of audio and video content. It functions as a comprehensive toolkit that provides both a command-line utility for direct media manipulation and a collection of low-level libraries for integration into custom applications. At its core, the project utilizes a packet-based stream engine and a format-agnostic abstraction layer to handle diverse media standards, containers, and network protocols. The framework distinguishes itself through a modular, graph-based filter execution model that allows f
Ijkplayer is a cross-platform media playback engine designed to provide consistent audio and video rendering across mobile devices. Built upon established open-source multimedia frameworks, it functions as a unified engine that leverages hardware-accelerated decoding to process diverse media formats. The project distinguishes itself by providing a comprehensive toolchain for compiling and configuring low-level media source code into native binary libraries. This allows developers to integrate high-performance playback directly into mobile applications, utilizing a pluggable output architectur
This project is a high-performance, terminal-based media player designed for efficient audio and video playback. It utilizes a modular decoding core to handle a wide range of multimedia formats while offloading frame processing to platform-specific hardware-accelerated rendering pipelines to minimize CPU overhead. Beyond its standalone utility, the software functions as an embeddable multimedia engine, providing a native library interface that allows external applications to integrate its advanced decoding and rendering capabilities directly into their own interfaces. The player is distinguis
This is a tool for downloading videos, images, and audio from the Douyin social media platform using shareable URLs or profile links. It can download individual posts, entire user profiles including all posts and liked content, collections, and music tracks, with options for watermark-free and high-quality output. The tool also supports live stream recording, comment collection, and keyword-based content search with JSONL export. The project distinguishes itself through an integrated REST API server that accepts download and transcription requests, tracks job status, and exposes health check
downkyicore is a cross-platform desktop application designed for the automated batch downloading of Bilibili videos, including support for high-resolution 8K and HDR content. It functions as a batch media downloader that manages multiple files and collections simultaneously through an automated queue system. The project integrates FFmpeg for media processing to separate audio and video streams and remove watermarks. It uses specialized mechanisms to bypass access restrictions and rate limits by employing random identifiers and mobile referrers during the parsing process. The system includes
JiaoZiVideoPlayer is an Android media player framework used to build mobile video interfaces. It serves as a unified interface layer that abstracts multiple backend playback engines to ensure consistent video delivery on mobile devices. The project integrates ExoPlayer and ijkPlayer, utilizing FFmpeg for broad codec support and efficient media decoding. This cross-backend approach allows the user interface to remain decoupled from specific playback engines, enabling the use of different decoding technologies based on hardware or software requirements. The framework provides a modular playbac
This project is a collection of official plugin packages and a native integration library designed to provide a consistent interface for accessing hardware and software functionality across different mobile and desktop platforms. It serves as a native platform bridge, enabling cross-platform applications to invoke native code and manage operating system dependencies. The project utilizes a federated plugin architecture, splitting plugins into common interfaces and separate platform implementations to allow for independent development and extension. It further supports native integration throu
This project is a professional live video production suite designed for capturing, encoding, and broadcasting high-quality media. At its core, it features a real-time media processing engine that utilizes hardware acceleration to composite multiple audio and video sources with minimal latency. The application provides a centralized studio interface for managing complex scene transitions, layering visual sources through a hierarchical scene-graph engine, and streaming content to multiple platforms simultaneously. The software is built on a cross-platform abstraction layer that ensures consiste
OpenCut is an open-source, non-linear video editor designed for professional-grade media production. It functions as a standalone desktop multimedia suite that provides a comprehensive workspace for importing, arranging, trimming, and manipulating video and audio clips on a multi-track timeline. The application distinguishes itself through a non-destructive editing workflow, where all modifications are stored as metadata references to preserve the integrity of original source files. It leverages a cross-platform graphics abstraction to maintain consistent visual output across different hardwa
This project is a customizable media player designed to provide a consistent interface for video and audio content across all modern web browsers and mobile devices. It functions as a unified abstraction layer, standardizing playback behavior and control interfaces for both native media elements and third-party streaming service embeds through a predictable, declarative API. The library distinguishes itself by wrapping native media elements with custom HTML structures, ensuring a uniform look and feel regardless of the underlying browser implementation. Developers can manage playback state, m
Shizuku is a framework that enables standard mobile applications to interact with restricted system-level interfaces and services. By acting as a bridge between the user space and protected system functions, it allows applications to perform privileged operations that are typically inaccessible due to standard operating system sandbox limitations. The project functions by routing requests through a persistent background service, which facilitates communication with internal system services and remote interfaces. This architecture allows for the execution of system-level tasks and the manageme
A collection of reference implementations and code samples for integrating Android camera hardware and software APIs. The project provides demonstrations for using both the Jetpack CameraX library and the low-level Camera2 API to implement photo and video capture features. The repository includes specialized implementations for high-performance recording, such as high-frame-rate slow motion and high-dynamic-range video. It also features examples of machine learning vision, demonstrating how to analyze live camera frames for object detection and QR code scanning. The project covers broad imag
Aurora IMUI is a cross-platform chat UI component library that provides native interface elements for building instant messaging screens on Android, iOS, and React Native. It wraps platform-native UI components behind a unified interface, delegating rendering to each platform's native runtime while maintaining consistent behavior and appearance across devices. The library separates chat data into a platform-agnostic model layer that drives view updates through reactive bindings to native list components. It manages the multi-modal input bar as a finite state machine that switches between text
This is a Kotlin Multiplatform UI framework for building consistent user interfaces across desktop, web, and mobile platforms using a single codebase. It functions as a declarative framework that enables the creation of shared UI components that run across multiple target environments. The project includes a cross-platform desktop toolkit for managing native window features, system notifications, and hardware acceleration. For web targets, it provides a declarative interface library and a rendering engine that executes via WebAssembly to achieve high-performance execution in browsers. The fr
uni-app is a cross-platform framework for building mobile and web applications from a single Vue.js codebase. It provides a multi-platform app compiler and a cross-platform UI library to ensure consistent behavior and visual presentation across different operating systems. The project features a native API bridge and a universal plugin system that wrap native SDKs into a unified interface, allowing developers to access device hardware and operating system functions. It utilizes a specialized rendering engine and native performance compilation to bypass standard document overhead and achieve h