# Video Subtitle Editors and Tools

> Search results for `tool to add and edit subtitles for video files` on awesome-repositories.com. 115 total matches; showing the first 50.

Explore on the web: https://awesome-repositories.com/q/tool-to-add-and-edit-subtitles-for-video-files

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [this search on awesome-repositories.com](https://awesome-repositories.com/q/tool-to-add-and-edit-subtitles-for-video-files).**

## Results

- [capsoftware/cap](https://awesome-repositories.com/repository/capsoftware-cap.md) (17,026 ⭐) — Cap is a self-hosted screen recording and video collaboration platform designed for teams to replace synchronous meetings with asynchronous video updates. It provides a comprehensive suite for capturing high-resolution desktop activity, including system audio, microphone input, and camera overlays, which are then processed through an integrated post-production workflow.

The platform distinguishes itself by offering full data sovereignty through containerized deployment and object storage abstractions, allowing users to host their media assets on private infrastructure or S3-compatible buckets. Beyond simple recording, it features keyframe-based video compositing, automated AI-powered transcription, and visual branding tools that enable creators to polish and annotate their content before sharing.

The system facilitates team engagement through a centralized workspace where viewers can provide feedback via timestamped comments, reactions, and playback analytics. It also includes programmatic interfaces for embedding videos into external applications, managing media assets, and automating distribution workflows.

The project is distributed as a containerized application, enabling deployment on private servers to maintain complete control over data storage and access permissions.
- [chocobozzz/peertube](https://awesome-repositories.com/repository/chocobozzz-peertube.md) (14,520 ⭐) — PeerTube is a decentralized, open-source video hosting platform that enables users to operate independent, interoperable servers. By utilizing the ActivityPub protocol, it connects these servers into a global, federated network where users can follow channels, discover content, and interact across different instances. The platform is designed to function as a self-hosted video content management system, providing a community-driven alternative to centralized media services.

What distinguishes PeerTube is its hybrid approach to content delivery and infrastructure management. It integrates peer-to-peer distribution via WebTorrent to reduce server bandwidth consumption, while simultaneously supporting remote object storage to decouple media assets from local disk capacity. To maintain performance under high load, the platform delegates resource-intensive tasks like video transcoding and transcription to external worker instances, ensuring the primary server remains responsive.

The platform offers a comprehensive suite of tools for content management, including live streaming, automated moderation, and granular access controls. Its extensibility is supported by a hook-based plugin architecture, allowing administrators to inject custom logic, modify interface elements, or integrate third-party services. Additionally, the system provides a robust command-line interface and a standardized REST API, enabling programmatic control over administrative tasks, bulk content processing, and platform maintenance.

The software is packaged for containerized deployment, simplifying infrastructure management and ensuring consistent execution across various hosting environments.
- [files-community/files](https://awesome-repositories.com/repository/files-community-files.md) (44,008 ⭐) — Files is a graphical file manager designed to replace the default operating system explorer with a unified, highly configurable environment. It functions as an extensible storage aggregator, normalizing local, cloud, and remote network storage into a single, consistent interface. By hooking into the system shell, the application intercepts navigation requests to provide a seamless, integrated experience for managing diverse file systems.

The application distinguishes itself through a dual-pane productivity environment that facilitates efficient cross-directory operations and drag-and-drop workflows. Users can control the interface through a searchable command palette and extensive keyboard shortcut customization, reducing reliance on traditional menu hierarchies. Furthermore, it features a metadata-based tagging system that decouples file organization from physical directory structures, allowing for flexible categorization and retrieval.

Beyond core navigation, the platform supports a modular plugin architecture and integrated version control, enabling users to manage code repositories and extend functionality directly within the browser. The environment is highly personalized, offering a declarative configuration schema for managing visual themes, folder styling, and behavioral preferences. Users can also perform context-aware global searches and manage complex directory layouts through a tabbed interface.
- [3b1b/videos](https://awesome-repositories.com/repository/3b1b-videos.md) (10,314 ⭐) — This project is a programmatic animation engine designed to create mathematical visualizations through executable scripts. It functions as a mathematical visualization tool that renders parametric curves, equations, and coordinate systems to translate abstract concepts into high-resolution video.

The system features an interactive scene renderer that allows for the execution of code snippets and real-time manipulation of scene states before final rendering. It includes an automated animation workflow that manages rendering checkpoints, scene playback, and video sequencing directly from a text editor.

The engine covers a broad capability surface including coordinate-based vector rendering, programmatic scene definition, and dynamic object relationship linking. It provides tools for animation sequence rendering and video organization to produce final high-resolution output.

The project uses a Python-based API to map mathematical expressions to renderable objects.
- [browser-use/video-use](https://awesome-repositories.com/repository/browser-use-video-use.md) (9,743 ⭐) — This project is an AI video post-production suite that uses large language models and programmatic tools to automate editing, transcription, and subtitle generation. It functions as an AI editing agent that translates natural language instructions into shell commands, providing a programmatic interface for manipulating media via FFmpeg.

The toolkit includes a motion graphics engine that generates technical animations and visual overlays through code-driven rendering and mathematical definitions. It distinguishes itself by combining an AI-powered transcriber for word-level timestamps with an automated system for removing filler words, false starts, and dead space.

The system covers a broad range of post-production capabilities, including audio-based video cutting, cinematic color grading through filter chains, and the integration of synthetic AI voiceovers. It also provides observability tools such as timeline visualization through composite filmstrips and waveforms, as well as self-evaluation loops to validate rendered output for visual jumps or audio pops.

Session data and editing history are persisted in text files to maintain project continuity across different execution contexts.
- [getgrav/grav](https://awesome-repositories.com/repository/getgrav-grav.md) (15,395 ⭐) — Grav is a flat-file content management system that eliminates the need for a traditional database by storing site content and configuration in human-readable Markdown and YAML files. Built as a modular PHP web framework, it uses a hierarchical page routing system where the physical directory structure directly determines the site's URL paths.

The platform is distinguished by its event-driven plugin architecture and a command-line interface that prioritizes system administration, deployment, and maintenance tasks. It utilizes a blueprint-driven system to generate administrative forms from structured data schemas, allowing for complex content management without requiring custom code. A secure, sandboxed templating engine handles the rendering of content into HTML, supporting template inheritance and custom filters.

The system provides a comprehensive suite of capabilities, including advanced media processing, multi-language support, and granular access control. It features robust automation tools for scheduling background tasks, managing site backups, and synchronizing content via version control. Developers can extend the core functionality through a modular plugin system, which allows for deep integration with external services and custom logic injection throughout the application lifecycle.

The project is designed for flexible deployment, supporting containerized environments and standard web server configurations. It includes extensive documentation and CLI tools to facilitate local development, package management, and automated system updates.
- [yaofanguk/video-subtitle-extractor](https://awesome-repositories.com/repository/yaofanguk-video-subtitle-extractor.md) (8,432 ⭐) — This project is an optical character recognition tool designed to extract hardcoded subtitles from video frames and convert them into synchronized subtitle files. It functions as a text processor that transforms embedded visual text into a written format to improve video accessibility and translation.

The system uses graphics processing units to increase the speed and accuracy of text recognition. It includes a subtitle cleaning tool that applies custom mapping configurations to filter out watermarks, channel logos, and duplicate lines from the extracted text.

The tool supports batch processing for multiple video files that share identical resolutions and text region settings. It utilizes region-based extraction to isolate subtitles from background noise and synchronizes recognized text strings with specific video timestamps.
- [ffmpeg/ffmpeg](https://awesome-repositories.com/repository/ffmpeg-ffmpeg.md) (61,176 ⭐) — FFmpeg is a cross-platform multimedia framework designed for the recording, conversion, and streaming of audio and video content. It functions as a comprehensive toolkit that provides both a command-line utility for direct media manipulation and a collection of low-level libraries for integration into custom applications. At its core, the project utilizes a packet-based stream engine and a format-agnostic abstraction layer to handle diverse media standards, containers, and network protocols.

The framework distinguishes itself through a modular, graph-based filter execution model that allows for complex, non-linear transformations of audio and video frames. It supports high-performance processing by offloading intensive encoding and decoding tasks to dedicated hardware and utilizing threaded parallel processing to maximize throughput across multiple processor cores. This architecture enables users to construct intricate pipelines for tasks ranging from simple format conversion to advanced real-time media filtering and analysis.

Beyond core transcoding, the project covers a broad functional surface including live streaming, hardware device capture, and secure network transport. It provides extensive capabilities for metadata management, subtitle processing, and stream synchronization, alongside diagnostic tools for inspecting media integrity and performance. The system is highly extensible, allowing for the dynamic integration of external codecs and third-party libraries to support specialized media requirements.
- [yaofanguk/video-subtitle-remover](https://awesome-repositories.com/repository/yaofanguk-video-subtitle-remover.md) (11,493 ⭐) — This project is a local AI inpainting tool designed to erase hard-coded subtitles and watermarks from videos and images. It functions as a content-aware media restorer that uses deep learning to reconstruct missing pixels and preserve the original resolution of the source files.

The software is distinguished by its local execution model, running inference on host hardware to process media without relying on external cloud APIs. It employs content-aware model selection, allowing the use of different generative algorithms based on media types, such as animation or live action, to optimize visual blending.

The system supports both automatic global text removal and targeted removal using coordinate-based area masking to isolate specific visual regions. It includes a queued batch processing pipeline for iteratively applying these removal algorithms across multiple files.
- [formbricks/formbricks](https://awesome-repositories.com/repository/formbricks-formbricks.md) (12,391 ⭐) — Formbricks is an open-source survey and feedback platform designed to help teams capture and analyze user insights through targeted, in-app, and website-based interactions. It functions as a comprehensive customer experience analytics system that allows organizations to maintain full control over their data, user attributes, and survey workflows.

The platform distinguishes itself through its event-driven architecture, which enables precise behavioral targeting by triggering surveys based on specific user actions or application events. It supports deep integration with external ecosystems by automatically synchronizing response data to CRMs, databases, and communication tools, while providing programmatic interfaces for managing resources and automating feedback loops.

Beyond core collection, the system includes advanced logic for conditional branching, scoring, and personalized routing to create adaptive survey experiences. It offers extensive customization options, including white-labeling, CSS overrides, and multi-channel distribution across web, mobile, and email environments.

The platform is built for self-hosting, supporting containerized deployments with built-in multi-tenant data isolation and enterprise-grade security features like single sign-on and role-based access control.
- [remotion-dev/remotion](https://awesome-repositories.com/repository/remotion-dev-remotion.md) (50,931 ⭐) — Remotion is a programmatic video framework that enables the creation of video content using component-based logic and standard web technologies. By leveraging a declarative animation engine, it allows developers to structure visual content as a hierarchy of reusable components, ensuring that animations and state updates remain consistent through deterministic frame execution.

The framework distinguishes itself by utilizing a headless browser renderer that captures visual output frame-by-frame to generate high-quality video files. This architecture supports a cloud-native media pipeline, allowing for scalable, parallelized rendering on serverless infrastructure. Developers can interact with their compositions in real time through a browser-based studio environment, which provides tools for debugging, parameter manipulation, and visual testing before final production.

Beyond its core rendering capabilities, the project includes a comprehensive suite of tools for managing media assets, including audio, captions, and vector animations. It supports complex visual effects through physics-based motion primitives, property interpolation, and integration with various graphics libraries. The system is designed for automated, high-volume production workflows, offering command-line interfaces and server-side APIs to handle the entire lifecycle of media generation and deployment.
- [browser-use/browser-use](https://awesome-repositories.com/repository/browser-use-browser-use.md) (100,229 ⭐) — Browser-use is a framework for building autonomous agents that navigate, interact with, and extract data from web interfaces using natural language instructions. By acting as an orchestration layer between large language models and browser automation protocols, it enables the execution of complex, multi-step workflows without relying on brittle selectors. The system functions as a headless browser controller, providing a programmatic interface to manage browser instances and execute granular interactions.

The project distinguishes itself through its ability to translate high-level intent into specific browser primitives, supported by a serialization process that converts complex web page structures into simplified text for model processing. It includes robust support for stateful session persistence, allowing agents to maintain authenticated environments across long-running tasks. Furthermore, the framework facilitates remote browser orchestration, enabling the scaling of automation routines in cloud environments with integrated support for stealth configurations and proxy management.

Beyond its core agent capabilities, the platform provides extensive tooling for structured data extraction and workflow integration. It supports a variety of model configurations and allows for the definition of custom tools to extend interaction logic. The project documentation includes quickstart guides for command-line execution and examples for integrating browser automation into broader software ecosystems.
- [kristjanbjarni/osd-subtitles](https://awesome-repositories.com/repository/kristjanbjarni-osd-subtitles.md) (0 ⭐) — The OSD Subtitles utility combines OpenTX/Blackbox log files with [Betaflight] OSD elements and produces a subtitle file with OSD overlay similiar to the the Betaflight OSD. You can then play your flight video with subtitle OSD elements in a video player like [VLC] or import them into Youtube or…
- [burningion/video-editing-mcp](https://awesome-repositories.com/repository/burningion-video-editing-mcp.md) (272 ⭐) — MCP Interface for Video Jungle
- [aider-ai/aider](https://awesome-repositories.com/repository/aider-ai-aider.md) (46,305 ⭐) — Aider is a command-line interface tool that enables large language models to directly edit, refactor, and manage source code within a local repository. It functions as an AI-powered coding assistant that integrates into the developer workflow, allowing users to apply code changes through natural language prompts while maintaining repository context and version control.

The tool distinguishes itself through a specialized diff-based patching engine that parses model-generated search-and-replace blocks to modify specific file segments without rewriting entire files. It features a provider-agnostic model abstraction that supports a wide range of cloud-based and local language models, enabling users to switch between them to optimize for performance, cost, and reasoning capabilities. To ensure high-quality results, it employs a repository context engine that analyzes codebase structure and dependencies, dynamically managing the active chat window to provide relevant information within token limits.

Beyond basic editing, the project automates the development lifecycle by integrating directly with version control systems to handle commit attribution and history management. It supports multi-stage planning through an architect mode that separates high-level design from low-level implementation, and it can automatically trigger test suites and linting commands to verify code modifications. The system is highly configurable, offering hierarchical settings management and a programmatic interface for scripting complex coding tasks.
- [sampotts/plyr](https://awesome-repositories.com/repository/sampotts-plyr.md) (29,862 ⭐) — This project is a customizable media player designed to provide a consistent interface for video and audio content across all modern web browsers and mobile devices. It functions as a unified abstraction layer, standardizing playback behavior and control interfaces for both native media elements and third-party streaming service embeds through a predictable, declarative API.

The library distinguishes itself by wrapping native media elements with custom HTML structures, ensuring a uniform look and feel regardless of the underlying browser implementation. Developers can manage playback state, monitor events, and configure settings through a centralized interface, while also utilizing advanced navigation tools like visual seek previews and keyboard shortcuts to enhance the user experience for long-form content.

The platform supports a wide range of functional requirements, including accessible media consumption through integrated captioning and screen reader support, as well as extensive visual customization via CSS variables. It handles the complexities of cross-browser compatibility and media lifecycle management, allowing for the integration of custom logic and analytics throughout the playback session.
- [microsoft/windows-universal-samples](https://awesome-repositories.com/repository/microsoft-windows-universal-samples.md) (9,696 ⭐) — This repository is a comprehensive collection of reference implementations and sample libraries for the Universal Windows Platform. It provides practical examples of how to use Windows Runtime APIs to build cross-device applications, including detailed guidance on XAML-based declarative user interfaces and DirectX-integrated rendering.

The project distinguishes itself by providing a wide array of hardware integration suites, covering low-level communication with USB, Serial, I2C, SPI, and GPIO peripherals. It includes specialized implementations for mixed reality holographic rendering, advanced digital inking, and computer vision tasks such as real-time face tracking and barcode scanning.

The codebase covers a broad surface of system capabilities, including adaptive media streaming, biometric authentication, and background task management. It also demonstrates the use of linguistic services for text analysis, globalization tools for regional formatting, and persistent storage strategies for application data.

The repository serves as a practical implementation guide for the Windows SDK, providing a library of samples for building responsive interfaces and integrating system-level services.
- [pajswigger/add-request-to-macro](https://awesome-repositories.com/repository/pajswigger-add-request-to-macro.md) (7 ⭐) — Burp extension to add a request to a macro
- [avelino/awesome-go](https://awesome-repositories.com/repository/avelino-awesome-go.md) (175,576 ⭐) — This project serves as a comprehensive language ecosystem index, functioning as a centralized, community-curated directory for the Go programming language. It organizes a vast landscape of software components, libraries, and development tools into a structured, navigable hierarchy, enabling developers to efficiently discover resources tailored to specific functional domains.

The repository distinguishes itself through a decentralized contribution model, where community-driven updates ensure the index remains current with the rapidly evolving software landscape. Beyond simple resource listing, it acts as a technical knowledge repository, aggregating professional literature, style guides, and best practices to support developer onboarding and professional growth across the entire software development lifecycle.

The directory covers a broad capability surface, including essential utilities for distributed systems engineering, application security, data processing, and development productivity. It provides access to specialized tools for database management, web framework integration, testing, and build automation, alongside educational materials that help developers master language-specific architectural patterns.

The project is maintained as a static resource aggregation, providing a holistic view of external links and documentation to orient developers within the Go ecosystem.
- [huanshere/videolingo](https://awesome-repositories.com/repository/huanshere-videolingo.md) (17,498 ⭐) — VideoLingo is an automated video localization suite designed to transcribe, translate, and dub video content. It functions as a translation pipeline that utilizes large language models to convert spoken audio into precise text segments and translate them into multiple languages.

The system differentiates itself through a multi-step translation refinement process and a specialized natural language processing utility that segments text into single-line captions meeting broadcast standards. It also integrates synthetic voiceover generation to replace or augment original audio tracks.

The project covers a broad range of media processing capabilities, including automated video acquisition from external platforms, word-level timestamp alignment for subtitles, and a task sequencing system to monitor and control the localization pipeline.
- [expo/expo](https://awesome-repositories.com/repository/expo-expo.md) (50,111 ⭐) — Expo is a universal mobile framework designed to build native iOS and Android applications from a single codebase using web-standard technologies. It provides a comprehensive development environment that includes a unified runtime for testing, cloud-based infrastructure for compiling and signing native binaries, and automated tools for managing the entire mobile release lifecycle, including app store submission.

The framework distinguishes itself through a plugin-based native configuration engine that programmatically modifies project files, allowing developers to integrate native modules without manual intervention. It also features a file-based routing system that maps directory structures directly to navigation paths, and an over-the-air update service that enables the deployment of JavaScript and asset changes directly to user devices, bypassing traditional app store review cycles.

Beyond these core capabilities, the platform offers a wide range of integrated services for managing project metadata, environment variables, and persistent data storage. It includes a robust set of UI components and utilities for handling hardware-level features such as camera access, geolocation, audio and video playback, and push notifications. Developers can also leverage managed cloud services to orchestrate custom build profiles and automate CI/CD workflows.

The project is managed via a command-line interface that facilitates project setup, native module integration, and the generation of custom development builds. Documentation and tooling are provided to support both standalone applications and the integration of Expo into existing native projects.
- [quahac/burp-add-to-sitemap-plusplus](https://awesome-repositories.com/repository/quahac-burp-add-to-sitemap-plusplus.md) (3 ⭐) — Add to sitemap++ is a BURP extension that can read URLs from files or clipboard and add the discovered information on the site map of the selected host(s).
- [google-gemini/gemini-cli](https://awesome-repositories.com/repository/google-gemini-gemini-cli.md) (105,341 ⭐) — This project provides a command-line interface for managing autonomous agent workflows, task orchestration, and system-level automation. It includes a comprehensive framework for defining agent skills, managing persistent memory, and delegating tasks to specialized subagents. Users can configure complex planning modes, execute shell commands with safety constraints, and integrate external tools through standardized protocols.

The platform supports non-interactive execution via a headless mode and provides an event-driven hook framework for custom lifecycle automation. It features centralized configuration for model routing, system prompts, and cost management, alongside a modular extension system for adding custom commands and capabilities. The interface also includes diagnostic tools, file system management utilities, and repository-level automation for maintenance tasks.
- [dokploy/dokploy](https://awesome-repositories.com/repository/dokploy-dokploy.md) (34,901 ⭐) — Dokploy is a self-hosted platform-as-a-service designed to simplify the deployment and management of containerized applications and databases. It provides a centralized control plane that decouples administrative management from application workloads, allowing users to oversee infrastructure across multiple server nodes through a unified web interface or a command-line tool.

The platform distinguishes itself through an extensive library of pre-configured application templates, enabling the rapid deployment of databases, identity providers, and various productivity or development tools. It supports complex orchestration by allowing users to define multi-container services using standard configuration files, which can be managed through automated build pipelines, Git integration, and real-time performance monitoring.

Beyond core deployment, the system includes robust infrastructure management capabilities such as automated backups to external object storage, horizontal and vertical scaling, and granular access control. It also provides secure configuration management, including environment variable synchronization, HTTPS certificate handling, and zero-downtime deployment strategies to ensure application stability and security.

The platform is designed for ease of use, offering an interactive API documentation interface and instructional resources to guide users through installation and configuration. It supports a wide range of modern web frameworks and runtimes, providing a flexible environment for hosting and maintaining services on private server hardware.
- [jamesm0rr1s/add-and-track-custom-issues](https://awesome-repositories.com/repository/jamesm0rr1s-add-and-track-custom-issues.md) (4 ⭐) — Add & Track Custom Issues is a Burp Suite extension that allows users to add and track manual findings that the automated scanner was unable to identify.
- [elevenlabs/elevenlabs-python](https://awesome-repositories.com/repository/elevenlabs-elevenlabs-python.md) (2,873 ⭐) — This Python SDK provides a comprehensive toolkit for synthetic audio generation, voice cloning, and the development of conversational AI agents. It enables the creation of lifelike spoken audio from text, the replication of human voices through custom cloning, and the deployment of real-time voice agents capable of interacting with external large language models.

The library distinguishes itself through deep integration of conversational AI capabilities, including the design of agent personas and the execution of real-time actions via APIs. It supports professional-grade audio production through a variety of specialized tools for multilingual dubbing, studio-quality music generation, and high-fidelity sound effects.

The SDK covers a broad surface of speech and media processing, including real-time audio streaming via WebSockets, speech-to-text transcription with speaker diarization, and the synchronization of audio with visual elements. It also provides utilities for monitoring generation costs and managing agent security through response guardrails and access controls.
- [pytube/pytube](https://awesome-repositories.com/repository/pytube-pytube.md) (13,135 ⭐) — Pytube is a Python library and command line interface for downloading videos, playlists, and captions from YouTube. It functions as both a programmatic tool for metadata extraction and a standalone media downloader.

The project is designed using only the Python standard library to avoid external package dependencies. It utilizes regular expression-based HTML parsing to extract stream URLs and asset details directly from the platform.

The library supports retrieving video metadata and thumbnails, as well as extracting caption tracks. It provides capabilities for downloading entire playlists and reconstructing DASH streams when progressive downloads are unavailable.
- [analyticsinmotion/add-chatgpt-to-microsoft-word](https://awesome-repositories.com/repository/analyticsinmotion-add-chatgpt-to-microsoft-word.md) (105 ⭐) — How to add ChatGPT's Text Completion to Microsoft Word
- [backstage/backstage](https://awesome-repositories.com/repository/backstage-backstage.md) (33,679 ⭐) — Backstage is an open-source framework for building internal developer portals. It provides a centralized, metadata-driven software catalog that tracks ownership, dependencies, and lifecycle status for all technical assets by harvesting configuration files directly from version control systems. The platform is built on a plugin-based modular architecture, allowing teams to extend core functionality through isolated, independently deployable modules that integrate into a unified frontend and backend ecosystem.

The project distinguishes itself through its focus on developer productivity and standardized workflows. It includes a template-driven scaffolding engine that automates the creation of new software projects, ensuring consistent architecture and best practices across teams. The platform also features granular, policy-based access control and secure proxy routing, which manage authentication and protect sensitive internal resources while aggregating infrastructure tools and documentation into a single, searchable interface.

Beyond its core catalog and scaffolding capabilities, the platform supports a wide range of operational needs, including infrastructure monitoring, technical documentation management, and automated notification delivery. It provides standardized patterns for custom plugin development, testing, and interface composition, enabling organizations to tailor the portal to their specific requirements. The system is designed to be extensible, with support for AI integration, usage analytics, and interface localization to accommodate diverse organizational needs.
- [clappr/clappr](https://awesome-repositories.com/repository/clappr-clappr.md) (7,473 ⭐) — Clappr is an HTML5 web media player and plugin-based framework used to render video and audio across different web platforms. It functions as a playback engine supporting adaptive bitrate streaming through HLS and DASH protocols to ensure smooth media delivery.

The project is distinguished by a modular architecture that allows developers to extend player functionality and add support for new media formats via a plugin system. It also includes a dedicated controller for managing digital rights management license requests to secure premium content.

The player covers a broad range of capabilities, including custom user interface composition with DOM overlays, media analytics integration for tracking engagement and buffering performance, and comprehensive playback management for audio and video tracks. It provides tools for configuring player appearance, managing closed captions, and handling DVR interactions.
- [mifi/editly](https://awesome-repositories.com/repository/mifi-editly.md) (5,435 ⭐) — Slick, declarative command line video editing & API
- [carguo/gsyvideoplayer](https://awesome-repositories.com/repository/carguo-gsyvideoplayer.md) (21,440 ⭐) — GSYVideoPlayer is a versatile video player framework for Android that acts as a multi-engine playback wrapper. It provides a unified playback interface to control different video playback kernels, such as ExoPlayer, IJKplayer, and MediaPlayer, ensuring compatibility across diverse mobile hardware.

The framework distinguishes itself through a flexible architecture that allows for core component swapping, enabling the replacement of playback engines, managers, or rendering layers. It includes a dedicated video caching manager for local storage and a customizable media UI component that supports specialized features like bullet chat and external subtitle management.

The project covers a wide range of capabilities, including adaptive bitrate streaming, graphic surface effects for filters and watermarks, and advanced playback controls such as picture-in-picture mode and screen orientation management. It also provides optimizations for playback within scrollable lists and integration for Jetpack Compose.

The framework includes tools for network traffic management, such as custom request headers and proxy configuration, alongside utilities for video frame capture and advertisement management.
- [bigskysoftware/htmx](https://awesome-repositories.com/repository/bigskysoftware-htmx.md) (48,210 ⭐) — HTMX is a hypermedia-driven frontend library that enables the creation of dynamic, asynchronous web applications by extending standard HTML attributes. It functions as a declarative engine that intercepts browser events to trigger network requests, allowing developers to update specific regions of the document with server-rendered HTML fragments. By shifting the logic of UI composition to the server, it minimizes the need for complex client-side state management and imperative JavaScript.

The library distinguishes itself through a progressive enhancement workflow that ensures web interfaces remain functional as standard HTML while upgrading to interactive experiences when JavaScript is enabled. It provides sophisticated control over content updates, including the ability to patch multiple disconnected page regions in a single response and synchronize DOM changes with browser-native CSS transitions. Furthermore, it supports real-time interactivity by establishing persistent, bi-directional communication channels through declarative WebSocket and server-sent event configurations.

Beyond its core engine, the framework includes a comprehensive suite of utilities for managing browser history, coordinating concurrent network requests, and handling form validation. It offers flexible configuration for response processing and allows developers to extend component behavior through event-driven hooks. The project is designed for ease of integration, requiring only a script inclusion to begin transforming standard links and forms into asynchronous, hypermedia-driven interactions.
- [caridy/plex-subtitles-normalizer](https://awesome-repositories.com/repository/caridy-plex-subtitles-normalizer.md) (14 ⭐) — CLI tool to fix subtitles needed by Plex Media Center
- [jianchang512/pyvideotrans](https://awesome-repositories.com/repository/jianchang512-pyvideotrans.md) (17,991 ⭐) — Pyvideotrans is an automated video localization platform designed to transcribe, translate, and dub media content for international distribution. It functions as an end-to-end workflow that combines speech recognition, text translation, and synthetic voice generation to process video files into localized versions.

The system distinguishes itself by offering a choice between local model inference for privacy and integration with third-party cloud services via user-provided credentials. This architecture allows users to maintain control over their billing and data security while utilizing modular pipelines to orchestrate complex tasks like voice cloning and subtitle synchronization.

The software supports large-scale operations through a command-line interface that manages batch task queuing and automated media processing. It utilizes multimedia frameworks to handle audio extraction and video remuxing, including options for lossless export to preserve visual quality. The toolset covers the entire localization lifecycle, from generating timestamped subtitles with speaker identification to producing synthetic voiceovers with adjustable speech parameters.
- [jamesm0rr1s/burpsuite-add-and-track-custom-issues](https://awesome-repositories.com/repository/jamesm0rr1s-burpsuite-add-and-track-custom-issues.md) (4 ⭐) — Add & Track Custom Issues is a Burp Suite extension that allows users to add and track manual findings that the automated scanner was unable to identify.
- [calcom/cal.com](https://awesome-repositories.com/repository/calcom-cal-com.md) (45,760 ⭐) — Cal.com is a comprehensive scheduling infrastructure platform designed to manage availability, booking workflows, and calendar synchronization across multiple users and external services. It provides a backend service for automated appointment scheduling, enabling the creation, confirmation, and management of booking lifecycles through a centralized state machine. The platform also offers embeddable user interface components that allow developers to integrate interactive booking experiences directly into third-party websites.

What distinguishes the platform is its extensible app ecosystem and intelligent automation capabilities. Developers can build custom integrations using a modular plugin architecture, while an AI-driven interface allows for complex scheduling operations and configuration updates via natural language commands. The system includes a sophisticated event routing engine that automatically assigns meetings to hosts based on availability, round-robin rules, and organizational hierarchy, supported by real-time webhook orchestration to keep external systems synchronized.

The platform covers a broad capability surface including CRM data synchronization, granular role-based access control, and secure OAuth-based integration management. It supports advanced booking configurations, such as prefilling form data and monitoring state changes, alongside specialized tools for Salesforce connectivity, including assignment traceability and fuzzy account matching. Users can also leverage local or remote server hosting options to maintain control over their infrastructure and security configurations.
- [tejasq/add-gitignore](https://awesome-repositories.com/repository/tejasq-add-gitignore.md) (722 ⭐) — An interactive CLI tool that adds a .gitignore to your projects.
- [chidiwilliams/buzz](https://awesome-repositories.com/repository/chidiwilliams-buzz.md) (17,903 ⭐) — Buzz is a desktop application that provides a local speech-to-text engine for transcribing and translating audio and video files. By leveraging local machine inference, the software ensures data privacy and offline performance, removing the need for cloud connectivity during media processing.

The application distinguishes itself through a modular plugin architecture that allows for the integration of custom functionality, such as content summarization and automated text formatting, without modifying the core codebase. It also features a speaker diarization pipeline that identifies and labels individual voices within recordings to improve the readability and organization of generated transcripts.

The system supports automated media processing by monitoring specific directories for new files, enabling users to trigger transcription or translation workflows as soon as assets are detected. Users can export results into various standard formats, including plain text and subtitle files, while utilizing hardware acceleration to increase processing speeds for large media files.
- [rubonnek/subtitles-importer](https://awesome-repositories.com/repository/rubonnek-subtitles-importer.md) (12 ⭐) — A comprehensive subtitles importer plugin for the Godot Game Engine.
- [clsid2/mpc-hc](https://awesome-repositories.com/repository/clsid2-mpc-hc.md) (14,378 ⭐) — This project is an open-source multimedia player for Windows designed for high-performance audio and video playback. It functions as a DirectShow-based media renderer that utilizes hardware-accelerated graphics APIs to perform color space conversion and high-quality scaling directly on the display adapter.

The application distinguishes itself through granular control over playback dynamics and visual output. Users can manipulate video orientation through rotation, flipping, and zooming, while also leveraging support for high dynamic range rendering. The player supports automated playback sequences, subtitle track management, and the ability to stream or download media directly from web URLs.

The software architecture relies on modular codec integration and external binary loading to support a wide range of media formats without requiring core application changes. It provides extensive configuration options, including customizable interface layouts, input shortcuts, and registry-based state persistence. The project also includes automated tools for managing build environments and dependencies to ensure consistent compilation from source code.
- [clickhouse/clickhouse](https://awesome-repositories.com/repository/clickhouse-clickhouse.md) (48,229 ⭐) — ClickHouse is a high-performance, columnar analytical database designed for real-time query execution and large-scale data aggregation. It functions as a distributed data warehouse capable of processing petabytes of information, while also providing an embedded engine that integrates directly into applications for native query capabilities without external dependencies. The system is built to handle high-throughput ingestion and complex analytical workloads, delivering millisecond-level latency for interactive dashboards and operational monitoring.

The platform distinguishes itself through advanced storage and execution techniques, including vectorized query processing and a merge tree storage engine that maintains performance during massive insertions. It features adaptive subcolumn mapping for semi-structured data and supports native vector search for machine learning and generative AI applications. To facilitate efficient data movement, the engine utilizes zero-copy shared memory buffers, minimizing overhead when interacting with external analytical tools or processing diverse file formats like Parquet, JSON, and Arrow.

Beyond its core storage and processing capabilities, the project provides a comprehensive suite of tools for observability, security, and data integration. It includes built-in support for natural language querying, automated workflow orchestration for AI agents, and extensive diagnostic features for query plan inspection. The platform also offers robust cloud infrastructure management, including support for private networking, compliant deployment strategies, and integrated billing consolidation.
- [duplicati/duplicati](https://awesome-repositories.com/repository/duplicati-duplicati.md) (14,283 ⭐) — Duplicati is a self-hosted backup server designed to perform encrypted, incremental, and compressed backups to a wide range of local, network, and cloud-based storage providers. It functions as a background service that automates recurring data protection tasks, ensuring that only changed data blocks are stored to maximize efficiency and minimize bandwidth usage.

The project distinguishes itself through a centralized management console that allows for the orchestration of multiple distributed backup agents from a single web-based dashboard. It supports multi-tenant management, enabling the organization of users and resources into hierarchical structures for delegated access and data isolation. Furthermore, it provides robust security features, including AES-256 encryption for data at rest, support for OIDC and SAML2 authentication, and provider-level immutability protections to prevent unauthorized modification of backup archives.

Beyond its core backup capabilities, the system includes comprehensive tools for data lifecycle management, such as automated retention policies, versioning, and integrity verification. It offers flexible configuration through both a graphical interface and a command-line utility, supporting automation scripting and dry-run simulations to verify workflows before execution. The software also handles complex environments by managing locked files and providing metadata indexing to ensure rapid restoration even if the primary configuration database is unavailable.

Duplicati is available through various installation formats, including native system packages, portable archives, and containerized deployments, allowing it to run in diverse operating environments.
- [shleecs/copy-and-paste-networks-for-deep-video-inpainting](https://awesome-repositories.com/repository/shleecs-copy-and-paste-networks-for-deep-video-inpainting.md) (0 ⭐) — Official pytorch implementation for "Copy-and-Paste Networks for Deep Video Inpainting" (ICCV 2019) V.1.0
- [mifi/lossless-cut](https://awesome-repositories.com/repository/mifi-lossless-cut.md) (41,364 ⭐) — LosslessCut is a desktop application designed for the precise editing of video and audio files without re-encoding the underlying media streams. By performing stream copying and container remuxing, the software allows users to cut, merge, and rearrange media segments while maintaining the original bit-perfect quality of the source content.

The application distinguishes itself by utilizing a stream-copying data pipeline that transfers raw media packets directly from source to destination, significantly reducing processing time compared to traditional transcoding workflows. It also functions as a media container remuxing tool, enabling users to repackage streams into different file formats or structures without altering the data itself.

Beyond basic trimming, the tool provides capabilities for high-resolution frame extraction and comprehensive metadata management. Users can capture still images from specific timestamps or scene transitions and import or export timing data and chapter markers to synchronize editing projects with external professional tools. The application is distributed as a cross-platform desktop shell that provides direct access to local file systems for media processing.
- [anshulbshah/blurred-image-to-video](https://awesome-repositories.com/repository/anshulbshah-blurred-image-to-video.md) (15 ⭐) — Converting Motion blurred image to video
- [frost819/bv](https://awesome-repositories.com/repository/frost819-bv.md) (2,804 ⭐) — bv is an Android TV video streaming client and specialized video player designed for television hardware. It features a remote-control user interface and a high-resolution display overlay to facilitate navigation and content consumption on big-screen displays.

The project focuses on television-specific optimizations, including a customizable comment system with adjustable transparency and scroll speed to ensure readability. It maps short-press and long-press hardware signals to specific engagement actions, such as tipping or saving content, to simplify interaction with creator profiles.

The application covers broad capabilities in media playback and content discovery, including precise seek navigation with thumbnail previews, state-persistent playback tracking for session resumption, and metadata-driven grid layouts for browsing video categories. It also includes tools for managing application preferences and visual display density.
- [open-mmlab/mmcv](https://awesome-repositories.com/repository/open-mmlab-mmcv.md) (6,446 ⭐) — mmcv is a foundation library for computer vision based on PyTorch. It provides a comprehensive system for constructing convolutional neural networks, a toolkit for image and video preprocessing, and a collection of high-performance deep learning vision operators.

The project is distinguished by its hardware-accelerated kernels for complex operations such as deformable convolutions and region pooling. It features a configuration-driven framework that allows for the dynamic instantiation of network layers and the registration of custom modules without modifying code.

The library covers a broad range of capability areas, including 3D point cloud processing, sparse tensor manipulations, and comprehensive visual data augmentation workflows. It also provides utilities for color space conversion, geometric and photometric transformations, and video handling, alongside tools for model complexity analysis and inference operation fusion.
- [mhergon/avplayerviewcontroller-subtitles](https://awesome-repositories.com/repository/mhergon-avplayerviewcontroller-subtitles.md) (281 ⭐) — Easy way to show SRT files on AVPlayerViewController
- [gitbookio/gitbook](https://awesome-repositories.com/repository/gitbookio-gitbook.md) (28,902 ⭐) — Gitbook is a documentation-as-code platform designed for centralized technical knowledge management. It functions as a knowledge management system that synchronizes documentation files directly with version control repositories, allowing teams to maintain content alongside their source code.

The platform distinguishes itself through an integrated artificial intelligence layer that provides context-aware search assistance and automated content suggestions. By utilizing block-based content modeling, it enables the construction of structured, modular documentation that can be compiled into static sites or deployed as secure, branded portals.

The system includes comprehensive tools for enterprise-grade publishing, including role-based access control, content localization, and custom domain configuration. It also incorporates observability features that analyze search queries to identify information gaps and improve the overall quality of technical documentation.
