# browser-use/video-use

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/browser-use-video-use).**

9,743 stars · 1,404 forks · Python · MIT

## Links

- GitHub: https://github.com/browser-use/video-use
- awesome-repositories: https://awesome-repositories.com/repository/browser-use-video-use.md

## Description

This project is an AI video post-production suite that uses large language models and programmatic tools to automate editing, transcription, and subtitle generation. It functions as an AI editing agent that translates natural language instructions into shell commands, providing a programmatic interface for manipulating media via FFmpeg.

The toolkit includes a motion graphics engine that generates technical animations and visual overlays through code-driven rendering and mathematical definitions. It distinguishes itself by combining an AI-powered transcriber for word-level timestamps with an automated system for removing filler words, false starts, and dead space.

The system covers a broad range of post-production capabilities, including audio-based video cutting, cinematic color grading through filter chains, and the integration of synthetic AI voiceovers. It also provides observability tools such as timeline visualization through composite filmstrips and waveforms, as well as self-evaluation loops to validate rendered output for visual jumps or audio pops.

Session data and editing history are persisted in text files to maintain project continuity across different execution contexts.

## Tags

### Artificial Intelligence & ML

- [Video Editing Agents](https://awesome-repositories.com/f/artificial-intelligence-ml/agentic-llm-frameworks/video-editing-agents.md) — Implements an AI agent that translates natural language instructions into actionable shell commands for automated video editing.
- [Audio Transcription](https://awesome-repositories.com/f/artificial-intelligence-ml/audio-transcription.md) — Converts spoken audio into text transcripts with word-level timestamps and speaker identification. ([source](https://github.com/browser-use/video-use/blob/main/README.md))
- [Automated Video Transcribers](https://awesome-repositories.com/f/artificial-intelligence-ml/speech-transcription/automated-video-transcribers.md) — Converts audio to word-level timestamps for precise cutting and automated subtitle burning.

### Graphics & Multimedia

- [AI Video Editing Automation](https://awesome-repositories.com/f/graphics-multimedia/ai-video-editing-automation.md) — Uses natural language and coding agents to automate cutting, color grading, and assembling video projects.
- [Animation & Motion Graphics](https://awesome-repositories.com/f/graphics-multimedia/animation-motion.md) — Produces programmatic animations and overlays for diagrams, UI mockups, and kinetic typography. ([source](https://github.com/browser-use/video-use/blob/main/SKILL.md))
- [Programmatic Motion Graphics](https://awesome-repositories.com/f/graphics-multimedia/animation-motion/programmatic-motion-graphics.md) — Ships a motion graphics engine that generates technical animations and visual overlays through code-driven rendering.
- [Animation Sequence Renderers](https://awesome-repositories.com/f/graphics-multimedia/animation-sequence-renderers.md) — Processes programmatic motion code into sequenced frames for final video files.
- [Transcription-Driven Slicing](https://awesome-repositories.com/f/graphics-multimedia/audio-slicing/transcription-driven-slicing.md) — Determines precise cut points by mapping word-level transcriptions and silence gaps to video frame boundaries.
- [FFmpeg Wrappers](https://awesome-repositories.com/f/graphics-multimedia/ffmpeg-wrappers.md) — Provides a programmatic interface for manipulating media using FFmpeg filters and shell-based processing.
- [Filler Audio Removal](https://awesome-repositories.com/f/graphics-multimedia/filler-audio-removal.md) — Identifies and cuts out filler words, false starts, and dead space to tighten video pacing. ([source](https://github.com/browser-use/video-use/blob/main/README.md))
- [Text-Driven Video Editing](https://awesome-repositories.com/f/graphics-multimedia/media-production-suites/graphics-media-assets/creative-and-cultural-media/generative-media-tools/text-driven-video-editing.md) — Translates natural language instructions into shell commands to automate video cutting and color grading. ([source](https://github.com/browser-use/video-use/blob/main/pyproject.toml))
- [Programmatic Animations](https://awesome-repositories.com/f/graphics-multimedia/programmatic-animations.md) — Creates programmatic animations and visual effects using mathematical definitions and code-driven rendering. ([source](https://github.com/browser-use/video-use/blob/main/pyproject.toml))
- [Audio-Driven Cutting](https://awesome-repositories.com/f/graphics-multimedia/video-production/video-editing/audio-driven-cutting.md) — Identifies optimal cut points based on word boundaries and silence gaps to maintain natural pacing. ([source](https://github.com/browser-use/video-use/blob/main/SKILL.md))
- [Color Grading](https://awesome-repositories.com/f/graphics-multimedia/color-grading.md) — Applies cinematic visual styles by passing video segments through sequential image adjustment filters.
- [Final Episode Assembly](https://awesome-repositories.com/f/graphics-multimedia/final-episode-assembly.md) — Processes extracted segments and adds overlays and subtitles to produce a final cohesive video export. ([source](https://github.com/browser-use/video-use/blob/main/SKILL.md))
- [Technical Animation Generation](https://awesome-repositories.com/f/graphics-multimedia/image-editing-processing/image-processing/image-sequence-processors/animation-frame-sequencers/generative-animation-sequences/technical-animation-generation.md) — Produces mathematical and technical animated videos from text prompts by generating code and rendering scenes. ([source](https://github.com/browser-use/video-use/tree/main/skills/manim-video))
- [Text-to-Speech Engines](https://awesome-repositories.com/f/graphics-multimedia/media-processing-analysis/audio-processing-systems/audio-processing/text-to-speech-engines/text-to-speech-engines.md) — Creates audio narrations for video content by integrating with external text-to-speech services. ([source](https://github.com/browser-use/video-use/blob/main/.env.example))
- [Analysis-Driven Editing Guides](https://awesome-repositories.com/f/graphics-multimedia/media-processing-analysis/media-manipulation/media-processing/video-analysis-processing/video-metadata-extraction/analysis-driven-editing-guides.md) — Processes audio transcripts to generate visual composite filmstrips that guide editing decisions without processing raw frames. ([source](https://github.com/browser-use/video-use#readme))
- [Post-Production Environments](https://awesome-repositories.com/f/graphics-multimedia/media-production-suites/media-management-production/production-control-systems/professional-production-control/post-production-environments.md) — Manages the end-to-end process of removing filler audio, applying color grades, and exporting final renders.
- [Timeline Visualization Tools](https://awesome-repositories.com/f/graphics-multimedia/timeline-visualization-tools.md) — Generates low-resolution visual and waveform previews to guide editing decisions.
- [Video Timeline Visualizers](https://awesome-repositories.com/f/graphics-multimedia/video-timeline-visualizers.md) — Generates filmstrip and waveform images to assist in making precise editing and cutting decisions. ([source](https://github.com/browser-use/video-use/blob/main/SKILL.md))
- [Visual Overlays](https://awesome-repositories.com/f/graphics-multimedia/visual-overlays.md) — Provides the ability to generate visual overlays using specialized animation tools to enhance video content. ([source](https://github.com/browser-use/video-use/blob/main/README.md))

### Part of an Awesome List

- [Natural Language Automation](https://awesome-repositories.com/f/awesome-lists/productivity/task-automation/natural-language-automation.md) — Translates natural language instructions into shell commands to automate video processing tasks.
- [Text-to-Speech](https://awesome-repositories.com/f/awesome-lists/ai/text-to-speech.md) — Adds synthetic speech and narrated audio tracks to video content using text-to-speech services.

### Content Management & Publishing

- [Automated Subtitle Generators](https://awesome-repositories.com/f/content-management-publishing/media-management/subtitle-management-systems/timestamped-subtitle-generators/automated-subtitle-generators.md) — Creates timed text overlays with configurable chunking and positioning using an automated transcription and embedding workflow. ([source](https://github.com/browser-use/video-use/blob/main/SKILL.md))
- [Burned-In Subtitle Rendering](https://awesome-repositories.com/f/content-management-publishing/media-management/subtitle-management-systems/timestamped-subtitle-generators/burned-in-subtitle-rendering.md) — Overlays customizable text chunks directly onto the video based on transcribed audio timestamps. ([source](https://github.com/browser-use/video-use/blob/main/README.md))

### Repository Format

- [AI Video Post-Production Suites](https://awesome-repositories.com/f/repository-format/open-source-desktop-applications/ai-video-post-production-suites.md) — Provides a toolkit for automating transcription, filler word removal, and subtitle generation through large language models.

### Testing & Quality Assurance

- [Render Quality Validation](https://awesome-repositories.com/f/testing-quality-assurance/render-quality-validation.md) — Executes a self-evaluation loop that checks cut boundaries for visual jumps or audio pops before finalizing. ([source](https://github.com/browser-use/video-use#readme))
