# mli/autocut

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/mli-autocut).**

7,579 stars · 793 forks · Python · apache-2.0

## Links

- GitHub: https://github.com/mli/autocut
- awesome-repositories: https://awesome-repositories.com/repository/mli-autocut.md

## Description

Autocut is a text-based video editor and automatic speech recognition tool. It allows users to cut and merge video clips by modifying a text transcript instead of using a traditional timeline.

The system operates as an FFmpeg video processor and subtitle manipulation utility. It converts spoken audio into text and compacts subtitle files into simplified formats, enabling the removal of unwanted video segments by deleting corresponding sentences from a transcription file.

The project covers automated video transcription, non-linear video cutting, and subtitle file management. It supports hardware acceleration to increase the processing speed of transcription and video manipulation tasks.

## Tags

### Graphics & Multimedia

- [Transcript-Based Editing](https://awesome-repositories.com/f/graphics-multimedia/non-linear-video-editing/transcript-based-editing.md) — Enables non-linear video cutting by mapping text timestamps to video frames within a transcription file.
- [FFmpeg Wrappers](https://awesome-repositories.com/f/graphics-multimedia/ffmpeg-wrappers.md) — Provides a processing engine that leverages FFmpeg for the precise cutting and merging of video segments.
- [Automated Processors](https://awesome-repositories.com/f/graphics-multimedia/media-processing-analysis/media-manipulation/media-processing/video-analysis-processing/video-file-processors/automated-processors.md) — Uses an automated FFmpeg-based pipeline to execute precise cuts and concatenations based on timestamped text.
- [Transcript-Driven Editors](https://awesome-repositories.com/f/graphics-multimedia/media-production-suites/graphics-media-assets/creative-and-cultural-media/generative-media-tools/text-driven-video-editing/transcript-driven-editors.md) — Provides a text-based video editor that enables cutting and merging by editing a transcript instead of a timeline.
- [Non-Linear Video Editing](https://awesome-repositories.com/f/graphics-multimedia/non-linear-video-editing.md) — Allows removing unwanted video segments through the modification of a transcription file instead of manual scrubbing.
- [Text-Based Cutting](https://awesome-repositories.com/f/graphics-multimedia/video-production/video-editing/text-based-cutting.md) — Implements a system for cutting and merging video clips by deleting or rearranging sentences in a text transcript.
- [Text-Based Trimming](https://awesome-repositories.com/f/graphics-multimedia/video-production/video-editing/text-based-trimming.md) — Allows the removal of unwanted video segments by deleting corresponding sentences from a transcription or subtitle file. ([source](https://github.com/mli/autocut/blob/main/README.md))
- [Video Assembly](https://awesome-repositories.com/f/graphics-multimedia/image-editing-processing/image-processing/image-sequence-processors/video-assembly.md) — Assembles final video clips by combining segments based on a modified text transcript. ([source](https://github.com/mli/autocut/blob/main/tea.yaml))
- [Video Editing](https://awesome-repositories.com/f/graphics-multimedia/video-production/video-editing.md) — Combines multiple edited video segments into a single output file based on a selection list. ([source](https://github.com/mli/autocut#readme))

### Artificial Intelligence & ML

- [Automatic Speech Recognition](https://awesome-repositories.com/f/artificial-intelligence-ml/automatic-speech-recognition.md) — Functions as an automatic speech recognition system that converts video audio into editable text files.
- [Speech to Text Transcription](https://awesome-repositories.com/f/artificial-intelligence-ml/speech-to-text-transcription.md) — Converts spoken audio from video files into text formats using configurable speech-to-text models. ([source](https://github.com/mli/autocut#readme))
- [Automated Video Transcribers](https://awesome-repositories.com/f/artificial-intelligence-ml/speech-transcription/automated-video-transcribers.md) — Automates the conversion of spoken audio from video files into time-synced text transcripts.
- [GPU Acceleration](https://awesome-repositories.com/f/artificial-intelligence-ml/gpu-acceleration.md) — Implements hardware acceleration via GPU to speed up the deep learning models used for audio transcription.

### Data & Databases

- [Transcript-to-Timestamp Mapping](https://awesome-repositories.com/f/data-databases/pointer-based-navigation/offset-based-addressing/timestamp-based-offset-lookups/transcript-to-timestamp-mapping.md) — Generates precise video edit points by mapping text indices from a transcript to specific timecodes.

### Part of an Awesome List

- [Lossless Concatenations](https://awesome-repositories.com/f/awesome-lists/media/video-playback/segmented/lossless-concatenations.md) — Ships the ability to merge pre-cut video portions losslessly without re-encoding the entire file.

### Content Management & Publishing

- [Subtitle Processing](https://awesome-repositories.com/f/content-management-publishing/media-management/subtitle-management-systems/subtitle-synchronization/subtitle-processing.md) — Provides utilities for cleaning and compacting subtitle files to make manual text editing more efficient.
- [Subtitle Format Converters](https://awesome-repositories.com/f/content-management-publishing/media-management/subtitle-management-systems/subtitle-synchronization/subtitle-processing/subtitle-format-converters.md) — Provides a utility for converting standard subtitle files into compact versions for easier manual cleaning.