# yaofanguk/video-subtitle-extractor

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/yaofanguk-video-subtitle-extractor).**

8,432 stars · 862 forks · Python · apache-2.0

## Links

- GitHub: https://github.com/YaoFANGUK/video-subtitle-extractor
- awesome-repositories: https://awesome-repositories.com/repository/yaofanguk-video-subtitle-extractor.md

## Topics

`deep-learning` `extract` `hardsub` `ocr` `ripper` `srt` `subrip` `subtitles`

## Description

This project is an optical character recognition tool designed to extract hardcoded subtitles from video frames and convert them into synchronized subtitle files. It functions as a text processor that transforms embedded visual text into a written format to improve video accessibility and translation.

The system uses graphics processing units to increase the speed and accuracy of text recognition. It includes a subtitle cleaning tool that applies custom mapping configurations to filter out watermarks, channel logos, and duplicate lines from the extracted text.

The tool supports batch processing for multiple video files that share identical resolutions and text region settings. It utilizes region-based extraction to isolate subtitles from background noise and synchronizes recognized text strings with specific video timestamps.

## Tags

### Artificial Intelligence & ML

- [Optical Character Recognition](https://awesome-repositories.com/f/artificial-intelligence-ml/optical-character-recognition.md) — Uses optical character recognition to convert visual text found within video frames into digital strings.
- [GPU-Accelerated Inference](https://awesome-repositories.com/f/artificial-intelligence-ml/gpu-accelerated-inference.md) — Offloads text recognition workloads to graphics hardware to reduce processing time and increase throughput.
- [GPU Acceleration](https://awesome-repositories.com/f/artificial-intelligence-ml/gpu-acceleration.md) — Uses GPU acceleration to increase the speed and accuracy of the text recognition process. ([source](https://github.com/YaoFANGUK/video-subtitle-extractor/blob/main/README_en.md))
- [OCR Acceleration](https://awesome-repositories.com/f/artificial-intelligence-ml/gpu-acceleration/ocr-acceleration.md) — Utilizes graphics processing units to significantly increase the throughput and accuracy of subtitle text recognition.
- [GPU-Accelerated Text Processing](https://awesome-repositories.com/f/artificial-intelligence-ml/natural-language-processing/text-tokenization/gpu-accelerated-text-processing.md) — Implements GPU-accelerated processing to reduce total text recognition time. ([source](https://github.com/YaoFANGUK/video-subtitle-extractor#readme))
- [Automated Video Subtitling](https://awesome-repositories.com/f/artificial-intelligence-ml/speech-transcription/automated-video-subtitling.md) — Automates the pipeline of extracting and aligning subtitles from multiple video files. ([source](https://github.com/YaoFANGUK/video-subtitle-extractor#readme))

### Graphics & Multimedia

- [Hardcoded Subtitle Extractors](https://awesome-repositories.com/f/graphics-multimedia/hardcoded-subtitle-extractors.md) — Extracts embedded text from video frames and saves it as synchronized written subtitle files.
- [Temporal Subtitle Synchronization](https://awesome-repositories.com/f/graphics-multimedia/temporal-subtitle-synchronization.md) — Maps recognized text strings to specific video timestamps to generate synchronized subtitle files.
- [Batch Video Processing](https://awesome-repositories.com/f/graphics-multimedia/batch-video-processing.md) — Enables subtitles to be extracted from multiple video files simultaneously when resolution and text regions are identical.
- [Video File Processors](https://awesome-repositories.com/f/graphics-multimedia/media-processing-analysis/media-manipulation/media-processing/video-analysis-processing/video-file-processors.md) — Provides capabilities to process multiple video files for text extraction based on shared resolution and region settings.

### Content Management & Publishing

- [Video Transcript Extraction](https://awesome-repositories.com/f/content-management-publishing/content-processing-transformation/content-extraction-engines/video-transcript-extraction.md) — Extracts subtitles and timing data from video frames to create written transcripts for accessibility. ([source](https://github.com/YaoFANGUK/video-subtitle-extractor/blob/main/.condarc))
- [Subtitle Processing](https://awesome-repositories.com/f/content-management-publishing/media-management/subtitle-management-systems/subtitle-synchronization/subtitle-processing.md) — Implements a full pipeline for extracting, synchronizing, and processing hardcoded subtitles from video content. ([source](https://github.com/YaoFANGUK/video-subtitle-extractor/blob/main/README_en.md))
- [Hardcoded Subtitle Extraction](https://awesome-repositories.com/f/content-management-publishing/media-management/subtitle-management-systems/subtitle-synchronization/subtitle-processing/extraction-tools/hardcoded-subtitle-extraction.md) — Converts text found within video frames into synchronized files using local optical character recognition. ([source](https://github.com/YaoFANGUK/video-subtitle-extractor#readme))
- [Hardcoded Subtitle Extraction](https://awesome-repositories.com/f/content-management-publishing/media-management/subtitle-management-systems/subtitle-synchronization/subtitle-processing/hardcoded-subtitle-extraction.md) — Converts text embedded directly into video frames into editable subtitle files using local optical character recognition.
- [Automated Subtitle Cleaning](https://awesome-repositories.com/f/content-management-publishing/media-management/subtitle-management-systems/subtitle-synchronization/subtitle-processing/automated-subtitle-cleaning.md) — Removes watermarks, logos, and recurring noise from extracted text using custom mapping configurations.
- [Video Accessibility Tools](https://awesome-repositories.com/f/content-management-publishing/video-accessibility-tools.md) — Creates written transcripts and synchronized subtitles for videos to improve viewer accessibility.

### Data & Databases

- [Coordinate-Based Extraction](https://awesome-repositories.com/f/data-databases/text-processing-utilities/text-extraction/coordinate-based-extraction.md) — Isolates subtitles from background noise by focusing analysis on specific coordinate areas within video frames.
- [Text Cleaning Pipelines](https://awesome-repositories.com/f/data-databases/text-normalization/text-cleaning-pipelines.md) — Standardizes extracted text by removing duplicates and visual noise through a cleaning pipeline. ([source](https://github.com/YaoFANGUK/video-subtitle-extractor/blob/main/README.md))
- [Subtitle Noise Reduction](https://awesome-repositories.com/f/data-databases/text-normalization/text-cleaning-pipelines/subtitle-noise-reduction.md) — Removes watermarks, channel logos, and duplicate lines from extracted text using custom mapping configurations.

### Development Tools & Productivity

- [Watermark and Logo Filtering](https://awesome-repositories.com/f/development-tools-productivity/line-based-text-filters/watermark-and-logo-filtering.md) — Filters out watermarks, channel logos, and duplicate lines using a custom mapping configuration.

### User Interface & Experience

- [Watermark Removal](https://awesome-repositories.com/f/user-interface-experience/content-rendering-components/image-overlays/media-watermarking-tools/watermark-removal.md) — Removes visual watermarks and channel logos from the final text output. ([source](https://github.com/YaoFANGUK/video-subtitle-extractor#readme))
