# mvdctop/movie_data_capture

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/mvdctop-movie-data-capture).**

7,405 stars · 1,381 forks · Python · GPL-3.0

## Links

- GitHub: https://github.com/mvdctop/Movie_Data_Capture
- awesome-repositories: https://awesome-repositories.com/repository/mvdctop-movie-data-capture.md

## Topics

`emby` `jellyfin` `kodi` `movie` `nas` `plex` `python`

## Description

Movie Data Capture is a media library organizer and movie metadata scraper designed to automatically categorize and name files in a local media collection. It functions as an automated content tagger that identifies movie files and applies descriptive tags by extracting film details from web databases.

The system utilizes an HTTP web scraper to fetch information from external APIs and remote HTML content. It employs a filename pattern parser to extract movie titles and release years from local files using regular expressions, which are then used to automate search queries.

The tool maps scraped metadata to folders on a local file system and persists movie details and organization mappings using a JSON data store. These capabilities support home media server management by ensuring local titles are matched with correct descriptions and technical details.

## Tags

### Content Management & Publishing

- [Movie and Show Metadata](https://awesome-repositories.com/f/content-management-publishing/metadata-tagging/bookmark-metadata-managers/automated-metadata-fetching/media-metadata-fetching/movie-and-show-metadata.md) — Retrieves detailed cinematic information such as titles and plot summaries from external databases. ([source](https://github.com/mvdctop/movie_data_capture#readme))
- [Content Tagging Systems](https://awesome-repositories.com/f/content-management-publishing/content-tagging-systems.md) — Assigns descriptive tags to movie files by matching scraped attributes against naming conventions.
- [Library Organization Automation](https://awesome-repositories.com/f/content-management-publishing/media-management/media-servers/library-organization-automation.md) — Automatically organizes movie files using naming and folder structures based on scraped metadata.

### Web Development

- [Media Metadata Scrapers](https://awesome-repositories.com/f/web-development/web-scrapers/media-metadata-scrapers.md) — Provides a specialized web scraper to extract movie metadata for organizing local media libraries.

### Part of an Awesome List

- [Filename-to-Tag Parsing](https://awesome-repositories.com/f/awesome-lists/devtools/tagging/tag-storage-method-configurators/sidecar-file-tagging/filename-to-tag-parsing.md) — Extracts metadata from filenames to automatically populate descriptive tags for movie files.
- [Movie Information](https://awesome-repositories.com/f/awesome-lists/media/movie-information.md) — Extracts detailed film information from web databases to build a structured digital movie catalog.
- [Automated Taggers](https://awesome-repositories.com/f/awesome-lists/media/movie-information/automated-taggers.md) — Identifies movie files and applies descriptive tags by matching filenames against external data sources.
- [JSON Data Stores](https://awesome-repositories.com/f/awesome-lists/data/json-and-data-parsing/json-data-stores.md) — Stores all movie details and organization mappings in structured JSON files for easy editing.

### Business & Productivity Software

- [Local File Organizers](https://awesome-repositories.com/f/business-productivity-software/local-file-organizers.md) — Maps scraped digital metadata to physical directory structures to organize the local media library.

### Data & Databases

- [Web Scrapers](https://awesome-repositories.com/f/data-databases/web-scrapers.md) — Fetches and parses live internet content from external APIs to collect movie data.
- [JSON-Based Persistence](https://awesome-repositories.com/f/data-databases/data-engineering-infrastructure/data-persistence-storage/data-storage/file-based-storage/local-file-storage/json-based-persistence.md) — Persists movie details and organization mappings using structured JSON files on the local filesystem.
- [Flat-File Storage](https://awesome-repositories.com/f/data-databases/flat-file-storage.md) — Uses a flat-file storage architecture to manage metadata without the overhead of a full database.

### Development Tools & Productivity

- [Web Scraping](https://awesome-repositories.com/f/development-tools-productivity/web-scraping.md) — Implements a web scraper to retrieve movie metadata from external APIs and HTML content.

### Graphics & Multimedia

- [Local Media Library Management](https://awesome-repositories.com/f/graphics-multimedia/local-media-library-management.md) — Coordinates the organization and naming of movie files on a local filesystem using scraped metadata.

### Operating Systems & Systems Programming

- [Filename Parsers](https://awesome-repositories.com/f/operating-systems-systems-programming/system-administration-maintenance/file-system-management/file-systems/filename-templates/filename-parsers.md) — Extracts movie titles and release years from filenames using custom pattern parsing.

### Software Engineering & Architecture

- [Regular Expression-Based Parsing](https://awesome-repositories.com/f/software-engineering-architecture/regular-expression-based-parsing.md) — Employs regular-expression based parsing to extract structured movie data from raw file paths.
- [Media Filename Parsers](https://awesome-repositories.com/f/software-engineering-architecture/syntax-parsing-engines/regex-based-parsers/media-filename-parsers.md) — Uses regular expression engines specifically designed to categorize media files based on filename structures.

### System Administration & Monitoring

- [Home Media Utility Setup](https://awesome-repositories.com/f/system-administration-monitoring/home-media-utility-setup.md) — Provides utility setup for maintaining a curated collection of movie files on home network hardware.
