# Photo and Document Metadata Extractors

> Search results for `pull metadata and geolocation from photos and documents` on awesome-repositories.com. 115 total matches; showing the first 50.

Explore on the web: https://awesome-repositories.com/q/pull-metadata-and-geolocation-from-photos-and-documents

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [this search on awesome-repositories.com](https://awesome-repositories.com/q/pull-metadata-and-geolocation-from-photos-and-documents).**

## Results

- [gam-team/gam](https://awesome-repositories.com/repository/gam-team-gam.md) (4,206 ⭐) — GAM is a command-line tool for administering Google Workspace and Cloud Identity. It translates command-line arguments into structured API calls, enabling administrators to manage users, groups, organizational units, and domain settings across a Google Workspace environment. The tool handles authentication through OAuth2 flows, service accounts, and workload identity federation, and supports multi-tenant configurations for managing multiple domains or cloud projects from a single installation.

GAM distinguishes itself through its batch processing and automation capabilities. It can process la
- [duplicati/duplicati](https://awesome-repositories.com/repository/duplicati-duplicati.md) (14,283 ⭐) — Duplicati is a self-hosted backup server designed to perform encrypted, incremental, and compressed backups to a wide range of local, network, and cloud-based storage providers. It functions as a background service that automates recurring data protection tasks, ensuring that only changed data blocks are stored to maximize efficiency and minimize bandwidth usage.

The project distinguishes itself through a centralized management console that allows for the orchestration of multiple distributed backup agents from a single web-based dashboard. It supports multi-tenant management, enabling the or
- [icloud-photos-downloader/icloud_photos_downloader](https://awesome-repositories.com/repository/icloud-photos-downloader-icloud-photos-downloader.md) (12,046 ⭐) — This tool is a command-line utility designed to synchronize and archive media from cloud storage to local directories. It functions as an automated backup service that maintains a local mirror of remote photo libraries, ensuring that local storage remains current with remote changes through periodic monitoring and incremental updates.

The project distinguishes itself through its support for persistent, containerized background execution, which allows for continuous, automated management of media collections. It provides robust multi-account isolation, enabling users to manage multiple indepen
- [expo/expo](https://awesome-repositories.com/repository/expo-expo.md) (50,111 ⭐) — Expo is a universal mobile framework designed to build native iOS and Android applications from a single codebase using web-standard technologies. It provides a comprehensive development environment that includes a unified runtime for testing, cloud-based infrastructure for compiling and signing native binaries, and automated tools for managing the entire mobile release lifecycle, including app store submission.

The framework distinguishes itself through a plugin-based native configuration engine that programmatically modifies project files, allowing developers to integrate native modules wit
- [awesome-selfhosted/awesome-selfhosted](https://awesome-repositories.com/repository/awesome-selfhosted-awesome-selfhosted.md) (299,516 ⭐) — This project is a community-curated directory of open-source software designed for deployment in private server environments and home labs. It serves as a comprehensive resource for discovering independent, self-hosted alternatives to mainstream cloud services, enabling users to maintain full data ownership and control over their digital infrastructure.

The directory is structured through a hierarchical taxonomy that organizes a vast collection of applications into logical categories, ranging from media management and data analytics to private communication and team productivity tools. It dis
- [hexojs/hexo](https://awesome-repositories.com/repository/hexojs-hexo.md) (41,768 ⭐) — Hexo is a command-line static site generator designed for content-driven blogging and website creation. It functions as a structured framework that transforms plain text files and markdown into production-ready static websites, utilizing a template-based rendering engine to separate site content from visual presentation.

The project is distinguished by its event-driven build pipeline, which manages the entire site lifecycle through a series of hooks for file processing, asset generation, and deployment. Developers can extend the system’s core capabilities through a modular plugin architecture
- [colinfay/geoloc](https://awesome-repositories.com/repository/colinfay-geoloc.md) (47 ⭐) — The goal of geoloc is to provide access to the geolocation API from modern web-browser, in order to get the location of the user in a Shiny app.
- [avaloniaui/avalonia](https://awesome-repositories.com/repository/avaloniaui-avalonia.md) (30,986 ⭐) — Avalonia is a cross-platform desktop framework that enables the creation of native-feeling applications for Windows, macOS, and Linux from a single codebase. It functions as a declarative UI toolkit, allowing developers to define complex visual hierarchies and interface structures using a markup-based syntax that maps directly to underlying object properties. By utilizing the Model-View-ViewModel architectural pattern, the framework facilitates a clean separation between application logic and user interface layout, which simplifies unit testing and component maintenance.

The framework disting
- [cockroachdb/cockroach](https://awesome-repositories.com/repository/cockroachdb-cockroach.md) (32,207 ⭐) — Cockroach is a distributed SQL database designed to scale horizontally across multiple nodes while maintaining strict ACID compliance and global data consistency. It functions as a relational database engine that automatically partitions data into ranges, rebalancing them across a cluster to accommodate growing storage and throughput requirements. By utilizing a distributed consensus protocol, the system ensures that all nodes agree on the order of operations, providing fault tolerance and continuous availability even in the event of hardware failures.

The system distinguishes itself through
- [ands/lightmapper](https://awesome-repositories.com/repository/ands-lightmapper.md) (1,535 ⭐) — lightmapper.h is a C/C++ single-file library for lightmap baking by using your existing OpenGL renderer.
- [obgnail/typora_plugin](https://awesome-repositories.com/repository/obgnail-typora-plugin.md) (3,837 ⭐) — This project is a JavaScript-based plugin framework for the Typora markdown editor. It provides a development platform to integrate custom functional logic, UI elements, and keyboard shortcuts into the editor environment.

The framework distinguishes itself by offering a JSON-RPC remote interface, allowing external programs to control the editor and trigger plugin functions. It also includes a dedicated publishing system to export and upload markdown documents to external blogging platforms via API requests.

The project covers a wide range of capabilities, including editor workflow automation
- [sinelaw/fresh](https://awesome-repositories.com/repository/sinelaw-fresh.md) (5,914 ⭐) — Zed is a terminal-based code editor built in Rust that provides a full-featured editing experience with familiar keybindings, mouse support, and multiple cursors. It runs entirely in the terminal while offering capabilities typically found in graphical editors, including split panes, a command palette, and integrated language server protocol support for real-time diagnostics, completions, go-to-definition, and code actions across multiple languages.

The editor distinguishes itself through a plugin system that runs sandboxed TypeScript plugins in a QuickJS runtime, with an asynchronous bridge
- [ente-io/ente](https://awesome-repositories.com/repository/ente-io-ente.md) (27,281 ⭐) — Ente is a privacy-focused platform for end-to-end encrypted storage and two-factor authentication management. It functions as a zero-knowledge identity provider, ensuring that all cryptographic operations, key derivation, and data encryption occur locally on the user's device. By maintaining this architecture, the service provider remains unable to access or decrypt any stored personal information or authentication credentials.

The platform distinguishes itself through a combination of on-device intelligence and resilient data distribution. It utilizes a local machine learning engine to perfo
- [ands/seamoptimizer](https://awesome-repositories.com/repository/ands-seamoptimizer.md) (243 ⭐) — A C/C++ single-file library that minimizes the hard transition errors of disjoint edges in lightmaps. It is based on a idea presented by Michał Iwanicki in the talk Lighting Technology of "The Last Of Us". A least squares solver is used to find a minimal error solution to the problem of sampling…
- [colinhacks/zod](https://awesome-repositories.com/repository/colinhacks-zod.md) (43,036 ⭐) — Zod is a TypeScript-first schema declaration and validation library designed to ensure end-to-end data integrity. It functions as a runtime type guard, allowing developers to define complex data structures through a declarative, chainable syntax. By using these schema definitions, the library automatically derives static TypeScript types, eliminating the need for manual type duplication and ensuring that runtime data matches expected application contracts.

The library distinguishes itself through functional schema composition, which enables the creation of hierarchical structures by nesting a
- [vote-and-verify/vote-and-verify](https://awesome-repositories.com/repository/vote-and-verify-vote-and-verify.md) (19 ⭐) — A Vote-and-Verify Strategy for Fast Spatial Verification in Image Retrieval
- [heyputer/puter](https://awesome-repositories.com/repository/heyputer-puter.md) (42,318 ⭐) — Puter is a browser-based desktop environment and cloud-native development platform that provides a virtualized graphical workspace. It enables developers to build and deploy full-stack web applications by integrating cloud storage, authentication, and serverless backend logic directly into the browser, eliminating the need for traditional server infrastructure.

The platform distinguishes itself through a unified cloud storage layer and a distributed network runtime that facilitates peer-to-peer communication and cross-origin resource fetching. It features a sophisticated cross-window orchestr
- [wei/pull](https://awesome-repositories.com/repository/wei-pull.md) (7,154 ⭐) — Pull is an automated pull request manager designed to synchronize GitHub repository forks with their upstream sources. It acts as an orchestrator that creates and manages pull requests to integrate remote changes and keep development forks aligned with original project sources.

The tool is configuration-driven, allowing users to define specific git integration strategies such as merging, squashing, rebasing, or hard resetting. It integrates with GitHub Actions to manage branch protection and the assignment of reviewers, assignees, and custom labels.

Synchronization can be triggered through p
- [awesomedata/awesome-public-datasets](https://awesome-repositories.com/repository/awesomedata-awesome-public-datasets.md) (75,979 ⭐) — This project is a community-maintained, open-access directory of high-quality public datasets. It serves as a centralized reference point for researchers, developers, and data scientists to locate reliable information sources across a wide spectrum of industries and scientific fields. By providing a structured index, the repository facilitates the discovery of data necessary for exploratory analysis, machine learning model training, and the development of data-intensive applications.

The directory distinguishes itself through a lightweight, platform-agnostic approach to resource indexing that
- [staabm/annotate-pull-request-from-checkstyle](https://awesome-repositories.com/repository/staabm-annotate-pull-request-from-checkstyle.md) (201 ⭐) — cs2pr - Annotate a GitHub Pull Request based on a Checkstyle XML-report within your GitHub Action
- [hammerspoon/hammerspoon](https://awesome-repositories.com/repository/hammerspoon-hammerspoon.md) (14,497 ⭐) — Hammerspoon is a programmable automation engine for macOS that enables deep system-level control through a Lua scripting environment. By bridging high-level scripts with native Objective-C APIs, it allows users to interact with the operating system's accessibility tree, intercept hardware input streams, and manage the lifecycle of running applications.

The project distinguishes itself through an event-driven architecture that registers asynchronous hooks for system notifications and hardware events. This allows for real-time automation, such as remapping keyboard and mouse inputs, managing wi
- [gofr-dev/gofr](https://awesome-repositories.com/repository/gofr-dev-gofr.md) (21,321 ⭐) — Gofr is a comprehensive framework for building production-ready microservices in Go. It provides a unified toolkit for developing RESTful APIs and gRPC services, offering built-in support for observability, database management, and distributed system communication.

The framework distinguishes itself through its focus on developer productivity and system resilience. It automates common backend tasks such as CRUD handler generation, schema-driven code creation, and database migration orchestration, while preventing race conditions in clustered environments. To maintain stability, it includes in
- [fastapi/fastapi](https://awesome-repositories.com/repository/fastapi-fastapi.md) (99,260 ⭐) — FastAPI is a web framework for building APIs with Python. It leverages standard language type hints to provide automatic data validation, request parsing, and interactive API documentation generation. The framework supports asynchronous request handling and manages execution contexts to prevent blocking the main event loop.

The project includes a dependency injection system that allows for the resolution and injection of reusable components into request handlers. This system supports request-scoped caching, lifecycle management, and integration with security mechanisms like OAuth2 and JSON We
- [loup-v/geolocation](https://awesome-repositories.com/repository/loup-v-geolocation.md) (230 ⭐) — Flutter geolocation plugin for Android and iOS.
- [baseflow/flutter-geolocator](https://awesome-repositories.com/repository/baseflow-flutter-geolocator.md) (1,329 ⭐) — Android and iOS Geolocation plugin for Flutter
- [iawia002/annie](https://awesome-repositories.com/repository/iawia002-annie.md) (31,414 ⭐) — Annie is a command-line video downloader and web video extraction library written in Go. It functions as a concurrent media downloader designed to fetch video files and playlists from websites via URLs.

The tool distinguishes itself through a proxy-aware network layer that supports SOCKS5 and HTTP proxies to bypass regional content restrictions. It also incorporates session cookie integration and referrer spoofing to facilitate the download of authenticated or age-gated content.

The project provides capabilities for bulk media acquisition, including batch downloading from text files and extr
- [datahub-project/datahub](https://awesome-repositories.com/repository/datahub-project-datahub.md) (12,141 ⭐) — DataHub is a metadata management platform designed to unify technical, operational, and business context across diverse data ecosystems. By utilizing a graph-based metadata model and an event-driven ingestion architecture, it creates a centralized source of truth that maps complex data relationships, lineage, and ownership. This foundational framework enables organizations to maintain a synchronized view of their data landscape, supporting both human-led discovery and automated data operations.

The platform distinguishes itself through its focus on grounding artificial intelligence and autono
- [soimort/you-get](https://awesome-repositories.com/repository/soimort-you-get.md) (56,839 ⭐) — This project is a command-line utility designed to fetch video, audio, and image content from a wide range of web platforms. It functions by parsing page metadata and utilizing modular, site-specific scripts to extract direct media stream URLs from complex web structures, enabling the local archiving of digital media for offline use.

The tool distinguishes itself through its ability to handle authenticated content, allowing users to inject browser-stored session cookies to access restricted or private media. It also supports real-time media streaming by piping remote content directly into ext
- [code-and-comment/code-and-comment](https://awesome-repositories.com/repository/code-and-comment-code-and-comment.md) (17 ⭐) — PWA to add comment to Github file.
- [espocrm/espocrm](https://awesome-repositories.com/repository/espocrm-espocrm.md) (2,799 ⭐) — EspoCRM is an open-source customer relationship management platform and SQL-based business application. It serves as a centralized web interface for tracking leads, opportunities, and contacts, providing a sales pipeline manager and a customizable business logic engine.

The platform is distinguished by its ability to function as a custom business application builder, allowing for the creation of tailored entities and automated workflows. It integrates marketing automation tools for campaign coordination and a structured customer support ticketing system for case management.

The system covers
- [appsecco/breaking-and-pwning-apps-and-servers-aws-azure-training](https://awesome-repositories.com/repository/appsecco-breaking-and-pwning-apps-and-servers-aws-azure-training.md) (952 ⭐) — Course content, lab setup instructions and documentation of our very popular Breaking and Pwning Apps and Servers on AWS and Azure hands on training!
- [tmont/audio-metadata](https://awesome-repositories.com/repository/tmont-audio-metadata.md) (60 ⭐) — This is a tinyish (2.1K gzipped) library to extract metadata from audio files. Specifically, it can extract ID3v1, ID3v2 and Vorbis comments (i.e. metadata in OGG containers).
- [janeczku/calibre-web](https://awesome-repositories.com/repository/janeczku-calibre-web.md) (17,500 ⭐) — Calibre-web is a self-hosted web application that provides a browser-based interface for browsing, managing, and reading digital book collections stored in a library database. It functions as a comprehensive library management system, allowing users to organize large collections, edit metadata, and perform automated content updates through a centralized administrative dashboard.

The platform distinguishes itself by integrating directly with external infrastructure to extend the capabilities of a standard digital library. It supports remote storage mapping to host files on cloud providers, uti
- [grpc/grpc](https://awesome-repositories.com/repository/grpc-grpc.md) (44,891 ⭐) — gRPC is a language-agnostic remote procedure call framework designed for high-performance communication between distributed services. It utilizes a structured interface definition language to generate consistent client stubs and server skeletons, enabling applications to invoke methods on remote servers as if they were local objects. By leveraging the HTTP/2 transport layer, the framework supports efficient binary serialization and multiplexed data exchange across diverse programming environments.

The framework distinguishes itself through its support for flexible communication patterns, incl
- [conductor-oss/conductor](https://awesome-repositories.com/repository/conductor-oss-conductor.md) (31,962 ⭐) — Conductor is a durable workflow engine designed to orchestrate complex, long-running business processes and autonomous agent loops. It functions as a stateful execution platform that persists the entire history of a process, ensuring that workflows remain reliable and recoverable across infrastructure failures, system restarts, and transient network errors. By managing task lifecycles, worker polling, and state transitions, it provides a centralized coordination layer for distributed systems.

The platform distinguishes itself through its specialized support for AI agent orchestration, allowin
- [photoprism/photoprism](https://awesome-repositories.com/repository/photoprism-photoprism.md) (39,816 ⭐) — PhotoPrism is a self-hosted digital asset management platform designed to organize, classify, and manage large collections of photos and videos on personal infrastructure. It functions as a private alternative to cloud-based services, ensuring that all media remains under the user's control. The platform utilizes neural-network-based media analysis to automatically detect objects, faces, and locations, providing a comprehensive, AI-powered approach to library organization.

The project distinguishes itself through its containerized architecture, which simplifies deployment and lifecycle manage
- [cyrus-and/prof](https://awesome-repositories.com/repository/cyrus-and-prof.md) (176 ⭐) — Self-contained C/C++ profiler library for Linux.
- [raymond-sci/and](https://awesome-repositories.com/repository/raymond-sci-and.md) (0 ⭐)
- [google/comprehensive-rust](https://awesome-repositories.com/repository/google-comprehensive-rust.md) (33,049 ⭐) — Comprehensive Rust is a structured educational curriculum designed to teach the Rust programming language, focusing on its core principles of memory safety, performance, and type correctness. The project provides a comprehensive learning path for software engineers, covering the language's ownership model, borrow checking, and compile-time validation mechanisms that eliminate common memory-related errors without the need for a garbage collector.

The curriculum distinguishes itself by offering specialized modules that demonstrate how to apply these safety guarantees in diverse, high-performanc
- [decaporg/decap-cms](https://awesome-repositories.com/repository/decaporg-decap-cms.md) (18,943 ⭐) — Decap CMS is a headless, Git-based content management system designed to provide a visual editing interface for static site workflows. By decoupling the administrative dashboard from the frontend, it allows users to manage content stored directly in version control repositories as structured data. The system maps visual form inputs to repository files, enabling non-technical contributors to update content without requiring direct code changes.

The platform distinguishes itself through its Git-centric automation, which handles content lifecycles by creating commits, branches, and pull requests
- [googlechrome/lighthouse](https://awesome-repositories.com/repository/googlechrome-lighthouse.md) (30,355 ⭐) — Lighthouse is an automated diagnostic tool that evaluates web pages against industry standards for performance, accessibility, and search engine optimization. It functions as a programmatic analysis engine and a command-line utility, allowing developers to integrate comprehensive web quality checks directly into continuous integration pipelines and local development workflows.

The project distinguishes itself through a modular architecture that utilizes artifact-based data collection to ensure consistent analysis across different environments. It supports a headless execution mode for automat
- [schacon/git-pulls](https://awesome-repositories.com/repository/schacon-git-pulls.md) (294 ⭐) — git-pulls
- [budnix/ball-and-wall](https://awesome-repositories.com/repository/budnix-ball-and-wall.md) (49 ⭐) — Ball And Wall - Arkanoid style game
- [meeb/tubesync](https://awesome-repositories.com/repository/meeb-tubesync.md) (2,625 ⭐) — TubeSync is a containerized media management tool and self-hosted archiver designed to automate the downloading and organization of video content from online sources, such as YouTube channels and playlists, into a local library for offline access. It functions as a download manager and metadata generator, utilizing a web interface to manage video subscriptions and synchronization settings.

The system features a rule-based content filter that evaluates video metadata against user-defined conditions to determine which items enter the download queue. To handle restricted or age-gated content, it
- [alubj/lock-and-key](https://awesome-repositories.com/repository/alubj-lock-and-key.md) (10 ⭐) — String and file encryption in GameMaker
- [linkedin/datahub](https://awesome-repositories.com/repository/linkedin-datahub.md) (12,106 ⭐) — DataHub is a metadata management system and data catalog platform designed to provide a centralized directory for discovering, managing, and documenting datasets across a diverse data stack. It serves as a comprehensive framework for metadata management, incorporating a data governance framework to classify sensitive information and assign ownership for organizational accountability.

The platform distinguishes itself through AI-enabled data discovery, which connects large language models to a metadata graph to allow for natural language search and exploration of data assets. It also provides
- [deepset-ai/haystack](https://awesome-repositories.com/repository/deepset-ai-haystack.md) (24,253 ⭐) — Haystack is an orchestration framework designed for building complex search and generative AI pipelines. It functions as an agentic workflow engine, enabling the construction of automated sequences that allow AI agents to perform multi-step reasoning and data analysis.

The framework utilizes a modular, component-based architecture that connects processing steps into directed acyclic graphs. By employing a provider-agnostic integration layer, it decouples core logic from specific external AI services and vector databases, allowing for the flexible exchange of underlying technologies. This desi
- [getpaseo/paseo](https://awesome-repositories.com/repository/getpaseo-paseo.md) (9,118 ⭐) — Paseo is an LLM coding agent orchestrator and multi-agent workflow manager designed to coordinate multiple AI agents across isolated git worktrees. It provides a unified control interface for managing these agents and their associated environments to execute complex programming tasks.

The system distinguishes itself through a remote agent daemon that enables secure access to local coding agents via encrypted relays. It employs a git worktree environment manager to isolate parallel tasks into dedicated directories and branch-based server URLs, preventing file collisions and network port confli
- [popcorntime/popcorntime](https://awesome-repositories.com/repository/popcorntime-popcorntime.md) (10,495 ⭐) — Popcorn Time is a cross-platform application that functions as a BitTorrent streaming client, enabling the playback of movies and television shows through sequential data downloading. It serves as a comprehensive media discovery platform and metadata API, providing tools to search, browse, and retrieve detailed information about entertainment content.

The platform distinguishes itself through a robust streaming data provider service that maps regional streaming availability and resolves encrypted tokens into secure redirects for external media platforms. It supports global audiences by locali
- [aaronlidman/osm-and-geojson](https://awesome-repositories.com/repository/aaronlidman-osm-and-geojson.md) (91 ⭐) — Converts between OSM XML and GeoJSON
