1 repo
Comprehensive systems for automated and scalable document data extraction and structuring.
Distinguishing note: Provides a full platform for document workflows rather than single-purpose extraction or conversion tools.
Explore 1 awesome GitHub repository matching data & databases · Document Processing Platforms. Refine with filters or upvote what's useful.
Marker is a comprehensive document processing platform designed to automate the conversion, extraction, and structuring of data from complex files. It functions as an orchestration engine that chains modular processing steps into versioned, reusable pipelines, allowing organizations to standardize document handling and automate repetitive business tasks at scale. The platform distinguishes itself through its support for secure, private infrastructure deployment, enabling users to run containerized services within their own environments to maintain strict data privacy. It features specialized
A comprehensive service for converting, extracting, and structuring data from complex files through automated and scalable workflows.