awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPPrivacyTerms

1 repo

Awesome GitHub RepositoriesAutomated Data Extraction

Tools that convert scanned or digital documents into structured data formats for large-scale analysis.

Explore 1 awesome GitHub repository matching content management & publishing · Automated Data Extraction. Refine with filters or upvote what's useful.

Awesome Automated Data Extraction GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • opendatalab/MinerU

    opendatalab/MinerU

    54,523GitHubView on GitHub↗

    MinerU is a document parsing pipeline designed to transform unstructured files into machine-readable, structured data. It utilizes deep learning models to perform layout analysis, identifying document regions and extracting complex content such as mathematical expressions. By combining these neural network inferences w

    Pythonai4sciencedocument-analysisextract-data