awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Automated Data Extraction · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesAutomated Data Extraction

Tools that convert scanned or digital documents into structured data formats for large-scale analysis.

Explore 1 awesome GitHub repository matching content management & publishing · Automated Data Extraction. Refine with filters or upvote what's useful.

  1. Home
  2. Content Management & Publishing
  3. Content Processing and Transformation
  4. Document Processing and Conversion
  5. Document Processing
  6. Data Extraction and Analysis
  7. Automated Data Extraction

Awesome Automated Data Extraction GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • opendatalab/MinerU

    opendatalab/MinerU

    54,523GitHubView on GitHub↗

    MinerU is a document parsing pipeline designed to transform unstructured files into machine-readable, structured data. It utilizes deep learning models to perform layout analysis, identifying document regions and extracting complex content such as mathematical expressions. By combining these neural network inferences w

    Pythonai4sciencedocument-analysisextract-data