awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Document Layout Analysis · Awesome GitHub Repositories

3 repos

Awesome GitHub RepositoriesDocument Layout Analysis

Tools for parsing and extracting structural information from complex documents to identify text, tables, and layout hierarchies.

Explore 3 awesome GitHub repositories matching artificial intelligence & ml · Document Layout Analysis. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Document Analysis
  4. Document Layout Analysis

Awesome Document Layout Analysis GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • tesseract-ocr/tesseract

    tesseract-ocr/tesseract

    72,460GitHubView on GitHub↗

    Tesseract is a neural network-based optical character recognition engine designed to convert scanned images and digital documents into machine-readable, searchable text. It functions as both a command-line utility for automating large-scale digitization workflows and a cross-platform library that can be embedded into d

    C++hacktoberfestlstmmachine-learning
  • opendatalab/MinerU

    opendatalab/MinerU

    54,523GitHubView on GitHub↗

    MinerU is a document parsing pipeline designed to transform unstructured files into machine-readable, structured data. It utilizes deep learning models to perform layout analysis, identifying document regions and extracting complex content such as mathematical expressions. By combining these neural network inferences w

    Pythonai4sciencedocument-analysisextract-data
  • docling-project/docling

    docling-project/docling

    53,584GitHubView on GitHub↗

    Docling is a modular framework designed for document parsing, layout analysis, and structured data extraction. It transforms unstructured files and web content into a unified, hierarchical data model that preserves the spatial and semantic relationships between text, tables, images, and layout elements. By normalizing

    Pythonaiconvertdocument-parser

Explore sub-tags

  • Page Segmentation ModesConfiguration settings for defining how document layouts are parsed into blocks, lines, or characters.