awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Document Segmentation · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesDocument Segmentation

Tools for dividing documents into logical sections based on content schemas.

Distinguishing note: Focuses on structural segmentation rather than general text splitting.

Explore 1 awesome GitHub repository matching content management & publishing · Document Segmentation. Refine with filters or upvote what's useful.

  1. Home
  2. Content Management & Publishing
  3. Document Segmentation

Awesome Document Segmentation GitHub Repositories

Describe the repository you're looking for…
Find the best repos with AI.We'll search the best matching repositories with AI.
  • datalab-to/marker

    datalab-to/marker

    31,757View on GitHub↗

    Marker is a comprehensive document processing platform designed to automate the conversion, extraction, and structuring of data from complex files. It functions as an orchestration engine that chains modular processing steps into versioned, reusable pipelines, allowing organizations to standardize document handling and automate repetitive business tasks at scale. The platform distinguishes itself through its support for secure, private infrastructure deployment, enabling users to run containerized services within their own environments to maintain strict data privacy. It features specialized

    Divides long or batch documents into logical sections by defining a schema that identifies specific parts.

    Python
    31,757View on GitHub↗