awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Multimodal Layout Analysis · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesMultimodal Layout Analysis

Techniques for interpreting visual document structures and embedded image content using multimodal models.

Explore 1 awesome GitHub repository matching artificial intelligence & ml · Multimodal Layout Analysis. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Machine Learning
  4. Document and Data Intelligence
  5. Multimodal Layout Analysis

Awesome Multimodal Layout Analysis GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • microsoft/markitdown

    microsoft/markitdown

    87,305GitHubView on GitHub↗

    This project is an AI-powered document processing engine designed to transform diverse file formats into structured Markdown. By leveraging multimodal language models, it performs complex layout analysis and semantic text extraction, allowing for the conversion of both unstructured files and scanned images into machine

    Employs multimodal language models to interpret visual document structures and perform semantic character recognition on embedded image content.

    Pythonautogenautogen-extensionlangchain