awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Multimodal Data Extraction Pipelines · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesMultimodal Data Extraction Pipelines

Workflows that orchestrate layout analysis and semantic generation to convert heterogeneous documents into structured machine-readable formats.

Explore 1 awesome GitHub repository matching data & databases · Multimodal Data Extraction Pipelines. Refine with filters or upvote what's useful.

  1. Home
  2. Data & Databases
  3. Data Processing Pipelines
  4. Multimodal Data Extraction Pipelines

Awesome Multimodal Data Extraction Pipelines GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • microsoft/markitdown

    microsoft/markitdown

    87,305GitHubView on GitHub↗

    This project is an AI-powered document processing engine designed to transform diverse file formats into structured Markdown. By leveraging multimodal language models, it performs complex layout analysis and semantic text extraction, allowing for the conversion of both unstructured files and scanned images into machine

    Pythonautogenautogen-extensionlangchain