awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Document and Data Intelligence · Awesome GitHub Repositories

2 repos

Awesome GitHub RepositoriesDocument and Data Intelligence

AI-driven systems for parsing, extracting, and structuring information from unstructured documents or text.

Explore 2 awesome GitHub repositories matching artificial intelligence & ml · Document and Data Intelligence. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Artificial Intelligence
  4. Document and Data Intelligence

Awesome Document and Data Intelligence GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • microsoft/markitdown

    microsoft/markitdown

    87,305GitHubView on GitHub↗

    This project is an AI-powered document processing engine designed to transform diverse file formats into structured Markdown. By leveraging multimodal language models, it performs complex layout analysis and semantic text extraction, allowing for the conversion of both unstructured files and scanned images into machine

    Pythonautogenautogen-extensionlangchain
  • infiniflow/ragflow

    infiniflow/ragflow

    73,425GitHubView on GitHub↗

    This project is a comprehensive retrieval-augmented generation platform designed for building, managing, and deploying knowledge-based AI applications. It provides a unified environment for organizing datasets, configuring conversational chat assistants, and developing autonomous agents that execute multi-step reasonin

    Pythonagentagenticagentic-ai

Explore sub-tags

  • AI-Powered Data ExtractionTools that automatically parse and extract structured data from unstructured documents like invoices, forms, and reports.
  • Document Intelligence ServicesCloud-based services that analyze, classify, and summarize large volumes of complex document-based information.
  • Semantic Parsing ToolsTools that extract and interpret structured data, such as text and tables, from complex document formats.