2 repos
Automated mechanisms for uploading and transforming diverse file formats into structured text for processing pipelines.
Explore 2 awesome GitHub repositories matching data & databases · Automated Document Ingestion. Refine with filters or upvote what's useful.
This project is an AI-powered document processing engine designed to transform diverse file formats into structured Markdown. By leveraging multimodal language models, it performs complex layout analysis and semantic text extraction, allowing for the conversion of both unstructured files and scanned images into machine
This project is a comprehensive retrieval-augmented generation platform designed for building, managing, and deploying knowledge-based AI applications. It provides a unified environment for organizing datasets, configuring conversational chat assistants, and developing autonomous agents that execute multi-step reasonin