1 repo
Modular systems that utilize specialized, pluggable parsers to transform diverse binary and text formats into structured data.
Explore 1 awesome GitHub repository matching content management & publishing · Plugin-Based Document Parsers. Refine with filters or upvote what's useful.
This project is an AI-powered document processing engine designed to transform diverse file formats into structured Markdown. By leveraging multimodal language models, it performs complex layout analysis and semantic text extraction, allowing for the conversion of both unstructured files and scanned images into machine