1 repo
Tools for converting, parsing, and preparing document content for ingestion into machine learning models.
Distinguishing note: Focuses on text extraction for AI pipelines rather than general-purpose document management or OCR.
Explore 1 awesome GitHub repository matching artificial intelligence & ml · Document Processing Utilities. Refine with filters or upvote what's useful.
Dify is an open-source platform for building, orchestrating, and deploying generative AI applications and autonomous agents. It provides a visual development environment that allows users to design complex, multi-step logic chains and conversational flows, which can then be published as APIs, web interfaces, or embedded widgets. The platform acts as a centralized infrastructure layer, managing model connections, prompt templates, and knowledge retrieval to support scalable AI-powered services. What distinguishes the platform is its focus on stateful application design and workflow orchestrati
The platform converts various file formats into plain text to make their content readable and processable by language models for analysis, summarization, or information retrieval.