1 repo
Tools that parse and interpret information from multiple media types including text and images for knowledge base construction.
Distinguishing note: Focuses on the extraction and parsing phase of multimodal data, distinct from general-purpose AI models.
Explore 1 awesome GitHub repository matching artificial intelligence & ml · Multimodal Data Extractors. Refine with filters or upvote what's useful.
LightRAG is a graph-based retrieval framework designed to build retrieval-augmented generation pipelines. It structures unstructured text into knowledge graphs, enabling multi-hop reasoning and complex query synthesis across large document collections. By integrating dense vector embeddings with structured knowledge graphs, the system facilitates both similarity-based and relationship-aware information retrieval. The framework distinguishes itself through a dual-level retrieval strategy that combines low-level keyword matching with high-level semantic graph traversal to capture both specific
Parses text and visual data from diverse document formats to build searchable knowledge bases.