1 repo
Modular systems for managing data ingestion, transformation, and querying workflows.
Distinguishing note: Focuses on the orchestration of data pipelines for knowledge-based applications.
Explore 1 awesome GitHub repository matching software engineering & architecture · Data Orchestration Frameworks. Refine with filters or upvote what's useful.
GraphRAG is a data processing pipeline and retrieval engine designed to transform unstructured text into interconnected knowledge graphs. By utilizing language models to extract entities and relationships, it builds structured representations of information that enable context-aware retrieval for downstream applications. The system distinguishes itself through hierarchical graph clustering and large-scale data synthesis, which organize massive document corpora into multi-level structures. This approach allows for both vector-based semantic searches and graph-based traversals, providing a comp
Manages the ingestion, transformation, and querying of information to maintain high-quality knowledge structures.