1 repo
Frameworks for executing modular transformation steps to process and structure data.
Distinguishing note: Focuses on the orchestration of data transformation pipelines for knowledge graph creation.
Explore 1 awesome GitHub repository matching software engineering & architecture · Data Orchestration Pipelines. Refine with filters or upvote what's useful.
GraphRAG is a data processing pipeline and retrieval engine designed to transform unstructured text into interconnected knowledge graphs. By utilizing language models to extract entities and relationships, it builds structured representations of information that enable context-aware retrieval for downstream applications. The system distinguishes itself through hierarchical graph clustering and large-scale data synthesis, which organize massive document corpora into multi-level structures. This approach allows for both vector-based semantic searches and graph-based traversals, providing a comp
Executes modular transformation steps in a defined sequence to convert unstructured corpora into a queryable knowledge graph format.