3 repository-uri
Systems designed to connect and unify data from diverse external sources for unified access.
Distinguishing note: Focuses on the architectural capability of federated data connectivity rather than specific ETL tools.
Explore 3 awesome GitHub repositories matching data & databases · Data Integration Frameworks. Refine with filters or upvote what's useful.
MindsDB is an AI-native database engine that treats machine learning models and autonomous agents as virtual tables. By mapping external data sources, predictive models, and third-party services directly into the database schema, it enables users to perform inference, data retrieval, and complex orchestration using standard SQL syntax. The platform distinguishes itself through an autonomous agent orchestrator that executes iterative reasoning loops, allowing agents to plan data access and synthesize natural language responses from connected knowledge bases. It functions as a federated data ga
MindsDB is built around three fundamental capabilities that form the foundation of MindsDB, enabling seamless integration, organization, and utilization of data. Connect data from hundreds of data sources that integrate
Perplexica is an AI-powered search engine that synthesizes real-time web results into coherent, cited summaries. By utilizing large language models and retrieval augmentation, the platform gathers information from the live internet to provide accurate answers to complex user queries, ensuring that all generated content includes verifiable source citations. The project functions as a search orchestration platform that aggregates data from multiple sources and exposes these capabilities through standard endpoints. This allows for automated data integration, enabling external software to retriev
Connects search and summarization capabilities to external software workflows to enable intelligent data processing.
DataHub is a metadata management platform designed to unify technical, operational, and business context across diverse data ecosystems. By utilizing a graph-based metadata model and an event-driven ingestion architecture, it creates a centralized source of truth that maps complex data relationships, lineage, and ownership. This foundational framework enables organizations to maintain a synchronized view of their data landscape, supporting both human-led discovery and automated data operations. The platform distinguishes itself through its focus on grounding artificial intelligence and autono
Connects to diverse data platforms using a high-throughput framework to provide unified visibility and governance across large-scale environments.