2 repos
Tools that aggregate data or search results from multiple independent storage locations into a unified interface.
Distinguishing note: Focuses on the aggregation of search results across distributed nodes rather than local file indexing.
Explore 2 awesome GitHub repositories matching data & databases · Distributed Query Engines. Refine with filters or upvote what's useful.
ClickHouse is a high-performance, columnar analytical database designed for real-time query execution and large-scale data aggregation. It functions as a distributed data warehouse capable of processing petabytes of information, while also providing an embedded engine that integrates directly into applications for native query capabilities without external dependencies. The system is built to handle high-throughput ingestion and complex analytical workloads, delivering millisecond-level latency for interactive dashboards and operational monitoring. The platform distinguishes itself through ad
Coordinates parallel execution across multiple nodes by splitting query tasks and aggregating partial results into a final response.
Copyparty is a self-hosted file server that provides a browser-based interface for managing, browsing, uploading, and downloading files. It utilizes a virtual file system abstraction to map diverse storage backends and network-attached devices into a unified directory structure, allowing for consistent file access across various storage environments. The platform functions as a cloud synchronization gateway, enabling automated data backups and transfers between local storage and remote cloud providers through integration with standard command-line tools. It also serves as a distributed storag
Query several file servers simultaneously to aggregate search results into a single view, making it easier to discover and manage files across distributed storage locations.