2 repos
Systems that defer or reorder data processing operations to improve performance on large datasets.
Distinguishing note: Focuses on the execution strategy of data queries rather than the storage or retrieval mechanism itself.
Explore 2 awesome GitHub repositories matching data & databases · Query Optimization Engines. Refine with filters or upvote what's useful.
ClickHouse is a high-performance, columnar analytical database designed for real-time query execution and large-scale data aggregation. It functions as a distributed data warehouse capable of processing petabytes of information, while also providing an embedded engine that integrates directly into applications for native query capabilities without external dependencies. The system is built to handle high-throughput ingestion and complex analytical workloads, delivering millisecond-level latency for interactive dashboards and operational monitoring. The platform distinguishes itself through ad
Transforms complex analytical operations into optimized, multi-threaded execution plans to minimize resource usage during data processing.
Nushell is a cross-platform shell and programming language designed to treat all input and output as structured data rather than raw text streams. By enforcing data types and command signatures, it provides a consistent environment for building robust, pipeline-oriented workflows. The shell allows users to chain commands that pass structured objects between stages, enabling complex data processing and automation tasks that remain predictable across different operating systems. What distinguishes the project is its focus on interactive data exploration and modular extensibility. Users can quer
Defers computation of complex data transformations until the final result is requested to optimize performance for large datasets.