1 repo
Tools for lazy evaluation of Parquet files.
Distinguishing note: Enables query optimization before data is loaded.
Explore 1 awesome GitHub repository matching data & databases · Parquet Scanners. Refine with filters or upvote what's useful.
Polars is a high-performance columnar data processing library designed for efficient analytical workflows. It functions as a structured data library that organizes information into typed columns, utilizing the Apache Arrow memory format to enable zero-copy data sharing and cache-friendly, vectorized operations. The engine is built to handle large-scale tabular datasets, providing both local and distributed analytical runtimes that scale from single-machine environments to multi-node clusters. The project distinguishes itself through a sophisticated lazy query engine that constructs abstract e
Scans Parquet files to create lazy computation holders, enabling predicate and projection pushdown.