2 रिपॉजिटरी
Tools for defining, manipulating, and transforming structured tabular data schemas.
Distinguishing note: Focuses on schema definition and manipulation rather than storage or database connectivity.
Explore 2 awesome GitHub repositories matching data & databases · Data Processing Libraries. Refine with filters or upvote what's useful.
Polars is a high-performance columnar data processing library designed for efficient analytical workflows. It functions as a structured data library that organizes information into typed columns, utilizing the Apache Arrow memory format to enable zero-copy data sharing and cache-friendly, vectorized operations. The engine is built to handle large-scale tabular datasets, providing both local and distributed analytical runtimes that scale from single-machine environments to multi-node clusters. The project distinguishes itself through a sophisticated lazy query engine that constructs abstract e
Organizes information into typed columns to enable efficient memory utilization and fast query execution.
This library is a data processing framework for the JVM that provides a type-safe environment for manipulating structured tabular data. It functions as a comprehensive toolset for performing complex data transformations, aggregations, and statistical analysis, while leveraging compile-time schema validation to ensure structural integrity across data pipelines. The project distinguishes itself through its deep integration with interactive notebook environments and its use of compile-time code generation. By automatically deriving and enforcing schemas from raw inputs, it generates type-safe ac
Provides a type-safe library for manipulating structured tabular data with compile-time schema validation and IDE autocompletion.