Zarr Python

Open-source alternatives to Zarr Python

Similar open-source projects, ranked by how many features they share with Zarr Python.

apache/arrow
apache/arrow
16,529View on GitHub
Arrow is a cross-language development platform for in-memory data. It provides a standardized, language-independent columnar memory format designed to accelerate analytical operations and improve memory efficiency on modern computing hardware. By utilizing a schema-driven approach, the framework enables the efficient organization of both flat and nested data structures. The project functions as an analytical data processing engine that facilitates high-performance computation directly on memory-resident datasets. It distinguishes itself through a zero-copy architecture, which allows multiple
C++arrowparquet
View on GitHub16,529
apache/druid
apache/druid
14,020View on GitHub
Apache Druid is a real-time analytics database and distributed columnar time-series store designed for sub-second analytical queries. It functions as a data platform featuring a distributed SQL query engine and a real-time data ingestion system for moving historical and streaming data from external sources. The system is distinguished by its ability to provide low-latency analytics under high concurrency to power operational dashboards. It implements a Kerberos-secured environment for user authentication and employs a shared-nothing cluster architecture to enable horizontal scaling. The plat
Javadruid
View on GitHub14,020
apache/hudi
apache/hudi
6,097View on GitHub
Apache Hudi is an open-source table format that brings ACID transactions, incremental processing, and multi-modal indexing to data lakes. It provides atomic commits with snapshot isolation, rollback, and optimistic concurrency control for reliable data lake operations, while supporting upserts, record-level updates, and deletions in large analytical datasets. The project distinguishes itself through a timeline-based architecture that coordinates all write operations, enabling features like time-travel querying, incremental change streaming, and multi-modal query views that include snapshot, i
Javaapacheflinkapachehudiapachespark
View on GitHub6,097
alluxio/alluxio
Alluxio/alluxio
7,202View on GitHub
Alluxio is a virtual distributed file system and data orchestration layer that serves as a high-performance caching layer between cloud storage and compute clusters. It acts as a distributed data cache designed to accelerate data access for large-scale analytics and machine learning workloads. The system provides a unified interface that presents multiple heterogeneous storage backends as a single coherent namespace. This allows for the unification of diverse storage systems, enabling computation engines to access data from different providers without changing application code. The project c
Java
View on GitHub7,202

See all 30 alternatives to Zarr Python

zarr-developerszarr-python

Features

Open-source alternatives to Zarr Python

apache/arrow

apache/druid

apache/hudi

Alluxio/alluxio

Star history

Open-source alternatives to Zarr Python

apache/arrow

apache/druid

apache/hudi

Alluxio/alluxio