What are the best open-source alternatives to Iceberg?

30 open-source projects similar to apache/iceberg, ranked by shared features. Top picks: delta-io/delta, apache/hudi, lancedb/lancedb, apache/gravitino, risingwavelabs/risingwave, apache/pinot, apache/hive, prestodb/presto, prefecthq/prefect, apache/arrow.

Is delta-io/delta a good alternative to Iceberg?

Delta is a lakehouse table format that brings ACID transactions and data warehouse consistency to large scale data lakes on cloud object storage. It serves as an ACID transaction manager, coordinating atomic commits and serializable isolation for concurrent reads and writes across distributed compu…

Is apache/hudi a good alternative to Iceberg?

Apache Hudi is an open-source table format that brings ACID transactions, incremental processing, and multi-modal indexing to data lakes. It provides atomic commits with snapshot isolation, rollback, and optimistic concurrency control for reliable data lake operations, while supporting upserts, rec…

Is lancedb/lancedb a good alternative to Iceberg?

LanceDB is a vector database and columnar data store designed to function as a versioned dataset manager and vector search engine. It serves as a high-performance backend for indexing and retrieving high-dimensional embeddings, providing the foundation for machine learning data pipelines. The syst…

Is apache/gravitino a good alternative to Iceberg?

Gravitino is a federated metadata lake and unified data catalog designed to manage tables, files, and AI models across diverse data sources and cloud storage. It serves as a centralized interface for governing schemas, access controls, and tagging across relational databases, messaging queues, and…

Is risingwavelabs/risingwave a good alternative to Iceberg?

RisingWave is a cloud-native streaming database and real-time analytics engine that uses standard SQL to process continuous data streams. It functions as a streaming data lakehouse, combining the capabilities of a streaming SQL database with a platform that integrates streaming ingestion with open…

Is apache/pinot a good alternative to Iceberg?

Pinot is a distributed, columnar analytical database designed for high-concurrency, low-latency query processing. It functions as a real-time OLAP datastore, enabling interactive, user-facing analytics by ingesting and querying massive datasets from both streaming and batch sources. The system arch…

Is apache/hive a good alternative to Iceberg?

Apache Hive is a SQL-on-Hadoop data warehouse that enables querying and managing petabytes of data stored in distributed storage such as HDFS and cloud storage services. It provides a familiar SQL interface for batch analytics and reporting, supported by a core set of components including the HiveS…

Is prestodb/presto a good alternative to Iceberg?

Presto is a distributed SQL query engine designed for high-performance analytical processing across heterogeneous data sources. It functions as a data federation platform and massively parallel processing engine, allowing users to execute interactive queries against diverse storage systems without…

Is prefecthq/prefect a good alternative to Iceberg?

Prefect is a workflow orchestration platform designed to define, schedule, and monitor complex data pipelines as Python code. It functions as a container-native engine that wraps individual tasks in isolated environments, ensuring consistent dependencies and resource allocation across diverse infra…

Is apache/arrow a good alternative to Iceberg?

Arrow is a cross-language development platform for in-memory data. It provides a standardized, language-independent columnar memory format designed to accelerate analytical operations and improve memory efficiency on modern computing hardware. By utilizing a schema-driven approach, the framework en…

Back to apache/iceberg

Open-source alternatives to Iceberg

30 open-source projects similar to apache/iceberg, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Iceberg alternative.

delta-io/delta
delta-io/delta
8,596View on GitHub
Delta is a lakehouse table format that brings ACID transactions and data warehouse consistency to large scale data lakes on cloud object storage. It serves as an ACID transaction manager, coordinating atomic commits and serializable isolation for concurrent reads and writes across distributed compute engines. The project provides a multi-engine interoperability layer that uses format translation to allow diverse SQL engines and processing frameworks to read and write the same tables. It functions as a data versioning system, utilizing a transaction log to enable time travel, historical snapsh
Scalaacidanalyticsbig-data
View on GitHub8,596
apache/hudi
apache/hudi
6,097View on GitHub
Apache Hudi is an open-source table format that brings ACID transactions, incremental processing, and multi-modal indexing to data lakes. It provides atomic commits with snapshot isolation, rollback, and optimistic concurrency control for reliable data lake operations, while supporting upserts, record-level updates, and deletions in large analytical datasets. The project distinguishes itself through a timeline-based architecture that coordinates all write operations, enabling features like time-travel querying, incremental change streaming, and multi-modal query views that include snapshot, i
Javaapacheflinkapachehudiapachespark
View on GitHub6,097
lancedb/lancedb
lancedb/lancedb
9,031View on GitHub
LanceDB is a vector database and columnar data store designed to function as a versioned dataset manager and vector search engine. It serves as a high-performance backend for indexing and retrieving high-dimensional embeddings, providing the foundation for machine learning data pipelines. The system distinguishes itself through a combination of cloud-native object storage and immutable version tracking, allowing for data time-travel and reproducible AI experiments. It integrates hybrid search capabilities, merging dense vector similarity with BM25 full-text search and SQL-like scalar filters
HTMLapproximate-nearest-neighbor-searchimage-searchnearest-neighbor-search
View on GitHub9,031

Open-source alternatives to Iceberg

delta-io/delta

apache/hudi

lancedb/lancedb

apache/gravitino

risingwavelabs/risingwave

apache/pinot

apache/hive

prestodb/presto

PrefectHQ/prefect

apache/arrow

pawelsalawa/sqlitestudio

thinkaurelius/titan

jeremyevans/sequel

GreptimeTeam/greptimedb

lance-format/lance

kaminari/kaminari

enochtangg/quick-SQL-cheatsheet

facebook/rocksdb

Admol/SystemDesign

erikgrinaker/toydb

apple/turicreate

stellar/stellar-core

spotify/spark-bigquery

spotify/scio

GoogleCloudPlatform/bigquery-utils

GoogleCloudPlatform/DataflowTemplates

GoogleCloudPlatform/psq

spotify/heroic

pubkey/rxdb

Kotlin/anko