What are the best open-source alternatives to Hbase?

30 open-source projects similar to apache/hbase, ranked by shared features. Top picks: apache/hadoop, apache/hive, deepseek-ai/3fs, gluster/glusterfs, hazelcast/hazelcast, doocs/advanced-java, apache/incubator-kvrocks, apache/cassandra, rustfs/rustfs, jerrylead/sparkinternals.

Is apache/hadoop a good alternative to Hbase?

Hadoop is a big data infrastructure suite and distributed data processing framework designed to store and process massive datasets across clusters of computers. It consists of a distributed storage system for managing large files across multiple nodes and a parallel computing engine for processing…

Is apache/hive a good alternative to Hbase?

Apache Hive is a SQL-on-Hadoop data warehouse that enables querying and managing petabytes of data stored in distributed storage such as HDFS and cloud storage services. It provides a familiar SQL interface for batch analytics and reporting, supported by a core set of components including the HiveS…

Is deepseek-ai/3fs a good alternative to Hbase?

3FS is a distributed file system and RDMA storage cluster designed for high-performance AI training and inference workloads. It functions as a strongly consistent storage layer that utilizes a disaggregated architecture to pool SSDs and memory resources across multiple nodes. The system provides s…

Is gluster/glusterfs a good alternative to Hbase?

GlusterFS is a software-defined distributed file system and scale-out storage cluster that aggregates disk resources from multiple servers into a single global namespace. It functions as a unified storage platform, allowing the same underlying data to be exposed through file, block, and object stor…

Is hazelcast/hazelcast a good alternative to Hbase?

Hazelcast is a distributed data platform that combines an in-memory data grid with a stream processing engine to support real-time analytics and event-driven applications. It functions as a partitioned, distributed key-value store that replicates data across cluster nodes to provide low-latency acc…

Is doocs/advanced-java a good alternative to Hbase?

This project is a comprehensive Java backend engineering guide and technical reference focused on high-concurrency design, distributed systems, and microservices architecture. It provides detailed strategies for decomposing monolithic applications, managing service discovery, and implementing the a…

Is apache/incubator-kvrocks a good alternative to Hbase?

Kvrocks is a disk-based NoSQL database and distributed key-value store that leverages the RocksDB storage engine to persist large datasets to physical disk. It is designed to be a Redis-compatible database, utilizing the standard Redis communication protocol to ensure interoperability with existing…

Is apache/cassandra a good alternative to Hbase?

Cassandra is a distributed NoSQL database and wide-column store designed for high availability and linear scalability. It functions as a fault-tolerant distributed system that utilizes an LSM-tree storage engine to optimize write throughput and manage massive datasets. The system is a CQL-complian…

Is rustfs/rustfs a good alternative to Hbase?

Rustfs is a distributed object storage system designed for high availability and horizontal scalability. It functions as a cluster-based platform that manages data across multiple nodes, providing a self-hosted infrastructure for large-scale storage requirements. The system is built to be containe…

Is jerrylead/sparkinternals a good alternative to Hbase?

SparkInternals is a technical reference and architecture guide detailing the internal design and implementation of the Apache Spark distributed computing engine. It serves as a study of big data engine analysis, focusing on how the system manages cluster execution and the interaction between driver…

Back to apache/hbase

Open-source alternatives to Hbase

30 open-source projects similar to apache/hbase, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Hbase alternative.

apache/hadoop
apache/hadoop
15,567View on GitHub
Hadoop is a big data infrastructure suite and distributed data processing framework designed to store and process massive datasets across clusters of computers. It consists of a distributed storage system for managing large files across multiple nodes and a parallel computing engine for processing data across a distributed cluster. The framework implements a distributed file system to ensure fault tolerance and high throughput, paired with a programming model that processes large datasets in parallel. It manages the underlying hardware and software environment required for distributed big dat
Java
View on GitHub15,567
apache/hive
apache/hive
6,012View on GitHub
Apache Hive is a SQL-on-Hadoop data warehouse that enables querying and managing petabytes of data stored in distributed storage such as HDFS and cloud storage services. It provides a familiar SQL interface for batch analytics and reporting, supported by a core set of components including the HiveServer2 Thrift service for remote query execution, the Hive Metastore Service for central metadata management, the Hive ACID Transaction Engine for concurrent read-write operations, and the Hive LLAP Interactive Engine for low-latency analytical processing. The WebHCat REST API offers an HTTP interfac
Javaapachebig-datadatabase
View on GitHub6,012
deepseek-ai/3fs
deepseek-ai/3FS
9,970View on GitHub
3FS is a distributed file system and RDMA storage cluster designed for high-performance AI training and inference workloads. It functions as a strongly consistent storage layer that utilizes a disaggregated architecture to pool SSDs and memory resources across multiple nodes. The system provides specialized storage implementations including an AI training checkpoint store for parallel state preservation and a distributed key-value cache store for decoder layer vectors to optimize inference processing. It ensures data integrity through chain replication and apportioned query distribution. The
C++
View on GitHub9,970

Open-source alternatives to Hbase

apache/hadoop

apache/hive

deepseek-ai/3FS

gluster/glusterfs

hazelcast/hazelcast

doocs/advanced-java

apache/incubator-kvrocks

apache/cassandra

rustfs/rustfs

JerryLead/SparkInternals

scylladb/scylla

MariaDB/server

facebook/rocksdb

VictoriaMetrics/VictoriaMetrics

smallnest/rpcx

tikv/tikv

SystemsApproach/book

webmin/webmin

deuxfleurs-org/garage

tporadowski/redis

ServiceWeaver/weaver

geektutu/high-performance-go

airtai/faststream

happyfish100/fastdfs

buildbot/buildbot

sjqzhang/go-fastdfs

skyzh/mini-lsm

andeya/pholcus

ceph/ceph

minio/minio