我们为您精选了匹配 “open source database” 的开源 GitHub 仓库。结果按与您查询的相关性进行排名 — 您可以使用下方筛选器缩小范围,或通过 AI 进行优化。
Cockroach is a distributed SQL database designed to scale horizontally across multiple nodes while maintaining strict ACID compliance and global data consistency. It functions as a relational database engine that automatically partitions data into ranges, rebalancing them across a cluster to accommodate growing storage and throughput requirements. By utilizing a distributed consensus protocol, the system ensures that all nodes agree on the order of operations, providing fault tolerance and continuous availability even in the event of hardware failures. The system distinguishes itself through
CockroachDB is a distributed, horizontally scalable SQL database that provides ACID transactions, automatic replication, and high availability, fitting your need for an open-source DBMS with comprehensive SQL support, scaling, and security features.
YugabyteDB is a distributed SQL database and relational data store designed for horizontal scalability and high availability across multiple nodes or regions. It functions as a cloud-native system that ensures continuous availability and supports PostgreSQL compatible query languages and drivers. The system includes specialized capabilities as a vector database for AI, utilizing high-dimensional indexing to perform similarity searches. It is engineered as a multi-region cloud database that synchronizes data across different geographic locations to maintain global availability. The project co
YugabyteDB is a cloud-native distributed SQL database with PostgreSQL compatibility that provides ACID transactions, horizontal scaling, and high availability — directly meeting the need for an open-source database management system that supports SQL, replication, and multi-model data (with added vector search capabilities).
OceanBase is a distributed SQL database designed for high availability and strong consistency across multiple nodes and regions. It functions as a hybrid transactional and analytical processing engine, allowing real-time analytics and transactions to execute on a single data copy. The system also serves as a vector database engine for indexing and querying vector data to power semantic search and recommendation systems. The platform features native compatibility layers for MySQL and Oracle, enabling the migration of legacy workloads without rewriting SQL code. It utilizes a Paxos-based distri
OceanBase is a distributed SQL database with MySQL and Oracle compatibility, ACID transactions, horizontal scaling, and built-in high availability via Paxos, making it a comprehensive open-source database management system that directly matches this search.
TiDB is a horizontally scalable, distributed SQL database designed to provide consistent transactional storage and high-performance analytical processing within a single unified architecture. It utilizes a decoupled compute-storage design and a distributed key-value storage layer to ensure horizontal scalability and efficient range-based queries. By employing a consensus-based replication algorithm, the system maintains high availability and automatic failover across multiple nodes and geographical regions. The platform distinguishes itself through its hybrid transactional and analytical proc
TiDB is a horizontally scalable, distributed SQL database with ACID transactions, replication, and MySQL compatibility, squarely matching the requirements for an open-source database management system.
SurrealDB is a multi-model database engine designed to store and query document, graph, relational, and vector data within a single ACID-compliant platform. It functions as an AI-native data store, integrating vector search, graph traversal, and machine learning model execution directly into its query layer. By providing a unified declarative query language, the platform eliminates the need for external middleware to synchronize data across different storage models. The platform distinguishes itself through its ability to manage agent memory and complex workflows natively. It allows developer
SurrealDB is a multi-model database engine that supports SQL, ACID transactions, and distributed deployment, directly matching the need for a database management system for structured and semi-structured data with the required features.
This project is a distributed, document-oriented database system designed to store information in flexible, hierarchical structures. It supports horizontal scaling through automated sharding and maintains high availability across global clusters using a multi-node replication protocol. By executing multi-document operations as atomic units, the system ensures data integrity and consistency across distributed environments. The platform distinguishes itself by integrating advanced vector-based indexing, which enables semantic similarity searches alongside traditional geospatial and lexical quer
MongoDB is a distributed document-oriented database with ACID transactions and horizontal scaling, but it does not natively support SQL or multi-model data, so it only partially matches the requested features.
This project is an open source relational database management system and SQL database designed for storing and managing structured data. It functions as a relational database for ensuring consistency and reliability, while also operating as a vector database for storing and querying high-dimensional vector embeddings. The system incorporates a columnar storage engine to optimize analytical query processing and large-scale data aggregation. It further enables vector similarity search, allowing users to find similar items by querying vector embeddings. The software covers a broad capability su
MariaDB is an open-source relational DBMS that supports SQL, ACID transactions, replication/high-availability (Galera), horizontal scaling, multi-model data (JSON, vectors, columnar), numerous client drivers, and built-in security, fitting the full scope of your query.
Dolt is a relational database engine that integrates version control directly into the database management layer. It functions as a version-controlled SQL database that tracks every row and schema change using a commit-based history, allowing users to branch, merge, and audit data modifications. By implementing a wire-protocol-compatible server, the system enables standard SQL clients and tools to interact with versioned data as if they were connecting to a traditional relational database. The platform distinguishes itself by applying repository-style workflows to data management, including s
Dolt is a version-controlled SQL database engine that provides full relational database capabilities via MySQL-compatible wire protocol, making it a genuine open-source database management system for structured and semi-structured data, though it lacks horizontal scaling and multi-model support that some visitors may seek.
ToyDB is a distributed SQL database that provides a system for storing and querying data across multiple nodes. It focuses on maintaining strong consistency and fault tolerance through the implementation of a distributed consensus algorithm. The project distinguishes itself by supporting historical data versioning, enabling time-travel queries to retrieve the state of the database from a specific point in the past. It utilizes multi-version concurrency control to manage ACID transactions and ensure data integrity during concurrent operations. The system covers relational data modeling with t
ToyDB is a distributed SQL database with ACID transactions and replication, fitting the core need for a database management system, though it lacks multi-model support and multi-language client drivers.
immudb is a tamperproof database that maintains an immutable record of entries using cryptographic commit logging. It ensures verifiable database integrity by utilizing Merkle trees to generate membership and consistency proofs that detect unauthorized data alterations. The system employs a multi-model storage engine that unifies key-value, document, and relational data structures within a single immutable backend. It provides compatibility with the PostgreSQL wire protocol, allowing it to integrate with standard SQL clients, ORMs, and database tools. The project covers broad capabilities in
immudb is a tamperproof database that supports SQL through PostgreSQL wire protocol and multi-model storage (key-value, document, relational), making it a genuine database system for structured and semi-structured data; however, its focus on cryptographic immutability and compliance rather than general-purpose ACID transactions or horizontal scaling means it covers only part of the feature set you listed.
FoundationDB is an ACID-compliant distributed transactional key-value store. It functions as a scalable database engine that ensures strict serializability and data consistency across a cluster of servers using a shared-nothing architecture. The system is distinguished by its multi-region replication capabilities, allowing data to be synchronized across different datacenters for high availability and disaster recovery. It utilizes optimistic concurrency control to manage distributed transactions and employs a majority-based coordination system to maintain cluster state. The platform provides
FoundationDB is a distributed ACID-compliant transactional key-value store with replication, horizontal scaling, and client drivers, but it lacks SQL support and multi-model capabilities, so it only partly meets your search for a structured/semi-structured data database.
This project is a multi-model database system designed to store and manage information as documents, graphs, and key-value pairs within a single engine. It functions as a graph database and knowledge graph platform, providing the infrastructure to build, query, and visualize structured data models. By integrating vector search capabilities, the system serves as a vector database that supports retrieval-augmented generation for artificial intelligence applications. The platform distinguishes itself through a unified query language that allows users to perform document lookups, graph traversals
ArangoDB is a multi-model open-source database that handles documents, graphs, and key-values in a single engine, making it a solid fit for storing and querying structured and semi-structured data, though its custom query language (AQL) means it does not offer direct SQL support as listed in the features.
AliSQL is a fork of MySQL by Alibaba that extends the relational database management system with enhancements for high performance, scalability, and enterprise-grade availability. It retains the core MySQL identity as a SQL-based database for storing, organizing, and retrieving structured data, while adding optimizations for large-scale transactional and analytical workloads. The project differentiates itself through a set of Alibaba-specific improvements, including a columnar engine for accelerating analytical queries directly on MySQL tables, and a distributed, shared-nothing NDB Cluster en
AliSQL is a MySQL fork that retains MySQL's SQL support and ACID transactions while adding Alibaba's enhancements for scalability and high availability, making it a genuine open-source relational database management system that covers most of the required features.
Apache Druid is a real-time analytics database and distributed columnar time-series store designed for sub-second analytical queries. It functions as a data platform featuring a distributed SQL query engine and a real-time data ingestion system for moving historical and streaming data from external sources. The system is distinguished by its ability to provide low-latency analytics under high concurrency to power operational dashboards. It implements a Kerberos-secured environment for user authentication and employs a shared-nothing cluster architecture to enable horizontal scaling. The plat
Apache Druid is a real-time analytics database with a distributed SQL engine and horizontal scaling, which fits the open-source database management system category, though it lacks ACID transactions and multi-model support for a general-purpose store.
Garnet is a multi-threaded in-memory database and distributed key-value store. It functions as a high-performance remote cache store that implements the RESP wire protocol to maintain compatibility with existing Redis clients and libraries. The project is distinguished by a shared-memory architecture that enables parallel request processing across multiple cores for sub-millisecond latency. It features a tiered storage system that automatically offloads colder data from system memory to SSD or cloud storage layers, and includes a specialized vector search database for high-dimensional similar
Garnet is an in-memory key-value store and distributed cache that qualifies as a database management system, but it lacks SQL support and ACID transactions, making it a narrower fit for this query than a relational database.
Cassandra is a distributed NoSQL database and wide-column store designed for high availability and linear scalability. It functions as a fault-tolerant distributed system that utilizes an LSM-tree storage engine to optimize write throughput and manage massive datasets. The system is a CQL-compliant database, using a structured query language to manage and retrieve tabular data stored across multiple nodes. It organizes information into rows and columns based on a flexible schema and primary keys. The project provides capabilities for horizontal database scaling, distributed data partitioning
Cassandra is a distributed NoSQL wide-column store with CQL support, horizontal scaling, and replication, making it a valid open-source database management system, though it lacks full SQL support and ACID transactions, so it fits the category but with feature gaps.
Neo4j is a native graph database management system designed to store and query highly connected data using a property-graph model. It provides an ACID-compliant transaction engine that ensures data integrity, supported by a distributed cluster architecture that maintains causal consistency across nodes. Users interact with the system through a declarative query language, which allows for complex pattern matching and path traversal without requiring manual traversal logic. The platform distinguishes itself through its hybrid approach to data retrieval, combining traditional graph-based queries
Neo4j is a fully open-source graph database management system with ACID transactions, clustering for high availability, and multi-language client drivers, but it does not support SQL or a multi-model approach, making it a narrow fit for a general-purpose relational DBMS.
LibSQL is a high-performance, distributed SQL database engine that extends SQLite to support remote network access, edge computing, and real-time synchronization. It functions as an embedded database library that integrates directly into application processes while providing the infrastructure to maintain consistency across multiple geographic regions. The platform distinguishes itself by enabling database interaction over standard HTTP protocols, allowing applications to query remote data sources in serverless and edge environments without requiring local filesystem access. It includes nativ
LibSQL extends SQLite into a distributed SQL database with HTTP access and geo-replication, fitting the search for an open-source DBMS, though its focus on embedded/edge deployments makes it narrower than a full standalone server.
Redis is an in-memory, key-value database designed to provide sub-millisecond latency for read and write operations. It functions as a versatile data platform, serving as a distributed cache, a message broker, a NoSQL document store, and a vector database. The system utilizes an event-driven, single-threaded loop to process requests efficiently, while maintaining data durability through append-only persistence logs and asynchronous snapshotting mechanisms. What distinguishes Redis is its ability to handle complex data structures—including strings, hashes, lists, sets, and sorted sets—alongsid
Redis is an in-memory key-value database that stores structured data and offers replication, clustering, and multi-language clients, but it lacks SQL support and ACID transactions, making it a narrower, NoSQL-oriented fit for your open-source DBMS needs.
SpacetimeDB is a stateful, real-time database platform that executes application logic directly within the database engine. By unifying data storage and business logic, it allows developers to build applications where state transitions are processed through atomic, server-side functions. The platform maintains persistent connections to stream incremental updates to clients, ensuring that local caches remain synchronized with the server state at all times. The platform distinguishes itself by generating type-safe client interfaces directly from server-side schema definitions, ensuring consiste
SpacetimeDB is a real-time relational database platform that stores and queries structured data with SQL and atomic transactions, fitting the core database management role, though it may not fully cover features like replication or horizontal scaling at the same depth as traditional systems.
RisingWave is a cloud-native streaming database and real-time analytics engine that uses standard SQL to process continuous data streams. It functions as a streaming data lakehouse, combining the capabilities of a streaming SQL database with a platform that integrates streaming ingestion with open table formats. The system is distinguished by its use of the PostgreSQL wire protocol, allowing it to integrate with existing SQL tools and drivers. It employs a decoupled compute and storage architecture, persisting streaming state and materialized views in cloud object storage to enable independen
RisingWave is a cloud-native streaming database and real-time analytics engine that supports standard SQL via the PostgreSQL wire protocol, so it is a database management system, but its focus on continuous stream processing rather than general-purpose transactional workloads means it likely lacks full ACID transactions and multi-model support.
RethinkDB is a distributed, document-oriented database designed to store and manage JSON-formatted data across scalable clusters. It utilizes a custom log-structured storage engine with B-Tree indexing to ensure high-performance disk I/O and data persistence. The system maintains high availability through automatic sharding and replication, employing a primary-replica voting consensus mechanism to handle node failures and ensure consistent cluster operations. A defining characteristic of the platform is its reactive changefeed engine, which allows applications to subscribe to live data update
RethinkDB is a distributed document database for JSON data with automatic sharding and replication, so it fits the category of an open-source database system, but it does not support SQL or multiple data models as requested.
Pinot is a distributed, columnar analytical database designed for high-concurrency, low-latency query processing. It functions as a real-time OLAP datastore, enabling interactive, user-facing analytics by ingesting and querying massive datasets from both streaming and batch sources. The system architecture relies on a centralized controller for cluster coordination and a distributed segment-based storage model to ensure horizontal scalability. The platform distinguishes itself through a hybrid ingestion pipeline that unifies real-time event streams and historical batch data into a single quer
Pinot is a distributed columnar analytical database that can store and query structured data with SQL, scaling horizontally and offering replication, so it fits the category of an open-source database management system, but it lacks ACID transactions and multi-model support, making it narrower than a general-purpose system.
TDengine is a distributed time-series database designed for the high-speed ingestion, compression, and retrieval of timestamped metrics and sensor data. It functions as a SQL-compatible analytics engine, allowing users to perform complex operations on massive volumes of time-ordered information using standard relational syntax. The platform is built to serve as a backend foundation for industrial IoT environments, managing real-time data streams and device metadata through a cluster-based architecture. The system distinguishes itself through a distributed sharding architecture that uses consi
TDengine is a distributed time-series database with SQL support, replication, and horizontal scaling, fitting as an open-source database management system, though its focus on time-series data means it does not provide general-purpose ACID transactions or multi-model capabilities.
TimescaleDB is an open-source PostgreSQL extension that adds native time-series capabilities to the database. At its core, it transforms standard PostgreSQL tables into hypertables—automatically partitioned by time intervals—so data is stored in fixed-size chunks without manual sharding. The extension includes a library of over 200 built-in SQL functions purpose-built for time-series workloads, such as time bucketing, gap filling, percentile estimation, and time-weighted averages. What distinguishes TimescaleDB from generic PostgreSQL is its set of integrated time-series features that work th
TimescaleDB is a PostgreSQL extension that brings native time-series capabilities while inheriting PostgreSQL's full SQL support, ACID transactions, replication, and client drivers — so it genuinely is an open-source DBMS for structured data — but its specialization toward time-series workloads makes it narrower than a general-purpose multi-model database.
Dgraph is a distributed graph database designed to store and query highly connected data. It organizes information as nodes and edges to represent complex relationships between entities, providing a platform for managing and analyzing deeply linked datasets. The system functions as a horizontally scalable cluster that partitions data across multiple nodes to maintain performance and availability as information volume increases. It utilizes a specialized query language built for low-latency navigation of interconnected data points, allowing for the execution of complex queries across large-sca
Dgraph is a distributed graph database that handles structured and semi-structured data via nodes and edges, but its graph-specific query language and data model mean it lacks the SQL support and multi-model breadth this search likely expects.
ScyllaDB is a distributed NoSQL database engine designed for high-throughput data storage and low-latency performance at scale. It functions as a shard-aware platform that manages large-scale datasets across distributed clusters, providing a foundation for real-time applications that require consistent availability and operational stability. The system distinguishes itself through a shared-nothing architecture that distributes data across independent CPU cores to eliminate lock contention. It incorporates a user-space networking stack and an asynchronous event-driven engine to maximize hardwa
ScyllaDB is a distributed NoSQL database engine optimized for high-throughput and low-latency workloads, so it qualifies as an open-source database management system, but it does not support SQL or ACID transactions, making it a narrower fit for the full feature set you’re looking for.
DuckDB is an in-process analytical database engine designed to run directly within an application process. As a zero-dependency, embedded system, it provides enterprise-grade SQL data processing capabilities without the overhead of managing a dedicated database server. It is built to handle complex analytical and aggregation tasks by storing and retrieving information in columns, allowing for high-performance relational data manipulation. The engine distinguishes itself through a columnar vectorized execution model that maximizes CPU cache efficiency during query operations. It employs adapti
DuckDB is an open-source embedded analytical SQL database engine that handles structured and semi-structured data in-process, making it a genuine database management system, though it lacks replication, high availability, and horizontal scaling.
Materialize is a streaming SQL database that continuously ingests live data from sources such as Kafka, Redpanda, PostgreSQL, and MySQL, and incrementally maintains materialized views. It provides a PostgreSQL-compatible query engine that accepts standard SQL over the PostgreSQL wire protocol, enabling any existing SQL client or BI tool to query real-time data. The system also includes a Model Context Protocol (MCP) server that exposes live materialized view data to AI agents, providing fresh context without polling. Materialize distinguishes itself through its ability to offer configurable c
Materialize is a streaming SQL database that ingests live data and provides SQL querying via the PostgreSQL wire protocol, making it a valid open-source database management system for structured data, though its focus on real-time streaming means it may lack traditional ACID and multi-model support.
ClickHouse is a high-performance, columnar analytical database designed for real-time query execution and large-scale data aggregation. It functions as a distributed data warehouse capable of processing petabytes of information, while also providing an embedded engine that integrates directly into applications for native query capabilities without external dependencies. The system is built to handle high-throughput ingestion and complex analytical workloads, delivering millisecond-level latency for interactive dashboards and operational monitoring. The platform distinguishes itself through ad
ClickHouse is a high-performance columnar analytical DBMS that supports SQL, distributed querying, and horizontal scaling, making it a strong fit for storing and querying structured or semi-structured data, though it is optimized for OLAP workloads rather than full ACID transactions.
Gel is an object-relational database system that models data as a graph of interconnected objects. By utilizing a strongly typed schema, it enables complex relational queries and polymorphic data structures without the need for traditional join tables. The system integrates native vector storage and similarity search operators, allowing it to function as both a relational and a vector database for semantic data retrieval. The platform distinguishes itself through a comprehensive suite of developer-centric automation tools. It features a declarative migration system that tracks and versions sc
Gel is an open-source object-relational database system with a graph-based data model and developer automation tools, which fits the database management category, but it uses its own EdgeQL instead of standard SQL, so it does not fully cover the requested SQL support and some other features.
SQLite is a serverless relational database engine and C-based library that stores data in a single local disk file. It functions as an embedded SQL database, integrating directly into applications without the need for a separate server process. The engine includes specialized capabilities for full-text search indexing and spatial data querying using R-Tree structures for geographic or geometric coordinate ranges. The system provides broad support for SQL data manipulation, database recovery and repair, and change tracking for synchronizing modifications between databases. It also features a
SQLite is a serverless embedded relational database engine with strong SQL and ACID support and wide language bindings, which makes it a genuine open-source database management system, but it lacks the built-in replication, high availability, and horizontal scaling you listed as desired features.
HBase is a distributed, wide-column NoSQL store and big data storage engine designed for sparse datasets. It functions as a scalable columnar database built on top of the Hadoop Distributed File System to provide real-time read and write access to massive volumes of structured and unstructured data. The system acts as a cross-language database gateway, offering connectivity through native remote procedure calls, REST, and Thrift interfaces. It distinguishes itself through a master-worker coordination model that enables horizontal scaling and fault tolerance across a cluster. The project cove
Apache HBase is a distributed, wide-column NoSQL database on Hadoop, so it fits the broad category of database management systems but lacks the SQL support and ACID transactions you specifically require.
YDB is a distributed SQL database and analytical engine designed for horizontal scalability and strong consistency. It functions as a multi-model system that supports transactional and analytical workloads through a distributed architecture providing serializable ACID transactions. The system is distinguished by its broad protocol compatibility, implementing the PostgreSQL wire protocol for standard SQL drivers and the Kafka protocol for messaging and streaming. It further serves as a vector database, supporting vector indexes and approximate nearest neighbor searches for semantic search and
YDB is an open-source distributed SQL database with ACID transactions, high availability, and horizontal scaling, directly matching the core need for structured data management, though multi-model support is not explicit.
CouchDB is a NoSQL document database that stores data as flexible documents and exposes a RESTful API for data management over HTTP. It functions as a distributed document store, synchronizing and replicating data across multiple nodes to ensure high availability and consistency. The system includes a full-text search engine that transforms database records into queryable documents, supporting sorting and pagination. Data synchronization is handled via multi-master replication, which exchanges revision histories to maintain consistency across distributed nodes. The database utilizes multi-ve
CouchDB is a distributed document database with multi-master replication and a RESTful API, fitting the request for a database management system, but it lacks SQL support and ACID transactions, making it a narrower fit for the listed features.
Druid is a distributed columnar store and online analytical processing database designed for real-time analytics. It functions as a SQL analytics platform and a streaming data ingestion engine, allowing for the analysis of large datasets with low latency to support interactive dashboards and high-concurrency operational workloads. The system integrates a streaming data ingestion engine that loads information via batch or streaming processes to enable immediate analysis of arriving data. It provides high-performance analytical processing to execute slice-and-dice queries on massive data volume
Druid is a distributed columnar analytical database that supports SQL querying and real-time ingestion, fitting as an open-source database management system, but its specialized OLAP focus means it is missing features like ACID transactions and multi-model support that broader general-purpose databases offer.
Orbit DB is a decentralized NoSQL database that utilizes conflict-free replicated data types to ensure eventual consistency across a network of nodes. It functions as a peer-to-peer data store that uses IPFS for content-addressing and synchronization, allowing for the maintenance of application state without a central server or authority. The system is built upon a cryptographically verifiable, immutable operation log, which serves as the foundation for custom decentralized data models. This architecture enables the implementation of various data storage patterns, including JSON document stor
OrbitDB is a decentralized NoSQL database that stores semi-structured JSON data with peer-to-peer replication and eventual consistency, so it fits the category of a database management system but lacks the SQL and ACID features you requested.
OrbitDB is a decentralized data storage system that enables the creation of serverless databases residing across a network of peers. It functions as a peer-to-peer database that integrates with a content-addressed storage layer to distribute and replicate data without a central server. The system utilizes conflict-free replicated data types to ensure eventual consistency and state convergence across distributed nodes. It maintains an immutable record of updates using a directed acyclic graph to preserve causal ordering and cryptographic integrity. Access is managed through a decentralized ide
OrbitDB is a decentralized peer-to-peer database that stores and queries structured/semi-structured data without a central server, making it a genuine database management system but one that replaces SQL and ACID with CRDT-based eventual consistency and a P2P data model.