Why is milvus-io/milvus a recommended Distributed Database Clusters GitHub Repositories repository?

Supports distributed architecture to handle horizontal scaling across clusters for large-scale production needs.

Why is redis/go-redis a recommended Distributed Database Clusters GitHub Repositories repository?

Manages data across distributed cluster deployments to support horizontal scaling and automated request routing.

Why is dgraph-io/dgraph a recommended Distributed Database Clusters GitHub Repositories repository?

Operates as a distributed storage platform that maintains performance and availability through cluster-based partitioning.

Why is tigerbeetle/tigerbeetle a recommended Distributed Database Clusters GitHub Repositories repository?

Deploys cluster nodes across multiple sites to maintain transaction processing capabilities during site failures.

Why is codislabs/codis a recommended Distributed Database Clusters GitHub Repositories repository?

Distributes requests across a scalable group of instances to enable horizontal growth and high performance.

Why is aws/aws-cdk a recommended Distributed Database Clusters GitHub Repositories repository?

Enables the creation of multi-Region database clusters to support low-latency access and disaster recovery.

Why is yugabyte/yugabyte-db a recommended Distributed Database Clusters GitHub Repositories repository?

Employs multi-region deployment strategies to synchronize data across geographic locations for global availability.

Why is patroni/patroni a recommended Distributed Database Clusters GitHub Repositories repository?

Manages the operational deployment and coordination of distributed database clusters across multiple nodes.

Why is dimitri/pgloader a recommended Distributed Database Clusters GitHub Repositories repository?

Migrates data into Citus distributed PostgreSQL clusters with automatic shard distribution.

Why is openatomfoundation/pikiwidb a recommended Distributed Database Clusters GitHub Repositories repository?

Expands storage capacity across multiple nodes and clusters to handle massive volumes of enterprise data.

17 مستودعات

Awesome GitHub RepositoriesDistributed Database Clusters

Configurations for running database systems across multiple nodes for high availability and scale.

Distinguishing note: Focuses on the operational deployment mode for distributed systems.

Explore 17 awesome GitHub repositories matching devops & infrastructure · Distributed Database Clusters. Refine with filters or upvote what's useful.

اعثر على أفضل المستودعات باستخدام الذكاء الاصطناعي.سنبحث عن أفضل المستودعات المطابقة باستخدام الذكاء الاصطناعي.

milvus-io/milvus
milvus-io/milvus
44,804عرض على GitHub
Milvus is a specialized vector database engine designed for the indexing, management, and high-speed similarity retrieval of high-dimensional vector embeddings. It functions as a similarity search engine capable of identifying nearest neighbors within large-scale vector spaces, supporting the storage and retrieval of billions of data points while maintaining consistent performance. The system utilizes a distributed architecture that decouples storage, query, and coordination into independent services, allowing for horizontal scaling across clusters. It employs a global indexing mechanism that
Supports distributed architecture to handle horizontal scaling across clusters for large-scale production needs.
Goannscloud-nativediskann
عرض على GitHub44,804
redis/go-redis
redis/go-redis
22,159عرض على GitHub
This project is a feature-rich Go client library designed for interacting with Redis. It serves as a comprehensive interface for managing remote data stores, enabling developers to execute standard database commands, handle complex data structures, and perform asynchronous operations within Go applications. The library distinguishes itself through its support for advanced Redis capabilities, including connection pooling, pipelining, and transactional integrity. It provides specialized primitives for managing distributed clusters, including automated topology updates and request routing to sha
Manages data across distributed cluster deployments to support horizontal scaling and automated request routing.
Gogogolangredis
عرض على GitHub22,159
dgraph-io/dgraph
dgraph-io/dgraph
21,700عرض على GitHub
Dgraph is a distributed graph database designed to store and query highly connected data. It organizes information as nodes and edges to represent complex relationships between entities, providing a platform for managing and analyzing deeply linked datasets. The system functions as a horizontally scalable cluster that partitions data across multiple nodes to maintain performance and availability as information volume increases. It utilizes a specialized query language built for low-latency navigation of interconnected data points, allowing for the execution of complex queries across large-sca
Operates as a distributed storage platform that maintains performance and availability through cluster-based partitioning.
Godatabasedistributedgo
عرض على GitHub21,700
tigerbeetle/tigerbeetle
tigerbeetle/tigerbeetle
16,291عرض على GitHub
TigerBeetle is a distributed financial accounting database designed for high-volume transaction processing. It functions as a specialized transaction engine that enforces strict double-entry bookkeeping invariants, ensuring that every debit and credit is balanced and accounted for with absolute consistency. By utilizing a consensus-based replication model, the system provides high availability and data durability across geographically distributed clusters, making it suitable for mission-critical financial infrastructure. The system distinguishes itself through a performance-oriented architect
Deploys cluster nodes across multiple sites to maintain transaction processing capabilities during site failures.
Zig
عرض على GitHub16,291
codislabs/codis
CodisLabs/codis
13,223عرض على GitHub
Codis is a distributed proxy system designed for scaling Redis clusters. It provides a sharding proxy that distributes data across multiple instances and a cluster manager to oversee the environment. The system enables horizontal scaling through dynamic resharding, which allows data slots to be migrated between servers without interrupting operations. It supports multi-key atomic operations using hash tags to ensure related keys are routed to the same server. The platform includes a graphical cluster management dashboard for monitoring and administration. It implements high availability prox
Distributes requests across a scalable group of instances to enable horizontal growth and high performance.
Gogogolangnosql
عرض على GitHub13,223
aws/aws-cdk
aws/aws-cdk
12,817عرض على GitHub
The AWS Cloud Development Kit is an infrastructure-as-code framework that enables developers to define and provision cloud resources using familiar programming languages. By utilizing construct-based synthesis, it translates high-level, object-oriented code into declarative templates, allowing for the automated management of complex cloud environments through a centralized, code-driven control plane. The framework distinguishes itself through its ability to model infrastructure as a dependency-aware resource graph, ensuring that components are provisioned and updated in the correct order. It
Enables the creation of multi-Region database clusters to support low-latency access and disaster recovery.
TypeScriptawscloud-infrastructurehacktoberfest
عرض على GitHub12,817
yugabyte/yugabyte-db
yugabyte/yugabyte-db
10,349عرض على GitHub
YugabyteDB is a distributed SQL database and relational data store designed for horizontal scalability and high availability across multiple nodes or regions. It functions as a cloud-native system that ensures continuous availability and supports PostgreSQL compatible query languages and drivers. The system includes specialized capabilities as a vector database for AI, utilizing high-dimensional indexing to perform similarity searches. It is engineered as a multi-region cloud database that synchronizes data across different geographic locations to maintain global availability. The project co
Employs multi-region deployment strategies to synchronize data across geographic locations for global availability.
Ccloud-nativecppdatabase
عرض على GitHub10,349
patroni/patroni
patroni/patroni
8,182عرض على GitHub
Patroni is a high availability manager and cluster orchestrator for PostgreSQL. It functions as an automatic failover controller and replication manager that ensures continuous database availability by automating leader election and promoting standby nodes during failures. The system maintains a consistent cluster state by acting as a distributed consensus coordinator. It synchronizes configuration and manages leader elections through integration with distributed configuration stores such as etcd, ZooKeeper, or Consul. Its broader capabilities include managing both synchronous and asynchrono
Manages the operational deployment and coordination of distributed database clusters across multiple nodes.
Pythonconsuletcdfailover
عرض على GitHub8,182
dimitri/pgloader
dimitri/pgloader
6,295عرض على GitHub
pgloader is a command-line tool that automates the migration of data and schema from various source databases and file formats into PostgreSQL. It combines schema discovery, parallel data pipelines, and type casting into a single, declarative workflow, using PostgreSQL's COPY protocol for high-throughput bulk loading. The tool distinguishes itself by compiling a dedicated command language into concurrent reader-writer pipelines that handle schema introspection, data transformation, and error-resilient batch processing. It supports migrating entire databases from MySQL, MS SQL, SQLite, and Pos
Migrates data into Citus distributed PostgreSQL clusters with automatic shard distribution.
Common Lispclozure-clcommon-lispcsv
عرض على GitHub6,295
openatomfoundation/pikiwidb
OpenAtomFoundation/pikiwidb
6,113عرض على GitHub
PikiwiDB هو قاعدة بيانات NoSQL موزعة ومخزن مفتاح-قيمة يعتمد على القرص يعمل كخادم بروتوكول متوافق مع Redis. تم تصميمه للتعامل مع مجموعات بيانات أكبر من ذاكرة النظام المتاحة من خلال استخدام محرك استمرارية يخزن مجموعة البيانات الكاملة على القرص. يستخدم النظام نموذج تخزين متدرج، حيث يقوم بتخزين البيانات الساخنة التي يتم الوصول إليها بشكل متكرر في الذاكرة مع الحفاظ على الحجم الأساسي على القرص. يضمن توفراً عالياً من خلال بنية مخزن بيانات مكرر، باستخدام سجلات ثنائية غير متزامنة لمزامنة البيانات بين العقد الأساسية والثانوية. يدعم المشروع توسيع قاعدة البيانات الموزعة عبر تقسيم البيانات القائم على المجموعة (sharding) وينظم البيانات في مجموعات لتوسيع طبقة التخزين. تشمل قدراته التشغيلية مراقبة أداء النظام لتتبع استخدام الموارد ودعم النشر المعبأ في حاويات.
Expands storage capacity across multiple nodes and clusters to handle massive volumes of enterprise data.
C++nosqlnosql-data-storagenosql-databases
عرض على GitHub6,113
federatedai/fate
FederatedAI/FATE
6,048عرض على GitHub
FATE is an open-source federated learning platform that enables multiple organizations to collaboratively train machine learning models without exposing raw data to any party. It provides a complete framework for private data collaboration, allowing participants to jointly compute on sensitive information while maintaining data privacy and security guarantees through secure multi-party computation protocols. The platform distinguishes itself through its comprehensive infrastructure management capabilities, supporting automated deployment of multi-party clusters using Ansible-driven provisioni
Setting up a distributed cluster of multiple parties using automation tools for collaborative model training.
Pythonalgorithmfatefederated-learning
عرض على GitHub6,048
alibaba/alisql
alibaba/AliSQL
5,706عرض على GitHub
AliSQL is a fork of MySQL by Alibaba that extends the relational database management system with enhancements for high performance, scalability, and enterprise-grade availability. It retains the core MySQL identity as a SQL-based database for storing, organizing, and retrieving structured data, while adding optimizations for large-scale transactional and analytical workloads. The project differentiates itself through a set of Alibaba-specific improvements, including a columnar engine for accelerating analytical queries directly on MySQL tables, and a distributed, shared-nothing NDB Cluster en
Manages deployments through dedicated agent and client software for high availability and redundancy.
C++alisqldatabaseduckdb
عرض على GitHub5,706
kubeoperator/kubeoperator
KubeOperator/KubeOperator
4,982عرض على GitHub
KubeOperator هي منصة شاملة لإدارة مجموعات Kubernetes، ومنسق للبنية التحتية، ومدير للمجموعات المتعددة. تعمل كتوزيعة Kubernetes للمؤسسات مصممة لأتمتة النشر والقياس وإدارة دورة حياة مجموعات الإنتاج عبر منصات سحابية متنوعة وآلات فيزيائية. تتميز المنصة بإمكانيات متخصصة للبيئات المعزولة (air-gapped)، بما في ذلك محرك تثبيت دون اتصال يقوم بإنشاء أرشيفات برمجية وإدارة سجلات خاصة للنشر الآمن دون إنترنت. كما توفر لوحة تحكم مركزية لعمليات الأسطول، مما يسمح باستيراد مجموعات خارجية وتنسيق البنية التحتية عبر مناطق جغرافية ومناطق توافر مختلفة. يغطي النظام سطحاً تشغيلياً واسعاً، بما في ذلك توفير الأجهزة الافتراضية الآلي، وتتبع مخزون الأجهزة، وإدارة دورة الحياة التعريفية للتصحيح والقياس. كما يدمج خدمات النسخ الاحتياطي والاسترداد المتكاملة، والتحكم في الوصول القائم على الأدوار مع مزامنة LDAP، ومراقبة شاملة لصحة المجموعة ومقاييس الأداء. يتم تنفيذ المهام الإدارية وعمليات المجموعة من خلال واجهة قائمة على الويب.
Automates the deployment and lifecycle of production clusters across diverse cloud platforms and physical machines.
Goansiblecontainersdocker
عرض على GitHub4,982
autobase-tech/autobase
autobase-tech/autobase
4,275عرض على GitHub
Autobase is a self-hosted database-as-a-service platform designed to automate the deployment, scaling, and administration of high-availability PostgreSQL clusters. It functions as a cluster orchestrator that handles replication, failover, and version upgrades across multiple servers. The platform is distinguished by its GitOps-driven approach, using version control and CI/CD pipelines as the single source of truth to automate database configurations and deployments. It provides both a web-based management interface and a command-line tool for provisioning and monitoring clusters. The system
Supports automated deployment of production-ready database clusters across a mix of cloud platforms and bare-metal machines.
TypeScriptauto-failoverautobaseautomation
عرض على GitHub4,275
vitabaks/autobase
vitabaks/autobase
4,266عرض على GitHub
Autobase is a self-hosted PostgreSQL database management platform and orchestrator designed to provide database-as-a-service capabilities. It automates the full lifecycle of PostgreSQL clusters, from initial provisioning and deployment to ongoing administration and decommissioning. The system distinguishes itself by treating database infrastructure as code, allowing clusters to be deployed and updated through version control and continuous integration pipelines. It provides both a centralized web console for visual management and programmatic interfaces for automating infrastructure orchestra
Enables the deployment of database clusters across a diverse mix of public cloud providers, virtual machines, and physical servers.
TypeScript
عرض على GitHub4,266
helixdb/helix-db
HelixDB/helix-db
3,830عرض على GitHub
Helix DB is a distributed graph database and knowledge graph platform that persists nodes and edges on object storage for durable and unlimited scaling. It operates as an ACID-compliant system, ensuring data consistency through serializable snapshot isolation during concurrent operations. The project distinguishes itself by combining a vector search engine and a property graph, utilizing hybrid vector and full-text search to locate entry points for graph traversals. It enables dynamic graph querying through a domain-specific language, allowing complex logic and recursive queries to be execute
Operates as a high-availability system with auto-scaling reader nodes and gateways to prevent single points of failure.
Rustaiclidatabase
عرض على GitHub3,830
octelium/octelium
octelium/octelium
3,371عرض على GitHub
Octelium is a zero-trust network access platform and identity-aware proxy designed to secure private HTTP, SSH, and SQL resources. It functions as a secure gateway that validates human and workload identities using OIDC, SAML, and FIDO2 passkeys before granting access to internal applications and SaaS APIs. The system is distinguished by its secretless access broker, which injects credentials—such as API keys, passwords, and AWS Sigv4 signatures—at the gateway level so users can access databases and cloud resources without managing secrets. It further specializes in AI gateway administration,
Extends service availability across multiple clusters and cloud regions with unified policies.
Goabacai-gatewayapi-gateway
عرض على GitHub3,371

Awesome Distributed Database Clusters GitHub Repositories

milvus-io/milvus

redis/go-redis

dgraph-io/dgraph

tigerbeetle/tigerbeetle

CodisLabs/codis

aws/aws-cdk

yugabyte/yugabyte-db

patroni/patroni

dimitri/pgloader

OpenAtomFoundation/pikiwidb

FederatedAI/FATE

alibaba/AliSQL

KubeOperator/KubeOperator

autobase-tech/autobase

vitabaks/autobase

HelixDB/helix-db

octelium/octelium

استكشف الوسوم الفرعية