# ceph/ceph

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/ceph-ceph).**

16,247 stars · 6,298 forks · C++ · other

## Links

- GitHub: https://github.com/ceph/ceph
- Homepage: https://ceph.io
- awesome-repositories: https://awesome-repositories.com/repository/ceph-ceph.md

## Topics

`block-storage` `cloud-storage` `distributed-file-system` `distributed-storage` `erasure-coding` `fuse` `hdfs` `high-performance` `highly-available` `iscsi` `kubernetes` `nfs` `nvme-over-fabrics` `object-store` `posix` `replication` `s3` `smb` `software-defined-storage` `storage`

## Description

Ceph is a unified, software-defined storage platform designed to provide object, block, and file storage services from a single distributed cluster. By decoupling data management from physical hardware, it enables elastic scaling across commodity hardware, allowing organizations to build large-scale storage infrastructure without reliance on proprietary vendor equipment.

The system distinguishes itself through a shared-nothing, distributed architecture that utilizes deterministic hashing for data placement. This approach eliminates centralized metadata bottlenecks, allowing the cluster to scale efficiently while maintaining even load balancing across all nodes. To ensure high availability and data reliability, the platform employs continuous background self-healing, automated integrity verification, and flexible redundancy strategies including both replication and erasure coding.

Beyond its core storage capabilities, the platform provides comprehensive infrastructure management tools for orchestrating cluster lifecycles and automating node deployment. It supports diverse application requirements by offering native integration for containerized workloads, virtualized block storage, and standard file system interfaces. The system also includes advanced performance features such as automated storage tiering and data access optimization to balance throughput and cost across varied hardware media.

## Tags

### DevOps & Infrastructure

- [Storage Volume Orchestration](https://awesome-repositories.com/f/devops-infrastructure/container-orchestration/storage-volume-orchestration.md) — Provides automated provisioning and mounting of persistent storage volumes to containerized workloads within distributed clusters. ([source](https://ceph.com/en/discover/technology/))
- [Container Storage Persistence](https://awesome-repositories.com/f/devops-infrastructure/container-storage-persistence.md) — Attaches distributed storage volumes to containerized workloads to ensure persistent data access in dynamic environments.
- [Self-Healing Infrastructure](https://awesome-repositories.com/f/devops-infrastructure/container-orchestration/workload-scheduling-scaling/self-healing-infrastructure.md) — Maintains data integrity through continuous background monitoring and automated repair of missing replicas or corruption.
- [Block Storage Services](https://awesome-repositories.com/f/devops-infrastructure/cloud-infrastructure/storage-data-persistence/block-storage-services.md) — Delivers thin-provisioned, snapshot-capable block devices to virtual machines and cloud environments.
- [Storage Distribution Algorithms](https://awesome-repositories.com/f/devops-infrastructure/load-balancing-algorithms/storage-distribution-algorithms.md) — Calculates data placement algorithmically to eliminate metadata bottlenecks and ensure even load balancing across storage hardware. ([source](https://ceph.com/en/discover/benefits/))
- [Storage Networking](https://awesome-repositories.com/f/devops-infrastructure/cloud-infrastructure/networking-connectivity/software-defined-networking-services/storage-networking.md) — Aggregates standard hardware into a scalable storage system to reduce infrastructure costs and vendor lock-in.
- [Cluster Lifecycle Management](https://awesome-repositories.com/f/devops-infrastructure/cluster-lifecycle-management.md) — Automates the deployment, configuration, and ongoing maintenance of storage nodes to ensure consistent performance. ([source](https://ceph.com/en/users/documentation/))
- [Cluster and Service Orchestration](https://awesome-repositories.com/f/devops-infrastructure/infrastructure/cluster-service-orchestration.md) — Automates the deployment, configuration, and orchestration of storage nodes and services across large-scale environments. ([source](https://tracker.ceph.com/projects/ceph))

### Software Engineering & Architecture

- [Distributed Storage Clusters](https://awesome-repositories.com/f/software-engineering-architecture/distributed-systems/distributed-data-management/distributed-storage-clusters.md) — Aggregates multiple independent nodes into a single unified storage system for enterprise-scale data management.
- [Distributed Hash Tables](https://awesome-repositories.com/f/software-engineering-architecture/distributed-systems/distributed-data-management/distributed-hash-tables.md) — Uses deterministic hashing to track object locations across thousands of devices without centralized lookup tables. ([source](https://ceph.com/en/discover/technology/))
- [Distributed Data Management](https://awesome-repositories.com/f/software-engineering-architecture/distributed-systems/distributed-data-management.md) — Maintains consistent data placement and replication across multiple storage nodes to ensure high availability. ([source](https://tracker.ceph.com/projects/ceph))
- [Shared-Nothing Architectures](https://awesome-repositories.com/f/software-engineering-architecture/shared-nothing-architectures.md) — Operates as a collection of independent nodes that manage their own resources to maximize throughput and eliminate single points of failure.
- [Storage Abstraction Layers](https://awesome-repositories.com/f/software-engineering-architecture/storage-abstraction-layers.md) — Abstracts physical storage media through pluggable engines optimized for diverse hardware input and output patterns.

### Data & Databases

- [Software-Defined Architectures](https://awesome-repositories.com/f/data-databases/data-engineering-infrastructure/data-persistence-storage/data-storage-architectures/software-defined-architectures.md) — Decouples data management from physical hardware to enable elastic scaling and high availability.
- [Erasure Coding](https://awesome-repositories.com/f/data-databases/data-management/erasure-coding.md) — Distributes data fragments across nodes to provide high fault tolerance with lower storage overhead than full replication.
- [Deterministic Placement Algorithms](https://awesome-repositories.com/f/data-databases/hash-data-structures/hashing-configurations/deterministic-placement-algorithms.md) — Uses deterministic hashing to calculate object locations across storage nodes without centralized metadata servers.
- [Unified Storage Interfaces](https://awesome-repositories.com/f/data-databases/object-storage-services/unified-storage-interfaces.md) — Exposes object, block, and file storage interfaces from a single distributed cluster to support diverse application requirements. ([source](https://ceph.com/en/discover/use-cases/))
- [Object Storage Systems](https://awesome-repositories.com/f/data-databases/object-storage-systems.md) — Manages data as objects across federated clusters using deterministic hashing to eliminate metadata bottlenecks.
- [Storage Reliability](https://awesome-repositories.com/f/data-databases/storage-reliability.md) — Ensures data availability through continuous monitoring, automated self-healing, and configurable replication across geographical locations. ([source](https://ceph.com/en/discover/use-cases/))
- [Bit Rot Detection](https://awesome-repositories.com/f/data-databases/data-governance-modeling/data-management-governance/data-integrity-validation/data-integrity/bit-rot-detection.md) — Performs scheduled background checks to detect and repair data corruption or bit rot across the storage cluster. ([source](https://ceph.com/en/discover/benefits/))
- [High Availability Architectures](https://awesome-repositories.com/f/data-databases/high-availability-architectures.md) — Ensures continuous service availability and data integrity through automated self-healing and multi-node redundancy.
- [Block Storage](https://awesome-repositories.com/f/data-databases/block-storage.md) — Provides high-performance block storage by striping virtual disk images across distributed nodes.
- [Distributed File Systems](https://awesome-repositories.com/f/data-databases/distributed-file-systems.md) — Provides a standard file system interface supporting native mounting, snapshots, and quotas across distributed infrastructure. ([source](https://ceph.com/en/discover/))
- [Storage Scaling](https://awesome-repositories.com/f/data-databases/storage-scaling.md) — Expands storage clusters dynamically across commodity hardware to accommodate data growth without performance degradation. ([source](https://ceph.com/en/discover/use-cases/))
- [Online Resharding Mechanisms](https://awesome-repositories.com/f/data-databases/object-storage/bucket-management/online-resharding-mechanisms.md) — Adjusts the distribution of object storage buckets across the cluster without requiring downtime for active users. ([source](https://ceph.com/en/news/blog/))
- [Data Access Protocols](https://awesome-repositories.com/f/data-databases/data-access-protocols.md) — Locates and accesses data directly within the cluster to eliminate metadata bottlenecks and reduce network traffic. ([source](https://ceph.com/en/discover/use-cases/))
- [Data Compression Algorithms](https://awesome-repositories.com/f/data-databases/data-compression-algorithms.md) — Applies database-level compression algorithms to stored data to reduce physical footprint without sacrificing performance. ([source](https://ceph.com/en/news/blog/))
- [Storage Tiering](https://awesome-repositories.com/f/data-databases/storage-tiering.md) — Moves data automatically between high-performance and economical storage media to balance speed and cost. ([source](https://ceph.com/en/discover/technology/))

### Networking & Communication

- [Remote Storage Protocols](https://awesome-repositories.com/f/networking-communication/remote-storage-protocols.md) — Exports block and file storage resources via standard network protocols like iSCSI and NFS for remote access. ([source](https://ceph.com/en/discover/technology/))

### Hardware & IoT

- [Commodity Hardware Abstractions](https://awesome-repositories.com/f/hardware-iot/integration-performance/hardware-interfacing-integration/commodity-hardware-abstractions.md) — Aggregates standard, off-the-shelf hardware components into a reliable and scalable distributed storage system. ([source](https://ceph.com/))

### Security & Cryptography

- [Identity and Access Management](https://awesome-repositories.com/f/security-cryptography/identity-and-access-management.md) — Controls multi-tenant permissions and identity-based security policies for object storage systems. ([source](https://ceph.com/en/news/blog/))
