1 repo
Storage architectures that persist data as immutable sequences of records for high-throughput I/O.
Distinguishing note: Focuses on the storage format rather than general database management.
Explore 1 awesome GitHub repository matching data & databases · Append-Only Storage Engines. Refine with filters or upvote what's useful.
Kafka is a distributed event streaming platform designed for capturing, storing, and processing real-time data streams across interconnected nodes. It functions as a distributed commit log, providing a fault-tolerant storage mechanism that records state changes sequentially to ensure data consistency and durability across distributed environments. The platform distinguishes itself through a partitioned commit log architecture that enables horizontal scaling and parallel processing of data streams. It integrates a stream processing engine for continuous transformations and aggregations, while
Persists data as an immutable sequence of records to allow for high-throughput sequential disk operations.