# debezium/debezium

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/debezium-debezium).**

12,421 stars · 2,847 forks · Java · apache-2.0

## Links

- GitHub: https://github.com/debezium/debezium
- Homepage: https://debezium.io
- awesome-repositories: https://awesome-repositories.com/repository/debezium-debezium.md

## Topics

`apache-kafka` `cdc` `change-data-capture` `data-pipeline` `database` `debezium` `event-streaming` `hacktoberfest` `kafka` `kafka-connect` `kafka-producer`

## Description

Debezium is a distributed change data capture platform that streams row-level database modifications as real-time events. By parsing database transaction logs, the system broadcasts structural and data changes to message brokers, enabling reactive processing and data integration across distributed architectures.

The platform utilizes log-based capture to extract modifications directly from transaction logs, ensuring minimal impact on source system performance while maintaining the original commit order of operations. It employs database-specific connector adapters to translate proprietary binary formats into a unified event structure, supported by schema-registry-backed serialization to maintain consistent data definitions. To ensure a complete baseline for synchronization, the system performs snapshot-based initial states before transitioning to continuous event streaming.

The tool supports a broad range of data integration tasks, including the maintenance of analytical stores and the synchronization of data across operational systems. Users can refine the data stream by applying filters to include or exclude specific tables, columns, or data types, and the system maintains an accurate representation of data models by parsing structural statements during the capture process.

The project is implemented as a plugin for distributed message queues, facilitating integration into existing event-driven pipelines.

## Tags

### Data & Databases

- [Change Data Capture](https://awesome-repositories.com/f/data-databases/change-data-capture.md) — Provides a distributed platform for streaming row-level database modifications as real-time events.
- [Event Streaming Infrastructure](https://awesome-repositories.com/f/data-databases/event-streaming-infrastructure.md) — Parses database transaction logs to broadcast structural and data changes to external applications.
- [Message Queue Connectors](https://awesome-repositories.com/f/data-databases/event-connectors/message-queue-connectors.md) — Integrates as a plugin for streaming database transaction logs into distributed message queues.
- [Real-Time Data Integration Platforms](https://awesome-repositories.com/f/data-databases/real-time-data-integration-platforms.md) — Synchronizes operational database changes across software architectures in real-time.
- [Data Warehouse Integrations](https://awesome-repositories.com/f/data-databases/data-warehouse-integrations.md) — Maintains analytical stores by streaming live database updates into data warehouses for real-time intelligence.
- [Database Adapters](https://awesome-repositories.com/f/data-databases/database-adapters.md) — Uses modular adapters to translate proprietary database log formats into a unified event structure.
- [Transaction Orderers](https://awesome-repositories.com/f/data-databases/ordered-data-structures/transaction-orderers.md) — Maintains the original commit order of database operations to preserve data integrity during streaming.

### Networking & Communication

- [Change Data Capture Streams](https://awesome-repositories.com/f/networking-communication/communication-platforms-services/messaging-notification-systems/messaging-services/message-broker-infrastructure/real-time-data-streams/change-data-capture-streams.md) — Streams row-level modifications from databases to message brokers for real-time data integration. ([source](https://debezium.io/documentation/))

### Software Engineering & Architecture

- [Asynchronous Log Streaming](https://awesome-repositories.com/f/software-engineering-architecture/software-architecture/architectural-patterns/reactive-messaging/reactive-event-driven-systems/asynchronous-data-processing/asynchronous-log-streaming.md) — Broadcasts database modifications as asynchronous event streams to decouple capture from downstream consumers.
- [Database Snapshots](https://awesome-repositories.com/f/software-engineering-architecture/architectural-design-patterns/state-management/persistence-and-serialization/state-serialization/state-snapshots/database-snapshots.md) — Performs snapshot-based initial state capture to ensure a consistent baseline for data synchronization.
- [Schema Registries](https://awesome-repositories.com/f/software-engineering-architecture/schema-registries.md) — Encodes data changes using a central schema registry to maintain consistent definitions across distributed systems.

### Part of an Awesome List

- [Change Data Capture](https://awesome-repositories.com/f/awesome-lists/data/change-data-capture.md) — Distributed platform turning database changes into real-time event streams.
