What are the best open-source alternatives to Chunjun?

30 open-source projects similar to dtstack/chunjun, ranked by shared features. Top picks: hazelcast/hazelcast, apache/flink-cdc, alibaba/datax, risingwavelabs/risingwave, dlt-hub/dlt, jerrylead/sparkinternals, airbytehq/airbyte, zendesk/maxwell, redpanda-data/connect, pentaho/pentaho-kettle.

Is hazelcast/hazelcast a good alternative to Chunjun?

Hazelcast is a distributed data platform that combines an in-memory data grid with a stream processing engine to support real-time analytics and event-driven applications. It functions as a partitioned, distributed key-value store that replicates data across cluster nodes to provide low-latency acc…

Is apache/flink-cdc a good alternative to Chunjun?

This project is a streaming data integration framework that captures real-time database changes and synchronizes them with downstream systems. It operates as a distributed streaming ETL and database synchronizer, reading database logs and snapshots to propagate row-level modifications to target sin…

Is alibaba/datax a good alternative to Chunjun?

DataX is a distributed data integration framework and plugin-based ETL tool designed for synchronizing large datasets between heterogeneous sources and destinations. It functions as a JDBC data migration engine and offline synchronization tool, enabling the movement of data between relational datab…

Is risingwavelabs/risingwave a good alternative to Chunjun?

RisingWave is a cloud-native streaming database and real-time analytics engine that uses standard SQL to process continuous data streams. It functions as a streaming data lakehouse, combining the capabilities of a streaming SQL database with a platform that integrates streaming ingestion with open…

Is dlt-hub/dlt a good alternative to Chunjun?

dlt is a Python data ingestion tool and ETL pipeline framework designed to fetch data from diverse sources and persist it into structured destinations. It functions as a schema inference engine that automatically detects data types and flattens nested JSON structures into relational tables, moving…

Is jerrylead/sparkinternals a good alternative to Chunjun?

SparkInternals is a technical reference and architecture guide detailing the internal design and implementation of the Apache Spark distributed computing engine. It serves as a study of big data engine analysis, focusing on how the system manages cluster execution and the interaction between driver…

Is airbytehq/airbyte a good alternative to Chunjun?

Airbyte is a data integration platform designed to synchronize information between diverse applications, databases, and data warehouses. It functions as an extract, transform, and load orchestrator that manages automated data movement workflows across cloud, on-premise, and hybrid environments. The…

Is zendesk/maxwell a good alternative to Chunjun?

Maxwell is a MySQL change data capture tool and binlog streaming application that converts database modifications into structured JSON events. It functions as a data pipeline that reads MySQL binary logs to synchronize changes across external indices, search engines, and distributed messaging syste…

Is redpanda-data/connect a good alternative to Chunjun?

Connect is a Kafka data integration platform and stream processing engine used to build declarative pipelines that move and transform messages between Kafka topics and external sources. It functions as a Kafka Connect framework and a change data capture tool, streaming real-time database modificati…

Is pentaho/pentaho-kettle a good alternative to Chunjun?

Pentaho Kettle is an enterprise ETL data integration platform designed to extract, transform, and load data between disparate sources and target databases. It functions as a metadata-driven orchestrator that utilizes a visual workflow designer to create and manage complex sequences of data tasks an…

Back to dtstack/chunjun

Open-source alternatives to Chunjun

30 open-source projects similar to dtstack/chunjun, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Chunjun alternative.

hazelcast/hazelcast
hazelcast/hazelcast
6,570View on GitHub
Hazelcast is a distributed data platform that combines an in-memory data grid with a stream processing engine to support real-time analytics and event-driven applications. It functions as a partitioned, distributed key-value store that replicates data across cluster nodes to provide low-latency access and high availability. The platform also serves as a distributed SQL query engine, allowing users to execute standard SQL statements against both in-memory datasets and external data sources. What distinguishes Hazelcast is its use of a distributed consensus subsystem to maintain strongly consis
Javabig-datacachingdata-in-motion
View on GitHub6,570
apache/flink-cdc
apache/flink-cdc
6,430View on GitHub
This project is a streaming data integration framework that captures real-time database changes and synchronizes them with downstream systems. It operates as a distributed streaming ETL and database synchronizer, reading database logs and snapshots to propagate row-level modifications to target sinks. The system supports declarative data integration, allowing users to define source-to-sink data flows using SQL or YAML configurations. It distinguishes itself by automating schema evolution to maintain synchronization when source structures change and ensuring exactly-once delivery and processin
Javabatchcdcchange-data-capture
View on GitHub6,430
alibaba/datax
alibaba/DataX
17,241View on GitHub
DataX is a distributed data integration framework and plugin-based ETL tool designed for synchronizing large datasets between heterogeneous sources and destinations. It functions as a JDBC data migration engine and offline synchronization tool, enabling the movement of data between relational databases, NoSQL stores, and object storage. The system utilizes a plugin-based connector architecture that decouples reader and writer logic, allowing it to map and transform data types across different storage engines using a standardized internal representation. This design supports heterogeneous data
Java
View on GitHub17,241

Open-source alternatives to Chunjun

hazelcast/hazelcast

apache/flink-cdc

alibaba/DataX

risingwavelabs/risingwave

dlt-hub/dlt

JerryLead/SparkInternals

airbytehq/airbyte

zendesk/maxwell

redpanda-data/connect

pentaho/pentaho-kettle

Jeffail/benthos

ArroyoSystems/arroyo

benthosdev/benthos

apache/beam

pgdogdev/pgdog

pingcap/tidb

dbt-labs/dbt-core

google/osv.dev

paperclipai/paperclip

tporadowski/redis

PrefectHQ/prefect

navidrome/navidrome

getgrav/grav

nats-io/nats-server

modin-project/modin

apache/spark

apache/hadoop

databricks/learning-spark

debezium/debezium

collectiveidea/audited