What are the best Awesome Plugin-Based ETL Frameworks GitHub Repositories?

Question 1

Accepted Answer

ETL systems that use a plugin architecture for readers and writers to extend connectivity to new data sources.

**Distinct from ETL Workflows:** Focuses on the plugin-based extensibility of the ETL process, whereas candidates focus on specific ETL types like Reverse ETL or Vector ETL.

Explore 5 awesome GitHub repositories matching data & databases · Plugin-Based ETL Frameworks. Refine with filters or upvote what's useful. Top picks: alibaba/datax, pentaho/pentaho-kettle, apache/flink-cdc, apac…

Question 2

Why is alibaba/datax a recommended Plugin-Based ETL Frameworks GitHub Repositories repository?

Accepted Answer

Uses a plugin-based connector architecture to decouple reader and writer logic, allowing extensions for new heterogeneous data sources.

Question 3

Why is pentaho/pentaho-kettle a recommended Plugin-Based ETL Frameworks GitHub Repositories repository?

Accepted Answer

Provides an ETL system using a plugin architecture for readers and writers to extend connectivity to new data sources.

Question 4

Why is apache/flink-cdc a recommended Plugin-Based ETL Frameworks GitHub Repositories repository?

Accepted Answer

Implements a distributed streaming ETL framework for filtering, transforming, and routing data in flight.

Question 5

Why is apache/pinot a recommended Plugin-Based ETL Frameworks GitHub Repositories repository?

Accepted Answer

Connects distributed processing frameworks to the datastore to enable reading and writing data within complex streaming pipelines.

Question 6

Why is dlt-hub/dlt a recommended Plugin-Based ETL Frameworks GitHub Repositories repository?

Accepted Answer

Provides a pluggable framework that automates schema evolution, incremental loading, and normalization for ETL workflows.

Awesome GitHub RepositoriesPlugin-Based ETL Frameworks

alibaba/DataX

pentaho/pentaho-kettle

apache/flink-cdc

apache/pinot

dlt-hub/dlt

探索子标签