# weiye-jing/datax-web

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/weiye-jing-datax-web).**

6,009 stars · 2,249 forks · Java · MIT

## Links

- GitHub: https://github.com/WeiYe-Jing/datax-web
- Homepage: https://segmentfault.com/u/weiye_jing/articles
- awesome-repositories: https://awesome-repositories.com/repository/weiye-jing-datax-web.md

## Description

DataX Web is a web-based management platform for scheduling, building, executing, and monitoring distributed data synchronization jobs powered by DataX. It provides a visual console for creating and managing DataX tasks without manual JSON configuration, with a distributed executor cluster that auto-registers worker nodes and supports configurable routing and blocking strategies for task distribution.

The platform offers cron-based task scheduling with dynamic start, stop, and immediate status changes, along with incremental sync capabilities that pass dynamic parameters to extract only new or changed records during scheduled runs. It includes role-based access control with admin and normal user tiers, real-time log streaming to a web console for monitoring running tasks, and email-based failure notifications with a pluggable extension interface for additional alert channels.

Beyond DataX tasks, the system supports executing Shell, Python, and PowerShell scripts through the same scheduling interface. It provides web-based configuration for data source connections with testing, task template creation, and JVM parameter customization, along with execution reports showing success and failure distributions.

## Tags

### Part of an Awesome List

- [Tasks and Scheduling](https://awesome-repositories.com/f/awesome-lists/devops/tasks-and-scheduling.md) — Support cron-based scheduling with dynamic start, stop, and status changes, including termination of running tasks that take effect immediately. ([source](https://cdn.jsdelivr.net/gh/weiye-jing/datax-web@master/README.md))

### Data & Databases

- [Dynamic Parameter Injections](https://awesome-repositories.com/f/data-databases/change-data-capture-streams/incremental-sync/dynamic-parameter-injections.md) — Passes dynamic field values and intervals to DataX tasks for incremental data extraction.
- [Scheduled Incremental Sync Configurations](https://awesome-repositories.com/f/data-databases/change-data-capture-streams/incremental-sync/scheduled-incremental-sync-configurations.md) — Defines parent-child task dependencies and configures incremental sync with auto-calculated date ranges. ([source](https://cdn.jsdelivr.net/gh/weiye-jing/datax-web@master/README.md))
- [Scheduled Sync Engines](https://awesome-repositories.com/f/data-databases/change-data-capture-streams/incremental-sync/scheduled-sync-engines.md) — Automates periodic extraction of only new or changed records from source databases using dynamic parameter injection.
- [Multi-Source Data Aggregation](https://awesome-repositories.com/f/data-databases/data-source-connectivity-tools/multi-source-data-aggregation.md) — Connects and manages diverse data sources like RDBMS, Hive, HBase, and MongoDB through a unified web console.
- [Multi-Source Data Integration](https://awesome-repositories.com/f/data-databases/data-source-connectivity-tools/multi-source-data-integration.md) — Generates column information and simplifies configuration for multiple data sources. ([source](https://cdn.jsdelivr.net/gh/weiye-jing/datax-web@master/README.md))
- [Incremental Data Exporters](https://awesome-repositories.com/f/data-databases/incremental-data-exporters.md) — Extracts only new or changed records based on a tracked cursor or timestamp for each scheduled run. ([source](https://github.com/WeiYe-Jing/datax-web/blob/master/doc/datax-web/%E5%8A%A8%E6%80%81%E5%8F%82%E6%95%B0%E5%AE%8C%E6%88%90%E5%A2%9E%E9%87%8F%E6%8A%BD%E5%8F%96.md))
- [Incremental Data Synchronization](https://awesome-repositories.com/f/data-databases/incremental-data-synchronization.md) — Runs data synchronization tasks that only pull new or changed records since the last execution. ([source](https://github.com/WeiYe-Jing/datax-web/blob/master/doc/datax-web/%E5%8A%A8%E6%80%81%E5%8F%82%E6%95%B0%E5%AE%8C%E6%88%90%E5%A2%9E%E9%87%8F%E6%8A%BD%E5%8F%96.md))
- [Data Synchronization Consoles](https://awesome-repositories.com/f/data-databases/redis-administration-utilities/web-based-consoles/data-synchronization-consoles.md) — A web interface for creating, scheduling, and monitoring DataX data synchronization tasks across multiple data sources.
- [Incremental Sync Configurations](https://awesome-repositories.com/f/data-databases/data-modification-apis/incremental-syncing/incremental-sync-configurations.md) — Configures incremental fields and partition parameters to automatically compute data ranges for each scheduled run. ([source](https://cdn.jsdelivr.net/gh/weiye-jing/datax-web@master/README.md))
- [Incremental Update Configurations](https://awesome-repositories.com/f/data-databases/data-modification-apis/incremental-syncing/incremental-update-configurations.md) — Configures incremental fields and retry logic to automatically compute data intervals for scheduled tasks. ([source](https://cdn.jsdelivr.net/gh/weiye-jing/datax-web@master/README.md))
- [Data Source Connectivity Tools](https://awesome-repositories.com/f/data-databases/data-source-connectivity-tools.md) — Verifies data source configurations with a manual test button for JDBC sources. ([source](https://cdn.jsdelivr.net/gh/weiye-jing/datax-web@master/README.md))
- [Plugin-Based Data Source Adapters](https://awesome-repositories.com/f/data-databases/plugin-based-data-source-adapters.md) — A plugin-based adapter that connects DataX to RDBMS, Hive, HBase, ClickHouse, MongoDB, and other data sources.
- [Scheduled Execution Reports](https://awesome-repositories.com/f/data-databases/type-safe-data-abstractions/ddl-and-dml-executions/execution-status-reports/scheduled-execution-reports.md) — Provide real-time operational data and scheduling reports, including date distribution charts for successful and failed executions. ([source](https://cdn.jsdelivr.net/gh/weiye-jing/datax-web@master/README.md))

### Development Tools & Productivity

- [Data Sync Job Builders](https://awesome-repositories.com/f/development-tools-productivity/build-tooling/build-orchestration-logic/build-orchestration-configuration/build-configuration-systems/web-build-configurations/data-sync-job-builders.md) — Builds DataX synchronization JSON configurations through a web interface, storing them in a database. ([source](https://cdn.jsdelivr.net/gh/weiye-jing/datax-web@master/README.md))
- [Cron Scheduling](https://awesome-repositories.com/f/development-tools-productivity/cron-scheduling.md) — Triggers data synchronization tasks on a cron schedule with dynamic start and stop.
- [Distributed Job Executors](https://awesome-repositories.com/f/development-tools-productivity/task-execution/single-task-executors/custom-task-executors/distributed-job-executors.md) — Distributes task execution across a cluster of auto-registering executors with routing strategies.
- [DataX Distributed Executors](https://awesome-repositories.com/f/development-tools-productivity/task-execution/single-task-executors/custom-task-executors/distributed-job-executors/datax-distributed-executors.md) — A cluster of worker nodes that execute DataX tasks with configurable routing, blocking, and retry strategies.
- [Task Timeout and Retry Configurations](https://awesome-repositories.com/f/development-tools-productivity/task-timeout-management/task-timeout-and-retry-configurations.md) — Set custom timeout and retry counts for tasks, with automatic interruption on timeout and email-based failure alerts that can extend to other channels. ([source](https://cdn.jsdelivr.net/gh/weiye-jing/datax-web@master/README.md))
- [Routing and Blocking Policies](https://awesome-repositories.com/f/development-tools-productivity/concurrent-task-execution/routing-and-blocking-policies.md) — Provides routing strategies and blocking policies for managing concurrent task execution across a cluster. ([source](https://cdn.jsdelivr.net/gh/weiye-jing/datax-web@master/README.md))
- [Parent-Child Task Triggers](https://awesome-repositories.com/f/development-tools-productivity/task-dependency-management/agent-task-dependency-resolvers/sequential-task-dependencies/parent-child-task-triggers.md) — Configure child tasks that trigger automatically after a parent task completes successfully, supporting comma-separated child task lists. ([source](https://cdn.jsdelivr.net/gh/weiye-jing/datax-web@master/README.md))
- [Configurable Routing Strategies](https://awesome-repositories.com/f/development-tools-productivity/task-execution/single-task-executors/custom-task-executors/distributed-job-executors/configurable-routing-strategies.md) — Distributes job execution across a cluster of executors using configurable routing strategies like round-robin, random, or failover. ([source](https://cdn.jsdelivr.net/gh/weiye-jing/datax-web@master/README.md))

### DevOps & Infrastructure

- [Task & Job Management](https://awesome-repositories.com/f/devops-infrastructure/automation-orchestration/task-execution-frameworks/task-job-management.md) — Provides a web interface to create, configure, and monitor DataX data synchronization jobs without writing code. ([source](https://segmentfault.com/u/weiye_jing/articles))
- [Cron-Based](https://awesome-repositories.com/f/devops-infrastructure/automation-orchestration/task-execution-frameworks/task-job-management/task-schedulers/os-scheduling-simulations/task-schedulers/cron-based.md) — Support cron-based scheduling, dynamic task status changes, start/stop operations, and termination of running tasks with immediate effect. ([source](https://cdn.jsdelivr.net/gh/weiye-jing/datax-web@master/README.md))
- [Distributed Job Execution](https://awesome-repositories.com/f/devops-infrastructure/distributed-job-execution.md) — Distributes DataX job execution across multiple worker nodes to handle large-scale data synchronization workloads. ([source](https://segmentfault.com/u/weiye_jing/articles))
- [Visual Job Builders](https://awesome-repositories.com/f/devops-infrastructure/job-scheduling/job-behavior-configurations/visual-job-builders.md) — Builds DataX synchronization jobs through a web interface without writing JSON configuration files manually.
- [Recurring Job Scheduling](https://awesome-repositories.com/f/devops-infrastructure/recurring-job-scheduling.md) — Run DataX tasks on a recurring schedule, automating data transfers between sources and targets. ([source](https://segmentfault.com/u/weiye_jing/articles))
- [Cron Job Automation](https://awesome-repositories.com/f/devops-infrastructure/scheduling/cron-job-automation.md) — Schedules recurring data synchronization jobs with dependency chains, retry logic, and timeout controls.
- [DataX Job Schedulers](https://awesome-repositories.com/f/devops-infrastructure/scheduling/cron-job-automation/datax-job-schedulers.md) — A cron-based scheduler that automates DataX data synchronization tasks with distributed execution and real-time monitoring.
- [Web-Based Job Management](https://awesome-repositories.com/f/devops-infrastructure/web-based-job-management.md) — Generates DataX JSON configuration files through a web interface for job management.
- [Task Schedulers](https://awesome-repositories.com/f/devops-infrastructure/automation-orchestration/task-execution-frameworks/task-job-management/task-schedulers.md) — Extend task types beyond DataX to include Shell, Python, and PowerShell scripts, all managed through the same scheduling interface. ([source](https://cdn.jsdelivr.net/gh/weiye-jing/datax-web@master/README.md))
- [Script Execution Tasks](https://awesome-repositories.com/f/devops-infrastructure/automation-orchestration/task-execution-frameworks/task-job-management/task-schedulers/os-scheduling-simulations/task-schedulers/script-execution-tasks.md) — Extend task types beyond DataX to include Shell, Python, and PowerShell scripts, all managed through the same scheduling system. ([source](https://cdn.jsdelivr.net/gh/weiye-jing/datax-web@master/README.md))

### Security & Cryptography

- [User Role Management](https://awesome-repositories.com/f/security-cryptography/identity-access-management/identity-management/user-management/user-role-management.md) — Administers system users with admin and normal roles for role-based access control. ([source](https://cdn.jsdelivr.net/gh/weiye-jing/datax-web@master/README.md))
- [Role-Based Access Control](https://awesome-repositories.com/f/security-cryptography/role-based-access-control.md) — Manages system access with admin and normal user roles controlling feature visibility.

### Software Engineering & Architecture

- [Web-Based Configuration Generators](https://awesome-repositories.com/f/software-engineering-architecture/application-lifecycle-management/configuration-management/configuration-interfaces-and-editors/web-based-configuration-generators.md) — Generates DataX JSON configuration files through a web interface for job management.
- [Clustered Task Distribution](https://awesome-repositories.com/f/software-engineering-architecture/load-balancing-architectures/clustered-task-distribution.md) — Assigns tasks to multiple executors that auto-register with a central scheduler for cluster deployment. ([source](https://cdn.jsdelivr.net/gh/weiye-jing/datax-web@master/README.md))
- [Pluggable Data Source Architectures](https://awesome-repositories.com/f/software-engineering-architecture/pluggable-data-source-architectures.md) — Connects to diverse data sources like RDBMS, Hive, HBase, and MongoDB through a pluggable adapter interface.
- [Automatic Task Retries](https://awesome-repositories.com/f/software-engineering-architecture/task-retry-policies/manual-task-retries/automatic-task-retries.md) — Set custom timeout and retry counts for tasks, automatically interrupting or retrying failed jobs based on configuration. ([source](https://cdn.jsdelivr.net/gh/weiye-jing/datax-web@master/README.md))
- [Configurable Blocking Strategies](https://awesome-repositories.com/f/software-engineering-architecture/blocking-task-submission/configurable-blocking-strategies.md) — Applies configurable blocking strategies when the scheduler dispatches tasks faster than the executor can process. ([source](https://cdn.jsdelivr.net/gh/weiye-jing/datax-web@master/README.md))
- [Partition Parameter Passings](https://awesome-repositories.com/f/software-engineering-architecture/default-configuration-values/execution-parameter-configurations/application-parameter-configurators/runtime-parameter-configurators/metadata-parameter-passings/partition-parameter-passings.md) — Passes partition field values as runtime parameters so DataX tasks target the correct table partitions during execution. ([source](https://github.com/WeiYe-Jing/datax-web/blob/master/doc/datax-web/%E5%88%86%E5%8C%BA%E5%8A%A8%E6%80%81%E4%BC%A0%E5%8F%82%E4%BD%BF%E7%94%A8.md))
- [Parent-Child Dependencies](https://awesome-repositories.com/f/software-engineering-architecture/job-continuation-chaining/parent-child-dependencies.md) — A system for defining parent-child task dependencies and triggering child tasks automatically after parent completion.
- [Task Templates](https://awesome-repositories.com/f/software-engineering-architecture/task-templates.md) — Builds reusable templates from common JSON configurations for creating new jobs. ([source](https://cdn.jsdelivr.net/gh/weiye-jing/datax-web@master/README.md))

### System Administration & Monitoring

- [Task Execution Monitoring](https://awesome-repositories.com/f/system-administration-monitoring/activity-monitors/task-execution-monitoring.md) — Monitor the real-time execution logs and status of data synchronization tasks through a visual web interface. ([source](https://segmentfault.com/u/weiye_jing/articles))
- [Live Log Streaming](https://awesome-repositories.com/f/system-administration-monitoring/notification-channels/slack-notifications/email-alert-notifications/live-log-streaming.md) — Streams live execution logs and sends failure notifications via email with extensible alert channels.
- [Real-Time Application Log Monitoring](https://awesome-repositories.com/f/system-administration-monitoring/real-time-application-log-monitoring.md) — Streams live execution logs to a web console for monitoring and terminating running tasks.
- [Email Alert Notifications](https://awesome-repositories.com/f/system-administration-monitoring/notification-channels/slack-notifications/email-alert-notifications.md) — Sends failure notifications via email with a pluggable interface for additional alert channels.

### Testing & Quality Assurance

- [Scheduled Execution Reports](https://awesome-repositories.com/f/testing-quality-assurance/general-testing-utilities/test-reporters/test-result-reporters/post-execution-report-generation/scheduled-execution-reports.md) — Provide real-time operational data and scheduling reports, including distribution charts for successful and failed executions. ([source](https://cdn.jsdelivr.net/gh/weiye-jing/datax-web@master/README.md))

### User Interface & Experience

- [Reusable Task Templates](https://awesome-repositories.com/f/user-interface-experience/layout-utilities/presentation-engines/template-engines/configuration-and-integration/custom-template-integrations/generation-templates/configuration-data-templates/reusable-task-templates.md) — Builds reusable templates from common JSON configurations for creating new jobs. ([source](https://cdn.jsdelivr.net/gh/weiye-jing/datax-web@master/README.md))
