1 repo
Platforms for defining, scheduling, and monitoring complex sequences of data processing tasks and dependencies.
Distinguishing note: None of the provided candidates were relevant; this category specifically targets data pipeline orchestration and task dependency management.
Explore 1 awesome GitHub repository matching data & databases · Workflow Orchestration Engines. Refine with filters or upvote what's useful.
Airflow is a platform for programmatically authoring, scheduling, and monitoring complex data pipelines. It functions as a workflow automation engine that manages the lifecycle of recurring business processes by executing code-defined task dependencies. By representing workflows as directed acyclic graphs, the system ensures that task execution order and data flow are explicitly defined and reliably maintained across distributed computing environments. The platform distinguishes itself through a highly modular, provider-based architecture that decouples core orchestration logic from external
Managing the lifecycle of recurring business processes by executing code-defined task dependencies and handling state persistence across distributed environments.