What are the best open-source alternatives to Nifi?

30 open-source projects similar to apache/nifi, ranked by shared features. Top picks: orchest/orchest, prefecthq/prefect, opendcai/dataflow, unstructured-io/unstructured, rudderlabs/rudder-server, matz/streem, iterative/dvc, apache/incubator-devlake, maiot-io/zenml, dora-rs/dora.

Is orchest/orchest a good alternative to Nifi?

Orchest is a data pipeline orchestrator and containerized workflow manager. It provides a platform for designing, scheduling, and executing complex data processing sequences through a combination of a graphical interface and scripting. The platform distinguishes itself by using containers to manag…

Is prefecthq/prefect a good alternative to Nifi?

Prefect is a workflow orchestration platform designed to define, schedule, and monitor complex data pipelines as Python code. It functions as a container-native engine that wraps individual tasks in isolated environments, ensuring consistent dependencies and resource allocation across diverse infra…

Is opendcai/dataflow a good alternative to Nifi?

DataFlow is an agent-based workflow orchestrator and data pipeline designed to synthesize, clean, and augment large-scale datasets for training large language models. It functions as a synthetic data generator and text curation tool, utilizing an intelligent assistant to assemble modular processing…

Is unstructured-io/unstructured a good alternative to Nifi?

Unstructured is an enterprise-grade data orchestration engine designed to transform raw, unstructured files into structured, machine-readable formats. It functions as a comprehensive platform for document ingestion, partitioning, and enrichment, specifically engineered to prepare complex data for r…

Is rudderlabs/rudder-server a good alternative to Nifi?

Rudder Server is a customer data platform and event routing pipeline designed to collect, transform, and route customer event data from various sources to data warehouses and business tools. It functions as a customer identity resolver, linking identifiers from multiple sources to build a unified i…

Is matz/streem a good alternative to Nifi?

Streem is a stream-based programming language and data pipeline orchestrator. It provides a domain-specific language for defining concurrent data flows, allowing users to link data sources to destinations through a sequence of operations that transform and filter individual stream elements. The sy…

Is iterative/dvc a good alternative to Nifi?

DVC is a data versioning tool and pipeline orchestrator designed to track large datasets and machine learning models. It functions as a system for managing large data artifacts by storing lightweight metadata in version control while keeping the actual binaries in a separate cache. The project ser…

Is apache/incubator-devlake a good alternative to Nifi?

DevLake is a DevOps data platform and analytics tool designed to orchestrate data pipelines that ingest, transform, and sync metadata from external development tools into a unified database. It functions as a system for collecting and normalizing data from source control, CI/CD pipelines, and issue…

Is maiot-io/zenml a good alternative to Nifi?

ZenML is an extensible machine learning orchestration framework designed to manage the end-to-end lifecycle of data pipelines and AI agent workflows. It functions as a durable orchestrator that executes machine learning tasks as directed acyclic graphs, ensuring that every step is containerized for…

Is dora-rs/dora a good alternative to Nifi?

Dora is a robotics dataflow framework and distributed orchestrator used to build and manage processing pipelines. It enables the deployment of robotics workloads across clusters with remote node execution and provides a real-time data pipeline for predictable performance. The system is distinguish…

Back to apache/nifi

Open-source alternatives to Nifi

30 open-source projects similar to apache/nifi, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Nifi alternative.

orchest/orchest
orchest/orchest
4,138View on GitHub
Orchest is a data pipeline orchestrator and containerized workflow manager. It provides a platform for designing, scheduling, and executing complex data processing sequences through a combination of a graphical interface and scripting. The platform distinguishes itself by using containers to manage software dependencies, ensuring consistent execution across different environments. It features a polyglot task scheduler capable of triggering jobs written in multiple programming languages and includes a version control system that tracks historical snapshots of project configurations and code.
TypeScriptairflowclouddag
View on GitHub4,138
prefecthq/prefect
PrefectHQ/prefect
21,640View on GitHub
Prefect is a workflow orchestration platform designed to define, schedule, and monitor complex data pipelines as Python code. It functions as a container-native engine that wraps individual tasks in isolated environments, ensuring consistent dependencies and resource allocation across diverse infrastructure. By utilizing a state-machine-based orchestration model, the system tracks execution progress through discrete transitions and persistent event logs to maintain reliable and observable task processing. The platform distinguishes itself through a decoupled worker-API architecture, which sep
Pythonautomationdatadata-engineering
View on GitHub21,640
opendcai/dataflow
OpenDCAI/DataFlow
2,926View on GitHub
DataFlow is an agent-based workflow orchestrator and data pipeline designed to synthesize, clean, and augment large-scale datasets for training large language models. It functions as a synthetic data generator and text curation tool, utilizing an intelligent assistant to assemble modular processing operators into functional pipelines based on user requirements. The project distinguishes itself through a low-code approach, providing a web-based visual interface for designing and monitoring multi-stage execution flows. It features an operator-based registry system that allows for the integratio
Pythondatadata-agentdata-cleaning
View on GitHub2,926

Open-source alternatives to Nifi

orchest/orchest

PrefectHQ/prefect

OpenDCAI/DataFlow

Unstructured-IO/unstructured

rudderlabs/rudder-server

matz/streem

iterative/dvc

apache/incubator-devlake

maiot-io/zenml

dora-rs/dora

enso-org/enso

redpanda-data/connect

business-science/ai-data-science-team

apache/iggy

dbt-labs/dbt-core

apache/airflow

dagster-io/dagster

ArroyoSystems/arroyo

spotify/luigi

airbytehq/airbyte

kestra-io/kestra

benthosdev/benthos

kedro-org/kedro

vectordotdev/vector

redis/RedisInsight

snailyp/gemini-balance

redpanda-data/redpanda

apache/seatunnel

infinyon/fluvio

cocoindex-io/cocoindex