What are the best open-source alternatives to EsProc?

30 open-source projects similar to splware/esproc, ranked by shared features. Top picks: pentaho/pentaho-kettle, alibaba/otter, clickhouse/clickhouse, evidence-dev/evidence, ucbepic/docetl, jruby/jruby, apache/datafusion, zipstack/unstract, cube2222/octosql, apache/seatunnel.

Is pentaho/pentaho-kettle a good alternative to EsProc?

Pentaho Kettle is an enterprise ETL data integration platform designed to extract, transform, and load data between disparate sources and target databases. It functions as a metadata-driven orchestrator that utilizes a visual workflow designer to create and manage complex sequences of data tasks an…

Is alibaba/otter a good alternative to EsProc?

Otter is a distributed database synchronization system and change data capture tool designed to replicate data between databases across multiple geographic regions. It functions as a synchronization orchestrator and ETL data pipeline that mirrors records and associated files in real time. The syst…

Is clickhouse/clickhouse a good alternative to EsProc?

ClickHouse is a high-performance, columnar analytical database designed for real-time query execution and large-scale data aggregation. It functions as a distributed data warehouse capable of processing petabytes of information, while also providing an embedded engine that integrates directly into…

Is evidence-dev/evidence a good alternative to EsProc?

evidence-dev/evidence is an open-source alternative to EsProc.

Is ucbepic/docetl a good alternative to EsProc?

docetl is an AI-powered document ETL tool and map-reduce orchestrator designed to transform large collections of unstructured documents into structured, queryable tables using language models. It provides a declarative pipeline framework for extracting, cleaning, and transforming data from sources…

Is jruby/jruby a good alternative to EsProc?

JRuby is a Ruby language implementation that runs on the Java Virtual Machine. It serves as a cross-language runtime and execution environment, allowing Ruby code to run on the JVM and share memory with Java applications. The project functions as a bridge between Ruby and Java, enabling Ruby scrip…

Is apache/datafusion a good alternative to EsProc?

Apache DataFusion is an extensible, columnar SQL query engine that runs embedded within a host application without requiring a separate server process. It processes data in columnar batches using Apache Arrow for memory-efficient analytics, and can scale analytic workloads across multiple nodes for…

Is zipstack/unstract a good alternative to EsProc?

Unstract is an unstructured data extraction system and ETL pipeline orchestrator that uses large language models to convert documents, images, and scans into structured JSON. It provides a document extraction API for integrating these capabilities into external automation tools and includes a Model…

Is cube2222/octosql a good alternative to EsProc?

Octosql is a federated SQL query engine, data transformer, and streaming SQL processor. It allows users to execute single SQL statements across multiple disparate data sources, including different database types and file formats, to merge and transform results into a unified set. The system distin…

Is apache/seatunnel a good alternative to EsProc?

SeaTunnel is a distributed data integration engine designed to synchronize structured and unstructured data across diverse sources and sinks. It functions as a multi-engine execution framework that can run data integration tasks across different distributed computing backends to optimize workload p…

Back to splware/esproc

Open-source alternatives to EsProc

30 open-source projects similar to splware/esproc, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best EsProc alternative.

pentaho/pentaho-kettle
pentaho/pentaho-kettle
8,353View on GitHub
Pentaho Kettle is an enterprise ETL data integration platform designed to extract, transform, and load data between disparate sources and target databases. It functions as a metadata-driven orchestrator that utilizes a visual workflow designer to create and manage complex sequences of data tasks and transformation pipelines. The system is distinguished by its distributed data processing engine, which executes workloads across clusters of server nodes to increase throughput. It employs a plugin-based architecture, allowing the platform to be extended via external JAR files to provide connectiv
Java
View on GitHub8,353
alibaba/otter
alibaba/otter
8,127View on GitHub
Otter is a distributed database synchronization system and change data capture tool designed to replicate data between databases across multiple geographic regions. It functions as a synchronization orchestrator and ETL data pipeline that mirrors records and associated files in real time. The system employs incremental log parsing to capture database changes and utilizes a consistency-based convergence algorithm and loop-avoidance logic to manage bi-directional replication. It processes data through a pipeline of selection, extraction, transformation, and loading to handle joins and format co
Java
View on GitHub8,127
clickhouse/clickhouse
ClickHouse/ClickHouse
48,229View on GitHub
ClickHouse is a high-performance, columnar analytical database designed for real-time query execution and large-scale data aggregation. It functions as a distributed data warehouse capable of processing petabytes of information, while also providing an embedded engine that integrates directly into applications for native query capabilities without external dependencies. The system is built to handle high-throughput ingestion and complex analytical workloads, delivering millisecond-level latency for interactive dashboards and operational monitoring. The platform distinguishes itself through ad
C++aianalyticsbig-data
View on GitHub48,229

Open-source alternatives to EsProc

pentaho/pentaho-kettle

alibaba/otter

ClickHouse/ClickHouse

evidence-dev/evidence

ucbepic/docetl

jruby/jruby

apache/datafusion

Zipstack/unstract

cube2222/octosql

apache/seatunnel

dlt-hub/dlt

mage-ai/mage-ai

TurboWay/bigdata_analyse

databricks/Spark-The-Definitive-Guide

pathwaycom/llm-app

hellodigua/ChatLab

ltsopensource/light-task-scheduler

lonng/nano

google/perfetto

lunatic-solutions/lunatic

MicrosoftDocs/azure-docs

emitter-io/emitter

apache/hadoop

google/cayley

h2oai/h2o-3

canonical/lxd

apache/druid

AlaSQL/alasql

datawhalechina/joyful-pandas

Exrick/xmall