What are the best open-source alternatives to Arroyo?

30 open-source projects similar to arroyosystems/arroyo, ranked by shared features. Top picks: risingwavelabs/risingwave, hazelcast/hazelcast, apache/pinot, apache/flink, greptimeteam/greptimedb, zhisheng17/flink-learning, robinhood/faust, netflix/metaflow, apache/spark, apache/incubator-storm.

Is risingwavelabs/risingwave a good alternative to Arroyo?

RisingWave is a cloud-native streaming database and real-time analytics engine that uses standard SQL to process continuous data streams. It functions as a streaming data lakehouse, combining the capabilities of a streaming SQL database with a platform that integrates streaming ingestion with open…

Is hazelcast/hazelcast a good alternative to Arroyo?

Hazelcast is a distributed data platform that combines an in-memory data grid with a stream processing engine to support real-time analytics and event-driven applications. It functions as a partitioned, distributed key-value store that replicates data across cluster nodes to provide low-latency acc…

Is apache/pinot a good alternative to Arroyo?

Pinot is a distributed, columnar analytical database designed for high-concurrency, low-latency query processing. It functions as a real-time OLAP datastore, enabling interactive, user-facing analytics by ingesting and querying massive datasets from both streaming and batch sources. The system arch…

Is apache/flink a good alternative to Arroyo?

Apache Flink is a distributed processing engine designed for both high-throughput, low-latency data streams and finite batch workloads. It functions as a stateful stream processor and a SQL stream processing engine, providing a unified runtime to execute relational queries and event-based transform…

Is greptimeteam/greptimedb a good alternative to Arroyo?

GreptimeDB is a distributed, open-source time-series database built for unified observability. It stores and queries metrics, logs, and traces together in a single columnar engine, supporting both SQL and PromQL for analysis. The database is designed as a Kubernetes-native operator with a decoupled…

Is zhisheng17/flink-learning a good alternative to Arroyo?

This project is a collection of educational resources and reference implementations for the Apache Flink stream processing framework. It provides a learning resource focused on mastering distributed stream processing through implementation guides, performance tuning tutorials, and practical example…

Is robinhood/faust a good alternative to Arroyo?

Faust is a Python library for building distributed stream processing applications that integrate with Kafka. It functions as an asynchronous stream processor designed to handle high-throughput event streams and real-time data analysis using asynchronous functions. The system operates as a distribu…

Is netflix/metaflow a good alternative to Arroyo?

Metaflow is a Python machine learning framework and MLOps workflow orchestrator designed to manage the lifecycle of data pipelines from local prototyping to production. It serves as a distributed compute manager and an experiment tracking system, enabling the creation of reproducible pipelines that…

Is apache/spark a good alternative to Arroyo?

Apache Spark is a unified distributed data processing engine designed for large-scale data analysis and computation graphs. It functions as a distributed machine learning framework, a graph processing system, a real-time stream processor, and a SQL analytics engine. The system enables the executio…

Is apache/incubator-storm a good alternative to Arroyo?

Apache Storm is a distributed stream processing framework and real-time data processing engine. It functions as a fault-tolerant distributed computing system designed to analyze data in motion across a cluster of machines for continuous stream computation. The system enables the creation of fault-…

Back to arroyosystems/arroyo

Open-source alternatives to Arroyo

30 open-source projects similar to arroyosystems/arroyo, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Arroyo alternative.

risingwavelabs/risingwave
risingwavelabs/risingwave
9,093View on GitHub
RisingWave is a cloud-native streaming database and real-time analytics engine that uses standard SQL to process continuous data streams. It functions as a streaming data lakehouse, combining the capabilities of a streaming SQL database with a platform that integrates streaming ingestion with open table formats. The system is distinguished by its use of the PostgreSQL wire protocol, allowing it to integrate with existing SQL tools and drivers. It employs a decoupled compute and storage architecture, persisting streaming state and materialized views in cloud object storage to enable independen
Rustapache-icebergdata-engineeringdatabase
View on GitHub9,093
hazelcast/hazelcast
hazelcast/hazelcast
6,570View on GitHub
Hazelcast is a distributed data platform that combines an in-memory data grid with a stream processing engine to support real-time analytics and event-driven applications. It functions as a partitioned, distributed key-value store that replicates data across cluster nodes to provide low-latency access and high availability. The platform also serves as a distributed SQL query engine, allowing users to execute standard SQL statements against both in-memory datasets and external data sources. What distinguishes Hazelcast is its use of a distributed consensus subsystem to maintain strongly consis
Javabig-datacachingdata-in-motion
View on GitHub6,570
apache/pinot
apache/pinot
6,098View on GitHub
Pinot is a distributed, columnar analytical database designed for high-concurrency, low-latency query processing. It functions as a real-time OLAP datastore, enabling interactive, user-facing analytics by ingesting and querying massive datasets from both streaming and batch sources. The system architecture relies on a centralized controller for cluster coordination and a distributed segment-based storage model to ensure horizontal scalability. The platform distinguishes itself through a hybrid ingestion pipeline that unifies real-time event streams and historical batch data into a single quer
Java
View on GitHub6,098

Open-source alternatives to Arroyo

risingwavelabs/risingwave

hazelcast/hazelcast

apache/pinot

apache/flink

GreptimeTeam/greptimedb

zhisheng17/flink-learning

robinhood/faust

Netflix/metaflow

apache/spark

apache/incubator-storm

apache/doris

fluent/fluent-bit

oceanbase/oceanbase

FasterXML/jackson

apache/nifi

apache/storm

infinyon/fluvio

vectordotdev/vector

redpanda-data/connect

electric-sql/electric

datahub-project/datahub

feast-dev/feast

ydb-platform/ydb

maiot-io/zenml

lancedb/lancedb

boto/boto3

rethinkdb/rethinkdb

StarRocks/starrocks

design-first/system-designer

json-schema-org/json-schema-spec