What are the best open-source alternatives to Apache Spark Tutorial?

Question 1

Accepted Answer

20 open-source projects similar to waylau/apache-spark-tutorial, ranked by shared features. Top picks: akka/alpakka-kafka, ambster-public/awesome-qlik, apache/flink, apache/kafka, apache/spark, awesome-spark/awesome-spark, awesomedata/awesome-public-datasets, briatte/awesome-network-analysis, datawhalechina/wonderful-sql, elasticsearch/elasticsearch-definitive-guide.

Question 2

Is akka/alpakka-kafka a good alternative to Apache Spark Tutorial?

Accepted Answer

Alpakka Kafka connector - Alpakka is a Reactive Enterprise Integration library for Java and Scala, based on Reactive Streams and Akka.

Question 3

Is ambster-public/awesome-qlik a good alternative to Apache Spark Tutorial?

Accepted Answer

A curated list of awesome Qlik extensions and resources for Qlik Sense and QlikView

Question 4

Is apache/flink a good alternative to Apache Spark Tutorial?

Accepted Answer

Apache Flink is a distributed processing engine designed for both high-throughput, low-latency data streams and finite batch workloads. It functions as a stateful stream processor and a SQL stream processing engine, providing a unified runtime to execute relational queries and event-based transform…

Question 5

Is apache/kafka a good alternative to Apache Spark Tutorial?

Accepted Answer

Kafka is a distributed event streaming platform designed for capturing, storing, and processing real-time data streams across interconnected nodes. It functions as a distributed commit log, providing a fault-tolerant storage mechanism that records state changes sequentially to ensure data consisten…

Question 6

Is apache/spark a good alternative to Apache Spark Tutorial?

Accepted Answer

Apache Spark is a unified distributed data processing engine designed for large-scale data analysis and computation graphs. It functions as a distributed machine learning framework, a graph processing system, a real-time stream processor, and a SQL analytics engine.

The system enables the executio…

Question 7

Is awesome-spark/awesome-spark a good alternative to Apache Spark Tutorial?

Accepted Answer

A curated list of awesome Apache Spark packages and resources.

Question 8

Is awesomedata/awesome-public-datasets a good alternative to Apache Spark Tutorial?

Accepted Answer

This project is a community-maintained, open-access directory of high-quality public datasets. It serves as a centralized reference point for researchers, developers, and data scientists to locate reliable information sources across a wide spectrum of industries and scientific fields. By providing…

Question 9

Is briatte/awesome-network-analysis a good alternative to Apache Spark Tutorial?

Accepted Answer

A curated list of awesome network analysis resources.

Question 10

Is datawhalechina/wonderful-sql a good alternative to Apache Spark Tutorial?

Accepted Answer

datawhalechina/wonderful-sql is an open-source alternative to Apache Spark Tutorial.

Question 11

Is elasticsearch/elasticsearch-definitive-guide a good alternative to Apache Spark Tutorial?

Accepted Answer

elasticsearch/elasticsearch-definitive-guide is an open-source alternative to Apache Spark Tutorial.

Open-source alternatives to Apache Spark Tutorial

akka/alpakka-kafka

ambster-public/awesome-qlik

apache/flink

apache/kafka

apache/spark

awesome-spark/awesome-spark

awesomedata/awesome-public-datasets

briatte/awesome-network-analysis

datawhalechina/wonderful-sql

elasticsearch/elasticsearch-definitive-guide

galliaproject/gallia-core

igorbarinov/awesome-data-engineering

looly/elasticsearch-definitive-guide-cn

manuzhang/awesome-streaming

openmole/gridscale

oxnr/awesome-bigdata

sduff/awesome-splunk

spotify/scio

TouK/nussknacker

youngwookim/awesome-hadoop