Real-time Machine Learning with Apache Spark on Twitter Public Stream
-
Updated
Apr 27, 2017 - Python
8000
Real-time Machine Learning with Apache Spark on Twitter Public Stream
A Simple Real-Time Detector of DDOS Attacks with Apache Kafka And Spark Streaming
A real-time sales data analysis Application using Spark Structured Streaming, Kafka as a messaging system, PostgreSQL as a storage for processed data, and Superset for creating a dashboard.
A Realtime Stock Portfolio Manager built using Apache Distributed Technologies!
Big Data Project - SSML - Spark Streaming for Machine Learning
Track trending hashtags on Twitter in real time.
a streaming app and a dashboard for visualizing cryptocurrency data fetched from the CoinGecko API. The streaming app retrieves real-time cryptocurrency information using Spark Streaming and stores it in a PostgreSQL database.
Big data computing tasks conducted with PySpark. The problems involve MapReduce and Streaming algorithms.
Developed a real-time streaming analytics pipeline using Apache Spark to calculate and store KPIs for e-commerce sales data, including total volume of sales, orders per minute, rate of return, and average transaction size. Used Spark Streaming to read data from Kafka, Spark SQL to calculate KPIs, and Spark DataFrame to write KPIs to JSON files.
This project gets data from Spotify API , ingests into kafka for streaming and processes it through spark streaming. All this is done on Azure.
Add a description, image, and links to the sparkstreaming topic page so that developers can more easily learn about it.
To associate your repository with the sparkstreaming topic, visit your repo's landing page and select "manage topics."