Create Data Lake on AWS S3 to store dimensional tables after processing data using Spark on AWS EMR cluster
-
Updated
Oct 10, 2019 - Python
8000
Create Data Lake on AWS S3 to store dimensional tables after processing data using Spark on AWS EMR cluster
Build a data warehouse from scratch, including full load, daily incremental load, design schema, SCD Type 1 and 2.
This is a flask application that converts an informational model of a decision problem to a snow-flaked star schema
Model an star schema from raw normalized Olympic Games data using dbt - postgres, airflow and docker
Building Data Warehouse and ETL pipelines using Amazon S3 and Redshift
Data Modeling with Apache Cassandra
Simple scripts for data cleaning, etl transformations and data reorganisations
ETL pipeline that extracts and transforms student athlete academic performance data, then populates a data warehouse using a star schema dimensional model.
Udacity project: implementing an ETL process on a PostgreSQL DB to create a star schema data model
University lab exercises with processing big data.
Creating a Data Warehouse using Aws Redshift.
Batch & streaming data pipelines built using Databricks with Pyspark and modeled the data into star schema to analyze in PowerBI, Formula-1 racing data from multiple data sources, APIs.
All in one slice and dice module
ETL Pipeline that Scrapes, Cleans, and Loads Book Data into PostgreSQL, then builds a Star-Schema Data Warehouse for Optimized Analysis.
This project builds a real-time food delivery analytics pipeline using AWS Kinesis, PySpark, Redshift, and QuickSight, with automated deployments via CodeBuild.
A Postgres database using a star schema to facilitate the analysis of user behaviour on a music streaming app.
Udacity Data Engineering Nanodegree - Project #5
A data warehouse on Amazon Redshift using a star schema to facilitate the analysis of user behaviour on a music streaming app.
This repository showcases a robust end-to-end data pipeline for the American Community Survey dataset, utilizing tools like Python, SparkSQL, and Docker. 🚀 Explore the architecture that transforms raw data into valuable insights through a Bronze / Silver / Gold framework. 🐙
Add a description, image, and links to the star-schema topic page so that developers can more easily learn about it.
To associate your repository with the star-schema topic, visit your repo's landing page and select "manage topics."