Stars
This Beam pipeline ingests CSV files from Google Cloud Storage (GCS), and efficiently loads them into a Cloud SQL PostgreSQL database using the `COPY` command. The template is designed for parallel…
Predict if a reservation will be canceled using robust Machine Learning pipelines with Airflow and Mlflow
This is a repo with links to everything you'd ever want to learn about data engineering
Data Engineering Zoomcamp is a free nine-week course that covers the fundamentals of data engineering.
Supplemental material for Udacity's "Writing READMEs" course
SQL data analysis & visualization projects using MySQL, PostgreSQL, SQLite, Tableau, Apache Spark and pySpark.