8000 Shegzimus (Oluwasegun) Β· GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Shegzimus's full-sized avatar

Block or report Shegzimus

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
shegzimus/README.md

Deutsch

🌟 About Me

  • βš™οΈ Data Engineer skilled in building modular cloud-based ETL pipelines.
  • πŸ”’ Academic background in Mathematics (BSc) & Data Science (MSc)
  • πŸ’Ό Brief career in Management & Public Health Consulting
  • πŸ’» Enjoys designing data architecture, and containerized workflows.

πŸ› οΈ Tech Stack:

My Skills

  • Specialty: Using algorithms to design ETL packages/modules

πŸ’‘ My Design Philosophy (what you can expect)

  • Modular & Maintainable Architecture: I design ETL pipelines as loosely coupled, reusable components to simplify debugging, enable parallel development, and support future scalability.

  • Cloud-Native & Cost-Efficient: I leverage managed cloud services (e.g., AWS Glue, BigQuery, GCS, S3) and infrastructure-as-code to optimize for reliability, performance, and cost, while maintaining security best practices.

  • Data Quality & Observability First: I embed validation, logging, and monitoring at each pipeline stage to ensure data integrity, enable fast issue detection, and support confident decision-making downstream.

πŸ”­ What I’m Learning

  • RegEx
  • Apache Pulsar
  • Go (for writing Kafka producers and multithreading)

πŸ“« Connect With Me

Pinned Loading

  1. DE_Fashion_Product_Images DE_Fashion_Product_Images Public

    Apache Airflow powered ETL Pipeline for moving about 133k images from Kaggle to GCS and BigQuery

    Python

  2. DE_NASA_NeoW_Pipeline DE_NASA_NeoW_Pipeline Public

    Airflow powered ETL pipeline for moving Near-Earth-Object data from NASA to Google Cloud

    Python

  3. ML-Video-Game-Sales-Prediction ML-Video-Game-Sales-Prediction Public

    Jupyter Notebook

  4. Masters-Thesis Masters-Thesis Public

    Python

0