8000 youzaina001 (Yassine Ouzaina) Β· GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View youzaina001's full-sized avatar

Highlights

  • Pro

Block or report youzaina001

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
youzaina001/README.md

Yassine OUZAINA πŸ‘¨β€πŸ’»

Data Engineer | ML Industrialization | Cloud Architecture (AWS, Azure, GCP)

ouzaina.yassine.ai@gmail.com | LinkedIn | GitHub | Paris, France


πŸ‘‹ Hello! I'm Yassine, a Data Engineer with proven experience in designing robust data pipelines and industrializing machine learning models, notably at re:mind (start-up). I specialize in transforming complex data into actionable insights and building intelligent, scalable systems for real-world applications. My work has included processing tens of gigabytes of data daily and significantly improving data access times (e.g., by 40% in a SaaS environment). I am skilled in cloud architecture across GCP, AWS, and Azure, and the automation of ETL workflows.

I'm actively seeking new opportunities in Data & AI where I can leverage my expertise to drive innovation and business value.


πŸš€ Core Competencies & Skills

My expertise spans across the full data lifecycle, from ingestion and processing to deployment and monitoring.

πŸ”§ Data Engineering & ETL

  • Data Pipelines & Processing: Apache Spark (PySpark, Scala), Apache Airflow, SQL, dbt, Apache Kafka
  • ETL Automation & Optimization: Talend, Custom Python scripting
  • Achievements:
    • Developed and optimized data pipelines for a SaaS product, processing tens of gigabytes of data daily.
    • Automated ETL processes, reducing data access time by 40%.

Python Scala SQL PySpark Airflow Kafka

☁️ Cloud Architecture & Platforms

  • Azure: Azure Data Factory, Databricks, General cloud architecture
  • AWS: S3, Lambda, Redshift
  • GCP: BigQuery, Vertex AI
  • Experience: Contributed to designing scalable cloud architectures tailored to analytical needs, ensuring optimal performance and cost efficiency.

Azure AWS GCP Databricks

🧠 Machine Learning & AI

  • Model Development: Scikit-Learn, TensorFlow, Pandas, NumPy
  • MLOps & Deployment: Deployment and monitoring of ML models in production (Docker, GitHub Actions).
  • Experience:
    • Developed supervised and unsupervised ML models for large datasets.
    • Optimized prediction algorithms, enhancing model accuracy by 20%.

Scikit-Learn TensorFlow Pandas

βš™οΈ DevOps, Databases & Tools

  • Programming: Python, Scala, Bash scripting, C
  • DevOps & Deployment: Docker, GitHub Actions, CI/CD, Azure DevOps
  • Databases: PostgreSQL, Microsoft SQL Server, MongoDB
  • Version Control & Collaboration: Git, Jira, Scrum
  • HPC: Slurm, MPI, OpenMP, CUDA, OpenCL (from HPC internship)
  • Data Visualization: PowerBI

Git Docker GitHub Actions Power BI


πŸŽ“ Education & Certifications

  • Master's, Big Data & AI - ESG, France (2023-2024)
  • Master of Engineering in Applied Mathematics - Enseirb-Matmeca, France (2020-2023)
  • 2-year highly selective classes to prepare for French Engineering schools - Morocco (2018-2020)

πŸ“œ Certifications:

  • Building RAG Agents with LLMs - NVIDIA (December 2024)
  • Large Language Models: Application through Production - Databricks (May 2024)

πŸ’‘ Projects & Portfolio

Explore my repositories below for a deeper dive into my work! I particularly recommend checking out my pinned projects for a curated showcase of my capabilities in action.


🌟 Let's Connect!

I'm always open to discussing new projects, collaborations, or opportunities in the Data and AI space. Feel free to reach out!

πŸ“§ Email: ouzaina.yassine.ai@gmail.com πŸ”— LinkedIn: linkedin.com/in/youzaina001/


Thanks for visiting my GitHub profile!

Pinned Loading

  1. ML_supervised ML_supervised Public

    Implementing and Comparing Regression Models Using Scikit-Learn & PyCaret

    Jupyter Notebook 1

  2. fine_tuned_trocr_small_stage1_OCR fine_tuned_trocr_small_stage1_OCR Public

    Fine-tuning microsoft/trocr-base-handwritten on an OCR dataset

    Jupyter Notebook 1

  3. VertexAI_OpenAI_RAG_ChatBot VertexAI_OpenAI_RAG_ChatBot Public

    VertexAI_OpenAI_RAG_ChatBot is a Python-based chatbot developed for a datathon event. This project demonstrates the integration of OpenAI and VertexAI technologies to create an interactive and inte…

    Python 1

  4. airflow-dbt-duckdb-ELT-pipeline airflow-dbt-duckdb-ELT-pipeline Public

    An ELT pipeline with Airbyte and DBT

    Python

0