Data Engineer | ML Industrialization | Cloud Architecture (AWS, Azure, GCP)
ouzaina.yassine.ai@gmail.com | LinkedIn | GitHub | Paris, France
π Hello! I'm Yassine, a Data Engineer with proven experience in designing robust data pipelines and industrializing machine learning models, notably at re:mind (start-up). I specialize in transforming complex data into actionable insights and building intelligent, scalable systems for real-world applications. My work has included processing tens of gigabytes of data daily and significantly improving data access times (e.g., by 40% in a SaaS environment). I am skilled in cloud architecture across GCP, AWS, and Azure, and the automation of ETL workflows.
I'm actively seeking new opportunities in Data & AI where I can leverage my expertise to drive innovation and business value.
My expertise spans across the full data lifecycle, from ingestion and processing to deployment and monitoring.
- Data Pipelines & Processing: Apache Spark (PySpark, Scala), Apache Airflow, SQL, dbt, Apache Kafka
- ETL Automation & Optimization: Talend, Custom Python scripting
- Achievements:
- Developed and optimized data pipelines for a SaaS product, processing tens of gigabytes of data daily.
- Automated ETL processes, reducing data access time by 40%.
- Azure: Azure Data Factory, Databricks, General cloud architecture
- AWS: S3, Lambda, Redshift
- GCP: BigQuery, Vertex AI
- Experience: Contributed to designing scalable cloud architectures tailored to analytical needs, ensuring optimal performance and cost efficiency.
- Model Development: Scikit-Learn, TensorFlow, Pandas, NumPy
- MLOps & Deployment: Deployment and monitoring of ML models in production (Docker, GitHub Actions).
- Experience:
- Developed supervised and unsupervised ML models for large datasets.
- Optimized prediction algorithms, enhancing model accuracy by 20%.
- Programming: Python, Scala, Bash scripting, C
- DevOps & Deployment: Docker, GitHub Actions, CI/CD, Azure DevOps
- Databases: PostgreSQL, Microsoft SQL Server, MongoDB
- Version Control & Collaboration: Git, Jira, Scrum
- HPC: Slurm, MPI, OpenMP, CUDA, OpenCL (from HPC internship)
- Data Visualization: PowerBI
- Master's, Big Data & AI - ESG, France (2023-2024)
- Master of Engineering in Applied Mathematics - Enseirb-Matmeca, France (2020-2023)
- 2-year highly selective classes to prepare for French Engineering schools - Morocco (2018-2020)
π Certifications:
- Building RAG Agents with LLMs - NVIDIA (December 2024)
- Large Language Models: Application through Production - Databricks (May 2024)
Explore my repositories below for a deeper dive into my work! I particularly recommend checking out my pinned projects for a curated showcase of my capabilities in action.
I'm always open to discussing new projects, collaborations, or opportunities in the Data and AI space. Feel free to reach out!
π§ Email: ouzaina.yassine.ai@gmail.com π LinkedIn: linkedin.com/in/youzaina001/
Thanks for visiting my GitHub profile!