π Principal Engineer | Data & ML Architect | Big Data Enthusiast
πΉ 13+ years of experience in backend architecture, big data engineering, and ML pipelines
πΉ Passionate about building scalable data systems, cloud computing, and ML-driven solutions
- Languages: Java, Scala, Python, SQL
- Big Data & ML: Apache Spark, Hadoop, Hive, Presto/Trino
- Cloud & Infrastructure: AWS, Oracle Cloud, Kubernetes, Docker
- Databases: PostgreSQL, Oracle Autonomous Database
- Tools & Frameworks: Microservices, CI/CD, Kafka
- Real-time ML Pipelines β Optimized Spark ML pipelines with 80%+ accuracy and 30% runtime reduction
- Cloud-based Data Platform β Designed a scalable data architecture on Oracle Cloud for predictive analytics
- Big Data Processing Engine β Built high-performance ETL pipelines with Apache Spark & Trino
- Cloudera Certified Developer for Apache Hadoop (CCDH)
- Professional Scrum Masterβ’ (PSM 1)
- Patents:
- Dynamic Data Selection for Machine Learning Models
- Hyperparameter Tuning for ML Models