- GitHub: github.com/ashish-parjapat
- LinkedIn: linkedin.com/in/ashish-kumar
- Email: ashish.kalyan007@gmail.com
- Phone: +91-8307414508
- Degree: Bachelors of Technology in Computer Science and Engineering
- University: Maharshi Dayanand University, Rohtak, India
- Duration: Jul 2020 - May 2024
- GPA: 8.5/10.0
- Position: Associate Software Engineer
- Duration: Jun 2024 - Present
- Location: Gurugram, India
- Description:
- Managed ETL processes to ensure efficient data transformation and integration.
- Developed and maintained applications, ensuring they meet client requirements and internal standards.
- Experienced in ensuring software quality and reliability throughout the development lifecycle.
- Position: Trainee
- Duration: Jan 2024 - May 2024
- Location: Delhi, India
- Description:
- Proficient in Java, SQL, and J2EE with hands-on experience in application development.
- Skilled in manual testing, including test case design and defect tracking.
- Experienced in ensuring software quality and reliability throughout the development lifecycle.
- Position: MERN stack Training
- Duration: Dec 2022 - Jun 2023
- Location: Remote, India
- Description:
- Developed full-stack applications using MERN (MongoDB, Express, React, Node.js) technologies.
- Implemented RESTful APIs for seamless communication between the front-end and back-end.
- Utilized Express and Node.js to build a robust and scalable server-side architecture.
- Duration: March 2025 - April 2024
- Link: git.io/employee-etl
- Description:
- Developed an end-to-end ETL pipeline in Databricks using Medallion Architecture to process employee data across Bronze, Silver, and Gold layers with Delta Lake.
- Implemented secure data lake access using Azure App Registration and Key Vault to mount ADLS Gen2 on DBFS.
- Optimized PySpark workflows for data cleaning, transformation, and aggregation, enabling scalable and modular data processing.
- Description:
- Built and maintained an end-to-end data pipeline to ingest global automobile data from JATO, enabling real-time data availability for various markets.
- Designed and developed a dynamic dashboard using GoogleSQL to visualize car availability across countries, enhancing data accessibility for business teams.
- Ensured smooth integration of ingested data by collaborating with stakeholders and optimizing data transformations for accuracy and performance.
- Duration: Jan 2023 - Feb 2023
- Link: git.io/salon-appointment
- Description:
- Constructed a salon appointment management system using bash programming and PostgreSQL.
- Validated the functionality and performance of the bash script through rigorous testing methods.
- Programming Languages: Python, SQL, Javascript
- Big Data: PySpark, Delta Lake, Databricks
- Cloud: Azure (ADF, Blob, SQL DB), GCP (BigQuery, GCS)
- Workflow Orchestration: Apache Airflow, Google Apps Script
- Databases: PostgreSQL, MongoDB, MySQL
- Tools: Git, VSCode, Looker Studio, Linux
- ETL and Data Pipelines with Shell, Airflow and Kafka
- Introduction to Big Data with Spark and Hadoop
- Relational Database Administration (DBA)
- Data Management with Databricks: Big Data with Delta Lakes
- Optimization
- Discrete Maths
- Probability and Random Processes
- Number Theory
- Linear Algebra