-
Globallogic India
- Gurugram
-
19:12
(UTC +05:30) - https://www.datascienceportfol.io/evansajumathew
- in/evansajumathew
-
Northwind-Traders Public
SQL-powered analysis of sales, employee performance, and customer behavior using PostgreSQL window functions. This project uncovers key business insights to optimize decision-making.
Jupyter Notebook UpdatedFeb 13, 2025 -
-
-
netflix_sql_data_analysis Public
This project explores the Netflix dataset using SQL to answer complex analytical questions. It involves data cleansing, aggregation, ranking, and advanced SQL techniques to uncover insights such as…
UpdatedNov 30, 2024 -
Integrated Apache Kafka (KRaft mode) with Apache Druid for real-time streaming and high-performance analytics.
Python UpdatedNov 18, 2024 -
SQL-50-Leetcode-Problems Public
The SQL 50 collection on LeetCode offers a diverse set of problems aimed at evaluating and improving your SQL skills. It covers a broad spectrum of concepts, including fundamental queries, subqueri…
1 UpdatedNov 6, 2024 -
This project implements a real-time data pipeline for EURO 2024 football data, utilizing Apache Kafka for streaming, Apache Pinot for fast querying, and Apache Superset for data visualization. The …
Python MIT License UpdatedOct 25, 2024 -
This project automates the extraction of university course details (e.g., schedules, professors, course codes) from text files using Regex pattern and SpaCy NLP Model and , processes them using PyS…
Python UpdatedSep 30, 2024 -
Data-Analysis-Projects Public
This repository hosts multiple data analysis projects, showcasing a variety of real-time and batch processing pipelines. Each project highlights different tools and technologies, offering comprehen…
-
Reddit_ETL_DE Public
This project demonstrates a complete data pipeline for extracting, transforming, and loading (ETL) Reddit data into an Amazon Redshift data warehouse. The pipeline uses various AWS services and too…
-
In This Repo, it contains code for Data Analysis / Business Analyst SET -B by Geeks for Geeks PAT
Jupyter Notebook UpdatedOct 14, 2023 -
In This Repo, it contains code for Data Analysis / Business Analyst SET -A by Geeks for Geeks PTA
Jupyter Notebook UpdatedOct 13, 2023 -
e2e-data-engineering Public
Forked from airscholar/e2e-data-engineeringAn end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All comp…
Python UpdatedOct 5, 2023 -
-
-
-
-
-
-
-
-
py Public
Forked from codebasics/pyRepository to store sample python programs for python learning
Jupyter Notebook UpdatedFeb 1, 2023 -