Stars
Apache Spark 3 - Spark Programming in Python for Beginners
This repo is mostly created for pyspark and hive related interview questions.
This repo is mostly created for pyspark and hive related interview questions.
This is the Curriculum for "Learn Data Science in 3 Months" By Siraj Raval on Youtube
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials,…
A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations of possible data sources. Multiple execution modes in multipl…
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Your favorite language gets closer to bare metal.
AWS commands, snippets, scripts, and more.
Python Data Science Handbook: full text in Jupyter Notebooks
Practice your pandas skills!
VeeRAnji0425 / Databricks-Apache-Spark-2X-Certified-Developer
Forked from vivek-bombatkar/Databricks-Apache-Spark-2X-Certified-DeveloperDatabricks - Apache Spark™ - 2X Certified Developer
Because its never late to start taking notes and 'public' it...
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Examples used in the Learning Apache Spark Course
Contains source files used in the Spark with Python course
An SBT plugin for dangerously fast development turnaround in Scala
A collections library created specifically for educational purposes. Do NOT use in production!
A repository of my work for Coursera's Machine Learning Specialization via University of Washington
Course on Udemy by Jose Portilla
These are projects that I worked on while enrolled in "Scala and Spark for Big Data and Machine Learning," from instructor, Jose Portilla.
First try at the Scala programming language and Apache Spark. Based on Udemy Course by Jose Portilla.
Source code for James Lee's Aparch Spark with Java course
Project for James' Apache Spark with Scala course