8000 GitHub - SumedhSankhe/EDA-H1B-application: This project was created to showcase the skills learnt in the DA-5020 Collecting Storing and Retrieving Data Course in the Spring 2017 Semester.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

This project was created to showcase the skills learnt in the DA-5020 Collecting Storing and Retrieving Data Course in the Spring 2017 Semester.

Notifications You must be signed in to change notification settings

SumedhSankhe/EDA-H1B-application

Repository files navigation

EDA-H1B-application

This project was created to showcase the skills learnt in the DA-5020 Collecting Storing and Retrieving Data Course in the Spring 2017 Semester. The main aim of this project was to identify the errors in the data and tidy them as much as possible. After tidying the data, creating a SQL database and storing the data in it. Run some basic low level database queries and extract the data back from the database. And the plot the data into graphs. All the above processes to be carried out in R programming language using the dplyr, ggplot2, RSQ-Lite packages.

Updates 08/09/2018

Functions created to read/transform/combine data into a single data table Significant improvement in reading and transformation speed since transformation are done for every file read as opposed to combine dataframe in the previous iteration

About

This project was created to showcase the skills learnt in the DA-5020 Collecting Storing and Retrieving Data Course in the Spring 2017 Semester.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

0