play_store_eda

EDA on Play Store Data

Summary

In this project we will perform Exploratory Data Analysis on Play Store Data. Play Store dataset contains information like reviews, ratings, size, category, etc. on around a 10,000 apps. In this project we will do some data cleaning, wrangling to clean and ready the data for exploratory analysis. Then, we will chart various variable to find any relationships between them, and identify top performing categories.

Dataset information

The dataset contains following columns: App, Category, Rating, Reviews, Size, Installs, Type, Price, Content Rating, Genres, Last Updated, Current Ver, Android Ver
Dataset shape: (10841, 13)
Libraries used: pandas, numpy, matplotlib, seaborn, missingno

Data Wrangling

De-duplication: The dataset contains duplicate rows. We first remove the duplicate rows. Then, we identify duplicate app names, which are listed in more than one categories. We keep the most reviewed app in each category.
Missing values: We visualize missing values with the missingno library. There are missing values in the dataset. We don't choose to impute the values right now. We will drop them or fill them with zeros when we do the plotting.
Custom performance metric: We add a column for a custom performance metric, which measures performance by calculating (rating * reviews / installs)

Charting:

We plot various graphs to find relationships between variables and identify top performing categories. The charts can be found in the ipython notebook.

Conclusion:

The EDA performed above reveals top categories by various metrics among which the top categories by our custom performance metric are:

Game Social Medical Weather Family The focus should be on developing apps in these top categories for maximum reach and number of installs. Also, in order to get high number of installs, developers should focus on getting more number of positive reviews with positive rating. The price of the app should be kept as low as possible, and preferrably the app should be free.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Play Store Data.csv		Play Store Data.csv
Play_Store_Data_EDA.ipynb		Play_Store_Data_EDA.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

play_store_eda

Summary

Dataset information

Data Wrangling

Charting:

Conclusion:

About

Uh oh!

Releases

Packages

Languages

giramakshay/play_store_eda

Folders and files

Latest commit

History

Repository files navigation

play_store_eda

Summary

Dataset information

Data Wrangling

Charting:

Conclusion:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages