8000 GitHub - LingAdeu/customer-churn-prediction: This project aims to reduce churn rate from 16.8% to 10% by exploiting both diagnostic and predictive analytics. Using the final model, the churn rate can be reduced to even below 10% based on a simulation.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

This project aims to reduce churn rate from 16.8% to 10% by exploiting both diagnostic and predictive analytics. Using the final model, the churn rate can be reduced to even below 10% based on a simulation.

Notifications You must be signed in to change notification settings

LingAdeu/customer-churn-prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

38 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

header

Addressing Customer Churn in an E-Commerce Company

About

In this project, our goal is to reduce customer churn rate from 16.8% to 10% in an e-commerce company. While the current churn rate at 16.8% is below the global churn rate in an e-commerce industry, if not addressed soon and properly, this churn problem will pose a financial risk to the company. Here, we used a diagnostic analysis focusing on understanding possible factors driving churn by comparing the demographics, satisfaction, and behaviors of churned and non-churned customers. We also developed a classification model to predict churn, prioritizing the minimization of false negatives due to their financial impacts (false negative = 5 x false positive).

After series of experiments exploiting resampling techniques (i.e., SMOTE, ADASYN, NearMiss v3) and 10 machine learning algorithms, we selected XGBoost as the final model based on both business (customer acquisition cost and retention cost) and machine learning metrics (F2 score), and identified two key factors which can be intervened for reducing the churn rate, namely tenure and cashback amount. The interventions can be done to customers predicted as churn by the model by increasing the cashback amount and lengthening the tenure. A simulation using 50% of intervention success rate, demonstrated the churn rate reduction to 10% is possible by utilizing the machine learning model and an intervention from the business stakeholder side.

Important

The notebook for storing every process of this project can be seen in folder notebook. Alternatively, kindly see NBViewer version for a better display. Additionally, to view our Tableau dashboard, use this URL.

Data

The dataset was obtained from Kaggle on an online retail company. This dataset consists of 15 numerical columns, including the target variable Churn, and 5 categorical columns.

Folder Organization

.
├── README.Md                                   <- The top-level README for using this project.
├── data
│   ├── E Commerce Dataset.xlsx                 <- The raw dataset for the analysis on the notebook.
│   └── clean_data_with_updated_values.csv      <- The clean dataset for a Tableau analysis.
├── img                                         <- Folder containing images for the notebook
├── model
│   └── clf_final.pkl                           <- The final model
├── notebook
│   └── notebook.ipynb                          <- Jupyter notebook file for data analysis and modeling
├── requirements.txt                            <- The requirements file for reproducing the environment.
├── src
│   └── app.py                                  <- Streamlit app
└── tableau
    └── workbook.twb                            <- The Tableau workbook

Contributor

About

This project aims to reduce churn rate from 16.8% to 10% by exploiting both diagnostic and predictive analytics. Using the final model, the churn rate can be reduced to even below 10% based on a simulation.

Topics

Resources

Stars

Watchers

Forks

Contributors 2

  •  
  •  
0