KAR-NG

💭

I may be slow to respond.

KAR KAR-NG

💭

I may be slow to respond.

Data Analyst - Statistical Comparison, Mapping, Text Analytics, Forecast, Machine Learning (Regression, Classification, Clustering, Principal Component)

3 followers · 0 following

Brisbane, Australia

Achievements

nasa Public

nasa

Updated Dec 31, 2022
superstore.sales Public

superstore.sales

HTML Updated Oct 4, 2022
student Public

Updated Sep 25, 2022
Life-Expectancy-Statistical-Analysis-WHO- Public

Statistically answered 8 research questions using Multiple Factor Analysis (MFA), Principal Component Analysis (PCA), Multiple Linear Regression, Welch's t-test, Wilcoxon signed-rank test, and Long…

r statistical-analysis principal-component-analysis unsupervised-machine-learning multiple-linear-regression mixed-effects-models longitudinal-analysis

HTML Updated Aug 20, 2022
Student-Retention-Rate-of-AUS-Universities Public

student

JavaScript Updated Jul 23, 2022
KAR-NG Public

My Personal Repository

1 Updated Jul 23, 2022
soil Public

soil

Updated Jul 16, 2022
Human-Resource-Data-Mining Public

5 analytical tasks have been completed using VAT validated gower-PAM clustering, Correspondence Analysis (CA), Asym-Biplot, Multiple Correspondence Analysis (MCA), Chi-Squared test, Regression, and…

data machine-learning r clustering regression classification datamining

R Updated Jul 6, 2022
ecar Public

ecar

HTML Updated Jun 23, 2022
regression Public

regressionbook

HTML Updated Jun 8, 2022
Credit-Card-Market-Segmentation Public

VEV model from Mclust among 5 clustering algorithms has optimal performance and detected 8 distinct groups of users. Data was cleaned, standardized and feature-selected, PCA’s biplot, Ggplot, Radar…

r clustering fuzzy pca dbscan clara machinelearning-r

R Updated May 27, 2022
Food-Poison-Survey-Analysis-using-Multiple-Correspondence-Analysis Public

This project applies multiple correspondence analysis (MCA) with the techniques in scree plot, variable plots, individual plots, biplot, cosine square (CO2) and contribution statistcs (contrib) to …

ai machinelearning-r principalcomponentanalysis

Updated May 8, 2022
pima Public

pima

Updated Feb 23, 2022
Loan-EDA-and-Machine-Learning-Prediction Public

Solved 7 business tasks and identified statistical important variables related to loan application. Many plots were synthesised during EDA and machine learning. Models built include Logistic regres…

R Updated Feb 9, 2022
Brisbane_Real_Estate_Sales_2020 Public

320k obs and 11 vars cleaned and manipulated for EDA and mapping (choropleth, cluster, points) to find a new home for a Brisbane family.

r mapping eda

R 1 Updated Nov 16, 2021
Analysis-of-Titanic-Mortality Public

Data manipulation, imputation, feature engineering, and machine learning algorithms (K-Nearest neightbour, random forest, and extreme-gradient boosting) were applied to clean the dataset. A final, …

HTML Updated Oct 26, 2021
Sales-of-Summer-Clothes-in-E-commerce- Public

Solve 9 analysis tasks and identified the most important variables in driving the success of clothes sales. Achieved via 22 plots, multiple linear regression and random forest

text-mining r random-forest eda inferential-statistics

R Updated Oct 21, 2021
Predicting-House-Prices-in-Boston_UniqueVersion Public

Extracted statistical relationships between house prices and many factors, applicationised the 90% R2 Random Forest model that outcompeted MLR, Lasso, PLS, KNN, and DT into production.

machine-learning r inferencial-statistics

R Updated Oct 10, 2021
SimpleTalkDemo_R Public
Forked from SQLSuperGuru/SimpleTalkDemo_R

Demo data and R script for Simple Talk aricle

R Updated Oct 5, 2021
Dirty-Data-Challenge- Public

Clean, manipulate, transform, and join 4 messy datasets

r data-manipulation data-cleaning

R Updated Oct 3, 2021
Houston_Avocado_Prices_EDA_-_Forecast Public

18k obs & 14 vars cleaned and manipulated for EDA, assumption tests, PP, WO, Ljung-Box, and forecasting (ETS & ARIMA) for avocado prices in the US and Houston.

r time-series eda forecasting

R 1 Updated Sep 28, 2021
Marketing_Analytics Public

Solved 9 biz tasks by 18 graphs and 10 statistical methods include dummy data partitioning (RMSE & R2), stepwise model selection, multicollinearity (correlation, VIF), MLR, GLM for logistic regress…

machine-learning r eda statistical-analysis group-comparison

R 1 Updated Sep 27, 2021
Recommendation_of_Crop_Classes_by_Predictive_Model Public

Built an ML API that recommends crop classes with 99.5% accuracy; Trained 13 models included Discriminants analyses, KNN, SVMs, Naive Bayers, Decision Tree, Random Forest (RF), and Boosted RF.

git machine-learning r eda

R 1 Updated Sep 24, 2021
Bike-Share_Big_Data_Analysis Public

12 datasets, 3.7 million obs, & 13 vars were cleaned and manipulated for 6 graphs, dynamic map, and statistics to convert casual riders into members.

r mapping eda

R 1 Updated Sep 14, 2021
Oats_Variety-Fertilizer_SplitPlot_Field_Experiment Public

A factorial Split-plot system analysed by Shapiro-Wilk test, Levene’s test, Q-Q plot, CI plot, Mixed-Effect Model, ANOVA, and Tukey test.

visualization r agriculture statistical-analysis

R 2 Updated Sep 14, 2021
ResortHotel_versus_CityHotel Public

119k obs & 32 vars cleaned and manipulated to create 14 distinct graphs and statistic tables for an extensive EDA to draw insights.

r eda hotel hospitality hotel-booking

R 1 Updated Sep 14, 2021
Maize_Soil_Nutrient_CRD_Glasshouse_Experiment- Public

A CRD system (8 treatments & 3 harvests) analysed by Shapiro-Wilk test, Q-Q plot, Levene’s test, Kruskal-Wallis test, and Dunn’s Post-hoc test.

visualization r sql agriculture statistical-analysis

R 3 Updated Sep 14, 2021
Cucumber_Multi-Env_LatinSquare_Field_Experiment Public

A multi-environment Latin Square designed trial analysed by ANOVA, Two-way ANOVA, Fully Random Model, Mixed Effect Model, and Tukey test.

visualization r sql agriculture statistical-analysis

R 2 Updated Sep 14, 2021

KAR KAR-NG

Achievements

Achievements

nasa Public

Uh oh!

superstore.sales Public

Uh oh!

student Public

Uh oh!

Life-Expectancy-Statistical-Analysis-WHO- Public

Uh oh!

Student-Retention-Rate-of-AUS-Universities Public

Uh oh!

KAR-NG Public

Uh oh!

soil Public

Uh oh!

Human-Resource-Data-Mining Public

Uh oh!

ecar Public

Uh oh!

regression Public

Uh oh!

Credit-Card-Market-Segmentation Public

Uh oh!

Food-Poison-Survey-Analysis-using-Multiple-Correspondence-Analysis Public

Uh oh!

pima Public

Uh oh!

Loan-EDA-and-Machine-Learning-Prediction Public

Uh oh!

Brisbane_Real_Estate_Sales_2020 Public

Uh oh!

Analysis-of-Titanic-Mortality Public

Uh oh!

Sales-of-Summer-Clothes-in-E-commerce- Public

Uh oh!

Predicting-House-Prices-in-Boston_UniqueVersion Public

Uh oh!

SimpleTalkDemo_R Public

Uh oh!

Dirty-Data-Challenge- Public

Uh oh!

Houston_Avocado_Prices_EDA_-_Forecast Public

Uh oh!

Marketing_Analytics Public

Uh oh!

Recommendation_of_Crop_Classes_by_Predictive_Model Public

Uh oh!

Bike-Share_Big_Data_Analysis Public

Uh oh!

Oats_Variety-Fertilizer_SplitPlot_Field_Experiment Public

Uh oh!

ResortHotel_versus_CityHotel Public

Uh oh!

Maize_Soil_Nutrient_CRD_Glasshouse_Experiment- Public

Uh oh!

Cucumber_Multi-Env_LatinSquare_Field_Experiment Public

Uh oh!