Predictive-Modelling-For-Customer-Engagement-At-VMware

Built a supervised multi-class predictive model to bucket customers based on the events and actions recorded during their interactions with the VMWare's customer engagement portals

OBJECTIVE

Improve Customer engagement with website portals of VM Ware

Suggest business on come up with a set of segment rules to identify top individuals for a digital asset and to target them with personalization on the website.
Have substantiated marketing and sales implications.
Fine tune predictive models with appropriate parameters to predict for customer segments

TECHNOLOGIES

R programming - SMOTE, LiblineaR, ggplot2, randomForest, RRF, gbm, xgboost

ALGORITHMS

SMOTE
Random Forest
LASSO
Ridge Regression
XGBoost

DATA

The dataset has 700+ variables and 50K+ records of customer interactions. The data set is confidential and could be bought from Harvard Business Publication.

DATA PRE PROCESSING

Removed null valued rows and performed SMOTE to balance the dataset for customer converts vs non customer converts. Loaded the datas set for modeling. The code can be accessed here.

Random Forest Model - Variable importance

Post exploratory analysis, we decide on important variables as predictors for our model. We use the mean decrease in Gini Index to pick on the important variables and reduce the number of dimensions in the feature set.

Post running this model, we use the mean decrease in gini index to list the important variables. Below is the list of variables which are potential predictors for a model.

We built a Lasso regression model using the top 200 variables that came out significant from the Random Forest model. We performed Cross-validation to get the best cost paramater for the LASSO regression.

We also built XGBoost model using the top 200 variables from the Random Forest model. The XGBoost model outperformed the LASSO regression in terms of accuracy and recall by 6% and 4% . However the model lacks interpretability in understanding what veriables influence the marketing and sales of products on the digital portals.

RESULTS

Interms of factors influencing the conversion of a vistor to customer, - "product page views, first data of download, top resources and pdf downloads" are top variables with high importance.
Vistors who view the product page more than average views for the page, should be priortized and personalization is required.
Vistors downloading more PDFs of various products are interested in understanding the details further, hence persuing them will add up for conversion rate.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Code		Code
Images		Images
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predictive-Modelling-For-Customer-Engagement-At-VMware

TABLE OF CONTENTS

OBJECTIVE

Improve Customer engagement with website portals of VM Ware

TECHNOLOGIES

ALGORITHMS

DATA

DATA PRE PROCESSING

Random Forest Model - Variable importance

RESULTS

About

Releases

Packages

Languages

skotak2/Predictive-Modelling-For-Customer-Engagement-At-VMware

Folders and files

Latest commit

History

Repository files navigation

Predictive-Modelling-For-Customer-Engagement-At-VMware

TABLE OF CONTENTS

OBJECTIVE

Improve Customer engagement with website portals of VM Ware

TECHNOLOGIES

ALGORITHMS

DATA

DATA PRE PROCESSING

Random Forest Model - Variable importance

RESULTS

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages