Azure Machine Learning designer (preview) gives you a cloud-based interactive, visual workspace that you can use to easily and quickly prep data, train and deploy machine learning models. It supports Azure Machine Learning compute, GPU or CPU. Machine Learning designer also supports publishing models as web services on Azure Kubernetes Service that can easily be consumed by other applications.
In this lab, we will be compare the performance of two binary classifiers: Two-Class Boosted Decision Tree and Two-Class Logistic Regression for predicting customer churn. The goal is to run an expensive marketing campaign for high risk customers; thus, the precision metric is going to be key in evaluating performance of these two algorithms. We will do all of this from the Azure Machine Learning designer without writing a single line of code.
-
In Azure portal, open the available machine learning workspace.
-
Select Launch studio under the Try the new Azure Machine Learning studio message.
-
When you first launch the studio, you may need to set the directory and subscription. If so, you will see this screen:
For the directory, select Udacity and for the subscription, select Azure Sponsorship. For the machine learning workspace, you may see multiple options listed. Select any of these (it doesn't matter which) and then click Get started.
-
From the studio, select Designer, Sample 5: Binary Classification – Customer Relationship Prediction.
-
In the settings panel on the right, select Select compute target.
-
In the
Set up compute target
editor, select the available compute, and then select Save.
Note: If you are facing difficulties in accessing pop-up windows or buttons in the user interface, please refer to the Help section in the lab environment.
-
From the
right-hand-side
of the pipeline, select the Two-Class Boosted Decision Tree module and then select the Delete Icon. -
From the
right-hand-side
of the pipeline, select the SMOTE module and then select the Delete Icon.
-
Select Machine Learning Algorithms section in the left navigation. Follow the steps outlined below:
-
Select the Two-Class Logistic Regression prebuilt module
-
Drag and drop the selected module on to the canvas
-
Connect the
Two-Class Logistic Regression
module to the first input of theTrain Model
module -
Connect the first output of the
Split Data
module to the second input of theTrain Model
module
-
-
Select Submit to open the
Setup pipeline run
editor.Please note that the button name in the UI is changed from Run to Submit.
-
In the
Setup pipeline run editor
, select Experiment, Create new and provideNew experiment name:
Churn-Predictor, and then select Submit. -
Wait for pipeline run to complete. It will take around 45 minutes to complete the run.
-
While you wait for the model training to complete, you can learn more about the evaluation metrics for the classification algorithm used in this lab by selecting Metrics for classification models.
-
From the
left-hand-side
of the pipeline, select Evaluate Model, Outputs, Visualize to open theEvaluate Model result visualization
dialog for theTwo-Class Boosted Decision Tree
module.
-
Scroll down to review model performance metrics for
Two-Class Boosted Decision Tree
. Observe that the Precision value is around 0.7.
-
From the
right-hand-side
of the pipeline, select Evaluate Model, Outputs, Visualize to open theEvaluate Model result visualization
dialog for theTwo-Class Logistic Regression
module.
-
Scroll down to review model performance metrics for
Two-Class Logistic Regression
. Observe that the Precision value is around 0.3.
- Based on the primary performance metric,
Precision
, it shows that theTwo-Class Boosted Decision Tree
algorithm outperforms theTwo-Class Logistic Regression
algorithm.
Congratulations! You have trained and compared performance of two different classification machine learning models. You can continue to experiment in the environment but are free to close the lab environment tab and return to the Udacity portal to continue with the lesson.