0% found this document useful (0 votes)

6 views36 pages

Class Tree

The document discusses the classification tree methodology for predicting customer default on credit card payments using a dataset of 10,000 customers. It outlines the steps for building and evaluating a classification tree model, including data splitting, tree construction, pruning, and performance assessment, while also comparing it to a logistic regression model. The advantages and disadvantages of decision trees are highlighted, along with a recommendation for a practical assignment involving HR analytics.

Uploaded by

abby.iitpkd

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views36 pages

Class Tree

Uploaded by

abby.iitpkd

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

Classification Tree

Context
Classification Problem

Classification Class
𝑌 Model
Output
Labels

Class
Qualitative Probabilities
𝑋1 𝑋2 𝑋3
Default Data Set
A data set on ten thousand customers.

Variables

• Default: A factor with levels “No” and “Yes” indicating whether the customer
defaulted on their debt.
• Student: A factor with levels “No” and “Yes” indicating whether the customer is a
student.
• Balance: The average balance that the customer has remaining on their credit card
after making their monthly payment.
• Income: Income of customer.
Default Dataset

Classification Predicted
Default Model
Output Default
Labels

Class
Qualitative Probabilities
Balance Income Student
Objective
• Relationship between the output (i.e.,
Inference default) and the input variables (i.e.,
balance, income, student)

Prediction • Whether an individual will default on

his or her credit card payment.
Classification Tree
•A classification tree is very
similar to a regression tree
except that we try to make a
prediction for a categorical
response rather than
continuous one.
Classification Tree Output
Classification
Tree Output
• In a regression tree, the predicted
response for an observation is given by the
average response of the training
observations that belong to the same
terminal node.
Classification
• In a classification tree, we predict that
Tree Output each observation belongs to the most
commonly occurring class of the training
observations in the region to which it
belongs.
Algorithm
• The tree is grown in the same manner as with a
regression tree.
• However in classification tree, minimizing MSE no
longer makes sense.
• A natural alternative is classification error rate.

Algorithm • The classification error rate is simply the fraction

of the training observations in that region that do
not belong to the most common class.
• There are several other different criteria available
as well, such as the “gini index” and “cross-
entropy”.
Steps
Default Data Set
• Divide the data set into two parts- one part to be used for training the model
1 and the other part to test the same.

• Build a large tree on the training data set.

• Prune the tree to improve accuracy.

• Check the performance of the pruned tree on the test data set.
4
Step 1

We have in total data from 10,000 customers.

We then randomly split the observations into two parts- the training set
containing 8,000 observations and the test set containing 2,000 observations.
Step 2
Build a large tree on the
training data set.
Step 3:
Tree Pruning
Step 3:
Pruned Tree
Step 4:
Compute the True Default status
Test Error Predicted No Yes Total
Default
No 1932 36 1968
Status
Yes 12 20 32
Classification Error Rate
Total 1944 56 2000
12+36
=
2000
= 0.024
Step 4:
Compute the True Default status
Test Error Predicted No Yes Total
Default
No 1932 36 1968
Status
Yes 12 20 32
Sensitivity
Total 1944 56 2000
20
=
56
= 0.357
Step 4:
Compute the True Default status
Test Error Predicted No Yes Total
Default
No 1932 36 1968
Status
Yes 12 20 32
Specificity
Total 1944 56 2000
1932
=
1944
= 0.9938
Logistic Regression Model
Step 1

We have in total data from 10,000 customers.

We then randomly split the observations into two parts- the training set
containing 8,000 observations and the test set containing 2,000 observations.
COEFF STD. 𝑧 -STAT 𝑝-
ERROR VALUE
Step 2
Intercept −11.1300 0.5551 −20.04 <0.0001
Build a logistic regression
model on the training data Balance 0.0057 0.0002 22.59 <0.0001
set.
Income 0.0000 0.0000 1.04 0.2985

Student[Yes] −0.5406 0.2658 −2.03 0.0419

Step 3:
Build the Refined
COEFF STD. 𝑧 -STAT 𝑝-
Logistic ERROR VALUE
Regression Model
Intercept −10.7400 0.4062 −26.44 <0.0001

• Use Stepwise Method Balance 0.0057 0.0002 22.61 <0.0001

using AIC as the model Student[Yes] −0.7565 0.1645 −4.59 <0.0001
selection criterion.
Step 4:
Compute the True Default status
Test Error Predicted No Yes Total
Default
No 1939 40 1979
Status
Yes 5 16 21
Classification Error Rate
Total 1944 56 2000
5+40
=
2000
= 0.0225
Step 4:
Compute the True Default status
Test Error Predicted No Yes Total
Default
No 1939 40 1979
Status
Yes 5 16 21
Sensitivity
Total 1944 56 2000
16
=
56
= 0.2857
Step 4:
Compute the True Default status
Test Error Predicted Predicted No Yes Total
Default Default
No 1939 40 1979
Status Status
Specificity Yes 5 16 21

1939 Total 1944 56 2000

=
1944
= 0.9974
Model Comparison
Classification
Tree: ROC for
Test Data
AUC=0.9159
Logistic
Regression:
ROC for Test
Data
AUC=0.9374
Which model is better?
◦ If the relationship between the predictors
and response is linear, then classical
linear models such as linear regression
Trees vs. would outperform regression trees.
Linear Models ◦ On the other hand, if the relationship
between the predictors is non-linear,
then decision trees would outperform
classical approaches.
Trees vs. Linear Model:
Classification Example
Top row: The true decision boundary is linear
◦ Left: linear model (Better)
◦ Right: decision tree

Bottom row: The true decision boundary is non-

linear
◦ Left: linear model
◦ Right: decision tree (Better)
Advantages:
◦ Trees are very easy to explain to people (even
easier than linear regression).
Advantages ◦ Trees can be plotted graphically, and hence can
be easily communicated even to a non-expert.
and ◦ They work fine for both classification and
Disadvantages regression problems.
of Decision
Trees Disadvantages:
◦ Trees don’t have the same prediction accuracy
as some of the more flexible approaches
available in practice.
Assignment
Consider the case “HR Analytics at ScaleneWorks - Behavioural Modelling to
predict Renege.” Fit an appropriate classification tree model for the data set
provided with the case. Compare its performance with the logistic regression
model.
Reading Material
• James, G., Witten, D., Hastie, T. & Tibshirani, R. (2013). An Introduction to
Statistical Learning: with Applications in R. New York: Springer-Verlag. (web:
http://www-bcf.usc.edu/~gareth/ISL/).
✓ Chapter 8: Sub-sections 8.1.2, 8.1.3, 8.3.1.

AIMLB PGP 2025 Session 8
No ratings yet
AIMLB PGP 2025 Session 8
52 pages
ML L8 Decision Tree
No ratings yet
ML L8 Decision Tree
109 pages
08 CSE358 Intro To Machine Learning II
No ratings yet
08 CSE358 Intro To Machine Learning II
100 pages
UNIT II Machine Learning
No ratings yet
UNIT II Machine Learning
118 pages
Unit Iii Machine Learning
No ratings yet
Unit Iii Machine Learning
19 pages
Supervised Learning
No ratings yet
Supervised Learning
187 pages
Unit 3 Classification - Dr. Vidyut D
No ratings yet
Unit 3 Classification - Dr. Vidyut D
72 pages
CO 2 Session 3
No ratings yet
CO 2 Session 3
39 pages
Module 5 Machine Learning
No ratings yet
Module 5 Machine Learning
36 pages
Lecture 16
No ratings yet
Lecture 16
5 pages
E-Commerce Data Mining Guide
No ratings yet
E-Commerce Data Mining Guide
50 pages
Lecture 13-Supervised Learning-Decision Trees-M
No ratings yet
Lecture 13-Supervised Learning-Decision Trees-M
47 pages
Pyq ML
No ratings yet
Pyq ML
8 pages
Unit3 ML
No ratings yet
Unit3 ML
7 pages
MISY 631 Final Review Calculators Will Be Provided For The Exam
No ratings yet
MISY 631 Final Review Calculators Will Be Provided For The Exam
9 pages
Statistical Learning Slides
No ratings yet
Statistical Learning Slides
60 pages
Lecture 7 - Decision Tree Regression Imran 19032025 103416am
No ratings yet
Lecture 7 - Decision Tree Regression Imran 19032025 103416am
40 pages
UNIT II Machine Learning
No ratings yet
UNIT II Machine Learning
118 pages
CE802 Report
No ratings yet
CE802 Report
7 pages
Chapter 4
No ratings yet
Chapter 4
14 pages
Machine Learning Model
No ratings yet
Machine Learning Model
9 pages
ML2
No ratings yet
ML2
7 pages
R Assignment
No ratings yet
R Assignment
8 pages
Decision Tree Regression
No ratings yet
Decision Tree Regression
2 pages
Unit 5
No ratings yet
Unit 5
25 pages
Week 6 - 7 - Classification
No ratings yet
Week 6 - 7 - Classification
67 pages
3-Classification, Clustering and Prediction
No ratings yet
3-Classification, Clustering and Prediction
142 pages
Unit II
No ratings yet
Unit II
34 pages
Admission Prediction Guide
No ratings yet
Admission Prediction Guide
13 pages
Decision Trees
No ratings yet
Decision Trees
8 pages
MIS410 Chapter6
No ratings yet
MIS410 Chapter6
47 pages
Classification Algorithm
No ratings yet
Classification Algorithm
78 pages
Lecture 8
No ratings yet
Lecture 8
28 pages
UCS622
No ratings yet
UCS622
1 page
Decision Tree and Evalaution
No ratings yet
Decision Tree and Evalaution
50 pages
Decision Tree R
No ratings yet
Decision Tree R
5 pages
TTDS Lecture 4
No ratings yet
TTDS Lecture 4
31 pages
Data Science Cheatsheet 2.0: Statistics Model Evaluation Logistic Regression
No ratings yet
Data Science Cheatsheet 2.0: Statistics Model Evaluation Logistic Regression
4 pages
Machine Learning
No ratings yet
Machine Learning
9 pages
ML Classifiers
No ratings yet
ML Classifiers
48 pages
Lecture 6 - Decision Trees
No ratings yet
Lecture 6 - Decision Trees
43 pages
Lab Experiment 4 - AI
No ratings yet
Lab Experiment 4 - AI
7 pages
ML Unit-2 Material WORD
No ratings yet
ML Unit-2 Material WORD
25 pages
ML Notes by Pushpa
No ratings yet
ML Notes by Pushpa
26 pages
Unit 7 Deterministic Models
No ratings yet
Unit 7 Deterministic Models
71 pages
Machine Learning Regression Guide
No ratings yet
Machine Learning Regression Guide
6 pages
Lec.7.intro.D.S. Fall 2023
No ratings yet
Lec.7.intro.D.S. Fall 2023
26 pages
Chapter-4-Machine Learning Algorithms For Classification
No ratings yet
Chapter-4-Machine Learning Algorithms For Classification
22 pages
Harsh It
No ratings yet
Harsh It
9 pages
2-Machine Learning Algorithms
No ratings yet
2-Machine Learning Algorithms
16 pages
Classification-1
No ratings yet
Classification-1
48 pages
Linear Regression Lab Guide
100% (1)
Linear Regression Lab Guide
8 pages
Broadly, There Are 3 Types of Machine Learning Algorithms.
No ratings yet
Broadly, There Are 3 Types of Machine Learning Algorithms.
33 pages
Macine Resit
No ratings yet
Macine Resit
7 pages
19-Introduction Classification Algorithm-18-09-2024
No ratings yet
19-Introduction Classification Algorithm-18-09-2024
102 pages
Chapter 2 Types of Machine Learning and Their Learning Strategies
No ratings yet
Chapter 2 Types of Machine Learning and Their Learning Strategies
45 pages
SCIKIT
No ratings yet
SCIKIT
12 pages
Machine Learning Lab Manual 06
100% (1)
Machine Learning Lab Manual 06
8 pages
08 Class Basic
No ratings yet
08 Class Basic
103 pages
Her CyberTracks Policy Diplomacy CyberTrack
No ratings yet
Her CyberTracks Policy Diplomacy CyberTrack
6 pages
03 Charts
No ratings yet
03 Charts
78 pages
Reg Tree
No ratings yet
Reg Tree
38 pages
02 Intro To Data Viz
No ratings yet
02 Intro To Data Viz
93 pages
Lecture 2 MLrecap
No ratings yet
Lecture 2 MLrecap
23 pages
01 Course Logistics
No ratings yet
01 Course Logistics
12 pages
Lecture 3 MLP
No ratings yet
Lecture 3 MLP
35 pages
5b Python Implementation of Decision Tree
No ratings yet
5b Python Implementation of Decision Tree
7 pages
MACHINE LEARNING TUTORIAL QUESTION BANK Modified
No ratings yet
MACHINE LEARNING TUTORIAL QUESTION BANK Modified
13 pages
Machine Learning An Algorithmic Perspective (2nd Ed) - 40-42
No ratings yet
Machine Learning An Algorithmic Perspective (2nd Ed) - 40-42
3 pages
ML UNIT 4 Sir
No ratings yet
ML UNIT 4 Sir
42 pages
PROS 1 1-s2.0-S095741742402462X-main
No ratings yet
PROS 1 1-s2.0-S095741742402462X-main
17 pages
Naive Bayes for Data Scientists
No ratings yet
Naive Bayes for Data Scientists
4 pages
Lec-15 Binary Search Tree
No ratings yet
Lec-15 Binary Search Tree
17 pages
Feature Extraction Using Deep Learning For Food Type Recognition
No ratings yet
Feature Extraction Using Deep Learning For Food Type Recognition
4 pages
CEP BSP
No ratings yet
CEP BSP
4 pages
Complexity Analysis and Big O Notation
No ratings yet
Complexity Analysis and Big O Notation
33 pages
Program: BE (Mechanical) Class: TE Course: Numerical Methods and Optimization Unit: Roots of Equation Lecture 01: Types of Root
No ratings yet
Program: BE (Mechanical) Class: TE Course: Numerical Methods and Optimization Unit: Roots of Equation Lecture 01: Types of Root
12 pages
IAI Unit2
No ratings yet
IAI Unit2
81 pages
Esdat Unit 2 Activity 2
No ratings yet
Esdat Unit 2 Activity 2
4 pages
On Recurrence Relation
100% (1)
On Recurrence Relation
25 pages
Rohini 52093006178
No ratings yet
Rohini 52093006178
11 pages
Synthetic Division Cheat Sheet PDF
No ratings yet
Synthetic Division Cheat Sheet PDF
1 page
CSC 211
No ratings yet
CSC 211
6 pages
Bearer
No ratings yet
Bearer
2 pages
Error Detection and Correction
No ratings yet
Error Detection and Correction
7 pages
A Review of Techniques For Optimization and Implementation of Digital Filters On FPGA
No ratings yet
A Review of Techniques For Optimization and Implementation of Digital Filters On FPGA
9 pages
Tutorial On Scilab
No ratings yet
Tutorial On Scilab
82 pages
Robot Path Planning
No ratings yet
Robot Path Planning
29 pages
2777959-Day 8 - Data Wrangling
No ratings yet
2777959-Day 8 - Data Wrangling
2 pages
1BM22CS038 Anagha Bharadwaj
No ratings yet
1BM22CS038 Anagha Bharadwaj
27 pages
Built-In Self-Calibration of On-Chip DAC and ADC
No ratings yet
Built-In Self-Calibration of On-Chip DAC and ADC
10 pages
Simulatedannealing Ranak Ghosh Sep 2010
No ratings yet
Simulatedannealing Ranak Ghosh Sep 2010
22 pages
Question Bank - Module 2 - Module-3 Module 4 - Module 5
No ratings yet
Question Bank - Module 2 - Module-3 Module 4 - Module 5
4 pages
Numerical Analysis Course Outline OBE Based
No ratings yet
Numerical Analysis Course Outline OBE Based
3 pages
Signals and System Question Bank
100% (2)
Signals and System Question Bank
11 pages
Final
No ratings yet
Final
13 pages

Class Tree

Uploaded by

Class Tree

Uploaded by

Classification Tree

Prediction • Whether an individual will default on

Algorithm • The classification error rate is simply the fraction

• Build a large tree on the training data set.

• Prune the tree to improve accuracy.

We have in total data from 10,000 customers.

We have in total data from 10,000 customers.

Student[Yes] −0.5406 0.2658 −2.03 0.0419

• Use Stepwise Method Balance 0.0057 0.0002 22.61 <0.0001

1939 Total 1944 56 2000

Bottom row: The true decision boundary is non-

You might also like