0% found this document useful (0 votes)

9 views8 pages

Decision Trees

Decision trees are models used for classification and regression tasks, utilizing features to make decisions through a structured tree-like format. The learning process involves selecting features to split data, measuring purity with entropy, and optimizing information gain until stopping criteria are met. Tree ensembles, such as random forests and XGBoost, enhance decision tree performance by combining multiple trees to improve accuracy and reduce overfitting.

Uploaded by

rahul.2nd.55

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views8 pages

Decision Trees

Uploaded by

rahul.2nd.55

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Decision Trees

Decision trees are a type of model used for making decisions based on a set of features.
Here's an overview:

Cat Classification Example

Consider an example where you want to classify if an animal is a cat or not. You might use
features like ear shape, face shape, and presence of whiskers to make this classification.

Here's an image that shows the decision-making process:

A decision tree consists of:

 Root Node: The initial decision point.

 Decision Nodes: Intermediate nodes that split the data based on features.

 Leaf Nodes: Final nodes that give the prediction.

⚙️Decision Tree Learning Process

To build a decision tree, you need to learn how to make splits based on the features. For
example, you might start by splitting based on ear shape:

The image above shows how ear shape (pointy vs floppy) can be used to begin classifying
animal faces.

You continue splitting until you reach a point where you can confidently classify the data.
The image above provides a simple and visual guide for distinguishing between cats and
other animals

🤔 Key Decisions in Decision Tree Learning

There are two critical decisions to make when learning a decision tree:

1. How to choose what feature to split on at each node?

 Maximize purity (or minimize impurity).

2. When do you stop splitting?

 When a node is 100% one class.

 When splitting a node will result in the tree exceeding a maximum depth.

 When improvements in purity score are below a threshold.

 When the number of examples in a node is below a threshold.

The image below shows a tree data structure with nodes labeled with "Depth" values,
ranging from 0 at the top to 2 at the bottom, illustrating a basic hierarchical structure:
📏 Measuring Purity: Entropy

Entropy is a measure of impurity. In the context of decision trees, it helps determine the
best splits to maximize information gain. Entropy is measured by the following formula:

H(p1)

Where p1 is the fraction of examples that are cats.

The image below shows a graphical representation of the relationship

between p1 and H(p1).

📈 Choosing a Split: Information Gain

Information Gain is used to determine the best feature to split on. It measures how much
information is gained about the target variable, given a specific feature. The general formula
is:

InformationGain=H(p1root)−(leftleft+right∗H(p1left)
+rightleft+right∗H(p1right))InformationGain=H(p1root)−(left+rightleft∗H(p1left)
+left+rightright∗H(p1right))

Where:

 H(p1root)H(p1root) is the entropy of the root node.

 H(p1left)H(p1left) is the entropy of the left child node.

 H(p1right)H(p1right) is the entropy of the right child node.

🧩 Putting It Together

Here’s the process for constructing a decision tree:

 Start with all examples at the root node.

 Calculate information gain for all possible features and pick the one with the highest
information gain.

 Split the dataset according to the selected feature, and create left and right branches
of the tree.

 Keep repeating the splitting process until stopping criteria are met:

 When a node is 100% one class.

 When splitting a node will result in the tree exceeding a maximum depth.

 Information gain from additional splits is less than a threshold.

 When the number of examples in a node is below a threshold.

The image below shows how face shape can be used to classify cats, with ear shape as a
secondary factor.

Using One-Hot Encoding for Categorical Features

When dealing with categorical features, it's common to use one-hot encoding.

One-Hot Encoding: If a categorical feature can take on n values, create n binary features (0
or 1 valued).

Here's an example of how to apply one-hot encoding to the ear shape feature:
Ear shape Pointy Floppy

Pointy 1 0

Floppy 0 1

Oval 0 0

The image below shows a table with cat icons and features that have been one-hot encoded.

⚖️Continuous Valued Features

For continuous features, you can split the data based on a threshold. For example, if you
have a weight feature, you might split the data into examples where the weight is less than
or equal to a certain value, and examples where the weight is greater than that value. The
image below shows how a weight feature can be used to classify an animal.

🌲 Regression Trees (Optional)

Decision trees can also be used for regression tasks, where the goal is to predict a
continuous value. In this case, the leaf nodes would contain the average value of the target
variable for the examples that fall into that node.

The image below shows how ear and face shape could be used to predict a dog's weight.

🌳🌲🌴 Tree Ensembles

To improve the performance and robustness of decision trees, you can use tree ensembles.
These methods combine multiple decision trees to make predictions. The image below
shows the faces of a group of cats.

Tree ensembles can reduce overfitting and improve generalization performance.

Here's how a prediction might look using a tree ensemble:

🌳 Tree Ensembles

Sampling with Replacement

Sampling with replacement involves selecting items from a dataset where each item, once
chosen, is returned to the pool, allowing it to be selected again.

Here's an image demonstrating the concept of sampling with replacement:

This image displays a grid of colored blocks, each representing an item that can be sampled.
The colors are randomly distributed, illustrating how, with replacement, the same item
(color) can be chosen multiple times in a sample.

🌲 Random Forest Algorithm

The random forest algorithm is an ensemble learning method that operates by constructing
multiple decision trees during training and outputting the mode of the classes (classification)
or mean prediction (regression) of the individual trees.

The process can be described as follows:

1. Given a training set of size m.

2. For i = 1 to n:

 Use sampling with replacement to create a new training set of size m.

 Train a decision tree on the new dataset.

Randomizing the Feature Choice

At each node, when choosing a feature to use to split:

1. If n features are available, pick a random subset of < n features.

2. Allow the algorithm to only choose from that subset of features.

🚀 XGBoost (eXtreme Gradient Boosting)

XGBoost is an optimized distributed gradient boosting library designed to be

highly efficient, flexible and portable. It implements machine learning algorithms under the
Gradient Boosting framework. XGBoost provides a parallel tree boosting (also known as
Gradient Boosting Machine) that solve many data science problems in a fast and accurate
way.

Boosted Trees Intuition

1. Given a training set of size m.

2. For i = 1 to n:

 Use sampling with replacement to create a new training set of size m. But
instead of picking from all examples with equal (1/m) probability, make it
more likely to pick examples that the previously trained trees misclassify.

 Train a decision tree on the new dataset.

XGBoost Features:

 Open-source implementation of boosted trees

 Fast, efficient implementation

 Good choice of default splitting criteria and criteria for when to stop splitting

 Built-in regularization to prevent overfitting

 Highly competitive algorithm for machine learning competitions (e.g., Kaggle

competitions)

This image shows a decision tree diagram for classifying animals as cats or not-cats based on
physical characteristics:

Here are the code snippets for using XGBoost in classification and regression tasks:

# Classification

from xgboost import XGBClassifier

model = XGBClassifier()

model.fit(X_train, y_train)

y_pred = model.predict(X_test)
# Regression

from xgboost import XGBRegressor

model = XGBRegressor()

model.fit(X_train, y_train)

y_pred = model.predict(X_test)

🤔 When to Use Decision Trees

Decision Trees vs Neural Networks

Feature Decision Trees and Tree Ensembles Neural Networks

Works well on tabular (structured) Works well on all types of data, including tabul
Data Type data unstructured data

Data Not recommended for unstructured

Recommendation data (images, audio, text)

Speed Fast May be slower than a decision tree

Transfer Learning Works with transfer learning

Small decision trees may be human

Interpretability interpretable

When building a system of multiple models wor

System Building be easier to string together multiple neural net

What Is Decision Tree
No ratings yet
What Is Decision Tree
35 pages
ML Unit3
No ratings yet
ML Unit3
8 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
No ratings yet
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
22 pages
Decision Trees for Data Scientists
0% (1)
Decision Trees for Data Scientists
24 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
25 pages
Decision Trees
No ratings yet
Decision Trees
27 pages
Tree Based Algorithms in Machine Learning
No ratings yet
Tree Based Algorithms in Machine Learning
8 pages
Decision Treesnotes
No ratings yet
Decision Treesnotes
3 pages
Bagging, Boosting, Decision Trees, Random Forest
No ratings yet
Bagging, Boosting, Decision Trees, Random Forest
19 pages
Phys361 S24 Lecture 17 Random Forests
No ratings yet
Phys361 S24 Lecture 17 Random Forests
24 pages
Decisiontrees
No ratings yet
Decisiontrees
28 pages
Unit-5 Decision Trees & Ensembles Methods
No ratings yet
Unit-5 Decision Trees & Ensembles Methods
11 pages
TEAA - Tree Ensembles-1
No ratings yet
TEAA - Tree Ensembles-1
43 pages
Decision Trees
No ratings yet
Decision Trees
5 pages
Decision Trees for Beginners
No ratings yet
Decision Trees for Beginners
45 pages
ML CLASS 6 Decision Tree Algorithm
No ratings yet
ML CLASS 6 Decision Tree Algorithm
21 pages
08 Decision - Tree
No ratings yet
08 Decision - Tree
9 pages
Decisiontree, Prefixcodeandgametree
No ratings yet
Decisiontree, Prefixcodeandgametree
12 pages
ML Ch-3 Decision Trees and Ensemble Methods
No ratings yet
ML Ch-3 Decision Trees and Ensemble Methods
14 pages
Decision Trees and Regression Techniques
No ratings yet
Decision Trees and Regression Techniques
27 pages
Decision Trees for CS Students
No ratings yet
Decision Trees for CS Students
54 pages
Decision Tree
0% (1)
Decision Tree
16 pages
EST Cheatsheet
No ratings yet
EST Cheatsheet
5 pages
Decision Tree Learning (8 Hours)
No ratings yet
Decision Tree Learning (8 Hours)
141 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Ch02 DecisionTree
100% (1)
Ch02 DecisionTree
41 pages
2.12 Chapter 6 Decision Tree
No ratings yet
2.12 Chapter 6 Decision Tree
56 pages
ML4 - Decision Trees & Random Forest
No ratings yet
ML4 - Decision Trees & Random Forest
44 pages
LAB (1) Decision Tree: Islamic University of Gaza Computer Engineering Department Artificial Intelligence ECOM 5038
No ratings yet
LAB (1) Decision Tree: Islamic University of Gaza Computer Engineering Department Artificial Intelligence ECOM 5038
18 pages
Unit Ii
No ratings yet
Unit Ii
22 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Decision Tree & Random Forest
No ratings yet
Decision Tree & Random Forest
16 pages
ML Unit 4
No ratings yet
ML Unit 4
47 pages
Notes On Decision Trees
No ratings yet
Notes On Decision Trees
2 pages
Lecture 5a
No ratings yet
Lecture 5a
24 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
14 pages
Lecture 17 18
No ratings yet
Lecture 17 18
52 pages
DataMining-Handouts1 5
No ratings yet
DataMining-Handouts1 5
8 pages
Lecture Note #5 - PEC-CS701E
No ratings yet
Lecture Note #5 - PEC-CS701E
16 pages
Decision Tree Basics for Students
No ratings yet
Decision Tree Basics for Students
29 pages
فاينل تعلم
No ratings yet
فاينل تعلم
144 pages
6 Decision Trees in Data Mining
No ratings yet
6 Decision Trees in Data Mining
10 pages
Machine - Learning - Lecture - 08 - Decision Tree Learning
No ratings yet
Machine - Learning - Lecture - 08 - Decision Tree Learning
67 pages
Assignment of Decision Tree
No ratings yet
Assignment of Decision Tree
15 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Decision Trees
No ratings yet
Decision Trees
26 pages
Decision Trees
No ratings yet
Decision Trees
45 pages
Intro to Decision Trees for ML Students
No ratings yet
Intro to Decision Trees for ML Students
15 pages
Present
No ratings yet
Present
20 pages
Prac 6
No ratings yet
Prac 6
6 pages
Decision Tree
No ratings yet
Decision Tree
28 pages
Decision Trees
No ratings yet
Decision Trees
18 pages
Unit IV Decision Trees
No ratings yet
Unit IV Decision Trees
37 pages
Unit 3,4,5 ML (CS - AI)
No ratings yet
Unit 3,4,5 ML (CS - AI)
37 pages
Dmi Unit 4
No ratings yet
Dmi Unit 4
34 pages
CE880 Lecture7 Slides
No ratings yet
CE880 Lecture7 Slides
78 pages
Decision Trees A Comprehensive Guide
No ratings yet
Decision Trees A Comprehensive Guide
10 pages
Unit 2
No ratings yet
Unit 2
29 pages
What Is Convolutional Neural Network (CNN) - CNN Intution
No ratings yet
What Is Convolutional Neural Network (CNN) - CNN Intution
10 pages
Loss Functions
No ratings yet
Loss Functions
1 page
Back Propagation
No ratings yet
Back Propagation
1 page
Optimizers
No ratings yet
Optimizers
1 page
Multilayer Perceptron
No ratings yet
Multilayer Perceptron
1 page
Percept Ron
No ratings yet
Percept Ron
1 page
Banking Customer Chain DASHBOARD
No ratings yet
Banking Customer Chain DASHBOARD
1 page
Machine Learning
No ratings yet
Machine Learning
2 pages
Final Year Project
No ratings yet
Final Year Project
9 pages
Chapter Introduction
No ratings yet
Chapter Introduction
7 pages
Evolution of Endpoint Detection and Response EDR I
No ratings yet
Evolution of Endpoint Detection and Response EDR I
7 pages
Fuzzy Machine Learning A Comprehensive Framework and Systematic Review
No ratings yet
Fuzzy Machine Learning A Comprehensive Framework and Systematic Review
18 pages
Solving Rubiks Cube Using Open CV
No ratings yet
Solving Rubiks Cube Using Open CV
12 pages
Data and Analytics Leaders: Rewire Your Culture For An AI-Augmented Future
No ratings yet
Data and Analytics Leaders: Rewire Your Culture For An AI-Augmented Future
18 pages
Ensemble Classifiers Overview
No ratings yet
Ensemble Classifiers Overview
37 pages
ODSC Machine Learning Guide V1.1
No ratings yet
ODSC Machine Learning Guide V1.1
6 pages
Instant Download Machine Learning With Python 1st Edition Oliver Theobald PDF All Chapters
100% (11)
Instant Download Machine Learning With Python 1st Edition Oliver Theobald PDF All Chapters
51 pages
AI - ML and I-O Psychology
No ratings yet
AI - ML and I-O Psychology
13 pages
Machine Coding of Events Data
No ratings yet
Machine Coding of Events Data
43 pages
A Review of Machine Learning and Deep Learning Applications
No ratings yet
A Review of Machine Learning and Deep Learning Applications
6 pages
CS Dept PPT 10march2022
No ratings yet
CS Dept PPT 10march2022
76 pages
Assignment 3 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 3 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
8 pages
Mahesh Biradar Resume
No ratings yet
Mahesh Biradar Resume
2 pages
EBSCO-FullText-04 03 2025
No ratings yet
EBSCO-FullText-04 03 2025
9 pages
ANN Formulas and Models
No ratings yet
ANN Formulas and Models
24 pages
AI Threat Detection in Network Security
No ratings yet
AI Threat Detection in Network Security
9 pages
Lecture 1
No ratings yet
Lecture 1
46 pages
Robotics & AI Research Portfolio
100% (1)
Robotics & AI Research Portfolio
2 pages
Unit-III NLP
No ratings yet
Unit-III NLP
15 pages
Final Quiz 1 - Attempt Review
No ratings yet
Final Quiz 1 - Attempt Review
6 pages
IJSRDV6I10368
No ratings yet
IJSRDV6I10368
2 pages
Seminar Report Krishna Sharma
No ratings yet
Seminar Report Krishna Sharma
35 pages
Lecture8 Unsupervised Learning
No ratings yet
Lecture8 Unsupervised Learning
58 pages
Digital Marketing 101 - Section 4 Slides
No ratings yet
Digital Marketing 101 - Section 4 Slides
20 pages
0.pham Bac Nguyen - LLM Algorithm
No ratings yet
0.pham Bac Nguyen - LLM Algorithm
2 pages
UNIT-2 ML Notes
No ratings yet
UNIT-2 ML Notes
15 pages

Decision Trees

Uploaded by

Decision Trees

Uploaded by

Decision Trees

Cat Classification Example

Here's an image that shows the decision-making process:

A decision tree consists of:

 Root Node: The initial decision point.

 Leaf Nodes: Final nodes that give the prediction.

⚙️Decision Tree Learning Process

🤔 Key Decisions in Decision Tree Learning

1. How to choose what feature to split on at each node?

 Maximize purity (or minimize impurity).

2. When do you stop splitting?

 When a node is 100% one class.

 When improvements in purity score are below a threshold.

 When the number of examples in a node is below a threshold.

Where p1 is the fraction of examples that are cats.

The image below shows a graphical representation of the relationship

📈 Choosing a Split: Information Gain

 H(p1root)H(p1root) is the entropy of the root node.

 H(p1left)H(p1left) is the entropy of the left child node.

 H(p1right)H(p1right) is the entropy of the right child node.

Here’s the process for constructing a decision tree:

 Start with all examples at the root node.

 When a node is 100% one class.

 Information gain from additional splits is less than a threshold.

 When the number of examples in a node is below a threshold.

Using One-Hot Encoding for Categorical Features

⚖️Continuous Valued Features

🌲 Regression Trees (Optional)

🌳🌲🌴 Tree Ensembles

Tree ensembles can reduce overfitting and improve generalization performance.

Here's how a prediction might look using a tree ensemble:

Sampling with Replacement

Here's an image demonstrating the concept of sampling with replacement:

🌲 Random Forest Algorithm

The process can be described as follows:

1. Given a training set of size m.

 Use sampling with replacement to create a new training set of size m.

 Train a decision tree on the new dataset.

Randomizing the Feature Choice

At each node, when choosing a feature to use to split:

1. If n features are available, pick a random subset of < n features.

2. Allow the algorithm to only choose from that subset of features.

🚀 XGBoost (eXtreme Gradient Boosting)

XGBoost is an optimized distributed gradient boosting library designed to be

Boosted Trees Intuition

1. Given a training set of size m.

 Train a decision tree on the new dataset.

 Open-source implementation of boosted trees

 Fast, efficient implementation

 Built-in regularization to prevent overfitting

 Highly competitive algorithm for machine learning competitions (e.g., Kaggle

from xgboost import XGBClassifier

from xgboost import XGBRegressor

🤔 When to Use Decision Trees

Decision Trees vs Neural Networks

Feature Decision Trees and Tree Ensembles Neural Networks

Data Not recommended for unstructured

Speed Fast May be slower than a decision tree

Transfer Learning Works with transfer learning

Small decision trees may be human

When building a system of multiple models wor

You might also like