0% found this document useful (0 votes)

10 views8 pages

Decision Tree - Notes

Uploaded by

Ritik chauhan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views8 pages

Decision Tree - Notes

Uploaded by

Ritik chauhan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Decision Tree Introduction with example

 Decision tree algorithm falls under the category of supervised learning. They can be used to
solve both regression and classification problems.

 Decision tree uses the tree representation to solve the problem in which each leaf node
corresponds to a class label and attributes are represented on the internal node of the tree.

 We can represent any boolean function on discrete attributes using the decision tree.

Below are some assumptions that we made while using decision tree:

 At the beginning, we consider the whole training set as the root.

 Feature values are preferred to be categorical. If the values are continuous then they are
discretized prior to building the model.

 On the basis of attribute values records are distributed recursively.

 We use statistical methods for ordering attributes as root or the internal node.

As you can see from the above image that Decision Tree works on the Sum of Product form which is
also known as Disjunctive Normal Form. In the above image, we are predicting the use of computer
in the daily life of the people.
In Decision Tree the major challenge is to identification of the attribute for the root node in each
level. This process is known as attribute selection. We have two popular attribute selection
measures:

1. Information Gain

2. Gini Index

1. Information Gain
When we use a node in a decision tree to partition the training instances into smaller subsets the
entropy changes. Information gain is a measure of this change in entropy.
Definition: Suppose S is a set of instances, A is an attribute, Sv is the subset of S with A = v, and
Values (A) is the set of all possible values of A, then

Entropy
Entropy is the measure of uncertainty of a random variable, it characterizes the impurity of an
arbitrary collection of examples. The higher the entropy more the information content.
Definition: Suppose S is a set of instances, A is an attribute, Sv is the subset of S with A = v, and
Values (A) is the set of all possible values of A, then

Example:

For the set X = {a,a,a,b,b,b,b,b}

Total instances: 8

Instances of b: 5

Instances of a: 3

= -[0.375 * (-1.415) + 0.625 * (-0.678)]

=-(-0.53-0.424)

= 0.954

Building Decision Tree using Information Gain

The essentials:

 Start with all training instances associated with the root node

 Use info gain to choose which attribute to label each node with

 Note: No root-to-leaf path should contain the same discrete attribute twice

 Recursively construct each subtree on the subset of training instances that would be
classified down that path in the tree.
The border cases:

 If all positive or all negative training instances remain, label that node “yes” or “no”
accordingly

 If no attributes remain, label with a majority vote of training instances left at that node

 If no instances remain, label with a majority vote of the parent’s training instances

Example:
Now, lets draw a Decision Tree for the following data using Information gain.

Training set: 3 features and 2 classes

X Y Z C

1 1 1 I

1 1 0 I

0 0 1 II

1 0 0 II

Here, we have 3 features and 2 output classes.

To build a decision tree using Information gain. We will take each of the feature and calculate the
information for each feature.
Split on feature X

Split on feature Y

Split on feature Z

From the above images we can see that the information gain is maximum when we make a split on
feature Y. So, for the root node best suited feature is feature Y. Now we can see that while splitting
the dataset by feature Y, the child contains pure subset of the target variable. So we don’t need to
further split the dataset.
The final tree for the above dataset would be look like this:

2. Gini Index

 Gini Index is a metric to measure how often a randomly chosen element would be
incorrectly identified.

 It means an attribute with lower Gini index should be preferred.

 Sklearn supports “Gini” criteria for Gini Index and by default, it takes “gini” value.

 The Formula for the calculation of the of the Gini Index is given below.

Example:
Lets consider the dataset in the image below and draw a decision tree using gini index.

Index A B C D E

1 4.8 3.4 1.9 0.2 positive

2 5 3 1.6 1.2 positive

3 5 3.4 1.6 0.2 positive

4 5.2 3.5 1.5 0.2 positive

5 5.2 3.4 1.4 0.2 positive

6 4.7 3.2 1.6 0.2 positive

Index A B C D E

7 4.8 3.1 1.6 0.2 positive

8 5.4 3.4 1.5 0.4 positive

9 7 3.2 4.7 1.4 negative

10 6.4 3.2 4.7 1.5 negative

11 6.9 3.1 4.9 1.5 negative

12 5.5 2.3 4 1.3 negative

13 6.5 2.8 4.6 1.5 negative

14 5.7 2.8 4.5 1.3 negative

15 6.3 3.3 4.7 1.6 negative

16 4.9 2.4 3.3 1 negative

In the dataset above there are 5 attributes from which attribute E is the predicting feature which
contains 2(Positive & Negative) classes. We have an equal proportion for both the classes.
In Gini Index, we have to choose some random values to categorize each attribute. These values for
this dataset are:

A B C D

>= 5 >= 3.0 >= 4.2 >= 1.4

<5 < 3.0 < 4.2 < 1.4

Calculating Gini Index for Var A:

Value >= 5: 12

Attribute A >= 5 & class = positive:

Attribute A >= 5 & class = negative:

Gini(5, 7) = 1 –
Value < 5: 4

Attribute A < 5 & class = positive:

Attribute A < 5 & class = negative:

Gini(3, 1) = 1 –
By adding weight and sum each of the gini indices:

Calculating Gini Index for Var B:

Value >= 3: 12

Attribute B >= 3 & class = positive:

Attribute B >= 5 & class = negative:

Gini(5, 7) = 1 –
Value < 3: 4

Attribute A < 3 & class = positive:

Attribute A < 3 & class = negative:

Gini(3, 1) = 1 –
By adding weight and sum each of the gini indices:

Using the same approach we can calculate the Gini index for C and D attributes.

Positive Negative

For A|>= 5.0 5 7

|<5 3 1

Ginin Index of A = 0.45825

Positive Negative

For B|>= 3.0 8 4

|< 3.0 0 4

Gini Index of B= 0.3345

Positive Negative

For C|>= 4.2 0 6

|< 4.2 8 2

Gini Index of C= 0.2

Positive Negative

For D|>= 1.4 0 5

|< 1.4 8 3

Gini Index of D= 0.273

The most notable types of decision tree algorithms are:-

1. Iterative Dichotomiser 3 (ID3): This algorithm uses Information Gain to decide which attribute is
to be used classify the current subset of the data. For each level of the tree, information gain is
calculated for the remaining data recursively.

2. C4.5: This algorithm is the successor of the ID3 algorithm. This algorithm uses either Information
gain or Gain ratio to decide upon the classifying attribute. It is a direct improvement from the ID3
algorithm as it can handle both continuous and missing attribute values.

3. Classification and Regression Tree(CART): It is a dynamic learning algorithm which can produce a
regression tree as well as a classification tree depending upon the dependent variable.

Data Minning Unit 5 PDF
No ratings yet
Data Minning Unit 5 PDF
19 pages
Learning Decision Trees
No ratings yet
Learning Decision Trees
10 pages
UNIT - 3 ML
No ratings yet
UNIT - 3 ML
24 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Unit-4 (1) .Docx ML
No ratings yet
Unit-4 (1) .Docx ML
42 pages
UNIT - 3 ML
No ratings yet
UNIT - 3 ML
24 pages
Unit 3.2 Decision Tree Algorithm Wit Examples
No ratings yet
Unit 3.2 Decision Tree Algorithm Wit Examples
85 pages
Supervised Decision TreeRandom Forest
No ratings yet
Supervised Decision TreeRandom Forest
39 pages
Unit 3 (A) NGP
No ratings yet
Unit 3 (A) NGP
78 pages
Decision Tree
No ratings yet
Decision Tree
12 pages
Decision Tree Learning Guide
No ratings yet
Decision Tree Learning Guide
33 pages
Solution For DWDM Problems
No ratings yet
Solution For DWDM Problems
24 pages
ML Unit 2 Final - III Yr
No ratings yet
ML Unit 2 Final - III Yr
72 pages
2 Decision Tree Algo
No ratings yet
2 Decision Tree Algo
46 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
Decision Trees
No ratings yet
Decision Trees
61 pages
Unit-3 ML
No ratings yet
Unit-3 ML
47 pages
DT-0 (3 Files Merged)
No ratings yet
DT-0 (3 Files Merged)
143 pages
Decision Tree in Machine Learning
No ratings yet
Decision Tree in Machine Learning
11 pages
Decision Tree Classification Guide
No ratings yet
Decision Tree Classification Guide
7 pages
Decisiontree 2
No ratings yet
Decisiontree 2
16 pages
Ch05-DT1-Dr Amin ML
No ratings yet
Ch05-DT1-Dr Amin ML
26 pages
Decision Trees: Decision Tree Is One of The Most Widely Used and
No ratings yet
Decision Trees: Decision Tree Is One of The Most Widely Used and
53 pages
Decision Tree Induction
No ratings yet
Decision Tree Induction
23 pages
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
31 pages
06-Classification Part1
No ratings yet
06-Classification Part1
44 pages
Construction of Decision Tree Attribute Selection Measures
No ratings yet
Construction of Decision Tree Attribute Selection Measures
5 pages
Module 5 Notes
No ratings yet
Module 5 Notes
8 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
22 pages
DM 3
No ratings yet
DM 3
37 pages
Unit6 - 2 Classification-Decision-Trees
No ratings yet
Unit6 - 2 Classification-Decision-Trees
36 pages
DM Unit 4
No ratings yet
DM Unit 4
24 pages
Decision Tree Induction
No ratings yet
Decision Tree Induction
80 pages
Classification With Decision Trees: Instructor: Qiang Yang
100% (1)
Classification With Decision Trees: Instructor: Qiang Yang
62 pages
ML Lecture 8 9 Classification
No ratings yet
ML Lecture 8 9 Classification
35 pages
6CS4-02 Machine Learning Manish Bhardwaj
No ratings yet
6CS4-02 Machine Learning Manish Bhardwaj
625 pages
Concepts and Techniques: - Chapter 8
No ratings yet
Concepts and Techniques: - Chapter 8
81 pages
Unit 1 ML (NN& ML Techniques)
No ratings yet
Unit 1 ML (NN& ML Techniques)
40 pages
Class Basic
No ratings yet
Class Basic
75 pages
Unit 1 ML (DT)
No ratings yet
Unit 1 ML (DT)
24 pages
Data Classification Basics
No ratings yet
Data Classification Basics
34 pages
Decision Trees and How To Build and Optimize Decision Tree Classifier
No ratings yet
Decision Trees and How To Build and Optimize Decision Tree Classifier
16 pages
Module 3
No ratings yet
Module 3
101 pages
CSE445 NSU Week - 4
No ratings yet
CSE445 NSU Week - 4
48 pages
ML Unit II
No ratings yet
ML Unit II
183 pages
Classification Algorithms
No ratings yet
Classification Algorithms
31 pages
Data Science Lectures 3
No ratings yet
Data Science Lectures 3
46 pages
AI Chapter 3 Part 2
No ratings yet
AI Chapter 3 Part 2
51 pages
Classification - Decision Trees
No ratings yet
Classification - Decision Trees
43 pages
Decitions Tree
No ratings yet
Decitions Tree
6 pages
Decision Trees
No ratings yet
Decision Trees
3 pages
Concepts and Techniques: Data Mining
100% (1)
Concepts and Techniques: Data Mining
81 pages
Lecture 5 DecisionTree
No ratings yet
Lecture 5 DecisionTree
21 pages
Decision Tree Induction Basics
No ratings yet
Decision Tree Induction Basics
55 pages
1.decision Trees Concepts
No ratings yet
1.decision Trees Concepts
70 pages
Decision Tree
No ratings yet
Decision Tree
7 pages
Higher Engineering Mathematics - John Bird-250-305
No ratings yet
Higher Engineering Mathematics - John Bird-250-305
56 pages
Python For Ds Ics Marks
No ratings yet
Python For Ds Ics Marks
2 pages
Data Security For M.tech
No ratings yet
Data Security For M.tech
75 pages
Black and White Geometric Corporate Report Presentation
No ratings yet
Black and White Geometric Corporate Report Presentation
9 pages
Cia 4 ML
No ratings yet
Cia 4 ML
60 pages
Recursive Partitioning Trees Guide
No ratings yet
Recursive Partitioning Trees Guide
34 pages
Major Project (Lipsha)
No ratings yet
Major Project (Lipsha)
114 pages
Machine Learning Syllabus MIC23 AIDS
No ratings yet
Machine Learning Syllabus MIC23 AIDS
2 pages
Syllabus: Data Warehousing and Data Mining
No ratings yet
Syllabus: Data Warehousing and Data Mining
18 pages
Predicting Inflation Through Online Prices
No ratings yet
Predicting Inflation Through Online Prices
20 pages
Classification: Table 4.1. Data Set For Exercise 2
No ratings yet
Classification: Table 4.1. Data Set For Exercise 2
7 pages
Summary
No ratings yet
Summary
20 pages
BIA Data Science Detailed Brochure - Vikhroli West, Mumbai-1
No ratings yet
BIA Data Science Detailed Brochure - Vikhroli West, Mumbai-1
28 pages
C4.5 Algorithm Decision Tree
No ratings yet
C4.5 Algorithm Decision Tree
18 pages
CSEIT2172121
No ratings yet
CSEIT2172121
12 pages
Machine Learning Techniques Using Python For Data
No ratings yet
Machine Learning Techniques Using Python For Data
17 pages
Decision Tree
No ratings yet
Decision Tree
18 pages
FRM Part 1 Quants 2023 ML
No ratings yet
FRM Part 1 Quants 2023 ML
8 pages
Regression Trees, Step by Step. Learn How To Build Regression Trees and - by Ivo Bernardo - Aug, 2022 - Towards Data Science
No ratings yet
Regression Trees, Step by Step. Learn How To Build Regression Trees and - by Ivo Bernardo - Aug, 2022 - Towards Data Science
36 pages
Decision Fania Bab 6 Akhir
No ratings yet
Decision Fania Bab 6 Akhir
12 pages
Localized Vending Machine Recommendations
No ratings yet
Localized Vending Machine Recommendations
10 pages
Class Basic
No ratings yet
Class Basic
67 pages
Analysis of Imbalanced Classification Algorithms A Perspective View
No ratings yet
Analysis of Imbalanced Classification Algorithms A Perspective View
5 pages
Classification With WEKA: Data Mining Lab 2
No ratings yet
Classification With WEKA: Data Mining Lab 2
8 pages
Ôn Thi KTDL
No ratings yet
Ôn Thi KTDL
18 pages
MLDA1
No ratings yet
MLDA1
8 pages
Artificial Intelligence: Foundations & Applications: Prof. Partha P. Chakrabarti & Arijit Mondal
No ratings yet
Artificial Intelligence: Foundations & Applications: Prof. Partha P. Chakrabarti & Arijit Mondal
24 pages
Artificial Intelligence in Data Mining
No ratings yet
Artificial Intelligence in Data Mining
4 pages
Data Mining: Practical Machine Learning Tools and Techniques
No ratings yet
Data Mining: Practical Machine Learning Tools and Techniques
27 pages
Artificial Intelligence Brochure
No ratings yet
Artificial Intelligence Brochure
17 pages
Black Friday Sales
No ratings yet
Black Friday Sales
26 pages
Machine Learning Algorithms in Bipedal Robot Control
No ratings yet
Machine Learning Algorithms in Bipedal Robot Control
16 pages
Fake Social Media Profile Detection and Reporting
No ratings yet
Fake Social Media Profile Detection and Reporting
6 pages

Decision Tree - Notes

Uploaded by

Decision Tree - Notes

Uploaded by

Decision Tree Introduction with example

 At the beginning, we consider the whole training set as the root.

 On the basis of attribute values records are distributed recursively.

For the set X = {a,a,a,b,b,b,b,b}

= -[0.375 * (-1.415) + 0.625 * (-0.678)]

Building Decision Tree using Information Gain

Training set: 3 features and 2 classes

Here, we have 3 features and 2 output classes.

 It means an attribute with lower Gini index should be preferred.

1 4.8 3.4 1.9 0.2 positive

2 5 3 1.6 1.2 positive

3 5 3.4 1.6 0.2 positive

4 5.2 3.5 1.5 0.2 positive

5 5.2 3.4 1.4 0.2 positive

6 4.7 3.2 1.6 0.2 positive

7 4.8 3.1 1.6 0.2 positive

8 5.4 3.4 1.5 0.4 positive

9 7 3.2 4.7 1.4 negative

10 6.4 3.2 4.7 1.5 negative

11 6.9 3.1 4.9 1.5 negative

12 5.5 2.3 4 1.3 negative

13 6.5 2.8 4.6 1.5 negative

14 5.7 2.8 4.5 1.3 negative

15 6.3 3.3 4.7 1.6 negative

16 4.9 2.4 3.3 1 negative

>= 5 >= 3.0 >= 4.2 >= 1.4

<5 < 3.0 < 4.2 < 1.4

Calculating Gini Index for Var A:

Attribute A >= 5 & class = positive:

Attribute A >= 5 & class = negative:

Attribute A < 5 & class = positive:

Attribute A < 5 & class = negative:

Calculating Gini Index for Var B:

Attribute B >= 3 & class = positive:

Attribute B >= 5 & class = negative:

Attribute A < 3 & class = positive:

Attribute A < 3 & class = negative:

For A|>= 5.0 5 7

Ginin Index of A = 0.45825

For B|>= 3.0 8 4

Gini Index of B= 0.3345

For C|>= 4.2 0 6

Gini Index of C= 0.2

For D|>= 1.4 0 5

Gini Index of D= 0.273

The most notable types of decision tree algorithms are:-

You might also like