Entropy and IG

Uploaded by

khansanadeem44

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views23 pages

Entropy and IG

Uploaded by

khansanadeem44

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 23

Entropy and

Information Grain
Entropy
 Entropy is a measure of disorder or uncertainty
and the goal of machine learning models and
Data Scientists in general is to reduce
uncertainty.
High, Low Entropy
 “High Entropy”
 X is from a uniform like distribution
 Flat histogram
 Values sampled from it are less predictable
 “Low Entropy”
 X is from a varied (peaks and valleys) distribution
 Histogram has many lows and highs
 Values sampled from it are more predictable
Decision tree-classification
to build a decision tree, we need to calculate two types of
entropy using frequency tables as follows:

a) Entropy using the frequency table of one attribute:

Entropy
 b) Entropy using the frequency table of two
attributes:
Information Gain

The information gain is based on the

decrease in entropy after a dataset is
split on an attribute. Constructing a
decision tree is all about finding attribute
that returns the highest information gain
(i.e., the most homogeneous branches).
Information gain
 Step 1: Calculate
entropy of the target.
Information gain cont…
 Step 2:
 The dataset is then split on the different attributes. The entropy for
each branch is calculated.
 Then it is added proportionally, to get total entropy for the split.
 The resulting entropy is subtracted from the entropy before the split.
 The result is the Information Gain, or decrease in entropy.
Information gain cont…
Information gain cont..

Step 3: Choose attribute with the largest

information gain as the decision node, divide the
dataset by its branches and repeat the same
process on every branch
Information gain cont..
 Step 4a: A branch
with entropy of 0 is a
leaf node.
Information gain cont..
 Step 4b: A branch with entropy
more than 0 needs further
splitting.
Information gain cont…
 A decision tree can easily be transformed to a
set of rules by mapping from the root node to
the leaf nodes one by one.
Decision Trees
When do I play tennis?
Decision Tree
Is the decision tree correct?
 Let’s check whether the split on Wind attribute is
correct.
 We need to show that Wind attribute has the
highest information gain.
When do I play tennis?
Wind attribute – 5 records match

Note: calculate the entropy only on examples that got

“routed” in our branch of the tree (Outlook=Rain)
Calculation
 Let
S = {D4, D5, D6, D10, D14}
 Entropy:
H(S) = – 3/5log(3/5) – 2/5log(2/5) = 0.971
 Information Gain
IG(S,Temp) = H(S) – H(S|Temp) = 0.01997
IG(S, Humidity) = H(S) – H(S|Humidity) = 0.01997
IG(S,Wind) = H(S) – H(S|Wind) = 0.971
Assignment #01
 Imagine your own example for classification
 Everyone should have different example
 What will be the root node?
 Make rules after finalizing the decision tree.
 Calculate entropy and IG
Note:
 23 rd
feb 2021 last date to submit.
 No handwritten assignment will be accepted.
 Copied assignment will be graded “0”.
 No late submission will be accepted.

10b Understanding Entropy Information Gain
No ratings yet
10b Understanding Entropy Information Gain
10 pages
MLT UNIT-3 Notes
No ratings yet
MLT UNIT-3 Notes
35 pages
Entropy and Information Gain Explained
No ratings yet
Entropy and Information Gain Explained
10 pages
Decision Trees Notes
No ratings yet
Decision Trees Notes
11 pages
Act 9
No ratings yet
Act 9
22 pages
Id3algorithm 200307175839
No ratings yet
Id3algorithm 200307175839
22 pages
Decision Tree
No ratings yet
Decision Tree
19 pages
Artificial Intelligence 11. Decision Tree Learning
No ratings yet
Artificial Intelligence 11. Decision Tree Learning
25 pages
ID3 Complete Solution
No ratings yet
ID3 Complete Solution
3 pages
Module 5 Notes
No ratings yet
Module 5 Notes
8 pages
WINSEM2020-21 CSE4020 ETH VL2020210504996 Reference Material II 19-May-2021 Random Forest
No ratings yet
WINSEM2020-21 CSE4020 ETH VL2020210504996 Reference Material II 19-May-2021 Random Forest
22 pages
T6 Decision Tree
No ratings yet
T6 Decision Tree
38 pages
Decision Tree
No ratings yet
Decision Tree
43 pages
Decision Tree
No ratings yet
Decision Tree
18 pages
Dokumen - Tips Decision Tree and Random Forest 58f9e8a0cce07
No ratings yet
Dokumen - Tips Decision Tree and Random Forest 58f9e8a0cce07
17 pages
Decision Trees
No ratings yet
Decision Trees
25 pages
Aiml Easy Solution
No ratings yet
Aiml Easy Solution
70 pages
Decision Tree
No ratings yet
Decision Tree
18 pages
7 DecisionTree
No ratings yet
7 DecisionTree
58 pages
3 Decision Trees - LMS
No ratings yet
3 Decision Trees - LMS
47 pages
DT Classifier
No ratings yet
DT Classifier
45 pages
Machine Learning Lec6
No ratings yet
Machine Learning Lec6
40 pages
Decision Tree
No ratings yet
Decision Tree
27 pages
Decision Trees for Data Scientists
No ratings yet
Decision Trees for Data Scientists
75 pages
Lecture 11 Classification-1
No ratings yet
Lecture 11 Classification-1
30 pages
Decision Tree Rule-Based Guide
No ratings yet
Decision Tree Rule-Based Guide
28 pages
Decision Tree Basics for Data Scientists
No ratings yet
Decision Tree Basics for Data Scientists
61 pages
07 - Decision Tree
No ratings yet
07 - Decision Tree
45 pages
ML Unit-3
No ratings yet
ML Unit-3
92 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Decision Tree
100% (4)
Decision Tree
66 pages
07 Decision Tree
No ratings yet
07 Decision Tree
45 pages
ID3 Algorithm
No ratings yet
ID3 Algorithm
22 pages
ID3 Decision Tree Explanation
No ratings yet
ID3 Decision Tree Explanation
8 pages
Decision Tree (Class 37-38) 169692509554958626652505a71d481
No ratings yet
Decision Tree (Class 37-38) 169692509554958626652505a71d481
45 pages
Lecture 4
No ratings yet
Lecture 4
74 pages
Decision Trees
No ratings yet
Decision Trees
19 pages
Decision Tree
No ratings yet
Decision Tree
31 pages
Decision Tree
No ratings yet
Decision Tree
29 pages
Decision Trees for Beginners
100% (1)
Decision Trees for Beginners
10 pages
Decision Tree Induction
No ratings yet
Decision Tree Induction
80 pages
Unit6 - 2 Classification-Decision-Trees
No ratings yet
Unit6 - 2 Classification-Decision-Trees
36 pages
Decision Tree Induction Basics
No ratings yet
Decision Tree Induction Basics
55 pages
Classification - Decision Trees
No ratings yet
Classification - Decision Trees
43 pages
Chap5 - Machine Learning Part II - Decision Tree
No ratings yet
Chap5 - Machine Learning Part II - Decision Tree
68 pages
Decision-Tree Learning .
No ratings yet
Decision-Tree Learning .
29 pages
Decision Tree
No ratings yet
Decision Tree
100 pages
Decision Tree Algorithm - A Complete Guide: Data Science Blogathon
No ratings yet
Decision Tree Algorithm - A Complete Guide: Data Science Blogathon
13 pages
DT-0 (3 Files Merged)
No ratings yet
DT-0 (3 Files Merged)
143 pages
ID3 Algorithm & ROC Analysis
No ratings yet
ID3 Algorithm & ROC Analysis
51 pages
Geometric Intuition of Decision Tree: Axis Parallel Hyperplanes
No ratings yet
Geometric Intuition of Decision Tree: Axis Parallel Hyperplanes
7 pages
ML 19
No ratings yet
ML 19
28 pages
06 Classification Decision Tree
No ratings yet
06 Classification Decision Tree
42 pages
Decision Tree
No ratings yet
Decision Tree
35 pages
Decision Trees & Information Gain
No ratings yet
Decision Trees & Information Gain
68 pages
23 Id3
No ratings yet
23 Id3
20 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
6CS4-02 Machine Learning Manish Bhardwaj
No ratings yet
6CS4-02 Machine Learning Manish Bhardwaj
625 pages
Decision Tree Part 1
No ratings yet
Decision Tree Part 1
15 pages
Wireless Presentation
No ratings yet
Wireless Presentation
26 pages
Lecture 01 11jan
No ratings yet
Lecture 01 11jan
29 pages
3medium Access Techniques
No ratings yet
3medium Access Techniques
17 pages
4wireless LAN-II
No ratings yet
4wireless LAN-II
14 pages
A Unique Relationship For The Determination of The Shear
100% (1)
A Unique Relationship For The Determination of The Shear
7 pages
Covariant Physics - From Classical Mechanics To General Relativity and Beyond 1st Edition Moataz H. Emam Updated 2025
No ratings yet
Covariant Physics - From Classical Mechanics To General Relativity and Beyond 1st Edition Moataz H. Emam Updated 2025
84 pages
Personal Control Over Aversive Stimuli and Its Relationship To Stress
No ratings yet
Personal Control Over Aversive Stimuli and Its Relationship To Stress
18 pages
C Programming for Beginners
73% (11)
C Programming for Beginners
159 pages
CSEC Mathematics Paper 2 - Structure Questions
100% (1)
CSEC Mathematics Paper 2 - Structure Questions
23 pages
Free Electron Model
No ratings yet
Free Electron Model
9 pages
Physics of Light and Optics
No ratings yet
Physics of Light and Optics
345 pages
2018 - Cranial Remodeling Orthosis For Infantile Plagiocephaly Created Through A 3D Scan, Topological Optimization, and 3D Printing Process
No ratings yet
2018 - Cranial Remodeling Orthosis For Infantile Plagiocephaly Created Through A 3D Scan, Topological Optimization, and 3D Printing Process
12 pages
Determinants Assignment
No ratings yet
Determinants Assignment
1 page
Learning Based Model Predictive Control (LBMPC) For Optimum Control of Asynchronous Motor
No ratings yet
Learning Based Model Predictive Control (LBMPC) For Optimum Control of Asynchronous Motor
91 pages
Land Surveying Techniques
No ratings yet
Land Surveying Techniques
2 pages
Activity 1 Finals Math1.1
No ratings yet
Activity 1 Finals Math1.1
9 pages
Nominal Base Stiffness
No ratings yet
Nominal Base Stiffness
3 pages
Introduction To R: Nihan Acar-Denizli, Pau Fonseca
No ratings yet
Introduction To R: Nihan Acar-Denizli, Pau Fonseca
50 pages
Tutorial III Root Locus Design
100% (3)
Tutorial III Root Locus Design
25 pages
Flores Vectors and Vector Addition
No ratings yet
Flores Vectors and Vector Addition
29 pages
Groups
No ratings yet
Groups
45 pages
Stats for Educators and Researchers
No ratings yet
Stats for Educators and Researchers
3 pages
Math 9 Quadratic Equations Plan
No ratings yet
Math 9 Quadratic Equations Plan
3 pages
MAECOP1
No ratings yet
MAECOP1
15 pages
Mathematics Maths NSC P1 QP Sep 2018 Eng Eastern Cape
No ratings yet
Mathematics Maths NSC P1 QP Sep 2018 Eng Eastern Cape
10 pages
Irm 23
No ratings yet
Irm 23
11 pages
Geometrical Optics Theory & Exercises
100% (2)
Geometrical Optics Theory & Exercises
67 pages
TE - 2019 - (AIML) Artificial Intelligence and Machine Learning
No ratings yet
TE - 2019 - (AIML) Artificial Intelligence and Machine Learning
4 pages
Grade 7 Math: Scale Drawings
No ratings yet
Grade 7 Math: Scale Drawings
4 pages
Measurement of Uncertainty PDF
No ratings yet
Measurement of Uncertainty PDF
96 pages
Final Practice Problems
No ratings yet
Final Practice Problems
38 pages
Video Ubd
No ratings yet
Video Ubd
4 pages
Paper Failure Probability Assessment
No ratings yet
Paper Failure Probability Assessment
6 pages
Advanced Engineering Mathematics (Gujarat Technological University 2016) 4th Edition Ravish R Singh Digital Download
No ratings yet
Advanced Engineering Mathematics (Gujarat Technological University 2016) 4th Edition Ravish R Singh Digital Download
122 pages

Entropy and IG

Uploaded by

Entropy and IG

Uploaded by

Entropy and

a) Entropy using the frequency table of one attribute:

The information gain is based on the

Step 3: Choose attribute with the largest

Note: calculate the entropy only on examples that got

You might also like