0% found this document useful (0 votes)

32 views11 pages

ID3 Algorithm

The document explains the creation of a Decision Tree using the ID3 (Iterative Dichotomiser 3) algorithm, which is widely used for classification tasks. It outlines the steps involved in building the tree, including data preprocessing, calculating entropy and information gain, and selecting the best attribute for splitting. Additionally, it discusses the limitations of the ID3 algorithm and Decision Trees, such as overfitting, instability, and challenges with continuous variables.

Uploaded by

rajkirannaidu123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views11 pages

ID3 Algorithm

Uploaded by

rajkirannaidu123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Lab Session -5 Create Decision Tree using ID3 Algorithm

A Decision Tree is a powerful and popular machine learning algorithm used

for both classification and regression tasks. It is a graphical representation of a
series of decisions and their possible outcomes, making it easy to understand
and interpret. The ID3 (Iterative Dichotomiser 3) algorithm is one of the earliest
and most widely used algorithms to create decision trees from a given dataset.
.
What is Decission Tree?
A Decision Tree is a popular machine learning algorithm used for both
classification and regression tasks. It is a tree-like structure that represents a
series of decisions and their possible outcomes. Each internal node of the tree
corresponds to a feature or attribute, each branch represents a decision based
on that attribute, and each leaf node represents the final outcome or class
label. Decision Trees are interpretable and easy to understand, making them
useful for both analysis and prediction.

What is ID3 Algorithm?

The ID3 (Iterative Dichotomiser 3) algorithm is one of the earliest and most
widely used algorithms to create Decision Trees from a given dataset. It uses
the concept of entropy and information gain to select the best attribute for
splitting the data at each node. Entropy measures the uncertainty or
randomness in the data, and information gain quantifies the reduction in
uncertainty achieved by splitting the data on a particular attribute. The ID3
algorithm recursively splits the dataset based on the attributes with the highest
information gain until a stopping criterion is met, resulting in a Decision Tree
that can be used for classification tasks.
Understanding the ID3 Algorithm:
The ID3 algorithm uses the concept of entropy and information gain to
construct a decision tree. Entropy measures the amount of uncertainty or
randomness in a dataset, while information gain quantifies the reduction in
entropy achieved by splitting the data on a specific attribute. The attribute with
the highest information gain is selected as the decision node for the tree.

Steps to Create a Decision Tree using the ID3 Algorithm:

Step 1: Data Preprocessing:
Clean and preprocess the data. Handle missing values and convert categorical
variables into numerical representations if needed.
Step 2: Selecting the Root Node:
Calculate the entropy of the target variable (class labels) based on the dataset.
The formula for entropy is:
Entropy(S) = -Σ (p_i * log2(p_i))
where p_i is the proportion of instances belonging to class i.
Step 3: Calculating Information Gain:
For each attribute in the dataset, calculate the information gain when the
dataset is split on that attribute. The formula for information gain is:
Information Gain(S, A) = Entropy(S) - Σ ((|S_v| / |S|) * Entropy(S_v))
where S_v is the subset of instances for each possible value of attribute A,
and |S_v| is the number of instances in that subset.
Step 4: Selecting the Best Attribute:
Choose the attribute with the highest information gain as the decision node for
the tree.
Step 5: Splitting the Dataset:
Split the dataset based on the values of the selected attribute.
Step 6: Repeat the Process:
Recursively repeat steps 2 to 5 for each subset until a stopping criterion is met
(e.g., the tree depth reaches a maximum limit or all instances in a subset
belong to the same class).
Solved Example:

Weather Temperature Humidity Windy Play Tennis

Sunny Hot High False No

Sunny Hot High True No

Overcast Hot High False Yes

Rainy Mild High False Yes

Rainy Cool Normal False Yes

Rainy Cool Normal True No

Overcast Cool Normal True Yes

Sunny Mild High False No

Sunny Cool Normal False Yes

Rainy Mild Normal False Yes

Sunny Mild Normal True Yes

Overcast Mild High True Yes

Weather Temperature Humidity Windy Play Tennis

Overcast Hot Normal False Yes

Rainy Mild High True No

Step 1: Data Preprocessing:

The dataset does not require any preprocessing, as it is already in a suitable
format.

Step 2: Calculating Entropy:

To calculate entropy, we first determine the proportion of positive and negative
instances in the dataset:
● Positive instances (Play Tennis = Yes): 9
● Negative instances (Play Tennis = No): 5
●
Entropy(S) = -(9/14) * log2(9/14) – (5/14) * log2(5/14) ≈ 0.940
Step 3: Calculating Information Gain:
We calculate the information gain for each attribute (Weather, Temperature,
Humidity, Windy) and choose the attribute with the highest information gain as
the root node.

Information Gain(S, Weather) = Entropy(S) – [(5/14) * Entropy(Sunny) + (4/14)

* Entropy(Overcast) + (5/14) * Entropy(Rainy)] ≈ 0.246
Information Gain(S, Temperature) = Entropy(S) – [(4/14) * Entropy(Hot) +
(4/14) * Entropy(Mild) + (6/14) * Entropy(Cool)] ≈ 0.029
Information Gain(S, Humidity) = Entropy(S) – [(7/14) * Entropy(High) + (7/14) *
Entropy(Normal)] ≈ 0.152

Information Gain(S, Windy) = Entropy(S) – [(8/14) * Entropy(False) + (6/14) *

Entropy(True)] ≈ 0.048
Step 4: Selecting the Best Attribute:
The “Weather” attribute has the highest information gain, so we select it as the
root node for our decision tree.
Step 5: Splitting the Dataset:
We split the dataset based on the values of the “Weather” attribute into three
subsets (Sunny, Overcast, Rainy).

Step 6: Repeat the Process:

Since the “Weather” attribute has n0o repeating values in any subset, we stop
splitting and label each leaf node with the majority class in that subset. The
decision tree will look like below:
Python code for creating a decision tree using the ID3 algorithm:
import pandas as pd
import numpy as np
import random

# Define the dataset

data = {
'Weather': ['Sunny', 'Sunny', 'Overcast', 'Rainy', 'Rainy', 'Rainy', 'Overcast',
'Sunny', 'Sunny', 'Rainy', 'Sunny', 'Overcast', 'Overcast', 'Rainy'],
'Temperature': ['Hot', 'Hot', 'Hot', 'Mild', 'Cool', 'Cool', 'Cool', 'Mild', 'Cool',
'Mild', 'Mild', 'Mild', 'Hot', 'Mild'],
'Humidity': ['High', 'High', 'High', 'High', 'Normal', 'Normal', 'Normal', 'High',
'Normal', 'Normal', 'Normal', 'High', 'Normal', 'High'],
'Windy': [False, True, False, False, False, True, True, False, False, False, True,
True, False, True],
'Play Tennis': ['No', 'No', 'Yes', 'Yes', 'Yes', 'No', 'Yes', 'No', 'Yes', 'Yes', 'Yes',
'Yes', 'Yes', 'No']
}

df = pd.DataFrame(data)

def entropy(target_col):
elements, counts = np.unique(target_col, return_counts=True)
entropy_val = -np.sum([(counts[i] / np.sum(counts)) * np.log2(counts[i] /
np.sum(counts)) for i in range(len(elements))])
return entropy_val
def information_gain(data, split_attribute_name, target_name):
total_entropy = entropy(data[target_name])
vals, counts= np.unique(data[split_attribute_name], return_counts=True)
weighted_entropy = np.sum([(counts[i] / np.sum(counts)) *
entropy(data.where(data[split_attribute_name]==vals[i]).dropna()[target_nam
e]) for i in range(len(vals))])
information_gain_val = total_entropy - weighted_entropy
return information_gain_val

def id3_algorithm(data, original_data, features, target_attribute_name,

parent_node_class):
# Base cases
if len(np.unique(data[target_attribute_name])) <= 1:
return np.unique(data[target_attribute_name])[0]
elif len(data) == 0:
return
np.unique(original_data[target_attribute_name])[np.argmax(np.unique(origina
l_data[target_attribute_name], return_counts=True)[1])]
elif len(features) == 0:
return parent_node_class
else:
parent_node_class =
np.unique(data[target_attribute_name])[np.argmax(np.unique(data[target_att
ribute_name], return_counts=True)[1])]
item_values = [information_gain(data, feature, target_attribute_name) for
feature in features]
best_feature_index = np.argmax(item_values)
best_feature = features[best_feature_index]
tree = {best_feature: {}}
features = [i for i in features if i != best_feature]
for value in np.unique(data[best_feature]):
value = value
sub_data = data.where(data[best_feature] == value).dropna()
subtree = id3_algorithm(sub_data, data, features,
target_attribute_name, parent_node_class)
tree[best_feature][value] = subtree
return tree

def predict(query, tree, default = 1):

for key in list(query.keys()):
if key in list(tree.keys()):
try:
result = tree[key][query[key]]
except:
return default
result = tree[key][query[key]]
if isinstance(result, dict):
return predict(query, result)
else:
return result
def train_test_split(df, test_size):
if isinstance(test_size, float):
test_size = round(test_size * len(df))
indices = df.index.tolist()
test_indices = random.sample(population=indices, k=test_size)
test_df = df.loc[test_indices]
train_df = df.drop(test_indices)
return train_df, test_df

train_data, test_data = train_test_split(df, test_size=0.2)

def fit(df, target_attribute_name, features):

return id3_algorithm(df, df, features, target_attribute_name, None)

def get_accuracy(df, tree):

df["classification"] = df.apply(predict, axis=1, args=(tree, 'Yes'))
df["classification_correct"] = df["classification"] == df["Play Tennis"]
accuracy = df["classification_correct"].mean()
return accuracy

tree = fit(train_data, 'Play Tennis', ['Weather', 'Temperature', 'Humidity',

'Windy'])
accuracy = get_accuracy(test_data, tree)
print("Decision Tree:")
print(tree)
print("Accuracy:", accuracy)
Output:
Decision Tree: {'Weather': {'Overcast': 'Yes', 'Rainy': {'Windy': {False: 'Yes', True:
'No'}}, 'Sunny': {'Temperature': {'Cool': 'Yes', 'Hot': 'No', 'Mild': 'No'}}}}
Accuracy: 0.6666666666666666

Limitations of the ID3 Algorithm and Decision Trees:

While decision trees and the ID3 algorithm offer several advantages, they also
have some limitations that need to be considered before using them in certain
scenarios:
1. Overfitting: Decision trees are prone to overfitting, especially when the
tree becomes too deep or complex. Overfitting occurs when the tree
captures noise or random fluctuations in the training data, leading to
poor performance on unseen data.
2. Instability: Small changes in the data can lead to different tree
structures, making decision trees less stable. A small variation in the data
might cause a split at a different attribute or threshold, potentially
affecting the entire tree.
3. Inability to Capture Linear Relationships: Decision trees are not
well-suited to capture linear relationships between variables. They
partition the data into distinct regions, making it challenging to represent
linear patterns.
4. Bias towards Features with More Levels: Attributes with more levels or
categories tend to have higher information gain simply due to having
more possible splits. This can bias the decision tree towards such
attributes, even if they might not be the most informative.
5. Lack of Robustness to Noise: Decision trees can be sensitive to noisy
data, as they might create splits based on noise or outliers that do not
generalize well to new data.
6. Difficulty in Handling Continuous Variables: The ID3 algorithm and basic
decision trees are designed to handle categorical variables. For
continuous variables, pre-processing is required to convert them into
discrete intervals or use other algorithms like CART (Classification and
Regression Trees).
7. Exponential Growth of Tree Size: Decision trees can grow rapidly,
especially when dealing with large datasets or a high number of features.
This can lead to complex trees that are difficult to interpret.
8. Limited Expressiveness: While decision trees can represent simple
decision boundaries, they might struggle to capture complex
relationships in the data.
9. Difficulty in Handling Missing Values: The ID3 algorithm does not handle
missing values well. Imputation methods or other algorithms need to be
used to handle missing data.
10.Class Imbalance: Decision trees can have difficulty dealing with
imbalanced class distributions in the data, potentially favoring the
majority class and performing poorly on the minority class.
Despite these limitations, decision trees and the ID3 algorithm remain widely
used and can still be effective in various scenarios, especially when combined
with techniques like pruning, ensemble methods (e.g., Random Forests,
Gradient Boosting), or when used as part of more sophisticated machine
learning pipelines. Understanding the strengths and weaknesses of decision
trees helps data scientists and machine learning practitioners make informed
decisions about their use in different applications.

Lab Program 3
No ratings yet
Lab Program 3
6 pages
Unit 4 - Decision Tree ID3
No ratings yet
Unit 4 - Decision Tree ID3
5 pages
ID3 Algorithm for ML Students
No ratings yet
ID3 Algorithm for ML Students
6 pages
Step 2: Implement The ID3 Algorithm
No ratings yet
Step 2: Implement The ID3 Algorithm
3 pages
Lab 3
No ratings yet
Lab 3
7 pages
MLT Experiment 3
No ratings yet
MLT Experiment 3
3 pages
Decision Tree ID3
No ratings yet
Decision Tree ID3
3 pages
MLT UNIT-3 Notes
No ratings yet
MLT UNIT-3 Notes
35 pages
Lec-2 Decision Tree - 13-8-2024
No ratings yet
Lec-2 Decision Tree - 13-8-2024
38 pages
Lab Manual2
No ratings yet
Lab Manual2
6 pages
06 Classification Decision Tree
No ratings yet
06 Classification Decision Tree
42 pages
Program 5
No ratings yet
Program 5
5 pages
Module 3-Decision Tree Learning
100% (1)
Module 3-Decision Tree Learning
33 pages
ID3 Decision Trees for ML Beginners
No ratings yet
ID3 Decision Trees for ML Beginners
7 pages
DT Classifier
No ratings yet
DT Classifier
45 pages
Decision Trees
No ratings yet
Decision Trees
19 pages
DWDM Lab 2
No ratings yet
DWDM Lab 2
3 pages
3ID3 Algorithm
No ratings yet
3ID3 Algorithm
9 pages
ML 4
No ratings yet
ML 4
5 pages
3 ID3 Algorithm Updated
No ratings yet
3 ID3 Algorithm Updated
3 pages
DataMining-Handouts1 5
No ratings yet
DataMining-Handouts1 5
8 pages
T6 Decision Tree
No ratings yet
T6 Decision Tree
38 pages
Program 6
No ratings yet
Program 6
4 pages
ID3 Complete Solution
No ratings yet
ID3 Complete Solution
3 pages
Decision Tree Id3 Problem
No ratings yet
Decision Tree Id3 Problem
5 pages
ID3 Decision Tree Explanation
No ratings yet
ID3 Decision Tree Explanation
8 pages
ML 5
No ratings yet
ML 5
2 pages
Lab Program 3
No ratings yet
Lab Program 3
6 pages
Improved ID3 Algorithm for Data Mining
No ratings yet
Improved ID3 Algorithm for Data Mining
5 pages
AD3461 ML Lab Manual
No ratings yet
AD3461 ML Lab Manual
32 pages
Decision Tree & Random Forest
No ratings yet
Decision Tree & Random Forest
41 pages
Da Lab3 221it064
No ratings yet
Da Lab3 221it064
6 pages
Decizsion Tree
No ratings yet
Decizsion Tree
16 pages
Lec-3-Decision Trees
No ratings yet
Lec-3-Decision Trees
47 pages
Da Lab3 221it084 Final
No ratings yet
Da Lab3 221it084 Final
6 pages
ID3 Algorithm: Abbas Rizvi CS157 B Spring 2010
No ratings yet
ID3 Algorithm: Abbas Rizvi CS157 B Spring 2010
19 pages
Module 5 Notes
No ratings yet
Module 5 Notes
8 pages
Decision Tree (Class 37-38) 169692509554958626652505a71d481
No ratings yet
Decision Tree (Class 37-38) 169692509554958626652505a71d481
45 pages
P 4 Andp 5
No ratings yet
P 4 Andp 5
4 pages
07 - ML - Decision Tree
No ratings yet
07 - ML - Decision Tree
37 pages
ID3 Decision Tree Algorithm Guide
No ratings yet
ID3 Decision Tree Algorithm Guide
17 pages
L3 - Decision Trees
No ratings yet
L3 - Decision Trees
28 pages
Decision Tree Induction
No ratings yet
Decision Tree Induction
52 pages
Decision Trees for Data Scientists
No ratings yet
Decision Trees for Data Scientists
14 pages
Unit 2 1
No ratings yet
Unit 2 1
15 pages
Classification
No ratings yet
Classification
148 pages
Tree Models
No ratings yet
Tree Models
42 pages
Unit 3
No ratings yet
Unit 3
90 pages
FALLSEM2024-25 BCSE209L TH VL2024250101735 2024-07-29 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101735 2024-07-29 Reference-Material-I
48 pages
New Module 3 Part1
No ratings yet
New Module 3 Part1
69 pages
Module 3 DecisionTree Notes
100% (1)
Module 3 DecisionTree Notes
14 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
ID3 Algorithm For Decision Trees
No ratings yet
ID3 Algorithm For Decision Trees
16 pages
Decision Tree Classifier & ID3 Guide
No ratings yet
Decision Tree Classifier & ID3 Guide
34 pages
Practice Q Machine Learning Ans
No ratings yet
Practice Q Machine Learning Ans
54 pages
Unit 2
100% (1)
Unit 2
42 pages
FALLSEM2023-24 CSE4020 ELA VL2023240104096 2023-08-19 Reference-Material-I
No ratings yet
FALLSEM2023-24 CSE4020 ELA VL2023240104096 2023-08-19 Reference-Material-I
11 pages
Decision Tree Rule-Based Guide
No ratings yet
Decision Tree Rule-Based Guide
28 pages
Decision Tree
No ratings yet
Decision Tree
20 pages
Java Unit 4 Swing
No ratings yet
Java Unit 4 Swing
64 pages
Hci Unit 1 Notes
No ratings yet
Hci Unit 1 Notes
16 pages
Hci Unit 1
No ratings yet
Hci Unit 1
36 pages
Decision Tree
No ratings yet
Decision Tree
26 pages
Candidate Elimination Algorithm
No ratings yet
Candidate Elimination Algorithm
7 pages
DWDM Mid 1 Imp
No ratings yet
DWDM Mid 1 Imp
1 page
Unit 4
No ratings yet
Unit 4
20 pages
BST Recursive
No ratings yet
BST Recursive
5 pages
Java Unit 4 Part 1
No ratings yet
Java Unit 4 Part 1
27 pages
Daa Unit 3 Anits
No ratings yet
Daa Unit 3 Anits
42 pages
Collections Framework
No ratings yet
Collections Framework
26 pages
MTech Syllabus
No ratings yet
MTech Syllabus
49 pages
Hands On Machine Learning With Scikit Learn and TensorFlow Early Release 2nd Edition Aurélien Géron Download PDF
100% (3)
Hands On Machine Learning With Scikit Learn and TensorFlow Early Release 2nd Edition Aurélien Géron Download PDF
65 pages
Summer Internship Report
No ratings yet
Summer Internship Report
27 pages
Decision Tree ID3 CART
No ratings yet
Decision Tree ID3 CART
28 pages
Machine Learning Essentials Guide
No ratings yet
Machine Learning Essentials Guide
21 pages
PGP DS&A 122c5ebe
No ratings yet
PGP DS&A 122c5ebe
23 pages
Data Science Internship Report
No ratings yet
Data Science Internship Report
87 pages
Decision Trees
100% (6)
Decision Trees
28 pages
AIML Manual V1!6!83 Removed
No ratings yet
AIML Manual V1!6!83 Removed
51 pages
Attrition and Its Effects On Organization
No ratings yet
Attrition and Its Effects On Organization
22 pages
Decision Trees for CS Students
100% (1)
Decision Trees for CS Students
29 pages
Data Warehousing and Data Mining Lab Manual
0% (1)
Data Warehousing and Data Mining Lab Manual
30 pages
AI & ML Lab Guide for Students
No ratings yet
AI & ML Lab Guide for Students
37 pages
Solution ML KOE - 073 PUT (7th Sem 2024-25) Neeru
No ratings yet
Solution ML KOE - 073 PUT (7th Sem 2024-25) Neeru
14 pages
ML Classification Essentials
No ratings yet
ML Classification Essentials
50 pages
Tu3 Weka Tutorials
No ratings yet
Tu3 Weka Tutorials
11 pages
Lecture 07A - Decision Trees
No ratings yet
Lecture 07A - Decision Trees
26 pages
Machine Learning Seminar Report
20% (5)
Machine Learning Seminar Report
26 pages
Data Mining To Improve Personnel Selection and Enhance Human Capital: A Case Study in High-Technology Industry
No ratings yet
Data Mining To Improve Personnel Selection and Enhance Human Capital: A Case Study in High-Technology Industry
11 pages
MGTSC 645 Shivani Gupta Assignment 2 1646112 Decision Tree
No ratings yet
MGTSC 645 Shivani Gupta Assignment 2 1646112 Decision Tree
4 pages
Faris Dzikrur Rahman - 29317008 - Decision Tree Assignment
100% (1)
Faris Dzikrur Rahman - 29317008 - Decision Tree Assignment
2 pages
Decision Tree Analysis Guide
No ratings yet
Decision Tree Analysis Guide
22 pages
Paper 43-Fraud Detection Using Machine Learning in E Commerce
No ratings yet
Paper 43-Fraud Detection Using Machine Learning in E Commerce
8 pages
Decision Trees Cheat Sheet PDF
No ratings yet
Decision Trees Cheat Sheet PDF
2 pages
Teit Cbgs Dmbi Lab Manual FH 2015
No ratings yet
Teit Cbgs Dmbi Lab Manual FH 2015
60 pages
Ôn Thi KTDL
No ratings yet
Ôn Thi KTDL
18 pages
Topic01 Classification Basics Jiawei Han Extra
No ratings yet
Topic01 Classification Basics Jiawei Han Extra
198 pages
Module - 03 Machine Learning (BCS602) Search Creators
No ratings yet
Module - 03 Machine Learning (BCS602) Search Creators
29 pages
Ai 4 Notes
No ratings yet
Ai 4 Notes
24 pages
Supervised Machine Learning Guide
No ratings yet
Supervised Machine Learning Guide
7 pages