3 ID3 Algorithm Updated

The document outlines an experiment to demonstrate the ID3 algorithm for decision tree classification using a dataset about tennis. It includes Python code for reading the dataset, calculating entropy, and building the decision tree, as well as classifying new samples and evaluating accuracy. The resultant decision tree achieved an accuracy of 75% on the test data.

Uploaded by

millionthoughtsclub

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views3 pages

3 ID3 Algorithm Updated

Uploaded by

millionthoughtsclub

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

3_ID3_algorithm_updated

January 20, 2025

1 Experiment 3

2 Write a program to demonstrate the working of the decision

tree based on ID3 algorithm. Use an appropriate data set for
building the decision tree and apply this knowledge to classify
a new sample
[10]: import pandas as pd
import numpy as np
df_tennis = pd.read_csv("tennis.csv")
df_tennis

[10]: Outlook Temperature Humidity Windy PlayTennis

0 Sunny Hot High Weak No
1 Sunny Hot High Strong No
2 Overcast Hot High Weak Yes
3 Rainy Mild High Weak Yes
4 Rainy Cool Normal Weak Yes
5 Rainy Cool Normal Strong No
6 Overcast Cool Normal Strong Yes
7 Sunny Mild High Weak No
8 Sunny Cool Normal Weak Yes
9 Rainy Mild Normal Weak Yes
10 Sunny Mild Normal Strong Yes
11 Overcast Mild High Strong Yes
12 Overcast Hot Normal Weak Yes
13 Rainy Mild High Strong No

[11]: from collections import Counter

def entropy_list(a_list):
cnt = Counter(x for x in a_list)
num_instance = len(a_list)*1.0
probs = [x/num_instance for x in cnt.values()]
return entropy(probs)

1
[12]: import math
def entropy(probs): #overall entropy
return sum([-prob*math.log(prob,2) for prob in probs])

[13]: def info_gain(df,split,target,trace=0):

df_split = df.groupby(split)
nobs = len(df.index)*1.0
df_agg_ent = df_split.agg({ target:[entropy_list, lambda x: len(x)/nobs] })
# print(df_agg_ent)
df_agg_ent.columns = ['Entropy','PropObserved']
new_entropy = sum( df_agg_ent['Entropy'] * df_agg_ent["PropObserved"])
old_entropy = entropy_list(df[target])
return old_entropy - new_entropy

[14]: def id3(df,target,attribute_name,default_class = None):

cnt = Counter(x for x in df[target])
if len(cnt)==1:
return next(iter(cnt))
elif df.empty or (not attribute_name):
return default_class
else:
default_class = max(cnt.keys())
gains = [info_gain(df,attr,target) for attr in attribute_name]
index_max = gains.index(max(gains))
best_attr = attribute_name[index_max]
tree = { best_attr:{ } }
remaining_attr = [x for x in attribute_name if x!=best_attr]
for attr_val, data_subset in df.groupby(best_attr):
subtree = id3(data_subset,target,remaining_attr,default_class)
tree[best_attr][attr_val] = subtree
return tree

[15]: def classify(instance,tree,default = None):

attribute = next(iter(tree))
if instance[attribute] in tree[attribute].keys():
result = tree[attribute][instance[attribute]]
if isinstance(result,dict):
return classify(instance,result)
else:
return result
else:
return default

[16]: attribute_names=list(df_tennis.columns)
attribute_names.remove('PlayTennis')
training_data = df_tennis.iloc[1:-4] # all but last thousand instances
test_data = df_tennis.iloc[-4:] # just the last thousand

2
train_tree = id3(training_data, 'PlayTennis', attribute_names)
print("\n\nThe Resultant Decision train_tree is :\n")
print(train_tree)
test_data['predicted2'] = test_data.
↪apply(classify,axis=1,args=(train_tree,'Yes') )

print ('\n\n Training the model for a few samples, and again predicting␣
↪\'Playtennis\' for remaining attribute')

print('The Accuracy for new trained data is : ' + str(␣

↪sum(test_data['PlayTennis']==test_data['predicted2'] ) / (1.0*len(test_data.

↪index)) ))

The Resultant Decision train_tree is :

{'Outlook': {'Overcast': 'Yes', 'Rainy': {'Windy': {'Strong': 'No', 'Weak':

'Yes'}}, 'Sunny': {'Temperature': {'Cool': 'Yes', 'Hot': 'No', 'Mild': 'No'}}}}

Training the model for a few samples, and again predicting 'Playtennis' for
remaining attribute
The Accuracy for new trained data is : 0.75
C:\Users\Admin\AppData\Local\Temp\ipykernel_4940\150528394.py:8:
SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead

See the caveats in the documentation: https://pandas.pydata.org/pandas-

docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
test_data['predicted2'] =
test_data.apply(classify,axis=1,args=(train_tree,'Yes') )

[ ]:

ML 4
No ratings yet
ML 4
5 pages
ID3 Algorithm
No ratings yet
ID3 Algorithm
11 pages
DWDM Lab 2
No ratings yet
DWDM Lab 2
3 pages
AD LAB-8.1-GrWork-updated
No ratings yet
AD LAB-8.1-GrWork-updated
7 pages
P 4 Andp 5
No ratings yet
P 4 Andp 5
4 pages
Step 2: Implement The ID3 Algorithm
No ratings yet
Step 2: Implement The ID3 Algorithm
3 pages
MANUAL
No ratings yet
MANUAL
34 pages
MLT Experiment 3
No ratings yet
MLT Experiment 3
3 pages
ML Ex1
No ratings yet
ML Ex1
12 pages
AD3461 ML Lab Manual
No ratings yet
AD3461 ML Lab Manual
32 pages
ML Lab Manual
No ratings yet
ML Lab Manual
25 pages
ML5 Implementation
No ratings yet
ML5 Implementation
32 pages
MANUAL
No ratings yet
MANUAL
33 pages
ID3 Algorithm for ML Students
No ratings yet
ID3 Algorithm for ML Students
6 pages
Dev Id3.ipynb - Colab
No ratings yet
Dev Id3.ipynb - Colab
4 pages
Machine Learning Laboratory Record Book: 1 Find S Algorithm
No ratings yet
Machine Learning Laboratory Record Book: 1 Find S Algorithm
22 pages
Decision Tree ID3
No ratings yet
Decision Tree ID3
3 pages
CR Lab
No ratings yet
CR Lab
5 pages
Lab Program 3
No ratings yet
Lab Program 3
6 pages
Aiml Practical
No ratings yet
Aiml Practical
17 pages
ML Lab P-1
No ratings yet
ML Lab P-1
10 pages
ML Lab Record
No ratings yet
ML Lab Record
49 pages
Classification
No ratings yet
Classification
148 pages
Machine Learning Lab Record: Dr. Sarika Hegde
No ratings yet
Machine Learning Lab Record: Dr. Sarika Hegde
23 pages
Anaconda Ex-7
No ratings yet
Anaconda Ex-7
3 pages
Import Import Def
No ratings yet
Import Import Def
2 pages
Lab Manual
No ratings yet
Lab Manual
25 pages
22053227-AD LAB-8-GrWork
No ratings yet
22053227-AD LAB-8-GrWork
5 pages
Lab Manual2
No ratings yet
Lab Manual2
6 pages
ML Lab
No ratings yet
ML Lab
26 pages
Ashwin Report
No ratings yet
Ashwin Report
18 pages
Machine Learning Algorithms Lab
No ratings yet
Machine Learning Algorithms Lab
48 pages
Lec-2 Decision Tree - 13-8-2024
No ratings yet
Lec-2 Decision Tree - 13-8-2024
38 pages
Da Lab3 221it064
No ratings yet
Da Lab3 221it064
6 pages
Unit 4 - Decision Tree ID3
No ratings yet
Unit 4 - Decision Tree ID3
5 pages
ML Lab Manual (1-9)
No ratings yet
ML Lab Manual (1-9)
37 pages
DWM 06
No ratings yet
DWM 06
4 pages
Programs Lab Bca
No ratings yet
Programs Lab Bca
16 pages
ML Experiments
No ratings yet
ML Experiments
22 pages
Is Lab Aman Agarwal PDF
No ratings yet
Is Lab Aman Agarwal PDF
8 pages
Practical 1ritesh
No ratings yet
Practical 1ritesh
3 pages
Lab 3
No ratings yet
Lab 3
7 pages
Da Lab3 221it084 Final
No ratings yet
Da Lab3 221it084 Final
6 pages
ID3 Decision Tree Algorithm Demo
No ratings yet
ID3 Decision Tree Algorithm Demo
6 pages
AIH Lab2
No ratings yet
AIH Lab2
10 pages
AI Lab M.Tech
No ratings yet
AI Lab M.Tech
29 pages
Program 5
No ratings yet
Program 5
5 pages
Slide 3
No ratings yet
Slide 3
23 pages
Advance Machine Learning
No ratings yet
Advance Machine Learning
28 pages
Weather Forecasting Example
No ratings yet
Weather Forecasting Example
3 pages
Decision Tree
No ratings yet
Decision Tree
6 pages
ML Lab Manual - 3,4,5
No ratings yet
ML Lab Manual - 3,4,5
6 pages
Practical File Machine Learning
No ratings yet
Practical File Machine Learning
29 pages
ID3 Algorithm For Decision Trees
No ratings yet
ID3 Algorithm For Decision Trees
16 pages
Machine Learning
No ratings yet
Machine Learning
27 pages
MLlab Manual LIET
No ratings yet
MLlab Manual LIET
52 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
Lecture 7.2 - DTC Algorithm Implementation
No ratings yet
Lecture 7.2 - DTC Algorithm Implementation
7 pages
ID3 Decision Trees for ML Beginners
No ratings yet
ID3 Decision Trees for ML Beginners
7 pages
Euth A 8 18
No ratings yet
Euth A 8 18
3 pages
2025 Scholarship Criteria
No ratings yet
2025 Scholarship Criteria
2 pages
My Story Terapia Narrativa
No ratings yet
My Story Terapia Narrativa
13 pages
Van Rees, 1996. Event History Analysis
No ratings yet
Van Rees, 1996. Event History Analysis
18 pages
Engineering Student's Career Journey
No ratings yet
Engineering Student's Career Journey
1 page
CCP Maze KS2 Activity 6 Treasure Hunt Activity
No ratings yet
CCP Maze KS2 Activity 6 Treasure Hunt Activity
12 pages
Driver Ed Faq
No ratings yet
Driver Ed Faq
13 pages
DeepHipp Accurate Segmentation of Hippocampus Usin
No ratings yet
DeepHipp Accurate Segmentation of Hippocampus Usin
16 pages
RAM Result - Grade 1 - Mango - Renelou Lopez
No ratings yet
RAM Result - Grade 1 - Mango - Renelou Lopez
22 pages
10 Admirable Researchhhhhhhh
No ratings yet
10 Admirable Researchhhhhhhh
4 pages
Yosef 2
No ratings yet
Yosef 2
57 pages
2024 - Exemplar English Gr1T2 Maths Diagnostic Assessment
No ratings yet
2024 - Exemplar English Gr1T2 Maths Diagnostic Assessment
3 pages
Hook Surgery Practice Booklet PDF
No ratings yet
Hook Surgery Practice Booklet PDF
4 pages
Fikom Up Confrerence 2012 Proceedings
100% (1)
Fikom Up Confrerence 2012 Proceedings
46 pages
Bilingual Teacher's Journey
No ratings yet
Bilingual Teacher's Journey
2 pages
2023 Bermuda School Sports Federation Senior School Cross Country Championships
No ratings yet
2023 Bermuda School Sports Federation Senior School Cross Country Championships
5 pages
Observation in "The Invisible Japanese Gentlemen"
No ratings yet
Observation in "The Invisible Japanese Gentlemen"
2 pages
Seven Types of Curriculum
No ratings yet
Seven Types of Curriculum
55 pages
Module2.2.Teaching Language Skills-Pre While Post
No ratings yet
Module2.2.Teaching Language Skills-Pre While Post
25 pages
Ayesha Umar Wahedi - CV 2016
No ratings yet
Ayesha Umar Wahedi - CV 2016
4 pages
BLIE-227-II-S-7 Previous Year
No ratings yet
BLIE-227-II-S-7 Previous Year
7 pages
Remedial Model or Social Treatment Model
No ratings yet
Remedial Model or Social Treatment Model
51 pages
Eurocentrism Kritik
No ratings yet
Eurocentrism Kritik
231 pages
Books To Start Trading
No ratings yet
Books To Start Trading
3 pages
Santo Tomas College of Agriculture, Sciences and Technology: Wednesday
No ratings yet
Santo Tomas College of Agriculture, Sciences and Technology: Wednesday
3 pages
3i's Inquiries, Investigation and Immersion
No ratings yet
3i's Inquiries, Investigation and Immersion
28 pages
By Charles J. Sykes: Losing The (Education) Race
No ratings yet
By Charles J. Sykes: Losing The (Education) Race
6 pages
Gs1 - l2 - Explanation
No ratings yet
Gs1 - l2 - Explanation
9 pages
IKEA Case Discussion
No ratings yet
IKEA Case Discussion
3 pages
Mathematics - Mathematics - Question Paper
No ratings yet
Mathematics - Mathematics - Question Paper
13 pages