Model Evaluation Methods

The document outlines various evaluation methods for classification and regression models, including accuracy, precision, recall, F1 score, confusion matrix, ROC curve, AUC for classification, and MAE, MSE, RMSE, R² score for regression. Each method is described with its formula and best use case, along with example code snippets using sklearn. A summary table categorizes the methods by type and their optimal applications.

Uploaded by

Sai Kusumanjali Bantanahal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views2 pages

Model Evaluation Methods

Uploaded by

Sai Kusumanjali Bantanahal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Model Evaluation Methods

Evaluation Methods for Classification

1. Accuracy:
- Correct predictions / Total predictions
- Best for balanced datasets
Code:
from sklearn.metrics import accuracy_score
accuracy = accuracy_score(y_test, y_pred)

2. Precision:
- Correct positive predictions / Total positive predictions made
- Best when false positives are costly
Code:
from sklearn.metrics import precision_score
precision = precision_score(y_test, y_pred)

3. Recall (Sensitivity):
- Correct positive predictions / All actual positives
- Best when false negatives are costly
Code:
from sklearn.metrics import recall_score
recall = recall_score(y_test, y_pred)

4. F1 Score:
- Harmonic mean of Precision and Recall
- Good balance of precision and recall
Code:
from sklearn.metrics import f1_score
f1 = f1_score(y_test, y_pred)

5. Confusion Matrix:
- Matrix showing TP, FP, TN, FN
Code:
from sklearn.metrics import confusion_matrix
cm = confusion_matrix(y_test, y_pred)

6. ROC Curve & AUC:

- ROC: True Positive Rate vs False Positive Rate
- AUC: Overall performance
Code:
from sklearn.metrics import roc_auc_score, roc_curve
auc = roc_auc_score(y_test, y_pred_prob)

Evaluation Methods for Regression

1. Mean Absolute Error (MAE):

Model Evaluation Methods

- Average absolute error

Code:
from sklearn.metrics import mean_absolute_error
mae = mean_absolute_error(y_test, y_pred)

2. Mean Squared Error (MSE):

- Average of squared differences
Code:
from sklearn.metrics import mean_squared_error
mse = mean_squared_error(y_test, y_pred)

3. Root Mean Squared Error (RMSE):

- Square root of MSE (penalizes large errors)
Code:
import numpy as np
rmse = np.sqrt(mse)

4. R² Score:
- Measures how well predictions fit the real data (1 = perfect)
Code:
from sklearn.metrics import r2_score
r2 = r2_score(y_test, y_pred)

Summary Table

| Type | Method | Best For |

|--------------|----------------|-----------------------------------|
| Classification | Accuracy | Balanced datasets |
| Classification | Precision | Avoiding false positives |
| Classification | Recall | Avoiding false negatives |
| Classification | F1 Score | Balance of precision and recall |
| Classification | ROC/AUC | Overall classification quality |
| Regression | MAE | Simple error analysis |
| Regression | MSE/RMSE | Penalizing larger errors |
| Regression | R² Score | Model goodness of fit |

Machine Learning Model Evaluation - Zero To Mastery Academy
No ratings yet
Machine Learning Model Evaluation - Zero To Mastery Academy
1 page
Module 2 Theory
No ratings yet
Module 2 Theory
6 pages
Lecture - (3-4) Evaluation Metrices Classification and Regression
No ratings yet
Lecture - (3-4) Evaluation Metrices Classification and Regression
28 pages
Model Evaluation
No ratings yet
Model Evaluation
18 pages
Performance Metrics ML
No ratings yet
Performance Metrics ML
4 pages
Machine Learning Model Evaluation
No ratings yet
Machine Learning Model Evaluation
2 pages
Metrix in ML
No ratings yet
Metrix in ML
7 pages
Data M
No ratings yet
Data M
10 pages
Data M11
No ratings yet
Data M11
5 pages
Unit - I Chap-4 Model Evaluation and Development
No ratings yet
Unit - I Chap-4 Model Evaluation and Development
35 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
6 pages
3-Performance Measures
No ratings yet
3-Performance Measures
35 pages
Metric
No ratings yet
Metric
6 pages
Ads Exp 4
No ratings yet
Ads Exp 4
4 pages
Evaluating Models CH-3
No ratings yet
Evaluating Models CH-3
5 pages
Machine Learning Model Evaluation
No ratings yet
Machine Learning Model Evaluation
11 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
6 pages
Performance Measures
No ratings yet
Performance Measures
19 pages
Ads 5
No ratings yet
Ads 5
5 pages
Unit 4
No ratings yet
Unit 4
20 pages
Ads Exp4
No ratings yet
Ads Exp4
3 pages
2-Training and Testing Models, Evaluation Metrics-01-07-2023
No ratings yet
2-Training and Testing Models, Evaluation Metrics-01-07-2023
23 pages
Week 08
No ratings yet
Week 08
13 pages
ML Lecture 11 Evaluation
No ratings yet
ML Lecture 11 Evaluation
17 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
49 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
IT 138 - Lecture 4
No ratings yet
IT 138 - Lecture 4
30 pages
Lec 4
No ratings yet
Lec 4
24 pages
7118 Ds Methodology Ss
No ratings yet
7118 Ds Methodology Ss
56 pages
Intel Assignment
No ratings yet
Intel Assignment
13 pages
Performance Evaluation
No ratings yet
Performance Evaluation
24 pages
Week 5
No ratings yet
Week 5
4 pages
Python Performance Evaluation Metrics
No ratings yet
Python Performance Evaluation Metrics
3 pages
Ad3501-Dl-Unit 4 Notes
No ratings yet
Ad3501-Dl-Unit 4 Notes
16 pages
8 2 Lecture AI 8 2
No ratings yet
8 2 Lecture AI 8 2
35 pages
Confusion Matrix
No ratings yet
Confusion Matrix
4 pages
Lect 02 Evaluation Part 1
No ratings yet
Lect 02 Evaluation Part 1
33 pages
IE 527 Intelligent Engineering Systems: Basic Concepts Model/performance Evaluation Overfitting
No ratings yet
IE 527 Intelligent Engineering Systems: Basic Concepts Model/performance Evaluation Overfitting
18 pages
06-FSSR DS610 2024 2025T1 Metrics
No ratings yet
06-FSSR DS610 2024 2025T1 Metrics
24 pages
Model Evaluation & Metrics Guide
No ratings yet
Model Evaluation & Metrics Guide
25 pages
Evaluation Metrics in Machine Learning
No ratings yet
Evaluation Metrics in Machine Learning
14 pages
Worksheet For 8th
No ratings yet
Worksheet For 8th
5 pages
Performance Metric (Summerized)
No ratings yet
Performance Metric (Summerized)
43 pages
11 - Model Eval and Tuning
No ratings yet
11 - Model Eval and Tuning
17 pages
Evaluating Accuracy of Classifier or Predictor
No ratings yet
Evaluating Accuracy of Classifier or Predictor
3 pages
ML Model Evaluation Metrics
No ratings yet
ML Model Evaluation Metrics
8 pages
Confusion Matrix
No ratings yet
Confusion Matrix
5 pages
DL IT324a 4
No ratings yet
DL IT324a 4
52 pages
Machine Learningassignment
No ratings yet
Machine Learningassignment
10 pages
Mod8 DM
No ratings yet
Mod8 DM
13 pages
Lecture - Model Evaluation
No ratings yet
Lecture - Model Evaluation
18 pages
11.2 - Classification Evaluation Metrics
No ratings yet
11.2 - Classification Evaluation Metrics
22 pages
6 Evaluarea Performantei
No ratings yet
6 Evaluarea Performantei
43 pages
Expanded Model Evaluation Metrics
No ratings yet
Expanded Model Evaluation Metrics
8 pages
Evaluation Metrics in Machine Learning - GeeksforGeeks
No ratings yet
Evaluation Metrics in Machine Learning - GeeksforGeeks
6 pages
Classification - Performance Evlaution
No ratings yet
Classification - Performance Evlaution
13 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
61 pages
Imp Notes For Aamd
No ratings yet
Imp Notes For Aamd
6 pages
Evaluation Metrics: Yining Chen (Adapted From Slides by Anand Avati) May 1, 2020
No ratings yet
Evaluation Metrics: Yining Chen (Adapted From Slides by Anand Avati) May 1, 2020
31 pages
Mobile Commerce Notes
No ratings yet
Mobile Commerce Notes
3 pages
PhiKitA Phishing Kit Attacks Dataset For Phishing
No ratings yet
PhiKitA Phishing Kit Attacks Dataset For Phishing
12 pages
Imp Questions e Biz
No ratings yet
Imp Questions e Biz
1 page
Tech Maze Volume 1 Issue 4
No ratings yet
Tech Maze Volume 1 Issue 4
21 pages
Unit 5
No ratings yet
Unit 5
34 pages
Optimize Your Game Performance For Consoles and PCs in Unity Unity 6 Edition
No ratings yet
Optimize Your Game Performance For Consoles and PCs in Unity Unity 6 Edition
128 pages
The Lead Magnet Playbook
No ratings yet
The Lead Magnet Playbook
16 pages
Transportation Problem LCM
No ratings yet
Transportation Problem LCM
15 pages
Java Output Question For Practise (Arrays)
No ratings yet
Java Output Question For Practise (Arrays)
18 pages
HXMJ-Large Print Bluetooth Mini Keyboard Compatible With Apple Ipad, I - MyKiron
No ratings yet
HXMJ-Large Print Bluetooth Mini Keyboard Compatible With Apple Ipad, I - MyKiron
1 page
OOS-2024-25 - Part B - Unit 8 - AI Ethics and Values
No ratings yet
OOS-2024-25 - Part B - Unit 8 - AI Ethics and Values
30 pages
Department of Information Technology & Communication: DFC 20113 - Programming Fundamentals
No ratings yet
Department of Information Technology & Communication: DFC 20113 - Programming Fundamentals
2 pages
Paper 1
No ratings yet
Paper 1
2 pages
Unit 3 OSS Projects
No ratings yet
Unit 3 OSS Projects
60 pages
Catalog Vendotek V SM
No ratings yet
Catalog Vendotek V SM
5 pages
Servo-S - Service Manual - EN - All PDF
100% (1)
Servo-S - Service Manual - EN - All PDF
80 pages
Google Maps: Borders and Influence
No ratings yet
Google Maps: Borders and Influence
3 pages
French Regular Verbs
No ratings yet
French Regular Verbs
14 pages
3G - DUW - Configuration - With - RBS6201 - 6102 - Using RRUs-1
No ratings yet
3G - DUW - Configuration - With - RBS6201 - 6102 - Using RRUs-1
16 pages
Product Data Sheet: Asfora - Single Socket Outlet With Side Earth - 16A Cream
No ratings yet
Product Data Sheet: Asfora - Single Socket Outlet With Side Earth - 16A Cream
2 pages
BCA Computer Graphics Exam Guide
No ratings yet
BCA Computer Graphics Exam Guide
3 pages
D13 Final
No ratings yet
D13 Final
23 pages
EC PM X30 Leaflet
No ratings yet
EC PM X30 Leaflet
3 pages
3BUA000500 en V Syst
No ratings yet
3BUA000500 en V Syst
24 pages
Zurabi-Papiashvili 2024
No ratings yet
Zurabi-Papiashvili 2024
3 pages
Ucare 6000 Introduction
No ratings yet
Ucare 6000 Introduction
48 pages
Galactic Civ Cheat
No ratings yet
Galactic Civ Cheat
4 pages
Alka Choudhary Resume
No ratings yet
Alka Choudhary Resume
1 page
Android OS Architecture & Development
No ratings yet
Android OS Architecture & Development
52 pages
Queue Operations in C++ Lecture
No ratings yet
Queue Operations in C++ Lecture
35 pages
Design Portfolio2
No ratings yet
Design Portfolio2
21 pages
Photoshop
No ratings yet
Photoshop
9 pages
Azure DevOps Certification Training Brochure
0% (1)
Azure DevOps Certification Training Brochure
18 pages
NS0-093 Dumps - NetApp Accredited Hardware Support Engineer Exam
No ratings yet
NS0-093 Dumps - NetApp Accredited Hardware Support Engineer Exam
8 pages
ECA2+ - Tests - Vocabulary Check 4A - 2018
No ratings yet
ECA2+ - Tests - Vocabulary Check 4A - 2018
2 pages

Model Evaluation Methods

Uploaded by

Model Evaluation Methods

Uploaded by

Model Evaluation Methods

Evaluation Methods for Classification

6. ROC Curve & AUC:

Evaluation Methods for Regression

1. Mean Absolute Error (MAE):

- Average absolute error

2. Mean Squared Error (MSE):

3. Root Mean Squared Error (RMSE):

| Type | Method | Best For |

You might also like