0% found this document useful (0 votes)

4 views6 pages

Module 2 Theory

Superwise learning in machine learning

Uploaded by

Kusuma Learning love

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views6 pages

Module 2 Theory

Superwise learning in machine learning

Uploaded by

Kusuma Learning love

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Model Evaluation Metrics:

1. MAE – Mean Absolute Error

 Definition: The average of the absolute differences between predicted

and actual values.

 Formula:

 Interpretation: Measures average magnitude of the errors in a set of

predictions, without considering their direction.

 Pros: Easy to interpret; less sensitive to outliers than MSE/RMSE.

 Cons: Doesn’t penalize large errors as heavily as MSE or RMSE.

2. MSE – Mean Squared Error

 Definition: The average of the squared differences between predicted

and actual values.

 Formula:

 Interpretation: Gives more weight to larger errors (squares them), so

it penalizes big mistakes more.

 Pros: Good for when large errors are especially undesirable.

 Cons: Sensitive to outliers; harder to interpret due to squared units.

3. RMSE – Root Mean Squared Error

 Definition: The square root of the MSE.

 Formula:
 Interpretation: Similar to MSE but in the same units as the target
variable.

 Pros: Easier to interpret than MSE; still penalizes large errors.

 Cons: Still sensitive to outliers.

Summary Table:

Penalizes Large Same Units as Sensitive to

Metric
Errors Target Outliers

MAE No Yes Less

MSE Yes No (squared units) Yes

RMSE Yes Yes Yes

Model evaluation metrics: Accuracy, precision,

recall, F1-score, ROC-AUC.
1. Accuracy
 Definition: The ratio of correctly predicted instances
to the total instances.
 Formula:
Accuracy=TP+TN/TP+TN+FP+FN
 Use case: Best used when classes are balanced.
 Limitation: Misleading in imbalanced datasets.
2. Precision (Positive Predictive Value)
 Definition: The ratio of correctly predicted positive
observations to the total predicted positives.
 Formula:
Precision=TP/TP+FP
 Use case: Useful when the cost of false positives is
high (e.g., spam detection).

3. Recall (Sensitivity or True Positive Rate)

 Definition: The ratio of correctly predicted positive
observations to all actual positives.
 Formula:
Recall=TP/TP+FN
 Use case: Useful when the cost of false negatives is
high (e.g., disease diagnosis).

4. F1-Score
 Definition: Harmonic mean of precision and recall.
Balances the two metrics.
 Formula:

 Use case: Best when you need a balance between

precision and recall.
5. ROC-AUC (Receiver Operating Characteristic – Area
Under Curve)
 Definition: Measures the model’s ability to
distinguish between classes.
 ROC Curve: Plots True Positive Rate (Recall) against
False Positive Rate.
 AUC (Area Under Curve): AUC = 1 means perfect
classifier; AUC = 0.5 means random guessing.
 Use case: Good for evaluating models across all
classification thresholds.

Summary Table:

Metric Focuses On Best When...

Overall
Accuracy Classes are balanced
correctness

Precision False positives False positives are costly

Recall False negatives False negatives are costly

Precision & Balance needed between FP

F1-Score
Recall and FN

ROC- Classification You care about ranking, not

AUC ability threshold
Model Training and Evaluation
1. Train-Test Split
 This is the basic method for evaluating the performance of a
machine learning model.
 You split your dataset into:
o Training set (e.g., 80%) – used to train the model.
o Test set (e.g., 20%) – used to evaluate model performance
on unseen data.
python
from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,

random_state=42)

2. Cross-Validation
 Cross-validation improves reliability by splitting the data into
multiple folds.
 The most common method: k-Fold Cross-Validation:
o The dataset is divided into k subsets (e.g., 5 or 10).
o The model is trained on k-1 folds and tested on the
remaining fold.
o Repeats k times, each time with a different test fold.
o The final performance metric is the average score across
all folds.
python
from sklearn.model_selection import cross_val_score

scores = cross_val_score(model, X, y, cv=5)

print(scores.mean())

3. Hyperparameter Tuning using GridSearchCV

 Hyperparameters are parameters that are not learned during
training (e.g., number of trees in Random Forest).
 GridSearchCV performs an exhaustive search over a grid of
hyperparameter values using cross-validation.
python
from sklearn.model_selection import GridSearchCV
param_grid = {
'n_estimators': [100, 200],
'max_depth': [None, 10, 20]
}

grid_search = GridSearchCV(estimator=RandomForestClassifier(),
param_grid=param_grid, cv=5)
grid_search.fit(X_train, y_train)

print(grid_search.best_params_)
print(grid_search.best_score_)

4. Overfitting and Underfitting

Term Description Symptoms Solution
Model learns the
High accuracy Regularization, more
training data too
Overfitting on training, data, simpler model,
well, including
poor on test. cross-validation.
noise.
Poor Use a more complex
Model is too
performance on model, feature
Underfitting simple to capture
both training engineering, reduce
patterns.
and test. regularization.
Visual Example:
 Underfitting: Straight line on a curved pattern.
 Overfitting: Complex zig-zag curve trying to fit every point.
 Good fit: Smooth curve that generalizes well.

ML Endsem
No ratings yet
ML Endsem
14 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
49 pages
Data M
No ratings yet
Data M
10 pages
6 Evaluarea Performantei
No ratings yet
6 Evaluarea Performantei
43 pages
Data M11
No ratings yet
Data M11
5 pages
Machine Learning Model Evaluation
No ratings yet
Machine Learning Model Evaluation
2 pages
Unit - I Chap-4 Model Evaluation and Development
No ratings yet
Unit - I Chap-4 Model Evaluation and Development
35 pages
Machine Learning # 2
No ratings yet
Machine Learning # 2
17 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
SML
No ratings yet
SML
8 pages
AI & ML Notes
No ratings yet
AI & ML Notes
22 pages
Lecture 5 Evaluation - Classifer
No ratings yet
Lecture 5 Evaluation - Classifer
61 pages
DL IT324a 4
No ratings yet
DL IT324a 4
52 pages
IDS U-5 Answers
No ratings yet
IDS U-5 Answers
16 pages
Model Evaluation Methods
No ratings yet
Model Evaluation Methods
2 pages
机器学习
No ratings yet
机器学习
41 pages
Evaluation Metrics in Machine Learning - GeeksforGeeks
No ratings yet
Evaluation Metrics in Machine Learning - GeeksforGeeks
6 pages
Metrix in ML
No ratings yet
Metrix in ML
7 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
6 pages
Module 4 Supervised Learning
No ratings yet
Module 4 Supervised Learning
4 pages
Lec - 4
No ratings yet
Lec - 4
43 pages
M.L L-9 Machine Learning Model Evaluation
No ratings yet
M.L L-9 Machine Learning Model Evaluation
20 pages
11 - Model Eval and Tuning
No ratings yet
11 - Model Eval and Tuning
17 pages
Unit 5-2 Marks
No ratings yet
Unit 5-2 Marks
5 pages
Evaluating Machine Learning Algorithms and Model Selection
No ratings yet
Evaluating Machine Learning Algorithms and Model Selection
10 pages
Model Evaluation
No ratings yet
Model Evaluation
18 pages
Evaluation Metricsflaksdj Fa
No ratings yet
Evaluation Metricsflaksdj Fa
22 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
3 pages
Machine Learning Model Evaluation
No ratings yet
Machine Learning Model Evaluation
11 pages
Machine Learning QUESTION AND ANSWERS
No ratings yet
Machine Learning QUESTION AND ANSWERS
13 pages
04 - Model Selection
No ratings yet
04 - Model Selection
62 pages
Machine Learning
No ratings yet
Machine Learning
46 pages
Topic 3
No ratings yet
Topic 3
48 pages
Machine Learning Model Evaluation - Zero To Mastery Academy
No ratings yet
Machine Learning Model Evaluation - Zero To Mastery Academy
1 page
ML 5
No ratings yet
ML 5
26 pages
Unit 2
No ratings yet
Unit 2
7 pages
Performance Measures
No ratings yet
Performance Measures
19 pages
Clase10 11
No ratings yet
Clase10 11
18 pages
Binary Classifier Evaluation Guide
No ratings yet
Binary Classifier Evaluation Guide
12 pages
What Are The Evaluation Metrics in Machine Learning
No ratings yet
What Are The Evaluation Metrics in Machine Learning
3 pages
Machine Learning II
No ratings yet
Machine Learning II
61 pages
Chapter 2 Part II
No ratings yet
Chapter 2 Part II
28 pages
Advanced ML Classification Guide
No ratings yet
Advanced ML Classification Guide
40 pages
Unit 2 Part 2 Data Science Final 23june
No ratings yet
Unit 2 Part 2 Data Science Final 23june
39 pages
Model Evaluation
No ratings yet
Model Evaluation
31 pages
Assignment Unit3ainotes-Class10 20250629124030
No ratings yet
Assignment Unit3ainotes-Class10 20250629124030
2 pages
Mod8 DM
No ratings yet
Mod8 DM
13 pages
Chương 2e. Model Evaluation
No ratings yet
Chương 2e. Model Evaluation
27 pages
T1 ML QB Soln
No ratings yet
T1 ML QB Soln
23 pages
Performance Evaluation
No ratings yet
Performance Evaluation
24 pages
ML - Training - Evaluation For Machine Learning Course
No ratings yet
ML - Training - Evaluation For Machine Learning Course
31 pages
Model Performance Assessment
No ratings yet
Model Performance Assessment
13 pages
Capstone Project
No ratings yet
Capstone Project
6 pages
Machine Learning Interview Questions.
50% (2)
Machine Learning Interview Questions.
43 pages
Dsbda Ut5
No ratings yet
Dsbda Ut5
7 pages
ML Interview Questions
No ratings yet
ML Interview Questions
10 pages
AIML-HC Mod 03
No ratings yet
AIML-HC Mod 03
46 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
6 pages
Unit Iii
No ratings yet
Unit Iii
67 pages
Perspectives On The Philosophy of David K Lewis Helen Beebee Editor All Chapter
100% (19)
Perspectives On The Philosophy of David K Lewis Helen Beebee Editor All Chapter
67 pages
Teaching Skills for Educators
No ratings yet
Teaching Skills for Educators
5 pages
C++ Matrix Class
No ratings yet
C++ Matrix Class
11 pages
Prediction of Autism Spectrum Disorder
No ratings yet
Prediction of Autism Spectrum Disorder
25 pages
Introducing Logic A Graphic Guide Third Edition Dan Cryan Download Full Chapters
100% (2)
Introducing Logic A Graphic Guide Third Edition Dan Cryan Download Full Chapters
160 pages
50 Years of FFT Algorithms and Applications.
No ratings yet
50 Years of FFT Algorithms and Applications.
34 pages
Python Programming Exercises
50% (2)
Python Programming Exercises
12 pages
The Voyage of The Vega Round Asia and Europe Volume I and Volume II 1st Edition Nils Adolf Erik Nordenskiöld PDF Download
No ratings yet
The Voyage of The Vega Round Asia and Europe Volume I and Volume II 1st Edition Nils Adolf Erik Nordenskiöld PDF Download
71 pages
Logic Is Metaphysics (2011) (PDF) - Hacker News
No ratings yet
Logic Is Metaphysics (2011) (PDF) - Hacker News
8 pages
2.14 Point, Line and Strip Surcharge Loads and Their Effects On Walls
100% (1)
2.14 Point, Line and Strip Surcharge Loads and Their Effects On Walls
16 pages
Week 3 Math 9 1 Lesson On Transformable Equations
No ratings yet
Week 3 Math 9 1 Lesson On Transformable Equations
3 pages
Archimedes Palimpsest: Ostomachion Is A Dissection Puzzle in
No ratings yet
Archimedes Palimpsest: Ostomachion Is A Dissection Puzzle in
3 pages
Simulation Tools Comparison: ATP-EMTP vs MATLAB
No ratings yet
Simulation Tools Comparison: ATP-EMTP vs MATLAB
15 pages
Managerial Economics 14th Edition by Mark Hirschey Available Full Chapters
100% (5)
Managerial Economics 14th Edition by Mark Hirschey Available Full Chapters
88 pages
Factoring Difference of Two Squares
No ratings yet
Factoring Difference of Two Squares
17 pages
Curriculum and Syllabi: National Institute of Technology
No ratings yet
Curriculum and Syllabi: National Institute of Technology
42 pages
Game Programm Notes
No ratings yet
Game Programm Notes
22 pages
Mar5 Physics Formulas PDF
No ratings yet
Mar5 Physics Formulas PDF
6 pages
Topics in Linear and Nonlinear Functional Analysis
100% (1)
Topics in Linear and Nonlinear Functional Analysis
402 pages
Graph Optimization Concepts
No ratings yet
Graph Optimization Concepts
23 pages
Physical Medicine and Rehabilitation Board Review Second Edition Sara J. Cuccurullo Download Full Chapters
100% (6)
Physical Medicine and Rehabilitation Board Review Second Edition Sara J. Cuccurullo Download Full Chapters
89 pages
Physics 1st Year Full Book
100% (1)
Physics 1st Year Full Book
3 pages
Project Management Theories and Practices
No ratings yet
Project Management Theories and Practices
53 pages
Microstrip Antenna Array With Four Port Butler Matrix For Switched Beam Base Station Application
No ratings yet
Microstrip Antenna Array With Four Port Butler Matrix For Switched Beam Base Station Application
6 pages
MIT C++ Notes
100% (1)
MIT C++ Notes
266 pages
Mathematics ST6
No ratings yet
Mathematics ST6
3 pages
A Built-In Hamming Code ECC Circuit
No ratings yet
A Built-In Hamming Code ECC Circuit
7 pages
Wick Theory PDF
No ratings yet
Wick Theory PDF
10 pages
6.2VolumesbyDisks and Washers
No ratings yet
6.2VolumesbyDisks and Washers
15 pages

Module 2 Theory

Uploaded by

Module 2 Theory

Uploaded by

Model Evaluation Metrics:

1. MAE – Mean Absolute Error

 Definition: The average of the absolute differences between predicted

 Interpretation: Measures average magnitude of the errors in a set of

 Pros: Easy to interpret; less sensitive to outliers than MSE/RMSE.

 Cons: Doesn’t penalize large errors as heavily as MSE or RMSE.

2. MSE – Mean Squared Error

 Definition: The average of the squared differences between predicted

 Interpretation: Gives more weight to larger errors (squares them), so

 Pros: Good for when large errors are especially undesirable.

 Cons: Sensitive to outliers; harder to interpret due to squared units.

3. RMSE – Root Mean Squared Error

 Definition: The square root of the MSE.

 Pros: Easier to interpret than MSE; still penalizes large errors.

 Cons: Still sensitive to outliers.

Penalizes Large Same Units as Sensitive to

MAE No Yes Less

MSE Yes No (squared units) Yes

RMSE Yes Yes Yes

Model evaluation metrics: Accuracy, precision,

3. Recall (Sensitivity or True Positive Rate)

 Use case: Best when you need a balance between

Metric Focuses On Best When...

Precision False positives False positives are costly

Recall False negatives False negatives are costly

Precision & Balance needed between FP

ROC- Classification You care about ranking, not

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,

scores = cross_val_score(model, X, y, cv=5)

3. Hyperparameter Tuning using GridSearchCV

4. Overfitting and Underfitting

You might also like