Open navigation menu

Scribd

0% found this document useful (0 votes)

69 views54 pages

How Good Is Your Model?: Andreas Müller

The document discusses evaluating machine learning models and selecting the best hyperparameters. It introduces concepts like accuracy not always being the best evaluation metric due to issues like class imbalance. It discusses using more nuanced metrics like precision, recall, F1 score calculated from a confusion matrix. It shows how to obtain these metrics and plot a confusion matrix using scikit-learn. It also discusses tuning hyperparameters of models using grid search cross-validation to select parameters that perform best.

Uploaded by

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

69 views54 pages

How Good Is Your Model?: Andreas Müller

The document discusses evaluating machine learning models and selecting the best hyperparameters. It introduces concepts like accuracy not always being the best evaluation metric due to issues like class imbalance. It discusses using more nuanced metrics like precision, recall, F1 score calculated from a confusion matrix. It shows how to obtain these metrics and plot a confusion matrix using scikit-learn. It also discusses tuning hyperparameters of models using grid search cross-validation to select parameters that perform best.

Uploaded by

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 54

How good is your

model?
S U P E R V I S E D L E A R N I N G W I T H S C I K I T- L E A R N

Andreas Müller
Core developer, scikit-learn
Classi cation metrics
Measuring model performance with accuracy:
Fraction of correctly classi ed samples

Not always a useful metric

SUPERVISED LEARNING WITH SCIKIT-LEARN

Class imbalance example: Emails
Spam classi cation
99% of emails are real; 1% of emails are spam

Could build a classi er that predicts ALL emails as real

99% accurate!

But horrible at actually classifying spam

Fails at its original purpose

Need more nuanced metrics

SUPERVISED LEARNING WITH SCIKIT-LEARN

Diagnosing classi cation predictions
Confusion matrix

SUPERVISED LEARNING WITH SCIKIT-LEARN

Diagnosing classi cation predictions
Confusion matrix

SUPERVISED LEARNING WITH SCIKIT-LEARN

Diagnosing classi cation predictions
Confusion matrix

SUPERVISED LEARNING WITH SCIKIT-LEARN

Diagnosing classi cation predictions
Confusion matrix

SUPERVISED LEARNING WITH SCIKIT-LEARN

Diagnosing classi cation predictions
Confusion matrix

SUPERVISED LEARNING WITH SCIKIT-LEARN

Diagnosing classi cation predictions
Confusion matrix

SUPERVISED LEARNING WITH SCIKIT-LEARN

Diagnosing classi cation predictions
Confusion matrix

SUPERVISED LEARNING WITH SCIKIT-LEARN

Diagnosing classi cation predictions
Confusion matrix

SUPERVISED LEARNING WITH SCIKIT-LEARN

Diagnosing classi cation predictions
Confusion matrix

SUPERVISED LEARNING WITH SCIKIT-LEARN

Diagnosing classi cation predictions
Confusion matrix

SUPERVISED LEARNING WITH SCIKIT-LEARN

Diagnosing classi cation predictions
Confusion matrix

SUPERVISED LEARNING WITH SCIKIT-LEARN

Diagnosing classi cation predictions
Confusion matrix

SUPERVISED LEARNING WITH SCIKIT-LEARN

Diagnosing classi cation predictions
Confusion matrix

Accuracy:

SUPERVISED LEARNING WITH SCIKIT-LEARN

Metrics from the confusion matrix
Precision

Recall

F1score:

High precision: Not many real emails predicted as spam

High recall: Predicted most spam emails correctly

SUPERVISED LEARNING WITH SCIKIT-LEARN

Confusion matrix in scikit-learn
from sklearn.metrics import classification_report
from sklearn.metrics import confusion_matrix

knn = KNeighborsClassifier(n_neighbors=8)

X_train, X_test, y_train, y_test = train_test_split(X, y,

test_size=0.4, random_state=42)

knn.fit(X_train, y_train)

y_pred = knn.predict(X_test)

SUPERVISED LEARNING WITH SCIKIT-LEARN

Confusion matrix in scikit-learn
print(confusion_matrix(y_test, y_pred))

[[52 7]
[ 3 112]]

print(classification_report(y_test, y_pred))

precision recall f1-score support

0 0.95 0.88 0.91 59
1 0.94 0.97 0.96 115
avg / total 0.94 0.94 0.94 174

SUPERVISED LEARNING WITH SCIKIT-LEARN

Let's practice!
S U P E R V I S E D L E A R N I N G W I T H S C I K I T- L E A R N
Logistic regression
and the ROC curve
S U P E R V I S E D L E A R N I N G W I T H S C I K I T- L E A R N

Hugo Bowne-Anderson
Data Scientist, DataCamp
Logistic regression for binary classi cation
Logistic regression outputs probabilities

If the probability ‘p’ is greater than 0.5:

The data is labeled ‘1’

If the probability ‘p’ is less than 0.5:

The data is labeled ‘0’

SUPERVISED LEARNING WITH SCIKIT-LEARN

Linear decision boundary

SUPERVISED LEARNING WITH SCIKIT-LEARN

Logistic regression in scikit-learn
from sklearn.linear_model import LogisticRegression
from sklearn.model_selection import train_test_split

logreg = LogisticRegression()

X_train, X_test, y_train, y_test = train_test_split(X, y,

test_size=0.4, random_state=42)

logreg.fit(X_train, y_train)
y_pred = logreg.predict(X_test)

SUPERVISED LEARNING WITH SCIKIT-LEARN

Probability thresholds
By default, logistic regression threshold = 0.5

Not speci c to logistic regression

k-NN classi ers also have thresholds

What happens if we vary the threshold?

SUPERVISED LEARNING WITH SCIKIT-LEARN

The ROC curve

SUPERVISED LEARNING WITH SCIKIT-LEARN

The ROC curve

SUPERVISED LEARNING WITH SCIKIT-LEARN

The ROC curve

SUPERVISED LEARNING WITH SCIKIT-LEARN

The ROC curve

SUPERVISED LEARNING WITH SCIKIT-LEARN

The ROC curve

SUPERVISED LEARNING WITH SCIKIT-LEARN

The ROC curve

SUPERVISED LEARNING WITH SCIKIT-LEARN

The ROC curve

SUPERVISED LEARNING WITH SCIKIT-LEARN

Plotting the ROC curve
from sklearn.metrics import roc_curve
y_pred_prob = logreg.predict_proba(X_test)[:,1]
fpr, tpr, thresholds = roc_curve(y_test, y_pred_prob)

plt.plot([0, 1], [0, 1], 'k--')

plt.plot(fpr, tpr, label='Logistic Regression')
plt.xlabel('False Positive Rate')
plt.ylabel('True Positive Rate')
plt.title('Logistic Regression ROC Curve')
plt.show();

SUPERVISED LEARNING WITH SCIKIT-LEARN

Plotting the ROC curve

logreg.predict_proba(X_test)[:,1]

SUPERVISED LEARNING WITH SCIKIT-LEARN

Let's practice!
S U P E R V I S E D L E A R N I N G W I T H S C I K I T- L E A R N
Area under the ROC
curve
S U P E R V I S E D L E A R N I N G W I T H S C I K I T- L E A R N

Hugo Bowne-Anderson
Data Scientist, DataCamp
Area under the ROC curve (AUC)
Larger area under the ROC curve = better model

SUPERVISED LEARNING WITH SCIKIT-LEARN

Area under the ROC curve (AUC)
Larger area under the ROC curve = better model

SUPERVISED LEARNING WITH SCIKIT-LEARN

Area under the ROC curve (AUC)
Larger area under the ROC curve = better model

SUPERVISED LEARNING WITH SCIKIT-LEARN

Area under the ROC curve (AUC)
Larger area under the ROC curve = better model

SUPERVISED LEARNING WITH SCIKIT-LEARN

AUC in scikit-learn
from sklearn.metrics import roc_auc_score

logreg = LogisticRegression()

X_train, X_test, y_train, y_test = train_test_split(X, y,

test_size=0.4, random_state=42)

logreg.fit(X_train, y_train)

y_pred_prob = logreg.predict_proba(X_test)[:,1]

roc_auc_score(y_test, y_pred_prob)

0.997466216216

SUPERVISED LEARNING WITH SCIKIT-LEARN

AUC using cross-validation
from sklearn.model_selection import cross_val_score
cv_scores = cross_val_score(logreg, X, y, cv=5,
scoring='roc_auc')

print(cv_scores)

[ 0.99673203 0.99183007 0.99583796 1. 0.96140652]

SUPERVISED LEARNING WITH SCIKIT-LEARN

Let's practice!
S U P E R V I S E D L E A R N I N G W I T H S C I K I T- L E A R N
Hyperparameter
tuning
S U P E R V I S E D L E A R N I N G W I T H S C I K I T- L E A R N

Hugo Bowne-Anderson
Data Scientist, DataCamp
Hyperparameter tuning
Linear regression: Choosing parameters

Ridge/lasso regression: Choosing alpha

k-Nearest Neighbors: Choosing n_neighbors

Parameters like alpha and k: Hyperparameters

Hyperparameters cannot be learned by tting the model

SUPERVISED LEARNING WITH SCIKIT-LEARN

Choosing the correct hyperparameter
Try a bunch of different hyperparameter values

Fit all of them separately

See how well each performs

Choose the best performing one

It is essential to use cross-validation

SUPERVISED LEARNING WITH SCIKIT-LEARN

Grid search cross-validation

SUPERVISED LEARNING WITH SCIKIT-LEARN

Grid search cross-validation

SUPERVISED LEARNING WITH SCIKIT-LEARN

Grid search cross-validation

SUPERVISED LEARNING WITH SCIKIT-LEARN

GridSearchCV in scikit-learn
from sklearn.model_selection import GridSearchCV

param_grid = {'n_neighbors': np.arange(1, 50)}

knn = KNeighborsClassifier()

knn_cv = GridSearchCV(knn, param_grid, cv=5)

knn_cv.fit(X, y)

knn_cv.best_params_

{'n_neighbors': 12}

knn_cv.best_score_

0.933216168717

SUPERVISED LEARNING WITH SCIKIT-LEARN

Let's practice!
S U P E R V I S E D L E A R N I N G W I T H S C I K I T- L E A R N
Hold-out set for nal
evaluation
S U P E R V I S E D L E A R N I N G W I T H S C I K I T- L E A R N

Hugo Bowne-Anderson
Data Scientist, DataCamp
Hold-out set reasoning
How well can the model perform on never before seen data?

Using ALL data for cross-validation is not ideal

Split data into training and hold-out set at the beginning

Perform grid search cross-validation on training set

Choose best hyperparameters and evaluate on hold-out set

SUPERVISED LEARNING WITH SCIKIT-LEARN

Let's practice!
S U P E R V I S E D L E A R N I N G W I T H S C I K I T- L E A R N

You might also like

Supervised Learning With Scikit-Learn
No ratings yet
Supervised Learning With Scikit-Learn
178 pages
Chap 11 12 - Practical Methodology and Applications - Heechul Lim
No ratings yet
Chap 11 12 - Practical Methodology and Applications - Heechul Lim
60 pages
Unit 3
No ratings yet
Unit 3
21 pages
Supervised Learning Using Python - Chapter3
No ratings yet
Supervised Learning Using Python - Chapter3
47 pages
ML 2 PPT Unit 2
No ratings yet
ML 2 PPT Unit 2
214 pages
Slides (A12 A14)
No ratings yet
Slides (A12 A14)
353 pages
Intro To Machine Learning 101 Python Data Science v2
No ratings yet
Intro To Machine Learning 101 Python Data Science v2
101 pages
Supervised Learning Using Python - Chapter1
No ratings yet
Supervised Learning Using Python - Chapter1
34 pages
Lesson 5 Deep Neural Net Optimization Tuning Interpretability
100% (1)
Lesson 5 Deep Neural Net Optimization Tuning Interpretability
105 pages
Scikit Learn
No ratings yet
Scikit Learn
25 pages
Classification
100% (2)
Classification
105 pages
Lesson 09 - Introduction To Model Building
No ratings yet
Lesson 09 - Introduction To Model Building
85 pages
Udacity Machine Learning Analysis Supervised Learning
100% (1)
Udacity Machine Learning Analysis Supervised Learning
504 pages
Instant Download MLOps Engineering at Scale 1st Edition Carl Osipov PDF All Chapter
No ratings yet
Instant Download MLOps Engineering at Scale 1st Edition Carl Osipov PDF All Chapter
49 pages
Datamining Lect12
No ratings yet
Datamining Lect12
75 pages
Chapter 1
No ratings yet
Chapter 1
34 pages
Machine Learning Lab Manual 06
100% (1)
Machine Learning Lab Manual 06
8 pages
Iso Iec TS 4213 2022
No ratings yet
Iso Iec TS 4213 2022
13 pages
REPORT
No ratings yet
REPORT
45 pages
Domande Complete ML UNIPD
No ratings yet
Domande Complete ML UNIPD
12 pages
Machine Learning II
No ratings yet
Machine Learning II
61 pages
Vtu ML
No ratings yet
Vtu ML
62 pages
Supervised Learning With Scikit-Learn: How Good Is Your Model?
No ratings yet
Supervised Learning With Scikit-Learn: How Good Is Your Model?
31 pages
498 FA2019 Lecture11
No ratings yet
498 FA2019 Lecture11
100 pages
IS4242 W6 Model Evaluation and Selection
No ratings yet
IS4242 W6 Model Evaluation and Selection
86 pages
DP-100 Study Guide
No ratings yet
DP-100 Study Guide
9 pages
ADS - Phase 3
No ratings yet
ADS - Phase 3
34 pages
DSML Clasification
No ratings yet
DSML Clasification
44 pages
Lec 17 - Dsfa23
No ratings yet
Lec 17 - Dsfa23
32 pages
AUC and The ROC Curve in Machine Learning - DataCamp
No ratings yet
AUC and The ROC Curve in Machine Learning - DataCamp
12 pages
Chapter 2
No ratings yet
Chapter 2
50 pages
Performance Parameters
No ratings yet
Performance Parameters
14 pages
MLSys 2021 Accounting For Variance in Machine Learning Benchmarks Paper
No ratings yet
MLSys 2021 Accounting For Variance in Machine Learning Benchmarks Paper
23 pages
20MEMECH Part 3 - Classification
No ratings yet
20MEMECH Part 3 - Classification
49 pages
Lecture 1
No ratings yet
Lecture 1
48 pages
INSY446 - 4 - Classification Part 1
No ratings yet
INSY446 - 4 - Classification Part 1
26 pages
Supervised Learning With Scikit-Learn
No ratings yet
Supervised Learning With Scikit-Learn
178 pages
2019 Book CyberSecurity PDF
No ratings yet
2019 Book CyberSecurity PDF
184 pages
Introduction To Machine Learning and Logistic Regression
No ratings yet
Introduction To Machine Learning and Logistic Regression
28 pages
Deep Learning Basics Lecture 11 Practical Methodology
No ratings yet
Deep Learning Basics Lecture 11 Practical Methodology
25 pages
Final Project Report - Kunal - Sir
No ratings yet
Final Project Report - Kunal - Sir
32 pages
Analytical Methods of Machine Learning Model For E-Commerce Sales Analysis and Prediction
No ratings yet
Analytical Methods of Machine Learning Model For E-Commerce Sales Analysis and Prediction
6 pages
Exploring The Fusion of Animation and Computer Vision For Enhanced Realism in Virtual Character Inter
No ratings yet
Exploring The Fusion of Animation and Computer Vision For Enhanced Realism in Virtual Character Inter
13 pages
Logistic Regression in Python - Real Python
No ratings yet
Logistic Regression in Python - Real Python
27 pages
Unit 3
No ratings yet
Unit 3
17 pages
A Stop List For General Text
No ratings yet
A Stop List For General Text
17 pages
Logistic Regression
100% (2)
Logistic Regression
30 pages
Introduction To Regression: George Boorman
No ratings yet
Introduction To Regression: George Boorman
50 pages
Suhana
No ratings yet
Suhana
34 pages
MACHINELEARNING
No ratings yet
MACHINELEARNING
20 pages
Comparative Study of Bayesian Optimization Process For The Best Machine Learning Hyperparameters
No ratings yet
Comparative Study of Bayesian Optimization Process For The Best Machine Learning Hyperparameters
11 pages
ML2
No ratings yet
ML2
7 pages
08 Logistic Regression
No ratings yet
08 Logistic Regression
19 pages
Document Centered Approach To Text Normalization
No ratings yet
Document Centered Approach To Text Normalization
8 pages
Data Science II: Charles C.N. Wang
No ratings yet
Data Science II: Charles C.N. Wang
38 pages
Machine Learning
No ratings yet
Machine Learning
7 pages
B-56 Sanket Jambhulkar MLA-3
No ratings yet
B-56 Sanket Jambhulkar MLA-3
7 pages
Chapter4 PDF
No ratings yet
Chapter4 PDF
34 pages
Prediction of Stress-Strain Behavior of PET FRP-Confined Concrete
No ratings yet
Prediction of Stress-Strain Behavior of PET FRP-Confined Concrete
21 pages
Machine Learning Basics: An Illustrated Guide For Non-Technical Readers
50% (2)
Machine Learning Basics: An Illustrated Guide For Non-Technical Readers
27 pages
Introduction To Scikit Learn
100% (1)
Introduction To Scikit Learn
108 pages
Machine Learning With Scikit-Learn: George Boorman
No ratings yet
Machine Learning With Scikit-Learn: George Boorman
34 pages
A Comparative Study For Arabic Text Classification Algorithms Based On Stop Words Elimination
No ratings yet
A Comparative Study For Arabic Text Classification Algorithms Based On Stop Words Elimination
5 pages
Cat Boost
No ratings yet
Cat Boost
7 pages
SocBiz-Winter Analytics Resources
No ratings yet
SocBiz-Winter Analytics Resources
7 pages
MS&E 448 Final Presentation High Frequency Algorithmic Trading
No ratings yet
MS&E 448 Final Presentation High Frequency Algorithmic Trading
29 pages
Classifying Arabic Web Pages Toolkit
No ratings yet
Classifying Arabic Web Pages Toolkit
4 pages
Supervised Learning With Scikit-Learn: Introduction To Regression
No ratings yet
Supervised Learning With Scikit-Learn: Introduction To Regression
31 pages
Dynamic Discovery of Type Classes and Relations in Semantic Web Data
No ratings yet
Dynamic Discovery of Type Classes and Relations in Semantic Web Data
26 pages
Official Google Cloud Certified Professional ML Engineer 9781119944461 9781119981848 9781119981565 - Compress
100% (1)
Official Google Cloud Certified Professional ML Engineer 9781119944461 9781119981848 9781119981565 - Compress
371 pages
Lect3 Supervised1
No ratings yet
Lect3 Supervised1
25 pages
Adadelta: An Adaptive Learning Rate Method Matthew D. Zeiler Google Inc., USA New York University, USA
No ratings yet
Adadelta: An Adaptive Learning Rate Method Matthew D. Zeiler Google Inc., USA New York University, USA
6 pages
Logistic Regression
No ratings yet
Logistic Regression
4 pages
Data Mining: Practical Machine Learning Tools and Techniques
No ratings yet
Data Mining: Practical Machine Learning Tools and Techniques
73 pages
Crisp-Dm: Cross Industry Standard Process For Data Mining
No ratings yet
Crisp-Dm: Cross Industry Standard Process For Data Mining
60 pages
KEA Practical Automatic Keyphrase Extraction
No ratings yet
KEA Practical Automatic Keyphrase Extraction
2 pages
ML File - 1
No ratings yet
ML File - 1
12 pages
Lab 4
No ratings yet
Lab 4
20 pages
ML Exp 8
No ratings yet
ML Exp 8
22 pages
ML Lab Manual
No ratings yet
ML Lab Manual
13 pages
Supervised Learning With Scikit-Learn: Preprocessing Data
No ratings yet
Supervised Learning With Scikit-Learn: Preprocessing Data
32 pages
F20BC - CW - 2020 - Biologically Inspired
No ratings yet
F20BC - CW - 2020 - Biologically Inspired
6 pages
A Suggestion-Based RDF Instance Matching System: January 2017
No ratings yet
A Suggestion-Based RDF Instance Matching System: January 2017
6 pages
Supervised Learning: Andreas Müller
No ratings yet
Supervised Learning: Andreas Müller
43 pages
ML
No ratings yet
ML
8 pages
An Unsupervised Model For Text Message Normalization
No ratings yet
An Unsupervised Model For Text Message Normalization
8 pages
Machine Learning Lecture1 - 26-27 Aug
No ratings yet
Machine Learning Lecture1 - 26-27 Aug
30 pages
W4 Ecs7020p
No ratings yet
W4 Ecs7020p
48 pages
Ch1 - Slides - Supervised Learning
No ratings yet
Ch1 - Slides - Supervised Learning
32 pages
Adam vs. SGD - Closing The Generalization Gap On Image Classification
No ratings yet
Adam vs. SGD - Closing The Generalization Gap On Image Classification
7 pages
PW3 SupervisedLearning
No ratings yet
PW3 SupervisedLearning
10 pages
Ethos Artificial Intelligence Technical Brief
100% (1)
Ethos Artificial Intelligence Technical Brief
10 pages
Model Evaluation - II
No ratings yet
Model Evaluation - II
12 pages
A Predictive Model For Steady-State Multiphase Pipe Flow: Machine Learning On Lab Data
No ratings yet
A Predictive Model For Steady-State Multiphase Pipe Flow: Machine Learning On Lab Data
23 pages
Data Science For Online Customer Analytics - Assignment
No ratings yet
Data Science For Online Customer Analytics - Assignment
11 pages
Scikit-Learn: Library For Machine Learning and Data Science With Python
No ratings yet
Scikit-Learn: Library For Machine Learning and Data Science With Python
11 pages
Scikit-Learn: Scikit-Learn Is An Open Source Python Library That
100% (1)
Scikit-Learn: Scikit-Learn Is An Open Source Python Library That
1 page
Foundations of Elementary Analysis
From Everand
Foundations of Elementary Analysis
Roshan Trivedi
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet