Hyperparameter Tuning

Hyperparameter tuning is essential for optimizing machine learning algorithms by selecting the best hyperparameters before training begins. It involves understanding the trade-off between bias and variance to prevent under-fitting and over-fitting, and utilizes various search algorithms like grid search and random search for efficiency. Model evaluation metrics vary based on the algorithm type and are crucial for selecting the right model and hyperparameters for optimal performance.

Uploaded by

textque5

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views3 pages

Hyperparameter Tuning

Uploaded by

textque5

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Hyperparameter Tuning

Hyperparameter tuning is choosing a set of optimal hyperparameters for a learning

algorithm. A hyperparameter is a model argument whose value is set before the
learning process begins. The key to machine learning algorithms is hyperparameter
tuning.

Hyperparameter types:
 K in K-NN
 Regularization constant, kernel type, and constants in SVMs
 Number of layers, number of units per layer, regularization in neural network

Generalization (test) error of learning algorithms has

two main components:
 Bias: error due to simplifying model assumptions
 Variance: error due to randomness of the training set

The trade-off between these components is determined by the complexity of the model
and the amount of training data. The optimal hyperparameters help to avoid under-fitting
(training and test error are both high) and over-fitting (Training error is low but test error
is high)

Introduction
Workflow: One of the core tasks of developing an ML model is to evaluate its
performance. There are multiple stages in developing an ML model for use in software
applications.

Figure 1: Workflow
Evaluation: Model evaluation and ongoing evaluation may have different matrices. For

example, model evaluation may include Accuracy or AUROC and ongoing evaluation

may include customer lifetime value. Also, the distribution of the data might change

between the historical data and live data. One way to detect distribution drift is through

continuous model monitoring.

Hyper-parameters: Model parameters are learned from data and hyper-parameters are

tuned to get the best fit. Searching for the best hyper-parameter can be tedious, hence

search algorithms like grid search and random search are used.

Figure 2: Hyper-parameter tuning vs Model training

Model Evaluation

Evaluation Matrices: These are tied to ML tasks. There are different matrices
for supervised algorithms (classification and regression) and unsupervised
algorithms. For example, the performance of classification of the binary class
is measured using Accuracy, AUROC, Log-loss, and KS.

Evaluation Mechanism: Model selection refers to the process of selecting

the right model that fits the data. This is done using test evaluation matrices.
The results from the test data are passed back to the hyper-parameter tuner
to get the most optimal hyperparameters.
Figure 3: Evaluation Mechanism

Hyperparameter Tuning
Hyperparameters: Vanilla linear regression does not have any hyperparameters.
Variants of linear regression (ridge and lasso) have regularization as a hyperparameter.
The decision tree has max depth and min number of observations in leaf as
hyperparameters.

Optimal Hyperparameters: Hyperparameters control the over-fitting and under-fitting

of the model. Optimal hyperparameters often differ for different datasets. To get the best
hyperparameters the following steps are followed:

1. For each proposed hyperparameter setting the model is evaluated

2. The hyperparameters that give the best model are selected.

Hyperparameters Search: Grid search picks out a grid of hyperparameter values and
evaluates all of them. Guesswork is necessary to specify the min and max values for
each hyperparameter. Random search randomly values a random sample of points on
the grid. It is more efficient than grid search. Smart hyperparameter tuning picks a few
hyperparameter settings, evaluates the validation matrices, adjusts the
hyperparameters, and re-evaluates the validation matrices. Examples of smart hyper-
parameter are Spearmint (hyperparameter optimization using Gaussian processes) and
Hyperopt (hyperparameter optimization using Tree-based estimators).

Lecture 9 Model Selection
No ratings yet
Lecture 9 Model Selection
15 pages
PHYPER
No ratings yet
PHYPER
3 pages
Hyper Parameter New
No ratings yet
Hyper Parameter New
4 pages
Tunability: Importance of Hyperparameters of Machine Learning Algorithms
No ratings yet
Tunability: Importance of Hyperparameters of Machine Learning Algorithms
32 pages
Module2.3 Hyperparameter Optimization
No ratings yet
Module2.3 Hyperparameter Optimization
29 pages
Hyperparameter Tuning
No ratings yet
Hyperparameter Tuning
4 pages
Hyperparameter Tuning - GeeksforGeeks
No ratings yet
Hyperparameter Tuning - GeeksforGeeks
23 pages
Hyper Parameter Turning
No ratings yet
Hyper Parameter Turning
4 pages
The Importance of Hyperparameters in Machine Learning
No ratings yet
The Importance of Hyperparameters in Machine Learning
8 pages
Hyperparameter Tuning Guide
No ratings yet
Hyperparameter Tuning Guide
9 pages
Unit 4
No ratings yet
Unit 4
34 pages
Hyper Parameters
No ratings yet
Hyper Parameters
7 pages
Model Training: (Anything Done While We Train The Model)
No ratings yet
Model Training: (Anything Done While We Train The Model)
194 pages
Lecture 4.1 AML
No ratings yet
Lecture 4.1 AML
12 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
49 pages
ML Chap 5
No ratings yet
ML Chap 5
14 pages
Machine Learning Process
No ratings yet
Machine Learning Process
2 pages
Algorithm Comparison Guide
No ratings yet
Algorithm Comparison Guide
14 pages
Training Evaluation
No ratings yet
Training Evaluation
42 pages
Tadlo MCL
No ratings yet
Tadlo MCL
11 pages
Model Parameters
No ratings yet
Model Parameters
26 pages
1 s2.0 S1674862X19300047 Main
No ratings yet
1 s2.0 S1674862X19300047 Main
15 pages
Hyperparameter Optimization For Machine Learning Models Based On Bayesian Optimization
No ratings yet
Hyperparameter Optimization For Machine Learning Models Based On Bayesian Optimization
15 pages
Hyperparameter Optimization of ML Algorithms
No ratings yet
Hyperparameter Optimization of ML Algorithms
69 pages
Hyperparameters
No ratings yet
Hyperparameters
2 pages
ML Algorithms
No ratings yet
ML Algorithms
2 pages
Hyperparameter Tuning
No ratings yet
Hyperparameter Tuning
3 pages
Hyperparameter Optimization Survey
No ratings yet
Hyperparameter Optimization Survey
13 pages
D2 Deep Learning Workshop Session 3
No ratings yet
D2 Deep Learning Workshop Session 3
5 pages
Hyperparameter Tuning for ML Experts
No ratings yet
Hyperparameter Tuning for ML Experts
22 pages
Unit 3 ML
No ratings yet
Unit 3 ML
40 pages
Hyperparameter Tuning
No ratings yet
Hyperparameter Tuning
17 pages
On Hyperparameter Optimization of Machine Learning Algorithms: Theory and Practice
No ratings yet
On Hyperparameter Optimization of Machine Learning Algorithms: Theory and Practice
69 pages
Automl: A Perspective Where Industry Meets Academy
No ratings yet
Automl: A Perspective Where Industry Meets Academy
154 pages
Hyper Parameters
No ratings yet
Hyper Parameters
24 pages
Hyperparameters and Parameters
No ratings yet
Hyperparameters and Parameters
8 pages
Parameters and LL
No ratings yet
Parameters and LL
6 pages
Unit 4 A
No ratings yet
Unit 4 A
16 pages
Machine Learning # 2
No ratings yet
Machine Learning # 2
17 pages
EE353 - 769 06 Intro To ML
No ratings yet
EE353 - 769 06 Intro To ML
27 pages
SML Updated UNIT 4
No ratings yet
SML Updated UNIT 4
44 pages
Topic 3
No ratings yet
Topic 3
48 pages
Dda3020 ML Fall24 Jia l13
No ratings yet
Dda3020 ML Fall24 Jia l13
45 pages
Hyperparameters
No ratings yet
Hyperparameters
11 pages
Ai Unit 5
No ratings yet
Ai Unit 5
13 pages
ML Unit IV
No ratings yet
ML Unit IV
70 pages
ML Individual Assigenment 1
No ratings yet
ML Individual Assigenment 1
11 pages
Lec - 4
No ratings yet
Lec - 4
43 pages
MC4301 - ML Unit 2 (Model Evaluation and Feature Engineering)
No ratings yet
MC4301 - ML Unit 2 (Model Evaluation and Feature Engineering)
40 pages
Hyperparameter Tuning Is The Process of Optimizing The Model
No ratings yet
Hyperparameter Tuning Is The Process of Optimizing The Model
3 pages
Lecture6c HyperparameterOptimization
No ratings yet
Lecture6c HyperparameterOptimization
19 pages
Lecture 5 - Feature Extraction, Model Building & Evaluation
No ratings yet
Lecture 5 - Feature Extraction, Model Building & Evaluation
35 pages
Unit 5
No ratings yet
Unit 5
11 pages
Machine Learning Model ENG
No ratings yet
Machine Learning Model ENG
16 pages
Step-4ML GWP 1
No ratings yet
Step-4ML GWP 1
1 page
Model Selection, Evaluation Metrics, and Learning From Imbalanced Data
No ratings yet
Model Selection, Evaluation Metrics, and Learning From Imbalanced Data
31 pages
ML Answer Key (M.tech)
No ratings yet
ML Answer Key (M.tech)
31 pages
CD Lab
No ratings yet
CD Lab
21 pages
Difference Between ANN, CNN and RNN
No ratings yet
Difference Between ANN, CNN and RNN
4 pages
BPMN
No ratings yet
BPMN
3 pages
Unit 3 - Managing Innovation and Entrepreneurship - WWW - Rgpvnotes.in
No ratings yet
Unit 3 - Managing Innovation and Entrepreneurship - WWW - Rgpvnotes.in
9 pages
Unit 5 - Managing Innovation and Entrepreneurship - WWW - Rgpvnotes.in
No ratings yet
Unit 5 - Managing Innovation and Entrepreneurship - WWW - Rgpvnotes.in
9 pages
Unit 2 - Managing Innovation and Entrepreneurship - WWW - Rgpvnotes.in
No ratings yet
Unit 2 - Managing Innovation and Entrepreneurship - WWW - Rgpvnotes.in
10 pages
Artificial Intelligence-Based Inventory Management For Retail Supply
No ratings yet
Artificial Intelligence-Based Inventory Management For Retail Supply
14 pages
Correct Validation WP Final V
No ratings yet
Correct Validation WP Final V
26 pages
8 Weeks Main ML Plan
No ratings yet
8 Weeks Main ML Plan
11 pages
Pravin 2022 Piper
No ratings yet
Pravin 2022 Piper
26 pages
1 s2.0 S095219762401100X Main
No ratings yet
1 s2.0 S095219762401100X Main
11 pages
Data Science Checklist
No ratings yet
Data Science Checklist
22 pages
Hyperparameter Tuning Guide
No ratings yet
Hyperparameter Tuning Guide
2 pages
Scikit-Learn for Data Scientists
No ratings yet
Scikit-Learn for Data Scientists
27 pages
Deep Learning and Genetic Algorithms For Cosmological Bayesian Inference Speed-Up
No ratings yet
Deep Learning and Genetic Algorithms For Cosmological Bayesian Inference Speed-Up
16 pages
Automated Diabetic Retinopathy Detection
No ratings yet
Automated Diabetic Retinopathy Detection
80 pages
May2024 - KT (2) - 1
No ratings yet
May2024 - KT (2) - 1
10 pages
12 - 23ECE216 - Nearest Neighbors
No ratings yet
12 - 23ECE216 - Nearest Neighbors
29 pages
Student Name: Course: Machine Learning Group: E27-24 Date: 16.01.2025
No ratings yet
Student Name: Course: Machine Learning Group: E27-24 Date: 16.01.2025
10 pages
Randomized Optimization Assignment
No ratings yet
Randomized Optimization Assignment
5 pages
Monitoring, Detection and Classification of Rabbit Livestock Activities Using The Internet of Things (IoT) and Support Vector Machine (SVM)
No ratings yet
Monitoring, Detection and Classification of Rabbit Livestock Activities Using The Internet of Things (IoT) and Support Vector Machine (SVM)
6 pages
Deep Fake Detection Vtu Report
No ratings yet
Deep Fake Detection Vtu Report
41 pages
EAAI-24-11159 R1 Reviewer
No ratings yet
EAAI-24-11159 R1 Reviewer
97 pages
13416-Article Text-95882-2-10-20250111
No ratings yet
13416-Article Text-95882-2-10-20250111
14 pages
What Is Epoch in Machine Learning - Simplilearn
No ratings yet
What Is Epoch in Machine Learning - Simplilearn
10 pages
An Enhanced Swin Transformer For Soccer Player Reidentification
No ratings yet
An Enhanced Swin Transformer For Soccer Player Reidentification
14 pages
Transition to Data Scientist in 6 Weeks
No ratings yet
Transition to Data Scientist in 6 Weeks
5 pages
Using Machine Learning For Detection and Prediction of Chronic Diseases
No ratings yet
Using Machine Learning For Detection and Prediction of Chronic Diseases
17 pages
A Gradient Boosting Model To Predict The Milk Production
No ratings yet
A Gradient Boosting Model To Predict The Milk Production
8 pages
Applsci 12 09820 v2
No ratings yet
Applsci 12 09820 v2
15 pages
Azure DP-100 Exam Skills Guide
No ratings yet
Azure DP-100 Exam Skills Guide
9 pages
Optimizing Healthcare Outcomes Through Data-Driven Predictive Modeling
No ratings yet
Optimizing Healthcare Outcomes Through Data-Driven Predictive Modeling
19 pages
Empowering Glioma Prognosis With Transparent Machine Learning and Interpretative Insights Using Explainable AI
No ratings yet
Empowering Glioma Prognosis With Transparent Machine Learning and Interpretative Insights Using Explainable AI
22 pages
Fake News Detection with AI Methods
No ratings yet
Fake News Detection with AI Methods
11 pages
Himanshu Sharma: Projects Tech Stack
No ratings yet
Himanshu Sharma: Projects Tech Stack
1 page
OCI Data Science Exam Answers
No ratings yet
OCI Data Science Exam Answers
10 pages