Key Machine Learning Terminologies and Their Expla

This document is a study guide that defines key machine learning terminologies from 'Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow' (2nd Ed.), organized by themes such as supervised learning, unsupervised learning, model evaluation, training algorithms, support vector machines, decision trees, and ensemble methods. It includes explanations of concepts like regression, classification, overfitting, gradient descent, and various algorithms. Additionally, it features practice flashcards to reinforce understanding of the material.

Uploaded by

VenkateshKumar B

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views4 pages

Key Machine Learning Terminologies and Their Expla

Uploaded by

VenkateshKumar B

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Key Machine Learning Terminologies and Their

Explanations
This study guide extracts and defines the core terms introduced in “Hands-On Machine
Learning with Scikit-Learn, Keras, and TensorFlow” (2nd Ed.), organized by theme for clarity.

1. Supervised Learning
Training Set
A labeled dataset used to fit (train) a model; each example includes input features and the
correct output label.
Features (Attributes)
Measurable properties or characteristics of the data (e.g., petal length, income).
Labels (Targets)
The desired outputs in supervised learning (e.g., price, class).
Regression
Predicting a continuous quantity (e.g., house price, life satisfaction).
Classification
Predicting discrete categories (e.g., spam vs. ham, digit 0–9).

2. Unsupervised Learning
Clustering
Grouping similar instances without labels (e.g., K-Means, DBSCAN).
Dimensionality Reduction
Reducing the number of features while preserving structure (e.g., PCA, Kernel PCA, LLE).
Anomaly Detection
Identifying unusual instances that deviate from the norm (e.g., One-class SVM, Isolation
Forest).
Association Rule Learning
Discovering relationships between variables in large datasets (e.g., Apriori).

3. Model Evaluation
Overfitting
A model fits the training data too closely and fails to generalize to new data.
Underfitting
A model is too simple and cannot capture underlying patterns in the data.
Bias–Variance Trade-off
Balancing underfitting (high bias) vs. overfitting (high variance).
Cross-Validation
Partitioning data into folds to reliably estimate generalization error.
Holdout Test Set
A final subset of data (e.g., 20%) held aside to assess the model after training.
Precision / Recall
– Precision: TP/(TP + FP), the fraction of positive predictions that are correct.
– Recall: TP/(TP + FN), the fraction of actual positives correctly identified.
F₁ Score
The harmonic mean of precision and recall: .
Receiver Operating Characteristic (ROC) Curve
Plots True Positive Rate vs. False Positive Rate at various thresholds.
Area Under the ROC Curve (AUC)
A summary scalar of ROC performance; 1.0 is perfect, 0.5 is random.

4. Training Algorithms
Normal Equation
Direct closed-form solution for Linear Regression: .
Singular Value Decomposition (SVD)
Factorizes to compute a pseudoinverse for regression when is singular.
Gradient Descent (GD)
Iterative optimization: .
– Batch GD: uses all instances per step.
– Stochastic GD: uses one instance per step.
– Mini-batch GD: uses small batches per step.
Learning Rate (η)
Step size in gradient descent; too small slows convergence, too large may diverge.
Regularization
Constraining model parameters to reduce overfitting:
– Ridge (ℓ₂) Regression: adds .
– Lasso (ℓ₁) Regression: adds , promotes sparsity.
– Elastic Net: mix of ℓ₁ and ℓ₂ penalties.
Early Stopping
Ceasing training when validation error stops improving to prevent overfitting.

5. Support Vector Machines (SVM)

Hyperplane
Decision boundary separating classes; in -D, it is -D.
Margin
Distance between the hyperplane and the nearest training instances.
Support Vectors
Training instances that lie on the margin edges, which determine the hyperplane.
Hard Margin
Assumes perfect separability; no margin violations allowed.
Soft Margin
Allows some margin violations via slack variables and penalty .
Kernel Trick
Computing dot products in transformed feature spaces without explicit mapping; e.g.:
– Polynomial Kernel: .
– Gaussian RBF: .
Dual Problem
Reformulation expressing optimization in terms of instance weights , enabling kernels.

6. Decision Trees
Node Impurity
– Gini Impurity: .
– Entropy: .
CART Algorithm
Binary tree induction by minimizing weighted impurity of splits.
Max Depth / Min Samples Split / Min Samples Leaf
Hyperparameters controlling tree growth to avoid overfitting.

7. Ensemble Methods
Bagging (Bootstrap Aggregation)
Training predictors on bootstrap-sampled subsets and aggregating votes/predictions.
Random Forests
Bagged Decision Trees with feature subsampling at each split for extra diversity.
Extra-Trees
Like Random Forests but with random split thresholds as well.
Boosting
Sequentially training predictors to correct predecessors’ errors, e.g., AdaBoost, Gradient
Boosting.
Stacking
Training a meta-learner (blender) on base learners’ predictions to optimally combine them.

Practice Flashcards
1. What is the bias–variance trade-off?
Balancing model simplicity (high bias, underfitting) vs. model complexity (high variance,
overfitting).
2. How does ℓ₁ regularization differ from ℓ₂?
ℓ₁ promotes sparse weights (feature selection), ℓ₂ shrinks weights smoothly.
3. What does the kernel trick enable in SVMs?
Applying a linear algorithm in a high-dimensional feature space without explicitly mapping to
it.
4. When do you use early stopping?
To stop training once validation error plateaus or rises, preventing overfitting.
5. How do Random Forests reduce overfitting compared to a single tree?
Aggregating many decorrelated trees lowers variance.
6. Define precision and recall.
Precision = TP/(TP+FP); recall = TP/(TP+FN).
7. What is PCA’s objective?
Find orthogonal axes (principal components) that maximize data variance, then project onto
them.
8. Why scale features before distance-based algorithms?
To ensure all features contribute equally and prevent distortion.
9. What is a confusion matrix?
A table showing counts of TP, TN, FP, FN for binary classification.
10. Describe Bagging vs. Boosting.
Bagging trains models independently on random subsets; boosting trains sequentially,
focusing on previous errors.

1.write The Formula For Sigmoid, Hyperbolic Tangen...
No ratings yet
1.write The Formula For Sigmoid, Hyperbolic Tangen...
3 pages
ML Algorithms Comprehensive Study
No ratings yet
ML Algorithms Comprehensive Study
9 pages
Machine Learning Engineer Interview Preparation Guide
No ratings yet
Machine Learning Engineer Interview Preparation Guide
14 pages
Machine Learning Engineer Cheatsheet
No ratings yet
Machine Learning Engineer Cheatsheet
3 pages
ML Overview
No ratings yet
ML Overview
11 pages
Evaluating Machine Learning Algorithms and Model Selection
No ratings yet
Evaluating Machine Learning Algorithms and Model Selection
10 pages
PRCV Viva Notes
No ratings yet
PRCV Viva Notes
32 pages
ML
No ratings yet
ML
18 pages
ML Fundamentals
No ratings yet
ML Fundamentals
15 pages
Machine Learning
No ratings yet
Machine Learning
6 pages
??????? ???????? ??????????!
No ratings yet
??????? ???????? ??????????!
16 pages
Top 50 ML Interview Questions Recreated
No ratings yet
Top 50 ML Interview Questions Recreated
5 pages
MLTAHER
No ratings yet
MLTAHER
14 pages
ML - ML in Nutshell
No ratings yet
ML - ML in Nutshell
7 pages
Kenny-230718-The Ultimate Machine Learning Cheat Sheet
No ratings yet
Kenny-230718-The Ultimate Machine Learning Cheat Sheet
20 pages
AI ML Concepts
No ratings yet
AI ML Concepts
97 pages
Machine Learning
No ratings yet
Machine Learning
2 pages
Supervised Learning Final With Diagrams Cleaned
No ratings yet
Supervised Learning Final With Diagrams Cleaned
7 pages
Machine Learning Questions and Answers: Decision Tree
No ratings yet
Machine Learning Questions and Answers: Decision Tree
3 pages
Three Machine Learning Algorithms
No ratings yet
Three Machine Learning Algorithms
11 pages
AWS Machine Learning Specialty Master Cheat Sheet
No ratings yet
AWS Machine Learning Specialty Master Cheat Sheet
24 pages
ML Assignment
No ratings yet
ML Assignment
13 pages
Module 2 MMC201
No ratings yet
Module 2 MMC201
25 pages
MachineLearning Chatgpt
No ratings yet
MachineLearning Chatgpt
19 pages
2 Marks
No ratings yet
2 Marks
14 pages
Machine Learning Theory Updated
No ratings yet
Machine Learning Theory Updated
8 pages
ML 5 Mark Questions Answers
No ratings yet
ML 5 Mark Questions Answers
3 pages
CS Study Guide
No ratings yet
CS Study Guide
3 pages
ML Questions Answers
No ratings yet
ML Questions Answers
4 pages
Unit 4 Learning
No ratings yet
Unit 4 Learning
5 pages
AIML-Unit 5 Notes-Assignment 5
No ratings yet
AIML-Unit 5 Notes-Assignment 5
24 pages
ML CheatSheet
No ratings yet
ML CheatSheet
14 pages
Unit 3 Ds
No ratings yet
Unit 3 Ds
10 pages
Spam Not Spam
No ratings yet
Spam Not Spam
7 pages
CSC413 Lecture Note
No ratings yet
CSC413 Lecture Note
32 pages
Nit ML Sugg
No ratings yet
Nit ML Sugg
5 pages
Data Collection
No ratings yet
Data Collection
8 pages
Machine Learning AL-405 GS Answers
No ratings yet
Machine Learning AL-405 GS Answers
3 pages
ML Interview
No ratings yet
ML Interview
65 pages
ML 2m Cie2
No ratings yet
ML 2m Cie2
4 pages
ML Notes
No ratings yet
ML Notes
12 pages
ML Unit 3
No ratings yet
ML Unit 3
10 pages
Lecture 5 - Feature Extraction, Model Building & Evaluation
No ratings yet
Lecture 5 - Feature Extraction, Model Building & Evaluation
35 pages
ML and DL
No ratings yet
ML and DL
15 pages
Chapter 2,3,4
No ratings yet
Chapter 2,3,4
8 pages
100-Machine-Learning-Interview-Questions-and-Answers (Downloaded From Internet)
No ratings yet
100-Machine-Learning-Interview-Questions-and-Answers (Downloaded From Internet)
24 pages
Supervised Learning
No ratings yet
Supervised Learning
30 pages
13 PracticalMachineLearning
100% (1)
13 PracticalMachineLearning
84 pages
Technical Questions and Answers
No ratings yet
Technical Questions and Answers
12 pages
ML Exam Preparation Tips
No ratings yet
ML Exam Preparation Tips
41 pages
ML Unit 2
No ratings yet
ML Unit 2
86 pages
Machine Learning 1707965934
No ratings yet
Machine Learning 1707965934
15 pages
Basic Machine Learning Terms 2
No ratings yet
Basic Machine Learning Terms 2
4 pages
ML Models
No ratings yet
ML Models
21 pages
Aasignment
No ratings yet
Aasignment
7 pages
ML Unit 4 5 Detailed Answers
No ratings yet
ML Unit 4 5 Detailed Answers
4 pages
Mont H 1 2 3 4 5 6 7 8 9 1 0 Dema ND 2 7 3 1 2 9 3 0 3 2 3 4 3 6 3 5 3 7 3 9
No ratings yet
Mont H 1 2 3 4 5 6 7 8 9 1 0 Dema ND 2 7 3 1 2 9 3 0 3 2 3 4 3 6 3 5 3 7 3 9
6 pages
Kolmogorov-Smirnov Test (K-S Test) : Hypotheses: Null Hypothesis (H
100% (1)
Kolmogorov-Smirnov Test (K-S Test) : Hypotheses: Null Hypothesis (H
2 pages
Action Research and Methodology Quiz
No ratings yet
Action Research and Methodology Quiz
25 pages
Basic Sampling Theory May 2017
No ratings yet
Basic Sampling Theory May 2017
1 page
Stat Final Exam '17-'18
100% (3)
Stat Final Exam '17-'18
2 pages
Measuring Relationship Via Regression Analysis and Correlation
No ratings yet
Measuring Relationship Via Regression Analysis and Correlation
9 pages
Linear Regression Exercises Solutions
No ratings yet
Linear Regression Exercises Solutions
4 pages
g16 1 +校正領域量測不確定度評估指引-20250317下載
No ratings yet
g16 1 +校正領域量測不確定度評估指引-20250317下載
83 pages
Astm D 6312 - 17
No ratings yet
Astm D 6312 - 17
15 pages
Statistics Test for Students
No ratings yet
Statistics Test for Students
8 pages
NUR5201 Week1 Research-Principles
No ratings yet
NUR5201 Week1 Research-Principles
33 pages
Eocre1 2
No ratings yet
Eocre1 2
8 pages
Skewness and Kurtosis Final
100% (1)
Skewness and Kurtosis Final
15 pages
Gec 4 Final Problem Sets With Answers HL
No ratings yet
Gec 4 Final Problem Sets With Answers HL
14 pages
Data Science Relationships Guide
No ratings yet
Data Science Relationships Guide
18 pages
Business Statistics: Median and Mode
No ratings yet
Business Statistics: Median and Mode
44 pages
Macro Economics Chapter 4
No ratings yet
Macro Economics Chapter 4
58 pages
Answers 4
No ratings yet
Answers 4
10 pages
Inferences On Two-Way Contingency Tables
No ratings yet
Inferences On Two-Way Contingency Tables
45 pages
Faculty of Business and Management: Assignment/ Project Declaration Form
No ratings yet
Faculty of Business and Management: Assignment/ Project Declaration Form
8 pages
STS 201 Tutorial Questions by Godspeed & Poseidon-1
No ratings yet
STS 201 Tutorial Questions by Godspeed & Poseidon-1
48 pages
Metlit 10 - Besar Sample - 20180824-Slides-1
No ratings yet
Metlit 10 - Besar Sample - 20180824-Slides-1
39 pages
Measures of Dispersion Explained
No ratings yet
Measures of Dispersion Explained
5 pages
Effects of Financial Innovations On Fina
No ratings yet
Effects of Financial Innovations On Fina
12 pages
Illustration of The Naïve Method
No ratings yet
Illustration of The Naïve Method
3 pages
Chapter III Research Methodology
100% (1)
Chapter III Research Methodology
22 pages
Unit 1-QTM-Introduction To Statistics-MBA 1
No ratings yet
Unit 1-QTM-Introduction To Statistics-MBA 1
48 pages
STAT - Worksheet 1, 2025
No ratings yet
STAT - Worksheet 1, 2025
6 pages
Statistics MCQ
100% (1)
Statistics MCQ
15 pages
Assignment 2 (Chapter 5 & 6) Fin534
No ratings yet
Assignment 2 (Chapter 5 & 6) Fin534
26 pages

Key Machine Learning Terminologies and Their Expla

Uploaded by

Key Machine Learning Terminologies and Their Expla

Uploaded by

Key Machine Learning Terminologies and Their

5. Support Vector Machines (SVM)

You might also like