0% found this document useful (0 votes)

36 views13 pages

Software Mining (ML, Testing) Notes Unit 2, 3

The document discusses several machine learning algorithms and concepts. It covers linear regression, its assumptions, and how to check for normal distribution. It also discusses root mean squared error, mean squared log error and naive bayes classification.

Uploaded by

mukundsinghkushwaha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views13 pages

Software Mining (ML, Testing) Notes Unit 2, 3

Uploaded by

mukundsinghkushwaha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 13

ANNOVA IS NOT IN MINING SYLLABUS

//////////////////////////////////////////////////////////////////////////////////////////////////////////
NOT DONE BY MAM

/////////////////////////////////////////////////////////////////////////////////////////////////////////
MACHINE LEARNING ALGORITHM
5. Root Mean Squared Error

Root of Mean Squared Error (MSE) or root of the mean squared distances between
actual and predicted values.

Here, N = total number of data points Yi = actual value Ŷi = predicted value

Higher the RMSE the larger the deviation in actual and predicted value. Lower the
RMSE value the better the model is with its predictions.

Advantages of RMSE:

i) The value of MSE is same as output unit, which makes the interpretation of loss
easy.

Disadvantages of RMSE:

i) Not robust to outliers.

6. Mean Squared Log Error (MSLE)

MSLE is a variation of Mean Squared Error. Use MSLE, when you don't want to
penalize large differences between actual and predicted value.

The logarithmic was introduced to interpret the relative difference between actual
and predicted value. To avoid natural log of possible 0 values, add 1 on both actual
and predicted values before taking logarithmic.

Here, N = total number of data points Yi = actual value Ŷi = predicted value

Advantages of MSLE:

i) Treats small differences between small actual and predicted values same as big
differences between large actual and predicted values.

Disadvantages of MSLE:

i) Penalizes underestimates more than the overestimates.

Linear Regression:
Linear Regression is a supervised machine learning algorithm which performs Regression by
plotting a straight line which best fits the data points.

Y = b0 + b1X1 + b2X2 + ... + bnXn

Assumptions:
 Assumes a linear relationship between the independent variable 'x', and the dependent variable 'y'
 Assumes no correlation between the independent variables 'x' (Multicollinearity)
 Assumes residuals have constant variance at every level of x (Homoscedasticity)
 Assumes residuals of the model are normally distributed (Normality)
 Assumes no pattern is formed when residuals are plotted
Advantages Disadvantages

Simple Implementation Prone to Underfitting

Performs best on Linear Data Sensitive to Outliers

Overfitting can be reduced by regularization Assumes that data is Independent

Check for normal distribution

 Chi-square method
 Kolmogorov-Smirnov
 Shapirov-Wilk

Graphical Method to find Normal distribution

 Histogram
 Quantile-Quantile Plot
Miu = mean

Sigma square = variance

Sigma = standard deviation

Assumtion

Features are continuous

Another different variant of naïve bayes are

Bernoulli (bernouli distribution), multinomial (multinomial distribution)

Machine Learning Notes
No ratings yet
Machine Learning Notes
12 pages
Supervised Learning Essentials
No ratings yet
Supervised Learning Essentials
30 pages
Machine Learning Categories Explained
No ratings yet
Machine Learning Categories Explained
8 pages
Ch-2 Supervised Machine Learning
No ratings yet
Ch-2 Supervised Machine Learning
48 pages
A Study On Regression Algorithm in Machine Learning
No ratings yet
A Study On Regression Algorithm in Machine Learning
3 pages
Unit 4 - Machine Learning PDF
No ratings yet
Unit 4 - Machine Learning PDF
49 pages
AI & ML Unit 3 Notes
No ratings yet
AI & ML Unit 3 Notes
20 pages
Supervised Learning Algorithms
No ratings yet
Supervised Learning Algorithms
20 pages
ML U2 Regression
No ratings yet
ML U2 Regression
20 pages
Cp4252 ML Unit-II
No ratings yet
Cp4252 ML Unit-II
44 pages
Linear Regression
No ratings yet
Linear Regression
91 pages
Lecture 5 - Linear Regression
No ratings yet
Lecture 5 - Linear Regression
51 pages
Lecture 2
No ratings yet
Lecture 2
66 pages
Unit 2
No ratings yet
Unit 2
80 pages
Unit-4 303-05 - Fundamentals of Machine Learning
No ratings yet
Unit-4 303-05 - Fundamentals of Machine Learning
17 pages
Machine Learning Algorithms Guide
No ratings yet
Machine Learning Algorithms Guide
5 pages
Machine Learning & Data Types Guide
No ratings yet
Machine Learning & Data Types Guide
22 pages
Foundations of Machine Learning - 3
No ratings yet
Foundations of Machine Learning - 3
38 pages
ML Unit-4
No ratings yet
ML Unit-4
20 pages
Pahwa 2017 Ijca 913453
No ratings yet
Pahwa 2017 Ijca 913453
8 pages
Linear Regression Summary
No ratings yet
Linear Regression Summary
57 pages
Forecasting and Learning Theory
No ratings yet
Forecasting and Learning Theory
46 pages
Machine Learning
No ratings yet
Machine Learning
21 pages
Unit2 ML Notes
No ratings yet
Unit2 ML Notes
19 pages
Regression
No ratings yet
Regression
16 pages
Regression Analysis for Beginners
No ratings yet
Regression Analysis for Beginners
35 pages
Machine Learning
No ratings yet
Machine Learning
62 pages
3.linear Regression
No ratings yet
3.linear Regression
18 pages
Linear Regression
No ratings yet
Linear Regression
9 pages
MECH4403 LR Week04
No ratings yet
MECH4403 LR Week04
25 pages
Linear Regression
No ratings yet
Linear Regression
37 pages
Chapter2 1 55ppt
No ratings yet
Chapter2 1 55ppt
8 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
30 pages
Machine Learning Questions and Answers For Interview
No ratings yet
Machine Learning Questions and Answers For Interview
20 pages
Assignment 2
No ratings yet
Assignment 2
42 pages
Modern Pridictive Modelling (Regression)
No ratings yet
Modern Pridictive Modelling (Regression)
12 pages
Week - 03 Week04
No ratings yet
Week - 03 Week04
32 pages
7 محاضرات
No ratings yet
7 محاضرات
36 pages
Machine Learning Data & Metrics Guide
No ratings yet
Machine Learning Data & Metrics Guide
12 pages
L4b - Perfomance Evaluation Metric - Regression
No ratings yet
L4b - Perfomance Evaluation Metric - Regression
6 pages
L4b - Perfomance Evaluation Metric - Regression
No ratings yet
L4b - Perfomance Evaluation Metric - Regression
6 pages
Linear Regression
No ratings yet
Linear Regression
89 pages
Lecture 02
No ratings yet
Lecture 02
43 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
54 pages
Machine Learning Basics Lecture Notes
No ratings yet
Machine Learning Basics Lecture Notes
3 pages
Unit 2
No ratings yet
Unit 2
136 pages
ML Unit-2
No ratings yet
ML Unit-2
123 pages
41 Machine Learning Algorithms I
No ratings yet
41 Machine Learning Algorithms I
8 pages
Machine Learning
No ratings yet
Machine Learning
15 pages
SumitBurnwal ML
No ratings yet
SumitBurnwal ML
13 pages
Aiml Unit 3
No ratings yet
Aiml Unit 3
9 pages
Software Engineering
No ratings yet
Software Engineering
21 pages
Agile Development
No ratings yet
Agile Development
5 pages
Software Engineering
No ratings yet
Software Engineering
10 pages
Data Modeling
No ratings yet
Data Modeling
12 pages
Gen 0107 Differential Equations
No ratings yet
Gen 0107 Differential Equations
11 pages
Beam Design Wizard Guide
No ratings yet
Beam Design Wizard Guide
2 pages
Lesson Plan Class 10
No ratings yet
Lesson Plan Class 10
12 pages
0.1C315 D5 Spec Sheet
No ratings yet
0.1C315 D5 Spec Sheet
4 pages
TYCS Sem5 SQA
No ratings yet
TYCS Sem5 SQA
3 pages
IONE-AA00-MS-MS-0002 Work Method Statements For Static Equipments Installation - Rev4
100% (2)
IONE-AA00-MS-MS-0002 Work Method Statements For Static Equipments Installation - Rev4
20 pages
Triangle Theorems and Proofs
No ratings yet
Triangle Theorems and Proofs
4 pages
Peikko Anchor Bolts NA-2016-08
No ratings yet
Peikko Anchor Bolts NA-2016-08
16 pages
Bending and Shear Stresses in Beams
100% (1)
Bending and Shear Stresses in Beams
28 pages
(Ebook PDF) Fluid Mechanics For Chemical Engineers 4th Edition Download
100% (3)
(Ebook PDF) Fluid Mechanics For Chemical Engineers 4th Edition Download
54 pages
Cee Syllabus
No ratings yet
Cee Syllabus
5 pages
5G Beamforming Techniques Guide
No ratings yet
5G Beamforming Techniques Guide
82 pages
Full Manual - Scripting Language Lab
No ratings yet
Full Manual - Scripting Language Lab
29 pages
Reservoir Fluid Properties Class
No ratings yet
Reservoir Fluid Properties Class
12 pages
Electrical Installation in Hazardous Area Presentation
100% (1)
Electrical Installation in Hazardous Area Presentation
79 pages
Thermal Analysis Characterisation Suite METTLER TOLEDO TGA DSC 1 Brochure
No ratings yet
Thermal Analysis Characterisation Suite METTLER TOLEDO TGA DSC 1 Brochure
14 pages
HP Laptop Comparison Guide
No ratings yet
HP Laptop Comparison Guide
2 pages
Circuit Theory Tutel
No ratings yet
Circuit Theory Tutel
8 pages
Hsslive Xi Arike 2025 Maths
No ratings yet
Hsslive Xi Arike 2025 Maths
54 pages
Crank and Cam Angle Sensors
No ratings yet
Crank and Cam Angle Sensors
21 pages
Phoenix Contact 1088127 en
No ratings yet
Phoenix Contact 1088127 en
10 pages
79-Article Text-432-1-10-20190501
No ratings yet
79-Article Text-432-1-10-20190501
9 pages
Textile Forms' Computer Simulation Techniques
No ratings yet
Textile Forms' Computer Simulation Techniques
29 pages
Type Tests On Disc Insulator Units & Strings
100% (8)
Type Tests On Disc Insulator Units & Strings
33 pages
Abses Perianal Jurnal
No ratings yet
Abses Perianal Jurnal
4 pages
S DynamicSimulation Petrofac
No ratings yet
S DynamicSimulation Petrofac
3 pages
Thesis Basma
No ratings yet
Thesis Basma
105 pages
04i Et XXXXXX 1200 813 p4x 001 - A
No ratings yet
04i Et XXXXXX 1200 813 p4x 001 - A
15 pages
Speed /frequency / Wavelength: Equation
No ratings yet
Speed /frequency / Wavelength: Equation
3 pages
Grade 3 Science Weather Practice Answers
100% (1)
Grade 3 Science Weather Practice Answers
7 pages

Software Mining (ML, Testing) Notes Unit 2, 3

Uploaded by

Software Mining (ML, Testing) Notes Unit 2, 3

Uploaded by

ANNOVA IS NOT IN MINING SYLLABUS

Here, N = total number of data points Yi = actual value Ŷi = predicted value

i) Not robust to outliers.

6. Mean Squared Log Error (MSLE)

Here, N = total number of data points Yi = actual value Ŷi = predicted value

i) Penalizes underestimates more than the overestimates.

Y = b0 + b1X1 + b2X2 + ... + bnXn

Simple Implementation Prone to Underfitting

Performs best on Linear Data Sensitive to Outliers

Overfitting can be reduced by regularization Assumes that data is Independent

Check for normal distribution

Graphical Method to find Normal distribution

Sigma square = variance

Features are continuous

Another different variant of naïve bayes are

Bernoulli (bernouli distribution), multinomial (multinomial distribution)

You might also like