MLT Unit 2 - Updated

Uploaded by

Suhani Garg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views58 pages

MLT Unit 2 - Updated

Uploaded by

Suhani Garg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 58

Machine Learning Techniques

(KAI-601)
Unit 2: Regression,
Bayesian Learning and
Support Vector Machine

Mr. Waseem Ahmed

Assistant Professor
CSE-AIML
ABES Engineering College
Regression in ML
• Regression is a supervised learning problem where we are given
examples forming a dataset, in which both the values of Input and
Output variables are given.
• Task is to learn a function to predict the value of Y for a given X.
• In regression Y is always continuous.
By using best fitting straight line
Independent and Dependent Variables
• Here ‘X’ is the independent
variable and ‘Y’ is the dependent
variable, whose value needs to be
predicted.
• For e.g.: Time may be X and the
Price of the product may be Y,
whose value changes with the time
variable X.
• The dependent variable is always
continuous.
• As in this case, the price of the
product is continuous as it can
have finite continuous values.
Positive and Negative slope
Linear Regression Line
Linear Regression Line
Understanding linear regression line
Understanding Linear Regression Algorithm
Understanding Linear Regression Algorithm
Logistic regression
• It analyzes relationships between variables.
• It assigns probabilities to discrete outcomes using the Sigmoid function, which
converts numerical results into an expression of probability between 0 and 1.0.
• Probability is either 0 or 1, depending on whether the event happens or not.
• For binary predictions, you can divide the population into two groups with a cut-off
of 0.5.
• Everything above 0.5 is considered to belong to group A, and everything below is
considered to belong to group B.
How Does the Logistic Regression Algorithm Work?
An organization wants to determine an employee’s salary increase based on their
performance.
For this purpose, a linear regression algorithm will help them decide.
Plotting a regression line by considering the
employee’s performance as the
independent variable, and the salary
increase as the dependent variable
will make their task easier.
What if the organization wants to know whether an employee would get a promotion or not based
on their performance?
Bayes Theorem

Prerequisite of Bayes Theorem

1. Experiment
An experiment is defined as the planned operation carried out under controlled
conditions such as tossing a coin, drawing a card and rolling a dice, etc.
2. Sample Space
During an experiment what we get as a result is called as possible outcomes and the
set of all possible outcomes of an event is known as sample space.
S1 = {1, 2, 3, 4, 5, 6}
S2 = {Head, Tail}
3. An event
An event is defined as a subset of sample space in an experiment.
Further, it is also called a set of outcomes.
Bayes Theorem
Consider the following example of tossing two coins.
If we toss two coins and look at all the different possibilities, we have the sample space
as {HH, HT, TH, TT}
Probability (X)= Number of favorable outcomes / Total number of possible outcomes
While calculating the math on probability, we usually denote probability as P. Some of
the probabilities in this event would be as follows:
•The probability of getting two heads = 1/4
•The probability of at least one tail = 3/4
•The probability of the second coin being head given the first coin is tail = 1/2
•The probability of getting two heads given the first coin is a head = 1/2
Bayes Theorem
Assume in our experiment of rolling a dice, there are two event A and B such that;
A = Event when an even number is obtained = {2, 4, 6}
B = Event when a number is greater than 4 = {5, 6}
Probability of the event A ''P(A)''= Number of favorable outcomes / Total number of
possible outcomes
P(E) = 3/6 =1/2 =0.5
Probability of the event B ''P(B)''= Number of favorable outcomes / Total number of
possible outcomes
=2/6=1/3=0.333
Bayes Theorem
Conditional Probability:
Conditional probability is defined as the probability of an event A, given that another
event B has already occurred (i.e. A conditional B). This is represented by P(A|B) and
we can define it as:
P(A|B) = P(A ∩ B) / P(B)
Naïve Bayes Classifier
Based on the Bayes theorem, the Naive Bayes Classifier gives the conditional probability of an event A given
event B.

Suppose we have a dataset of weather conditions and the corresponding target variable "Play".
So using this dataset we need to decide whether we should play or not on a particular day according to the
weather conditions.

So to solve this problem, we need to follow the below steps:

1.Convert the given dataset into frequency tables.

2.Generate a Likelihood table by finding the probabilities of given features.
3.Now, use the Bayes theorem to calculate the posterior probability.
Problem: If the weather is sunny, then the Player should play or not?
DataSet
Frequency table for the Weather Conditions:
Weather Yes No
Overcast 5 0
Rainy 2 2
Sunny 3 2
Total 10 4
Likelihood table weather condition:

Weather No Yes
Overcast 0 5 5/14= 0.35
Rainy 2 2 4/14=0.29
Sunny 2 3 5/14=0.35
All 4/14=0.29 10/14=0.71
Applying Bayes’ theorem:
P(Yes|Sunny)= P(Sunny|Yes)*P(Yes)/P(Sunny)
P(Sunny|Yes)= 3/10= 0.3
P(Sunny)= 0.35
P(Yes)=0.71
So P(Yes|Sunny) = 0.3*0.71/0.35= 0.60
P(No|Sunny)= P(Sunny|No)*P(No)/P(Sunny)
P(Sunny|NO)= 2/4=0.5
P(No)= 0.29
P(Sunny)= 0.35
So P(No|Sunny)= 0.5*0.29/0.35 = 0.41
So as we can see from the above calculation that
P(Yes|Sunny)>P(No|Sunny)
Hence on a Sunny day, Player can play the game.
Bayesian Belief Networks
• Bayesian Belief Network is a graphical representation of different
probabilistic relationships among random variables.
• Bayesian Belief Network is a “Probabilistic Graphical Model” that
represents “Conditional Dependencies” between random variable
through a Directed Acyclic Graph (DAG).
• The probability in Bayesian Belief Network is derived based on a
condition: P(attribute/parent)
(Probability of an attribute, true over the parent attribute)
• Bayesian Belief Network is a classifier with no dependencies oßn attributes
i.e it is condition independent.
• The graph of BBN consists of nodes (variables) and arcs (Causal
Relationaship).
Bayesian Belief Networks
The BBN helps in modelling and reasoning capabilities about the
uncertainties hidden between these random variables with the help of
the dependencies captured via arcs in DAG.
Weather

Health Rainy
Umbrella
Sales
Tea
Green
Leaves
Bayesian Belief Networks
• BBN works on the Joint and Conditional Probability.
• Joint Probability is given as:
P(X1,X2,..,Xn)=∏P(Xi |Parents(Xi))
i=1,..n

• where P(Xi |Parents(Xi)) means the probability of each feature

with respect to its parent.
BBN
Find the probability that ‘P1 (JohnCalls)’ is true ,‘P2 (MaryCalls)’ is
true when the alarm ‘A’ rang, but no burglary ‘B’ and fire ‘F’ has
occurred.
Support Vector Machine
• SVM is a supervised machine learning algorithm, used to solve
regression and (mainly) classification problem statements.
• In ML, Vectors means the training examples or training data with
the help of which a classifier is constructed.
• In SVM, the subset of training data is used to represent the decision
boundary.
• The objective of the Support Vector Machine algorithm is to find
a hyperplane in an N-dimensional space (where N — the number of
features) that distinctly classifies the data points.
• To separate the two classes of data points, there are many possible hyperplanes
that could be chosen.
• Margin − defined as the gap between two lines on the closet data points of different
classes. It can be calculated as the perpendicular distance from the line to the support
vectors. Large margin is considered as a good margin and small margin is considered as a
bad margin.
• The objective is to find a plane that has the maximum margin, i.e. the maximum
distance between data points of both classes.
Hyperplanes and Support Vectors
• Hyperplanes are decision boundaries that help classify the data
points. Data points falling on either side of the hyperplane can be
attributed to different classes.
• The dimension of the hyperplane depends upon the number of
features.
• If the number of input features is 2, then the hyperplane is just a
line. If the number of input features is 3, then the hyperplane
becomes a two-dimensional plane.
Hyperplanes in 2D and 3D feature space
Support Vectors
• Support vectors are data points that are closer to the hyperplane and
influence the position and orientation of the hyperplane.
• Using these support vectors, we maximize the margin of the
classifier.
• Deleting the support vectors will change the position of the
hyperplane.
• These are the points that help us build our SVM.
SVM Slope/ Line equation
Straight Line equation is y=mx+c
Or ax+by+c=0
Or y=wx+b
and in case of SVM, we have this equation as y=wTx +b
And taking the equation of a straight line,
by =-ax-c. i.e: y= -ax/b – c/b
Where –a/b is the slope and, -c/b is the intercept.
Linear Kernel

If there are two kernels named x1 and x2, the linear kernel can be defined by the dot product of the
two vectors: K(x1, x2) = x1 . X2
Polynomial Kernel
• We can define a polynomial kernel with this equation:
K(x1, x2) = (x1 . x2 + 1)d

• Here, x1 and x2 are vectors and d represents the degree of the

polynomial.
• K(x1, x2) represents the decision boundary to separate the given classes.
Gaussian Kernel
• The Gaussian kernel is an example of a radial basis function
kernel.
• It can be represented with this equation:

• It is used when there is no prior knowledge of the data.

• The value of gamma varies from 0 to 1.
• We must provide the value of gamma in the code manually.
• The most preferred value for gamma is 0.1.
7

Week 7 AI LUMS
No ratings yet
Week 7 AI LUMS
47 pages
ML-Unit - 3 & 4
No ratings yet
ML-Unit - 3 & 4
33 pages
IML Module 3
No ratings yet
IML Module 3
95 pages
UNIT IV Na-Ve Bayes Classifier Algorithm
No ratings yet
UNIT IV Na-Ve Bayes Classifier Algorithm
33 pages
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-04 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-04 Reference-Material-I
69 pages
Topic Wise Lecture Notes Unit 4
No ratings yet
Topic Wise Lecture Notes Unit 4
31 pages
ICT202B AI ML and Emerging Technologies UNIT 3 (Classification and Regression) 2
No ratings yet
ICT202B AI ML and Emerging Technologies UNIT 3 (Classification and Regression) 2
23 pages
DA Unit 2
No ratings yet
DA Unit 2
124 pages
Machine Learning: Sem-VII
No ratings yet
Machine Learning: Sem-VII
108 pages
Ong Eng Kiam v. Ong G.R. No. 153206, 10.23.2006
No ratings yet
Ong Eng Kiam v. Ong G.R. No. 153206, 10.23.2006
5 pages
Unit 3
No ratings yet
Unit 3
12 pages
Lecture 18 - 2024
No ratings yet
Lecture 18 - 2024
34 pages
Mathematics of Machine Learning MIT
No ratings yet
Mathematics of Machine Learning MIT
411 pages
Unit 6 Ai
No ratings yet
Unit 6 Ai
28 pages
FML Unit3
No ratings yet
FML Unit3
18 pages
Naive Bayes Thoerem
No ratings yet
Naive Bayes Thoerem
90 pages
ML Unit 2
No ratings yet
ML Unit 2
33 pages
Notes
No ratings yet
Notes
32 pages
Unit 2linear Regression Bayesian Learning
No ratings yet
Unit 2linear Regression Bayesian Learning
49 pages
KCA 034 - Unit 2
No ratings yet
KCA 034 - Unit 2
97 pages
Unit Ii
No ratings yet
Unit Ii
48 pages
Unit 3 ML
No ratings yet
Unit 3 ML
28 pages
QUESTIONS
No ratings yet
QUESTIONS
20 pages
UNIT I-Part 2
No ratings yet
UNIT I-Part 2
35 pages
S, SVM, LR
No ratings yet
S, SVM, LR
18 pages
ML Unit 3 V1
No ratings yet
ML Unit 3 V1
25 pages
ML Unit 2
No ratings yet
ML Unit 2
33 pages
MLT Unit-2
No ratings yet
MLT Unit-2
30 pages
Februart 2025 Parich News - Parish of Newcastle & Newtownmountkennedy With Calary, Co. Wicklow, Ireland
No ratings yet
Februart 2025 Parich News - Parish of Newcastle & Newtownmountkennedy With Calary, Co. Wicklow, Ireland
28 pages
BECE352E Module 3
No ratings yet
BECE352E Module 3
64 pages
Machine Learning
No ratings yet
Machine Learning
87 pages
07 - Bayesian Learning
No ratings yet
07 - Bayesian Learning
55 pages
Chapter 2
No ratings yet
Chapter 2
31 pages
Aiml Iii
No ratings yet
Aiml Iii
28 pages
(Machine Learning) BAYES' THEOREM AND CONCEPT LEARNING
No ratings yet
(Machine Learning) BAYES' THEOREM AND CONCEPT LEARNING
22 pages
MLT UNIT-2 Notes
No ratings yet
MLT UNIT-2 Notes
16 pages
MIT18 657F15 LecNote PDF
No ratings yet
MIT18 657F15 LecNote PDF
194 pages
ML Unit 3 Part B Material
No ratings yet
ML Unit 3 Part B Material
15 pages
CS 601 Machine Learning Unit 5
No ratings yet
CS 601 Machine Learning Unit 5
18 pages
Unit 5 - Machine Learning
No ratings yet
Unit 5 - Machine Learning
16 pages
Smallest Things in The World
No ratings yet
Smallest Things in The World
4 pages
WK 08
No ratings yet
WK 08
10 pages
Lecture - 2 & 3
No ratings yet
Lecture - 2 & 3
62 pages
Feasibility Analysis
No ratings yet
Feasibility Analysis
59 pages
Unit 2 AAM
No ratings yet
Unit 2 AAM
32 pages
CS601 - Machine Learning - Unit 1 - Notes - 1672759748
No ratings yet
CS601 - Machine Learning - Unit 1 - Notes - 1672759748
13 pages
Unit 3
No ratings yet
Unit 3
9 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
41 Machine Learning Algorithms I
No ratings yet
41 Machine Learning Algorithms I
8 pages
Comparison of Classification Algorithms
No ratings yet
Comparison of Classification Algorithms
11 pages
Tutorial 7 Machine Learning Algorithms
No ratings yet
Tutorial 7 Machine Learning Algorithms
30 pages
AP MLA Winner 2024
No ratings yet
AP MLA Winner 2024
7 pages
English 9: Long Term Lesson Plans
No ratings yet
English 9: Long Term Lesson Plans
15 pages
Unit Iii
No ratings yet
Unit Iii
18 pages
Global Test Common Core 2025 Semester 2
No ratings yet
Global Test Common Core 2025 Semester 2
3 pages
Wayang Golek Slide Descriptions
No ratings yet
Wayang Golek Slide Descriptions
32 pages
List of Approved Complementary Materials February 2024
No ratings yet
List of Approved Complementary Materials February 2024
3 pages
Aiml Unit 3
No ratings yet
Aiml Unit 3
9 pages
Grade 7 Unit 1: Criterion D Assessment-Article Mercury Poisoning
No ratings yet
Grade 7 Unit 1: Criterion D Assessment-Article Mercury Poisoning
9 pages
Adobe Analytics Challenge 2021: Team Mavericks
No ratings yet
Adobe Analytics Challenge 2021: Team Mavericks
44 pages
FALL SEMESTER 2019-20 AI With Python: ECE4031 Digital Assignment - 1
No ratings yet
FALL SEMESTER 2019-20 AI With Python: ECE4031 Digital Assignment - 1
14 pages
Bayes ML Tutorial
No ratings yet
Bayes ML Tutorial
69 pages
Lecom Sched February 2022 and March
No ratings yet
Lecom Sched February 2022 and March
1 page
Machine Learning UNIT-2: Logistic Regression
No ratings yet
Machine Learning UNIT-2: Logistic Regression
12 pages
Bark08 Ghahramani Samlbb 01
No ratings yet
Bark08 Ghahramani Samlbb 01
26 pages
OF BST CHAPTER 8 (Controlling)
No ratings yet
OF BST CHAPTER 8 (Controlling)
14 pages
2020.10.30 EEUU Contra CLAUDIA DÍAZ GUILLÉN y ADRIÁN VELÁSQUEZ FIGUEROA
No ratings yet
2020.10.30 EEUU Contra CLAUDIA DÍAZ GUILLÉN y ADRIÁN VELÁSQUEZ FIGUEROA
12 pages
Berk (2012) Child Development
No ratings yet
Berk (2012) Child Development
1 page
Supervised Learning
No ratings yet
Supervised Learning
6 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
11 pages
Magdalene, 21, Eyes of The Third, Time H.E.A.D : Allies Extended
No ratings yet
Magdalene, 21, Eyes of The Third, Time H.E.A.D : Allies Extended
54 pages
Cheatsheet Supervised Learning
No ratings yet
Cheatsheet Supervised Learning
4 pages
Reaction Paper 1
No ratings yet
Reaction Paper 1
2 pages
The Glass Castle
No ratings yet
The Glass Castle
3 pages
Anaesthetic Implications in Concurrent Diseases
No ratings yet
Anaesthetic Implications in Concurrent Diseases
37 pages
Superstition PDF
No ratings yet
Superstition PDF
1 page
Cheatsheet Supervised Learning
100% (1)
Cheatsheet Supervised Learning
4 pages
Salvadori - Teaching Structures To Architects
No ratings yet
Salvadori - Teaching Structures To Architects
7 pages
Quiz No. 1 Answer Key
63% (8)
Quiz No. 1 Answer Key
4 pages
Prof Educ 2: Foundation of Special and Inclusive Education
100% (1)
Prof Educ 2: Foundation of Special and Inclusive Education
12 pages
CS 601 Machine Learning Unit 5
No ratings yet
CS 601 Machine Learning Unit 5
18 pages
Arasitology: Intestinal Amoebae
No ratings yet
Arasitology: Intestinal Amoebae
5 pages
Experiment 3: Sublimation and Melting Point Determination
No ratings yet
Experiment 3: Sublimation and Melting Point Determination
4 pages
THEORY-Johnson's Behavioural System Model
100% (3)
THEORY-Johnson's Behavioural System Model
11 pages
Philippine Telegraph and Telephone Co
50% (2)
Philippine Telegraph and Telephone Co
2 pages
Who Is The Father of Lord Shiva, Vishnu and Brahma - Setu Asia
No ratings yet
Who Is The Father of Lord Shiva, Vishnu and Brahma - Setu Asia
3 pages
BAYES Theorem
From Everand
BAYES Theorem
Jeffery Short
2/5 (5)
Educ 540 - Art - The Giving Tree - Metis Pastel Art - Lesson Plan
No ratings yet
Educ 540 - Art - The Giving Tree - Metis Pastel Art - Lesson Plan
6 pages
Council of Nicea
No ratings yet
Council of Nicea
2 pages
Functions and Probability for Sixth Graders
From Everand
Functions and Probability for Sixth Graders
Home School Brew
No ratings yet

MLT Unit 2 - Updated

Uploaded by

MLT Unit 2 - Updated

Uploaded by

Machine Learning Techniques

Mr. Waseem Ahmed

Prerequisite of Bayes Theorem

So to solve this problem, we need to follow the below steps:

1.Convert the given dataset into frequency tables.

• where P(Xi |Parents(Xi)) means the probability of each feature

• Here, x1 and x2 are vectors and d represents the degree of the

• It is used when there is no prior knowledge of the data.

You might also like