0% found this document useful (0 votes)

165 views4 pages

Naive Bayes Theory

Naïve Bayes is a supervised machine learning algorithm that uses Bayes' theorem to classify data based on conditional probabilities. It assumes independence between features and calculates the probability of each feature belonging to a class. To classify new data, it calculates the probabilities of the data belonging to each class and selects the class with the highest probability. Gaussian Naïve Bayes handles continuous values by assuming a Gaussian distribution of feature values.

Uploaded by

PAWAN TIWARI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

165 views4 pages

Naive Bayes Theory

Uploaded by

PAWAN TIWARI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Naïve Bayes

Naïve Bayes is a supervised machine learning algorithm which is used to classify data
into predefined classes. It uses the concept of conditional probability to classify the test
data.
Conditional Probability: - It help us to find the probability that something will happen
given that something else is happened.
Consider two events namely A & B. then,
[ P(A/B) = P(A)*P(B/A) ]
Here, P(B/A) is the probability that “B” happens given that “A” is already happens.
The fundamental assumptions of Naïve Bayes are that each feature makes an
independent & equal contribution to the outcome.

Theorem: - Bayes theorem finds and helps us to know how often “A” happens given
that “B” has already happened i.e. P(A/B) is only when we know how often “B” happens
given that “A” is already happened i.e. P(B/A).
In other words, we can say that, it basically finds the probability of an event occurring
based on the probability of an event that has already occurred.
Mathematically we can state the theorem as: -
𝐵
𝐴 𝑃 ∗ 𝑃(𝐴)
𝐴
𝑃 =
𝐵 𝑃(𝐵)

P(B) != 0
B is called as an evidence.
P(A) is known as priori of A. i.e. probability of an event before evidence is seen.
P(A/B) is known as posteriori probability of B. i.e. probability of an event after evidence
is seen.
In simple terms, we can say that Naïve Bayes finds the probability of every feature then
it selects the outcome with highest probability.
The dataset we keep for Naïve Bayes algorithm is divided into two parts namely:
1. Feature Matrix: - It contains all the vectors (rows) of dataset in which each
vector consists of the value of dependent features.
2. Response Vector: - It contains the value of class variable (prediction or output)
for each row of feature matrix.
Now, let’s discuss the working of Naïve Bayes algorithm mathematically with the help
of an example: -

S.No. Colour Type Origin Stolen (Class)

1. Red Sports Domestic Yes
2. Yellow Sports Domestic No
3. Red SUV Imported No
4. Red Sports Imported Yes
5. Yellow SUV Imported Yes
6. Red Sports Imported Yes

In the above dataset, we have {“Colour”, “Type”, “Origin”} are features and {“Stolen”} is
class variable.
Now, let’s find the output result of Naïve Bayes algorithm on above dataset by using the
following steps: -
Step – 1: - Probability of class variable: -
4 2 2 1
𝑃(𝑦𝑒𝑠) = = 𝑎𝑛𝑑 𝑃(𝑁𝑜) = =
6 3 6 3
Step – 2: - Probability of every features w.r.t each class variable
1. For Colour feature

𝑃 = 𝑃 =

2. For Type feature

𝑃 = 𝑃 =

3. For Origin feature

𝑃 = 𝑃 =

Step – 3: - Classify the unknown data based on conditional probability formula of Bayes
theorem.
X = <Yellow, SUV, Domestic>
For this we have to create a classifier model and we will find the probability of given
unknown dataset for all possible values of the class variable and pick up the output with
maximum probability. i.e.

𝑥
𝑌 = 𝑎𝑟𝑔𝑚𝑎𝑥 𝑃(𝑦) ∗ 𝑃
𝑦

Here, P(y) is class probability and 𝑃 is conditional probability.

Now final calculation will be:

𝑋 𝑌𝑒𝑙𝑙𝑜𝑤 𝑆𝑈𝑉 𝐷𝑜𝑚𝑒𝑠𝑡𝑖𝑐 1 1 1 1
𝑃 =𝑃 ∗𝑃 ∗𝑃 = ∗ ∗ =
𝑌𝑒𝑠 𝑌𝑒𝑠 𝑌𝑒𝑠 𝑌𝑒𝑠 4 4 4 64

𝑋 𝑌𝑒𝑙𝑙𝑜𝑤 𝑆𝑈𝑉 𝐷𝑜𝑚𝑒𝑠𝑡𝑖𝑐 1 1 1 1

𝑃 =𝑃 ∗𝑃 ∗𝑃 = ∗ ∗ =
𝑁𝑜 𝑁𝑜 𝑁𝑜 𝑁𝑜 2 2 2 8

Clearly, we can say that 𝑃 <𝑃

Thus, we can say that the SUV car of yellow colour having domestic origin has not been
stolen.

Gaussian Naïve Bayes Classifier

In this classifier, continuous values are associated with each feature are assumed
to be distributed according to a Gaussian distribution.
Q. What is Gaussian Distribution?
Ans: - It is a distribution plot which when plotted gives a bell-shaped curve which is
symmetric about the mean of the feature values.
The likelihood of the features in Gaussian distribution is assumed to be Gaussian in
nature, hence the conditional probability is given by:

⎡ ⎤
𝑥 1
⎢𝑃 = 𝑒 ⎥
⎢ 𝑦 ⎥
2𝜋𝜎
⎣ ⎦
There are another popular Naïve Bayes Classifiers are available in Machine Learning
fraternity. Some of these are
1. Multinomial Naïve Bayes Classifier
2. Bernoulli Naïve Bayes Classifier

Pro: -
1. Requires a small amount of training data
2. Easy and fast to predict the class variable
Cons: -
1. Does not work well if we have a correlated data
2. It requires some evidence to work on
3. Not suitable for larger set

Applications: - Document Classification, Spam Filtration, etc.

Voltas LTD Vs Rolta India LTD On 14 February, 2014
No ratings yet
Voltas LTD Vs Rolta India LTD On 14 February, 2014
10 pages
Symbian, Google and Apple in The Mobile Space
No ratings yet
Symbian, Google and Apple in The Mobile Space
8 pages
Bhatia International To BALCO: Piloting A Much Needed Course Correction
No ratings yet
Bhatia International To BALCO: Piloting A Much Needed Course Correction
16 pages
Financial Analysis of Subscription Model
No ratings yet
Financial Analysis of Subscription Model
4 pages
Goal Programming Group4 Amat112b
No ratings yet
Goal Programming Group4 Amat112b
24 pages
Bits Project Management
No ratings yet
Bits Project Management
4 pages
Engineering Project: Wire Twisting
No ratings yet
Engineering Project: Wire Twisting
48 pages
Integer Programming Models Guide
No ratings yet
Integer Programming Models Guide
9 pages
Group Assignement No. 2
No ratings yet
Group Assignement No. 2
4 pages
Crisp - DM: Data Mining Process
No ratings yet
Crisp - DM: Data Mining Process
8 pages
Study of Plcs and CNC Machines at Bhel,: Register No: 18becxxxx Name: V Ajay Kumar
No ratings yet
Study of Plcs and CNC Machines at Bhel,: Register No: 18becxxxx Name: V Ajay Kumar
33 pages
Software Pricing: Seminar Presented by
100% (1)
Software Pricing: Seminar Presented by
23 pages
Encoded Data Analysis
No ratings yet
Encoded Data Analysis
3 pages
Supreme Court Arbitration Cases
No ratings yet
Supreme Court Arbitration Cases
190 pages
From (Arpita Seth (Arpitaseth.24@gmail - Com) ) - ID (963) - ADR
No ratings yet
From (Arpita Seth (Arpitaseth.24@gmail - Com) ) - ID (963) - ADR
20 pages
Advanced Machine Learning Course Overview
No ratings yet
Advanced Machine Learning Course Overview
7 pages
NHAI v. Sayedabad Tea Company Case
No ratings yet
NHAI v. Sayedabad Tea Company Case
5 pages
Financial Analysis for Managers
No ratings yet
Financial Analysis for Managers
4 pages
The Imperative and Research Directions of Sustainable Project Management
No ratings yet
The Imperative and Research Directions of Sustainable Project Management
14 pages
Bhatia International v. Bulk Trading S.A.
No ratings yet
Bhatia International v. Bulk Trading S.A.
4 pages
Dmart
No ratings yet
Dmart
11 pages
Machine Learning Course Overview
No ratings yet
Machine Learning Course Overview
3 pages
Nokia Pricing Strategy Analysis
No ratings yet
Nokia Pricing Strategy Analysis
42 pages
CAT 1999 Exam Answer Key PDF
No ratings yet
CAT 1999 Exam Answer Key PDF
32 pages
Project On Pricing Support 4 Wheeler
No ratings yet
Project On Pricing Support 4 Wheeler
98 pages
Q1 (JC Penney)
No ratings yet
Q1 (JC Penney)
3 pages
Pricing of Services
No ratings yet
Pricing of Services
9 pages
Case Problem 1 Textbook Publishing
No ratings yet
Case Problem 1 Textbook Publishing
2 pages
Minor Research Project FINAL
No ratings yet
Minor Research Project FINAL
50 pages
AMR Case Study Solution March 2019
100% (1)
AMR Case Study Solution March 2019
6 pages
Bhatia International V Bulk Trading
No ratings yet
Bhatia International V Bulk Trading
9 pages
MOS PGPM Individual Assignment 2
No ratings yet
MOS PGPM Individual Assignment 2
1 page
Mba Research Project Report
100% (1)
Mba Research Project Report
88 pages
STWS Business Model Archetypes
No ratings yet
STWS Business Model Archetypes
1 page
Importance of Performance Appraisal
No ratings yet
Importance of Performance Appraisal
7 pages
Problem Set 2 3 4 Or-2
100% (1)
Problem Set 2 3 4 Or-2
5 pages
DS II Packet 2
No ratings yet
DS II Packet 2
31 pages
Banking Customer Service Evolution
100% (1)
Banking Customer Service Evolution
6 pages
Lecture 5 Testing Marketing Hypotheses 2020
No ratings yet
Lecture 5 Testing Marketing Hypotheses 2020
23 pages
Entrepreneurship Development A4
No ratings yet
Entrepreneurship Development A4
44 pages
Pa ZG512 Ec-3r First Sem 2022-2023
No ratings yet
Pa ZG512 Ec-3r First Sem 2022-2023
5 pages
Predictive Maintenance of Railway Point Machine Using Machine Learning Algorithm
No ratings yet
Predictive Maintenance of Railway Point Machine Using Machine Learning Algorithm
3 pages
Marsh Meadow Decision Tree Analysis
No ratings yet
Marsh Meadow Decision Tree Analysis
9 pages
Assignment 2 Problem
No ratings yet
Assignment 2 Problem
7 pages
Machinelearningreport 190805183023 PDF
No ratings yet
Machinelearningreport 190805183023 PDF
49 pages
CaseStudy Radiohead
No ratings yet
CaseStudy Radiohead
3 pages
Docx
No ratings yet
Docx
16 pages
Operations Research Concepts Quiz
No ratings yet
Operations Research Concepts Quiz
12 pages
Forecasting Methods and Models
100% (1)
Forecasting Methods and Models
30 pages
Long Quiz 2 - Bond and Stock Valuation
No ratings yet
Long Quiz 2 - Bond and Stock Valuation
3 pages
Course Material BM QT 2019 PDF
No ratings yet
Course Material BM QT 2019 PDF
44 pages
Capital Budgeting & Project Analysis
No ratings yet
Capital Budgeting & Project Analysis
11 pages
Balanced Scorecard: Strategic Management Guide
No ratings yet
Balanced Scorecard: Strategic Management Guide
12 pages
Fitbit Strategy & Market Analysis
No ratings yet
Fitbit Strategy & Market Analysis
12 pages
Valuation of GCPL
No ratings yet
Valuation of GCPL
51 pages
Mba HR Project A Study On An Employee Satisfaction in VAIDYANATH COMPANY LTD PARLI V
No ratings yet
Mba HR Project A Study On An Employee Satisfaction in VAIDYANATH COMPANY LTD PARLI V
39 pages
CCS - Lec 5
No ratings yet
CCS - Lec 5
33 pages
An Introduction To Naive Bayes Algorithm For Beginners
No ratings yet
An Introduction To Naive Bayes Algorithm For Beginners
11 pages
UNIT 2 AAM Notes
No ratings yet
UNIT 2 AAM Notes
38 pages
Naive Bayes Algorithm
No ratings yet
Naive Bayes Algorithm
46 pages
Prasoon Raj - 1709131099 - Report
100% (1)
Prasoon Raj - 1709131099 - Report
41 pages
Naive Bayes Implementation
No ratings yet
Naive Bayes Implementation
2 pages
SVM Tuning for Optimal Classification
No ratings yet
SVM Tuning for Optimal Classification
2 pages
Decision Tree Algorithm Guide
No ratings yet
Decision Tree Algorithm Guide
4 pages
Matplotlib Library Implementation
No ratings yet
Matplotlib Library Implementation
3 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
4 pages
Dimentionality Reduction Implementation
No ratings yet
Dimentionality Reduction Implementation
8 pages
Electronic Nose
No ratings yet
Electronic Nose
25 pages
Lectronic OSE: Presentation by
No ratings yet
Lectronic OSE: Presentation by
18 pages
Seminar Report
57% (7)
Seminar Report
31 pages
Probability & CDF for Dice Rolls
No ratings yet
Probability & CDF for Dice Rolls
10 pages
Chapter 03 Answers
No ratings yet
Chapter 03 Answers
29 pages
Bayesian Classification Examples
No ratings yet
Bayesian Classification Examples
17 pages
Reliability Engineering Theory and Practice 8th Edition Birolini A Instant Download
100% (2)
Reliability Engineering Theory and Practice 8th Edition Birolini A Instant Download
56 pages
Schwarz EstimatingDimensionModel 1978
No ratings yet
Schwarz EstimatingDimensionModel 1978
5 pages
Set 4 IBM-322
No ratings yet
Set 4 IBM-322
3 pages
Kou - 2002 - MS - A Jump-Diffusion Model For Option Pricing
No ratings yet
Kou - 2002 - MS - A Jump-Diffusion Model For Option Pricing
17 pages
Powerpoint Workshop Introduction To Deep Learning - Statistics and Data Analysis
No ratings yet
Powerpoint Workshop Introduction To Deep Learning - Statistics and Data Analysis
26 pages
Q3W3 - Probability Mass Function, Mean - 0
No ratings yet
Q3W3 - Probability Mass Function, Mean - 0
44 pages
Continuous
No ratings yet
Continuous
8 pages
Copulas and Stochastic Processes
No ratings yet
Copulas and Stochastic Processes
117 pages
Random and Probable Thoughts: Property of STI
No ratings yet
Random and Probable Thoughts: Property of STI
2 pages
Cambridge International As and A Level Mathematics - Statistics 2
No ratings yet
Cambridge International As and A Level Mathematics - Statistics 2
37 pages
Unit5 Updated
No ratings yet
Unit5 Updated
69 pages
Continuous Random Variables II
No ratings yet
Continuous Random Variables II
1 page
Data Trends for Analysts
No ratings yet
Data Trends for Analysts
3 pages
Probstatmarkov PDF
No ratings yet
Probstatmarkov PDF
242 pages
Fundamentals and Advances in Multiple-Hypothesis Tracking - NATO Report
No ratings yet
Fundamentals and Advances in Multiple-Hypothesis Tracking - NATO Report
18 pages
Spain vs Italy UEFA Euro Match Preview
No ratings yet
Spain vs Italy UEFA Euro Match Preview
1 page
Time Averages and Ergodicity
No ratings yet
Time Averages and Ergodicity
24 pages
300 Continuous Time Markov Chains
No ratings yet
300 Continuous Time Markov Chains
78 pages
Full Download (Ebook PDF) Statistics For Business Economics 12th by David R. Anderson PDF
100% (2)
Full Download (Ebook PDF) Statistics For Business Economics 12th by David R. Anderson PDF
41 pages
Information Theory and Computing Assignment No. 1: April 10, 2020
No ratings yet
Information Theory and Computing Assignment No. 1: April 10, 2020
12 pages
Foundations of Agnostic Statistics
100% (1)
Foundations of Agnostic Statistics
318 pages
ML Unit-V
No ratings yet
ML Unit-V
161 pages
Unit 5 Probability
No ratings yet
Unit 5 Probability
35 pages
Probability
No ratings yet
Probability
47 pages
Chapter 4 Slides
No ratings yet
Chapter 4 Slides
127 pages
Stats Tests for Thesis Students
No ratings yet
Stats Tests for Thesis Students
1 page
BMS - Statistics For Business de J2fMxBv
No ratings yet
BMS - Statistics For Business de J2fMxBv
4 pages

Naive Bayes Theory

Uploaded by

Naive Bayes Theory

Uploaded by

Naïve Bayes

S.No. Colour Type Origin Stolen (Class)

2. For Type feature

3. For Origin feature

Here, P(y) is class probability and 𝑃 is conditional probability.

Now final calculation will be:

𝑋 𝑌𝑒𝑙𝑙𝑜𝑤 𝑆𝑈𝑉 𝐷𝑜𝑚𝑒𝑠𝑡𝑖𝑐 1 1 1 1

Clearly, we can say that 𝑃 <𝑃

Gaussian Naïve Bayes Classifier

Applications: - Document Classification, Spam Filtration, etc.

You might also like