0% found this document useful (0 votes)

15 views17 pages

Module 3 - Bayesian Classifier

Bayesian classification is a statistical method that uses Bayes' Theorem to predict class membership probabilities. The naïve Bayesian classifier simplifies the process by assuming conditional independence among attributes, allowing for efficient computation and often yielding comparable performance to other classifiers. However, it may suffer from accuracy loss due to its independence assumption and requires careful handling of zero-probability issues.

Uploaded by

adityasriram701

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views17 pages

Module 3 - Bayesian Classifier

Uploaded by

adityasriram701

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 17

Bayesian Classification

Bayes Theorem - Thomas Bayes

Bayes Theorem - Basics
Bayesian Classification
 A statistical classifier: performs probabilistic prediction,
i.e., predicts class membership probabilities
 Foundation: Based on Bayes’ Theorem.
 Performance: A simple Bayesian classifier, naïve Bayesian
classifier, has comparable performance with decision tree
and selected neural network classifiers
 Incremental: Each training example can incrementally
increase/decrease the probability that a hypothesis is
correct — prior knowledge can be combined with observed
data
 Standard: Even when Bayesian methods are
computationally intractable, they can provide a standard of
optimal decision making against which other methods can
be measured
Bayes’ Theorem: Basics
M
 P(B) 
Total probability Theorem:  P(B | A )P( A )
i i
i 1

 Bayes’ Theorem:P( H | X) P(X | H ) P( H ) P(X | H )P( H ) / P(X)

P(X)
 Let X be a data sample (“evidence”): class label is unknown
 Let H be a hypothesis that X belongs to class C
 Classification is to determine P(H|X), (i.e., posteriori probability):
the probability that the hypothesis holds given the observed
data sample X
 P(H) (prior probability): the initial probability
 E.g., X will buy computer, regardless of age, income, …
 P(X): probability that sample data is observed
 P(X|H) (likelihood): the probability of observing the sample X,
given that the hypothesis holds
 E.g., Given that X will buy computer, the prob. that X is

5 31..40, medium income

Prediction Based on Bayes’ Theorem

 Given training data X, posteriori probability of a

hypothesis H, P(H|X), follows the Bayes’ theorem

P(H | X) P(X | H ) P(H ) P(X | H )P(H ) / P(X)

P(X)
 Informally, this can be viewed as
posteriori = likelihood x prior/evidence
 Predicts X belongs to Ci iff the probability P(Ci|X) is the
highest among all the P(Ck|X) for all the k classes
 Practical difficulty: It requires initial knowledge of
many probabilities, involving significant computational
cost
6
Naïve Bayes Classification
 Let D be a training set of tuples and their associated
class labels, and each tuple is represented by an n-D
attribute vector X = (x1, x2, …, xn)
 Suppose there are m classes C1, C2, …, Cm.
 Given a tuple, X, the classifier will predict that X
belongs to the class having the highest posterior
probability, conditioned on X. That is, the naıve
Bayesian classifier predicts that tuple X belongs to the
class Ci if and only if

7
Classification Is to Derive the
Maximum Posteriori
 Classification is to derive the maximum
posteriori, i.e., the maximal P(Ci|X) This
can be derived from Bayes’ theorem Since
P(X) is constant for all classes, only P (X|
Ci)P(Ci) needs to be maximized.
P(X | C )P(C )
P(C | X)  i i P(C | X) P(X | C )P(C )
i P(X) i i i
Naïve Bayes Classifier
 A simplified assumption: attributes are conditionally
independent (i.e., no dependence relation between
attributes):
n
P ( X | C i )   P ( x | C i ) P ( x | C i ) P ( x | C i ) ... P ( x | C i )
k 1 2 n
k 1

 This greatly reduces the computation cost: Only

counts the class distribution
 If Ak is categorical, P(xk|Ci) is the number of tuples in
Ci having value xk for Ak divided by |Ci, D| (number of
tuples of Ci in D)
 If Ak is continous-valued, P(xk|C 1 i) is
 usually computed
2
( x  )

( x,  ,  ) 
2
2
based on Gaussian g distribution 2 
ewith a mean μ and
standard deviation σ
P ( X | C i )  g ( xk ,  C ,  C )
and P(xk|Ci) is
i i

9
How to Predict a class label using naıve
Bayesian classification?

 Given class labeled training tuples from

AllElectronics Customer Database.

 The data tuples are described by the

attributes age, income, student, and
credit rating.
Naïve Bayes Classifier: Training Dataset

Class: C1:buys_computer = ‘yes’ & C2:buys_computer = ‘no’

11
 P(Ci): P(buys_computer = “yes”) = 9/14
= 0.643
age
P(buys_computer
income studentcredit_rating
= “no”) =
buys_computer
5/14=<=30
0.357
<=30
high
high
no
no
fair
excellent
no
no
31…40 high no fair yes
>40 medium no fair yes
>40 low yes fair yes
>40 low yes excellent no
31…40 low yes excellent yes
<=30 medium no fair no
<=30 low yes fair yes
>40 medium yes fair yes
<=30 medium yes excellent yes
31…40 medium no excellent yes
31…40 high yes fair yes
>40 medium no excellent no

The tuple we wish to classify is

X = (youth , income = medium, student
=yes, credit_rating = fair)
age income studentcredit_rating
buys_computer
<=30 high no fair no
<=30 high no excellent no
31…40 high no fair yes

Naïve Bayes Classifier: An Example >40

>40
medium
low
no fair
yes fair
yes
yes
>40 low yes excellent no
31…40 low yes excellent yes
<=30 medium no fair no
<=30 low yes fair yes
 Compute P(X|Ci) for each class >40 medium yes fair yes
<=30 medium yes excellent yes
31…40 medium no excellent yes
31…40 high yes fair yes
>40 medium no excellent no

P(age = “youth” | buys_computer = “yes”) = 2/9 = 0.222

P(age = “youth” | buys_computer = “no”) = 3/5 = 0.6
P(income = “medium” | buys_computer = “yes”) = 4/9 =
0.444
P(income = “medium” | buys_computer = “no”) = 2/5 = 0.4
P(student = “yes” | buys_computer = “yes) = 6/9 = 0.667
P(student = “yes” | buys_computer = “no”) = 1/5 = 0.2
P(credit_rating = “fair” | buys_computer = “yes”) = 6/9 =
0.667
P(credit_rating = “fair” | buys_computer = “no”) = 2/5 =
0.4

 X = (youth , income = medium, student = yes, credit_rating =

fair)
13
Conditio
n
satisfied

Therefore the Naïve Bayesian Classifier predicts

buys_computer = yes for tuple X
Avoiding the Zero-Probability Problem
 Naïve Bayesian prediction requires each conditional
prob. be non-zero. Otherwise, the predicted prob.
will be zero n
P( X | C i)   P( x k | C i)
k 1

 Ex. Suppose a dataset with 1000 tuples, income=low

(0), income= medium (990), and income = high (10)
 Use Laplacian correction (or Laplacian estimator)
 Adding 1 to each case
Prob(income = low) = 1/1003
Prob(income = medium) = 991/1003
Prob(income = high) = 11/1003
 The “corrected” prob. estimates are close to their
15“uncorrected” counterparts
Naïve Bayes Classifier: Comments
 Advantages
 Easy to implement
 Good results obtained in most of the cases
 Disadvantages
 Assumption: class conditional independence,
therefore loss of accuracy
 Practically, dependencies exist among variables
 E.g., hospitals: patients: Profile: age, family history, etc.
Symptoms: fever, cough etc., Disease: lung
cancer, diabetes, etc.
 Dependencies among these cannot be modeled by Naïve
Bayes Classifier

16
Thank you….

Bayes Classification
No ratings yet
Bayes Classification
9 pages
AI Notes
No ratings yet
AI Notes
19 pages
Bayes' Theorem for Data Science
No ratings yet
Bayes' Theorem for Data Science
10 pages
Lecture 8 - Naive Bayes
No ratings yet
Lecture 8 - Naive Bayes
27 pages
ML 05 Bayesian Classifier
No ratings yet
ML 05 Bayesian Classifier
19 pages
Statistical Inference INF312 - Is - Lecture 03 - Part 3
No ratings yet
Statistical Inference INF312 - Is - Lecture 03 - Part 3
18 pages
Unit6 - 3 Classification-Bayesian
No ratings yet
Unit6 - 3 Classification-Bayesian
15 pages
Bayesian Classification Guide
No ratings yet
Bayesian Classification Guide
6 pages
Lecture12 Ch8 ClassBasic Part2
No ratings yet
Lecture12 Ch8 ClassBasic Part2
22 pages
2.3 Bayes Classification
No ratings yet
2.3 Bayes Classification
15 pages
L3 (Week3) Bayesian Classifier
No ratings yet
L3 (Week3) Bayesian Classifier
21 pages
Naive Bayes
No ratings yet
Naive Bayes
37 pages
Lecture 5 Bayesian Classification
No ratings yet
Lecture 5 Bayesian Classification
16 pages
Naive Bayes Classifier Guide
No ratings yet
Naive Bayes Classifier Guide
16 pages
3 - Bayesian Classification
No ratings yet
3 - Bayesian Classification
15 pages
A5 PDF
No ratings yet
A5 PDF
9 pages
TTDS Lecture 5
No ratings yet
TTDS Lecture 5
8 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
47 pages
20210913115710D3708 - Session 09-12 Bayes Classifier
No ratings yet
20210913115710D3708 - Session 09-12 Bayes Classifier
30 pages
UNIT - IV
No ratings yet
UNIT - IV
169 pages
WINSEM2024-25 BCSE334L TH VL2024250502042 2025-03-03 Reference-Material-I
No ratings yet
WINSEM2024-25 BCSE334L TH VL2024250502042 2025-03-03 Reference-Material-I
18 pages
Classification Bayes
No ratings yet
Classification Bayes
21 pages
ML Module4 Classification
No ratings yet
ML Module4 Classification
79 pages
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
No ratings yet
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
66 pages
Classification Clustering
No ratings yet
Classification Clustering
44 pages
Bayesian
No ratings yet
Bayesian
23 pages
29-Naive Bayes-03-10-2024
No ratings yet
29-Naive Bayes-03-10-2024
48 pages
Unit-3 AML (Bayesian Concept Learning)
No ratings yet
Unit-3 AML (Bayesian Concept Learning)
40 pages
Bayes Classification
No ratings yet
Bayes Classification
4 pages
CSC 325 AI Lecture08 Supervised Learning
No ratings yet
CSC 325 AI Lecture08 Supervised Learning
32 pages
4 22865 IS465 2019 1 2 1 08ClassBasic
No ratings yet
4 22865 IS465 2019 1 2 1 08ClassBasic
43 pages
CSC 325 AI Lecture08 Supervised Learning Fall2024 DR Raheel 20022025 034558pm
No ratings yet
CSC 325 AI Lecture08 Supervised Learning Fall2024 DR Raheel 20022025 034558pm
29 pages
Bayesian Classification
No ratings yet
Bayesian Classification
25 pages
Bayes Classification Methods
No ratings yet
Bayes Classification Methods
22 pages
Bayes Classification Method
No ratings yet
Bayes Classification Method
18 pages
Bayesian Learning
No ratings yet
Bayesian Learning
58 pages
DWDM Unit 3 Part 2
No ratings yet
DWDM Unit 3 Part 2
8 pages
K - Nearest Neighbours Classifier / Regressor
No ratings yet
K - Nearest Neighbours Classifier / Regressor
35 pages
Naïve Bayes for Data Scientists
No ratings yet
Naïve Bayes for Data Scientists
31 pages
Lesson 3.3 - Supervised Learning Rule Based Classification
No ratings yet
Lesson 3.3 - Supervised Learning Rule Based Classification
43 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Bayesian Classification Explained
No ratings yet
Bayesian Classification Explained
7 pages
Nayes Bayes Classifier
No ratings yet
Nayes Bayes Classifier
46 pages
Naive Bayes
No ratings yet
Naive Bayes
24 pages
23-Naive Bayes
No ratings yet
23-Naive Bayes
22 pages
BSC ML CH2
No ratings yet
BSC ML CH2
79 pages
9-Decision Tree Induction-23-01-2025
No ratings yet
9-Decision Tree Induction-23-01-2025
40 pages
10 Classification New 1
No ratings yet
10 Classification New 1
31 pages
ML Naive Bayes 1
No ratings yet
ML Naive Bayes 1
19 pages
Bayes Classifier
No ratings yet
Bayes Classifier
20 pages
07 Naive Bayes
No ratings yet
07 Naive Bayes
6 pages
Lecture-7 Classification Using Naive Bays
No ratings yet
Lecture-7 Classification Using Naive Bays
19 pages
Class Adv Classification IV
No ratings yet
Class Adv Classification IV
49 pages
ML 09 Naive Bayes Classifier
No ratings yet
ML 09 Naive Bayes Classifier
24 pages
Bays Classifier (Machine Learning)
No ratings yet
Bays Classifier (Machine Learning)
16 pages
Naive Bayes & SVM Overview
No ratings yet
Naive Bayes & SVM Overview
79 pages
Lect 8-Decision Tree-2
No ratings yet
Lect 8-Decision Tree-2
16 pages
Support Vector Machines
No ratings yet
Support Vector Machines
33 pages
Intro
No ratings yet
Intro
12 pages
Rev Two
No ratings yet
Rev Two
33 pages
Exploratory Data Analysis Unit 2
No ratings yet
Exploratory Data Analysis Unit 2
39 pages
Research Paper
No ratings yet
Research Paper
19 pages
BRM Unit 5
No ratings yet
BRM Unit 5
10 pages
Unit 15 Quality Assurance: Objectives
No ratings yet
Unit 15 Quality Assurance: Objectives
14 pages
Estimating Stock Market Volatility
No ratings yet
Estimating Stock Market Volatility
9 pages
Units For Courses in Mathematics Department - 2021 - 2022
No ratings yet
Units For Courses in Mathematics Department - 2021 - 2022
7 pages
Business Statistics Module 5 Notes
No ratings yet
Business Statistics Module 5 Notes
3 pages
Rstudio Homework
100% (1)
Rstudio Homework
6 pages
Probability Distributions Guide
No ratings yet
Probability Distributions Guide
33 pages
Syllabus
No ratings yet
Syllabus
9 pages
PHD Coursework Sessional SAMPLE PAPER-1
No ratings yet
PHD Coursework Sessional SAMPLE PAPER-1
13 pages
N Paper 0-92-11may-Ccbs Docs 9916 1
No ratings yet
N Paper 0-92-11may-Ccbs Docs 9916 1
12 pages
Practical Research 2 Version Eduard M Albay
No ratings yet
Practical Research 2 Version Eduard M Albay
219 pages
Statistic For Agriculture Studies: The Assumptions of Regression
No ratings yet
Statistic For Agriculture Studies: The Assumptions of Regression
6 pages
Alchemy Deciphered
100% (10)
Alchemy Deciphered
573 pages
Fusion The Energy of The Universe 1st Edition Garry M. Mccracken Available Full Chapters
100% (2)
Fusion The Energy of The Universe 1st Edition Garry M. Mccracken Available Full Chapters
133 pages
Analytics Group Assignment
No ratings yet
Analytics Group Assignment
16 pages
Comparative Method in Sociology
100% (2)
Comparative Method in Sociology
17 pages
Business Vocabulary in Use Advanced 2nd Edition Bill Mascull Online PDF
100% (2)
Business Vocabulary in Use Advanced 2nd Edition Bill Mascull Online PDF
142 pages
Fast MCD Algorithm for Robust Estimation
No ratings yet
Fast MCD Algorithm for Robust Estimation
13 pages
The Study of Students' Mathematics Lesson Learning Quality
No ratings yet
The Study of Students' Mathematics Lesson Learning Quality
7 pages
Classifier 3
No ratings yet
Classifier 3
19 pages
QueuingTheory Matlab Assignment
No ratings yet
QueuingTheory Matlab Assignment
3 pages
Appendix A Cumulative Probabilities For A Standard Normal Distribution P (Z X) N (X) For X 0 or P (Z Z) N (Z) For Z 0
No ratings yet
Appendix A Cumulative Probabilities For A Standard Normal Distribution P (Z X) N (X) For X 0 or P (Z Z) N (Z) For Z 0
9 pages
Forecasting 2025
No ratings yet
Forecasting 2025
25 pages
Hiring Process Analytics Project 4 On Statistics
100% (1)
Hiring Process Analytics Project 4 On Statistics
6 pages
2.4 Worksheet
No ratings yet
2.4 Worksheet
3 pages
Site Furnishings A Complete Guide To The Planning Selection and Use of Landscape Furniture and Amenities 1st Edition Bill Main PDF Download
100% (3)
Site Furnishings A Complete Guide To The Planning Selection and Use of Landscape Furniture and Amenities 1st Edition Bill Main PDF Download
100 pages
Deming Regression: By:-Amit Singh
No ratings yet
Deming Regression: By:-Amit Singh
29 pages
Measure of Central Tendency of Ungrouped Data Exemplar
No ratings yet
Measure of Central Tendency of Ungrouped Data Exemplar
7 pages

Module 3 - Bayesian Classifier

Uploaded by

Module 3 - Bayesian Classifier

Uploaded by

Bayesian Classification

Bayes Theorem - Thomas Bayes

 Bayes’ Theorem:P( H | X) P(X | H ) P( H ) P(X | H )P( H ) / P(X)

5 31..40, medium income

 Given training data X, posteriori probability of a

P(H | X) P(X | H ) P(H ) P(X | H )P(H ) / P(X)

 This greatly reduces the computation cost: Only

 Given class labeled training tuples from

 The data tuples are described by the

Class: C1:buys_computer = ‘yes’ & C2:buys_computer = ‘no’

The tuple we wish to classify is

Naïve Bayes Classifier: An Example >40

P(age = “youth” | buys_computer = “yes”) = 2/9 = 0.222

 X = (youth , income = medium, student = yes, credit_rating =

Therefore the Naïve Bayesian Classifier predicts

 Ex. Suppose a dataset with 1000 tuples, income=low

You might also like