0% found this document useful (0 votes)

72 views28 pages

I2ml3e Chap8

This document summarizes key concepts in nonparametric machine learning methods. It discusses nonparametric estimation, density estimation using histograms, kernel estimators, and k-nearest neighbor methods. It also covers nonparametric classification, regression, and outlier detection. Methods described include kernel density estimation, nearest neighbor classification, regression using regressograms and kernel smoothers, and choosing bandwidth parameters through cross-validation.

Uploaded by

EMS Metalworking Machinery

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views28 pages

I2ml3e Chap8

Uploaded by

EMS Metalworking Machinery

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Lecture Slides for

INTRODUCTION
TO
MACHINE
LEARNING
3RD EDITION
ETHEM ALPAYDIN
© The MIT Press, 2014

alpaydin@boun.edu.tr
http://www.cmpe.boun.edu.tr/~ethem/i2ml3e
CHAPTER 8:

NONPARAMETRIC
METHODS
Nonparametric Estimation
3

 Parametric (single global model), semiparametric

(small number of local models)
 Nonparametric: Similar inputs have similar outputs
 Functions (pdf, discriminant, regression) change
smoothly
 Keep the training data;“let the data speak for
itself”
 Given x, find a small number of closest training
instances and interpolate from these
 Aka lazy/memory-based/case-based/instance-
based learning
Density Estimation
4

 Given the training set X={xt}t drawn iid from p(x)

 Divide data into bins of size h
 Histogram: # x t in the samebin as x
pˆx  
Nh
Naive estimator:
# x  h  x t  x  h

pˆx  
2Nh
or
1 N  x  xt  1/ 2 if u  1
pˆx    w
Nh t 1  h
 wu   
otherwise
 0
5
6
Kernel Estimator
7

 Kernel function, e.g., Gaussian kernel:

1  u 
2
K u   exp 
2  2

 Kernel estimator (Parzen windows)

1 N  x  xt 
pˆx    K 
Nh t 1  h


8
k-Nearest Neighbor Estimator
9

 Instead of fixing bin width h and counting the

number of instances, fix the instances (neighbors) k
and check bin width
k
pˆx  
2Ndk x 
dk(x), distance to kth closest instance to x
10
Multivariate Data
11

 Kernel density estimator

1 N
 x  xt 
pˆx   d  K 
t 1 

Nh h 
Multivariate Gaussian kernel

spheric  1 
d
 u
2

K u     exp 
 2   2 
ellipsoid
1  1 T 1 
K u   exp u S u 
2  S
d /2 1/ 2
 2 
Nonparametric Classification
12

 Estimate p(x|Ci) and use Bayes’ rule

 Kernel estimator
1 N
 x  xt t ˆ Ni
pˆx|C i    K  ri P C i  
Ni h d t 1  h  N
1 N
 x  xt t
gi x   pˆx|C i P C i   d
ˆ  K  ri
Nh t 1  h 

 k-NN estimator

pˆx|C i  
ki
PˆC i |x  
ˆ
p x | C i PˆC i  ki

NiV x 
k
pˆx  k
Condensed Nearest Neighbor
13

 Time/space complexity of k-NN is O (N)

 Find a subset Z of X that is small and is accurate in
classifying X (Hart, 1968)
E' Z | X   E X |Z    Z
Condensed Nearest Neighbor
14

 Incremental algorithm: Add instance if needed

Distance-based Classification
15

 Find a distance function D(xr,xs) such that

if xrand xsbelong to the same class, distance is small
and if they belong to different classes, distance is
large
 Assume a parametric model and learn its
parameters using data, e.g.,
Learning a Distance Function
16

 The three-way relationship between distances,

dimensionality reduction, and feature extraction.
 M=LTL is dxd and L is kxd

 Similarity-based representation using similarity

scores
 Large-margin nearest neighbor (chapter 13)
Euclidean distance (circle) is not suitable,
Mahalanobis distance using an M (ellipse) is suitable.
After the data is projected along L, Euclidean distance can be used.

17
Outlier Detection
18

 Find outlier/novelty points

 Not a two-class problem because outliers are very
few, of many types, and seldom labeled
 Instead, one-class classification problem: Find
instances that have low probability
 In nonparametric case: Find instances far away from
other instances
Local Outlier Factor
19
Nonparametric Regression
20

 Aka smoothing models

 Regressogram

t 1
N
bx , x t
 r t

gˆx  
t 1 b
N
x , x t

where
 if is in the samebin with x
bx , x   
t
t 1 x
0 otherwise
21
22
Running Mean/Kernel Smoother
23

 Running mean smoother  Kernel smoother

 x  xt  t
t 1K  h  r
N
 x  xt  t
t 1w h  r
N

gˆx    
gˆx    
 x  xt 
t 1K  h 
N
 x  xt 
t 1w h 
N
 
 
where where K( ) is Gaussian
1 i f u  1  Additive models (Hastie
w u   
0 otherwi s e and Tibshirani, 1990)
 Running line smoother
24
25
26
How to Choose k or h ?
27

 When k or h is small, single instances matter; bias is

small, variance is large (undersmoothing): High
complexity
 As k or h increases, we average over more instances
and variance decreases but bias increases
(oversmoothing): Low complexity
 Cross-validation is used to finetune k or h.
28

TO Machine Learning: Lecture Slides For
No ratings yet
TO Machine Learning: Lecture Slides For
28 pages
Non Parametric Methods 8
No ratings yet
Non Parametric Methods 8
23 pages
I2ml Chap8 v1 1
No ratings yet
I2ml Chap8 v1 1
22 pages
Pattern Revision
No ratings yet
Pattern Revision
63 pages
Machine Learning Notes 1
No ratings yet
Machine Learning Notes 1
120 pages
Duda Solutions PDF
No ratings yet
Duda Solutions PDF
77 pages
Unit 2 ML
No ratings yet
Unit 2 ML
89 pages
Machine Learning Foundations
No ratings yet
Machine Learning Foundations
119 pages
UNIT IV Non Parametric Methods
No ratings yet
UNIT IV Non Parametric Methods
37 pages
Medical Imabmnge Analysis
No ratings yet
Medical Imabmnge Analysis
41 pages
Foundations of Machine
No ratings yet
Foundations of Machine
120 pages
ML.5-Clustering Techniques (Week 9)
No ratings yet
ML.5-Clustering Techniques (Week 9)
71 pages
Lec 04
No ratings yet
Lec 04
70 pages
ML RUSA Module 6 Probablistic EM KNN SVM
No ratings yet
ML RUSA Module 6 Probablistic EM KNN SVM
51 pages
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
No ratings yet
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
47 pages
Classification (NaiveBayes KNN SVM DecisionTrees)
No ratings yet
Classification (NaiveBayes KNN SVM DecisionTrees)
105 pages
PRML Exercise Solutions Guide
No ratings yet
PRML Exercise Solutions Guide
87 pages
Data Mining Lecture 10B: Classification
No ratings yet
Data Mining Lecture 10B: Classification
62 pages
Datamining Lect7knearst
No ratings yet
Datamining Lect7knearst
62 pages
Toc
No ratings yet
Toc
14 pages
K - Nearest Neighbours Classifier / Regressor
No ratings yet
K - Nearest Neighbours Classifier / Regressor
35 pages
Pattern Recognition 21BR551 MODULE 03 NOTES
No ratings yet
Pattern Recognition 21BR551 MODULE 03 NOTES
16 pages
Predict Classify Cluster
No ratings yet
Predict Classify Cluster
12 pages
Session 5
No ratings yet
Session 5
36 pages
Introduction To Machine Learning - Ethem Alpaydin
83% (6)
Introduction To Machine Learning - Ethem Alpaydin
432 pages
SCH Smo 03 C
No ratings yet
SCH Smo 03 C
24 pages
Datamining Lect12
No ratings yet
Datamining Lect12
75 pages
Chapter 4
No ratings yet
Chapter 4
40 pages
Notes Cce 577
No ratings yet
Notes Cce 577
71 pages
Aiml Unit-4
No ratings yet
Aiml Unit-4
82 pages
Kernel Methods for Statisticians
No ratings yet
Kernel Methods for Statisticians
53 pages
Kernel Methods in Machine Learning
No ratings yet
Kernel Methods in Machine Learning
53 pages
Classification 2
No ratings yet
Classification 2
56 pages
Lecture 4
No ratings yet
Lecture 4
51 pages
Classification
No ratings yet
Classification
50 pages
ML Unit-Iv
No ratings yet
ML Unit-Iv
32 pages
Data Mining: Classification
No ratings yet
Data Mining: Classification
79 pages
Data Mining Classifiers Guide
No ratings yet
Data Mining Classifiers Guide
23 pages
K Nearest Neighbor Classification
0% (1)
K Nearest Neighbor Classification
32 pages
UE20EC352-Machine Learning & Applications Unit 3 - Non Parametric Supervised Learning
No ratings yet
UE20EC352-Machine Learning & Applications Unit 3 - Non Parametric Supervised Learning
117 pages
Classification
No ratings yet
Classification
74 pages
Intro&NP Stat
No ratings yet
Intro&NP Stat
122 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
WK 6 Nearest Neighbor Classifier and Bayesian Classifier 12-05-2021
No ratings yet
WK 6 Nearest Neighbor Classifier and Bayesian Classifier 12-05-2021
23 pages
K-Nearest Neighbor Learning
No ratings yet
K-Nearest Neighbor Learning
31 pages
Data Mining Assignment 3
No ratings yet
Data Mining Assignment 3
9 pages
Data Science Interview - 1
No ratings yet
Data Science Interview - 1
32 pages
ML Unit2
No ratings yet
ML Unit2
38 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
Cheat Sheet
No ratings yet
Cheat Sheet
163 pages
04 Nonparametric Methods
No ratings yet
04 Nonparametric Methods
33 pages
Appendix 1 - NC00056438-Rev2
No ratings yet
Appendix 1 - NC00056438-Rev2
3 pages
Es1003 Lathe Drawtube Specifications
No ratings yet
Es1003 Lathe Drawtube Specifications
7 pages
EN PDF Betontechnik
No ratings yet
EN PDF Betontechnik
23 pages
Bulk Olive Oil Pricing
No ratings yet
Bulk Olive Oil Pricing
1 page
I2ml3e Chap15
No ratings yet
I2ml3e Chap15
22 pages
Duyuru Basel 0001 55
No ratings yet
Duyuru Basel 0001 55
29 pages
I2ml3e Chap11
No ratings yet
I2ml3e Chap11
38 pages
2018 Annual Report
No ratings yet
2018 Annual Report
76 pages
I2ml3e Chap6
No ratings yet
I2ml3e Chap6
37 pages
Feeding For Dairy
No ratings yet
Feeding For Dairy
28 pages
I2ml3e Chap5
No ratings yet
I2ml3e Chap5
26 pages
A Directory of Paper Recycling Resources
No ratings yet
A Directory of Paper Recycling Resources
276 pages
I2ml3e Chap9
No ratings yet
I2ml3e Chap9
15 pages
Steam Turbine 00 Neil Rich
No ratings yet
Steam Turbine 00 Neil Rich
242 pages
Thermodynamics of 1911 Pea B
No ratings yet
Thermodynamics of 1911 Pea B
330 pages
General Purchase Terms - Version 10.2006
No ratings yet
General Purchase Terms - Version 10.2006
5 pages
Unit 2
No ratings yet
Unit 2
92 pages
Steamturbine 02 Neilgoog
No ratings yet
Steamturbine 02 Neilgoog
245 pages
Design and Fabrication of Pressing Steam Boiler
No ratings yet
Design and Fabrication of Pressing Steam Boiler
14 pages
Steam Turbine 08 Neil Goog
No ratings yet
Steam Turbine 08 Neil Goog
655 pages
Design of Steam Boilers & Vessels
No ratings yet
Design of Steam Boilers & Vessels
439 pages
MHS HW2 Agv
No ratings yet
MHS HW2 Agv
3 pages
Your Sinclair 01 Jan 1986
No ratings yet
Your Sinclair 01 Jan 1986
122 pages
SAP Partial Turnkey Project Guide
No ratings yet
SAP Partial Turnkey Project Guide
22 pages
Surgical Instruments A Pocket Guide
No ratings yet
Surgical Instruments A Pocket Guide
306 pages
NOTES 6 - Sensitivity Analysis3
No ratings yet
NOTES 6 - Sensitivity Analysis3
36 pages
P2612 C4804786 1 Canon Warranty Certificate EM FINAL
No ratings yet
P2612 C4804786 1 Canon Warranty Certificate EM FINAL
4 pages
Antenna Magus Flyer PDF
No ratings yet
Antenna Magus Flyer PDF
4 pages
Types of Logic Explained
No ratings yet
Types of Logic Explained
6 pages
Office Supply Procurement Award
No ratings yet
Office Supply Procurement Award
2 pages
System1200 Upload Instructions
No ratings yet
System1200 Upload Instructions
11 pages
Getting To Know ArcGIS 4th Edition PDF
No ratings yet
Getting To Know ArcGIS 4th Edition PDF
793 pages
Optimizing Healthcare Inventory Management
No ratings yet
Optimizing Healthcare Inventory Management
3 pages
M&A Integration
No ratings yet
M&A Integration
27 pages
Regression & Interpolation Guide
No ratings yet
Regression & Interpolation Guide
19 pages
Electrical Engineer (CV+Application) SAMPLE
No ratings yet
Electrical Engineer (CV+Application) SAMPLE
6 pages
Job Satisfaction Panchasheel
No ratings yet
Job Satisfaction Panchasheel
59 pages
Cycle Count Items Formula Calculator
No ratings yet
Cycle Count Items Formula Calculator
3 pages
Multicores, Multiprocessors, and P, Clusters
No ratings yet
Multicores, Multiprocessors, and P, Clusters
51 pages
ITI Exam Results - Shivalika Kushwah
No ratings yet
ITI Exam Results - Shivalika Kushwah
1 page
The Heart Science 7th Class Powerpoint Presentation
No ratings yet
The Heart Science 7th Class Powerpoint Presentation
1 page
COCOMO
No ratings yet
COCOMO
45 pages
Class Vii (Comp) Study Material
No ratings yet
Class Vii (Comp) Study Material
4 pages
980L, 980M and 982M Wheel Loader Cooling System
No ratings yet
980L, 980M and 982M Wheel Loader Cooling System
2 pages
C++ Multiple, Multilevel and Hierarchical Inheritance
No ratings yet
C++ Multiple, Multilevel and Hierarchical Inheritance
1 page
Machine Design (Shaft) Project
No ratings yet
Machine Design (Shaft) Project
5 pages
Digital Signature Act 1997 Malaysia
No ratings yet
Digital Signature Act 1997 Malaysia
62 pages
Flash Magic: GUI and Command Line Manual
No ratings yet
Flash Magic: GUI and Command Line Manual
122 pages
Slide Set 3 Pass Transistor Logic / Transmission Gates
No ratings yet
Slide Set 3 Pass Transistor Logic / Transmission Gates
20 pages