0% found this document useful (0 votes)

85 views32 pages

MA5232 Modeling and Numerical Simulations: Iterative Methods For Mixture-Model Segmentation 8 Apr 2015

This document summarizes a lecture on iterative methods for mixture-model segmentation and subspace clustering. It reviews K-means clustering and the Expectation-Maximization (EM) algorithm for central clustering. It then discusses modeling data with a mixture of subspaces and formulations of the K-subspaces and EM algorithms for subspace segmentation. The EM algorithm estimates subspace membership probabilities and model parameters iteratively through E and M steps. Key differences between K-subspaces and EM are noted, and homework is assigned on exercises from the lecture handout.

Uploaded by

navneeth91

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

85 views32 pages

MA5232 Modeling and Numerical Simulations: Iterative Methods For Mixture-Model Segmentation 8 Apr 2015

Uploaded by

navneeth91

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

4/7/2015

MA5232 Modeling and Numerical

Simulations
Lecture 2
Iterative Methods for Mixture-Model
Segmentation
8 Apr 2015
National University of Singapore

Last time
PCA reduces dimensionality of a data set while
retaining as much as possible the data variation.
Statistical view: The leading PCs are given by the
leading eigenvectors of the covariance.
Geometric view: Fitting a d-dim subspace model via
SVD

Extensions of PCA
Probabilistic PCA via MLE
Kernel PCA via kernel functions and kernel matrices
National University of Singapore

4/7/2015

This lecture
Review basic iterative algorithms for central
clustering
Formulation of the subspace segmentation
problem

National University of Singapore

Segmentation by Clustering

From: Object Recognition as Machine Translation, Duygulu, Barnard, de Freitas, Forsyth, ECCV02

4/7/2015

Example 4.1
Euclidean distance-based clustering is not
invariant to linear transformation

Distance metric needs to be adjusted after

linear transformation

National University of Singapore

Central Clustering
Assume data sampled from a mixture of
Gaussian

Classical distance metric between a sample

and the mean of the jth cluster is the
Mahanalobis distance

National University of Singapore

4/7/2015

Central Clustering: K-Means

Assume a map function provide each ith
sample a label
An optimal clustering minimizes the withincluster scatter:

i.e., the average distance of all samples to

their respective cluster means
National University of Singapore

Central Clustering: K-Means

However, as K is user defined,
when
each point becomes a cluster itself: K=n.
In this chapter, would assume true K is known.

National University of Singapore

4/7/2015

Algorithm
A chicken-and-egg view

National University of Singapore

Two-Step Iteration

National University of Singapore

4/7/2015

Example
http://util.io/k-means

National University of Singapore

Feature Space

Source: K. Grauman

4/7/2015

Results of K-Means Clustering:

Image

Clusters on intensity

Clusters on color

K-means clustering using intensity alone and color alone

* From Marc Pollefeys COMP 256 2003

4/7/2015

A bad local optimum

National University of Singapore

Characteristics of K-Means
It is a greedy algorithm, does not guarantee to
converge to the global optimum.
Given fixed initial clusters/ Gaussian models, the
iterative process is deterministic.
Result may be improved by running k-means
multiple times with different starting conditions.
The segmentation-estimation process can be
treated as a generalized expectationmaximization algorithm
National University of Singapore

4/7/2015

EM Algorithm [Dempster-Laird-Rubin 1977]

Expectation Maximization (EM) estimates the
model parameters and the segmentation in a
ML sense.
Assume samples are independently drawn
from a mixed probabilistic distribution,
indicated by a hidden discrete variable z
Cond. dist. can be Gaussian
National University of Singapore

The Maximum-Likelihood Estimation

The unknown parameters are
The likelihood function:

The optimal solution maximizes the loglikelihood

National University of Singapore

4/7/2015

The Maximum-Likelihood Estimation

Directly maximize the log-likelihood function
is a high-dimensional nonlinear optimization
problem

National University of Singapore

Define a new function:

The first term is called expected complete loglikelihood function;

The second term is the conditional entropy.
National University of Singapore

4/7/2015

Observation:

National University of Singapore

The Maximum-Likelihood Estimation

Regard the (incomplete) log-likelihood as a
function of two variables:
Maximize g iteratively (E step, followed by M
step)

National University of Singapore

4/7/2015

Iteration converges to a stationary

point

National University of Singapore

Prop 4.2: Update

National University of Singapore

4/7/2015

Update
Recall

Assume
is fixed, then maximize the
expected complete log-likelihood

National University of Singapore

To maximize the expected log-likelihood, as an

example, assume each cluster is isotropic
normal distribution:

Eliminate the constant term in the objective

National University of Singapore

4/7/2015

Exer 4.2

Compared to k-means, EM assigns the

samples softly to each cluster according to a
set of probabilities.

National University of Singapore

EM Algorithm

National University of Singapore

4/7/2015

Exam 4.3: Global max may not exist

National University of Singapore

Alternative view of EM:

Coordinate ascent
w

National University of Singapore

4/7/2015

Alternative view of EM:

Coordinate ascent
w

National University of Singapore

Alternative view of EM:

Coordinate ascent
w

w2
w1

National University of Singapore

4/7/2015

Alternative view of EM:

Coordinate ascent
w

w2
w1

National University of Singapore

Alternative view of EM:

Coordinate ascent
w

w2
w1

National University of Singapore

4/7/2015

Alternative view of EM:

Coordinate ascent
w

w2
w1

National University of Singapore

Visual example of EM

4/7/2015

Potential Problems
Incorrect number of Mixture Components

Singularities

Incorrect Number of Gaussians

4/7/2015

Incorrect Number of Gaussians

Singularities
A minority of the data can have a
disproportionate effect on the model
likelihood.
For example

4/7/2015

GMM example

Singularities
When a mixture component collapses on a
given point, the mean becomes the point, and
the variance goes to zero.
Consider the likelihood function as the
covariance goes to zero.
The likelihood approaches infinity.

4/7/2015

K-means VS EM

k-means clustering and EM clustering on an artificial dataset ("mouse"). The

tendency of k-means to produce equi-sized clusters leads to bad results, while
EM benefits from the Gaussian distribution present in the data set

National University of Singapore

So far
K-means
Expectation Maximization

National University of Singapore

4/7/2015

Next up
Multiple-Subspace Segmentation
K-subspaces
EM for Subspaces

National University of Singapore

Multiple-Subspace Segmentation

National University of Singapore

4/7/2015

K-subspaces

National University of Singapore

K-subspaces
With noise, we minimize

Unfortunately, unlike PCA, there is no constructive

solution to the above minimization problem. The main
difficulty is that the foregoing objective is hybrid it is
a combination of minimization on the continuous
variables {Uj} and the discrete variable j.
National University of Singapore

4/7/2015

K-subspaces

National University of Singapore

K-subspaces

Exactly the same as

in PCA

National University of Singapore

4/7/2015

K-subspaces

National University of Singapore

K-subspaces

National University of Singapore

4/7/2015

EM for Subspaces

National University of Singapore

EM for Subspaces

National University of Singapore

4/7/2015

EM for Subspaces

National University of Singapore

EM for Subspaces

National University of Singapore

4/7/2015

EM for Subspaces

National University of Singapore

EM for Subspaces
In the M step

National University of Singapore

4/7/2015

EM for Subspaces

National University of Singapore

EM for Subspaces

National University of Singapore

4/7/2015

EM for Subspaces

National University of Singapore

EM for Subspaces

National University of Singapore

4/7/2015

Relationship between K-subspaces and

EM
At each iteration,
K-subspaces algorithm gives a definite
assignment of every data point into one of the
subspaces;
EM algorithm views the membership as a
random variable and uses its expected value
to give a probabilistic assignment of the
data point.
National University of Singapore

Homework
Read the handout Chapter 4 Iterative
Methods for Multiple-Subspace
Segmentation.
Complete exercise 4.2 (page 111) of the
handout

National University of Singapore

Ml Module5 Clustering
No ratings yet
Ml Module5 Clustering
71 pages
Lec15 16 Handout
No ratings yet
Lec15 16 Handout
33 pages
UNIT III Part-1
No ratings yet
UNIT III Part-1
69 pages
Unit Iii
No ratings yet
Unit Iii
70 pages
Week 4 - Lecture Slides - K-Means, Mixture Models, & EM
No ratings yet
Week 4 - Lecture Slides - K-Means, Mixture Models, & EM
65 pages
Machine Learning-4
No ratings yet
Machine Learning-4
73 pages
ML.5-Clustering Techniques (Week 9)
No ratings yet
ML.5-Clustering Techniques (Week 9)
71 pages
Lecture_08_slides
No ratings yet
Lecture_08_slides
43 pages
MLLecture-1
No ratings yet
MLLecture-1
56 pages
Lecture Expectation Maximization
No ratings yet
Lecture Expectation Maximization
58 pages
ML Lecture06 Unsupervised Learning
No ratings yet
ML Lecture06 Unsupervised Learning
87 pages
MLSlides5 - Selected - Shared
No ratings yet
MLSlides5 - Selected - Shared
30 pages
The Anatomy of The Electro-Weak Symmetry Breaking
No ratings yet
The Anatomy of The Electro-Weak Symmetry Breaking
339 pages
EM and Kmeans relations
No ratings yet
EM and Kmeans relations
70 pages
Lec 11
No ratings yet
Lec 11
57 pages
Week 5 v1.1 - Unsupervised Learning
No ratings yet
Week 5 v1.1 - Unsupervised Learning
40 pages
5 Clustering
No ratings yet
5 Clustering
38 pages
2017-AdaCluster Adaptive Clustering For Heterogeneous Data
No ratings yet
2017-AdaCluster Adaptive Clustering For Heterogeneous Data
34 pages
Clustering
No ratings yet
Clustering
65 pages
cz4041 10 Clustering
No ratings yet
cz4041 10 Clustering
67 pages
22 Mixture Models A EM
No ratings yet
22 Mixture Models A EM
32 pages
Machine Learning: Unsupervised Learning Dimensionality Reduction K-Means Clustering
No ratings yet
Machine Learning: Unsupervised Learning Dimensionality Reduction K-Means Clustering
28 pages
I2ml3e Chap7
No ratings yet
I2ml3e Chap7
22 pages
DSA5102_lecture10
No ratings yet
DSA5102_lecture10
40 pages
Region Segmentation Readings: Chapter 10: 10.1 Additional Materials Provided
No ratings yet
Region Segmentation Readings: Chapter 10: 10.1 Additional Materials Provided
47 pages
Pattern Analysis-Machine Learning
No ratings yet
Pattern Analysis-Machine Learning
74 pages
Clustering Mixture
No ratings yet
Clustering Mixture
22 pages
3D Vision: Topic 9 Stereo Vision (I)
No ratings yet
3D Vision: Topic 9 Stereo Vision (I)
36 pages
Introduction To (Statistical) Machine Learning
No ratings yet
Introduction To (Statistical) Machine Learning
30 pages
Tema5 Teoria-2830
No ratings yet
Tema5 Teoria-2830
57 pages
ML RUSA Module 6 Probablistic EM KNN SVM
No ratings yet
ML RUSA Module 6 Probablistic EM KNN SVM
51 pages
Week 7 - Latent Variable Models and Expectation Maximization
No ratings yet
Week 7 - Latent Variable Models and Expectation Maximization
39 pages
EM-converted
No ratings yet
EM-converted
22 pages
EHF For Satellite Communication
No ratings yet
EHF For Satellite Communication
25 pages
Chap2 Part2 GMM
No ratings yet
Chap2 Part2 GMM
34 pages
Statistical Methods For NLP: Document and Topic Clustering, K-Means, Mixture Models, Expectation-Maximization
No ratings yet
Statistical Methods For NLP: Document and Topic Clustering, K-Means, Mixture Models, Expectation-Maximization
47 pages
Medical Imabmnge Analysis
No ratings yet
Medical Imabmnge Analysis
41 pages
6.2 K Means
No ratings yet
6.2 K Means
23 pages
MLT lab 08
No ratings yet
MLT lab 08
5 pages
M Bhuvan Stat
No ratings yet
M Bhuvan Stat
22 pages
CB PDF
No ratings yet
CB PDF
69 pages
Lecture08b Kmeans
No ratings yet
Lecture08b Kmeans
10 pages
gmm
No ratings yet
gmm
8 pages
Lecture 3
No ratings yet
Lecture 3
15 pages
Dsci303-19 GM - em
No ratings yet
Dsci303-19 GM - em
81 pages
Module13 GaussianMixtureModel
No ratings yet
Module13 GaussianMixtureModel
17 pages
Feb.2019 Core Shell MoS2 Graphene
No ratings yet
Feb.2019 Core Shell MoS2 Graphene
28 pages
Nuclear Isomer - Wikipedia, The Free Encyclopedia
No ratings yet
Nuclear Isomer - Wikipedia, The Free Encyclopedia
7 pages
Chapter 6 Linear Inequalities
100% (1)
Chapter 6 Linear Inequalities
48 pages
Expectation-Maximization Clustring V2
No ratings yet
Expectation-Maximization Clustring V2
9 pages
K-Medias, Mezcla de Gausianas y Un Ejemplo
No ratings yet
K-Medias, Mezcla de Gausianas y Un Ejemplo
6 pages
Week 7 GMM
No ratings yet
Week 7 GMM
9 pages
IPS E.max CAD-On Technique - September 2010 - e
No ratings yet
IPS E.max CAD-On Technique - September 2010 - e
65 pages
K-Means Clustering Method For The Analysis of Log Data
No ratings yet
K-Means Clustering Method For The Analysis of Log Data
3 pages
Applied Stat
No ratings yet
Applied Stat
2 pages
Mixture Models and Clustering
No ratings yet
Mixture Models and Clustering
8 pages
K.means Clustering
No ratings yet
K.means Clustering
8 pages
Tortuosity
No ratings yet
Tortuosity
4 pages
BCS To Bose Crossover: Broken-Symmetry State
No ratings yet
BCS To Bose Crossover: Broken-Symmetry State
4 pages
TD10 - td_gmm_2025
No ratings yet
TD10 - td_gmm_2025
1 page
Noritake Value Shade
100% (1)
Noritake Value Shade
4 pages
A Generalized Method For Predicting The Minimum Fluidization Velocity
No ratings yet
A Generalized Method For Predicting The Minimum Fluidization Velocity
3 pages
Symmetrical Based Projects
No ratings yet
Symmetrical Based Projects
105 pages
cs229 Notes7b PDF
No ratings yet
cs229 Notes7b PDF
4 pages
The Big Bang Theory
No ratings yet
The Big Bang Theory
4 pages
507 39 Solutions-Instructor-manual Ch7 DRCS
100% (1)
507 39 Solutions-Instructor-manual Ch7 DRCS
13 pages
U L D R: Nsupervised Earning and Imensionality Eduction
No ratings yet
U L D R: Nsupervised Earning and Imensionality Eduction
58 pages
SV Total RNA Isolation System Protocol
No ratings yet
SV Total RNA Isolation System Protocol
30 pages
Fundamentals of Fluid Mechanics: Chapter 5: Mass, Bernoulli, and Energy Equations
No ratings yet
Fundamentals of Fluid Mechanics: Chapter 5: Mass, Bernoulli, and Energy Equations
57 pages
TMR PDF
No ratings yet
TMR PDF
5 pages
ML DSBA Lab7
No ratings yet
ML DSBA Lab7
6 pages
NATS 1745 - Reading Notes (Textbook)
No ratings yet
NATS 1745 - Reading Notes (Textbook)
9 pages
Lab Report Writing FOOD CHEMISTRY
No ratings yet
Lab Report Writing FOOD CHEMISTRY
24 pages
Transient - Chapter 4 - Cooling Water System Part 1
No ratings yet
Transient - Chapter 4 - Cooling Water System Part 1
32 pages
EST 1103: Earth Science I
No ratings yet
EST 1103: Earth Science I
1 page
Position, Velocity, and Acceleration (Integration)
No ratings yet
Position, Velocity, and Acceleration (Integration)
16 pages
The Secret and The Universal Spirit
No ratings yet
The Secret and The Universal Spirit
7 pages
Delta Robot Kinematics
No ratings yet
Delta Robot Kinematics
11 pages
PD 997
No ratings yet
PD 997
3 pages
K Means
No ratings yet
K Means
33 pages
Dimensionless Numbers & Problem 4.5 - 1
No ratings yet
Dimensionless Numbers & Problem 4.5 - 1
56 pages
Mixture Models and Expectation-Maximization: Justus H. Piater
No ratings yet
Mixture Models and Expectation-Maximization: Justus H. Piater
11 pages
KILn Tyre Failure
No ratings yet
KILn Tyre Failure
5 pages
Earth Science
No ratings yet
Earth Science
70 pages
Basic Polon
No ratings yet
Basic Polon
5 pages
Tutorial PE104
No ratings yet
Tutorial PE104
7 pages
Electrical Energy Conversion and Transport: An Interactive Computer-Based Approach
From Everand
Electrical Energy Conversion and Transport: An Interactive Computer-Based Approach
George G. Karady
No ratings yet
40 Machine Learning Algorithms
From Everand
40 Machine Learning Algorithms
Anam Giri
No ratings yet
Introduction to Finite Element Analysis
From Everand
Introduction to Finite Element Analysis
Rahul Basu
No ratings yet
EDUCATION DATA MINING FOR PREDICTING STUDENTS’ PERFORMANCE
From Everand
EDUCATION DATA MINING FOR PREDICTING STUDENTS’ PERFORMANCE
Dr. GEETHA N DATA SCIENTIST, BENGALURU
No ratings yet