Example 161

The document outlines a MATLAB implementation for generating and analyzing data from a mixture of normal distributions using an EM algorithm. It includes functions for data generation, plotting, and parameter estimation, with specific inputs and outputs defined for each function. The implementation is designed to handle 2-dimensional data and up to 7 classes, focusing on estimating means, variances, and a priori probabilities of the distributions.

Uploaded by

loantv

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views6 pages

Example 161

Uploaded by

loantv

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

function Example161

randn('seed',0);
m1=[1, 1]'; m2=[3, 3]';m3=[2, 6]';
m=[m1 m2 m3];
S(:,:,1)=0.1*eye(2);
S(:,:,2)=0.2*eye(2);
S(:,:,3)=0.3*eye(2);
P=[0.4 0.4 0.2];
N=500;
sed=0;
[X,y]=mixt_model(m,S,P,N,sed);
plot_data(X,y,m,1)
m1_ini=[0; 2];m2_ini=[5; 2];m3_ini=[5; 5];
m_ini=[m1_ini m2_ini m3_ini];
s_ini=[.15 .27 .4];
Pa_ini=[1/3 1/3 1/3];
e_min=10^(-5);
[m_hat,s_hat,Pa,iter,Q_tot,e_tot]=em_alg_function(X,m_ini,s_ini,Pa_ini,e_min)

function [X,y]=mixt_model(m,S,P,N,sed)
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% FUNCTION
% [X,y]=mixt_model(m,S,P,N,sed)
% Generates a set of data vectors that stem from a mixture of normal
% distributions (also used in Chapter 2).
%
% INPUT ARGUMENTS:
% m: lxc matrix whose i-th column contains the
% (l-dimensional) mean of the i-th normal distribution.
% S: lxlxc matrix whose i-th lxl two-dimensional "slice" is the
% covariance matrix corresponding to the i-th normal distribution.
% P: c-dimensional vector whose i-th coordinate contains
% the a priori probability for the i-th normal distribution.
% N: the total number of points to be generated by the mixture
% distribution.
% sed: the seed used for the initialization of the built-in MATLAB
% random generator function "rand".
%
% OUTPUT ARGUMENTS:
% X: lxN matrix whose columns are the produced vectors.
% y: N-dimensional vector whose i-th element indicates the
% distribution generated the i-th vector.
%
% (c) 2010 S. Theodoridis, A. Pikrakis, K. Koutroumbas, D. Cavouras
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
rand('seed',sed);
1
[l,c]=size(m);
%Constructing the P_acc vector. This is necessary for picking randomly
%one of the c normal distributions in order to generate a point.
P_acc=P(1);
for i=2:c
t=P_acc(i-1)+P(i);
P_acc=[P_acc t];
end
% Generation of the data set
X=[];
y=[];
for i=1:N
t=rand;
ind=sum(t>P_acc)+1; % Index of the normal distribution that will
generate the i-th vector
X=[X; mvnrnd(m(:,ind)',S(:,:,ind),1)];
y=[y ind];
end
X=X';
function plot_data(X,y,m,h)
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% FUNCTION
% plot_data(X,y,m,h)
% Plotting utility, capable of visualizing 2-dimensional datasets that
% consist of, at most, 7 classes. It plots with different colors: (a) the
% vectors of a data set that belong to different classes and (b) the mean
% vectors of the classes, provided that the data are 2-dimensional and the
% total number of classes is at most 7.
%
% INPUT ARGUMENTS:
% X: lxN matrix, whose columns are the data vectors to be plotted.
% y: N-dimensional vector whose i-th component is the class label
% of the i-th data vector.
% m: lxc matrix, whose j-th column corresponds to the
% mean vector of the j-th class.
% h: the handle of the figure on which the data will be plotted.
%
% (c) 2010 S. Theodoridis, A. Pikrakis, K. Koutroumbas, D. Cavouras
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

[l,N]=size(X); % N=no. of data vectors, l=dimensionality

[l,c]=size(m); % c=no. of classes
if(l~=2) || (c>7)
fprintf('NO PLOT CAN BE GENERATED\n')
2
return
else
pale=['r.'; 'g.'; 'b.'; 'y.'; 'm.'; 'c.';'co'];
figure(h)
% Plot of the data vectors
hold on
for i=1:N
plot(X(1,i),X(2,i),pale(y(i),:))
hold on
end
% Plot of the class centroids
for j=1:c
plot(m(1,j),m(2,j),'k+')
hold on
end
end
function [m,s,Pa,iter,Q_tot,e_tot]=em_alg_function(x,m,s,Pa,e_min)
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% FUNCTION
% [m,s,Pa,iter,Q_tot,e_tot]=em_alg_function(x,m,s,Pa,e_min)
% EM algorithm for estimating the parameters of a mixture of normal
% distributions, with diagonal covariance matrices.
% WARNING: IT ONLY RUNS FOR THE CASE WHERE THE COVARIANCE MATRICES
% ARE OF THE FORM sigma^2*I. IN ADDITION, IF sigma_i^2=0 FOR SOME
% DISTRIBUTION AT AN ITERATION, IT IS ARBITRARILY SET EQUAL TO 0.001.
%
% INPUT ARGUMENTS:
% x: lxN matrix, each column of which is a feature vector.
% m: lxJ matrix, whos j-th column is the initial
% estimate for the mean of the j-th distribution.
% s: 1xJ vector, whose j-th element is the variance
% for the j-th distribution.
% Pa: J-dimensional vector, whose j-th element is the initial
% estimate of the a priori probability of the j-th distribution.
% e_min: threshold used in the termination condition of the EM
% algorithm.
%
% OUTPUT ARGUMENTS:
% m: it has the same structure with input argument m and contains
% the final estimates of the means of the normal distributions.
% s: it has the same structure with input argument s and contains
% the final estimates of the variances of the normal
% distributions.
% Pa: J-dimensional vector, whose j-th element is the final estimate
% of the a priori probability of the j-th distribution.

3
% iter: the number of iterations required for the convergence of the
% EM algorithm.
% Q_tot: vector containing the likelihood value at each iteration.
% e_tot: vector containing the error value at each itertion.
%
% (c) 2010 S. Theodoridis, A. Pikrakis, K. Koutroumbas, D. Cavouras
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
x=x';
m=m';
[p,n]=size(x);
[J,n]=size(m);
e=e_min+1;
Q_tot=[];
e_tot=[];
iter=0;
while (e>e_min)
iter=iter+1;
e;
P_old=Pa;
m_old=m;
s_old=s;
% Determine P(j|x_k; theta(t))
for k=1:p
temp=gauss(x(k,:),m,s);
P_tot=temp*Pa';
for j=1:J
P(j,k)=temp(j)*Pa(j)/P_tot;
end
end
% Determine the log-likelihood
Q=0;
for k=1:p
for j=1:J
Q=Q+P(j,k)*(-(n/2)*log(2*pi*s(j)) - sum( (x(k,:)-
m(j,:)).^2)/(2*s(j)) + log(Pa(j)) );
end
end
Q_tot=[Q_tot Q];
% Determine the means
for j=1:J
a=zeros(1,n);
for k=1:p

4
a=a+P(j,k)*x(k,:);
end
m(j,:)=a/sum(P(j,:));
end
% Determine the variances
for j=1:J
b=0;
for k=1:p
b=b+ P(j,k)*((x(k,:)-m(j,:))*(x(k,:)-m(j,:))');
end
s(j)=b/(n*sum(P(j,:)));
if(s(j)<10^(-10))
s(j)=0.001;
end
end
% Determine the a priori probabilities
for j=1:J
a=0;
for k=1:p
a=a+P(j,k);
end
Pa(j)=a/p;
end
e=sum(abs(Pa-P_old))+sum(sum(abs(m-m_old)))+sum(abs(s-s_old));
e_tot=[e_tot e];
end
function [z]=gauss(x,m,s)
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% FUNCTION (auxiliary)
% [z]=gauss(x,m,s)
% Takes as input the mean values and the variances of a number of Gaussian
% distributions and a vector x and computes the value of each
% Gaussian at x.
%
% NOTE: It is assumed that the covariance matrices of the gaussian
% distributions are diagonal with equal diagonal elements, i.e. it has the
% form sigma^2*I, where I is the identity matrix.
%
% INPUT ARGUMENTS:
% x: l-dimensional row vector, on which the values of the J
% gaussian distributions will be calculated
% m: Jxl matrix, whose j-th row corresponds to the
% mean of the j-th gaussian distribution
5
% s: J-dimensional row vector whose j-th component corresponds to
% the variance for the j-th gaussian distribution (it is assumed
% that the covariance matrices of the distributions are of the
% form sigma^2*I, where I is the lxl identity matrix)
%
% OUTPUT ARGUMENTS:
% z: J-dimensional vector whose j-th component is the value of the
% j-th gaussian distribution at x.
%
% (c) 2010 S. Theodoridis, A. Pikrakis, K. Koutroumbas, D. Cavouras
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
[J,l]=size(m);
[p,l]=size(x);
z=[];
for j=1:J
t=(x-m(j,:))*(x-m(j,:))';
c=1/(2*pi*s(j))^(l/2);
z=[z c*exp(-t/(2*s(j)))];
end

Neural Network
No ratings yet
Neural Network
7 pages
Probability and Statistics Lab Analysis
No ratings yet
Probability and Statistics Lab Analysis
41 pages
Random Vectors
No ratings yet
Random Vectors
9 pages
PR Practical File
No ratings yet
PR Practical File
38 pages
Matlab Code: 2.2 Exercises With Matlab 2.2.1 Standard Normal Distribution
No ratings yet
Matlab Code: 2.2 Exercises With Matlab 2.2.1 Standard Normal Distribution
12 pages
ED23D008 Lakshmi S
No ratings yet
ED23D008 Lakshmi S
23 pages
Em Algo For Multivariate GMM
No ratings yet
Em Algo For Multivariate GMM
9 pages
Matlab
No ratings yet
Matlab
26 pages
Topic 2 Matlab Examples
No ratings yet
Topic 2 Matlab Examples
5 pages
STAT3006: Tutorial 1: Sample Solutions
No ratings yet
STAT3006: Tutorial 1: Sample Solutions
10 pages
Maths Codes
No ratings yet
Maths Codes
16 pages
Sst304 Lesson 1
No ratings yet
Sst304 Lesson 1
8 pages
MATLAB Pattern Recognition Guide
No ratings yet
MATLAB Pattern Recognition Guide
58 pages
StatisticsToolbox II
No ratings yet
StatisticsToolbox II
16 pages
The Gibbs Sampler: Function
No ratings yet
The Gibbs Sampler: Function
1 page
W2e Multivariate Gaussian
No ratings yet
W2e Multivariate Gaussian
6 pages
Week 2
No ratings yet
Week 2
61 pages
Random Vectors Explained
No ratings yet
Random Vectors Explained
7 pages
ECON 1630 Problem Set #2 Fall 2021: Bias Variance
No ratings yet
ECON 1630 Problem Set #2 Fall 2021: Bias Variance
9 pages
Lab PDF
No ratings yet
Lab PDF
11 pages
Moments 7
No ratings yet
Moments 7
4 pages
PS - Lab - Manual - 4th - Sem (1) ANIL
No ratings yet
PS - Lab - Manual - 4th - Sem (1) ANIL
19 pages
Stat
No ratings yet
Stat
43 pages
Estimating PDF'S, Means, Variances: 1 Exp 1: Estimated PDF/CDF Plots From Data
No ratings yet
Estimating PDF'S, Means, Variances: 1 Exp 1: Estimated PDF/CDF Plots From Data
12 pages
STAT456 Study Guide
No ratings yet
STAT456 Study Guide
31 pages
Joining Instructions Lisboa
No ratings yet
Joining Instructions Lisboa
8 pages
LMSP
No ratings yet
LMSP
7 pages
Bayesian MCMC for Statisticians
No ratings yet
Bayesian MCMC for Statisticians
20 pages
Slides No Break
No ratings yet
Slides No Break
77 pages
Probability Theory and Statistics Lab - Prof - S N Chandra Shekhar
No ratings yet
Probability Theory and Statistics Lab - Prof - S N Chandra Shekhar
30 pages
HW 1
No ratings yet
HW 1
4 pages
STAT3006 Lecture Notes 2021 Aug8 2021
No ratings yet
STAT3006 Lecture Notes 2021 Aug8 2021
110 pages
Generate Two Correlated Noise
No ratings yet
Generate Two Correlated Noise
6 pages
MVA Section1 2012
No ratings yet
MVA Section1 2012
14 pages
Statistical Analysis: 1 Data Analysis: Mean, Variance, Boxplots
No ratings yet
Statistical Analysis: 1 Data Analysis: Mean, Variance, Boxplots
4 pages
Matlab Max Entropy Distribution Tool
No ratings yet
Matlab Max Entropy Distribution Tool
11 pages
2.2 Exercises With Matlab: 2.2.1 Standard Normal Distribution
No ratings yet
2.2 Exercises With Matlab: 2.2.1 Standard Normal Distribution
9 pages
An Algorithmic Introduction To Numerical Simulation of Stochastic Differential Equations
No ratings yet
An Algorithmic Introduction To Numerical Simulation of Stochastic Differential Equations
22 pages
Actuarial Statistics CS1B Nov 2024 Solutions
No ratings yet
Actuarial Statistics CS1B Nov 2024 Solutions
11 pages
Module 2 - 1
No ratings yet
Module 2 - 1
18 pages
MBC W1-2 Notes
No ratings yet
MBC W1-2 Notes
21 pages
BootcampStat Code Sol
No ratings yet
BootcampStat Code Sol
12 pages
MATLAB Probability Distributions
100% (1)
MATLAB Probability Distributions
8 pages
Lecture 13: Simple Linear Regression in Matrix Format
No ratings yet
Lecture 13: Simple Linear Regression in Matrix Format
9 pages
Delta Method para Un Rato
No ratings yet
Delta Method para Un Rato
44 pages
Data Scaling and Statistical Methods
No ratings yet
Data Scaling and Statistical Methods
4 pages
MATLAB Exponential Sequence Guide
No ratings yet
MATLAB Exponential Sequence Guide
53 pages
Notes 5 Multivariate Distributions
No ratings yet
Notes 5 Multivariate Distributions
13 pages
Roch Mmids Intro 5exercises
No ratings yet
Roch Mmids Intro 5exercises
9 pages
Paramest PDF
No ratings yet
Paramest PDF
37 pages
Central Limit Therom
No ratings yet
Central Limit Therom
3 pages
5 2-3 Spatial Environmental Data Gaussian Processes
No ratings yet
5 2-3 Spatial Environmental Data Gaussian Processes
5 pages
Non-Linear Models With BSSM
No ratings yet
Non-Linear Models With BSSM
1 page
Durrande 2020
No ratings yet
Durrande 2020
90 pages
Part A: For End
No ratings yet
Part A: For End
2 pages
1) Common Univariate Summaries: I) I) Iii) I) Ii)
No ratings yet
1) Common Univariate Summaries: I) I) Iii) I) Ii)
5 pages
CS229: Factor Analysis Explained
No ratings yet
CS229: Factor Analysis Explained
9 pages
Macroeconomics Computational Methods
No ratings yet
Macroeconomics Computational Methods
72 pages
2N3055 Amplifier Circuit With PCB - 60W
No ratings yet
2N3055 Amplifier Circuit With PCB - 60W
83 pages
Singer Identification From Smaller Snippets of Audio Clips Using Acoustic Features and DNNs
No ratings yet
Singer Identification From Smaller Snippets of Audio Clips Using Acoustic Features and DNNs
7 pages
Emotional Speech Recognition Using Deep Neural Networks
No ratings yet
Emotional Speech Recognition Using Deep Neural Networks
20 pages
Music Genre Classification Using DenseNet and Data Augmentation
No ratings yet
Music Genre Classification Using DenseNet and Data Augmentation
18 pages
High-Frequency Stock Returns Model
No ratings yet
High-Frequency Stock Returns Model
12 pages
Particle Picking - RELION Documentation
No ratings yet
Particle Picking - RELION Documentation
19 pages
Routing Algorithm Based On Vehicle Position Analysis For Internet of Vehicles
No ratings yet
Routing Algorithm Based On Vehicle Position Analysis For Internet of Vehicles
13 pages
EM Algorithm
No ratings yet
EM Algorithm
30 pages
Expectation Maximization Homework Solution
100% (1)
Expectation Maximization Homework Solution
8 pages
Deep Learning for Economic Forecasting
No ratings yet
Deep Learning for Economic Forecasting
40 pages
Irtplay
No ratings yet
Irtplay
78 pages
Xai-Aaai-21 Paper 6
No ratings yet
Xai-Aaai-21 Paper 6
7 pages
AI After Mids Chap 7 To 10
No ratings yet
AI After Mids Chap 7 To 10
34 pages
Mas 04
No ratings yet
Mas 04
4 pages
Joint Models for Medical Research
No ratings yet
Joint Models for Medical Research
26 pages
AI and ML Lab - VIVA Questions
83% (6)
AI and ML Lab - VIVA Questions
7 pages
Variance Component Estimation & Best Linear Unbiased Prediction (Blup)
100% (1)
Variance Component Estimation & Best Linear Unbiased Prediction (Blup)
16 pages
Recommender System Based On Customer Segmentation (RSCS)
No ratings yet
Recommender System Based On Customer Segmentation (RSCS)
28 pages
Entropy-Based Financial Forecasting
No ratings yet
Entropy-Based Financial Forecasting
19 pages
Unit 3.5 & 5 ML
No ratings yet
Unit 3.5 & 5 ML
16 pages
Static and Dynamic Novelty Detection Methods For Jet Engine Health Monitoring
No ratings yet
Static and Dynamic Novelty Detection Methods For Jet Engine Health Monitoring
22 pages
Soccer Analytics with Poisson Models
0% (1)
Soccer Analytics with Poisson Models
39 pages
Phase-Type Distributions & Mixtures of Erlangs
No ratings yet
Phase-Type Distributions & Mixtures of Erlangs
132 pages
DC Programming and DCA: Thirty Years of Developments
No ratings yet
DC Programming and DCA: Thirty Years of Developments
64 pages
Expectation Maximisation Algorithm
No ratings yet
Expectation Maximisation Algorithm
11 pages
Theory and Use of EM Algorithm
No ratings yet
Theory and Use of EM Algorithm
26 pages
Ps 3
No ratings yet
Ps 3
15 pages
Unit V
No ratings yet
Unit V
165 pages
ML Unit 2
No ratings yet
ML Unit 2
8 pages
DENCLUE 2.0: Fast Clustering Based On Kernel Density Estimation
No ratings yet
DENCLUE 2.0: Fast Clustering Based On Kernel Density Estimation
11 pages
EM Algorithm Explained: Coin Toss
No ratings yet
EM Algorithm Explained: Coin Toss
7 pages
Model Inference and Averaging: Dept. Computer Science & Engineering, Shanghai Jiao Tong University
No ratings yet
Model Inference and Averaging: Dept. Computer Science & Engineering, Shanghai Jiao Tong University
51 pages
Platias2020 Greece
No ratings yet
Platias2020 Greece
10 pages

Example 161

Uploaded by

Example 161

Uploaded by

function Example161

[l,N]=size(X); % N=no. of data vectors, l=dimensionality

You might also like