0% found this document useful (0 votes)

8 views4 pages

A Support Vector Clustering Method

Uploaded by

dridabs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views4 pages

A Support Vector Clustering Method

Uploaded by

dridabs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

A Support Vector Clustering Method

Asa Ben-Hur
Faculty of Industrial Engineering and Management
Technion, Haifa 32000, Israel

David Horn
School of Physics and Astronomy
Raymond and Beverly Sackler Faculty of Exact Sciences
Tel Aviv University, Tel Aviv 69978, Israel

Hava T. Siegelmann Vladimir Vapnik

Faculty of Industrial Engineering and Management AT&T Labs Research
Technion, Haifa 32000, Israel 100 Schultz Dr., Red Bank, NJ 07701, USA

Abstract form that outlines its boundaries. It also uses an extension

to a higher dimensional space. However it employs a ker-
We present a novel kernel method for data clustering us- nel method, and hence it can be implemented by algebraic
ing a description of the data by support vectors. The kernel manipulations in the original (input) space. Kernel meth-
rejects a projection of the data points from data space to a ods are the basis of the support vector machine (SVM) ap-
high dimensional feature space. Cluster boundaries are de- proach that was developed for classification problems [2];
fined a s spheres in feature space, which represent complex the decision boundary in the input space is represented by
geometric shapes in data space. We utilize this geometric a hyperplane in the higher dimensional feature space. We
representation of the data to construct a simple clustering use a similar idea, representing an arbitrary closed curve
algorithm. that encloses a cluster of data points by a sphere in high
dimensional feature space. Our formalism is similar to the
analysis of [lo, 111, in which the authors characterize the
support of a distribution represented by a finite data-set and
1. Introduction detect its outliers. This paper presents the general ideas and
an application in clustering. Further developments will be
Clustering of data sets is a problem that has been studied found in [ 11.
extensively [3,4]. Since different types of data sets are of-
ten more amenable to one clustering method than another,
there exists continuing interest in developing yet better al- 2. Describing Cluster Boundaries with Support
gorithms that will have general appeal and success. Vectors
A recent algorithm by Lipson and Siegelmann [ 5 ] repre-
sents data points in a higher dimensional space built out of In this section we develop a method for representing the
direct products of the original data space. Their algorithm boundary of a cluster of data points using the formalism
relies on tensorial manipulations in the higher dimensional of support vectors. Let {xi} C x be the data-set, with
space, and allows for cluster boundaries with rich geometric x Rd,the input space. The boundary of this cluster
shapes. When cluster boundaries overlap the corresponding may be rather complicated. Using a nonlinear transforma-
data points are interpreted as beloging to both clusters. tion from x to some high dimensional space, the clus-
The algorithm developed here shares with [ 5 ] the prop- ter of points may take on a much simpler form. In fact,
erty that a cluster is associated with a specific geometric it turns out that with an appropriate choice of the mapping

0-7695-0750-6/00 $10.00 0 2000 IEEE 724

@, a sphere in the high dimensional space which encloses This formalism can be generalized to allow for outliers
the transformed data points corresponds to a complex geo- [ 12,8,9] obeying
metrical shape which encloses the points in input space (see
figures). We want the shape in input space to be a tight IIwq)- all2 F R2 + t j (9)
fit around the data points, so we look for the smallest en- with 2 0, by extending the Lagrangian into
closing sphere. An enclosing sphere is represented by the
constraints: L , = L - czs,. (10)
II@(xi) - all2 5 R2 V i (1) j
where 11 . 11 is the Euclidean norm and a is the center of This is implemented with the same dual Lagrangian, W as
the sphere. Our goal is to minimize R2 over all choices of above. with the added constraints
a that satisfy these constraints. To solve this problem we
introduce the Lagrangian 0 I P3 F c. (11)
The interpretation is that for all points lying inside or on the
j
sphere t3= 0 and 0,< C , while for the outliers P3 = C.
For each point x we define the distance of its image in
where P j 2 0 are Lagrange multipliers. Minimizing L with feature space from the center of the sphere:
respect to the radius R leads to the normalization condition
R2(x) = ll@(x)- all2 . (12)
CPj=I7 (3)
In view of (4) and the definition of the kernel we have:

while minimization with respect to a leads to R2(X) = K(x7 -2

3
PjK(xJ > x)+x
2>3
PZP3K(XZ, xj)

(4) (13)
j The radius of the cluster is defined as:
Using these relations we may eliminate the variables R and R = {R(x,) I x , is a support vector } . (14)
a, tuming the Lagrangian into the Wolfe dual which is a
function of the variables pj: In practice, one takes the average over all support vectors.
The contour that encloses the cluster in data space is the set
W= Q(Xj)”j - PiPj@(Xi) . Q(Xj) (5)
j id
{x I R(x) = R } . (15)

Maximizing W with respect to pi leads to the emergence of A point x is an outlier if R(x) > R.
two types of data-points: Those for which P j = 0, include The shape of the contour is governed by two parame-
all points that lie inside the sphere and some that may lie on ters, q and C. The figures below demonstrate that as q is
it, and those for which /3j > 0 that are on the boundary of increased, this shape represents a tighter fit to the cluster of
the sphere. The latter are the support vectors which define points. For C = 1 no outliers are allowed due to the con-
the center of the sphere as seen above. straint of Eq. (3). The size of C determines the number
We follow the standard method of SV and represent the of outliers. As C is decreased, the number of outliers in-
dot products @(xi). @(xj)by an appropriate Mercer kernel creases. Moreover, as can be seen from the three q = 50
[7]. Common choices are polynomial kemels examples, the influence of outliers on the shape of the clus-
ter boundary decreases too. The number of support vectors
K(Xi,Xj) = (Xi . xj + l ) d (6) depends on both q and C. We observed that for fixed q , as
C is decreased, their number decreases since the increased
or the Gaussian kernel number of outliers makes the shape smoother, and thus eas-
~ ( x ~ , xe-qIIxt-xjIIZ
~ ) (7) ier to describe with less support vectors. Quantitativeresults
on the parameter dependence of the number of outliers and
which is used here. It was noted in [111 that the polynomial support vectors can be found in [ 11.
kemel is not appropriate since it gives a weight to points
with large coordinate values. A comparison with the Lapla- 3. The Clustering Algorithm
cian kemel is given elsewhere [l]. The Lagrangian W is
now written as:
We extend the single cluster case to a multiple cluster
W = E K ( x j 7 x j ) P j- C P i P j K ( x i , x j ) . (8) problem in a straightforward fashion. We choose the num-
j i,j ber of clusters and pick data-points as initial centers of those

725
clusters. Then we test the data points one by one, and as-
sign each point xi to the cluster which minimizes R(xi).
This assigns each point according to the nearest center of
a feature space sphere. After each data point is added, the
parameters ,Bjare re-evaluated with the algorithm presented
above. There are efficient algorithms tailored especially for
the SVM quadratic programming problem. The Sequential
Minimal Optimization (SMO) 161 can be adapted for the La-
grangian (10) [lo]. Another advantage is that a smart ini-
tialization that uses the result of the previous iteration gives
fast convergence.

04
Figure 3. q = 50, C = 1.
03

i
02 06

-01

4 2

4 3

-04
-0s 4 s -0. -02 0 02 04 os
-01

-02

Figure 1. q = 6, C = 1. - 04
4 .3 6

2
OS

Figure 4. q = 50, C = 0.09.

the problem is defined. We see that for low values of q, 6

and 9, the shapes of the cluster boundaries do not form tight
fits to the data. For such values of q the result also depends
on the choice of initial points and the order in which the data
is presented. Higher values of q , 20 and 50, do much better.
-02- Here the treatment of outliers becomes important. Thus, the
-03- three figures of q = 50 show that only as C is decreased to
0.07, the effect of the outliers is reduced and smooth curves
are obtained for the cluster boundaries. The optimal value
of q required to represent the contour of a cluster is data
Figure 2. q = 9, C = 1. dependent, and increases as the desired curvature increases.
In the problem at hand q = 50 seems to be too high since
Examples of the results of this procedure are shown in the lower cluster splits into two pieces. Hence we think that
Figures 1-6. The data set on which we tested the algorithm q = 20,C = 0.07 gives a better solution.
is composed of 144 points in lR2, with 66 points in the upper The algorithm presented here utilizes the support vector
cluster and 78 points in the lower one. The data points are approach for representing cluster boundaries in a very sim-
denoted by dots; those data points that are support vectors ple way. Work on a more elaborate clustering scheme is
or outliers are surrounded by diamonds or squares respec- now in progress.
tively. The curves are the projections of the spheres in the Our algorithm utilizes the support vector approach,
high dimensional feature space onto the data-space in which hence it puts special emphasis on cluster boundaries. This

726
06
A + + + I

Figure 5. q = 50, C = 0.07. Figure 7. Clustering produced by K-means

nual ACM Workshop on COLT, pages 144-1 52. ACM Press,

6
I3
1998.
[3] R. Duda and P. Hart. Pattern classifrcation and scene analy-
sis. Wiley-Interscience, 1973.
[4] A. Jain and R. Dubes. Algorithms for clustering data. Pren-
tice Hall, Englewood Cliffs, NJ, 1988.
[5] H. Lipson and H. Siegelmann. Clustering irregular shapes

4
..
I
m

01 06
using high-order neurons. Neural Computution, 2000, to be
published.
[6] J. Platt. Fast training of SVMs using sequential minimal op-
timization. In B. Scholkopf, C. Burges, and A. Smola, ed-
itors, Advances in Kernel Methods - Support Vector Learn-
ing, pages 185-208. MIT Press, Cambridge, MA, 1999.
[7] S. Saitoh. T h e o q of reproducing kernels and its applica-
tions. Longman Scientific & Technical, 1988.
Figure 6. q = 20, C = 0.07. [8] B. Scholkopf. Support Vector Learning. R. Oldenburg Ver-
lag, 1997.
[9] B. Scholkopf, C. Burgess, and A. Smolla, editors. Advances
can be contrasted with a standard clustering algorithm, like in Kernel Methods - Support Vector Learning. MIT Press,
K-means 141, that separates data into clusters by minimizing 1999.
data distances from cluster centers, a criterion which yields [lo] B. Scholkopf, J. Platt, J. Shawe-Taylor, A. Smola, and
very simple cluster boundaries. Applying K-means to our R. Williamson. Estimating the support of a high dimen-
data set with K = 2 we obtain the results shown in Fig- sional distribution. In Proceedings of the Annual Conference
ure 7. Obviously this clustering algorithm is less suitable to on Neural Information Systems 1999 (NIPS"99). MIT Press,
this problem than ours, because it does not reflect the intu- 2000.
itive notion that proximity between points within a cluster [ 111 D. Tax and R. Duin. Support vector domain description.
Pattern Recognition letters, 20: 1991-1999, 1999.
should be a key criterion. This highlights the flexibility of
[12] V. Vapnik. The Nature of Statistical Learning Theory.
the SV-algorithm, which takes proximity into account and Springer Verlag, 1995.
can represent clusters with arbitrary shapes.
Acknowledgments.
This work was partially supported by the Israel Ministry of
Science.

References
[I] A. Ben-Hur, D. Hom, H. Siegelmann, and V. Vapnik. A
support vector method for hierarchical clustering. Preprint.
[2] B. Boser, 1. Guyon, and V. Vapnik. A training algorithm for
optimal margin classifiers. In Proceedings of the 5th An-

727

Support Vector Clustering: Journal of Machine Learning Research 2 (2001) 125-137 Submitted 3/04 Published 12/01
No ratings yet
Support Vector Clustering: Journal of Machine Learning Research 2 (2001) 125-137 Submitted 3/04 Published 12/01
13 pages
An Efficient Data Preprocessing Procedure For Support Vector Clustering
No ratings yet
An Efficient Data Preprocessing Procedure For Support Vector Clustering
17 pages
A Comparison Between K-Means and Support Vector Clustering of Categorical Data
No ratings yet
A Comparison Between K-Means and Support Vector Clustering of Categorical Data
12 pages
SVC Talk
No ratings yet
SVC Talk
30 pages
ZhuoLiu SVclustering
No ratings yet
ZhuoLiu SVclustering
28 pages
NIPS 1999 Support Vector Method For Novelty Detection Paper
No ratings yet
NIPS 1999 Support Vector Method For Novelty Detection Paper
7 pages
SVM - Hype or Hallelujah
No ratings yet
SVM - Hype or Hallelujah
13 pages
Locally Constrained Support Vector Clustering
No ratings yet
Locally Constrained Support Vector Clustering
10 pages
n25 PDF
No ratings yet
n25 PDF
8 pages
Tutorial4 SVM
No ratings yet
Tutorial4 SVM
8 pages
A Tutorial on ν-Support Vector Machines: 1 An Introductory Example
No ratings yet
A Tutorial on ν-Support Vector Machines: 1 An Introductory Example
29 pages
Machine Learning: SVM & Kernels
No ratings yet
Machine Learning: SVM & Kernels
5 pages
Support Vector Machines
No ratings yet
Support Vector Machines
18 pages
Dis11 Sol
No ratings yet
Dis11 Sol
5 pages
2092 On Spectral Clustering Analysis and An Algorithm
No ratings yet
2092 On Spectral Clustering Analysis and An Algorithm
8 pages
Introduction To Support Vector Machines: 1 Description
No ratings yet
Introduction To Support Vector Machines: 1 Description
15 pages
A Fast and Stable Cluster Labeling Method For Support Vector Clustering
No ratings yet
A Fast and Stable Cluster Labeling Method For Support Vector Clustering
6 pages
Learning Spectral Clustering
No ratings yet
Learning Spectral Clustering
8 pages
Feature Selection For Nonlinear Kernel Support Vector Machines
No ratings yet
Feature Selection For Nonlinear Kernel Support Vector Machines
6 pages
20 SVM
No ratings yet
20 SVM
35 pages
Download
No ratings yet
Download
6 pages
Introduction To: Support Vector Machines
No ratings yet
Introduction To: Support Vector Machines
53 pages
Support Vector Machines: Xiaojin Zhu
No ratings yet
Support Vector Machines: Xiaojin Zhu
41 pages
A New Generalized Learning Vector Quantization Algorithm
No ratings yet
A New Generalized Learning Vector Quantization Algorithm
6 pages
2461 Out of Sample Extensions For Lle Isomap Mds Eigenmaps and Spectral Clustering
No ratings yet
2461 Out of Sample Extensions For Lle Isomap Mds Eigenmaps and Spectral Clustering
8 pages
Least-Squares Fitting Algorithms of The NIST Algorithm Testing System
No ratings yet
Least-Squares Fitting Algorithms of The NIST Algorithm Testing System
9 pages
Lec5 Support Vector Machine
No ratings yet
Lec5 Support Vector Machine
28 pages
SML Unit 4
No ratings yet
SML Unit 4
61 pages
SSVM A Simple SVM Algorithm
No ratings yet
SSVM A Simple SVM Algorithm
6 pages
Poly Kernel
No ratings yet
Poly Kernel
6 pages
SVM Basics for Machine Learning Enthusiasts
No ratings yet
SVM Basics for Machine Learning Enthusiasts
4 pages
Supervised and Unsupervised SVM Techniques
No ratings yet
Supervised and Unsupervised SVM Techniques
78 pages
Support Vector Machine Classification For Large Data Sets Via Minimum Enclosing Ball Clustering
No ratings yet
Support Vector Machine Classification For Large Data Sets Via Minimum Enclosing Ball Clustering
9 pages
Introduction to Support Vector Machines
No ratings yet
Introduction to Support Vector Machines
33 pages
Improved Boundary Support Vector Clustering With S
No ratings yet
Improved Boundary Support Vector Clustering With S
22 pages
Session19 Extra SVM
No ratings yet
Session19 Extra SVM
59 pages
SCH Smo 03 C
No ratings yet
SCH Smo 03 C
24 pages
Unit - 2
No ratings yet
Unit - 2
15 pages
Least Squares Support Vector Machine Classifiers: Neural Processing Letters 9: 293-300, 1999
No ratings yet
Least Squares Support Vector Machine Classifiers: Neural Processing Letters 9: 293-300, 1999
8 pages
K-SVM: An Effective SVM Algorithm Based On K-Means Clustering
No ratings yet
K-SVM: An Effective SVM Algorithm Based On K-Means Clustering
8 pages
SVM 1997
No ratings yet
SVM 1997
11 pages
Some Methods of Constructing Kernel
No ratings yet
Some Methods of Constructing Kernel
23 pages
Support Vector Machines
No ratings yet
Support Vector Machines
57 pages
04 Surface Reconstruction
No ratings yet
04 Surface Reconstruction
65 pages
Spectral Clustering: X Through The Parameter W 0. The Resulting
No ratings yet
Spectral Clustering: X Through The Parameter W 0. The Resulting
7 pages
SVM & Image Classification.
No ratings yet
SVM & Image Classification.
22 pages
Robust Clustering Based On The MFV
No ratings yet
Robust Clustering Based On The MFV
13 pages
1501589527da Mod14 Q1 e Text
No ratings yet
1501589527da Mod14 Q1 e Text
12 pages
Report 1
No ratings yet
Report 1
6 pages
SVM Formulation and Optimization
No ratings yet
SVM Formulation and Optimization
16 pages
Support Vector Machine
No ratings yet
Support Vector Machine
19 pages
Support Vector Machine
No ratings yet
Support Vector Machine
49 pages
Data Mining and Machine Learning: Fundamental Concepts and Algorithms
No ratings yet
Data Mining and Machine Learning: Fundamental Concepts and Algorithms
45 pages
Exp 14
No ratings yet
Exp 14
27 pages
An Improved Training Algorithm For Support Vector Machines
No ratings yet
An Improved Training Algorithm For Support Vector Machines
10 pages
Support Vector Machines (SVM) : Y.H. Hu
No ratings yet
Support Vector Machines (SVM) : Y.H. Hu
25 pages
W12 SVM
No ratings yet
W12 SVM
52 pages
Simplicial Complexes For Clouds of Data
No ratings yet
Simplicial Complexes For Clouds of Data
53 pages
Juj 2008
No ratings yet
Juj 2008
73 pages
5.3 The Unit Circle CAST Rule
No ratings yet
5.3 The Unit Circle CAST Rule
5 pages
Geometry Complete PDF
No ratings yet
Geometry Complete PDF
23 pages
5a. Activity Broken Pottery
No ratings yet
5a. Activity Broken Pottery
10 pages
Types of Similarity Search
No ratings yet
Types of Similarity Search
11 pages
IGCSE Mathematics 2016 Syllabus
0% (1)
IGCSE Mathematics 2016 Syllabus
39 pages
Transformation
No ratings yet
Transformation
64 pages
Polyhedral Mesh Handling in Openfoam: Hrvoje Jasak
No ratings yet
Polyhedral Mesh Handling in Openfoam: Hrvoje Jasak
13 pages
Differential Geometry of Curves and Surfaces 1. Curves in The Plane
No ratings yet
Differential Geometry of Curves and Surfaces 1. Curves in The Plane
20 pages
LUSAS Help
No ratings yet
LUSAS Help
5 pages
Circles PDF
No ratings yet
Circles PDF
9 pages
Undergraduate Math Olympiad Guide
No ratings yet
Undergraduate Math Olympiad Guide
1 page
Stress and Strain Theories
No ratings yet
Stress and Strain Theories
34 pages
Vector Calculus Problems Worksheet
No ratings yet
Vector Calculus Problems Worksheet
2 pages
Engineering Mechanics Model Papers
No ratings yet
Engineering Mechanics Model Papers
11 pages
Detailed Solutions. Grade 9
100% (1)
Detailed Solutions. Grade 9
13 pages
Trigonometry (
No ratings yet
Trigonometry (
38 pages
Notes Important Questions Answers of 11th Math Chapter 9 Excercise 9.1
100% (1)
Notes Important Questions Answers of 11th Math Chapter 9 Excercise 9.1
6 pages
Algebra Full Year 10
No ratings yet
Algebra Full Year 10
13 pages
Trigonometry B
No ratings yet
Trigonometry B
27 pages
Quiz On Geometry For Grade 7
No ratings yet
Quiz On Geometry For Grade 7
2 pages
Crystal Symmetry and Morphology Guide
No ratings yet
Crystal Symmetry and Morphology Guide
63 pages
Lecture Questions:: Physics E Outline For Chp. 8 Rotational Equilibrium and Dynamics
No ratings yet
Lecture Questions:: Physics E Outline For Chp. 8 Rotational Equilibrium and Dynamics
2 pages
Math Investigation
No ratings yet
Math Investigation
25 pages
Catalogo Arielle 2022
No ratings yet
Catalogo Arielle 2022
24 pages
Drafting 1-5 DLAP 2020-2021
No ratings yet
Drafting 1-5 DLAP 2020-2021
4 pages
3850 Certificate in Mathematics - Sample Paper Stage 2: Candidate Name (First, Middle, Last)
No ratings yet
3850 Certificate in Mathematics - Sample Paper Stage 2: Candidate Name (First, Middle, Last)
17 pages
Pre-Calculus For non-STEM (Algebra & Trigonometry
No ratings yet
Pre-Calculus For non-STEM (Algebra & Trigonometry
6 pages
QP Set 1 430) S) 1
No ratings yet
QP Set 1 430) S) 1
12 pages

A Support Vector Clustering Method

Uploaded by

A Support Vector Clustering Method

Uploaded by

A Support Vector Clustering Method

Hava T. Siegelmann Vladimir Vapnik

Abstract form that outlines its boundaries. It also uses an extension

0-7695-0750-6/00 $10.00 0 2000 IEEE 724

while minimization with respect to a leads to R2(X) = K(x7 -2

Figure 4. q = 50, C = 0.09.

the problem is defined. We see that for low values of q, 6

Figure 5. q = 50, C = 0.07. Figure 7. Clustering produced by K-means

nual ACM Workshop on COLT, pages 144-1 52. ACM Press,

You might also like