Abstract
A method to estimate the probability density function of multivariate distributions is presented. The classical Parzen window approach builds a spherical Gaussian density around every input sample. This choice of the kernel density yields poor robustness for real input datasets. We use multivariate Student-t distributions in order to improve the adaptation capability of the model. Our method has a first stage where hard neighbourhoods are determined for every sample. Then soft clusters are considered to merge the information coming from several hard neighbourhoods. Hence, a specific mixture component is learned for each soft cluster. This leads to outperform other proposals where the local kernel is not as robust and/or there are no smoothing strategies, like the manifold Parzen windows.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bezdek, J.C.: Numerical taxonomy with fuzzysets. J. Math. Biol. 1, 57–71 (1974)
Bishop, C.: Neural Networks for Pattern Recognition. Oxford University Press, Oxford (1995)
Hjort, N.L., Jones, M.C.: Locally Parametric Nonparametric Density Estimation. Annals of Statistics 24(4), 1619–1647 (1996)
Izenman, A.J.: Recent developments in nonparametric density estimation. Journal of the American Statistical Association 86(413), 205–224 (1991)
Kanzow, C., Yamashita, N., Fukushima, M.: Levenberg-Marquardt methods for constrained nonlinear equations with strong local convergence properties. Journal of Computational and Applied Mathematics 172, 375–397 (2004)
Lejeune, M., Sarda, P.: Smooth estimators of distribution and density functions. Computational Statistics and Data Analysis 14, 457–471 (1992)
McLachlan, G., Peel, D.: Finite Mixture Models. Wiley, Chichester (2000)
Newman, D.J., Hettich, S., Blake, C.L., Merz, C.J.: UCI Repository of machine learning databases. Department of Information and Computer Science, University of California, Irvine (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html
Parzen, E.: On the Estimation of a Probability Density Function and Mode. Annals of Mathematical Statistics 33, 1065–1076 (1962)
Shoham, S.: Robust clustering by deterministic agglomeration EM of mixtures of multivariate t-distributions. Pattern Recognition 35, 1127–1142 (2002)
Silverman, B.: Density Estimation for Statistics and Data Analysis. Chapman and Hall, New York (1986)
Svensén, M., Bishop, C.M.: Robust Bayesian mixture modeling. Neurocomputing 64, 235–252 (2005)
Tipping, M.E., Bishop, C.M.: Mixtures of Probabilistic Principal Components Analyzers. Neural Computation 11, 443–482 (1999)
Vapnik, V.N.: Statistical Learning Theory. John Wiley and Sons, New York (1998)
Vincent, P., Bengio, Y.: Manifold Parzen Windows. Advances in Neural Information Processing Systems 15, 825–832 (2003)
Wang, H., Zhang, Q., Luo, B., Wei, S.: Robust mixture modelling using multivariate t-distribution with missing information. Pattern Recognition Letters 25, 701–710 (2004)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
López-Rubio, E., Ortiz-de-Lazcano-Lobato, J.M., López-Rodríguez, D., del Carmen Vargas-Gonzalez, M. (2008). Robust Nonparametric Probability Density Estimation by Soft Clustering. In: Kůrková, V., Neruda, R., Koutník, J. (eds) Artificial Neural Networks - ICANN 2008. ICANN 2008. Lecture Notes in Computer Science, vol 5163. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87536-9_17
Download citation
DOI: https://doi.org/10.1007/978-3-540-87536-9_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87535-2
Online ISBN: 978-3-540-87536-9
eBook Packages: Computer ScienceComputer Science (R0)