Abstract
Since in practical data mining problems high-dimensional data are clustered, the resulting clusters are high-dimensional geometrical objects, which are difficult to analyze and interpret. Cluster validity measures try to solve this problem by providing a single numerical value. As a low dimensional graphical representation of the clusters could be much more informative than such a single value, this paper proposes a new tool for the visualization of fuzzy clustering results. By using the basic properties of fuzzy clustering algorithms, this new tool maps the cluster centers and the data such that the distances between the clusters and the data-points are preserved. During the iterative mapping process, the algorithm uses the membership values of the data and minimizes an objective function similar to the original clustering algorithm. Comparing to the original Sammon mapping not only reliable cluster shapes are obtained but the numerical complexity of the algorithm is also drastically reduced. The developed tool has been applied for visualization of reconstructed phase space trajectories of chaotic systems. The case study demonstrates that proposed FUZZSAMM algorithm is a useful tool in user-guided clustering.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Abonyi J, Lakatos BG, Ulbert ZS (2002) Modelling and control of isothermal crystallizers by self organising maps. Chem Eng Trans 1(3):1149
Abonyi J, Feil B, Babuška R (2004) State-space reconstruction and prediction of chaotic time series based on fuzzy clustering. IEEE SMC 2004, October 10–13
Ashkenazy Y (1999) The use of generalized information dimension in measuring fractal dimension of time series. Physica A 271:427–447
Ataei M, Lohmann B, Khaki-Sedigh A, Lucas C (2004) Model based method for estimating an attractor dimension from uni/ multivariate chaotic time series with application to Bremen climatic dynamics. Chaos Solitons Fractals 19:1131–1139
Babuška R, Verbruggen HB (1991) Fuzzy set methods for local modeling and identification. Multiple model approaches to nonlinear modeling and control. Taylor & Francis, London, pp 75–100
Bezdek JC (1981) Pattern recognition with fuzzy objective function algorithms. Plenum, New York
Cao C, (1997) Practical method for determining the minimum embedding dimension of a scalar time series. Physica D 110:43–50
Casdagli M (1989) Nonlinear prediction of chaotic time series. Physica D 35:335–356
Farmer JD, Sidorowich JJ (1987) Predicting chaotic time series. Phys Rev Lett 59(8):845–848
Grassberger P, Procaccia I (1983) Measuring the strangeness of strange attractors. Physica D 9:189–208
Hanesch M, Scholger R, Dekkers MJ (2001) The application of fuzzy c-means cluster analysis and non-linear mapping to a soil data set for the detection of polluted sites. Phys Chem Earth 26:885–891
Hunter Jr, Norman F (1992) Nonlinear prediction of speach signals. In: Casdagli M, Eubanks S (eds) Nonlinear modelling and forecasting. Addison-Wesley, pp 467–492
Jiang X, Adeli H (2003) Fuzzy clustering approach for accurate embedding dimension identification in chaotic time series. Integrated Comput Aided Eng 10:287–302
Klawonn F, Chekhtman V, Janz E (2003) Visual inspection of fuzzy clustering results. Adv Soft Comput Eng Des Manuf 65–76
Kovacs A, Abonyi J (2002) Vizualization of fuzzy clustering results by modified Sammon mapping. In: Proceedings of the 3rd international symposium of Hungarian researchers on computational intelligence, pp 177–188
Lakatos BG (1994) Stability and dynamics of continuous crystallizers. Comput Chem Eng 18:427–431
Lakatos BG (1995) Sustained oscillation in isothermal CMSMPR crystallizers: effect of size-dependent crystall growth rate. Sapundzhiev TsJ (1995) ACH Models Chem 132:379–394
Lerner B, Guterman H, Aladjem M, Dinstein I, Romem Y (1998) On pattern classification with Sammon’s nonlinear mapping an experimental study. Pattern Recognit 31:371–381
Lillekjendlie B, Kugiumtzis D, Christophersen N (1994) Chaotic time series—Part II: System identification and prediction. Modeling Identification Control 15(4):225–243
Mao J, Jain AK (1995) Artifical neural networks for feature extraction and multivariate data projection. IEEE Trans Neural Netw 629–637
Pal NR, Eluri VK (1998) Two efficient connectionist schemes for structure preserving dimensionality reduction. IEEE Trans Neural Netw 9:1143–1153
de Ridder D, Duin RPW (1997) Sammon’s mapping using neural networks: a comparison. Pattern Recognit Lett 18:1307–1316
Sauer T, Yorke JA, Casdagli M (1991) Embedology. J Stat Phys 65:579–615
Takens F (1981) Detecting strange attractor in turbulence. In: Rand DA, Young LS (eds). Dynamical systems and turbulence. Springer, Berlin Heidelberg New York, pp 366–381
Vesanto J (2000) Neural network tool for data mining: SOM Toolbox. In: Proceedings of symposium on tool environments and development methods for intelligent systems (TOOLMET2000), pp 184–196
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Feil, B., Balasko, B. & Abonyi, J. Visualization of fuzzy clusters by fuzzy Sammon mapping projection: application to the analysis of phase space trajectories. Soft Comput 11, 479–488 (2007). https://doi.org/10.1007/s00500-006-0111-5
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-006-0111-5