A Fuzzy Subspace Algorithm for Clustering High Dimensional Data

Guojun Gan²²,
Jianhong Wu²² &
Zijiang Yang²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4093))

Included in the following conference series:

International Conference on Advanced Data Mining and Applications

3144 Accesses
23 Citations

Abstract

In fuzzy clustering algorithms each object has a fuzzy membership associated with each cluster indicating the degree of association of the object to the cluster. Here we present a fuzzy subspace clustering algorithm, FSC, in which each dimension has a weight associated with each cluster indicating the degree of importance of the dimension to the cluster. Using fuzzy techniques for subspace clustering, our algorithm avoids the difficulty of choosing appropriate cluster dimensions for each cluster during the iterations. Our analysis and simulations strongly show that FSC is very efficient and the clustering results produced by FSC are very high in accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Fuzzy Knowledge-Based Subspace Clustering for Life Science Data Analysis

Viewpoint-Driven Subspace Fuzzy C-Means Algorithm

Subspace Clustering and Some Soft Variants

References

Jain, A., Murty, M., Flynn, P.: Data clustering: A review. ACM Computing Surveys 31, 264–323 (1999)
Article Google Scholar
Cao, Y., Wu, J.: Projective ART for clustering data sets in high dimensional spaces. Neural Networks 15, 105–120 (2002)
Article Google Scholar
Agrawal, R., Gehrke, J., Gunopulos, D., Raghavan, P.: Automatic subspace clustering of high dimensional data for data mining applications. In: SIGMOD Record ACM Special Interest Group on Management of Data, pp. 94–105 (1998)
Google Scholar
Aggarwal, C., Wolf, J., Yu, P., Procopiuc, C., Park, J.: Fast algorithms for projected clustering. In: Proceedings of the 1999 ACM SIGMOD international conference on Management of data, pp. 61–72. ACM Press, New York (1999)
Chapter Google Scholar
Domeniconi, C., Papadopoulos, D., Gunopulos, D., Ma, S.: Subspace clustering of high dimensonal data. In: Proceedings of the SIAM International Conference on Data Mining, Lake Buena Vista, Florida (2004)
Google Scholar
Goil, S., Nagesh, H., Choudhary, A.: MAFIA: Efficient and scalable subspace clustering for very large data sets. Technical Report CPDC-TR-9906-010, Center for Parallel and Distributed Computing, Department of Electrical & Computer Engineering, Northwestern University (1999)
Google Scholar
Aggarwal, C., Yu, P.: Finding generalized projected clusters in high dimensional spaces. In: Chen, W., Naughton, J.F., Bernstein, P.A. (eds.) Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, Dallas, Texas, USA, May 16-18, 2000, vol. 29, pp. 70–81. ACM, New York (2000)
Chapter Google Scholar
Woo, K., Lee, J.: FINDIT: a fast and intelligent subspace clustering algorithm using dimension voting. PhD thesis, Korea Advanced Institue of Science and Technology, Department of Electrical Engineering and Computer Science (2002)
Google Scholar
Cheng, C., Fu, A., Zhang, Y.: Entropy-based subspace clustering for mining numerical data. In: Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 84–93. ACM Press, New York (1999)
Chapter Google Scholar
Kaufman, L., Rousseeuw, P.: Finding Groups in Data–An Introduction to Cluster Analysis. Wiley series in probability and mathematical statistics. John Wiley & Sons, Inc., New York (1990)
Google Scholar
Yang, J., Wang, W., Wang, H., Yu, P.: δ-clusters: capturing subspace correlation in a large data set. In: Proceedings. 18th International Conference on Data Engineering, pp. 517–528 (2002)
Google Scholar
Procopiuc, C., Jones, M., Agarwal, P., Murali, T.: A monte carlo algorithm for fast projective clustering. In: Proceedings of the 2002 ACM SIGMOD international conference on Management of data, pp. 418–427. ACM Press, New York (2002)
Chapter Google Scholar
Gan, G., Wu, J.: Subspace clustering for high dimensional categorical data. ACM SIGKDD Explorations Newsletter 6, 87–94 (2004)
Article Google Scholar
Agarwal, P., Mustafa, N.: k-means projective clustering. In: Proceedings of the Twenty-third ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems(PODS), Paris, France, pp. 155–165. ACM Press, New York (2004)
Chapter Google Scholar
Liu, B., Xia, Y., Yu, P.: Clustering through decision tree construction. In: Proceedings of the ninth international conference on Information and knowledge management, McLean, Virginia, USA, pp. 20–29. ACM Press, New York (2000)
Google Scholar
Hartigan, J.: Clustering Algorithms. John Wiley & Sons, Toronto (1975)
MATH Google Scholar
Huang, Z., Ng, M.: A fuzzy k-modes algorithm for clustering categorical data. IEEE Transactions on Fuzzy Systems 7, 446–452 (1999)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics and Statistics, York University, Toronto, Ontario, M3J 1P3, Canada
Guojun Gan & Jianhong Wu
School of Information Technology, Atkinson Faculty of Liberal and Professional Studies, York University, Toronto, Ontario, M3J 1P3, Canada
Zijiang Yang

Authors

Guojun Gan
View author publications
You can also search for this author in PubMed Google Scholar
Jianhong Wu
View author publications
You can also search for this author in PubMed Google Scholar
Zijiang Yang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Information Technology and Electronic Engineering, The University of Queensland, Queensland, Australia
Xue Li
University of Alberta, Canada
Osmar R. Zaïane
Northwest Polytechnical University, China
Zhanhuai Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gan, G., Wu, J., Yang, Z. (2006). A Fuzzy Subspace Algorithm for Clustering High Dimensional Data. In: Li, X., Zaïane, O.R., Li, Z. (eds) Advanced Data Mining and Applications. ADMA 2006. Lecture Notes in Computer Science(), vol 4093. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11811305_30

Download citation

DOI: https://doi.org/10.1007/11811305_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37025-3
Online ISBN: 978-3-540-37026-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Fuzzy Subspace Algorithm for Clustering High Dimensional Data

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Fuzzy Knowledge-Based Subspace Clustering for Life Science Data Analysis

Viewpoint-Driven Subspace Fuzzy C-Means Algorithm

Subspace Clustering and Some Soft Variants

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A Fuzzy Subspace Algorithm for Clustering High Dimensional Data

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Fuzzy Knowledge-Based Subspace Clustering for Life Science Data Analysis

Viewpoint-Driven Subspace Fuzzy C-Means Algorithm

Subspace Clustering and Some Soft Variants

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation