Abstract
In recent years, folksonomy becomes a hot topic in many research fields such as complex systems, information retrieval, and recommending systems. It is essential to study the semantic relationships among tags in folksonomy applications. The main contributions of this paper includes: (a) proposes a general framework for the analysis of the semantic relationships among tags based on their co-occurrence. (b)investigates eight correlation measurements from various fields; then appliying these measurements to searching similar tags for a given tag on datasets from del.icio.us. (c) conducts a comparative study on both accuracy and time performance of the eight measurements. From the comparison, a best overall correlation measurement is concluded for similar tags searching in the applications of folksonomy.
Supported by the 11th Five Years Key Programs for Sci. &Tech. Development of China under grant No. 2006BAI05A01, the National Science Foundation under grant No 60773169, the Software Innovation Project of Sichuan Youth under Grant No 2007AA0155 and the Development Foundation of Chengdu Univeristy of Information Technology(KYTZ200811).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Shirky, C.: Ontology is overrated: Categories, links, and tags. Clay Shirky’s Writings About the Internet Website (2005)
del.icio.us, http://del.icio.us/
Cattuto, C., Loreto, V., Pietronero, L.: Semiotic dynamics and collaborative tagging. Proceedings of the National Academy of Sciences United States of America 104, 1461 (2007)
Cattuto, C., Loreto, V., Servedio, V.D.: A yule-simon process with memory. Europhysics Letters 76(2), 208–214 (2006)
Lux, M., Granitzer, M., Kern, R.: Aspects of Broad Folksonomies. In: 18th International Conference on Database and Expert Systems Applications (2007)
Brin, S., Motwani, R., Silverstein, C.: Beyond Market Baskets: Generalizing Association Rules to Correlations. In: Proceedings ACM SIGMOD International Conference on Management of Data, Tucson, Arizona, USA, May 13-15 (1997)
Chan, P.K.: A non-invasive learning approach to building web user profiles. In: KDD 1999 Workshop on Web Usage Analysis and User Profiling (1999)
Jeh, G., Widom, J.: SimRank: A measure of structural-context similarity. In: Proc. 8th ACM SIGKDD Intl. Conf. on Knowledge Discovery and Data Mining (July 2002)
Pythagorean Theorem, http://mathworld.wolfram.com/PythagoreanTheorem.html
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)
Deshpande, M., Karypis, G.: Item-based top-n recommendation algorithms. ACM Trans. Inf. Syst. 22(1), 143–177 (2004)
Thorsten, J.: Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization. In: Proceedings of 14th International Conference on Machine Learning (1996)
Rosenfeld, R.: A maximum entropy approach to adaptive statistical language modeling. Computer,speech, and language 10 (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Xu, K., Chen, Y., Jiang, Y., Tang, R., Liu, Y., Gong, J. (2008). A Comparative Study of Correlation Measurements for Searching Similar Tags. In: Tang, C., Ling, C.X., Zhou, X., Cercone, N.J., Li, X. (eds) Advanced Data Mining and Applications. ADMA 2008. Lecture Notes in Computer Science(), vol 5139. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88192-6_75
Download citation
DOI: https://doi.org/10.1007/978-3-540-88192-6_75
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88191-9
Online ISBN: 978-3-540-88192-6
eBook Packages: Computer ScienceComputer Science (R0)