A Neighborhood Search Method for Link-Based Tag Clustering

Cui, Jianwei; Li, Pei; Liu, Hongyan; He, Jun; Du, Xiaoyong

doi:10.1007/978-3-642-03348-3_12

Jianwei Cui^25,26,
Pei Li^25,26,
Hongyan Liu²⁷,
Jun He^25,26 &
…
Xiaoyong Du^25,26

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5678))

Included in the following conference series:

International Conference on Advanced Data Mining and Applications

2225 Accesses
3 Citations

Abstract

Recently tagging has been a flexible and important way to share and categorize web resources. However, ambiguity and large quantities of tags restrict its value for resource sharing and navigation. Tag clustering could help alleviate these problems by gathering relevant tags. In this paper, we introduce a link-based method to measure the relevance between tags based on random walk on graphs. We also propose a new clustering method which could address several challenges in tag clustering. The experimental results based on del.icio.us show that our methods achieve good accuracy and acceptable performance on tag clustering.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Simpson, E.: Clustering Tags in Enterprise and Web Folksonomies. Technical report, HP Labs (2008)
Google Scholar
Newzingo: Your Map to Google News, http://www.newzingo.com
Grigory, B., Philipp, K., Frank, S: Automated Tag Clustering: Improving search and exploration in the tag space. WWW (2006)
Google Scholar
Celine, V.D., Martin, H., Katharina, S.: Folksontology: An integrated approach for turning folksomomies into ontology. SemNet, 57–70 (2007)
Google Scholar
Leonard, K., Peter, J.R.: Finding Groups in Data: an Introduction to Cluster Analysis. Wiley Interscience, Hoboken (1990)
Google Scholar
Martin, E., Hans-Peter, K., Jorg, S., Xiaowei, X.: A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In: SIGKDD 1996 (1996)
Google Scholar
Christopher, H.B., Nancy, M.: Improved Annotation of the Blogopshere via Autotagging and Hierarchical Clustering. WWW (2006)
Google Scholar
Glen, J., Jennifer, W.: SimRank: A measure of structural-context similarity. In: SIGKDD, pp. 538–543 (2002)
Google Scholar
Kallenberg, O.: Foundations of Modern Probability. Springer, New York (1997)
MATH Google Scholar
Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: Bringing order to the Web. Technical report, Stanford University Database Group (1998)
Google Scholar
Pei, L., Zhixu, L., Li, H., Jun, H., Xiaoyong, D.: Using Link-Based Content Analysis to Measure Document Similarity Effectively. APWeb/WAIM, 455–467 (2009)
Google Scholar
Del.icio.us, http://delicious.com
Tian, Z., Raghu, R., Miron, L.: BIRCH: An Efficient Data Clustering Method for very Large Databases. In: SIGMOD, pp. 103–114 (1996)
Google Scholar
Porter, M.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980), http://www.tartarus.org/~martin/PorterStemmer
Article Google Scholar
The stop-words list, http://members.unine.ch/jacques.savoy/clef/englishST.txt

Download references

Author information

Authors and Affiliations

Key Labs of Data Engineering and Knowledge Engineering, Ministry of Education, China
Jianwei Cui, Pei Li, Jun He & Xiaoyong Du
School of Information, Renmin University of China, 100872, Beijing
Jianwei Cui, Pei Li, Jun He & Xiaoyong Du
Department of Management Science and Engineering, Tsinghua University, 100084, Beijing
Hongyan Liu

Authors

Jianwei Cui
View author publications
You can also search for this author in PubMed Google Scholar
Pei Li
View author publications
You can also search for this author in PubMed Google Scholar
Hongyan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Jun He
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoyong Du
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Knowledge Science & Engineering Institute, School of Education Technology, Beijing Normal University, Xinjiekouwai Ave. 19, 100875, Beijing, China
Ronghuai Huang
The Hong Kong University of Science and Technology, Clear Water Bay,, Hong Kong, Hong Kong
Qiang Yang
School of Computing Science, Simon Fraser University, 8888 University Drive, V5A 1S6, Burnaby, BC, Canada
Jian Pei
Faculty of Economics, University of Porto, Rua Dr. Roberto Frias, 4200-465, Porto, Portugal
João Gama
School of Information, Zhongguancum, Renmin University, 100872, Beijing, China
Xiaofeng Meng
School of Information Technology and Electrical Engineering, The University of Queensland, 4072, St. Lucia, Queensland, Australia
Xue Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cui, J., Li, P., Liu, H., He, J., Du, X. (2009). A Neighborhood Search Method for Link-Based Tag Clustering. In: Huang, R., Yang, Q., Pei, J., Gama, J., Meng, X., Li, X. (eds) Advanced Data Mining and Applications. ADMA 2009. Lecture Notes in Computer Science(), vol 5678. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03348-3_12

Download citation

DOI: https://doi.org/10.1007/978-3-642-03348-3_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03347-6
Online ISBN: 978-3-642-03348-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics