Abstract
Social tagging is an increasingly popular way to describe latent semantic information of web resources and thus is widely used to improve the performance of information retrieval system. However, there also has been significant variance of the quality of social tags because they can be annotated by folks on the web freely. As a consequence, how to measure the quality of social tags (referred to as social tag confidence) becomes an important issue. In this paper, we propose a statistic model to measure the confidence of social tags by utilizing a combination of three attributes of a social tag: web resource, tag, and tagging user. In order to evaluate the effectiveness of our model, two experiments are performed with datasets crawled from del.icio.us. Experimental results show that our model has a better performance than other approaches with respect to Normalized Discounted Cumulated Gain (NDCG). In addition, F-1 measure of tagged web page clustering performance is also increased when our model is applied to filter the noisy social tags with low tag confidence.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bao, S., Xue, G., Wu, X., Yu, Y., Fei, B., Su, Z.: Optimizing web search using social annotations. In: The 16th International Conference on World Wide Web (WWW 2007), pp. 501–510. ACM Press, Banff (2007)
Park, L., Ramamohanarao, K.: Mining Web Multi-resolution Community-based Popularity for Information Retrieval. In: The 16th International ACM Conference on Conference on Information and Knowledge Management (CIKM 2007), pp. 545–554. ACM Press, Lisboa (2007)
Wang, C., Zhang, L., Zhang, H.: Learning to Reduce the Semantic Gap in Web Image Retrieval and Annotation. In: The 31th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), pp. 355–362. ACM Press, Singapore (2008)
Noll, M.G., Meinel, C.: Exploring social annotations for web document classification. In: The 2008 ACM Symposium on Applied Computing (SAC 2008), pp. 2315–2320. ACM Press, Brazil (2008)
Ramage, D., Heymann, P.: Clustering the Tagged Web. In: The Second ACM International Conference on Web Search and Data Mining, pp. 54–63. ACM Press, Barcelona (2009)
Xu, S., Bao, S., Cao, Y., Yu, Y.: Using social annotations to improve language model for information retrieval. In: The 16th International ACM Conference on Conference on Information and Knowledge Management (CIKM 2007), pp. 1003–1006. ACM Press, Lisboa (2007)
Mathes, A.: Folksonomies - cooperative classification and communication through shared metadata. Computer Mediated Communication, LIS590CMC (Doctoral Seminar), Graduate School of Library and Information Science, University of Illinois Urbana-Champaign (2004)
Lee, S., Min, H., Lee, Y.B., Ro, Y.M.: Measurement of Tag Confidence in User Generated Contents Retrieval. In: Proc. of SPIE, pp. 7257–7262 (2009)
Wu, L., Yang, L., Yu, N.: Learning to Tag. In: The 18th International Conference on World Wide Web (WWW 2009), pp. 361–370. ACM Press, Marid (2009)
Zhou, D., Bian, J., Zheng, S., Zha, H., Giles, C.L.: Exploring social annotations for information retrieval. In: The 17th International Conference on World Wide Web (WWW 2008), pp. 715–724. ACM Press, Beijing (2008)
Schenkel, R., Crecelius, T., Kacimi, M., Michel, S., Neumann, T., Parreira, J.X., Weikum, G.: Efficient top-k querying over social-tagging networks. In: The 31th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), pp. 523–530. ACM Press, Singapore (2008)
Wu, X., Zhang, L., Yu, Y.: Exploring social annotations for the semantic web. In: The 15th International Conference on World Wide Web (WWW 2006), pp. 417–426. ACM Press, Edinburgh (2006)
Brooks, C.H., Montanez, N.: Improved annotation of the blogosphere via autotagging and hierarchical clustering. In: The 15th International Conference on World Wide Web (WWW 2006), pp. 625–632. ACM Press, Edinburgh (2006)
Li, X., Guo, L., Zhao, Y.E.: Tag-based social interest discovery. In: The 17th International Conference on World Wide Web (WWW 2008), pp. 675–684. ACM Press, Beijing (2008)
Krestel, R., Fankhauser, P., Nejdl, W.: Latent Dirichlet Allocation for Tag Recommendation. In: The 3rd ACM Conference on Recommender Systems, New York, USA (2009)
Körner, C., Benz, D., Hotho, A., Strohmaier, M., Stumme, B.: Stop Thinking, Start Tagging: Tag Semantics Emerge from Collaborative Verbosity. In: The International Conference on World Wide Web (WWW 2010), pp. 251–260. ACM Press, USA (2010)
Koutrika, G., Effendi, F.A., Gyöngyi, Z., Heymann, P., Molina, H.G.: Combating Spam in Tagging Systems. In: The Third International Workshop on Adversarial Information Retrieval on the Web (AIRWeb 2007), Alberta, Canada (2007)
Wu, L., Yang, L., Yu, N.: Learning to Tag. In: The 18th International Conference on World Wide Web (WWW 2009), pp. 361–370. ACM Press, Marid (2009)
Liu, D., Hua, X., Yang, L.: Tag Ranking. In: The 18th International Conference on World Wide Web (WWW 2009), pp. 351–360. ACM Press, Marid (2009)
Akamine, S., Kawahara, D., Kato, Y.: WISDOM: A Web Information Credibility Analysis System. In: Proceedings of the ACL-IJCNLP, Suntec, Singapore, pp. 1–4 (2009)
Zhu, J., Wang, C., He, X., Bu, J., Chen, C., Qu, M., Lu, G.: Tag-Oriented Document Summarization. In: The 18th International Conference on World Wide Web (WWW 2009), pp. 1195–1196. ACM Press, Marid (2009)
Xu, Z., Fu, Y., Mao, J., Su, D.: Towards the Semantic Web: Collaborative Tag Suggestions. In: The 15th International Conference on World Wide Web (WWW 2006). ACM Press, Edinburgh (2006)
Markines, B., Cattuto, C., Menczer, F., Benz, D., Hotho, A., Stumme, G.: Evaluating similarity measures for emergent semantics of social tagging. In: The 18th International Conference on World Wide Web (WWW 2009), pp. 641–650. ACM Press, Marid (2009)
Jarvelin, K., Kekalainen, J.: Cumulated gain-based evaluation of IR techniques. ACM Transactions on Information Systems 20(4), 422–466 (2002)
Manning, C., Raghavan, P., Schütze, H.: Introduction to information retrieval. Cambridge University Press, Cambridge (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gu, X., Wang, X., Li, R., Wen, K., Yang, Y., Xiao, W. (2011). Measuring Social Tag Confidence: Is It a Good or Bad Tag?. In: Wang, H., Li, S., Oyama, S., Hu, X., Qian, T. (eds) Web-Age Information Management. WAIM 2011. Lecture Notes in Computer Science, vol 6897. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23535-1_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-23535-1_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23534-4
Online ISBN: 978-3-642-23535-1
eBook Packages: Computer ScienceComputer Science (R0)