Abstract
Social tagging has become a very important mechanism for organizing information on the Web. Usually, people tag a web page manually, just as what they do on a social bookmarking web site. In this paper, we will demonstrate a brand-new perspective - tagging web pages automatically by mining search logs. In order to keep diversity, we first classify web queries into different categories and then extract tags from queries to depict each category. Thereafter we describe a web page with all queries which are related to this page, and finally we get the recommended tags for each web page after mapping the related queries into corresponding diverse tags. The experiments conducted on a real search log show that our method can dig out accurate and meaningful diverse tags for web pages more effectively.
Supported by NSFC under Grant No. 61073081.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Belém, F.M., Martins, E.F., Almeida, J.M., Gon?alves, M.A., Pappa, G.L.: Exploiting co-occurrence and information quality metrics to recommend tags in web 2.0 applications. In: CIKM (2010)
Bing, L., Sun, B., Jiang, S., Zhang, Y., Lam, W.: Learning ontology resolution for document representation and its applications in text mining. In: CIKM (2010)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. The Journal of Machine Learning Research (3), 993–1022 (2003)
Bordino, I., Castillo, C., Donato, D., Gionis, A.: Query similarity by projecting the query-flow graph. In: SIGIR (2010)
Carman, M.J., Baillie, M., Gwadera, R., Crestani, F.: A statistical comparison of tag and query logs. In: SIGIR (2009)
Chien, L.-F.: Pat-tree-based keyword extraction for chinese information retrieval. In: SIGIR (1997)
Gupta, M., Li, R., Yin, Z., Han, J.: Survey on social tagging techniques. In: SIGKDD (2010)
Hu, J., Wang, G., Lochovsky, F., Sun, J.-t., Chen, Z.: Understanding user’s query intent with wikipedia. In: WWW (2009)
Jansen, B.J., Booth, D.L., Spink, A.: Determining the informational, navigational, and transactional intent of web queries. Information Processing and Management 44(3), 1251–1266 (2008)
Jiang, S., Bin, L., Sun, B., Zhang, Y., Lam, W.: Ontology enhancement and concept granularity learning: keeping yourself current and adaptive. In: SIGKDD (2011)
Liu, D., Hua, X.-S., Yang, L., Wang, M., Zhang, H.-J.: Tag ranking. In: WWW (2009)
Marlow, C., Naaman, M., Boyd, D., Davis, M.: Ht06, tagging paper, taxonomy, flickr, academic article, to read. In: HYPERTEXT (2006)
Scriver, A.D.: Semantic distance in wordnet: A simplified and improved measure of semantic relatedness. Master Thesis, University of Waterloo, Canada (2006)
Shen, D., Pan, R., Sun, J.-T., Pan, J.J., Wu, K., Yin, J., Yang, Q.: Query enrichment for web-query classification. ACM TOIS 24(3), 320–352 (2006)
Sigurbjornsson, B., van Zwol, R.: Flickr tag recommendation based on collective knowledge. In: WWW (2008)
White, R.W., Bilenko, M., Cucerzan, a.S.: Studying the use of popular destinations to enhance web search interaction. In: SIGIR (2007)
Xu, Z., Fu, Y., Mao, J., Su, D.: Towards the semantic web: Collaborative tag suggestions. In: WWW (2006)
Zhou, D., Bian, J., Zheng, S., Zha, H., Giles, C.L.: Exploring social annotations for information retrieval. In: WWW (2009)
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yan, L., Huang, C., Zhang, Y. (2012). Actively Mining Search Logs for Diverse Tags. In: Hou, Y., Nie, JY., Sun, L., Wang, B., Zhang, P. (eds) Information Retrieval Technology. AIRS 2012. Lecture Notes in Computer Science, vol 7675. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35341-3_48
Download citation
DOI: https://doi.org/10.1007/978-3-642-35341-3_48
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35340-6
Online ISBN: 978-3-642-35341-3
eBook Packages: Computer ScienceComputer Science (R0)