Skip to main content

Categorizing Search Results Using WordNet and Wikipedia

  • Conference paper
Web-Age Information Management (WAIM 2012)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7418))

Included in the following conference series:

Abstract

Terms used in search queries often have multiple meanings and usages. Consequently, search results corresponding to different meanings or usages may be retrieved, making identifying relevant results inconvenient and time-consuming. In this paper, we study the problem of grouping the search results based on the different meanings and usages of a query. We build on a previous work that identifies and ranks possible categories of any user query based on the meanings and common usages of the terms and phrases within the query. We use these categories to group search results. In this paper, we study different methods, including several new methods, to assign search result record (SRRs) to the categories. Our SRR grouping framework supports a combination of categorization, clustering and query rewriting techniques. Our experimental results show that some of our grouping methods can achieve high accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal, R., Gollapudi, S., Halverson, A., Ieong, S.: Diversifying search results. In: Proc. 2nd ACM Intl. Conf. on Web Search and Data Mining (2009)

    Google Scholar 

  2. Carpineto, C., Osinski, S., Romano, G., Weiss, D.: A survey of web clustering engines. ACM Computing Surveys 41(3), Article No. 17 (2009)

    Google Scholar 

  3. He, M., Cutler, M., Wu, K.: Categorizing Queries by Topic Directory. In: WAIM Conference, pp. 278–284 (2008)

    Google Scholar 

  4. Hemayati, R., Meng, W., Yu, C.: Semantic-Based Grouping of Search Engine Results Using WordNet. In: Dong, G., Lin, X., Wang, W., Yang, Y., Yu, J.X. (eds.) APWeb/WAIM 2007. LNCS, vol. 4505, pp. 678–686. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  5. Hemayati, R.T., Meng, W., Yu, C.: Identifying and Ranking Possible Semantic and Common Usage Categories of Search Engine Queries. In: Chen, L., Triantafillou, P., Suel, T. (eds.) WISE 2010. LNCS, vol. 6488, pp. 254–261. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  6. Hemayati, R., Meng, W., Yu, C.: Categorizing Search Results. Technical report (2012), http://cs.binghamton.edu/~rtaghiz1/

  7. Hotho, A., Staab, S., Stumme, G.: WordNet Improves Text Document Clustering. In: ACM SIGIR Semantic Web Workshop (2003)

    Google Scholar 

  8. Jain, A.K., Murty, M.N.: Data Clustering: A Review. ACM Computing Surveys (1999)

    Google Scholar 

  9. Liu, F., Yu, C., Meng, W.: Personalize Web Search by Mapping User Queries to Categories. In: ACM CIKM Conference (2002)

    Google Scholar 

  10. Liu, S., Yu, C., Meng, W.: Word Sense Disambiguation in Queries. In: ACM CIKM Conference, pp. 525–532 (2005)

    Google Scholar 

  11. de Luca, E., Nürnberger, A., von-Guericke, O: Ontology-Based Semantic Online Classification of Documents: Supporting Users in Searching the Web. University of Magdeburg, Universitätsplatz 2, 39106 Magdeburg, Germany, AMR (2004)

    Google Scholar 

  12. Pitler, E., Church, K.: Using word-sense disambiguation methods to classify web queries by intent. In: Conference on Empirical Methods in NLP, vol. 3 (2009)

    Google Scholar 

  13. Santos, R.L.T., Macdonald, C., Ounis, I.: Intentaware search result diversification. In: SIGIR (2011)

    Google Scholar 

  14. de Simone, T., Kazakov, D.: Using WordNet Similarity and Antonymy Relations to Aid Document Retrieval. In: Recent Advances in Natural Language Processing, RANLP (2005)

    Google Scholar 

  15. Song, M.-H., Lim, S.Y., Kang, D.-J., Lee, S.-J.: Ontology-Based Automatic Classification of Web Documents. In: Huang, D.-S., Li, K., Irwin, G.W. (eds.) ICIC 2006, Part II. LNCS (LNAI), vol. 4114, pp. 690–700. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  16. Xing, D., Xue, G., Yang, Q., Yu, Y.: Deep Classifier: Automatically Categorizing Search Results into Large-scale Hierarchies. In: Int’l. Conf. on Web Search & Data Mining (2008)

    Google Scholar 

  17. Zamir, O., Etzioni, O.: Grouper: A Dynamic Clustering Interface to Web Search Results. In: World Wide Web Conference (1999)

    Google Scholar 

  18. Zeng, H., He, Q., Chen, Z., Ma, W.: Learning To Cluster Web Search Results. In: ACM SIGIR (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hemayati, R.T., Meng, W., Yu, C. (2012). Categorizing Search Results Using WordNet and Wikipedia. In: Gao, H., Lim, L., Wang, W., Li, C., Chen, L. (eds) Web-Age Information Management. WAIM 2012. Lecture Notes in Computer Science, vol 7418. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32281-5_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-32281-5_18

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-32280-8

  • Online ISBN: 978-3-642-32281-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics