Skip to main content

A Learning Approach to Hierarchical Search Result Diversification

  • Conference paper
  • First Online:
Web and Big Data (APWeb-WAIM 2017)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10367))

  • 1717 Accesses

Abstract

The queries in search engine that issued by users are often ambiguous. By returning diverse ranking results we can satisfy different information needs as far as possible. Recently, a hierarchical structure are proposed to represent user intents instead of a flat list of subtopics. Although the hierarchical diversification model performs better than previous models, it utilizes a predefined function to calculate the diversity score, which may not reach the optimal result. The model’s parameters need to be tuned manually and repeatedly without intention, which cause a time-consuming problem. In this paper, we introduce a learning based hierarchical diversification model. Benefit from the learning model, the parameter values are determined automatically and more optimal. Experiments show that our approach outperform several existing diversification models significantly.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.lemurproject.org/clueweb09.php/.

  2. 2.

    http://www.lemurproject.org/indri.php.

  3. 3.

    http://projects.yisongyue.com/svmdiv/.

References

  1. Agrawal, R., Gollapudi, S., Halverson, A., Ieong, S.: Diversifying search results. In: Proceedings of the Second ACM International Conference on Web Search and Data Mining, pp. 5–14. ACM (2009)

    Google Scholar 

  2. Carbonell, J., Goldstein, J.: The use of MMR, diversity-based reranking for reordering documents and producing summaries. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 335–336. ACM (1998)

    Google Scholar 

  3. Chapelle, O., Metlzer, D., Zhang, Y., Grinspan, P.: Expected reciprocal rank for graded relevance. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management, pp. 621–630. ACM (2009)

    Google Scholar 

  4. Clarke, C.L., Craswell, N., Soboroff, I.: Overview of the TREC 2009 web track. Technical report, DTIC Document (2009)

    Google Scholar 

  5. Clarke, C.L., Craswell, N., Soboroff, I.: Preliminary report on the TREC 2009 web track. In: Proceeding of TREC (2009)

    Google Scholar 

  6. Clarke, C.L., Kolla, M., Cormack, G.V., Vechtomova, O., Ashkan, A., Büttcher, S., MacKinnon, I.: Novelty and diversity in information retrieval evaluation. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 659–666. ACM (2008)

    Google Scholar 

  7. Clarke, C.L.A., Kolla, M., Vechtomova, O.: An effectiveness measure for ambiguous and underspecified queries. In: Azzopardi, L., Kazai, G., Robertson, S., Rüger, S., Shokouhi, M., Song, D., Yilmaz, E. (eds.) ICTIR 2009. LNCS, vol. 5766, pp. 188–199. Springer, Heidelberg (2009). doi:10.1007/978-3-642-04417-5_17

    Chapter  Google Scholar 

  8. Collins-Thompson, K., Macdonald, C., Bennett, P., Diaz, F., Voorhees, E.M.: TREC 2014 web track overview. Technical report, DTIC Document (2015)

    Google Scholar 

  9. Dang, V., Croft, W.B.: Diversity by proportionality: an election-based approach to search result diversification. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 65–74. ACM (2012)

    Google Scholar 

  10. Dou, Z., Hu, S., Chen, K., Song, R., Wen, J.R.: Multi-dimensional search result diversification. In: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, pp. 475–484. ACM (2011)

    Google Scholar 

  11. Drosou, M., Pitoura, E.: Search result diversification. ACM SIGMOD Rec. 39(1), 41–47 (2010)

    Article  Google Scholar 

  12. Hu, S., Dou, Z., Wang, X., Sakai, T., Wen, J.R.: Search result diversification based on hierarchical intents. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, pp. 63–72. ACM (2015)

    Google Scholar 

  13. Marden, J.I.: Analyzing and Modeling Rank Data. CRC Press, London (1996)

    MATH  Google Scholar 

  14. Santos, R.L., Macdonald, C., Ounis, I.: Exploiting query reformulations for web search result diversification. In: Proceedings of the 19th International Conference on World Wide Web, pp. 881–890. ACM (2010)

    Google Scholar 

  15. Vargas, S., Castells, P., Vallet, D.: Explicit relevance models in intent-oriented information retrieval diversification. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 75–84. ACM (2012)

    Google Scholar 

  16. Yue, Y., Joachims, T.: Predicting diverse subsets using structural SVMs. In: Proceedings of the 25th International Conference on Machine Learning, pp. 1224–1231. ACM (2008)

    Google Scholar 

  17. Zhu, Y., Lan, Y., Guo, J., Cheng, X., Niu, S.: Learning for search result diversification. In: Proceedings of the 37th International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 293–302. ACM (2014)

    Google Scholar 

Download references

Ackonwloedgements

This research is supported by National Natural Science Foundation of China (Grant No. 61375054), Natural Science Foundation of Guangdong Province Grant No. 2014A030313745, Basic Scientific Research Program of Shenzhen City (Grant No. JCYJ20160331184440545), and Cross fund of Graduate School at Shenzhen, Tsinghua University (Grant No. JC20140001).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hai-Tao Zheng .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Zheng, HT., Wang, Z., Xiao, X. (2017). A Learning Approach to Hierarchical Search Result Diversification. In: Chen, L., Jensen, C., Shahabi, C., Yang, X., Lian, X. (eds) Web and Big Data. APWeb-WAIM 2017. Lecture Notes in Computer Science(), vol 10367. Springer, Cham. https://doi.org/10.1007/978-3-319-63564-4_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-63564-4_25

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-63563-7

  • Online ISBN: 978-3-319-63564-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics