Skip to main content

Query Performance Prediction and Classification for Information Search Systems

  • Conference paper
  • First Online:
Book cover Web and Big Data (APWeb-WAIM 2018)

Abstract

Automatic performance prediction and classification for information search results is useful in different scenarios. In this paper, we propose two score-based post-retrieval performance prediction methods. Both of them take magnitude and variance of resultant document scores into consideration at the same time. We also try to classify queries into three different classes: easy, medium, and hard by using a support vector machine-based approach. The experimental results show that the proposed predictors in this paper are very competitive compared with other predictors in the same category, and the support vector machine-based approach is effective for query classification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://en.wikipedia.org/wiki/Brier_score.

  2. 2.

    http://www.csie.ntu.edu.tw/~cjlin/libsvm/.

References

  1. Yom-Tov, E., Fine, S., Carmel, D., Darlow, A.: Learning to estimate query difficulty: including applications to missing content detection and distributed information retrieval. In: 28th SIGIR, pp. 512–519 (2005)

    Google Scholar 

  2. Lang, H., Wang, B., Li, J., et al.: Predicting query performance for text retrieval. J. Softw. 19(2), 291–300 (2008)

    Article  Google Scholar 

  3. Hauff, C., Hiemstra, D., de Jong, F.: A survey of pre-retrieval query performance predictors. In: 17th CIKM, pp. 1419–1420 (2008)

    Google Scholar 

  4. Katz, G., Shtok, A., Kurland, O., Shapira, B., Rokach, L.: Wikipedia-based query performance prediction. In: 37th SIGIR, pp. 1235–1238 (2014)

    Google Scholar 

  5. Cronen-Townsend, S., Zhou, Y., Croft, W.B.: Predicting query performance. In: 25th SIGIR, pp. 299–306 (2002)

    Google Scholar 

  6. Zhou, Y., Croft, W.B.: Ranking robustness: a novel framework to predict query performance. In: 15th CIKM, pp. 567–574 (2006)

    Google Scholar 

  7. Cummins, R., Jose, J.M., O’Riordan, C.: Improved query performance prediction using standard deviation. In: 34th SIGIR, pp. 24–28 (2011)

    Google Scholar 

  8. Zhou, Y., Croft, W.B.: Query performance prediction in web search environments. In: 30th SIGIR, pp. 543–550 (2007)

    Google Scholar 

  9. Shtok, A., Kurland, O., Carmel, D., Raiber, F., Markovits, G.: Predicting query performance by query-drift estimation. ACM Trans. Inf. Syst. 30(2), 305–312 (2009)

    Google Scholar 

  10. Tao, Y., Wu, S.: Query performance prediction by considering score magnitude and variance together. In: 23th CIKM, pp. 1891–1894 (2014)

    Google Scholar 

  11. Voorhees, E.M.: Overview of the TREC 2004 robust track. In: 23rd TREC, 13 (2004)

    Google Scholar 

  12. Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)

    MATH  Google Scholar 

  13. Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2(27), 1–27 (2011)

    Article  Google Scholar 

  14. Zhou, Y.: Retrieval performance prediction and document quality. University of Massachusetts, Massachusetts (2007)

    Google Scholar 

  15. Diaz, F., Jones, R.: Using temporal profiles of queries for precision prediction. In: 27th SIGIR, pp. 18–24 (2004)

    Google Scholar 

  16. He, B., Ounis, I.,: Inferring query performance using pre-retrieval predictors. In: 11th SPIRE, pp. 43–54 (2004)

    Chapter  Google Scholar 

  17. Vinay, V., Cox, I.J., Milic-Frayling, N., Wood, K.R.: On ranking the effectiveness of searches. In: 29th SIGIR, pp. 398–404 (2006)

    Google Scholar 

  18. Perez-Iglesias, J., Araujo, L.: Standard deviation as a query hardness estimator. In: 17th SPIRE, pp. 207–212 (2010)

    Chapter  Google Scholar 

  19. Mizzaro, S., Mothe, J.: Why do you think this query is difficult? A user study on human query prediction. In: 39th SIGIR, pp. 1073–1076 (2016)

    Google Scholar 

  20. Shtok, A., Kurland, O., Carmel, D.: Query performance prediction using reference lists. ACM Trans. Inf. Syst. 34(4), 19–52 (2016)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shengli Wu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhang, Z., Chen, J., Wu, S. (2018). Query Performance Prediction and Classification for Information Search Systems. In: Cai, Y., Ishikawa, Y., Xu, J. (eds) Web and Big Data. APWeb-WAIM 2018. Lecture Notes in Computer Science(), vol 10987. Springer, Cham. https://doi.org/10.1007/978-3-319-96890-2_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-96890-2_23

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-96889-6

  • Online ISBN: 978-3-319-96890-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics