Query Performance Prediction and Classification for Information Search Systems

Zhang, Zhongmin; Chen, Jiawei; Wu, Shengli

doi:10.1007/978-3-319-96890-2_23

Zhongmin Zhang¹⁶,
Jiawei Chen¹⁶ &
Shengli Wu¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10987))

Included in the following conference series:

Asia-Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint International Conference on Web and Big Data

1324 Accesses
1 Citations

Abstract

Automatic performance prediction and classification for information search results is useful in different scenarios. In this paper, we propose two score-based post-retrieval performance prediction methods. Both of them take magnitude and variance of resultant document scores into consideration at the same time. We also try to classify queries into three different classes: easy, medium, and hard by using a support vector machine-based approach. The experimental results show that the proposed predictors in this paper are very competitive compared with other predictors in the same category, and the support vector machine-based approach is effective for query classification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Yom-Tov, E., Fine, S., Carmel, D., Darlow, A.: Learning to estimate query difficulty: including applications to missing content detection and distributed information retrieval. In: 28th SIGIR, pp. 512–519 (2005)
Google Scholar
Lang, H., Wang, B., Li, J., et al.: Predicting query performance for text retrieval. J. Softw. 19(2), 291–300 (2008)
Article Google Scholar
Hauff, C., Hiemstra, D., de Jong, F.: A survey of pre-retrieval query performance predictors. In: 17th CIKM, pp. 1419–1420 (2008)
Google Scholar
Katz, G., Shtok, A., Kurland, O., Shapira, B., Rokach, L.: Wikipedia-based query performance prediction. In: 37th SIGIR, pp. 1235–1238 (2014)
Google Scholar
Cronen-Townsend, S., Zhou, Y., Croft, W.B.: Predicting query performance. In: 25th SIGIR, pp. 299–306 (2002)
Google Scholar
Zhou, Y., Croft, W.B.: Ranking robustness: a novel framework to predict query performance. In: 15th CIKM, pp. 567–574 (2006)
Google Scholar
Cummins, R., Jose, J.M., O’Riordan, C.: Improved query performance prediction using standard deviation. In: 34th SIGIR, pp. 24–28 (2011)
Google Scholar
Zhou, Y., Croft, W.B.: Query performance prediction in web search environments. In: 30th SIGIR, pp. 543–550 (2007)
Google Scholar
Shtok, A., Kurland, O., Carmel, D., Raiber, F., Markovits, G.: Predicting query performance by query-drift estimation. ACM Trans. Inf. Syst. 30(2), 305–312 (2009)
Google Scholar
Tao, Y., Wu, S.: Query performance prediction by considering score magnitude and variance together. In: 23th CIKM, pp. 1891–1894 (2014)
Google Scholar
Voorhees, E.M.: Overview of the TREC 2004 robust track. In: 23rd TREC, 13 (2004)
Google Scholar
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)
MATH Google Scholar
Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2(27), 1–27 (2011)
Article Google Scholar
Zhou, Y.: Retrieval performance prediction and document quality. University of Massachusetts, Massachusetts (2007)
Google Scholar
Diaz, F., Jones, R.: Using temporal profiles of queries for precision prediction. In: 27th SIGIR, pp. 18–24 (2004)
Google Scholar
He, B., Ounis, I.,: Inferring query performance using pre-retrieval predictors. In: 11th SPIRE, pp. 43–54 (2004)
Chapter Google Scholar
Vinay, V., Cox, I.J., Milic-Frayling, N., Wood, K.R.: On ranking the effectiveness of searches. In: 29th SIGIR, pp. 398–404 (2006)
Google Scholar
Perez-Iglesias, J., Araujo, L.: Standard deviation as a query hardness estimator. In: 17th SPIRE, pp. 207–212 (2010)
Chapter Google Scholar
Mizzaro, S., Mothe, J.: Why do you think this query is difficult? A user study on human query prediction. In: 39th SIGIR, pp. 1073–1076 (2016)
Google Scholar
Shtok, A., Kurland, O., Carmel, D.: Query performance prediction using reference lists. ACM Trans. Inf. Syst. 34(4), 19–52 (2016)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Jiangsu University, Zhenjiang, 212013, China
Zhongmin Zhang, Jiawei Chen & Shengli Wu

Authors

Zhongmin Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jiawei Chen
View author publications
You can also search for this author in PubMed Google Scholar
Shengli Wu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shengli Wu .

Editor information

Editors and Affiliations

South China University of Technology, Guangzhou, China
Yi Cai
Nagoya University, Nagoya, Japan
Yoshiharu Ishikawa
Hong Kong Baptist University, Kowloon Tong, Hong Kong, China
Jianliang Xu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, Z., Chen, J., Wu, S. (2018). Query Performance Prediction and Classification for Information Search Systems. In: Cai, Y., Ishikawa, Y., Xu, J. (eds) Web and Big Data. APWeb-WAIM 2018. Lecture Notes in Computer Science(), vol 10987. Springer, Cham. https://doi.org/10.1007/978-3-319-96890-2_23

Download citation

DOI: https://doi.org/10.1007/978-3-319-96890-2_23
Published: 19 July 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-96889-6
Online ISBN: 978-3-319-96890-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics