Abstract
Automatic performance prediction and classification for information search results is useful in different scenarios. In this paper, we propose two score-based post-retrieval performance prediction methods. Both of them take magnitude and variance of resultant document scores into consideration at the same time. We also try to classify queries into three different classes: easy, medium, and hard by using a support vector machine-based approach. The experimental results show that the proposed predictors in this paper are very competitive compared with other predictors in the same category, and the support vector machine-based approach is effective for query classification.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Yom-Tov, E., Fine, S., Carmel, D., Darlow, A.: Learning to estimate query difficulty: including applications to missing content detection and distributed information retrieval. In: 28th SIGIR, pp. 512–519 (2005)
Lang, H., Wang, B., Li, J., et al.: Predicting query performance for text retrieval. J. Softw. 19(2), 291–300 (2008)
Hauff, C., Hiemstra, D., de Jong, F.: A survey of pre-retrieval query performance predictors. In: 17th CIKM, pp. 1419–1420 (2008)
Katz, G., Shtok, A., Kurland, O., Shapira, B., Rokach, L.: Wikipedia-based query performance prediction. In: 37th SIGIR, pp. 1235–1238 (2014)
Cronen-Townsend, S., Zhou, Y., Croft, W.B.: Predicting query performance. In: 25th SIGIR, pp. 299–306 (2002)
Zhou, Y., Croft, W.B.: Ranking robustness: a novel framework to predict query performance. In: 15th CIKM, pp. 567–574 (2006)
Cummins, R., Jose, J.M., O’Riordan, C.: Improved query performance prediction using standard deviation. In: 34th SIGIR, pp. 24–28 (2011)
Zhou, Y., Croft, W.B.: Query performance prediction in web search environments. In: 30th SIGIR, pp. 543–550 (2007)
Shtok, A., Kurland, O., Carmel, D., Raiber, F., Markovits, G.: Predicting query performance by query-drift estimation. ACM Trans. Inf. Syst. 30(2), 305–312 (2009)
Tao, Y., Wu, S.: Query performance prediction by considering score magnitude and variance together. In: 23th CIKM, pp. 1891–1894 (2014)
Voorhees, E.M.: Overview of the TREC 2004 robust track. In: 23rd TREC, 13 (2004)
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)
Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2(27), 1–27 (2011)
Zhou, Y.: Retrieval performance prediction and document quality. University of Massachusetts, Massachusetts (2007)
Diaz, F., Jones, R.: Using temporal profiles of queries for precision prediction. In: 27th SIGIR, pp. 18–24 (2004)
He, B., Ounis, I.,: Inferring query performance using pre-retrieval predictors. In: 11th SPIRE, pp. 43–54 (2004)
Vinay, V., Cox, I.J., Milic-Frayling, N., Wood, K.R.: On ranking the effectiveness of searches. In: 29th SIGIR, pp. 398–404 (2006)
Perez-Iglesias, J., Araujo, L.: Standard deviation as a query hardness estimator. In: 17th SPIRE, pp. 207–212 (2010)
Mizzaro, S., Mothe, J.: Why do you think this query is difficult? A user study on human query prediction. In: 39th SIGIR, pp. 1073–1076 (2016)
Shtok, A., Kurland, O., Carmel, D.: Query performance prediction using reference lists. ACM Trans. Inf. Syst. 34(4), 19–52 (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Zhang, Z., Chen, J., Wu, S. (2018). Query Performance Prediction and Classification for Information Search Systems. In: Cai, Y., Ishikawa, Y., Xu, J. (eds) Web and Big Data. APWeb-WAIM 2018. Lecture Notes in Computer Science(), vol 10987. Springer, Cham. https://doi.org/10.1007/978-3-319-96890-2_23
Download citation
DOI: https://doi.org/10.1007/978-3-319-96890-2_23
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-96889-6
Online ISBN: 978-3-319-96890-2
eBook Packages: Computer ScienceComputer Science (R0)