Abstract
In an information retrieval system, predicting query performance, for keyword based queries is important in giving early feedback to the user which can result in an improved query which in turn results in a better query result. There exists clarity score based and ranking robustness score based techniques to solve this problem. Both these, eventhough shows good performance, suffers from high computational time needs and are post-retrieval methods. In contrast to this, there do exist several pre-retrieval parameters which can judge the query without executing it. Pre-retrieval parameters based on distribution of information in query terms, which basically depends on inverse document frequency (idf) of query terms, are shown to be good predictors. Among these, the standard-deviation of idf values of query terms is known to be better. This paper generalizes this and proposes to use joint idf for a set of terms together, than using each term’s idf individually. Empirical studies are done using some standard data sets. The parameters based on the proposed method are shown to be better than the previous method which is nothing but a special case of the proposed method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Allan J, Ballesteros L, Callan JP, Croft WB, Lu Z (1995) Recent experiments with inquery. In: Proceedings of the 4th Text Retrieval Conference, pp 49–64
Amati G, Carpineto C, Romano G (2004) Fondazione Ugo Bordoni. Query difficulty, robustness, and selective application of query expansion. In: ECIR, vol 4. Springer, pp 127–137
Amati G, Van Rijsbergen CJ (2002) Probabilistic models of information retrieval based on measuring the divergence from randomness. ACM Trans Inf Syst (TOIS) 20(4):357–389
Cronen-Townsend S, Zhou Y, Croft WB (2002) Predicting query performance. In: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, pp 299–306
Jiawei Han, MK, Pei J (2011) Data mining: concepts and techniques: concepts and techniques. Elsevier
He B, Ounis l (2003) A study of parameter tuning for term frequency normalization. In: Proceedings of the twelfth international conference on Information and knowledge management. ACM, pp 10–16
He B, Ounis l (2004) Inferring query performance using pre-retrieval predictors. In: String processing and information retrieval. Springer, pp 43–54
Pirkola Ari, Järvelin Kalervo (2001) Employing the resolution power of search keys. J Am Soc Inf Sci Technol 52(7):575–583
Plachouras V, Ounis I, van Rijsbergen CJ, Cacheda F (2003) University of glasgow at the web track: dynamic application of hyperlink analysis using the query scope. In: TREC, vol 3, pp 636–642
Robertson SE, Walker S, Jones S, Hancock-Beaulieu MM, Gatford M, et al (1995) Okapi at trec-3. NIST SPECIAL PUBLICATION SP, pp 109–109
Zhai C, Lafferty J (2001) A study of smoothing methods for language models applied to ad hoc information retrieval. In: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, pp 334–342
Zhou Y, Croft WB (2006) Ranking robustness: a novel framework to predict query performance. In: Proceedings of the 15th ACM international conference on Information and knowledge management. ACM, pp 567–574
Acknowledgments
This work is supported by a UGC-SERO Minor Project with Reference No. MRP-4609/14.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Science+Business Media Singapore
About this paper
Cite this paper
Viswanath, P., Rohini, J., Reddy, Y.C.A.P. (2017). Query Performance Prediction Using Joint Inverse Document Frequency of Multiple Terms. In: Attele, K., Kumar, A., Sankar, V., Rao, N., Sarma, T. (eds) Emerging Trends in Electrical, Communications and Information Technologies. Lecture Notes in Electrical Engineering, vol 394. Springer, Singapore. https://doi.org/10.1007/978-981-10-1540-3_10
Download citation
DOI: https://doi.org/10.1007/978-981-10-1540-3_10
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-1538-0
Online ISBN: 978-981-10-1540-3
eBook Packages: EngineeringEngineering (R0)