Abstract
On the basis of a theoretical analysis of issues around populations and sampling, for both topics and documents, and parameters with which we hope to characterise the effectiveness of different systems, we propose a modification to the traditional average precision metric. This modification involves both transformation and (in the estimation of the parameter) smoothing. The modified version is shown to have certain distributional advantages, on a substantial dataset. In particular, the distribution of values of the modified metric, over topics for a given system/run, is approximately normal.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Banks, D., Over, P., Zhang, N.F.: Blind men and elephants: Six approaches to TREC data. Information Retrieval 1(1-2), 7–34 (1999)
Buckley, C., Voorhees, E.: Evaluating evaluation measure stability. In: Belkin, N.J., Ingwersen, P., Leong, M.K. (eds.) SIGIR 2000: Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 33–40. ACM Press, New York (2000)
Carterette, B.: Model-Based Inference about IR Systems. In: Amati, G., Crestani, F. (eds.) ICTIR 2011. LNCS, vol. 6931, pp. 101–112. Springer, Heidelberg (2011)
Cormack, G.V., Lynam, T.R.: Statistical precision of information retrieval evaluation. In: Efthimiadis, E.N., Dumais, S.T., Hawking, D., Järvelin, K. (eds.) SIGIR 2006, pp. 533–540. ACM Press, New York (2006)
Robertson, S.: On GMAP – and other transformations. In: Yu, P.S., Tsotras, V.J., Fox, E.A., Liu, B. (eds.) CIKM 2006, pp. 78–83. ACM Press, New York (2006)
Robertson, S.E., Sparck Jones, K.: Relevance weighting of search terms. Journal of the American Society for Information Science 27, 129–146 (1976), http://www.soi.city.ac.uk/~ser/papers/RSJ76.pdf
Robertson, S.: On document populations and measures of IR effectiveness. In: Dominich, S., Kiss, F. (eds.) Studies in Theory of Information Retrieval (Proceedings of ICTIR 2007), pp. 9–22. Foundation for Information Society, Budapest (2007)
Robertson, S.: A new interpretation of average precision. In: Myaeng, S.H., Oard, D.W., Sebastiani, F., Chua, T., Leong, M.K. (eds.) SIGIR 2008, pp. 689–690. ACM Press, New York (2008)
Wikipedia: Shapiro-Wilk test, http://en.wikipedia.org/wiki/Shapiro%E2%80%93Wilk_test (visited January 4, 2012)
Yilmaz, E., Aslam, J.A.: Estimating average precision with incomplete and imperfect judgements. In: Yu, P.S., Tsotras, V.J., Fox, E.A., Liu, B. (eds.) CIKM 2006, pp. 102–111. ACM Press, New York (2006)
Yilmaz, E., Robertson, S.: On the choice of effectiveness measures for learning to rank. Information Retrieval 13(3), 271–290 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Robertson, S. (2012). On Smoothing Average Precision. In: Baeza-Yates, R., et al. Advances in Information Retrieval. ECIR 2012. Lecture Notes in Computer Science, vol 7224. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28997-2_14
Download citation
DOI: https://doi.org/10.1007/978-3-642-28997-2_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28996-5
Online ISBN: 978-3-642-28997-2
eBook Packages: Computer ScienceComputer Science (R0)