Abstract
Score standardization is a necessary step for many different types of Web search tasks in which results from multiple components need to be combined or re-ranked. Some recent studies suggest that score standardization may have impact on the performance of some typical explicit search result diversification methods such as XQuAD. In this paper, we evaluate the performance of six score standardization methods. Experiments with TREC data are carried out with two typical explicit result diversification methods XQuAD and PM2. We find that the reciprocal standardization method performs better than other score standardization methods in all the cases. Furthermore, we improve the reciprocal standardization method by scaling those scores up so as to better satisfy the requirement of probability scores and obtain better results with XQuAD. We confirm that score standardization has significant impact on the performance of explicit search result diversification methods and such a fact can be used to obtain more profitable score standardization methods and result diversification methods.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
Its webpage is located at http://lemurproject.org/clueweb09/.
- 2.
Its webpage is located at http://www.lemurproject.org/indri/. Its retrieval model is based on a combination of the language modeling and inference network retrieval frameworks.
References
Carbonell, J., Goldstein, J.: The use of MMR, diversity-based reranking for reordering documents and producing summaries. In: SIGIR, pp. 335–336 (1998)
Cormack, G.V., Clarke, C.L.A., Büttcher, S.: Reciprocal rank fusion outperforms condorcet and individual rank learning methods. In: SIGIR, pp. 758–759 (2009)
Dang, V., Croft, W.B.: Diversity by proportionality: an election-based approach to search result diversification. In: SIGIR, pp. 65–74 (2012)
Lee, J.H.: Analyses of multiple evidence combination. In: SIGIR, pp. 267–276 (1997)
Montague, M., Aslam, J.A.: Relevance score standardization for meta-search. In: CIKM, pp. 427–433 (2001)
Ozdemiray, A.M., Altingovde, I.S.: Explicit search result diversification using score and rank aggregation methods. JASIST 66(6), 1212–1228 (2015)
Renda, M.E., Straccia, U.: Web meta-search: rank vs. score based rank aggregation methods. In: SAC, pp. 841–846 (2003)
Santos, R.L.T., Macdonald, C., Ounis, C.I.: Exploiting query reformulations for web search result diversification. In: WWW, pp. 881–890 (2010)
Wu, S.: Data Fusion in Information Retrieval. Springer, Berlin (2012). https://doi.org/10.1007/978-3-642-28866-1
Zhai, C.X., Cohen, W.W., Lafferty, J.: Beyond independent relevance: methods and evaluation metrics for subtopic retrieval. In: SIGIR, pp. 10–17 (2003)
Shokouhi, M., Si, L.: Federated search. Found. Trends Inf. Retr. 5(1), 1–102 (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Zhang, Z., Xu, C., Wu, S. (2018). Evaluation of Score Standardization Methods for Web Search in Support of Results Diversification. In: Meng, X., Li, R., Wang, K., Niu, B., Wang, X., Zhao, G. (eds) Web Information Systems and Applications. WISA 2018. Lecture Notes in Computer Science(), vol 11242. Springer, Cham. https://doi.org/10.1007/978-3-030-02934-0_17
Download citation
DOI: https://doi.org/10.1007/978-3-030-02934-0_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-02933-3
Online ISBN: 978-3-030-02934-0
eBook Packages: Computer ScienceComputer Science (R0)