Abstract
The aim of a Personalised Ranking Model (PRM) is to filter the top-k set of documents from a number of relevant documents matching the search query. Dwell times of previously clicked results have been shown to be valuable for estimating documents’ relevance. The indexing structure of the dwell time is an important parameter. We propose a dwell time-based scoring scheme called Dwell-tf-idf to index text and non-text data, based on which search results are ranked. The effectiveness of incorporating into the ranking process the proposed Dwell-tf-idf scheme is validated by a controlled experiment which shows a significant improvement in the search results within the top-k rank.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
The terms ‘F-Measure’ and ‘F-Score’ are used interchangeably for convenience throughout this paper as in some literature reviews.
References
Kelly, D., Belkin, N.J.: Display time as implicit feedback: understanding task effects. In: Proceedings of the 27th Annual International SIGIR Conference on Research and Development in Information Retrieval, pp. 377–384, ACM (2004)
Agichtein, E., Brill, E., Dumais, S.: Improving web search ranking by incorporating user behaviour information. In: Proceedings of the 29th Annual International SIGIR Conference on Research and Development in Information Retrieval, pp. 19–26, ACM (2006)
Collins-Thompson, K., Bennett, P.N., White, R.W., De La Chica, S., Sontag, D.: Personalising web search results by reading level. In: Proceedings of the 20th International Conference on Information and Knowledge Management, pp. 403–412, ACM (2011)
Hassan, A., White, R.W.: Personalised models of search satisfaction. In: Proceedings of the 22nd International Conference on Information and Knowledge Management, pp. 2009–2018, ACM (2013)
Al Sharji, S., Beer, M., Uruchurtu, E.: Enhancing the degree of personalisation through vector space model and profile ontology. In: IEEE RIVF International Conference Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), pp. 248–252, IEEE (2013)
Xu, S., Jiang, H., Lau, F.: Mining user dwell time for personalised web search re-ranking. In: Proceedings of the 22nd International Joint Conference on Artificial Intelligence, vol. 3, pp. 2367–2372, AAAI Press (2011)
Khodaei, A., Shahabi, C., Li, C.: SKIF-P: a point-based indexing and ranking of web documents for spatial-keyword search. GeoInformatica 16(3), 563–596 (2012). GeoInformatica
Zobel, J., Moffat, A.: Inverted files for text search engines. Comput. Surv. (CSUR) 38(2), 6 (2006). ACM
Jiang, D., Pei, J., Li, H.: Mining search and browse logs for web search: a survey. Trans. Intell. Syst. Technol. (TIST) 4(4), 57 (2013). ACM
Guo, Q., Agichtein, E.: Beyond dwell time: estimating document relevance from cursor movements and other post-click searcher behaviour. In: Proceedings of the 21st International Conference on World Wide Web, pp. 569–578, ACM (2012)
Silverstein, C., Marais, H., Henzinger, M., Moricz, M.: Analysis of a very large web search engine query log. SIGIR Forum 33(1), 6–12 (1999). ACM
Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008)
Hassan, A., Jones, R., Klinkner, K.L.: Beyond DCG: user behaviour as a predictor of a successful search. In: Proceedings of the 3rd International Conference on Web Search and Data Mining, pp. 221–230, ACM (2010)
Salton, G.: Automatic text processing: the transformation, analysis, and retrieval of information by computer. Addison-Wesley (1989)
Hu, Y., Qian, Y., Li, H., Jiang, D., Pei, J., Zheng, Q.: Mining query subtopics from search log data. In: Proceedings of the 35th International SIGIR Conference on Research and Development in Information Retrieval, pp. 305–314, ACM (2012)
Ageev, M., Guo, Q., Lagun, D., Agichtein, E.: Find it if you can: a game for modeling different types of web search success using interaction data. In: Proceedings of the 34th International SIGIR Conference on Research and Development in Information Retrieval, pp. 345–354, ACM (2011)
Elsweiler, D., Ruthven, I.: Towards task-based personal information management evaluations. In: Proceedings of the 30th Annual International SIGIR Conference on Research and Development in Information Retrieval, pp. 23–30, ACM (2007)
Chia-Jung, L., Teevan, J., De La Chica, S.: Characterising multi-click search behavior and the risks and opportunities of changing results during use. In: Proceedings of the 37th International SIGIR Conference on Research and Development in Information Retrieval, pp. 515–524, ACM (2014)
Bengio, Y., Grandvalet, Y.: No unbiased estimator of the variance of k-fold cross-validation. J. Mach. Learn. Res. 5, 1089–1105 (2004)
Acknowledgement
The authors extend their sincere thanks to the Dean, the Head of ETC and staff at the NCT in Oman for their cooperation and support during the data collection.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Al-Sharji, S., Beer, M., Uruchurtu, E. (2015). A Dwell Time-Based Technique for Personalised Ranking Model. In: Chen, Q., Hameurlain, A., Toumani, F., Wagner, R., Decker, H. (eds) Database and Expert Systems Applications. Globe DEXA 2015 2015. Lecture Notes in Computer Science(), vol 9262. Springer, Cham. https://doi.org/10.1007/978-3-319-22852-5_18
Download citation
DOI: https://doi.org/10.1007/978-3-319-22852-5_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-22851-8
Online ISBN: 978-3-319-22852-5
eBook Packages: Computer ScienceComputer Science (R0)