Abstract
In order to make query suggestion mechanisms more efficient, it is important to have metrics that will estimate query suggestions quality well. Recently, Kharitonov et al. [7] proposed a family of metrics that showed much better alignment with user satisfaction than previously known metrics. However, they did not address the problem of computing the proposed metrics. In this paper we show that the problem can be reduced to one of the two string problems which we call Top-k and Sorted-Top-k. Given an integer k and two sets of pairwise distinct strings (queries) with weights, Q and Q test , the Top-k problem is to find, for each query q ∈ Q test , its shortest prefix q[1..i] such that q belongs to the list of k heaviest queries in Q starting with q[1..i]. The Sorted-Top-k problem is to retrieve, for each q ∈ Q test and 1 ≤ i ≤ |q|, a position of q in the sorted list of the k heaviest queries in Q starting with q[1..i]. We show several linear-time solutions to these problems and compare them experimentally.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bar-Yossef, Z., Kraus, N.: Context-sensitive query auto-completion. In: Proceedings of the 20th International Conference on World Wide Web, pp. 107–116. ACM, New York (2011)
Brodal, G.S., Fagerberg, R., Greve, M., López-Ortiz, A.: Online sorted range reporting. In: Proceedings of the 20th International Symposium on Algorithms and Computation, pp. 173–182 (2009)
Duan, H., Hsu, B.-J.P.: Online spelling correction for query completion. In: Proceedings of the 20th International Conference on World Wide Web, pp. 117–126. ACM, New York (2011)
Frederickson, G.N.: An optimal algorithm for selection in a min-heap. Inf. Comput. 104(2), 197–214 (1993)
Gusfield, D.: Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology. Cambridge University Press (1997)
Harel, D., Tarjan, R.E.: Fast algorithms for finding nearest common ancestors. SIAM J. Comput. 13(2), 338–355 (1984)
Kharitonov, E., Macdonald, C., Serdyukov, P., Ounis, I.: User model-based metrics for offline query suggestion evaluation. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 633–642. ACM, New York (2013)
Shokouhi, M., Radinsky, K.: Time-sensitive query auto-completion. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 601–610. ACM, New York (2012)
Vuillemin, J.: A unifying look at data structures. Commun. ACM 23(4), 229–239 (1980)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Loptev, A., Selugina, A., Starikovskaya, T. (2014). Simple and Efficient String Algorithms for Query Suggestion Metrics Computation. In: Moura, E., Crochemore, M. (eds) String Processing and Information Retrieval. SPIRE 2014. Lecture Notes in Computer Science, vol 8799. Springer, Cham. https://doi.org/10.1007/978-3-319-11918-2_26
Download citation
DOI: https://doi.org/10.1007/978-3-319-11918-2_26
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11917-5
Online ISBN: 978-3-319-11918-2
eBook Packages: Computer ScienceComputer Science (R0)