Abstract
In this paper, we focus on efficient processing of a given XML keyword query based on SLCA semantics. We assign to each node an ID that equals to its visiting order when traversing the XML document in deep-first order, based on which we construct two kinds of indexes. The first index is an inverted list L of IDDewey labels for each keyword k, where each IDDewey label lāāāL represents a node v that directly contains k, l consists of node IDs corresponding to all nodes on the path from the document root to v. The second index is a hash table, which records, for each pair of node v and keyword k, the number of occurrence of k in the subtree rooted at v. Based on the two indexes, we propose an algorithm, namely TDHS, that takes the shortest inverted IDDewey label list as the working list and computes all SLCA results in a top-down manner based on hash search. Compared with existing methods, our method achieves the worst case time complexity of \(O(m\cdot|L^{ID}_1|)\) for a given keyword query Q, where \(|L^{ID}_1|\) is the number of distinct node IDs in the shortest inverted IDDewey label list of Q. Our experimental results verify the performance advantages of our method according to various evaluation metrics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bao, Z., Ling, T.W., Chen, B., Lu, J.: Effective xml keyword search with relevance oriented ranking. In: ICDE (2009)
Chen, L.J., Papakonstantinou, Y.: Supporting top-k keyword search in xml databases. In: ICDE (2010)
Bao, Z., Ling, T.W., Chen, B., Lu, J.: Effective xml keyword search with relevance oriented ranking. In: ICDE (2009)
Guo, L., Shao, F., Botev, C., Shanmugasundaram, J.: Xrank: Ranked keyword search over xml documents. In: SIGMOD Conference (2003)
Li, G., Feng, J., Wang, J., Zhou, L.: Effective keyword search for valuable lcas over xml documents. In: CIKM (2007)
Li, Y., Yu, C., Jagadish, H.V.: Schema-free xquery. In: VLDB (2004)
Liu, Z., Chen, Y.: Identifying meaningful return information for xml keyword search. In: SIGMOD Conference (2007)
Sun, C., Chan, C.Y., Goenka, A.K.: Multiway slca-based keyword search in xml data. In: WWW (2007)
Wang, W., Wang, X., Zhou, A.: Hash-Search: An Efficient SLCA-Based Keyword Search Algorithm on XML Documents. In: Zhou, X., Yokota, H., Deng, K., Liu, Q. (eds.) DASFAA 2009. LNCS, vol.Ā 5463, pp. 496ā510. Springer, Heidelberg (2009)
Xu, Y., Papakonstantinou, Y.: Efficient keyword search for smallest lcas in xml databases. In: SIGMOD Conference (2005)
Zhou, J., Bao, Z., Chen, Z., Lan, G., Lin, X., Ling, T.W.: Top-Down SLCA Computation Based on List Partition. In: Lee, S.G., Peng, Z., Zhou, X., Moon, Y.-S., Unland, R., Yoo, J. (eds.) DASFAA 2012, Part I. LNCS, vol.Ā 7238, pp. 172ā184. Springer, Heidelberg (2012)
Zhou, J., Bao, Z., Wang, W., Ling, T.W., Chen, Z., Lin, X., Guo, J.: Fast slca and elca computation for xmlkeyword queries based on set intersection. In: ICDE (2012)
Zhou, R., Liu, C., Li, J.: Fast elca computation for keyword queries on xml data. In: EDBT (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
Ā© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhou, J., Lan, G., Chen, Z., Tang, X., Guo, J. (2012). Top-Down SLCA Computation Based on Hash Search. In: Gao, H., Lim, L., Wang, W., Li, C., Chen, L. (eds) Web-Age Information Management. WAIM 2012. Lecture Notes in Computer Science, vol 7418. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32281-5_27
Download citation
DOI: https://doi.org/10.1007/978-3-642-32281-5_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32280-8
Online ISBN: 978-3-642-32281-5
eBook Packages: Computer ScienceComputer Science (R0)