XTree: A New XML Keyword Retrieval Model

Ji, Cong-Rui; Deng, Zhi-Hong; Xiang, Yong-Qing; Yu, Hang; Tang, Shi-Wei

doi:10.1007/978-3-642-03996-6_16

Cong-Rui Ji²⁹,
Zhi-Hong Deng²⁹,
Yong-Qing Xiang²⁹,
Hang Yu²⁹ &
…
Shi-Wei Tang²⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5731))

Included in the following conference series:

421 Accesses

Abstract

As more and more data are represented and stored by XML format, how to query XML data has become an increasingly important research issue. Keyword search is a proven user-friendly way of querying HTML documents, and it is well suited to XML trees as well. However, it is still an open problem in XML keyword retrieval that which XML nodes are meaningful and reasonable to a query, how to find these nodes effectively and efficiently. In recent years, many XML keyword retrieval models have been presented to solve the problem, such as XRANK and SLCA. These models usually return the most specific results and discard most ancestral nodes. There may not be sufficient information for users to understand the returned results easily. In this paper, we present a new XML keyword retrieval model, XTree, which can cover every keyword node and return the comprehensive result trees. For XTree model, we propose Xscan algorithm for processing keyword queries and GenerateTree for constructing results. We analytically and experimentally evaluate the performances of our algorithms, and the experiments show that our algorithms are efficient.

Supported by the National High-Tech Research and Development Plan of China under Grant No.2009AA01Z136.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Clark, J., DeRose, S.: XML Path Language (XPath) 1.0 (November 1999), http://www.w3.org/TR/xpath
XQuery 1.0: An XML Query Language (June 2001), http://www.w3.org/XML/Query
Shanmugasundaram, J., Tufte, K., Zhang, C., Gang, H., DeWitt, D.J., Naughton, J.F.: Relational databases for querying XML documents: Limitations and opportunities. In: VLDB (1999)
Google Scholar
Kanne, C.C., Moerkotte, G.: Efficient Storage of XML Data. In: ICDE (2000)
Google Scholar
Schmidt, A., Kersten, M.L., Windhouwer, M., Waas, F.: Efficient relational storage and retrieval of XML documents. In: Suciu, D., Vossen, G. (eds.) WebDB 2000. LNCS, vol. 1997, p. 137. Springer, Heidelberg (2001)
Chapter Google Scholar
Cooper, B.F., Sample, N., Franklin, M.J., Hjaltason, G.R., Shadmon, M.: Proc. A Fast Index for Semistructured Data. In: VLDB (2001)
Google Scholar
Chien, S.Y., Vagena, Z., Zhang, D.H., Tsotras, V.J., Zaniolo, C.: Efficient Structural Joins on Indexed XML Documents. In: VLDB (2002)
Google Scholar
McHugh, J., Widom, J., Abiteboul, S., Luo, Q., Rajaraman, A.: Indexing Semistructured Data. Technical Report (1998)
Google Scholar
Bohannon, P., Freire, J., Roy, P., Simeon, J.: From XML schema to relations: A cost-based approach to XML storage. In: ICDE (2002)
Google Scholar
Cho, J., Rajagopalan, S.: A fast regular expression indexing engine. In: ICDE (2002)
Google Scholar
Guo, L., Shao, F., Botev, C., Shanmugasundaram, J.: XRANK: Ranked Keyword Search over XML Documents. In: Proceedings of SIGMOD, pp. 16–27 (2003)
Google Scholar
Xu, Y., Papakonstantinou, Y.: Efficient Keyword Search for Smallest LCAs in XML Databases. In: Proceedings of SIGMOD, pp. 527–538 (2005)
Google Scholar
Liu, Z., Chen, Y.: Identifying Meaningful Return Information for XML Keyword Search. In: Proceedings of SIGMOD, pp. 329–340 (2007)
Google Scholar
Xerces-C, http://xerces.apache.org/xerces-c/
Berkeley DB, http://www.oracle.com/technology/products/berkeley-db/index.html

Download references

Author information

Authors and Affiliations

Key Laboratory of Machine Perception (Minister of Education), School of Electronics Engineering and Computer Science, Peking University, Beijing, 100871, China
Cong-Rui Ji, Zhi-Hong Deng, Yong-Qing Xiang, Hang Yu & Shi-Wei Tang

Authors

Cong-Rui Ji
View author publications
You can also search for this author in PubMed Google Scholar
Zhi-Hong Deng
View author publications
You can also search for this author in PubMed Google Scholar
Yong-Qing Xiang
View author publications
You can also search for this author in PubMed Google Scholar
Hang Yu
View author publications
You can also search for this author in PubMed Google Scholar
Shi-Wei Tang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Hong Kong University of Science and Technology, Hong Kong
Lei Chen
Swinburne University of Technology, Melbourne, Australia
Chengfei Liu
Renmin Universty of China, China
Xiao Zhang
Renmin University China, China
Shan Wang
Dept. of Industrial Economics and Technology Management, NTNU, Norway
Darijus Strasunskas
NTNU, Norway
Stein L. Tomassen
AOL, China
Jinghai Rao
SAP Research China, China
Wen-Syan Li
Comp. Sci. and Eng. Dept., Arizona State University, 85287, Tempe, AZ
K. Selçuk Candan
Dickson Computer Systems, 7A Victory Avenue 4th floor, Homantin, Kln, P.O. Box, Hong Kong
Dickson K. W. Chiu
Zhejiang Gongshang University, China
Yi Zhuang
University of Colorado at Boulder, USA
Clarence A. Ellis
Kyonggi University, Korea
Kwang-Hoon Kim

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ji, CR., Deng, ZH., Xiang, YQ., Yu, H., Tang, SW. (2009). XTree: A New XML Keyword Retrieval Model. In: Chen, L., et al. Advances in Web and Network Technologies, and Information Management. APWeb WAIM 2009 2009. Lecture Notes in Computer Science, vol 5731. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03996-6_16

Download citation

DOI: https://doi.org/10.1007/978-3-642-03996-6_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03995-9
Online ISBN: 978-3-642-03996-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics