Abstract
Many XML query languages use regular path expressions to query XML data. Retrieval of XML data is like retrieval of relational databases, but the difference is that relational data are stored in 2-D flat tables whereas XML data are organized in a tree-like structure. Hence, fast tree traversal is a key to XML query processing. This is commonly accomplished using indexing. In this paper, we present a design and implementation of an efficient and quick solution to indexing XML data. Our approach is based on a numbering scheme that encodes the XML elements for not only indexing to the elements but also for quick determination of the ancestor-descendant relationship between elements in the tree hierarchy. This approach also allows efficiently inserting and updating the index system, which is an improvement over several existing XML tree node numbering methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Berkeley, D.B.: http://www.sleepycat.com/docs/ref/refs/bdb-usenix.html
Chamberlin, D., Florescu, D., Robie, J., Simon, J., Stefanescu, M.: XQuery: A Query Language for XML. W3C working Draft, Technical Report WD-xquery-20010215, World Wide Web Consortium (2001)
Clark, J., DeRose, S.: XML Path Language (XPath) Version 1.0. W3C Recommendation. Technical Report REC-xpath-19991116, World Wide Web Consortium (1999)
Cooper, B.F., Sample, N., Franklin, M.J., Hjaltason, G.R., Shadmon, M.: A Fast Index for Semistructured Data. In: Proceedings of the 27th International Conference on Very Large Databases, Rome, Italy (2001)
Dietz, P.F.: Maintaining order in a Linked List. In: Proceedings of the Fourteenth Annual ACM Symposium on Theory of Computing, San Francisco, California (1982)
Egnor, D., Lord, R.: Structured Information Retrieval Using XML, http://www.haifa.il.ibm.com/sigir00-xml/final-papers/Egnor
Katz, H.: XQEngine, http://www.fatdog.com
Kha, D., Yoshikawa, M., Uemura, S.: An XML Indexing Structure with Relative Region Coordinates. In: Proceedings of the 17th International Conference on Data Engineering, Heidelberg, Germany (2001)
Larson, R.R.: Cheshire II Project, http://cheshire.lib.berkeley.edu
Li, Q., Moon, B.: Indexing and Querying XML Data for Regular Path Expressions. In: Proceedings of the 27th International Conference on Very Large Databases, Rome, Italy (2001)
Luk, R., Chan, A., Dillon, T.S., Leong, H.V.: A survey of search engines for XML documents. In: SIGIR 2000 Workshop on XML and Information Retrieval, Athens, Greece, pp. 1–9 (2000)
McHugh, J., Widom, J., Abiteboul, S., Luo, Q., Rajaraman, A.: Indexing Semistructured Data. Technical Report, Stanford University (1998)
Rizzolo, F., Mendelzon, A.: Indexing XML Data with ToXin. In: Fourth International Workshop on the Web and Databases (in conjunction with ACM SIGMOD 2001), Santa Barbara, California (2001)
Shanmugasundaram, J., et al.: Relational Databases for Querying XML Documents: Limitations and Opportunities. In: Proc. Of the 25th International Conference on VLDB, Edinburgh, Scotland (1999)
Structured Information Manager, http://www.simdb.com
Tian, F., DeWitt, D.J., Chen, J.: The Design and Performan ce Evaluation of Alternative XML Storage Strategies: SIGMOD Record special issue on Data Management Issues in E-Commerce (2002)
Turau, V.: Making Legacy Data Accessible for XML Applications, http://www.informatik.fh-wiesbaden.de/turau/veroeff.html
Yoon, J., Raghavan, V., Chakilam, V., Kerschberg, L.: BitCube: A Three-Dimensional Bitmap Indexing for XML Documents. Journal of Intelligent Information Systems 17 (2001)
Yoshikawa, M., Amasaga, T., Shimura, T., Uemura, S.: XRel: A Path-Based Approach to Storage and Retrieval of XML Documents using Relational Databases. ACM Trans. On Internet Technology (2001)
XDisect - XML Refined Search, http://www.pybiz.com/products/xdisect
Xset Enabling XML Applications, http://www.cs.berkeley.edu
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hu, G., Tang, C. (2004). Indexing XML Data for Path Expression Queries. In: Ramamoorthy, C.V., Lee, R., Lee, K.W. (eds) Software Engineering Research and Applications. SERA 2003. Lecture Notes in Computer Science, vol 3026. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24675-6_26
Download citation
DOI: https://doi.org/10.1007/978-3-540-24675-6_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21975-0
Online ISBN: 978-3-540-24675-6
eBook Packages: Springer Book Archive