Skip to main content
Log in

RPE query processing and optimization techniques for XML databases

  • Knowledge and Data Processing
  • Published:
Journal of Computer Science and Technology Aims and scope Submit manuscript

Abstract

Anextent join to compute path expressions containing parent-children and ancestor-descendent operations and two path expression optimization rules,path-shortening andpath-complementing, are presented in this paper.Path-shortening reduces the number of joins by shortening the path whilepath-complementing optimizes the path execution by, using an equivalent complementary path expression to compute the original one. Experimental results show that the algorithms proposed are more efficient than traditional algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Florescu D, Kossmann D. A performance evaluation of alternative mapping schemes for storing XML data in a relational database. INRIA Tech. Report, INRIA, No.3680, 1999.

  2. Florescu D, Kossmann D. Storing and querying XML data using an RDBMS.IEEE Data Engineering Bulletin, 1999, 3: 27–34.

    Google Scholar 

  3. Deutsch A, Fernandez M, Suciu D. Storing semistructured data with STORED. InProc. the 1999 SIGMOD Conference, Philadelphia, USA, 1999, pp.431–442.

  4. Manolescu I, Florescu D, Kossmann Det al. Agora: Living with XML and relational InProc. the 26th VLDB Conference, Cairo, Egypt, 2000, pp.623–626.

  5. Schmidt A, Kersten M, Windhouwer Met al. Efficient relational storage and retrieval of XML documents. InProc. the 3rd International Workshop WebDB, Dallas, USA, 2000, pp.137–150.

  6. Zhou A, Lu H, Zheng Set al. VXMLR: A visual XML-relational database system. InProc. the 27th VLDB Conference, Roma, Italy, 2001, pp.719–720.

  7. Fegaras L, Elmasri R. Query engines for Web-accessible XML data. InProc. the 27th VLDB Conference, Roma, Italy, 2001, pp.251–260.

  8. Hou J, Zhang Y, Kambayashi Y. Object-oriented representation for XML data. InProc. the 3rd CODAS Conference, Beijing, China, 2001, pp.43–52.

  9. Renner A. XML data and object databases: A perfect couple? InProc. the 17th ICDE Conference, Heidelberg, 2001, pp.143–148.

  10. McHugh J, Abiteboul S, Goldman Ret al. Lore: A database management system for semistructured data.SIGMOD Record, 1997, 3: 54–66.

    Article  Google Scholar 

  11. Shoens K, Luniewski A, Schwarz Pet al. The Rufus system: Information organization for semi-structured data. InProc. the 19th VLDB Conference, Dublin, 1993, pp.97–107.

  12. Fernandez M. Florescu D, Kang Jet al. Catching the boat with Strudel: Experiences with a Web-site management system. InProc. the 1998 SIGMOD Conference, Seattle, USA, 1998, pp.414–425.

  13. Schoning H. Tamino — A DBMS designed for XML. InProc. the 17th ICDE Conference, Heidelberg, Germany, 2001, pp.149–154.

  14. Goldman R, Widom J. DataGuides: Enabling query formulation and optimization in semistructured databases. InProc. the 23rd VLDB conference, Athens, Greece, 1997, pp.436–445.

  15. Milo T, Suciu D. Index structures for path expressions. InProc. the International Conference on Database Theory, Jerusalem, Israel, 1999, pp.277–295.

  16. Li Q, Moon B. Indexing and querying XML data for regular path expressions. InProc. the 27th VLDB Conference, Roma, Italy, 2001, pp.361–370.

  17. Chamberlin D, Robie J, Florescu D.: Quilt: An XML query language for heterogeneous data sources. InProc. 3rd International Workshop WebDB, Dallas, 2000, pp.1–25.

  18. Fankhauser P. XQuery formal semantics: State and challenges.SIGMOD Record, 2001, 3: 14–19.

    Article  Google Scholar 

  19. Deutsch A, Fernandez M, Florescu Det al. Xmlql: A query language for XML. 1999, http://www.w3.org/TR/NOTE-xml-ql/.

  20. Robie J, Lapp J, Schach D. XML query language (XQL). 1998, http://www.w3.org/TandS/QL/QL98/cfp.

  21. Cark J, DeRose S. XMP path language (XPath). Technical Report REC-xpath-19991116, W3C, 1999.

  22. Abiteboul S, Quass D, McHugh Jet al. The Lorel query language for semistructured data.International Journal on Digital Libraries, 1997, 1: 68–88.

    Article  Google Scholar 

  23. Schmidt A, Waas M, Kersten M Let al. XMark: A benchmark for XML data management. InProc. 28th VLDB Conference. Hong Kong, China, 2002, pp.974–985.

  24. Wang G, Lu H, Yu G, Bao Y. Managing very large document collections using semantics.Journal of Computer Science and Technology, 2003, 18(3): 403–406.

    Article  Google Scholar 

  25. Cattel R G G, Barry D, Berier Met al. The Object Data Standard: ODMG 3.0. Morgan Kaufmann, 2000.

  26. Rahm E, Bohme T. XMach-1: Multi-user evaluation of XML data management systems with XMach-1. InProc. 1st VLDB Workshop on Efficiency and Effectiveness of XML Tools, and Techniques (EEXTT2002), Hong Kong, China, 2002, pp.148–158.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Guo-Ren Wang.

Additional information

This research is partially supported by the National Natural Science Foundation of China (Grant No.60273079) and the Teaching and Research Award Program for Outstanding Young Teachers in High Education Institution of the Ministry of Education, China.

Guo-Ren Wang is a professor at Northeastern University, China. He received his B.E. degree, M.E. degree and Ph.D. degree from Northeasern University in 1988, 1991 and 1996, respectively. His research interests include XML data management, query processing and optimization, bioinformatics, high-dimensional indexing, and parallel database systems.

Bing Sun is a Ph.D. candidate at Northeastern University, China. His research interest include XML data management, query processing and optimization.

Jian-Hua Lv is a Ph.D. candidate at Northeastern University, China. His research interests include XML data management, query processing and optimization.

Ge Yu is a professor at Northeastern University, China, and a supervisor of Ph.D. students. He received his B.E. degree and M.E. degree from Northeasern University in 1982 and 1986, respectively, and his Ph.D. degree from Kyushu University, Japan in 1996. He is a member of IPSJ, ACM, and ACM SIGMOD. His research interests include distributed and parallel databases system, objected-oriented database system, multi-database and information integration, data warehousing and data mining, transactional workflow management, and Web-service.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, GR., Sun, B., Lv, JH. et al. RPE query processing and optimization techniques for XML databases. J. Comput. Sci. & Technol. 19, 224–237 (2004). https://doi.org/10.1007/BF02944801

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02944801

Keywords

Navigation