Skip to main content

Efficient Relational Storage and Retrieval of XML Documents

  • Conference paper
  • First Online:
Book cover The World Wide Web and Databases (WebDB 2000)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1997))

Included in the following conference series:

Abstract

In this paper, we present a data and an execution model that allow for efficient storage and retrieval of XML documents in a relational database. The data model is strictly based on the notion of binary associations: by decomposing XML documents into small, flexible and semantically homogeneous units we are able to exploit the performance potential of vertical fragmentation. Moreover, our approach provides clear and intuitive semantics, which facilitates the definition of a declarative query algebra. Our experimental results with large collections of XML documents demonstrate the effectiveness of the techniques proposed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. S. Abiteboul, D. Quass, J. McHugh, J. Widom, and J. L. Wiener. The Lorel Query Language for Semistructured Data. International Journal on Digital Libraries, 1(1):68–88, 1997.

    Article  Google Scholar 

  2. C. Beeri and Y. Tzaban. SAL: An Algebra for Semistructured Data and XML. In International Workshop on the Web and Databases, pages 37–42, Pennsylvania, USA, 1999.

    Google Scholar 

  3. P. A. Boncz and M. L. Kersten. MIL Primitives for Querying a Fragmented World. The VLDB Journal, 8(2):101–119, 1999.

    Article  Google Scholar 

  4. A. Bonifati and S. Ceri. Comparative Analysis of Five XML Query Languages. ACM SIGMOD Record, 1(29):68–79, 2000.

    Article  Google Scholar 

  5. J. Bosak. Sample XML documents. shakespeare.1.01.xml.zip, available at ftp://sunsite.unc.edu/pub/sun-info/standards/xml/eg/.

  6. P. Buneman, S. B. Davidson, G. G. Hillebrand, and D. Suciu. A Query Language and Optimization Techniques for Unstructured Data. In Proc. of the ACM SIGMOD Int’l. Conf. on Management of Data, pages 505–516, Montreal, Canada, 1996.

    Google Scholar 

  7. A. Deutsch, M. F. Fernandez, and D. Suciu. Storing Semistructured Data with STORED. In Proc. of the ACM SIGMOD Int’l. Conf. on Management of Data, pages 431–442, Philadephia, PA, USA, 1999.

    Google Scholar 

  8. M. Dyck. The GNU version of The Collaborative International Dictionary of English, presented in the Extensible Markup Language. Available at http://metalab.unc.edu/webster/.

  9. D. Florescu and D. Kossmann. Storing and Querying XML Data Using an RDBMS. Data Engineering Bulletin, 22(3), 1999.

    Google Scholar 

  10. R. Goldman and J. Widom. Dataguides: Enabling Query Formulation and Optimization in Semistructured Databases. In Proc. of the Int’l. Conf. on Very Large Data Bases, pages 436–445, Athens, Greece, 1997.

    Google Scholar 

  11. C. Kanne and G. Moerkotte. Efficient Storage of XML Data. In Proceedings of the 16th International Conference on Data Engineering, page 198, 2000.

    Google Scholar 

  12. M. Ley. DBLP Bibliography. http://www.informatik.uni-trier.de:8000/~ley/db/.

  13. J. McHugh, S. Abiteboul, R. Goldman, D. Quass, and J. Widom. Lore: A Database Management System for Semistructured Data. ACM SIGMOD Record, 3(26), 1997.

    Google Scholar 

  14. J. Shanmugasundaram, K. Tufte, G. He, C. Zhang, D. DeWitt, and J. Naughton. Relational Databases for Querying XML Documents: Limitations and Opportunities. In Proc. of the Int’l. Conf. on Very Large Data Bases, pages 302–314, Edinburgh, UK, 1999.

    Google Scholar 

  15. T. Shimura, M. Yoshikawa, and S. Uemura. Storage and Retrieval of XML Documents Using Object-Relational Databases. In Database and Expert Systems Applications, pages 206–217. Springer, 1999.

    Google Scholar 

  16. Software AG. Tamino-Technical Description. Available at http://www.softwareag.com/tamino/technical/description.htm.

  17. R. van Zwol, P. Apers, and A. Wilschutz. Implementing semi-structured data with MOA. In Workshop on Query Processing for Semistructured data and Non-Standard Data Formats (in conjunction with ICDT), 1999.

    Google Scholar 

  18. W3C. Document Object Model (DOM). Available at http://www.w3.org/DOM/.

  19. W3C. Extensible Markup Language (XML) 1.0. Available at http://www.w3.org/TR/1998/REC-xml-19980210.

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Schmidt, A., Kersten, M., Windhouwer, M., Waas, F. (2001). Efficient Relational Storage and Retrieval of XML Documents. In: Goos, G., Hartmanis, J., van Leeuwen, J., Suciu, D., Vossen, G. (eds) The World Wide Web and Databases. WebDB 2000. Lecture Notes in Computer Science, vol 1997. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45271-0_9

Download citation

  • DOI: https://doi.org/10.1007/3-540-45271-0_9

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-41826-9

  • Online ISBN: 978-3-540-45271-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics