Skip to main content

Adaptive XML Shredding: Architecture, Implementation, and Challenges

  • Conference paper
  • First Online:
Efficiency and Effectiveness of XML Tools and Techniques and Data Integration over the Web (DIWeb 2002, EEXTT 2002)

Abstract

As XML data becomes central to business-critical applications, there is a growing need for efficient and reliable XML storage. Two main approaches have been proposed for storing XML data: native and colonial systems. Native systems (e.g., [19], [20]) are designed from the ground up specifically for XML and XML query languages. Colonial systems (e.g., [5],[7], [19]), on the other hand, attempt to reuse existing commercial database systems (DBMS) by mapping XML into the underlying model used by the DBMS. Colonial systems can thus leverage features, such as concurrency control, crash recovery, scalability, and highly optimized query processors available in the DMBS, making them an attractive alternative for managing XML data. However, several technical challenges need to be addressed in terms of architecture, algorithms, and implementation of these systems.In this paper, we described how these issues are addressed in the context of colonial systems that use relational databases as the underlying DBMS.

Current address: juliana@cse.ogi.edu.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. S. Agrawal, S. Chaudhuri, and V.R. Narasayya. Automated selection of materialized views and indexes in SQL databases. In Proc. of VLDB, 2000. 115

    Google Scholar 

  2. S. Agrawal, S. Chaudhuri, and V.R. Narasayya. Materialized view and index selection tool for microsoft sql server 2000. In Proc. of SIGMOD, 2001. 115

    Google Scholar 

  3. S. Boag, D.Chamberlin, M. Fernandez, D. Florescu, J. Robie, J. Siméon, and M. Stefanescu. XQuery 1.0: An XML query language. W3C Working Draft, June 2001. 106

    Google Scholar 

  4. P. Bohannon, J. Freire, J. Haritsa, M. Ramanath, P. Roy, and J. Siméon. Legodb: Customizing relational storage for xml documents. In Proc. of VLDB, 2002. 104, 115

    Google Scholar 

  5. P. Bohannon, J. Freire, P. Roy, and J. Siméon. From XML schema to relations: A cost-based approach to XML storage. In Proc. of ICDE, 2002. 104, 108, 109, 113, 115

    Google Scholar 

  6. M. J. Carey, J. Kiernan, J. Shanmugasundaram, E. J. Shekita, and S. N. Subramanian. XPERANTO: Middleware for publishing object-relational data as xml documents. In Proc. of VLDB, 2000. 104, 110

    Google Scholar 

  7. A. Deutsch, M. Fernandez, and D. Suciu. Storing semi-structured data with STORED. In Proc. of SIGMOD, 1999. 104

    Google Scholar 

  8. D. Draper, P. Fankhauser, M. Fernandez, A. Malhotra, K. Rose, M. Rys, J. Siméon, and P. Wadler. The XQuery 1.0 formal semantics, March 2002. W3C Working Draft. 109

    Google Scholar 

  9. M. Fernandez, W.C. Tan, and D. Suciu. Silkroute: trading between relations and XML. Computer Networks, 33(1-6):723–745, 2000. 104, 106, 108, 110

    Article  Google Scholar 

  10. D. Florescu and D. Kossmann. A performance evaluation of alternative mapping schemes for storing XML in a relational database. Technical Report 3680, INRIA, 1999. 104

    Google Scholar 

  11. J. Freire, J. Haritsa, M. Ramanath, P. Roy, and J. Siméon. Statix: Making XML count. In Proc. of SIGMOD, 2002. 111

    Google Scholar 

  12. G. Graefe and W. J. McKenna. The volcano optimizer generator: Extensibility and efficient search. In Proc. of ICDE, 1993. 115

    Google Scholar 

  13. G. Piatetsky-Shapiro and C. Connell. Accurate estimation of the number of tuples satisfying a condition. In Proc. of SIGMOD, 1984. 112

    Google Scholar 

  14. Raghu Ramakrishnan and Johannes Gehrke. Database Management Systems. McGraw-Hill, 2000. 112

    Google Scholar 

  15. P. Roy, S. Seshadri, S. Sudarshan, and S. Bhobe. Efficient and extensible algorithms for multi query optimization. In Proc. of SIGMOD, 2000. 111, 115

    Google Scholar 

  16. J. Shanmugasundaram, K. Tufte, G. He, C. Zhang, D. DeWitt, and J. Naughton. Relational databases for querying XML documents: Limitations and opportunities. In Proc. of VLDB, 1999. 104, 106, 113

    Google Scholar 

  17. Tamino. http://www.softwareag.com/tamino 104

  18. G. Valentin, M. Zuliani, D.C. Zilio, G.M. Lohman, and A. Skelley. Db2 advisor: An optimizer smart enough to recommend its own indexes. In Proc. of ICDE, 2000. 115

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Freire, J., Siméon, J. (2003). Adaptive XML Shredding: Architecture, Implementation, and Challenges. In: Bressan, S., Lee, M.L., Chaudhri, A.B., Yu, J.X., Lacroix, Z. (eds) Efficiency and Effectiveness of XML Tools and Techniques and Data Integration over the Web. DIWeb EEXTT 2002 2002. Lecture Notes in Computer Science, vol 2590. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36556-7_7

Download citation

  • DOI: https://doi.org/10.1007/3-540-36556-7_7

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-00736-4

  • Online ISBN: 978-3-540-36556-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics