Skip to main content

Essential Performance Drivers in Native XML DBMSs

  • Conference paper
SOFSEM 2010: Theory and Practice of Computer Science (SOFSEM 2010)

Abstract

As a multi-layered XML database management system, we have designed, implemented, and optimized over the recent five years our prototype system XTC, a native XDBMS providing multi-lingual query interfaces (XQuery, XPath, DOM). In particular in higher system layers, we have compared competing concepts and iteratively found salient solutions which drastically improved the overall XDBMS performance. XML query processing is critically affected by the smooth interplay of concepts and methods on all architectural layers: node labeling and mapping options for storage structures; availability of suitable index mechanisms; provision of a spectrum of path processing operators; query language compilation and optimization. Furthermore, effective and efficient locking protocols must be present to guarantee the ACID properties for XML processing and to achieve high transaction throughput.

In this survey, we outline our experiences gained during the implementation and optimization of XTC. We figure out the “key drivers” to maximize throughput while keeping the response times at an acceptable level. Because we have implemented all options and alternatives in XTC, dedicated benchmark runs allow for comparisons in identical environments and illustrate the benefit of all implementation decisions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aguiar Moraes Filho, J., Härder, T.: EXsum – An XML Summarization Framework. In: Proc. IDEAS, pp. 139–148 (2008)

    Google Scholar 

  2. Bächle, S., Härder, T.: Tailor-Made Lock Protocols and Their DBMS Integration. In: Proc. EDBT 2008 Workshop on Software Engineering for Tailor-made Data Management, pp. 18–23 (2008)

    Google Scholar 

  3. Bächle, S., Härder, T.: The Real Performance Drivers Behind XML Lock Protocols. In: Bhowmick, S.S., Küng, J., Wagner, R. (eds.) DEXA 2009. LNCS, vol. 5690, pp. 38–52. Springer, Heidelberg (2009)

    Google Scholar 

  4. Bruno, N., Koudas, N., Srivastava, D.: Holistic Twig Joins: Optimal XML Pattern Matching. In: Proc. SIGMOD, pp. 310–321 (2002)

    Google Scholar 

  5. Chen, T., Lu, J., Ling, T.W.: On Boosting Holism in XML Twig Pattern Matching Using Structural Indexing Techniques. In: Proc. SIGMOD, pp. 455–466 (2005)

    Google Scholar 

  6. Chen, S., Li, H.-G., Tatemura, J., Hsiung, W.-P., Agrawal, D., Candan, K.S.: Twig2Stack: Bottom-Up Processing of Generalized-Tree-Pattern Queries over XML Documents. In: Proc. VLDB, pp. 283–294 (2006)

    Google Scholar 

  7. Christophides, W., Plexousakis, D., Scholl, M., Tourtounis, S.: On Labeling Schemes for the Semantic Web. In: Proc.12th Int. WWW Conf., pp. 544–555 (2003)

    Google Scholar 

  8. Document Object Model (DOM) Level 2 / Level 3 Core Specification. W3C Recommendation (2004), http://www.w3.org/TR/DOM-Level-3-Core

  9. Fontoura, M., Josifovski, V., Shekita, E.J., Yang, B.: Optimizing Cursor Movement in Holistic Twig Joins. In: Proc. CIKM, pp. 784–791 (2005)

    Google Scholar 

  10. Graefe, G.: Hierarchical Locking in B-Tree Indexes. In: Proc. German National Database Conf. (BTW). LNI, vol. 65, pp. 18–42. Springer, Heidelberg (2007)

    Google Scholar 

  11. Gray, J., Reuter, A.: Transaction Processing: Concepts and Techniques. Morgan Kaufmann, San Francisco (1993)

    MATH  Google Scholar 

  12. Haas, L., Freytag, J.-C., Lohman, G.M., Pirahesh, H.: Extensible Query Processing in Starburst. In: Proc. SIGMOD, pp. 377–388 (1989)

    Google Scholar 

  13. Härder, T.: XML Databases and Beyond – Plenty of Architectural Challenges Ahead. In: Eder, J., Haav, H.-M., Kalja, A., Penjam, J. (eds.) ADBIS 2005. LNCS, vol. 3631, pp. 1–16. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  14. Härder, T., Haustein, M.P., Mathis, C., Wagner, M.: Node Labeling Schemes for Dynamic XML Documents Reconsidered. Data & Knowl. Eng. 60(1), 126–149 (2007)

    Article  Google Scholar 

  15. Härder, T., Mathis, C., Schmidt, K.: Comparison of Complete and Elementless Native Storage of XML Documents. In: Proc. IDEAS, pp. 102–113 (2007)

    Google Scholar 

  16. Härder, T., Schmidt, K., Ou, Y., Bächle, S.: Towards Flash Disk Use in Databases – Keeping Performance While Saving Energy? In: Proc. German National Database Conf. (BTW). LNI, vol. 144, pp. 167–186. Springer, Heidelberg (2009)

    Google Scholar 

  17. Härder, T., Reuter, A.: Concepts for Implementing a Centralized Database Management System. In: Proc. Int. Computing Symposium on Application Systems Development, pp. 28–60. B.G. Teubner-Verlag (1983)

    Google Scholar 

  18. Haustein, M.P.: Fine-Granular Transaction Isolation in Native XML Database Management Systems (in German), Ph.D. Thesis, Univ. of Kaiserslautern, Verlag Dr. Hut, München (2006)

    Google Scholar 

  19. Haustein, M.P., Härder, T., Mathis, C., Wagner, M.: DeweyIDs – The Key to Fine-Grained Management of XML Documents. In: Proc. SBBD, pp. 85–99 (2005)

    Google Scholar 

  20. Haustein, M.P., Härder, T., Luttenberger, K.: Contest of XML Lock Protocols. In: Proc. VLDB, pp. 1069–1080 (2006)

    Google Scholar 

  21. Haustein, M.P., Härder, T.: Optimizing Lock Protocols for Native XML Processing. Data & Knowl. Eng. 65(1), 147–173 (2008)

    Google Scholar 

  22. Jagadish, H.V., Al-Khalifa, S., Chapman, A.: TIMBER: A Native XML Database. The VLDB Journal 11(4), 274–291 (2002)

    Article  MATH  Google Scholar 

  23. Jiang, H., Wang, W., Lu, H., Yu, J.X.: Holistic Twig Joins on Indexed XML Documents. In: Proc. VLDB, pp. 273–284 (2003)

    Google Scholar 

  24. Jiao, E., Ling, T.W., Chan, C.Y.: PathStack¬: A Holistic Path Join Algorithm for Path Query with Not-Predicates on XML Data. In: Zhou, L.-z., Ooi, B.-C., Meng, X. (eds.) DASFAA 2005. LNCS, vol. 3453, pp. 113–124. Springer, Heidelberg (2005)

    Google Scholar 

  25. Li, C., Ling, T.W., Hu, M.: Efficient Updates in Dynamic XML Data: from Binary String to Quaternary String. VLDB J. 17(3), 573–601 (2008)

    Article  Google Scholar 

  26. Loeser, H., Nicola, M., Fitzgerald, J.: Index Challenges in Native XML Database Systems. In: Proc. German National Database Conf. (BTW). LNI, vol. 144, pp. 508–523. Gesellschaft für Informatik (2009)

    Google Scholar 

  27. Lu, J., Chen, T., Ling, T.W.: Efficient Processing of XML Twig Patterns with Parent-Child Edges: a Look-Ahead Approach. In: Proc. CIKM, pp. 533–542 (2004)

    Google Scholar 

  28. Lu, J., Chen, T., Ling, T.W.: TJFast: Effective Processing of XML Twig Pattern Matching. In: Proc. WWW, pp. 1118–1119 (2005)

    Google Scholar 

  29. Mathis, C.: Storing, Indexing, and Querying XML Documents in Native XML Database Management Systems, Ph.D. Thesis, Univ. of Kaiserslautern, Verlag Dr. Hut, München (2009)

    Google Scholar 

  30. Mathis, C., Härder, T., Haustein, M.P.: Locking-Aware Structural Join Operators for XML Query Processing. In: Proc. SIGMOD, pp. 467–478 (2006)

    Google Scholar 

  31. Mathis, C., Härder, T., Schmidt, K., Bächle, S.: XML Indexing and Storage: Fulfilling the Wish List (submitted, 2009)

    Google Scholar 

  32. Miklau, G.: XML Data Repository, http://www.cs.washington.edu/research/xmldatasets

  33. Mohan, C.: ARIES/KVL: A Key-Value Locking Method for Concurrency Control of Multiaction Transactions Operating on B-Tree Indexes. In: Proc. VLDB, pp. 392–405 (1990)

    Google Scholar 

  34. O’Neil, P., O’Neil, E., Pal, S., Cseri, I., Schaller, G., Westbury, N.: OrdPaths: Insert-Friendly XML Node Labels. In: Proc. SIGMOD, pp. 903–908 (2004)

    Google Scholar 

  35. Özcan, F., Seemann, N., Wang, L.: XQuery Rewrite Optimization in IBM DB2 pureXML. Data Engineering Bulletin 31(4), 25–32 (2008)

    Google Scholar 

  36. Qin, L., Yu, J.X., Ding, B.: TwigList: Make Twig Pattern Matching Fast. In: Kotagiri, R., Radha Krishna, P., Mohania, M., Nantajeewarawat, E. (eds.) DASFAA 2007. LNCS, vol. 4443, pp. 850–862. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  37. Schmidt, A., Waas, F., Kersten, M.L., Carey, M.J., Manolescu, I., Busse, R.: XMark: A Benchmark for XML Data Management. In: Proc. VLDB, pp. 974–985 (2002)

    Google Scholar 

  38. Schmidt, K., Härder, T.: Usage-Driven Storage Structures for Native XML Databases. In: Proc. IDEAS, pp. 169–178 (2008)

    Google Scholar 

  39. Schöning, H.: Tamino–A DBMS Designed for XML. In: Proc. ICDE, pp. 149–154 (2001)

    Google Scholar 

  40. Siirtola, A., Valenta, M.: Verifying Parameterized taDOM+ Lock Managers. In: Geffert, V., Karhumäki, J., Bertoni, A., Preneel, B., Návrat, P., Bieliková, M. (eds.) SOFSEM 2008. LNCS, vol. 4910, pp. 460–472. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  41. Weiner, A.M., Härder, T.: Using Structural Joins and Holistic Twig Joins for Native XML Query Optimization. In: Grundspenkis, J., Morzy, T., Vossen, G. (eds.) ADBIS 2009. LNCS, vol. 5739, pp. 149–163. Springer, Heidelberg (2009)

    Google Scholar 

  42. Weiner, A.M., Härder, T.: A Framework for Cost-Based Query Optimization in Native XML Database Management Systems. In: Li, C., Ling, T.W. (eds.) Advanced Applications and Structures in XML Processing: Label Streams, Semantics Utilization, and Data Query Technologies. IGI Global (2010)

    Google Scholar 

  43. XML on Wall Street, Financial XML Projects, http://lighthouse-partners.com/xml

  44. Yu, J.X., Luo, D., Meng, X., Lu, H.: Dynamically Updating XML Data: Numbering Scheme Revisited. World Wide Web 8(1), 5–26 (2005)

    Article  Google Scholar 

  45. Yu, T., Ling, T.W., Lu, J.: TwigStackList¬: A Holistic Twig Join Algorithm for Twig Query with Not-Predicates on XML Data. In: Li Lee, M., Tan, K.-L., Wuwongse, V. (eds.) DASFAA 2006. LNCS, vol. 3882, pp. 249–263. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Härder, T., Mathis, C., Bächle, S., Schmidt, K., Weiner, A.M. (2010). Essential Performance Drivers in Native XML DBMSs. In: van Leeuwen, J., Muscholl, A., Peleg, D., Pokorný, J., Rumpe, B. (eds) SOFSEM 2010: Theory and Practice of Computer Science. SOFSEM 2010. Lecture Notes in Computer Science, vol 5901. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11266-9_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-11266-9_3

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-11265-2

  • Online ISBN: 978-3-642-11266-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics