Abstract
As a multi-layered XML database management system, we have designed, implemented, and optimized over the recent five years our prototype system XTC, a native XDBMS providing multi-lingual query interfaces (XQuery, XPath, DOM). In particular in higher system layers, we have compared competing concepts and iteratively found salient solutions which drastically improved the overall XDBMS performance. XML query processing is critically affected by the smooth interplay of concepts and methods on all architectural layers: node labeling and mapping options for storage structures; availability of suitable index mechanisms; provision of a spectrum of path processing operators; query language compilation and optimization. Furthermore, effective and efficient locking protocols must be present to guarantee the ACID properties for XML processing and to achieve high transaction throughput.
In this survey, we outline our experiences gained during the implementation and optimization of XTC. We figure out the “key drivers” to maximize throughput while keeping the response times at an acceptable level. Because we have implemented all options and alternatives in XTC, dedicated benchmark runs allow for comparisons in identical environments and illustrate the benefit of all implementation decisions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aguiar Moraes Filho, J., Härder, T.: EXsum – An XML Summarization Framework. In: Proc. IDEAS, pp. 139–148 (2008)
Bächle, S., Härder, T.: Tailor-Made Lock Protocols and Their DBMS Integration. In: Proc. EDBT 2008 Workshop on Software Engineering for Tailor-made Data Management, pp. 18–23 (2008)
Bächle, S., Härder, T.: The Real Performance Drivers Behind XML Lock Protocols. In: Bhowmick, S.S., Küng, J., Wagner, R. (eds.) DEXA 2009. LNCS, vol. 5690, pp. 38–52. Springer, Heidelberg (2009)
Bruno, N., Koudas, N., Srivastava, D.: Holistic Twig Joins: Optimal XML Pattern Matching. In: Proc. SIGMOD, pp. 310–321 (2002)
Chen, T., Lu, J., Ling, T.W.: On Boosting Holism in XML Twig Pattern Matching Using Structural Indexing Techniques. In: Proc. SIGMOD, pp. 455–466 (2005)
Chen, S., Li, H.-G., Tatemura, J., Hsiung, W.-P., Agrawal, D., Candan, K.S.: Twig2Stack: Bottom-Up Processing of Generalized-Tree-Pattern Queries over XML Documents. In: Proc. VLDB, pp. 283–294 (2006)
Christophides, W., Plexousakis, D., Scholl, M., Tourtounis, S.: On Labeling Schemes for the Semantic Web. In: Proc.12th Int. WWW Conf., pp. 544–555 (2003)
Document Object Model (DOM) Level 2 / Level 3 Core Specification. W3C Recommendation (2004), http://www.w3.org/TR/DOM-Level-3-Core
Fontoura, M., Josifovski, V., Shekita, E.J., Yang, B.: Optimizing Cursor Movement in Holistic Twig Joins. In: Proc. CIKM, pp. 784–791 (2005)
Graefe, G.: Hierarchical Locking in B-Tree Indexes. In: Proc. German National Database Conf. (BTW). LNI, vol. 65, pp. 18–42. Springer, Heidelberg (2007)
Gray, J., Reuter, A.: Transaction Processing: Concepts and Techniques. Morgan Kaufmann, San Francisco (1993)
Haas, L., Freytag, J.-C., Lohman, G.M., Pirahesh, H.: Extensible Query Processing in Starburst. In: Proc. SIGMOD, pp. 377–388 (1989)
Härder, T.: XML Databases and Beyond – Plenty of Architectural Challenges Ahead. In: Eder, J., Haav, H.-M., Kalja, A., Penjam, J. (eds.) ADBIS 2005. LNCS, vol. 3631, pp. 1–16. Springer, Heidelberg (2005)
Härder, T., Haustein, M.P., Mathis, C., Wagner, M.: Node Labeling Schemes for Dynamic XML Documents Reconsidered. Data & Knowl. Eng. 60(1), 126–149 (2007)
Härder, T., Mathis, C., Schmidt, K.: Comparison of Complete and Elementless Native Storage of XML Documents. In: Proc. IDEAS, pp. 102–113 (2007)
Härder, T., Schmidt, K., Ou, Y., Bächle, S.: Towards Flash Disk Use in Databases – Keeping Performance While Saving Energy? In: Proc. German National Database Conf. (BTW). LNI, vol. 144, pp. 167–186. Springer, Heidelberg (2009)
Härder, T., Reuter, A.: Concepts for Implementing a Centralized Database Management System. In: Proc. Int. Computing Symposium on Application Systems Development, pp. 28–60. B.G. Teubner-Verlag (1983)
Haustein, M.P.: Fine-Granular Transaction Isolation in Native XML Database Management Systems (in German), Ph.D. Thesis, Univ. of Kaiserslautern, Verlag Dr. Hut, München (2006)
Haustein, M.P., Härder, T., Mathis, C., Wagner, M.: DeweyIDs – The Key to Fine-Grained Management of XML Documents. In: Proc. SBBD, pp. 85–99 (2005)
Haustein, M.P., Härder, T., Luttenberger, K.: Contest of XML Lock Protocols. In: Proc. VLDB, pp. 1069–1080 (2006)
Haustein, M.P., Härder, T.: Optimizing Lock Protocols for Native XML Processing. Data & Knowl. Eng. 65(1), 147–173 (2008)
Jagadish, H.V., Al-Khalifa, S., Chapman, A.: TIMBER: A Native XML Database. The VLDB Journal 11(4), 274–291 (2002)
Jiang, H., Wang, W., Lu, H., Yu, J.X.: Holistic Twig Joins on Indexed XML Documents. In: Proc. VLDB, pp. 273–284 (2003)
Jiao, E., Ling, T.W., Chan, C.Y.: PathStack¬: A Holistic Path Join Algorithm for Path Query with Not-Predicates on XML Data. In: Zhou, L.-z., Ooi, B.-C., Meng, X. (eds.) DASFAA 2005. LNCS, vol. 3453, pp. 113–124. Springer, Heidelberg (2005)
Li, C., Ling, T.W., Hu, M.: Efficient Updates in Dynamic XML Data: from Binary String to Quaternary String. VLDB J. 17(3), 573–601 (2008)
Loeser, H., Nicola, M., Fitzgerald, J.: Index Challenges in Native XML Database Systems. In: Proc. German National Database Conf. (BTW). LNI, vol. 144, pp. 508–523. Gesellschaft für Informatik (2009)
Lu, J., Chen, T., Ling, T.W.: Efficient Processing of XML Twig Patterns with Parent-Child Edges: a Look-Ahead Approach. In: Proc. CIKM, pp. 533–542 (2004)
Lu, J., Chen, T., Ling, T.W.: TJFast: Effective Processing of XML Twig Pattern Matching. In: Proc. WWW, pp. 1118–1119 (2005)
Mathis, C.: Storing, Indexing, and Querying XML Documents in Native XML Database Management Systems, Ph.D. Thesis, Univ. of Kaiserslautern, Verlag Dr. Hut, München (2009)
Mathis, C., Härder, T., Haustein, M.P.: Locking-Aware Structural Join Operators for XML Query Processing. In: Proc. SIGMOD, pp. 467–478 (2006)
Mathis, C., Härder, T., Schmidt, K., Bächle, S.: XML Indexing and Storage: Fulfilling the Wish List (submitted, 2009)
Miklau, G.: XML Data Repository, http://www.cs.washington.edu/research/xmldatasets
Mohan, C.: ARIES/KVL: A Key-Value Locking Method for Concurrency Control of Multiaction Transactions Operating on B-Tree Indexes. In: Proc. VLDB, pp. 392–405 (1990)
O’Neil, P., O’Neil, E., Pal, S., Cseri, I., Schaller, G., Westbury, N.: OrdPaths: Insert-Friendly XML Node Labels. In: Proc. SIGMOD, pp. 903–908 (2004)
Özcan, F., Seemann, N., Wang, L.: XQuery Rewrite Optimization in IBM DB2 pureXML. Data Engineering Bulletin 31(4), 25–32 (2008)
Qin, L., Yu, J.X., Ding, B.: TwigList: Make Twig Pattern Matching Fast. In: Kotagiri, R., Radha Krishna, P., Mohania, M., Nantajeewarawat, E. (eds.) DASFAA 2007. LNCS, vol. 4443, pp. 850–862. Springer, Heidelberg (2007)
Schmidt, A., Waas, F., Kersten, M.L., Carey, M.J., Manolescu, I., Busse, R.: XMark: A Benchmark for XML Data Management. In: Proc. VLDB, pp. 974–985 (2002)
Schmidt, K., Härder, T.: Usage-Driven Storage Structures for Native XML Databases. In: Proc. IDEAS, pp. 169–178 (2008)
Schöning, H.: Tamino–A DBMS Designed for XML. In: Proc. ICDE, pp. 149–154 (2001)
Siirtola, A., Valenta, M.: Verifying Parameterized taDOM+ Lock Managers. In: Geffert, V., Karhumäki, J., Bertoni, A., Preneel, B., Návrat, P., Bieliková, M. (eds.) SOFSEM 2008. LNCS, vol. 4910, pp. 460–472. Springer, Heidelberg (2008)
Weiner, A.M., Härder, T.: Using Structural Joins and Holistic Twig Joins for Native XML Query Optimization. In: Grundspenkis, J., Morzy, T., Vossen, G. (eds.) ADBIS 2009. LNCS, vol. 5739, pp. 149–163. Springer, Heidelberg (2009)
Weiner, A.M., Härder, T.: A Framework for Cost-Based Query Optimization in Native XML Database Management Systems. In: Li, C., Ling, T.W. (eds.) Advanced Applications and Structures in XML Processing: Label Streams, Semantics Utilization, and Data Query Technologies. IGI Global (2010)
XML on Wall Street, Financial XML Projects, http://lighthouse-partners.com/xml
Yu, J.X., Luo, D., Meng, X., Lu, H.: Dynamically Updating XML Data: Numbering Scheme Revisited. World Wide Web 8(1), 5–26 (2005)
Yu, T., Ling, T.W., Lu, J.: TwigStackList¬: A Holistic Twig Join Algorithm for Twig Query with Not-Predicates on XML Data. In: Li Lee, M., Tan, K.-L., Wuwongse, V. (eds.) DASFAA 2006. LNCS, vol. 3882, pp. 249–263. Springer, Heidelberg (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Härder, T., Mathis, C., Bächle, S., Schmidt, K., Weiner, A.M. (2010). Essential Performance Drivers in Native XML DBMSs. In: van Leeuwen, J., Muscholl, A., Peleg, D., Pokorný, J., Rumpe, B. (eds) SOFSEM 2010: Theory and Practice of Computer Science. SOFSEM 2010. Lecture Notes in Computer Science, vol 5901. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11266-9_3
Download citation
DOI: https://doi.org/10.1007/978-3-642-11266-9_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-11265-2
Online ISBN: 978-3-642-11266-9
eBook Packages: Computer ScienceComputer Science (R0)