Abstract
The Extensible Markup Language (XML) has attracted much attention as a data model for data exchange, data integration and rich data representation. A challenging question is how to manage native XML data in distributed databases. This leads to the problem of how to obtain a suitable distribution design for XML documents. In this paper we present a design approach for vertical fragmentation to minimise total query costs. Our approach is based on a cost model that takes the complex structure of queries on XML data into account. We show that system performance can be improved after vertical fragmentation using our approach, which is based on user access patterns.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abraham, J., Chaudhari, N.S., Prakash, E.C.: XML query algebra operators, and strategies for their implementation. In: IEEE Region 10 Conference, pp. 286–289. IEEE Computer Society Press, Los Alamitos (2004)
Andrade, A., Ruberg, G., Baião, F.A., Braganholo, V.P., Mattoso, M.: Efficiently processing xml queries over fragmented repositories with partix. In: Ioannidis, Y., Scholl, M.H., Schmidt, J.W., Matthes, F., Hatzopoulos, M., Boehm, K., Kemper, A., Grust, T., Boehm, C. (eds.) EDBT 2006. LNCS, vol. 3896, pp. 150–163. Springer, Heidelberg (2006)
Bellatreche, L., Simonet, A., Simonet, M.: Vertical fragmentation in distributed object database systems with complex attributes and methods. In: Thoma, H., Wagner, R.R. (eds.) DEXA 1996. LNCS, vol. 1134, pp. 15–21. Springer, Heidelberg (1996)
Bex, G.J., Neven, F., Van den Bussche, J.V.D.: Dtds versus xml schema: a practical study. In: WebDB, pp. 79–84. ACM Press, New York (2004)
Bremer, J.-M., Gertz, M.: On distributing xml repositories. In: WebDB, pp. 73–78 (2003)
Cornell, D., Yu, P.: A vertical partitioning algorithm for relational databases. In: ICDE, pp. 30–35 (1987)
Ezeife, C.I., Barker, K.: Vertical fragmentation for advanced object models in a distributed object based system. In: ICCI, pp. 613–632 (1995)
Frasincar, F., Houben, G.-J., Pau, C.: Xal: an algebra for xml query optimization. In: ADC, pp. 49–56 (2002)
Goldman, R., McHugh, J., Widom, J.: From semistructured data to xml: Migrating the lore data model and query language. In: WebDB
Hoffer, J.A., Severance, D.G.: The use of cluster analysis in physical database design. In: VLDB, pp. 69–86 (1975)
Karlapalem, K., Navathe, S.B., Morsi, M.M.A.: Issues in distribution design of object-oriented databases. In: IWDOM, pp. 148–164 (1992)
Ma, H.: Distribution design in object oriented databases. Master’s thesis, Massey University (2003)
Ma, H., Schewe, K.-D.: A heuristic approach to cost-efficient horizontal fragmentation of XML documents. In: Pastor, Ó., Falcão e Cunha, J. (eds.) CAiSE 2005. LNCS, vol. 3520, pp. 131–136. Springer, Heidelberg (2005)
Ma, H., Schewe, K.-D., Kirchberg, M.: A heuristic approach to vertical fragmentation incorporating query information. In: Baltic Conference on Databases and Information Systems, pp. 69–76 (2006)
Ma, H., Schewe, K.-D., Wang, Q.: A heuristic approach to cost-efficient fragmentation and allocation of complex value databases. In: ADC, pp. 119–128 (2006)
Ma, H., Schewe, K.-D., Wang, Q.: Distribution Design for Higher-Order Data Models. Data. and Knowledge Engineering 60, 400–434 (2007)
Miklau, G., Suciu, D.: Containment and equivalence for a fragment of XPath. J. ACM 51(1), 2–45 (2004)
Muthuraj, J., Chakravarthy, S., Varadarajan, R., Navathe, S.B.: A formal approach to the vertical partitioning problem in distributed database design. In: International Conference on Parallel and Distributed Information Systems, pp. 26–34 (1993)
Navathe, S., Karlapalem, K., Ra, M.: A mixed fragmentation methodology for initial distributed database design. Journal of Computer and Software Engineering 3 (1995)
Navathe, S.B., Ceri, S., Wiederhold, G., Dour, J.: Vertical Partitioning Algorithms for Database Design. ACM TODS 9, 680–710 (1984)
Navathe, S.B., Ra, M.: Vertical Partitioning for Database Design: A Graphical Algorithm. SIGMOD Record 14, 440–450 (1989)
Nicola, M., van der Linden, B.: Native xml support in db2 universal database. In: VLDB, pp. 1164–1174 (2005)
Özsu, M., Valduriez, P.: Principles of Distributed Database Systems (1999)
Schewe, K.-D.: On the unification of query algebras and their extension to rational tree structures. In: ADC, pp. 52–59 (2001)
Weigel, F., Schulz, K.U., Meuss, H.: Node identification schemes for efficient xml retrieval. In: Foundations of Semistructured Data (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hartmann, S., Ma, H., Schewe, KD. (2007). Cost-Based Vertical Fragmentation for XML. In: Chang, K.CC., et al. Advances in Web and Network Technologies, and Information Management. APWeb WAIM 2007 2007. Lecture Notes in Computer Science, vol 4537. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72909-9_2
Download citation
DOI: https://doi.org/10.1007/978-3-540-72909-9_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72908-2
Online ISBN: 978-3-540-72909-9
eBook Packages: Computer ScienceComputer Science (R0)