Skip to main content

Cost-Based Vertical Fragmentation for XML

  • Conference paper
Advances in Web and Network Technologies, and Information Management (APWeb 2007, WAIM 2007)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4537))

Abstract

The Extensible Markup Language (XML) has attracted much attention as a data model for data exchange, data integration and rich data representation. A challenging question is how to manage native XML data in distributed databases. This leads to the problem of how to obtain a suitable distribution design for XML documents. In this paper we present a design approach for vertical fragmentation to minimise total query costs. Our approach is based on a cost model that takes the complex structure of queries on XML data into account. We show that system performance can be improved after vertical fragmentation using our approach, which is based on user access patterns.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abraham, J., Chaudhari, N.S., Prakash, E.C.: XML query algebra operators, and strategies for their implementation. In: IEEE Region 10 Conference, pp. 286–289. IEEE Computer Society Press, Los Alamitos (2004)

    Chapter  Google Scholar 

  2. Andrade, A., Ruberg, G., Baião, F.A., Braganholo, V.P., Mattoso, M.: Efficiently processing xml queries over fragmented repositories with partix. In: Ioannidis, Y., Scholl, M.H., Schmidt, J.W., Matthes, F., Hatzopoulos, M., Boehm, K., Kemper, A., Grust, T., Boehm, C. (eds.) EDBT 2006. LNCS, vol. 3896, pp. 150–163. Springer, Heidelberg (2006)

    Google Scholar 

  3. Bellatreche, L., Simonet, A., Simonet, M.: Vertical fragmentation in distributed object database systems with complex attributes and methods. In: Thoma, H., Wagner, R.R. (eds.) DEXA 1996. LNCS, vol. 1134, pp. 15–21. Springer, Heidelberg (1996)

    Google Scholar 

  4. Bex, G.J., Neven, F., Van den Bussche, J.V.D.: Dtds versus xml schema: a practical study. In: WebDB, pp. 79–84. ACM Press, New York (2004)

    Chapter  Google Scholar 

  5. Bremer, J.-M., Gertz, M.: On distributing xml repositories. In: WebDB, pp. 73–78 (2003)

    Google Scholar 

  6. Cornell, D., Yu, P.: A vertical partitioning algorithm for relational databases. In: ICDE, pp. 30–35 (1987)

    Google Scholar 

  7. Ezeife, C.I., Barker, K.: Vertical fragmentation for advanced object models in a distributed object based system. In: ICCI, pp. 613–632 (1995)

    Google Scholar 

  8. Frasincar, F., Houben, G.-J., Pau, C.: Xal: an algebra for xml query optimization. In: ADC, pp. 49–56 (2002)

    Google Scholar 

  9. Goldman, R., McHugh, J., Widom, J.: From semistructured data to xml: Migrating the lore data model and query language. In: WebDB

    Google Scholar 

  10. Hoffer, J.A., Severance, D.G.: The use of cluster analysis in physical database design. In: VLDB, pp. 69–86 (1975)

    Google Scholar 

  11. Karlapalem, K., Navathe, S.B., Morsi, M.M.A.: Issues in distribution design of object-oriented databases. In: IWDOM, pp. 148–164 (1992)

    Google Scholar 

  12. Ma, H.: Distribution design in object oriented databases. Master’s thesis, Massey University (2003)

    Google Scholar 

  13. Ma, H., Schewe, K.-D.: A heuristic approach to cost-efficient horizontal fragmentation of XML documents. In: Pastor, Ó., Falcão e Cunha, J. (eds.) CAiSE 2005. LNCS, vol. 3520, pp. 131–136. Springer, Heidelberg (2005)

    Google Scholar 

  14. Ma, H., Schewe, K.-D., Kirchberg, M.: A heuristic approach to vertical fragmentation incorporating query information. In: Baltic Conference on Databases and Information Systems, pp. 69–76 (2006)

    Google Scholar 

  15. Ma, H., Schewe, K.-D., Wang, Q.: A heuristic approach to cost-efficient fragmentation and allocation of complex value databases. In: ADC, pp. 119–128 (2006)

    Google Scholar 

  16. Ma, H., Schewe, K.-D., Wang, Q.: Distribution Design for Higher-Order Data Models. Data. and Knowledge Engineering 60, 400–434 (2007)

    Article  Google Scholar 

  17. Miklau, G., Suciu, D.: Containment and equivalence for a fragment of XPath. J. ACM 51(1), 2–45 (2004)

    Article  MathSciNet  Google Scholar 

  18. Muthuraj, J., Chakravarthy, S., Varadarajan, R., Navathe, S.B.: A formal approach to the vertical partitioning problem in distributed database design. In: International Conference on Parallel and Distributed Information Systems, pp. 26–34 (1993)

    Google Scholar 

  19. Navathe, S., Karlapalem, K., Ra, M.: A mixed fragmentation methodology for initial distributed database design. Journal of Computer and Software Engineering 3 (1995)

    Google Scholar 

  20. Navathe, S.B., Ceri, S., Wiederhold, G., Dour, J.: Vertical Partitioning Algorithms for Database Design. ACM TODS 9, 680–710 (1984)

    Article  Google Scholar 

  21. Navathe, S.B., Ra, M.: Vertical Partitioning for Database Design: A Graphical Algorithm. SIGMOD Record 14, 440–450 (1989)

    Article  Google Scholar 

  22. Nicola, M., van der Linden, B.: Native xml support in db2 universal database. In: VLDB, pp. 1164–1174 (2005)

    Google Scholar 

  23. Özsu, M., Valduriez, P.: Principles of Distributed Database Systems (1999)

    Google Scholar 

  24. Schewe, K.-D.: On the unification of query algebras and their extension to rational tree structures. In: ADC, pp. 52–59 (2001)

    Google Scholar 

  25. Weigel, F., Schulz, K.U., Meuss, H.: Node identification schemes for efficient xml retrieval. In: Foundations of Semistructured Data (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Kevin Chen-Chuan Chang Wei Wang Lei Chen Clarence A. Ellis Ching-Hsien Hsu Ah Chung Tsoi Haixun Wang

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hartmann, S., Ma, H., Schewe, KD. (2007). Cost-Based Vertical Fragmentation for XML. In: Chang, K.CC., et al. Advances in Web and Network Technologies, and Information Management. APWeb WAIM 2007 2007. Lecture Notes in Computer Science, vol 4537. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72909-9_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-72909-9_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-72908-2

  • Online ISBN: 978-3-540-72909-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics