Skip to main content

XML in the Stars: A New Approach to Data Warehouses

  • Chapter
  • 157 Accesses

Abstract

XML is just becoming a standard for document processing and interchange in Internet. Relatively less attention has been paid to its direct usage in data warehousing. We focus on the following questions. How can XML influence a DW design? How can dimensionality be mapped into XML data? We suppose collections of XML data described by Document Type Definitions (DTDs). This data has been generated by applications and plays a role of OLTP database(s). A star schema, a well-known technique used in data warehousing, can be applied. Then dimension information is supposed to be contained in XML data. We will use the notions of subDTD and view, and formulate referential integrity constraints in XML environment. We use simple pattern matching capabilities of current XML query languages for XML view specification and tree embedding algorithms for these purposes. Then a dimension hierarchy is defined as a set of logically connected collections of XML data. Due to the structural complexity of XML data the approach requires subtler formal model than it is done with conventional dimension and fact tables in classical star schemes.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   229.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   299.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   299.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Anutariya, C., Wuwongse, V., Nantajeewarawat, E., and Akama, K., Towards a Foundation for XML Documents Databases, in: Proc. of the 1st Int. Conf. on Electrical Commerce and Web technologies (EC-Web 2000), London UK, LCNS 1875, Springer Verlag, 2000, pp. 324–333.

    Chapter  Google Scholar 

  • Apparao, V. et al, Document object model (DOM) level 1 specification, October 1988; http://www.w3.org/TR/REC-DOM Level-1.

    Google Scholar 

  • Abiteboul, S., Vianu, V., Segoufin, L.: Representing and Querying XML with Incomplete Information, in: Proc. of the ACM SIGACT-SIGMOD-SIGART Symp. on Principles of Database Systems, 2001, pp. 150–161.

    Google Scholar 

  • Bourret, R., XML and Databases; http://www.rpbourret.com/xml/XMLAndDatabases.htm.

    Google Scholar 

  • Florescu, D. Deutsch, A. Levy, A. Fernandez, M. and Suciu D., A Query Language for XML, in: Proc. of Eighth Int. WWW Conference, 1999, pp. 77–91.

    Google Scholar 

  • Kilpeläinen, P., Tree Matching Problems with Applications to Structured Text Databases, Rep. A–1992–6, University of Helsinki, Finland, 1992.

    Google Scholar 

  • Kimball R., The Data Warehouse Toolkit: Practical Techniques for Building Dimensional Data Warehouses, John Wiley, 1996.

    Google Scholar 

  • Kimball R., 1997, A dimensional manifesto, DBMS, August.

    Google Scholar 

  • Papakonstantinou, Y., Vianu, V., DTD Inference for Views of XML Data, in: Proc. ACM SIGACT-SIGMOD-SIGART Symp. on Principles of Database Systems, 2000, pp. 35–46.

    Google Scholar 

  • Pokorny, J., Data Warehouses: a Modelling Perspective, in: Evolution and Challenges in System Development, edited by W.G.Wojtkowski, S. Wrycza, and J. Zupancic, Kluwer Academic/Plenum Press Publ., 1999a, pp.59–71.

    Chapter  Google Scholar 

  • Pokorny J., To the Stars through Dimensions and Facts, in: Proc. of the 3rd International Conference on Business Information Systems (BIS’99), Springer Verlag, London, 1999b, pp. 135–147.

    Google Scholar 

  • Pokorny, J., Dealing with Dimensions in Data Warehousing, in: Knowledge Discovery for Business Information Systems, edited by W. Abramowicz and Jozef Zurada, Kluwer Academic Publishers, Boston, 2000a, pp. 307–324.

    Google Scholar 

  • Pokorny, J., XML Functionally, in: Proc. of IDEAS2000, edited by B.C.Desai, Y. Kioki, and M. Toyama, IEEE Comp. Society, 2000b, pp. 266–274.

    Google Scholar 

  • Schlieder, T., Naumann, F., Approximate Tree Embedding for Querying XML Data, ACM SIGIR Workshop on XML and IR, Athens, 2000.

    Google Scholar 

  • W3C Extensible Markup Language (XML) 1.0. 1998; http://www.w3.org.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

G. Harindranath W. Gregory Wojtkowski Jože Zupančič Duska Rosenberg Wita Wojtkowski Stanislaw Wrycza John A. A. Sillince

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer Science+Business Media New York

About this chapter

Cite this chapter

Pokorný, J. (2002). XML in the Stars: A New Approach to Data Warehouses. In: Harindranath, G., et al. New Perspectives on Information Systems Development. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-0595-2_40

Download citation

  • DOI: https://doi.org/10.1007/978-1-4615-0595-2_40

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4613-5149-8

  • Online ISBN: 978-1-4615-0595-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics