Abstract
XML is just becoming a standard for document processing and interchange in Internet. Relatively less attention has been paid to its direct usage in data warehousing. We focus on the following questions. How can XML influence a DW design? How can dimensionality be mapped into XML data? We suppose collections of XML data described by Document Type Definitions (DTDs). This data has been generated by applications and plays a role of OLTP database(s). A star schema, a well-known technique used in data warehousing, can be applied. Then dimension information is supposed to be contained in XML data. We will use the notions of subDTD and view, and formulate referential integrity constraints in XML environment. We use simple pattern matching capabilities of current XML query languages for XML view specification and tree embedding algorithms for these purposes. Then a dimension hierarchy is defined as a set of logically connected collections of XML data. Due to the structural complexity of XML data the approach requires subtler formal model than it is done with conventional dimension and fact tables in classical star schemes.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Anutariya, C., Wuwongse, V., Nantajeewarawat, E., and Akama, K., Towards a Foundation for XML Documents Databases, in: Proc. of the 1st Int. Conf. on Electrical Commerce and Web technologies (EC-Web 2000), London UK, LCNS 1875, Springer Verlag, 2000, pp. 324–333.
Apparao, V. et al, Document object model (DOM) level 1 specification, October 1988; http://www.w3.org/TR/REC-DOM Level-1.
Abiteboul, S., Vianu, V., Segoufin, L.: Representing and Querying XML with Incomplete Information, in: Proc. of the ACM SIGACT-SIGMOD-SIGART Symp. on Principles of Database Systems, 2001, pp. 150–161.
Bourret, R., XML and Databases; http://www.rpbourret.com/xml/XMLAndDatabases.htm.
Florescu, D. Deutsch, A. Levy, A. Fernandez, M. and Suciu D., A Query Language for XML, in: Proc. of Eighth Int. WWW Conference, 1999, pp. 77–91.
Kilpeläinen, P., Tree Matching Problems with Applications to Structured Text Databases, Rep. A–1992–6, University of Helsinki, Finland, 1992.
Kimball R., The Data Warehouse Toolkit: Practical Techniques for Building Dimensional Data Warehouses, John Wiley, 1996.
Kimball R., 1997, A dimensional manifesto, DBMS, August.
Papakonstantinou, Y., Vianu, V., DTD Inference for Views of XML Data, in: Proc. ACM SIGACT-SIGMOD-SIGART Symp. on Principles of Database Systems, 2000, pp. 35–46.
Pokorny, J., Data Warehouses: a Modelling Perspective, in: Evolution and Challenges in System Development, edited by W.G.Wojtkowski, S. Wrycza, and J. Zupancic, Kluwer Academic/Plenum Press Publ., 1999a, pp.59–71.
Pokorny J., To the Stars through Dimensions and Facts, in: Proc. of the 3rd International Conference on Business Information Systems (BIS’99), Springer Verlag, London, 1999b, pp. 135–147.
Pokorny, J., Dealing with Dimensions in Data Warehousing, in: Knowledge Discovery for Business Information Systems, edited by W. Abramowicz and Jozef Zurada, Kluwer Academic Publishers, Boston, 2000a, pp. 307–324.
Pokorny, J., XML Functionally, in: Proc. of IDEAS2000, edited by B.C.Desai, Y. Kioki, and M. Toyama, IEEE Comp. Society, 2000b, pp. 266–274.
Schlieder, T., Naumann, F., Approximate Tree Embedding for Querying XML Data, ACM SIGIR Workshop on XML and IR, Athens, 2000.
W3C Extensible Markup Language (XML) 1.0. 1998; http://www.w3.org.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2002 Springer Science+Business Media New York
About this chapter
Cite this chapter
Pokorný, J. (2002). XML in the Stars: A New Approach to Data Warehouses. In: Harindranath, G., et al. New Perspectives on Information Systems Development. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-0595-2_40
Download citation
DOI: https://doi.org/10.1007/978-1-4615-0595-2_40
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-5149-8
Online ISBN: 978-1-4615-0595-2
eBook Packages: Springer Book Archive