Abstract
Data warehouses integrate several operational sources to provide a multidimensional (MD) analysis of data. Therefore, the development of a data warehouse claims for an in-depth analysis of these data sources. Several approaches have been presented to obtain multidimensional structures from data sources in order to guide this development. However, these approaches assume that a wide documentation of the data sources is available and only provide informal guidelines to support the discovery of MD elements. Therefore, this task may become highly difficult for complex and large data sources (e.g. legacy systems). To overcome these problems, we consider the development of the data warehouse as a modernization scenario that addresses the analysis of the available data sources, thus discovering MD structures to either derive a data-driven conceptual MD model or reconcile a requirement-driven conceptual MD model with data sources. Specifically, we use concepts from Architecture Driven Modernization (ADM) in order to automatically perform the following tasks: (i) obtain a logical representation of data sources (ii) mark this logical representation with MD concepts, and (iii) derive a conceptual MD model from the marked model. Finally, we have provided a case study based on a real world project in order to exemplify the application of our approach.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Inmon, W.: Building the Data Warehouse. Wiley & Sons, Chichester (2002)
Kimball, R., Ross, M.: The Data Warehouse Toolkit. Wiley & Sons, Chichester (2002)
Golfarelli, M., Maio, D., Rizzi, S.: The Dimensional Fact Model: A conceptual model for data warehouses. Int. J. Cooperative Inf. Syst. 7(2-3), 215–247 (1998)
Hüsemann, B., Lechtenbörger, J., Vossen, G.: Conceptual data warehouse modeling. In: 2nd Int. Workshop on Design and Management of Data Warehouses, DMDW 2000
Giorgini, P., Rizzi, S., Garzetti, M.: Goal-oriented requirement analysis for data warehouse design. 8th Int. Workshop on Data Warehousing and OLAP, DOLAP, 47–56 (2005)
Mazón, J.N., Trujillo, J., Lechtenbörger, J.: A set of QVT relations to assure the correctness of data warehouses by using multidimensional normal forms. In: Embley, D.W., Olivé, A., Ram, S. (eds.) ER 2006. LNCS, vol. 4215, pp. 385–398. Springer, Heidelberg (2006)
Mazón, J.N., Trujillo, J., Lechtenbörger, J.: Reconciling requirement-driven data warehouses with data sources via multidimensional normal forms. Data & Knowledge Engineering (doi:10.1016/ j.datak.2007.04.004)
Rizzi, S., Abelló, A., Lechtenbörger, J., Trujillo, J.: Research in data warehouse modeling and design: dead or alive? In: 9th Int. Workshop on Data Warehousing and OLAP, DOLAP, pp. 3–10 (2006)
Seacord, R., Plakosh, D., Lewis, G.: Modernizing Legacy Systems: Software Technologies, Engineering Processes and Business Practices. Addison-Wesley, London, UK (2003)
Olivé, A.: Conceptual schema-centric development: A grand challenge for information systems research. In: Pastor, Ó., Falcão e Cunha, J. (eds.) CAiSE 2005. LNCS, vol. 3520, pp. 1–15. Springer, Heidelberg (2005)
OMG.: MDA Guide, 1.0.1., http://www.omg.org/cgi-bin/doc?omg/03-06-01
OMG: Architecture Driven Modernization (ADM) http://adm.omg.org/
Mazón, J.N., Trujillo, J., Serrano, M., Piattini, M.: Applying MDA to the development of data warehouses. In: 8th Int. Workshop on Data Warehousing and OLAP, DOLAP, pp. 57–66 (2005)
Mazón, J.N., Pardillo, J., Trujillo, J.: Applying transformations to model driven data warehouses. In: DaWaK 2006. LNCS, vol. 4081, pp. 13–22. Springer, Heidelberg (2006)
Mazón, J.N., Trujillo, J.: An MDA approach for the development of data warehouses. Decision Support Systems (doi:10.1016 /j.dss.2006.12.003)
Böhnlein, M., vom Ende, A.U.: Deriving initial data warehouse structures from the conceptual data models of the underlying operational information systems. In: 2nd Int. Workshop on Data Warehousing and OLAP, DOLAP, pp. 15–21 (1999)
Moody, D.L., Kortink, M.A.R.: From enterprise models to dimensional models: a methodology for data warehouse and data mart design. In: 2nd Int. Workshop on Design and Management of Data Warehouses, DMDW (2000)
Phipps, C., Davis, K.C.: Automating data warehouse conceptual schema design and evaluation. In: 4th Int. Workshop on Design and Management of Data Warehouses, DMDW 2000, pp. 23–32 (2000)
Alhajj, R.: Extracting the extended entity-relationship model from a legacy relational database. Inf. Syst. 28(6), 597–618 (2003)
Jensen, M.R., Holmgren, T., Pedersen, T.B.: Discovering multidimensional structure in relational data. In: Kambayashi, Y., Mohania, M.K., Wöß, W. (eds.) DaWaK 2004. LNCS, vol. 3181, pp. 138–148. Springer, Heidelberg (2004)
van den Heuvel, W.J.: Matching and adaptation: Core techniques for MDA-(ADM)-driven integration of new business applications with wrapped legacy systems. Workshop on Model-Driven Evolution of Legacy Systems (2004)
OMG: MOF 2.0 Query/View/Transformation, http://www.omg.org/cgi-bin/doc?ptc/2005,-11-01
OMG: Common Warehouse Metamodel Specification 1.1, http://www.omg.org/cgi-bin/doc?formal/03-03-02
Soutou, C.: Relational database reverse engineering: Algorithms to extract cardinality constraints. Data Knowl. Eng. 28(2), 161–207 (1998)
OMG: Unified Modeling Language Specification 2.0, http://www.omg.org/cgi-bin/doc?formal/05-07-04
Luján-Mora, S., Trujillo, J., Song, I.Y.: A UML profile for multidimensional modeling in data warehouses. Data Knowl. Eng. 59(3), 725–769 (2006)
Hick, J.M., Hainaut, J.L.: Strategy for database application evolution: The DB-MAIN approach. In: Song, I.-Y., Liddle, S.W., Ling, T.-W., Scheuermann, P. (eds.) ER 2003. LNCS, vol. 2813, pp. 291–306. Springer, Heidelberg (2003)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mazón, JN., Trujillo, J. (2007). A Model Driven Modernization Approach for Automatically Deriving Multidimensional Models in Data Warehouses. In: Parent, C., Schewe, KD., Storey, V.C., Thalheim, B. (eds) Conceptual Modeling - ER 2007. ER 2007. Lecture Notes in Computer Science, vol 4801. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75563-0_6
Download citation
DOI: https://doi.org/10.1007/978-3-540-75563-0_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-75562-3
Online ISBN: 978-3-540-75563-0
eBook Packages: Computer ScienceComputer Science (R0)