Abstract
The Data Warehouse is a data storage medium with the purpose to produce accurate and useful information to support business stakeholders to conduct data analysis that helps with performing decision making processes and improving information resources. The data warehouse provides a single and detailed view of the organization, and it is intended to be exploited by means of OLAP (On-line Analytical Processing) tools. These tools facilitate information analysis and navigation through the business data based on the multidimensional paradigm. A crucial decision for designing multidimensional models concerns the grain of facts, determined by fact–dimension relationships. This means, that the accuracy of the information can depend on how the data model is structured to support multi-value dimensions and avoid double-counting’s. The paper presents a technique used to overcome these constraints enabling designers to abstract complexity at a conceptual level without taking into account of more complex schema structures (like bridge table) to deal with non-strict fact–dimension relationships at different granularities. The technique is demonstrated using the Pentaho tool and lessons learned from our case study, an information system to monitor the execution of public works contracts.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Kimball, R., et al.: The Data Warehouse Lifecycle Toolkit, 2nd edn. Wiley (2008)
Mazón, J., Lechtenbörger, J., Trujillo, J.: A survey on summarizability issues in multidimensional modeling. Data & Knowledge Engineering 68, 1452–1469 (2009)
Adamson, C.: Star Schema: The Complete Reference. McGraw-Hill (2010)
Romero, O., Abelló, A.: A Survey of Multidimensional Modeling Methodologies. Int. Journal of Data Warehousing & Mining 5(2), 1–23 (2009)
Song, I.Y., Rowen, W., Medsker, C., Ewen, E.F.: An analysis of many-to-many relationships between fact and dimension tables in dimensional modeling. In: Proc. of DMDW (2001)
Guo, Y., Tang, S., Tong, Y., Yang, D.: Triple-Driven Data Modeling Methodology in Data Warehousing: A Case Study. In: Proc. of DOLAP (2006)
Dori, D., Feldman, R., Sturm, A.: Transforming an operational system model to a data warehouse model: a survey of techniques. In: Int. Conf. on Software- Science, Technology and Engineering, pp. 47–56. IEEE Computer Society (2005)
Thenmozhi, M., Vivekanandan, K.: A Tool for Data Warehouse Multidimensional Schema Design using Ontology. Int. Journal of Computer Science Issues 10(2(3)) (March 2013)
Mazón, J., Trujillo, J.: A hybrid model driven development framework for the multidimensional mod-eling of data warehouses. Proc. of SIGMOD Record 38(2) (2009)
Serrano, M., et al.: Metrics for data warehouse conceptual models understandability. Information and Software Technology 49, 851–870 (2007)
Romero, O., Simitsis, A., Abelló, A.: GEM: Requirement-driven generation of ETL and multidimensional conceptual designs. In: Cuzzocrea, A., Dayal, U. (eds.) DaWaK 2011. LNCS, vol. 6862, pp. 80–95. Springer, Heidelberg (2011)
Talwar, K., Gosain, A.: Hierarchy classification for Data Warehouse: A Survey. In: Proc. of ICCCS (2012)
Prat, N., Akoka, J., Comyn-Wattiau, I.: A UML-based data warehouse design method. Decision Support Systems 42, 1449–1473 (2006)
Song, Y., et al.: An Analysis of Many-to-Many Relationships Between Fact and Dimension Tables in Dimensional Modeling. In: Proc. of DMDW (2001)
Chee Tahir, A., Darton, R.C.: The Process Analysis Method of selecting indicators to quantify the sustainability performance of a business operation. Int. Journal of Cleaner Production 18, 1598–1607 (2010)
Pentaho, “Mondrian Schema Documentation”, online documentation available at the Pentaho website: http://mondrian.pentaho.com/documentation/schema.php;
Zhijuan, W., Hongchang, W.: A Data Warehouse Design Method. In: International Conference on Computer Science and Service System (2012)
Lechtenbörger, J., Vossen, G.: Multidimensional normal forms for data warehouse design. Inf. Syst. 28(5), 415–434 (2003)
Caniupán, M., Bravo, L., Hurtado, C.A.: Repairing inconsistent dimensions in data warehouses. Data & Knowledge Engineering 79-80, 17–39 (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Pestana, G., Catelas, P., Rosa, I. (2015). A Multi-driven Approach to Improve Data Analytics for Multi-value Dimensions. In: Rocha, A., Correia, A., Costanzo, S., Reis, L. (eds) New Contributions in Information Systems and Technologies. Advances in Intelligent Systems and Computing, vol 354. Springer, Cham. https://doi.org/10.1007/978-3-319-16528-8_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-16528-8_12
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-16527-1
Online ISBN: 978-3-319-16528-8
eBook Packages: EngineeringEngineering (R0)