A Multi-driven Approach to Improve Data Analytics for Multi-value Dimensions
The Data Warehouse is a data storage medium with the purpose to produce accurate and useful information to support business stakeholders to conduct data analysis that helps with performing decision making processes and improving information resources. The data warehouse provides a single and detailed view of the organization, and it is intended to be exploited by means of OLAP (On-line Analytical Processing) tools. These tools facilitate information analysis and navigation through the business data based on the multidimensional paradigm. A crucial decision for designing multidimensional models concerns the grain of facts, determined by fact–dimension relationships. This means, that the accuracy of the information can depend on how the data model is structured to support multi-value dimensions and avoid double-counting’s. The paper presents a technique used to overcome these constraints enabling designers to abstract complexity at a conceptual level without taking into account of more complex schema structures (like bridge table) to deal with non-strict fact–dimension relationships at different granularities. The technique is demonstrated using the Pentaho tool and lessons learned from our case study, an information system to monitor the execution of public works contracts.
KeywordsMultidimensional Schema Design Requirements Analysis Multi- Value Dimensions
Unable to display preview. Download preview PDF.
- 1.Kimball, R., et al.: The Data Warehouse Lifecycle Toolkit, 2nd edn. Wiley (2008)Google Scholar
- 3.Adamson, C.: Star Schema: The Complete Reference. McGraw-Hill (2010)Google Scholar
- 5.Song, I.Y., Rowen, W., Medsker, C., Ewen, E.F.: An analysis of many-to-many relationships between fact and dimension tables in dimensional modeling. In: Proc. of DMDW (2001)Google Scholar
- 6.Guo, Y., Tang, S., Tong, Y., Yang, D.: Triple-Driven Data Modeling Methodology in Data Warehousing: A Case Study. In: Proc. of DOLAP (2006)Google Scholar
- 7.Dori, D., Feldman, R., Sturm, A.: Transforming an operational system model to a data warehouse model: a survey of techniques. In: Int. Conf. on Software- Science, Technology and Engineering, pp. 47–56. IEEE Computer Society (2005)Google Scholar
- 8.Thenmozhi, M., Vivekanandan, K.: A Tool for Data Warehouse Multidimensional Schema Design using Ontology. Int. Journal of Computer Science Issues 10(2(3)) (March 2013)Google Scholar
- 9.Mazón, J., Trujillo, J.: A hybrid model driven development framework for the multidimensional mod-eling of data warehouses. Proc. of SIGMOD Record 38(2) (2009)Google Scholar
- 12.Talwar, K., Gosain, A.: Hierarchy classification for Data Warehouse: A Survey. In: Proc. of ICCCS (2012)Google Scholar
- 14.Song, Y., et al.: An Analysis of Many-to-Many Relationships Between Fact and Dimension Tables in Dimensional Modeling. In: Proc. of DMDW (2001)Google Scholar
- 16.Pentaho, “Mondrian Schema Documentation”, online documentation available at the Pentaho website: http://mondrian.pentaho.com/documentation/schema.php;
- 17.Zhijuan, W., Hongchang, W.: A Data Warehouse Design Method. In: International Conference on Computer Science and Service System (2012)Google Scholar