Abstract
Business Intelligence systems rely on an integrated, consistent, and certified information repository called the Data Warehouse (DW) that is periodically fed with operational data. In the decision-making process, the analyzed data are usually stored in the DW in the form of multidimensional cubes. These cubes are queried interactively by the decision makers, according to the online analytical processing paradigm. In larger companies with multiple subsidiaries, the frequent expression of new business needs requires the creation of new data cubes which generate a large number of cubes to be manipulated. The inevitable complexity and heterogeneity of data cubes make it difficult to design data cubes. The decision maker can precisely express his needs through a query in natural language which consists of a set of analysis indicators (measures, dimensions) separated by the AND operator. However, the decision maker’s need may be incomplete. Indeed, he usually has a cube that represents part of his needs and he may want to complete it or enrich it with other cubes that are unknown to him. To deal with these situations, we propose in this paper an approach that addresses the problem of designing and constructing data cubes where the expressed need is scattered over more than one cube. Our goal is to enable decision makers to analyze all of their needs using just one cube. Our approach consists of two variants: a variant that is based on analysis indicators, and another based on the known cube. We present the validation of our approach by means of a tool, called “Design-Cubes-Query” that implements our approach and we show its use through a case study.
Similar content being viewed by others
References
Abelló, A., Samos, J., Saltor, F.: On relationships offering new drill-across possibilities. In: Proceedings of the 5th ACM International Workshop on Data Warehousing and OLAP, pp. 7–13. ACM (2002)
Alberto, D.: Fusion cubes: towards self-service business intelligence. Int. J. Data Warehous. Min. 9(2), 66–88 (2013)
Bimonte, S., Sautot, L., Journaux, L., Faivre, B.: Multidimensional model design using data mining: a rapid prototyping methodology. Int. J. Data Warehous. Min. 13(1), 1–35 (2017)
Boukraâ, D., Boussaïd, O., Bentayeb, F.: OLAP operators for complex object data cubes. In: ADBIS, pp. 103–116. Springer (2010)
Cheung, D.W., Zhou, B., Kao, B., Lu, H., Lam, T.W., Ting, H.F.: Requirement-based data cube schema design. In: Proceedings of the Eighth International Conference on Information and Knowledge Management, pp. 162–169. ACM (1999)
Chhabra, R., Pahwa, P.: Data mart designing and integration approaches. Int. J. Comput. Sci. Mob. Comput. 3(4), 74–79 (2014)
Cohen, W., Ravikumar, P., Fienberg, S.: A comparison of string metrics for matching names and records. In: KDD Workshop on Data Cleaning and Object Consolidation, vol. 3, pp. 73–78 (2003)
Djiroun, R., Bimonte, S., Boukhalfa, K.: A first framework for top-k cubes queries. In: International Conference on Conceptual Modeling, pp. 187–197. Springer (2015)
Etcheverry, L., Vaisman, A., Zimányi, E.: Modeling and querying data warehouses on the semantic web using QB4OLAP. In: International Conference on Data Warehousing and Knowledge Discovery, pp. 45–56. Springer (2014)
Gardner, S.R.: Building the data warehouse: the tough questions project managers have to ask their companies’ executives–and themselves–and the guidelines needed to sort out the answers. Commun. ACM 41(9), 52–61 (1998)
Ghrab, A., Romero, O., Skhiri, S., Vaisman, A., Zimányi, E.: A framework for building OLAP cubes on graphs. In: East European Conference on Advances in Databases and Information Systems, pp. 92–105. Springer (2015)
Golfarelli, M., Rizzi, S.: A methodological framework for data warehouse design. In: Proceedings of the 1st ACM International Workshop on Data Warehousing and OLAP, pp. 3–9. ACM (1998)
Golfarelli, M., Rizzi, S., Biondi, P.: myOLAP: an approach to express and evaluate OLAP preferences. IEEE Trans. Knowl. Data Eng. 23(7), 1050–1064 (2011)
Gomaa, W.H., Fahmy, A.A.: A survey of text similarity approaches. Int. J. Comput. Appl. (2013). https://doi.org/10.5120/11638-7118
Hung, E., Cheung, D.W., Kao, B.: Optimization in data cube system design. J. Intell. Inf. Syst. 23(1), 17–45 (2004)
Hüsemann, B., Lechtenbörger, J., Vossen, G.: Conceptual Data Warehouse Design. Universität Münster, Angewandte Mathematik und Informatik (2000)
Islam, A., Inkpen, D.: Semantic text similarity using corpus-based word similarity and string similarity. ACM Trans. Knowl. Discov. Data 2(2), 10 (2008)
Jindal, R., Taneja, S.: Comparative study of data warehouse design approaches: a survey. Int. J. Database Manag. Syst. 4(1), 33 (2012)
Kimball, R., Ross, M.: The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling. Wiley, New York (2011)
Kimball, R., Ross, M.: The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling. Wiley, New York (2013)
Masuma, M.R., Losarwar, V.: Text classification and clustering through similarity measures. IJLTEMAS 5(3), 91–94 (2016)
Nedelcu, B.: Business intelligence systems. Database Syst. J. 4(4), 12–20 (2013)
Niemi, T., Nummenmaa, J., Thanisch, P.: Constructing OLAP cubes based on queries. In: Proceedings of the 4th ACM International Workshop on Data Warehousing and OLAP, pp. 9–15. ACM (2001)
Parimala, N., Pahwa, P.: Coalescing data marts. In: Proceedings of XVI International Conference on Computer and Information Science and Engineering, pp. 280–285 (2006)
Djiroun, R., Boukhalfa, K., Alimazighi, Z., et al.: A data cube design and construction methodology based on OLAP queries. In: 13th IEEE/ACS International Conference of Computer Systems and Applications, AICCSA 2016, Agadir, Morocco, pp. 1–8 (2016)
Riazati, D., Thom, J.A., Zhang, X.: Drill across & visualization of cubes with non-conformed dimensions. In: Proceedings of the Nineteenth Conference on Australasian Database, vol. 75, pp. 97–105. Australian Computer Society, Inc. (2008)
Rizzi, S., Abelló, A., Lechtenbörger, J., Trujillo, J.: Research in data warehouse modeling and design: dead or alive? In: Proceedings of the 9th ACM International Workshop on Data Warehousing and OLAP, pp. 3–10. ACM (2006)
Sabaini, A., Zimányi, E., Combi, C.: Extending the multidimensional model for linking cubes. In: EDA, pp. 17–32 (2015)
Bimonte, S., Schneider, M.: Merging spatial data cubes using the GIS overlay operator. J. Decis. Syst. 19(3), 261–290 (2010)
Torlone, R.: Two approaches to the integration of heterogeneous data warehouses. Distrib. Parallel Databases 23(1), 69–97 (2008)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Djiroun, R., Boukhalfa, K. & Alimazighi, Z. Designing data cubes in OLAP systems: a decision makers’ requirements-based approach. Cluster Comput 22, 783–803 (2019). https://doi.org/10.1007/s10586-018-2883-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10586-018-2883-7