Skip to main content
Log in

Designing data cubes in OLAP systems: a decision makers’ requirements-based approach

  • Published:
Cluster Computing Aims and scope Submit manuscript

Abstract

Business Intelligence systems rely on an integrated, consistent, and certified information repository called the Data Warehouse (DW) that is periodically fed with operational data. In the decision-making process, the analyzed data are usually stored in the DW in the form of multidimensional cubes. These cubes are queried interactively by the decision makers, according to the online analytical processing paradigm. In larger companies with multiple subsidiaries, the frequent expression of new business needs requires the creation of new data cubes which generate a large number of cubes to be manipulated. The inevitable complexity and heterogeneity of data cubes make it difficult to design data cubes. The decision maker can precisely express his needs through a query in natural language which consists of a set of analysis indicators (measures, dimensions) separated by the AND operator. However, the decision maker’s need may be incomplete. Indeed, he usually has a cube that represents part of his needs and he may want to complete it or enrich it with other cubes that are unknown to him. To deal with these situations, we propose in this paper an approach that addresses the problem of designing and constructing data cubes where the expressed need is scattered over more than one cube. Our goal is to enable decision makers to analyze all of their needs using just one cube. Our approach consists of two variants: a variant that is based on analysis indicators, and another based on the known cube. We present the validation of our approach by means of a tool, called “Design-Cubes-Query” that implements our approach and we show its use through a case study.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

Notes

  1. http://okfn.org/opendata/.

  2. http://opengovdata.org/.

  3. https://github.com/Simmetrics/simmetrics.

References

  1. Abelló, A., Samos, J., Saltor, F.: On relationships offering new drill-across possibilities. In: Proceedings of the 5th ACM International Workshop on Data Warehousing and OLAP, pp. 7–13. ACM (2002)

  2. Alberto, D.: Fusion cubes: towards self-service business intelligence. Int. J. Data Warehous. Min. 9(2), 66–88 (2013)

    Article  MathSciNet  Google Scholar 

  3. Bimonte, S., Sautot, L., Journaux, L., Faivre, B.: Multidimensional model design using data mining: a rapid prototyping methodology. Int. J. Data Warehous. Min. 13(1), 1–35 (2017)

    Article  Google Scholar 

  4. Boukraâ, D., Boussaïd, O., Bentayeb, F.: OLAP operators for complex object data cubes. In: ADBIS, pp. 103–116. Springer (2010)

  5. Cheung, D.W., Zhou, B., Kao, B., Lu, H., Lam, T.W., Ting, H.F.: Requirement-based data cube schema design. In: Proceedings of the Eighth International Conference on Information and Knowledge Management, pp. 162–169. ACM (1999)

  6. Chhabra, R., Pahwa, P.: Data mart designing and integration approaches. Int. J. Comput. Sci. Mob. Comput. 3(4), 74–79 (2014)

    Google Scholar 

  7. Cohen, W., Ravikumar, P., Fienberg, S.: A comparison of string metrics for matching names and records. In: KDD Workshop on Data Cleaning and Object Consolidation, vol. 3, pp. 73–78 (2003)

  8. Djiroun, R., Bimonte, S., Boukhalfa, K.: A first framework for top-k cubes queries. In: International Conference on Conceptual Modeling, pp. 187–197. Springer (2015)

  9. Etcheverry, L., Vaisman, A., Zimányi, E.: Modeling and querying data warehouses on the semantic web using QB4OLAP. In: International Conference on Data Warehousing and Knowledge Discovery, pp. 45–56. Springer (2014)

  10. Gardner, S.R.: Building the data warehouse: the tough questions project managers have to ask their companies’ executives–and themselves–and the guidelines needed to sort out the answers. Commun. ACM 41(9), 52–61 (1998)

    Article  Google Scholar 

  11. Ghrab, A., Romero, O., Skhiri, S., Vaisman, A., Zimányi, E.: A framework for building OLAP cubes on graphs. In: East European Conference on Advances in Databases and Information Systems, pp. 92–105. Springer (2015)

  12. Golfarelli, M., Rizzi, S.: A methodological framework for data warehouse design. In: Proceedings of the 1st ACM International Workshop on Data Warehousing and OLAP, pp. 3–9. ACM (1998)

  13. Golfarelli, M., Rizzi, S., Biondi, P.: myOLAP: an approach to express and evaluate OLAP preferences. IEEE Trans. Knowl. Data Eng. 23(7), 1050–1064 (2011)

    Article  Google Scholar 

  14. Gomaa, W.H., Fahmy, A.A.: A survey of text similarity approaches. Int. J. Comput. Appl. (2013). https://doi.org/10.5120/11638-7118

  15. Hung, E., Cheung, D.W., Kao, B.: Optimization in data cube system design. J. Intell. Inf. Syst. 23(1), 17–45 (2004)

    Article  MATH  Google Scholar 

  16. Hüsemann, B., Lechtenbörger, J., Vossen, G.: Conceptual Data Warehouse Design. Universität Münster, Angewandte Mathematik und Informatik (2000)

  17. Islam, A., Inkpen, D.: Semantic text similarity using corpus-based word similarity and string similarity. ACM Trans. Knowl. Discov. Data 2(2), 10 (2008)

    Article  Google Scholar 

  18. Jindal, R., Taneja, S.: Comparative study of data warehouse design approaches: a survey. Int. J. Database Manag. Syst. 4(1), 33 (2012)

    Article  Google Scholar 

  19. Kimball, R., Ross, M.: The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling. Wiley, New York (2011)

    Google Scholar 

  20. Kimball, R., Ross, M.: The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling. Wiley, New York (2013)

    Google Scholar 

  21. Masuma, M.R., Losarwar, V.: Text classification and clustering through similarity measures. IJLTEMAS 5(3), 91–94 (2016)

    Google Scholar 

  22. Nedelcu, B.: Business intelligence systems. Database Syst. J. 4(4), 12–20 (2013)

    Google Scholar 

  23. Niemi, T., Nummenmaa, J., Thanisch, P.: Constructing OLAP cubes based on queries. In: Proceedings of the 4th ACM International Workshop on Data Warehousing and OLAP, pp. 9–15. ACM (2001)

  24. Parimala, N., Pahwa, P.: Coalescing data marts. In: Proceedings of XVI International Conference on Computer and Information Science and Engineering, pp. 280–285 (2006)

  25. Djiroun, R., Boukhalfa, K., Alimazighi, Z., et al.: A data cube design and construction methodology based on OLAP queries. In: 13th IEEE/ACS International Conference of Computer Systems and Applications, AICCSA 2016, Agadir, Morocco, pp. 1–8 (2016)

  26. Riazati, D., Thom, J.A., Zhang, X.: Drill across & visualization of cubes with non-conformed dimensions. In: Proceedings of the Nineteenth Conference on Australasian Database, vol. 75, pp. 97–105. Australian Computer Society, Inc. (2008)

  27. Rizzi, S., Abelló, A., Lechtenbörger, J., Trujillo, J.: Research in data warehouse modeling and design: dead or alive? In: Proceedings of the 9th ACM International Workshop on Data Warehousing and OLAP, pp. 3–10. ACM (2006)

  28. Sabaini, A., Zimányi, E., Combi, C.: Extending the multidimensional model for linking cubes. In: EDA, pp. 17–32 (2015)

  29. Bimonte, S., Schneider, M.: Merging spatial data cubes using the GIS overlay operator. J. Decis. Syst. 19(3), 261–290 (2010)

    Article  Google Scholar 

  30. Torlone, R.: Two approaches to the integration of heterogeneous data warehouses. Distrib. Parallel Databases 23(1), 69–97 (2008)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kamel Boukhalfa.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Djiroun, R., Boukhalfa, K. & Alimazighi, Z. Designing data cubes in OLAP systems: a decision makers’ requirements-based approach. Cluster Comput 22, 783–803 (2019). https://doi.org/10.1007/s10586-018-2883-7

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10586-018-2883-7

Keywords

Navigation