Abstract
In the Data Warehouse (DW) technology, On-line Analytical Processing (OLAP) is a good applications package that empowers decision makers to explore and navigate into a multidimensional structure of precomputed measures, which is referred to as a Data Cube. Though, OLAP is poorly equipped for forecasting and predicting empty measures of data cubes. Usually, empty measures translate inexistent facts in the DW and in most cases are a source of frustration for enterprise managements, especially when strategic decisions need to be taken. In the recent years, various studies have tried to add prediction capabilities to OLAP applications. For this purpose, generally, Data Mining and Machine Learning methods have been widely used to predict new measures’ values in DWs. In this paper, we introduce a novel approach attempting to extend OLAP to a prediction application. Our approach operates in two main stages. The first one is a preprocessing one that makes use of the Principal Component Analysis (PCA) to reduce the dimensionality of the data cube and then generates ad hoc training sets. The second stage proposes a novel OLAP oriented architecture of Multilayer Perceptron Networks (MLP) that learns from each training set and comes out with predicted measures of inexistent facts. Carried out experiments demonstrate the effectiveness of our proposal and the performance of its predictive capabilities.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Inmon, W.H.: Building the Data Warehouse. John Wiley & Sons (1996)
Agarwal, D., Chen, B.C.: Latent OLAP: Data cubes over latent variables. In: Proceedings of the 2011 International Conference on Management of Data, SIGMOD 2011, pp. 877–888. ACM, New York (2011)
Codd, E.F., Codd, S.B., Salley, C.T.: Providing OLAP (on-line Analytical Processing) to User-analysts: An IT Mandate, vol. 32. Codd & Date, Inc. (1993)
Sarawagi, S., Agrawal, R., Megiddo, N.: Discovery-Driven Exploration of OLAP Data Cubes. In: Schek, H.-J., Saltor, F., Ramos, I., Alonso, G. (eds.) EDBT 1998. LNCS, vol. 1377, pp. 168–182. Springer, Heidelberg (1998)
BenMessaoud, R., Loudcher-Rabaseda, S.: OLEMAR: an On-Line Environment for Mining Association Rules in Multidimensional Data. Advances in Data Warehousing and Mining, vol. 2. Idea Group Publishing (2007)
Bodin-Niemczuk, A., Messaoud, R.B., Rabaséda, S.L., Boussaid, O.: Vers l’intégration de la prédiction dans les cubes OLAP. In: EGC, pp. 203–204 (2008)
Haykin, S.: Neural Networks: a Comprehensive Foundation. Prentice Hall International Editions Series. Prentice-Hall (1999)
Adya, M., Collopy, F.: How Effective are Neural Networks at Forecasting and Prediction? a Review and evaluation. Journal of Forecasting 17(5-6)
Palpanas, T., Koudas, N., Mendelzon, A.: Using Datacube Aggregates for Approximate Querying and Deviation Detection. IEEE Trans. on Knowl. and Data Eng. 17, 1465–1477 (2005)
Bishop, C.: Neural Networks For Pattern Recognition. Oxford University Press (1995)
Chen, B.C., Chen, L., Lin, Y., Ramakrishnan, R.: Prediction Cubes. In: Proceedings of the 31st International Conference on Very large Data Bases, VLDB 2005, pp. 982–993 (2005)
Chen, Y., Dong, G., Han, J., Pei, J., Wah, B.W., Wang, J.: Regression Cubes with Lossless Compression and Aggregation. IEEE Trans. on Knowl. and Data Eng. 18
Hotelling, H.: Analysis of a Complex of Statistical Variables into Principal Components. Journal of Educational Psychology 24(7), 498–520 (1933)
Tshilidzi, M.: Computational Intelligence for Missing Data Imputation, Estimation, and Management: Knowledge Optimization Techniques. Information Science Reference - Imprint of: IGI Publishing, Hershey (2009)
Wang, Z., Xu, J., Lu, F., Zhang, Y.: Using the Method Combining PCA with BP Neural Network to Predict Water Demand for Urban Development. In: Proceedings of the 2009 Fifth International Conference on Natural Computation, vol. 2, pp. 621–625. IEEE Computer Society, Washington, DC (2009)
Hornik, K., Stinchcombe, M., White, H.: Multilayer Feedforward Networks are Universal Approximators. Neural Networks 2(5), 359–366 (1989)
Rumelhart, D.E., Mcclelland, J.L.: Parallel Distributed Processing: Explorations in the Microstructure of Cognition: Foundations (Parallel Distributed Processing) (August 1986)
Ben Othman, L., Ben Yahia, S.: Yet Another Approach for Completing Missing Values. In: Yahia, S.B., Nguifo, E.M., Belohlavek, R. (eds.) CLA 2006. LNCS (LNAI), vol. 4923, pp. 155–169. Springer, Heidelberg (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Abdelbaki, W., Ben Messaoud, R., Ben Yahia, S. (2012). A Neural-Based Approach for Extending OLAP to Prediction. In: Cuzzocrea, A., Dayal, U. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2012. Lecture Notes in Computer Science, vol 7448. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32584-7_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-32584-7_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32583-0
Online ISBN: 978-3-642-32584-7
eBook Packages: Computer ScienceComputer Science (R0)