Abstract
Alert-raising and deviation detection in OLAP and explora-tory search concerns calling the user’s attention to variations and non-uniform data distributions, or directing the user to the most interesting exploration of the data. In this paper, we are interested in the ability of a data warehouse to monitor continuously new data, and to update accordingly a particular type of materialized views recording statistics, called baselines. It should be possible to detect deviations at various levels of aggregation, and baselines should be fully integrated into the database. We propose Multi-level Baseline Materialized Views (BMV), including the mechanisms to build, refresh and detect deviations. We also propose an incremental approach and formula for refreshing baselines efficiently. An experimental setup proves the concept and shows its efficiency.
This work was done while Pedro Furtado was visiting University of Tours.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
For the sake of readability, baseline expressions are simplified in the examples: the top levels are dropped in the group-by sets.
References
Aligon, J., Gallinucci, E., Golfarelli, M., Marcel, P., Rizzi, S.: A collaborative filtering approach for recommending OLAP sessions. Decis. Support Syst. 69, 20–30 (2015)
Fabris, C.C., Freitas, A.A.: Incorporating deviation-detection functionality into the OLAP paradigm. In: XVI Simpósio Brasileiro de Banco de Dados, 1–3 Outubro 2001, Rio de Janeiro, Brasil, Anais/Proceedings, pp. 274–285 (2001)
Furtado, P.: Reduced representations of multidimensional datasets, phd thesis, u. coimbra, December 2000
Ganti, V., Gehrke, J., Ramakrishnan, R., Loh, W.: A framework for measuring differences in data characteristics. J. Comput. Syst. Sci. 64(3), 542–578 (2002)
Geng, L., Hamilton, H.J.: Interestingness measures for data mining: a survey. ACM Comput. Surv. (CSUR) 38(3), 9 (2006)
Han, J., Cheng, H., Xin, D., Yan, X.: Frequent pattern mining: current status and future directions. Data Min. Knowl. Disc. 15(1), 55–86 (2007)
Harinarayan, V., Rajaraman, A., Ullman, J.D.: Implementing data cubes efficiently. SIGMOD Rec. 25(2), 205–216 (1996)
Lòpez, M., Nadal, S., Djedaini, M., Marcel, P., Peralta, V., Furtado, P.: An approach for raising alert in real-time data warehouses. Journes francophones sur les Entrepts de Donnes et lAnalyse en ligne Bruxelles, Belgique, 2–3 avril 2015, 2015(1), 55–86 (2015)
Mumick, I.S., Quass, D., Mumick, B.S.: Maintenance of data cubes and summary tables in a warehouse. In: SIGMOD 1997, Proceedings ACM SIGMOD International Conference on Management of Data, May 13–15, 1997, Tucson, Arizona, USA, pp. 100–111 (1997)
O’Neil, P., O’Neil, E., Chen, X., Revilak, S.: The star schema benchmark and augmented fact table indexing. In: Nambiar, R., Poess, M. (eds.) TPCTC 2009. LNCS, vol. 5895, pp. 237–252. Springer, Heidelberg (2009)
Sarawagi, S.: Explaining differences in multidimensional aggregates. In: VLDB 1999, Proceedings of 25th International Conference on Very Large Data Bases, September 7–10, 1999, Edinburgh, Scotland, UK, pp. 42–53 (1999)
Sarawagi, S.: idiff: Informative summarization of differences in multidimensional aggregates. Data Min. Knowl. Discov. 5(4), 255–276 (2001)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Furtado, P., Nadal, S., Peralta, V., Djedaini, M., Labroche, N., Marcel, P. (2015). Materializing Baseline Views for Deviation Detection Exploratory OLAP. In: Madria, S., Hara, T. (eds) Big Data Analytics and Knowledge Discovery. DaWaK 2015. Lecture Notes in Computer Science(), vol 9263. Springer, Cham. https://doi.org/10.1007/978-3-319-22729-0_19
Download citation
DOI: https://doi.org/10.1007/978-3-319-22729-0_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-22728-3
Online ISBN: 978-3-319-22729-0
eBook Packages: Computer ScienceComputer Science (R0)