Skip to main content

Materializing Baseline Views for Deviation Detection Exploratory OLAP

  • Conference paper
  • First Online:
Book cover Big Data Analytics and Knowledge Discovery (DaWaK 2015)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9263))

Included in the following conference series:

Abstract

Alert-raising and deviation detection in OLAP and explora-tory search concerns calling the user’s attention to variations and non-uniform data distributions, or directing the user to the most interesting exploration of the data. In this paper, we are interested in the ability of a data warehouse to monitor continuously new data, and to update accordingly a particular type of materialized views recording statistics, called baselines. It should be possible to detect deviations at various levels of aggregation, and baselines should be fully integrated into the database. We propose Multi-level Baseline Materialized Views (BMV), including the mechanisms to build, refresh and detect deviations. We also propose an incremental approach and formula for refreshing baselines efficiently. An experimental setup proves the concept and shows its efficiency.

This work was done while Pedro Furtado was visiting University of Tours.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    For the sake of readability, baseline expressions are simplified in the examples: the top levels are dropped in the group-by sets.

References

  1. Aligon, J., Gallinucci, E., Golfarelli, M., Marcel, P., Rizzi, S.: A collaborative filtering approach for recommending OLAP sessions. Decis. Support Syst. 69, 20–30 (2015)

    Article  Google Scholar 

  2. Fabris, C.C., Freitas, A.A.: Incorporating deviation-detection functionality into the OLAP paradigm. In: XVI Simpósio Brasileiro de Banco de Dados, 1–3 Outubro 2001, Rio de Janeiro, Brasil, Anais/Proceedings, pp. 274–285 (2001)

    Google Scholar 

  3. Furtado, P.: Reduced representations of multidimensional datasets, phd thesis, u. coimbra, December 2000

    Google Scholar 

  4. Ganti, V., Gehrke, J., Ramakrishnan, R., Loh, W.: A framework for measuring differences in data characteristics. J. Comput. Syst. Sci. 64(3), 542–578 (2002)

    Article  MathSciNet  MATH  Google Scholar 

  5. Geng, L., Hamilton, H.J.: Interestingness measures for data mining: a survey. ACM Comput. Surv. (CSUR) 38(3), 9 (2006)

    Article  Google Scholar 

  6. Han, J., Cheng, H., Xin, D., Yan, X.: Frequent pattern mining: current status and future directions. Data Min. Knowl. Disc. 15(1), 55–86 (2007)

    Article  MathSciNet  Google Scholar 

  7. Harinarayan, V., Rajaraman, A., Ullman, J.D.: Implementing data cubes efficiently. SIGMOD Rec. 25(2), 205–216 (1996)

    Article  Google Scholar 

  8. Lòpez, M., Nadal, S., Djedaini, M., Marcel, P., Peralta, V., Furtado, P.: An approach for raising alert in real-time data warehouses. Journes francophones sur les Entrepts de Donnes et lAnalyse en ligne Bruxelles, Belgique, 2–3 avril 2015, 2015(1), 55–86 (2015)

    Google Scholar 

  9. Mumick, I.S., Quass, D., Mumick, B.S.: Maintenance of data cubes and summary tables in a warehouse. In: SIGMOD 1997, Proceedings ACM SIGMOD International Conference on Management of Data, May 13–15, 1997, Tucson, Arizona, USA, pp. 100–111 (1997)

    Google Scholar 

  10. O’Neil, P., O’Neil, E., Chen, X., Revilak, S.: The star schema benchmark and augmented fact table indexing. In: Nambiar, R., Poess, M. (eds.) TPCTC 2009. LNCS, vol. 5895, pp. 237–252. Springer, Heidelberg (2009)

    Google Scholar 

  11. Sarawagi, S.: Explaining differences in multidimensional aggregates. In: VLDB 1999, Proceedings of 25th International Conference on Very Large Data Bases, September 7–10, 1999, Edinburgh, Scotland, UK, pp. 42–53 (1999)

    Google Scholar 

  12. Sarawagi, S.: idiff: Informative summarization of differences in multidimensional aggregates. Data Min. Knowl. Discov. 5(4), 255–276 (2001)

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Patrick Marcel .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Furtado, P., Nadal, S., Peralta, V., Djedaini, M., Labroche, N., Marcel, P. (2015). Materializing Baseline Views for Deviation Detection Exploratory OLAP. In: Madria, S., Hara, T. (eds) Big Data Analytics and Knowledge Discovery. DaWaK 2015. Lecture Notes in Computer Science(), vol 9263. Springer, Cham. https://doi.org/10.1007/978-3-319-22729-0_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-22729-0_19

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-22728-3

  • Online ISBN: 978-3-319-22729-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics