Skip to main content

Elimination of Redundant Views in Multidimensional Aggregates

  • Conference paper
  • First Online:
Data Warehousing and Knowledge Discovery (DaWaK 2000)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1874))

Included in the following conference series:

Abstract

On-line analytical processing provides multidimensional data analysis, through extensive computation based on aggregation, along many dimensions and hierarchies. To accelerate query-response time, pre-computed results are often stored for later retrieval. This adds a prohibitive storage overhead when applied to the whole set of aggregates. In this paper we describe a novel approach which provides the means for the efficient selection, computation and storage of multidimensional aggregates. The approach identifies redundant aggregates, by inspection, thus allowing only distinct aggregates to be computed and stored. We propose extensions to relational theory and also present new algorithms for implementing the approach, providing a solution which is both scalable and low in complexity. The experiments were conducted using real and synthetic datasets and demonstrate that significant savings in computation time and storage space can be achieved when redundant aggregates are eliminated. Savings have also been shown to increase as dimensionality increases. Finally, the implications of this work affect the indexing and maintenance of views and the user interface.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. S. Agarwal, R. Agrawal, P.M. Deshpande, A. Gupta, J.F. Naughton, R. Ramakrishnan, and S. Sarawagi. On the Computation of Multidimensional Aggregates. In Proceedings of the 22nd International Conference on Very Large Databases, pages 506–521, Mumbai, Sept. 1996.

    Google Scholar 

  2. E. Baralis, S. Paraboschi, E. Teniente. Materialized View Selection in a Multidimensional Database. Proceedings of the 23rd International Conference on Very Large Databases, pages 156–165, Athens 1997.

    Google Scholar 

  3. D. Barbara, M. Sullivan. Quasi-Cubes: A space-efficient way to support approximate multidimensional databases. Technical Report-Dept of Inf and Soft. Eng. George Mason University 1998.

    Google Scholar 

  4. K., Beyer, R., Ramakrishnan Bottom-Up Computation and Iceberg CUBEs. Proc. ACM SIGMOD International. Conf. on Management of Data 1999, pages 359–370, Philadelphia PA, USA, June 1999.

    Google Scholar 

  5. E.F., Codd, “A relational model for large shared data banks. Communications of the ACM, 13(6):377–387, 1970.

    Article  MATH  Google Scholar 

  6. E.F., Codd, S.B. Codd, C.T. Salley. Providing OLAP (On-Line Analytical Processing) to User Analyst: An IT Mandate. Arbor Software at http://www.arborsoft.com/OLAP.html.

  7. S. Chaudhuri, U. Dayal. An Overview of Data Warehousing and OLAP Technology. Technical Report MSR-TR-97-14, Microsoft Research Advanced Technology, March 1997.

    Google Scholar 

  8. P.M. Despande, A. Shukla, J.F. Naughton, K. Ramaswamy. Storage Estimation of the Multidimensional Aggregates. In Proceedings of the 22nd International Conference on Very Large Databases, pages 522–531, Mumbai, Sept. 1996.

    Google Scholar 

  9. J. Gray, A. Bosworth, A. Layman, and H. Pirahesh. Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals. Proceedings of the 21st International Conference on Data Engineering. New Orleans (LA), USA, Feb. 26-March 1), 1996.

    Google Scholar 

  10. H. Gupta. Selection of Views to Materialize in a Data Warehouse. Proc. of the 6th International Conference in Database Theory (ICDT), pages 98–112, Delphi, Jan 1997.

    Google Scholar 

  11. V. Harinarayan, A. Rajaraman, J.D. Ullman. Implementing Data Cubes Efficiently. Proceedings of ACM SIGMOD International Conference on Management of Data, pages 205–227, 1996.

    Google Scholar 

  12. C.J. Hahn, S.G. Warren, J. London. Edited synoptic cloud reports over the globe 1982–1991. Available from http://cdiac.esd.ornl.gov/cdiac/ndps/ndp026b.html, 1994.

  13. R. Kimball. The Data Warehouse Toolkit. John Wiley 1996.

    Google Scholar 

  14. N. Kotsis, D.R. McGregor. Compact Representation: An Efficient Implementation for the Data Warehouse Architecture. 1st International Conference, DaWAK’ 99 pages 78–85. Florence, Italy, August 99.

    Google Scholar 

  15. N. Kotsis. Multidimensional Aggregation in OLAP systems. PhD thesis. Department of Computer Science, University of Strathclyde, February 2000.

    Google Scholar 

  16. C. Lucchesi, S. Osborn. Candidate keys for relations. J. Computer and System Science, 17(2:270–279, 1978.

    Article  MATH  MathSciNet  Google Scholar 

  17. O’Neil, P., Graefe, G., Multi-Table Joins through Bitmapped Join Indices. Proceedings of ACM SIGMOD International Conference on Management of Data, 1996.

    Google Scholar 

  18. K.A. Ross, D. Srivastava Fast Computation of Sparse Datacubes. Proc. of the 23rd International Conference on Very Large Databases, pages 116–125, Athens 1997.

    Google Scholar 

  19. S. Sarawagi, R. Agrawal, A. Gupta. On Computing the Data Cube. it Research report 10026, IBM Almaden Research Center, San Jose, California, 1996.

    Google Scholar 

  20. A. Shukla, P.M. Despande, J.F. Naughton. Materialized View Selection for Multidimensional Datasets, Proc. of 24rd International Conference on Very Large Databases, pages 488–499, New York 1998.

    Google Scholar 

  21. F. Raab, editor. TPC Benchmark TM-Standard Specification Revision 1.3.1 Transaction Processing Council 1998

    Google Scholar 

  22. J.S. Vitter and M. Wang. Approximate Computation of Multidimensional Aggregates of Sparse Data Using Wavelets. Proc. ACM SIGMOD International Conference on Management of Data 1999, p. 193–204, Philadelphia PA, USA, June 1999.

    Google Scholar 

  23. J. Widom. Research Problems in Data Warehousing. Proc. of the 4th Intl. Conference of CIKM, pages 25–30, Nov. 1995.

    Google Scholar 

  24. M.C. Wu and A. Buchmann. Encoded bitmap indexing for dataWarehouses. International Conference on data Engineering, pages 220–230, 1998.

    Google Scholar 

  25. Y. Zhao, P.M. Deshpande, J.F. Naughton. An Array based Algorithm for Simultaneous Multidimensional Aggregates, Proceedings of ACM SIGMOD International Conference On Management of Data, pages 159–170, 1997.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kotsis, N., McGregor, D.R. (2000). Elimination of Redundant Views in Multidimensional Aggregates. In: Kambayashi, Y., Mohania, M., Tjoa, A.M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2000. Lecture Notes in Computer Science, vol 1874. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44466-1_15

Download citation

  • DOI: https://doi.org/10.1007/3-540-44466-1_15

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-67980-6

  • Online ISBN: 978-3-540-44466-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics