Skip to main content

Detecting Redundancy in Data Warehouse Evolution

  • Conference paper
  • First Online:
Conceptual Modeling — ER ’99 (ER 1999)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1728))

Included in the following conference series:

Abstract

A Data Warehouse (DW) can be abstractly seen as a set of materialized views defined over a set of remote data sources. A DW is intended to satisfy a set of queries. The views materialized in a DWrelate to each other in a complex manner, through common subexpressions, in order to guarantee high query performance and low view maintenance cost. DWs are time varying. As time passes new materialized views are added in order to satisfy new queries or for performance reasons while old queries are dropped. The evolution of a DWcan result in a redundant set of materialized views.

In this paper we address the problem of detecting redundant views in a given DW view selection, that is, views that can be removed from the DW without negatively affecting the query evaluation or the view maintenance process. Using an AND/OR dag representation for multiple queries and views, we first provide a method for detecting materialized views that are not needed in the process of propagating source relation changes to the DW. Then, we use this method to detect materialized views that are redundant. As a side effect, our approach shows how source relation changes can be propagated to the DW materialized views by exploiting common subexpressions between views and by using other materialized views that are not affected by these changes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. S. Chaudhuri, R. Krishnamurthy, S. Potamianos, and K. Shim. Optimizing Queries with Materialized Views. In Proc. of the 11th ICDE, pages 190–200, 1995.

    Google Scholar 

  2. T. Griffin and L. Libkin. Incremental maintenance of views with duplicates. In Proc. of the ACM SIGMOD Intl. Conf. on Management of Data, 1995.

    Google Scholar 

  3. A. Gupta, V. Harinarayan, and D. Quass. Aggregate-Query Processing in Data Warehousing Environments. In Proc. of the 21st VLDB, pages 358–369, 1995.

    Google Scholar 

  4. A. Gupta, I. Mumick, and V. Subrahmanian. Maintaining views incrementally. In Proc. of the ACM SIGMOD Intl. Conf. on Management of Data, 1993.

    Google Scholar 

  5. A. Gupta and I. S. Mumick. Maintenance of materialized views: Problems, techniques and applications. Data Engineering, 18(2):3–18, 1995.

    Google Scholar 

  6. H. Gupta. Selection of Views to Materialize in a Data Warehouse. In Proc. of the 6th Intl. Conf. on Database Theory, pages 98–112, 1997.

    Google Scholar 

  7. H. Gupta and I. S. Mumick. Selection of Views to Materialize Under a Maintenance Cost Constraint. In Proc. of the 7th Intl. Conf. on Database Theory, 1999.

    Google Scholar 

  8. W. Inmon. Building the Data Warehouse. John Wiley & Sons, 2nd edition, 1996.

    Google Scholar 

  9. A. Levy, A. O. Mendelson, Y. Sagiv, and D. Srivastava. Answering Queries using Views. In Proc. of the ACM Symp. on Principles of Database Systems, 1995.

    Google Scholar 

  10. D. Quass. Maintenance Expressions for Views with Aggregation. In Workshop on Materialized Views: Techniques and Applications, pages 110–118, 1996.

    Google Scholar 

  11. D. Quass, A. Gupta, I. S. Mumick, and J. Widom. Making Views Self Maintainable for Data Warehousing. In Proc. of the 4th Intl. Conf. on Parallel and Distributed Information Systems, pages 158–169, 1996.

    Google Scholar 

  12. K. A. Ross, D. Srivastava, and S. Sudarshan. Materialized View Maintenance and Integrity Constraint Checking: Trading Space for Time. In Proc. of the ACM SIGMOD Intl. Conf. on Management of Data, pages 447–458, 1996.

    Google Scholar 

  13. N. Roussopoulos and Y. Kang. Principles and techniques in the design of ADMS. Computer, Dec 1986.

    Google Scholar 

  14. T. K. Sellis. Multiple Query Optimization. ACM Transactions on Database Systems, 13(1):23–52, 1988.

    Article  Google Scholar 

  15. D. Theodoratos and M. Bouzeghoub. Data Currency Quality Factors in Data Warehouse Design. In Proc. of the Intl. Workshop on Design and Management of Data Warehouses, Heidelberg, Germany, June 1999.

    Google Scholar 

  16. D. Theodoratos, S. Ligoudistianos, and T. Sellis. Designing the Global Data Warehouse with SPJ Views. In Proc. of the 11th Intl. Conf. on Advanced Information Systems Engineering, Springer-Verlag, LNCS No 1626, pages 180–194, 1999.

    Google Scholar 

  17. D. Theodoratos and T. Sellis. Data Warehouse Configuration. In Proc. of the 23rd Intl. Conf. on Very Large Data Bases, pages 126–135, 1997.

    Google Scholar 

  18. D. Theodoratos and T. Sellis. Data Warehouse Schema and Instance Design. In Proc. of the 17th Intl. Conf. on Conceptual Modeling, Springer LNCS 1507, pages 363–376, 1998.

    Google Scholar 

  19. D. Theodoratos and T. Sellis. Dynamic Data Warehouse Design. In Proc. of the 1st Intl. Conf. on Data Warehousing and Knowledge Discovery, Springer-Verlag, LNCS, 1999.

    MATH  Google Scholar 

  20. J. Yang, K. Karlapalem, and Q. Li. Algorithms for Materialized View Design in Data Warehousing Environment. In Proc. of the 23rd Intl. Conf. on Very Large Data Bases, pages 136–145, 1997.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Theodoratos, D. (1999). Detecting Redundancy in Data Warehouse Evolution. In: Akoka, J., Bouzeghoub, M., Comyn-Wattiau, I., Métais, E. (eds) Conceptual Modeling — ER ’99. ER 1999. Lecture Notes in Computer Science, vol 1728. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-47866-3_23

Download citation

  • DOI: https://doi.org/10.1007/3-540-47866-3_23

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-66686-8

  • Online ISBN: 978-3-540-47866-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics