Abstract
A Data Warehouse (DW) can be abstractly seen as a set of materialized views defined over a set of remote data sources. A DW is intended to satisfy a set of queries. The views materialized in a DWrelate to each other in a complex manner, through common subexpressions, in order to guarantee high query performance and low view maintenance cost. DWs are time varying. As time passes new materialized views are added in order to satisfy new queries or for performance reasons while old queries are dropped. The evolution of a DWcan result in a redundant set of materialized views.
In this paper we address the problem of detecting redundant views in a given DW view selection, that is, views that can be removed from the DW without negatively affecting the query evaluation or the view maintenance process. Using an AND/OR dag representation for multiple queries and views, we first provide a method for detecting materialized views that are not needed in the process of propagating source relation changes to the DW. Then, we use this method to detect materialized views that are redundant. As a side effect, our approach shows how source relation changes can be propagated to the DW materialized views by exploiting common subexpressions between views and by using other materialized views that are not affected by these changes.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
S. Chaudhuri, R. Krishnamurthy, S. Potamianos, and K. Shim. Optimizing Queries with Materialized Views. In Proc. of the 11th ICDE, pages 190–200, 1995.
T. Griffin and L. Libkin. Incremental maintenance of views with duplicates. In Proc. of the ACM SIGMOD Intl. Conf. on Management of Data, 1995.
A. Gupta, V. Harinarayan, and D. Quass. Aggregate-Query Processing in Data Warehousing Environments. In Proc. of the 21st VLDB, pages 358–369, 1995.
A. Gupta, I. Mumick, and V. Subrahmanian. Maintaining views incrementally. In Proc. of the ACM SIGMOD Intl. Conf. on Management of Data, 1993.
A. Gupta and I. S. Mumick. Maintenance of materialized views: Problems, techniques and applications. Data Engineering, 18(2):3–18, 1995.
H. Gupta. Selection of Views to Materialize in a Data Warehouse. In Proc. of the 6th Intl. Conf. on Database Theory, pages 98–112, 1997.
H. Gupta and I. S. Mumick. Selection of Views to Materialize Under a Maintenance Cost Constraint. In Proc. of the 7th Intl. Conf. on Database Theory, 1999.
W. Inmon. Building the Data Warehouse. John Wiley & Sons, 2nd edition, 1996.
A. Levy, A. O. Mendelson, Y. Sagiv, and D. Srivastava. Answering Queries using Views. In Proc. of the ACM Symp. on Principles of Database Systems, 1995.
D. Quass. Maintenance Expressions for Views with Aggregation. In Workshop on Materialized Views: Techniques and Applications, pages 110–118, 1996.
D. Quass, A. Gupta, I. S. Mumick, and J. Widom. Making Views Self Maintainable for Data Warehousing. In Proc. of the 4th Intl. Conf. on Parallel and Distributed Information Systems, pages 158–169, 1996.
K. A. Ross, D. Srivastava, and S. Sudarshan. Materialized View Maintenance and Integrity Constraint Checking: Trading Space for Time. In Proc. of the ACM SIGMOD Intl. Conf. on Management of Data, pages 447–458, 1996.
N. Roussopoulos and Y. Kang. Principles and techniques in the design of ADMS. Computer, Dec 1986.
T. K. Sellis. Multiple Query Optimization. ACM Transactions on Database Systems, 13(1):23–52, 1988.
D. Theodoratos and M. Bouzeghoub. Data Currency Quality Factors in Data Warehouse Design. In Proc. of the Intl. Workshop on Design and Management of Data Warehouses, Heidelberg, Germany, June 1999.
D. Theodoratos, S. Ligoudistianos, and T. Sellis. Designing the Global Data Warehouse with SPJ Views. In Proc. of the 11th Intl. Conf. on Advanced Information Systems Engineering, Springer-Verlag, LNCS No 1626, pages 180–194, 1999.
D. Theodoratos and T. Sellis. Data Warehouse Configuration. In Proc. of the 23rd Intl. Conf. on Very Large Data Bases, pages 126–135, 1997.
D. Theodoratos and T. Sellis. Data Warehouse Schema and Instance Design. In Proc. of the 17th Intl. Conf. on Conceptual Modeling, Springer LNCS 1507, pages 363–376, 1998.
D. Theodoratos and T. Sellis. Dynamic Data Warehouse Design. In Proc. of the 1st Intl. Conf. on Data Warehousing and Knowledge Discovery, Springer-Verlag, LNCS, 1999.
J. Yang, K. Karlapalem, and Q. Li. Algorithms for Materialized View Design in Data Warehousing Environment. In Proc. of the 23rd Intl. Conf. on Very Large Data Bases, pages 136–145, 1997.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Theodoratos, D. (1999). Detecting Redundancy in Data Warehouse Evolution. In: Akoka, J., Bouzeghoub, M., Comyn-Wattiau, I., Métais, E. (eds) Conceptual Modeling — ER ’99. ER 1999. Lecture Notes in Computer Science, vol 1728. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-47866-3_23
Download citation
DOI: https://doi.org/10.1007/3-540-47866-3_23
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66686-8
Online ISBN: 978-3-540-47866-9
eBook Packages: Springer Book Archive