The Lord of the Rings: Efficient Maintenance of Views at Data Warehouses
Data warehouses have become extremely important to support online analytical processing (OLAP) queries in databases. Since the data view that is obtained at a data warehouse is derived from multiple data sources that are continuously updated, keeping a data warehouse up-to-date becomes a crucial problem. An approach referred to as the incremental view maintenance is widely used. Unfortunately, a precise and formal definition of view maintenance (which can actually be seen as a distributed computation problem) does not exist. This paper develops a formal model for maintaining views at data warehouses in a distributed asynchronous system. We start by formulating the view maintenance problem in terms of abstract update and data integration operations and state the notions of correctness associated with data warehouse views. We then present a basic protocol and establish its proof of correctness. Finally, we present an efficient version of the proposed protocol by incorporating several optimizations. So, this paper is mainly concerned with basic principles of distributed computing and their use to solve database related problems.
KeywordsData Object Data Warehouse Multiple Data Source Basic Protocol Data Node
Unable to display preview. Download preview PDF.
- 1.Agrawal D., Alonso G., El Abbadi A. and Stanoi I., Exploiting Atomic Broadcast in Replicated Databases. Proc. of the International Conference on Parallelism (EUROPAR), pp. 496–503, August 1997.Google Scholar
- 2.Agrawal D., El Abbadi A., Singh A. and Yurek T., Efficient Data View Maintenance Warehouses. Proc. ACM SIGMOD, pp. 417–427, 1997.Google Scholar
- 3.Agrawal A., El Abbadi A., Mostéfaoui A., Raynal R. and Roy M., The Lord of the Rings: Efficient Maintenance of Views at Data Warehouses. IRISA Research Report #1441, IRISA, Rennes, France. Available at: http://www.irisa.fr/bibli/publi/pi/2002/1441/1441.html.
- 4.Agrawal D., El Abbadi A. and Steinke R.C., Epidemic Algorithms in Replicated Databases. Proc. ACM PODS, pp. 161–172, 1997.Google Scholar
- 6.Colby L.S., Griffin T., Libkin L., Mumick I.S. and Trickey H., Algorithms for Deferred View Maintenance. Proc. ACM SIGMOD, acm Press, pp. 469–480, 1996.Google Scholar
- 7.Demers A. et al., Epidemic Algorithms for Replicated Database Maintenance. Proc. ACM PODC, acm Press, pp. 1–12, 1987.Google Scholar
- 8.Gupta A. and Mumick I.S., Maintenance of Materialized Views: Problems, Techniques and Applications. IEEE Bulletin of the Technical Committee on Data Engineering, 18(2):3–18, June 1995.Google Scholar
- 9.Gupta A., Mumick I.S. and Subramanian V.S., Maintaining Views Incrementally. Proc. ACM SIGMOD, pp. 157–166, 1993.Google Scholar
- 11.Hull R. and Zhou G., A Framework for Supporting Data Integration Using the Materialized and Virtual Approaches. Proc. ACM SIGMOD, pp. 481–492, 1996.Google Scholar
- 13.Powell D. (Guest Editor). Special Issue on Group Communication. Communications of the ACM, 39(4):50–97, 1996.Google Scholar
- 15.Stanoi I., Agrawal D. and El Abbadi A., Modeling and Maintaining Multi-View Data Warehouses. Proc. 18th. Int. Conference on Conceptual Modeling, Paris, France, pp. 161–175, 1999.Google Scholar
- 16.Wuu G. T. and Bernstein A. J., Efficient Solutions to the Replicated Log and Dictionary Problems. Proc. ACM PODC, pp. 233–242, 1984.Google Scholar
- 17.Zhuge Y., Garcia-Molina H., Hammer J. and Widom J., View Maintenance in a Warehousing Environment. Proc. ACM SIGMOD Int. Conf. on Management of Data, pp. 316–327, 1995.Google Scholar
- 18.Zhuge Y., Garcia-Molina H. and Wiener J.L., The Strobe Algorithms for Multi-Source Warehouse Consistency. Proc. Int. Conference on Parallel and Distributed Information Systems, 1996.Google Scholar