Skip to main content

Scalable Maintenance of Multiple Interrelated Data Warehousing Systems

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1874))

Abstract

The maintenance of data warehouses(DWs) is becoming an increasingly important topic due to the growing use, derivation and integration of digital information. Most previous work has dealt with one centralized data warehouse only. In this paper, we now focus on environments with multiple DWs that are possibly derived from other DWs. In such a large-scale environment, data updates from base sources may arrive in individual data warehouses in different orders, thus resulting in inconsistent data warehouse extents. We propose to address this problem by employing a registry agent responsible for establishing one unique order for the propagation of updates from the base sources to the DWs. With this solution, individual DW managers can still maintain their respective extents autonomously and independently from each other, thus allowing them to apply any existing incremental maintenance algorithm from the literature. We demonstrate that this registry-based coordination approach (RyCo) indeed achieves consistency across all DWs.

This work was supported in part by several grants from NSF, namely, the NSF NYI grant #IRI 97-96264, the NSF CISE Instrumentation grant #IRIS 97-29878, and the NSF grant #IIS 97-32897. Dr. Rundensteiner would like to thank our industrial sponsors, in particular, IBM for the IBM partnership award, and GTE for partial support of Xin Zhang.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. D. Agrawal, A. E. Abbadi, A. Singh, and T. Yurek. Efficient View Maintenance at Data Warehouses. In Proceedings of SIGMOD, pages 417–427, 1997.

    Google Scholar 

  2. S. Chaudhuri and U. Dayal. An Overview of Data Warehousing and OLAP Technology. SIGMOD Record, 26(1):65–74, 1997.

    Article  Google Scholar 

  3. L. Colby, A.Kawaguchi, D. Lieuwen, I. Mumick, and K. Ross. Supporting Multiple View Maintenance Policies. AT&T Technical Memo, 1996.

    Google Scholar 

  4. L. Ding, X. Zhang, and E. A. Rundensteiner. The MRE Wrapper Approach: Enabling Incremental View Maintenance of Data Warehouses Defined On Multi-Relation Information Sources. In Proceedings of the ACM First International Workshop on Data Warehousing and OLAP (DOLAP’99), pages 30–35, November 1999.

    Google Scholar 

  5. L. Ding, X. Zhang, and E. A. Rundensteiner. Scalable Maintenance of Multiple Interrelated Data Warehousing Systems. Technical Report WPI-CS-TR-00-16, Worcester Polytechnic Institute, Dept. of Computer Science, May 2000.

    Google Scholar 

  6. H. García-Molina, W. Labio, J. L. Wiener, and Y. Zhuge. Distributed and Parallel Computing Issues in Data Warehousing. In Symposium on Principles of Distributed Computing, page 7, 1998. Abstract.

    Google Scholar 

  7. A. Gupta and I. S. Mumick. What is the data warehousing problem? (Are materialized views the answer?). In International Conference on Very Large Data Bases, page 602, 1996. Panel.

    Google Scholar 

  8. A. Kawaguchi, D. F. Lieuwen, I. S. Mumick, and K. A. Ross. Implementing Incremental View Maintenance in Nested Data Models. In Workshop on Database Programming Languages, pages 202–221, 1997.

    Google Scholar 

  9. M. K. Mohania, S. Konomi, and Y. Kambayashi. Incremental Maintenance of Materialized Views. In Database and Expert Systems Applications (DEXA), pages 551–560, 1997.

    Google Scholar 

  10. E. A. Rundensteiner, A. Koeller, and X. Zhang. Maintaining data warehouses over changing information sources. Communications of the ACM, June 2000.

    Google Scholar 

  11. E. A. Rundensteiner, A. Koeller, X. Zhang, A. Lee, A. Nica, A. Van Wyk, and Y. Li. Evolvable View Environment. In Proceedings of SIGMOD’99 Demo Session, pages 553–555, May/June 1999.

    Google Scholar 

  12. I. Stanoi, D. Agrawal, and A. E. Abbadi. Weak Consistency in Distributed Data Warehouses. In Proceedings of the International Conference of Foundations of Data Organization, November 1998.

    Google Scholar 

  13. I. Stanoi, D. Agrawal, and A. E. Abbadi. Modeling and Maintaining Multi-View Data Warehouses. In Proceedings of the 18th International Conference on Conceptual Modeling (ER’99), pages 161–175, 1999.

    Google Scholar 

  14. M. Wu and A. P. Buchman. Research Issues in Data Warehousing. In Datenbanksysteme in Büro, Technik und Wissenschaft, pages 61–82, 1997.

    Google Scholar 

  15. X. Zhang and E. A. Rundensteiner. The SDCC Framework for Integrating Existing Algorithms for Diverse Data Warehouse Maintenance Tasks. In International Database Engineering and Application Symposium, pages 206–214, Montreal, Canada, August, 1999.

    Google Scholar 

  16. Y. Zhuge, H. García-Molina, J. Hammer, and J. Widom. View Maintenance in a Warehousing Environment. In Proceedings of SIGMOD, pages 316–327, May 1995.

    Google Scholar 

  17. Y. Zhuge, H. García-Molina, and J. L. Wiener. The Strobe Algorithms for Multi-Source Warehouse Consistency. In International Conference on Parallel and Distributed Information Systems, pages 146–157, December 1996.

    Google Scholar 

  18. Y. Zhuge, J. L. Wiener, and H. García-Molina. Multiple View Consistency for Data Warehousing. In Proceedings of IEEE International Conference on Data Engineering, pages 289–300, 1997.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ding, L., Zhang, X., Rundensteiner, E.A. (2000). Scalable Maintenance of Multiple Interrelated Data Warehousing Systems. In: Kambayashi, Y., Mohania, M., Tjoa, A.M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2000. Lecture Notes in Computer Science, vol 1874. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44466-1_11

Download citation

  • DOI: https://doi.org/10.1007/3-540-44466-1_11

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-67980-6

  • Online ISBN: 978-3-540-44466-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics