Skip to main content

Consistency Algorithms for Multi-Source Warehouse View Maintenance

  • Chapter
Parallel and Distributed Information Systems

Abstract

A warehouse is a data repository containing integrated information for efficient querying and analysis. Maintaining the consistency of warehouse data is challenging, especially if the data sources are autonomous and views of the data at the warehouse span multiple sources. Transactions containing multiple updates at one or more sources, e.g., batch updates, complicate the consistency problem. In this paper we identify and discuss three fundamental transaction processing scenarios for data warehousing. We define four levels of consistency for warehouse data and present a new family of algorithms, the Strobe family, that maintain consistency as the warehouse is updated, under the various warehousing scenarios. All of the algorithms are incremental and can handle a continuous and overlapping stream of updates from the sources. Our implementation shows that the algorithms are practical and realistic choices for a wide variety of update scenarios.

This work was partially supported by Rome Laboratories under Air Force Contract F30602-94-C-0237; by an equipment grant from Digital Equipment Corporation; and by the Advanced Research and Development Committee of the Community Management Staff, as a project in the Massive Digital Data Systems Program.

This paper is an extended version of a paper published in International Conference on Parallel and Distributed Information Systems, December 1996. The main additions are in Section 6.2 and Section 8. Also, we included the full proof of correctness of Strobe and C-Strobe algorithm in Appendix A. Proof of Correctness for Strobe Algorithm and B. Proof of Correctness for Complete Strobe Algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. R. Alonso, D. Barbara, and H. Garcia-Molina. Data caching issues in an information retrieval system. ACM Transaction on Database Systems, 15 (3): 359–384, September 1990.

    Article  Google Scholar 

  2. E. Baralis, S. Ceri, and S Paraboschi. Conservative timestamp revised for materialized view maintenance in a data warehouse. In Proceedings of the Workshop on Materialized Views, Techniques and Applications, pages 1–9, Montreal, Canada, June 1996.

    Google Scholar 

  3. Y. Breitbart, H. Garcia-Molina, and A. Silberschatz. Overview of multidatabase transaction management. VLDB Journal, 1 (2): 181–239, October 1992.

    Article  Google Scholar 

  4. P.A. Bernstein, V. Hadzilacos, and N. Goodman. Concurrency Control and Recovery in Database Systems. Addison-Wesley, Reading, Massachusetts, 1987.

    Google Scholar 

  5. J.A. Blakeley, P.-A. Larson, and F.W. Tompa. Efficiently updating materialized views. In Proceedings of ACM SIGMOD Conference, pages 61–71, Washington, D.C., June 1986.

    Google Scholar 

  6. M. Cochinwala and J. Bradley. A multidatabase system for tracking and retrieval of financial data. In VLDB Conference, pages 714–721, 1994.

    Google Scholar 

  7. L.S. Colby, T. Griffin, L. Libkin, I.S. Mumick, and H. Trickey. Algorithms for deferred view maintenance. In Proceedings of ACM SIGMOD Conference, pages 469–480, Montreal, Quebec, Canada, June 1996.

    Google Scholar 

  8. A. Courtney, W. Janssen, D. Severson, M. Spreitzer, and F. Wymore. Inter–language unification, release 1.5. Technical Report ISTL–CSA–94–01–01 (Xerox accession number P94–00058, Xerox PARC, May 1994.

    Google Scholar 

  9. S. Ceri and J. Widom. Deriving production rules for incremental view maintenance. In VLDB Conference, pages 577–589, Barcelona, Spain, September 1991.

    Google Scholar 

  10. Rob Goldring and Beth Hamel, January 1996. Personal correspondence about IBM’s data warehouse customer needs.

    Google Scholar 

  11. A. Gupta and I.S. Mumick. Maintenance of materialized views: Problems, techniques, and applications. IEEE Data Engineering Bulletin, Special Issue on Materialized Views and Data Warehousing, 18(2):3–18, June 1995.

    Google Scholar 

  12. A. Gupta, I. Mumick, and V. Subrahmanian. Maintaining views incrementally. In Proceedings of ACM SIGMOD Conference, pages 157–166, Washington, D.C., May 1993.

    Google Scholar 

  13. H. Garcia-Molina and G. Wiederhold. Read-only transactions in a distributed database. ACM Transaction on Database Systems, 7 (2): 209–234, June 1982.

    Article  MATH  Google Scholar 

  14. R. Gallersdorfer and M. Nicola. Improving performance in replicated databases through relaxed coherency. In VLDB Conference, pages 445–456, Zurich, Switzerland, September 1995.

    Google Scholar 

  15. A. Gupta and J. Widom. Local verification of global integrity constraints in distributed databases. In Proceedings of ACM SIGMOD Conference, pages 49–58, Washington, D.C., May 1993.

    Google Scholar 

  16. J.V. Harrison and S.W. Dietrich. Maintenance of materialized views in a deductive database: An update propagation approach. In Proceedings ofthe 1992 JICLSP Workshop on Deductive Databases, pages 56–65, 1992.

    Google Scholar 

  17. R. Hull and G. Zhou. A framework for supporting data integration using the materialized and virtual approaches. In Proceedings of ACM SIGMOD Conference, pages 481–492, Montreal, Quebec, Canada, June 1996.

    Google Scholar 

  18. W.H. Inmon and C. Kelley. Rdb/VMS: Developing the Data Warehouse. QED Publishing Group, Boston, Massachusetts, 1993.

    Google Scholar 

  19. W. Labio and H. Garcia-Molina. Efficient snapshot differential algorithms in data warehousing. In VLDB Conference, pages 63–74, September 1996.

    Google Scholar 

  20. B. Lindsay, L.M. Haas, C. Mohan, H. Pirahesh, and P. Wilms. A snapshot differential refresh algorithm. In Proceedings of ACMSIGMOD Conference, Washington, D.C., May 1986.

    Google Scholar 

  21. X. Qian and G. Wiederhold. Incremental recomputation of active relational expressions. IEEE Transactions on Knowledge and Data Engineering, 3 (3): 337–34I, September 1991.

    Article  Google Scholar 

  22. A. Segev and W. Fang. Currency-based updates to distributed materialized views. In ICDE Conference, pages 512–520, Los Alamitos, February 1990.

    Google Scholar 

  23. O. Shmueli and A. Itai. Maintenance of views. In Proceedings of ACM SIGMOD Conference, pages 240–255, Boston, Massachusetts, May 1984.

    Google Scholar 

  24. A. Segev and J. Park. Updating distributed materialized views. IEEE Transactions on Knowledge and Data Engineering, 1(2):173–184,June 1989.

    Google Scholar 

  25. Sybase, Inc. Command Reference Manual, release 4.9 edition, 1992.

    Google Scholar 

  26. J.L. Wiener, H. Gupta, W.J. Labio, Y. Zhuge, H. Garcia-Molina, and J. Widom. A system prototype for warehouse view maintenance. In Proceedings of the Workshop on Materialized Views, Techniques and Applications, pages 26–33, Montreal, Canada, June 1996.

    Google Scholar 

  27. G. Wiederhold and X. Qian. Consistency control of replicated data in federated databases. In Proceedings of the IEEE Workshop on Management of Replicated Data, pages 130–132, Houston, Texas, November 1990.

    Chapter  Google Scholar 

  28. Y. Zhuge, H. Garcia-Molina, J. Hammer, and J. Widom. View maintenance in a warehousing environment. In Proceedings of ACM SIGMOD Conference, pages 316–327, San Jose, Califomia, May 1995.

    Google Scholar 

  29. Y. Zhuge, J. L. Wiener, and H. Garcia-Molina. Multiple view consistency for data warehousing. In ICDE Conference, Birmingham, UK, April 1997.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1998 Springer Science+Business Media New York

About this chapter

Cite this chapter

Zhuge, Y., Garcia-Molina, H., Wiener, J.L. (1998). Consistency Algorithms for Multi-Source Warehouse View Maintenance. In: Naughton, J.F., Weikum, G. (eds) Parallel and Distributed Information Systems. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-6132-0_2

Download citation

  • DOI: https://doi.org/10.1007/978-1-4757-6132-0_2

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4419-5026-0

  • Online ISBN: 978-1-4757-6132-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics