Encyclopedia of Database Systems

2018 Edition
| Editors: Ling Liu, M. Tamer Özsu

Distributed Recovery

  • Kian-Lee TanEmail author
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-8265-9_712


Recovery in distributed commit protocols; Recovery in distributed database systems; Recovery in replicated database systems


In a distributed database system (DDBS), failures in the midst of a transaction processing (such as failure of a site where a subtransaction is being processed) may lead to an inconsistent database. As such, a recovery subsystem is an essential component of a DDBS [13]. To ensure correctness, recovery mechanisms must be in place to ensure transaction atomicity and durability even in the midst of failures.

Distributed recovery is more complicated than centralized database recovery because failures can occur at the communication links or a remote site. Ideally, a recovery system should be simple, incur tolerable overhead, maintain system consistency, provide partial operability, and avoid global rollback [6].

Historical Background

A DDBS must be reliable for it to be useful. In particular, a reliable DDBS must guarantee transaction atomicity and...

This is a preview of subscription content, log in to check access.

Recommended Reading

  1. 1.
    Chrysanthis PK, Samaras G, Al-Houmaily YJ. Hsu recovery and performance of atomic commit processing in distributed database systems. In: Kumar V, Hsu M, editors. Recovery mechanisms in database systems. Upper Saddle River: Prentice-Hall; 1998. Chapter 13.Google Scholar
  2. 2.
    Gore M, Ghosh RK. Recovery of mobile transactions. In: Proceedings of the 11th International Workshop on Database and Expert Systems Applications; 2000. p. 23–7.Google Scholar
  3. 3.
    Gray J. Notes on data base operating systems. In: Bayer R, Graham R, Seegmuller G, editors. Operating systems – an advanced course. LNCS, vol. 60, Springer; 1978. p. 393–481.Google Scholar
  4. 4.
    Gray J, et al. The recovery manager of the system R database manager. ACM Comput Surv. 1981;3(2):223–43.CrossRefGoogle Scholar
  5. 5.
    Hvasshovd S, Torbjornsen O, Bratsberg S, Holager P. The clustra telecom database: high availability, high throughput, and real-time response. In: Proceedings of the 21th International Conference on Very Large Data Bases; 1995. p. 469–77.Google Scholar
  6. 6.
    Isloor SS, Marsland TA System recovery in distributed databases. In: Proceedings of the 3rd International Computer Software and Applications Conference; 1979. p. 421–26.Google Scholar
  7. 7.
    Jimensez-Peris R, Patino-Martinez M, Alonso G. An algorithm for non-intrusive, parallel recovery of replicated data and its correctness. In: Proceedings of the 21st Symposium on Reliable Distributed Systems; 2002. p. 150–9.Google Scholar
  8. 8.
    Lampson B, Sturgis H. Crash recovery in a distributed data storage system. Technical report, Computer Science Laboratory, Xerox Palo Alto Research Center, California; 1976.Google Scholar
  9. 9.
    Lau E, Madden S. An integrated approach to recovery and high availability in an updatable, distributed data warehouse. In: Proceedings of the 32nd International Conference on Very Large Data Bases; 2006. p. 12–5.Google Scholar
  10. 10.
    Lin J, Dunham MH. A low-cost checkpointing technique for distributed databases. Distrib Parall Databases. 2001;10(3):241–68.CrossRefGoogle Scholar
  11. 11.
    Lomet D. Consistent timestamping for transactions in distributed systems. Technical Report CRL90/3, Cambridge Research Laboratory, Digital Equipment Corporation; 1990.Google Scholar
  12. 12.
    Mohan C, Lindsay B, Obermarck R. Transaction management in the R* distributed data base management system. ACM Trans Database Syst. 1986;11(4):378–96.CrossRefGoogle Scholar
  13. 13.
    Özsu MT, Valduriez P. Principles of distributed database systems. 2nd ed. New York: Prentice-Hall; 1999.Google Scholar
  14. 14.
    Skeen D. Non-blocking commit protocols. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 1981. p. 133–42.Google Scholar
  15. 15.
    Wang Y, Liu X. Agent based dynamic recovery protocol in distributed databases. In: Proceedings of the 2nd International Symposium on Parallel and Distributed Computing; 2003.Google Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Department of Computer ScienceNational University of SingaporeSingaporeSingapore

Section editors and affiliations

  • Kian-Lee Tan
    • 1
  1. 1.Department of Computer ScienceNational University of SingaporeSingaporeSingapore