Cloud Standby: Disaster Recovery of Distributed Systems in the Cloud

  • Alexander Lenk
  • Stefan Tai
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8745)


Disaster recovery planning and securing business processes against outtakes have been essential parts of running a company for decades. As IT systems became more important, and especially since more and more revenue is generated over the Internet, securing the IT systems that support the business processes against outages is essential. Using fully operational standby sites with periodically updated standby systems is a well-known approach to prepare against disasters. Setting up and maintaining a second datacenter is, however, expensive. In this work, we present Cloud Standby, a warm standby approach for setting up and updating a standby system in the Cloud. We describe the architecture of Cloud Standby and its methods for deploying and updating the standby system. We show that by using Cloud Standby the recovery time and long-term costs of disaster recovery can significantly be reduced.


Cloud Standby IaaS Warm Standby Disaster Recovery Distributed Systems 


  1. 1.
    AWS Inc.: Amazon Web Services, Cloud Computing: Compute, Storage, Database,
  2. 2.
    Ayari, N., et al.: Fault tolerance for highly available internet services: concepts, approaches, and issues. IEEE Commun. Surv. Tutor. 10(2), 34–46 (2008)CrossRefGoogle Scholar
  3. 3.
    Cully, B., et al.: Remus: High Availability Via Asynchronous Virtual Machine Replication. In: Proceedings of the 5th USENIX Symposium on Networked Systems Design and Implementation, pp. 161–174 (2008)Google Scholar
  4. 4.
    Henderson, C.: Building scalable web sites. O’reilly (2008)Google Scholar
  5. 5.
    Hiles, A.: The definitive handbook of business continuity management. Wiley (2010)Google Scholar
  6. 6.
    Klems, M., et al.: Automating the delivery of IT Service Continuity Management through cloud service orchestration. In: 2010 IEEE Network Operations and Management Symposium (NOMS), pp. 65–72 (2010)Google Scholar
  7. 7.
  8. 8.
    Lenk, A., et al.: What’s inside the Cloud? An architectural map of the Cloud landscape. In: ICSE Workshop on Software Engineering Challenges of Cloud Computing, CLOUD 2009, pp. 23–31 (2009)Google Scholar
  9. 9.
    Lenk, A., Pallas, F.: Cloud Standby System and Quality Model. Int. J. Cloud Comput. IJCC. 1(2), 48–59 (2013)Google Scholar
  10. 10.
    Lenk, A., Pallas, F.: Modeling Quality Attributes of Cloud-Standby-Systems. In: Lau, K.-K., Lamersdorf, W., Pimentel, E. (eds.) ESOCC 2013. LNCS, vol. 8135, pp. 49–63. Springer, Heidelberg (2013)CrossRefGoogle Scholar
  11. 11.
    Li, Z., et al.: The Cloud’s Cloudy Moment: A Systematic Survey of Public Cloud Service Outage. ArXiv Prepr. ArXiv13126485 (2013)Google Scholar
  12. 12.
    Mell, P., Grance, T.: The NIST definition of cloud computing. NIST Spec. Publ. 800, 145 (2011)Google Scholar
  13. 13.
    Menken, I., et al.: Itil V3 Malc-Managing Across the Lifecycle Full Certification Online Learning and Study Book Course: The Itil V3 Intermediate Malc Complete Certification Kit. Emereo Pty Limited (2009)Google Scholar
  14. 14.
    Pokharel, M., et al.: Disaster Recovery for System Architecture Using Cloud Computing. Presented at the July (2010)Google Scholar
  15. 15.
    Rajagopalan, S., et al.: SecondSite: disaster tolerance as a service. In: Proceedings of the 8th ACM SIGPLAN/SIGOPS Conference on Virtual Execution Environments, pp. 97–108 (2012)Google Scholar
  16. 16.
    Schmidt, K.: High availability and disaster recovery. Springer (2006)Google Scholar
  17. 17.
    Symantec: 2011 SMB Disaster Preparedness Survey - Global Results (2011)Google Scholar
  18. 18.
    Whitman, M., et al.: Principles of incident response and disaster recovery. Cengage Learning (2013)Google Scholar
  19. 19.
    Wittern, E., et al.: Feature-based Configuration of Vendor-independent Deployments on IaaS. In: 18th IEEE International Enterprise Distributed Object Computing Conference, EDOC (2014)Google Scholar
  20. 20.
    Wood, T., et al.: Disaster recovery as a cloud service: Economic benefits & deployment challenges. In: 2nd USENIX Workshop on Hot Topics in Cloud Computing (2010)Google Scholar
  21. 21.
    Wood, T., et al.: PipeCloud: using causality to overcome speed-of-light delays in cloud-based disaster recovery. In: Proceedings of the 2nd ACM Symposium on Cloud Computing, p. 17 (2011)Google Scholar
  22. 22.
    Ensembl Genome Browser,
  23. 23.
    Topology and Orchestration Specification for Cloud Applications Version 1.0,

Copyright information

© International Federation for Information Processing 2014

Authors and Affiliations

  • Alexander Lenk
    • 1
  • Stefan Tai
    • 2
  1. 1.FZI Forschungszentrum InformatikBerlinGermany
  2. 2.Technische Universität BerlinBerlinGermany

Personalised recommendations