Skip to main content

Fault-Tolerant Routing

  • Chapter
  • First Online:
  • 2065 Accesses

Part of the book series: SpringerBriefs in Computer Science ((BRIEFSCOMPUTER))

Abstract

Failures are inevitable considering the large scale of DCN. Most DCN topologies are designed with redundancy to recover from failures and maintain performance. We introduce the taxonamy of common failures in DCNs, and review the fault-tolerance techniques. An overview of the fault-tolerance characteristics of some DCN topologies based on simulation results is also presented.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Al-Fares, M., Loukissas, A., Vahdat, A.: A scalable, commodity data center network architecture. In: Proceedings of the ACM SIGCOMM 2008 Conference on Data Communication, Seattle, pp. 63–74. ACM (2008)

    Google Scholar 

  2. Al-Fares, M., Radhakrishnan, S., Raghavan, B., Huang, N., Vahdat, A.: Hedera: dynamic flow scheduling for data center networks. In: Proceedings of the 7th USENIX Conference on Networked Systems Design and Implementation, San Jose, p. 19. USENIX Association (2010)

    Google Scholar 

  3. Duato, J., Yalamanchili, S., Ni, L.: Interconnection Networks: An Engineering Approach. Morgan Kaufmann, San Francisco (2003)

    Google Scholar 

  4. Greenberg, A., Hamilton, J.R., Jain, N., Kandula, S., Kim, C., Lahiri, P., Maltz, D.A., Patel, P., Sengupta, S.: VL2: a scalable and flexible data center network. SIGCOMM Comput. Commun. Rev. 39(4), 51–62 (2009). doi:http://doi.acm.org/10.1145/1594977.1592576

  5. Guo, C., Wu, H., Tan, K., Shi, L., Zhang, Y., Lu, S.: DCell: a scalable and fault-tolerant network structure for data centers. ACM SIGCOMM Comput. Commun. Rev. 38(4), 75–86 (2008)

    Article  Google Scholar 

  6. Guo, C., Lu, G., Li, D., Wu, H., Zhang, X., Shi, Y., Tian, C., Zhang, Y., Lu, S.: BCube: a high performance, server-centric network architecture for modular data centers. ACM SIGCOMM Comput. Commun. Rev. 39(4), 63–74 (2009)

    Article  Google Scholar 

  7. Katz, D., Ward, D.: Bfd for ipv4 and ipv6 (single hop). draft-ietf-bfd-v4v6-1hop-09 (work in progress) (2009)

    Google Scholar 

  8. Liu, Y., Muppala, J.: DCNSim: a data center network simulator. In: 33rd International Conference on Distributed Computing Systems Workshops (ICDCSW), Philadelphia, 2013, pp. 1–6. IEEE (2013)

    Google Scholar 

  9. Liu, Y., Muppala, J.: Fault-tolerance characteristics of data center network topologies using fault regions. In: IEEE/IFIP 43rd International Conference on Dependable Systems and Networks Workshops (DSN-W), Budapest, 2013, pp. 1–6. IEEE (2013)

    Google Scholar 

  10. Liu, Y., Lin, D., Muppala, J., Hamdi, M.: A study of fault-tolerance characteristics of data center networks. In: IEEE/IFIP 42nd International Conference on Dependable Systems and Networks Workshops (DSN-W), Boston, 2012, pp. 1–6. IEEE (2012)

    Google Scholar 

  11. Niranjan Mysore, R., Pamboris, A., Farrington, N., Huang, N., Miri, P., Radhakrishnan, S., Subramanya, V., Vahdat, A.: Portland: a scalable fault-tolerant layer 2 data center network fabric. ACM SIGCOMM Comput. Commun. Rev. 39(4), 39–50 (2009)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2013 The Author(s)

About this chapter

Cite this chapter

Liu, Y., Muppala, J.K., Veeraraghavan, M., Lin, D., Hamdi, M. (2013). Fault-Tolerant Routing. In: Data Center Networks. SpringerBriefs in Computer Science. Springer, Cham. https://doi.org/10.1007/978-3-319-01949-9_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-01949-9_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-01948-2

  • Online ISBN: 978-3-319-01949-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics