Abstract
Failures are inevitable considering the large scale of DCN. Most DCN topologies are designed with redundancy to recover from failures and maintain performance. We introduce the taxonamy of common failures in DCNs, and review the fault-tolerance techniques. An overview of the fault-tolerance characteristics of some DCN topologies based on simulation results is also presented.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Al-Fares, M., Loukissas, A., Vahdat, A.: A scalable, commodity data center network architecture. In: Proceedings of the ACM SIGCOMM 2008 Conference on Data Communication, Seattle, pp. 63–74. ACM (2008)
Al-Fares, M., Radhakrishnan, S., Raghavan, B., Huang, N., Vahdat, A.: Hedera: dynamic flow scheduling for data center networks. In: Proceedings of the 7th USENIX Conference on Networked Systems Design and Implementation, San Jose, p. 19. USENIX Association (2010)
Duato, J., Yalamanchili, S., Ni, L.: Interconnection Networks: An Engineering Approach. Morgan Kaufmann, San Francisco (2003)
Greenberg, A., Hamilton, J.R., Jain, N., Kandula, S., Kim, C., Lahiri, P., Maltz, D.A., Patel, P., Sengupta, S.: VL2: a scalable and flexible data center network. SIGCOMM Comput. Commun. Rev. 39(4), 51–62 (2009). doi:http://doi.acm.org/10.1145/1594977.1592576
Guo, C., Wu, H., Tan, K., Shi, L., Zhang, Y., Lu, S.: DCell: a scalable and fault-tolerant network structure for data centers. ACM SIGCOMM Comput. Commun. Rev. 38(4), 75–86 (2008)
Guo, C., Lu, G., Li, D., Wu, H., Zhang, X., Shi, Y., Tian, C., Zhang, Y., Lu, S.: BCube: a high performance, server-centric network architecture for modular data centers. ACM SIGCOMM Comput. Commun. Rev. 39(4), 63–74 (2009)
Katz, D., Ward, D.: Bfd for ipv4 and ipv6 (single hop). draft-ietf-bfd-v4v6-1hop-09 (work in progress) (2009)
Liu, Y., Muppala, J.: DCNSim: a data center network simulator. In: 33rd International Conference on Distributed Computing Systems Workshops (ICDCSW), Philadelphia, 2013, pp. 1–6. IEEE (2013)
Liu, Y., Muppala, J.: Fault-tolerance characteristics of data center network topologies using fault regions. In: IEEE/IFIP 43rd International Conference on Dependable Systems and Networks Workshops (DSN-W), Budapest, 2013, pp. 1–6. IEEE (2013)
Liu, Y., Lin, D., Muppala, J., Hamdi, M.: A study of fault-tolerance characteristics of data center networks. In: IEEE/IFIP 42nd International Conference on Dependable Systems and Networks Workshops (DSN-W), Boston, 2012, pp. 1–6. IEEE (2012)
Niranjan Mysore, R., Pamboris, A., Farrington, N., Huang, N., Miri, P., Radhakrishnan, S., Subramanya, V., Vahdat, A.: Portland: a scalable fault-tolerant layer 2 data center network fabric. ACM SIGCOMM Comput. Commun. Rev. 39(4), 39–50 (2009)
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2013 The Author(s)
About this chapter
Cite this chapter
Liu, Y., Muppala, J.K., Veeraraghavan, M., Lin, D., Hamdi, M. (2013). Fault-Tolerant Routing. In: Data Center Networks. SpringerBriefs in Computer Science. Springer, Cham. https://doi.org/10.1007/978-3-319-01949-9_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-01949-9_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-01948-2
Online ISBN: 978-3-319-01949-9
eBook Packages: Computer ScienceComputer Science (R0)