Abstract
Highly available and resilient networks play a decisive role in today’s networked world. As network faults are inevitable and networks are becoming constantly intricate, finding effective fault recovery solutions in a timely manner is becoming a challenging task for administrators. Therefore, an automated mechanism to support fault resolution is essential towards efficient fault handling process. In this paper we propose an architecture to support automated fault recovery in terms of traffic engineering, recovery knowledge discovery and automated recovery planning. We base our discussion on an application scenario for recovery from border router failure to maintain optimized configuration of outbound inter-domain traffic.
Chapter PDF
References
Bressoud, T., Rastogi, R., Smith, M.: Optimal configuration for bgp route selection. In: Proc. IEEE INFOCOM (2003)
Aamodt, A., Plaza, E.: Case-based reasoning: foundational issues, methodological variations, and system approaches. AI Communications 7(1), 39–59 (1994)
Tran, H.M., Schönwälder, J.: Distributed Case-Based Reasoning for Fault Management. In: Proc. 1st International Conference on Autonomous Infrastructure, Management and Security, pp. 200–203. Springer, Heidelberg (2007)
Verma, D.C.: Simplifying network administration using policy-based management. IEEE Network 16(2) (2002)
Amin, M., Ho, K., Howarth, M., Pavlou, G.: An integrated network management framework for inter-domain outbound traffic engineering. In: Helmy, A., Jennings, B., Murphy, L., Pfeifer, T. (eds.) MMNS 2006. LNCS, vol. 4267, pp. 208–222. Springer, Heidelberg (2006)
Feamster, N., Borkenhagen, J., Rexford, J.: Guidelines for interdomain traffic engineering. SIGCOMM Comput. Commun. Rev. 33(5), 19–30 (2003)
Tran, H.M., Schönwälder, J.: Heuristic Search using a Feedback Scheme in Unstructured Peer-to-Peer Networks. In: Proc. 5th International Workshop on Databases, Information Systems and P2P Computing. Springer, Heidelberg (2007)
McDermott, D., et al.: Pddl - the planning domain definition language (1998)
Nau, D., Traverso, P., Ghallab, M.: Automated Planning - Theory and Practic. Morgan Kaufmann, San Francisco (2004)
Brodie, M., Ma, S., Lohman, G., Syeda-Mahmood, T., Mignet, L., Modani, N., Champlin, J., Sohn, P.: Quickly finding known software problems via automated symptom matching. In: Proc. 2nd International Conference on Automatic Computing, Washington, DC, USA, pp. 101–110. IEEE Computer Society Press, Los Alamitos (2005)
Montani, S., Anglano, C.: Case-based reasoning for autonomous service failure diagnosis and remediation in software systems. In: Proc. 8th European Conference on Case-Based Reasoning, pp. 489–503. Springer, Heidelberg (2006)
Hadjiantonis, A.M., Charalambides, M., Pavlou, G.: A policy-based approach for managing ubiquitous networks in urban spaces. In: Proc. IEEE International Conference on Communications (ICC 2007) (2007)
Flegkas, P., Trimintzios, P., Pavlou, G.: A policy-based quality of service management system for ip diffserv networks. IEEE Network 16(2) (2002)
Kephart, J.O.: Research challenges of autonomic computing. In: Proc. 27th International Conference on Software Engineering (ICSE 2005). ACM, New York (2005)
Srivastava, B., Kambhampati, S.: The case for automated planning in autonomic computing. IEEE, Los Alamitos (2005)
Arshad, N., Heimbigner, D., Wolf, A.L.: A planning based approach to failure recovery in distributed systems. In: Proc. 1st ACM SIGSOFT workshop on Self-managed systems, pp. 8–12. ACM, New York (2004)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 IFIP International Federation for Information Processing
About this paper
Cite this paper
Liu, F., Hadjiantonis, A.M., Tran, H.M., Amin, M. (2008). An Architecture for Supporting Network Fault Recovery Management. In: Hausheer, D., Schönwälder, J. (eds) Resilient Networks and Services. AIMS 2008. Lecture Notes in Computer Science, vol 5127. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70587-1_9
Download citation
DOI: https://doi.org/10.1007/978-3-540-70587-1_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-70586-4
Online ISBN: 978-3-540-70587-1
eBook Packages: Computer ScienceComputer Science (R0)