Abstract
The main intention of this paper is to introduce the proposition of a new role-based approach to self-healing monitoring. This is preceded by an overview of existing approaches to the monitoring of distributed systems using self-healing features. Starting with a discussion of autonomous monitoring systems, we will come to self-healing systems. These systems should be able to automatically resolve the problems that occur in a system under monitoring. The paper provides insight into various aspects of self-healing monitoring systems at the software and hardware level. A detailed description of a new agent-based system, AgeMon, is covered later on. The system is based on the roles played by different types of agents. The self-healing features can be achieved by a form of cooperation of agents, e.g. monitoring agents, rule agents, database agents. The paper discusses the roles and gives an implementation background.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Predictive Self-Healing in the Solaris 10 Operating System - A Technical Introduction (September 2004), http://www.sun.com/bigadmin/content/selfheal/selfheal_overview.pdf
Alonso, J., Torres, J., Silva, L.M., Griffith, R., Kaiser, G.: Towards Self-adaptable monitoring framework for self-healing, CoreGRID TR-0150, July 3 (2008), http://www.coregrid.net/mambo/images/stories/-TechnicalReports/tr-0150.pdf
Griffith, R., Kaiser, G.: Adding self-healing capabilities to the common language runtime. Technical report, Columbia University (2005)
Using Nagios to monitor faults in a self-healing environment, by Mikko A.T. Pervilä (2007), http://www.cs.helsinki.fi/u/niklande/opetus/SemK07/-paper/pervila.pdf
Amin, M.: Toward self-healing energy infrastructure systems. Computer Applications in Power 14(1), 20–28 (2001)
Sidiroglou, S., Laadan, O., Keromytis, A.D., Nieh, J.: Using Rescue Points to Navigate Software Recovery (Short Paper). In: Proceedings of the IEEE Symposium on Security and Privacy (May 2007)
The Intelligent Software Agents Lab – Home Page, http://www.cs.cmu.edu/~softagents/intro.htm
Baresi, L., Guinea, S., Pasquale, L.: Self-healing BPEL Processes with Dynamo and the JBoss Rule Engine. In: Int. Workshop on Engineering of Software Services for Pervasive Environments: in Conjunction with the 6th ESEC/FSE Joint Meeting, Dubrovnik, Croatia, pp. 11–20 (2007)
Glassbox – How It Works, http://www.glassbox.com/glassbox/HowItWorks.html
AspectJ – Home page, http://www.eclipse.org/aspectj/
Guttman, E., Perkins, C., Veizades, J., Day, M.: RFC 2608 Service Location Protocol, Version 2 (June 1999), http://tools.ietf.org/html/rfc2608
Goland, Y.Y., Cai, T., Leach, P., Gu, Y., Albright, S.: Simple Service Discovery Protocol/1.0. (October 28, 1999), http://coherence.beebits.net/chrome/site/draft-cai-ssdp-v1-03.txt
Jini Discovery and Join Specification v3. September 4 (2006), http://www.jini.org/wiki/Jini_Discovery_and_Join_Specification
PANACEA - Towards a Self-healing Development Framework. In: 10th IFIP/IEEE International Symposium on Integrated Network Management, IM 2007, May 21, pp. 169–178 (2007), ISBN: 1-4244-0798-2
HP Open View Self-Healing Services: Overview and Technical Introduction, HP Labs (2006), http://managementsoftware.hp.com/services/selfhealing_whitepaper.pdf
Funika, W., Godowski, P., Pȩgiel, P.: A Semantic-Oriented Platform for Performance Monitoring of Distributed Java Applications. In: Bubak, M., van Albada, G.D., Dongarra, J., Sloot, P.M.A. (eds.) ICCS 2008, Part III. LNCS, vol. 5103, pp. 233–242. Springer, Heidelberg (2008)
Wuttke, J.: An approach to detecting failures automatically. In: Fourth International Workshop on Software Quality Assurance: in Conjunction With the 6th ESEC/FSE Joint Meeting, Dubrovnik, Croatia, pp. 17–24 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Funika, W., Pȩgiel, P. (2010). A Role-Based Approach to Self-healing in Autonomous Monitoring Systems. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Wasniewski, J. (eds) Parallel Processing and Applied Mathematics. PPAM 2009. Lecture Notes in Computer Science, vol 6068. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14403-5_14
Download citation
DOI: https://doi.org/10.1007/978-3-642-14403-5_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14402-8
Online ISBN: 978-3-642-14403-5
eBook Packages: Computer ScienceComputer Science (R0)