Advertisement

Reducing Network Downtime by Intelligent Fault Analysis

  • B. BipinEmail author
Conference paper
Part of the Lecture Notes in Networks and Systems book series (LNNS, volume 98)

Abstract

Fault management system remains as a critical component of any Network Management System present in the telecom industry. The final goal for any telecom service provider is to develop a high availability of service to its customers. For any telecom operator there’ll this module Alarm system or even known as Fault Management system to handle the faults in the system. The faults can be ranging anywhere from a hardware fault, a security breach, a software error or anything which can be a possible threat to the high availability of the service. With the number of customers increasing daily, the network in telecom graphs form like an evolving graph. Thus, the complexity of the telecom network also increases with the increase in nodes.

Considering the software faults apart, the amount of hardware faults is increasing in an alarming rate. This is because most of the operators build the hardware infrastructure in a hybrid fashion i.e., with different hardware from different providers. Many of these hardware faults can lead to denial of service to the customers and are of critical severity. As for the service provider, one hardware fault can impact connected nodes in the network and might also bring down a huge part of it. This paper deals with a solution from a service provider’s point of view, that when multiple faults are reported by the fault management system, it should prioritize the faults in the order that they need to be resolved so that the loss incurred is minimized.

Keywords

Networks Telecom Topology Fault management Dependency graphs Network graphs 

References

  1. 1.
    Kulatunge, A., Basu, K., Lee, H.C., Prakash, M.: Network fault prediction and proactive maintenance system. Nortel Networks Limited, St. Laurent & Patterson, L.L.P US Patent 6,353,902 B1, 5 Mar 2002Google Scholar
  2. 2.
    Kaffine, D.M., Rosen, J.S., Schmidt, P.H.: Network fault isolation. Teradyne, Inc., Boston, MA, USA, US Patent 6,654.914 9 9 B1, 25 Nov 2003Google Scholar
  3. 3.
    Lutz, C., Nall, N.L., Spain, D., Sexton, O.R.: Proactive analysis of communication network problems. AT & T Intellectual Property I, L.P., Atlanta, GA, USA, US Patent 7,974,387 B2, 5 July 2011Google Scholar
  4. 4.
    Penido, G., Nogueira, J.M., Machado, C.: An automatic fault diagnosis and correction system for telecommunications management. Published in Integrated Network Management VI. Distributed Management for the Networked Millennium. Proceedings of the Sixth IFIP/IEEE International Symposium on Integrated Network Management. (Cat. No.99EX302) (1999)Google Scholar
  5. 5.
    Raman, L.G.: Fundamentals of Telecommunications Network Management Wiley/IEEE Press (1999). ISBN: 978-0-7803-3466-3Google Scholar
  6. 6.
    Aghasaryan, A., Jard, C., Thomas, J.: UML specification of a generic model for fault diagnosis of telecommunications networks. In: ICT (2004)Google Scholar
  7. 7.
    Liebowitz, J. (ed.): Expert System Applications to Telecommunications. Wiley, New York (1988)Google Scholar
  8. 8.
    Bouloutas, A., Calo, S., Finkel, A.: Alarm correlation and fault ıdentication in communication networks. IEEE Trans. Commun. 42, 523–533 (1994)CrossRefGoogle Scholar
  9. 9.
    Ensel, C., Keller, A.: An approach for managing service dependencies with XML and the resource description framework. J. Netw. Syst. Manag. 10(2), 147–170 (2002)CrossRefGoogle Scholar
  10. 10.
    Enhanced Telecom Operations Map. TeleManagement Forum. http://www.tmforum.org
  11. 11.
  12. 12.
    Fitzgerald, J., Dennis, A.: Business Data Communications and Networking, 5th edn. Wiley, Hoboken (1996)Google Scholar
  13. 13.
    Wolfe, A.: IBM sets its sights on ‘autonomic computing’. IEEE Spectr. 39, 18–19 (2002)Google Scholar
  14. 14.
    Jackobson, G., Weissman, M.D.: Alarm correlation. IEEE Netw. 7(6), 52–59 (1993)CrossRefGoogle Scholar
  15. 15.
    Sterritt, R., Curran, E.P., Song, H.: HACKER: human and computer knowledge discovered event rules for telecommunications fault management. In: Proceedings of IEEE International Conference on Systems, Man & Cybernetics, October 2002Google Scholar

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  1. 1.Nokia NetworksBangaloreIndia

Personalised recommendations