A Novel Fault Diagnosis Approach to Path-Protected WDM Mesh Networks

  • Chi-Shih Chao
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5297)


In today’s cyber-world, network faults can not only cause unexpected damages and consternation to enterprises, but also result in huge loss of customers and a lot of negative effects. For this reason, ISPs (Internet Service Providers) do their best to achieve the goal of higher network reliability, service availability, and accessible bandwidth. To do so, most of them construct or rent carrier-grade backbone networks, which use WDM (Wavelength Division Multiplexing) with path restoration/protection techniques, as one of the means toward the goal. However, network faults can still impact largely to these wellprotected WDM networks since the actual network reliability would fall down as a fault occurs, even if the network could function as normal with protection mechanisms. In addition, the protection/restoration mechanisms would usually get fault diagnosis into trouble because they can cover faults in most cases. In this paper, a novel and competent fault diagnosis approach which can be used in path-protected WDM mesh networks for a high and constant network availability is proposed. This diagnosis approach uses the trimmed/extracted major alarm propagation behavior of a fault and our leader major alarm domain concept to deduce and isolate the most suspicious network area of the fault. Additionally, to enhance the diagnosis accuracy, our approach also refers to some certain information with respect to SRLG (Shared Risk Link Group) and dynamic network sessions of the diagnosed network to further narrow down the suspicious area. Last, simulation results associated with discussions are shown to demonstrate the performance of our diagnosis approach.


Network fault management WDM mesh networks SRLG trimmed major alarm propagation pattern leader major alarm domain 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Perros, H.G.: Connection-Oriented Networks: SONET/SDH, ATM, MPLS, and Optical Networks, 1st edn. Wiley, Chichester (2005)CrossRefGoogle Scholar
  2. 2.
    Zhou, D., Subramaniam, S.: Survivability in Optical Networks. IEEE Network, 16–23 (November/December 2000)Google Scholar
  3. 3.
    Ramamuthy, S., Mukherjee, B.: Survivable WDM Mesh Networks: Part I – Protection. In: Proc. IEEE INFOCOM 1999, New York, pp. 744–751 (March 1999)Google Scholar
  4. 4.
    Ho, P.H., Mouftah, H.T.: Shared Protection in Mesh WDM Networks. IEEE Communications 42(1), 70–76 (2004)CrossRefGoogle Scholar
  5. 5.
    Mas, C., Thiran, P.: An Efficient Algorithm for Locating Soft and Hard Failures in WDM Networks. IEEE Journal on Selected Areas in Communications 18(10), 1900–1911 (2000)CrossRefGoogle Scholar
  6. 6.
    Mas, C., Tomkos, I., Tonguz, O.K.: Failure Location Algorithm for Transparent Optical Networks. IEEE Journal on Selected Areas in Communications 23(8) (August 2005)Google Scholar
  7. 7.
    Sahasrabuddhe, L., Ramamurthy, S., Mukherjee, B.: Fault Management in IP-over-WDM Networks: WDM Protection vs. IP Restoration. IEEE Journal on Selected Areas in Communications 20(1), 21–33 (2002)CrossRefGoogle Scholar
  8. 8.
    Ho, P.H., Tapolcai, J., Mouftah, H.T.: On Achieving Optimal Survivable Routing for Shared Protection in Survivable Next-Generation Internet. IEEE Trans. on Reliability 53(2) (June 2004)Google Scholar
  9. 9.
    Zhu, K., Mukherjee, B.: On-Line Approaches for Provisioning Connections of Different Bandwidth Granularities in WDM Mesh Networks. IEEE OFC, 549–551 (2002)Google Scholar
  10. 10.
    Brodie, M., Rish, I., Ma, S.: Intelligent Probing: A Cost-Effective Approach to Fault Diagnosis in Computer Networks. IBM System Journal 41(3), 372–385 (2002)CrossRefGoogle Scholar
  11. 11.
    Zhang, J., Mukherjee, B.: A Review of Fault Management in WDM Mesh Networks: Basic Concepts and Research Challenges. IEEE Networks, 41–48 (March/April 2004)Google Scholar
  12. 12.
    Link Management Protocol – IETF Draft (draft-ietf-ccamp-lmp-03.txt) (2007) Google Scholar
  13. 13.
    Steinder, M., Sethi, A.S.: A Survey of Fault Localization Techniques in Computer Networks. Science of Computer Programming 53(2), 165–194 (2004)MathSciNetCrossRefzbMATHGoogle Scholar
  14. 14.
    Chao, C.S., Yang, D.L., Liu, A.C.: A Time-Aware Fault Diagnosis System in LAN. In: Proceedings of the 2001 IFIP/IEEE International Symposium on Integrated Network Management (IM 2001), Seattle, USA, May 14-18, pp. 499–512 (2001)Google Scholar
  15. 15.
    Katzela, I., Schwartz, M.: Schemes for Fault Identification in Communication Networks. IEEE/ACM Transactions on Networking 3(6), 753–764 (1995)CrossRefGoogle Scholar
  16. 16.
    Suurballe, J.W., Tarjan, R.E.: A Quick Method for Finding Shortest Pairs of Disjoint Paths. Networks 14, 325–336 (1984)MathSciNetCrossRefzbMATHGoogle Scholar
  17. 17.
    Zuckerman, D.N.: Keynote Speech: Optical Control Plane – Management Included. In: The 10th Asia-Pacific Network Operations and Management Symposium (APNOMS 2007), Sapporo, Japan (November 2007)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Chi-Shih Chao
    • 1
  1. 1.Department of Communications EngineeringFeng Chia UniversityTaiwan, ROC

Personalised recommendations