Proactive network fault detection
- 22 November 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 3 (0743166X) , 1147-1155
- https://doi.org/10.1109/infcom.1997.631137
Abstract
The increasing role of communication networks in today's society results in a demand for higher levels of network availability and reliability. At the same time, fault management is becoming more difficult due to the dynamic nature and heterogeneity of networks. We propose an intelligent monitoring system using adaptive statistical techniques. The system continually learns the normal behavior of the network and detects deviations from the norm. Within the monitoring system, the measurements are segmented, and features extracted from the segments are used to describe the normal behavior of the measurement variables. This information is combined in the structure of a Bayesian network. The proposed system is thereby able to detect unknown or unseen faults. Experimental results on real network data demonstrate that the proposed system can detect abnormal behavior before a fault actually occurs.Keywords
This publication has 11 references indexed in Scilit:
- Decision-theoretic troubleshootingCommunications of the ACM, 1995
- Markov monitoring with unknown statesIEEE Journal on Selected Areas in Communications, 1994
- Fault management tools for a cooperative and decentralized network operations environmentIEEE Journal on Selected Areas in Communications, 1994
- A probabilistic approach to fault diagnosis in linear lightwave networksIEEE Journal on Selected Areas in Communications, 1993
- Alarm correlationIEEE Network, 1993
- Bayesian Analysis in Expert SystemsStatistical Science, 1993
- Models and algorithms for network fault detection and identification: a reviewPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1992
- Managing communication networks by monitoring databasesIEEE Transactions on Software Engineering, 1991
- A case study of Ethernet anomalies in a distributed computing environmentIEEE Transactions on Reliability, 1990
- Adaptive sequential segmentation of piecewise stationary time seriesInformation Sciences, 1983