Automatic alarm correlation for fault identification
- 19 November 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 2, 553-561 vol.2
- https://doi.org/10.1109/infcom.1995.515921
Abstract
In communication networks, a large number of alarms exist to signal any abnormal behavior of the network. As network faults typically result in a number of alarms, correlating these different alarms and identifying their source is a major problem in fault management. The alarm correlation problem is of major practical significance. Alarms that have not been correlated may not only lead to significant misdirected efforts, based on insufficient information, but may cause multiple corrective actions (possibly contradictory) as each alert is handled independently. The paper proposes a general framework to solve the alarm correlation problem. The authors introduce a new model for faults and alarms based on probabilistic finite state machines. They propose two algorithms. The first one acquires the fault models starting from possibly incomplete and incorrect date. The second one correlates alarms in the presence of multiple faults and noisy information. Both algorithms have polynomial time complexity, use an extension of the Viterbi algorithm to deal with the corrupted data, and can be implemented in hardware. As an example, they are applied to analyse faults using data generated by the ANS (Advanced Network and Services, Inc.)/NSF T3 network.Keywords
This publication has 12 references indexed in Scilit:
- Inference of a probabilistic finite state machine from its outputIEEE Transactions on Systems, Man, and Cybernetics, 1995
- Alarm correlation and fault identification in communication networksIEEE Transactions on Communications, 1994
- Correcting dependent errors in sequences generated by finite-state processesIEEE Transactions on Information Theory, 1993
- Fault identification using a finite state machine model with unreliable partially observed data sequencesIEEE Transactions on Communications, 1993
- A Probabilistic Causal Model for Diagnostic Problem Solving Part II: Diagnostic StrategyIEEE Transactions on Systems, Man, and Cybernetics, 1987
- On Communicating Finite-State MachinesJournal of the ACM, 1983
- Formal Methods in Communication Protocol DesignIEEE Transactions on Communications, 1980
- Protocol Representation with Finite-State ModelsIEEE Transactions on Communications, 1980
- The viterbi algorithmProceedings of the IEEE, 1973
- A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov ChainsThe Annals of Mathematical Statistics, 1970