Coping with network failures
- 1 June 2004
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGMETRICS Performance Evaluation Review
- Vol. 32 (1) , 270-281
- https://doi.org/10.1145/1012888.1005719
Abstract
Link and node failures in IP networks pose a challenge for network control algorithms. Routing restoration, which computes new routes that avoid failed links, involves fundamental tradeoffs between efficient use of network resources, complexity of the restoration strategy and disruption to network traffic. In order to achieve a balance between these goals, obtaining routings that provide good performance guarantees under failures is desirable.In this paper, building on previous work that provided performance guarantees under uncertain (and potentially unknown) traffic demands, we develop algorithms for computing optimal restoration paths and a methodology for evaluating the performance guarantees of routing under failures. We then study the performance of route restoration on a diverse collection of ISP networks. Our evaluation uses a competitive analysis type framework, where performance of routing with restoration paths under failures is compared to the best possible performance on the failed network. We conclude that with careful selection of restoration paths one can obtain restoration strategies that retain nearly optimal performance on the failed network while minimizing disruptions to traffic flows that did not traverse the failed parts of the network.Keywords
This publication has 13 references indexed in Scilit:
- Making intra-domain routing robust to changing and uncertain traffic demandsPublished by Association for Computing Machinery (ACM) ,2003
- Internet traffic engineering by optimizing OSPF weightsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Optimizing OSPF/IS-IS weights in a changing worldIEEE Journal on Selected Areas in Communications, 2002
- Inferring link weights using end-to-end measurementsPublished by Association for Computing Machinery (ACM) ,2002
- Experience in measuring backbone traffic variabilityPublished by Association for Computing Machinery (ACM) ,2002
- Trajectory sampling for direct traffic observationIEEE/ACM Transactions on Networking, 2001
- Deriving traffic demands for operational IP networks: methodology and experienceIEEE/ACM Transactions on Networking, 2001
- Time-Varying Network Tomography: Router Link DataJournal of the American Statistical Association, 2000
- Restoration strategies and spare capacity requirements in self-healing ATM networksIEEE/ACM Transactions on Networking, 1999
- Optimal capacity and flow assignment for self-healing ATM networks based on line and end-to-end restorationIEEE/ACM Transactions on Networking, 1998