A theory of fault-tolerant routing in wormhole networks
- 1 August 1997
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Parallel and Distributed Systems
- Vol. 8 (8) , 790-802
- https://doi.org/10.1109/71.605766
Abstract
Fault-tolerant systems aim at providing continuous operation in the presence of faults. Multicomputers rely on an interconnection network between processors to support the message-passing mechanism. Therefore, the reliability of the interconnection network is very important for the reliability of the whole system. This paper analyzes the effective redundancy available in a wormhole network by combining connectivity and deadlock freedom. Redundancy is defined at the channel level. We propose a sufficient condition for channel redundancy, also computing the set of redundant channels. The redundancy level of the network is also defined, proposing a theorem that supplies its value. This theory is developed on top of our necessary and sufficient condition for deadlock-free adaptive routing. The new theory also considers the failure of physical channels when virtual channels are used. Finally, we propose a methodology for the design of fault-tolerant routing algorithms, showing its application to n-dimensional meshes.Keywords
This publication has 43 references indexed in Scilit:
- Free dimensions-an effective approach to achieving fault tolerance in hypercubePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Fault-tolerant routing with non-adaptive wormhole algorithms in mesh networksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Fault-tolerant wormhole routing algorithms for mesh networksIEEE Transactions on Computers, 1995
- A THEORY TO INCREASE THE EFFECTIVE REDUNDANCY IN WORMHOLE NETWORKSParallel Processing Letters, 1994
- Fault-tolerant wormhole routing in toriPublished by Association for Computing Machinery (ACM) ,1994
- A fault-tolerant communication scheme for hypercube computersIEEE Transactions on Computers, 1992
- An adaptive and fault tolerant wormhole routing strategy for k-ary n-cubesIEEE Transactions on Computers, 1991
- On the Design of Deadlock-Free Adaptive Routing Algorithms for Multicomputers: Design MethodologiesPublished by Springer Nature ,1991
- Reliable broadcast in hypercube multicomputersIEEE Transactions on Computers, 1988
- Dynamically Restructurable Fault-Tolerant Processor Network ArchitecturesIEEE Transactions on Computers, 1985