Bi-level reconfigurations of fault tolerant arrays
- 1 February 1992
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Computers
- Vol. 41 (2) , 231-239
- https://doi.org/10.1109/12.123400
Abstract
Two types of algorithms are considered, namely, local algorithms and global algorithms. In a local algorithm, no processors need to know the status of all other processors in the system. The recovery process is distributed among the processors with each processor using extremely local knowledge. With these properties, the reconfiguration algorithm may achieve fast recovery and real time response but many sacrifice the optimal use of redundancy. In contrast, the goal of a global algorithm is to optimize the use of redundancy with respect to some fault tolerance criteria. This, however, requires global knowledge about other processors in the system and often necessitates extensive changes in the configuration of the system. For unmaintained, long-life systems, local fault tolerance algorithms have the advantages of fast recovery, while global fault tolerance algorithms provide better reliability and longer life expectancy. Fortunately, under certain conditions, it is possible to combine the advantages of the two types of algorithms. These conditions are described.Keywords
This publication has 12 references indexed in Scilit:
- Interstitial redundancy: an area efficient fault tolerance scheme for large area VLSI processor arraysIEEE Transactions on Computers, 1988
- Distributed Fault-Tolerance of Tree StructuresIEEE Transactions on Computers, 1987
- Reconfigurable Tree Architectures Using Subtree Oriented Fault ToleranceIEEE Transactions on Computers, 1987
- A Fault-Tolerant Modular Architecture for Binary TreesIEEE Transactions on Computers, 1986
- Fault Tolerance Techniques for Array Structures Used in SupercomputingComputer, 1986
- Fault-Tolerant Computing—Concepts and ExamplesIEEE Transactions on Computers, 1984
- Algorithm-Based Fault Tolerance for Matrix OperationsIEEE Transactions on Computers, 1984
- The Diogenes Approach to Testable Fault-Tolerant Arrays of ProcessorsIEEE Transactions on Computers, 1983
- FTMP—A highly reliable fault-tolerant multiprocess for aircraftProceedings of the IEEE, 1978
- Reliability Modeling for Fault-Tolerant ComputersIEEE Transactions on Computers, 1971