Definition and analysis of hardware- and software-fault-tolerant architectures
- 1 July 1990
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in Computer
- Vol. 23 (7) , 39-51
- https://doi.org/10.1109/2.56851
Abstract
A structured definition of hardware- and software-fault-tolerant architectures is presented. Software-fault-tolerance methods are discussed, resulting in definitions for soft and solid faults. A soft software fault has a negligible likelihood or recurrence and is recoverable, whereas a solid software fault is recurrent under normal operations or cannot be recovered. A set of hardware- and software-fault-tolerant architectures is presented, and three of them are analyzed and evaluated. Architectures tolerating a single fault and architectures tolerating two consecutive faults are discussed separately. A sidebar addresses the cost issues related to software fault tolerance. The approach taken throughout is as general as possible, dealing with specific classes of faults or techniques only when necessary.Keywords
This publication has 6 references indexed in Scilit:
- Hardware and software fault tolerance: a unified architectural approachPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Distributed execution of recovery blocks: an approach for uniform treatment of hardware and software faults in real-time applicationsIEEE Transactions on Computers, 1989
- Design Fault TolerancePublished by Springer Nature ,1987
- The N-Version Approach to Fault-Tolerant SoftwareIEEE Transactions on Software Engineering, 1985
- Software Engineering: Problems and PerspectivesComputer, 1984
- Design of self-checking softwarePublished by Association for Computing Machinery (ACM) ,1975