A Unified Reliability Model for Fault-Tolerant Computers
- 1 November 1980
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Computers
- Vol. C-29 (11) , 1002-1011
- https://doi.org/10.1109/tc.1980.1675495
Abstract
The diversified nature of fault-tolerant computers led to the development of a multiplicity of reliability models which are seemingly unrelated to each other. As a result, it becomes difficult to develop automated tools for reliability analysis which are both general and efficient. Thus, the potential of reliability modeling as a practical and useful tool in the design process of fault-tolerant computers has not been fully realized. This paper summarizes the results of an extended effort to develop a unified approach to reliability modeling of fault-tolerant computers which strikes a good compromise between generality and practicality. The unified model developed encompasses repairable and nonrepairable systems and models, transient as well as permanent faults, and their recovery. Based on the unified model, a powerful and efficient reliability estimation program ARIES has been developed.Keywords
This publication has 12 references indexed in Scilit:
- Reliability Models of NMR SystemsIEEE Transactions on Reliability, 1975
- A Reliability Model for Gracefully Degrading and Standby-Sparing SystemsIEEE Transactions on Computers, 1975
- Reliability of Some Redundant Systems with RepairIEEE Transactions on Reliability, 1973
- A Unified Method for Analyzing Mission Reliability for Fault Tolerant Computer SystemsIEEE Transactions on Reliability, 1973
- A Reliability and Comparative Analysis of Two Standby System ConfigurationsIEEE Transactions on Reliability, 1973
- The Concept of Coverage and Its Effect on the Reliability Model of a Repairable SystemIEEE Transactions on Computers, 1973
- Reliability Modeling for Fault-Tolerant ComputersIEEE Transactions on Computers, 1971
- Some relationships between failure detection probability and computer system reliabilityPublished by Association for Computing Machinery (ACM) ,1967
- Design of a Repairable Redundant ComputerIEEE Transactions on Electronic Computers, 1962
- Upper Bounds on Mean Life of Self-Repairing SystemsIRE Transactions on Reliability and Quality Control, 1962