Experimental evaluation of the fail-silent behavior in computers without error masking
- 17 December 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 350-359
- https://doi.org/10.1109/ftcs.1994.315625
Abstract
Traditionally, fail-silent computers are implemented by using massive redundancy,(hardware or software). In this research we investigate ,if it is possible to obtain ,a high degree of fail-silent behavior, from,a computer,without hardware,or software ,replication by using ,only simple behavior based error detection techniques. It is assumed that if the errors caused by a,fault are detected in time,it will be possible to stop the erroneous computer behavior, thus preventing the violation of the fail-silent model. The evaluation technique used in this research is physical fault injection at the pin level. Results obtained by the injection ofabout,20000 different faults in two ,different target systems,have shown ,that 1) in ,a system ,without error detection up to 46% of the faults caused the violation of the fail-silent model; 2) in ,a computer ,with behavior based error detection the percentage of faults that caused the violation of the fail-silent mode,was reduced to values from 2.3% to 0.4%; 3) the results are very dependent,on the target system, on the program under execution during the fault injection and on the type of faults. Keywords: Fail-silent behavior, failure models, experimentalKeywords
This publication has 19 references indexed in Scilit:
- Efficient memory access checkingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Exploiting instruction-level resource parallelism for transparent, integrated control-flow monitoringPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Experimental evaluation of a set of simple error detection mechanismsMicroprocessing and Microprogramming, 1990
- Continuous signature monitoring: low-cost concurrent detection of processor control errorsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 1990
- Fault injection for dependability validation: a methodology and some applicationsIEEE Transactions on Software Engineering, 1990
- The Delta-4 approach to dependability in open distributed computing systemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1988
- Concurrent error detection using watchdog processors-a surveyIEEE Transactions on Computers, 1988
- Fault Tolerance in Tandem Computer SystemsPublished by Springer Nature ,1987
- Executable assertions and flight softwarePublished by American Institute of Aeronautics and Astronautics (AIAA) ,1984
- Byzantine generals in actionACM Transactions on Computer Systems, 1984