DEPEND: a simulation-based environment for system level dependability analysis
- 1 January 1997
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Computers
- Vol. 46 (1) , 60-74
- https://doi.org/10.1109/12.559803
Abstract
The paper presents the rationale for a functional simulation tool, called DEPEND, which provides an integrated design and fault injection environment for system level dependability analysis. The paper discusses the issues and problems of developing such a tool, and describes how DEPEND tackles them. Techniques developed to simulate realistic fault scenarios, reduce simulation time explosion, and handle the large fault model and component domain associated with system level analysis are presented. Examples are used to motivate and illustrate the benefits of this tool. To further illustrate its capabilities, DEPEND is used to simulate the Unix-based Tandem triple-modular-redundancy (TMR) based prototype fault-tolerant system and evaluate how well it handles near-coincident errors caused by correlated and latent faults. Issues such as memory scrubbing, re-integration policies, and workload dependent repair times, which affect how the system handles near-coincident errors, are also evaluated. Unlike any other simulation-based dependability studies, the accuracy of the simulation model is validated by comparing the results of the simulations with measurements obtained from fault injection experiments conducted on a production Tandem machine.Keywords
This publication has 34 references indexed in Scilit:
- Simulation of software behavior under hardware faultsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Fast simulation of dependability models with general failure, repair and maintenance processesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- DOCTOR: an integrated software fault injection environment for distributed real-time systemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- DEPEND: a simulation-based environment for system level dependability analysisIEEE Transactions on Computers, 1997
- The UltraSAN modeling environmentPerformance Evaluation, 1995
- Fault injection for dependability validation: a methodology and some applicationsIEEE Transactions on Software Engineering, 1990
- Monte Carlo simulation of complex system mission reliabilityPublished by Association for Computing Machinery (ACM) ,1989
- Influence of workload on error recovery in random access memoriesIEEE Transactions on Computers, 1988
- Analysis of Typical Fault-Tolerant Architectures using HARPIEEE Transactions on Reliability, 1987
- Reliability Modeling Using SHARPEIEEE Transactions on Reliability, 1987