DOCTOR: an integrated software fault injection environment for distributed real-time systems
- 19 November 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 204-213
- https://doi.org/10.1109/ipds.1995.395831
Abstract
The paper presents an integrated software fault injection environment (DOCTOR) which is capable of (1) generating synthetic workloads under which system dependability is evaluated, (2) injecting various types of faults with different options, and (3) collecting performance and dependability data. A comprehensive graphical user interface is also provided. The software implemented fault-injection tools supports three types of faults: memory faults, CPU faults, and communication faults. Each injected fault may be permanent, transient or intermittent. A fault-injection plan can be formulated probabilistically, or based on the past event history. The modular organization of tools is particularly designed for distributed architectures. DOCTOR is implemented on a distributed real-time system called HARTS, and it capability has been tested through numerous experiments.Keywords
This publication has 12 references indexed in Scilit:
- Probabilistic diagnosis of multiprocessor systems with arbitrary connectivityPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Fault injection for dependability validation of fault-tolerant computing systemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- FINE: A fault injection and monitoring environment for tracing the UNIX system behavior under faultsIEEE Transactions on Software Engineering, 1993
- A distributed real-time operating systemIEEE Software, 1992
- FERRARI: a tool for the validation of system dependability propertiesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1992
- The x-Kernel: an architecture for implementing network protocolsIEEE Transactions on Software Engineering, 1991
- Fault injection experiments using FIATIEEE Transactions on Computers, 1990
- Misconceptions about real-time computing: a serious problem for next-generation systemsComputer, 1988
- Characterization of Fault Recovery through Fault Injection on FTMPIEEE Transactions on Reliability, 1987
- Measurement and Application of Fault LatencyIEEE Transactions on Computers, 1986