A Framework for Software Fault Tolerance in Real-Time Systems
- 1 May 1983
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Software Engineering
- Vol. SE-9 (3) , 355-364
- https://doi.org/10.1109/tse.1983.237017
Abstract
Real-time systems often have very high reliability requirements and are therefore prime candidates for the inclusion of fault tolerance techniques. In order to provide tolerance to software faults, some form of state restoration is usually advocated as a means of recovery. State restoration can be expensive and the cost is exacerbated for systems which utilize concurrent processes. The concurrency present in most real-time systems and the further difficulties introduced by timing constraints suggest that providing tolerance for software faults may be inordinately expensive or complex. We believe that this need not be the case, and propose a straightforward pragmatic approach to software fault tolerance'which is believed to be applicable to many real-time systems. The approach takes advantage of the structure of real-time systems to simplify error recovery, and a classification scheme for errors is introduced. Responses to each type of error are proposed which allow service to be maintained.Keywords
This publication has 12 references indexed in Scilit:
- A program structure for error detection and recoveryPublished by Springer Nature ,2005
- A Recovery Cache for the PDP-11IEEE Transactions on Computers, 1980
- Reliability Issues in Computing System DesignACM Computing Surveys, 1978
- A reconsideration of the recovery block schemeThe Computer Journal, 1978
- Process backup in producer-consumer systemsPublished by Association for Computing Machinery (ACM) ,1977
- Software reliabilityPublished by Association for Computing Machinery (ACM) ,1977
- Fault-Tolerant Software for Real-Time ApplicationsACM Computing Surveys, 1976
- Fault-Tolerant SystemsIEEE Transactions on Computers, 1976
- Exception handlingCommunications of the ACM, 1975
- System structure for software fault toleranceIEEE Transactions on Software Engineering, 1975