A Rigorous Approach to Fault-Tolerant Programming

1 January 1985

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Software Engineering

Vol. SE-11 (1) , 23-31
https://doi.org/10.1109/tse.1985.231534

Abstract

The design of programs that are tolerant of hardware fault occurrences and processor crashes is investigated. Using a stable storage management system as a running example, a new approach is suggested for specifying, understanding, and verifying the correctness of fault-tolerant software. The approach extends previously developed axiomatic reasoning methods to the design of fault-tolerant systems by modeling faults as being operations that are performed at random time intervals on any computing system by the system's adverse environment.

Keywords

FAULT TOLERANCE
HARDWARE
LOGIC PROGRAMMING
STOCHASTIC PROCESSES
COMPUTER CRASHES
AVAILABILITY
STOCHASTIC SYSTEMS
DESIGN METHODOLOGY
FAULT TOLERANT SYSTEMS
SOFTWARE SYSTEMS