Structuring Distributed Systems for Recoverability and Crash Resistance
- 1 July 1981
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Software Engineering
- Vol. SE-7 (4) , 436-447
- https://doi.org/10.1109/tse.1981.230846
Abstract
An object-oriented multilevel model of computation is used to discuss recoverability and crash resistance issues in distributed systems. Of particular importance are the issues that are raised when recoverability and crash resistance properties are desired from objects whose concrete representations are distributed over several nodes. The execution of a program at a node of the system can give rise to a hierarchy of processes executing various parts of the program at different nodes. Recoverability and crash resistance properties are needed to ensure that such a group of processes leave the system state consistent despite faults in the system.Keywords
This publication has 10 references indexed in Scilit:
- Concurrent Pascal with backward error recovery: Language features and examplesSoftware: Practice and Experience, 1979
- A Model of Recoverability in Multilevel SystemsIEEE Transactions on Software Engineering, 1978
- Reliability Issues in Computing System DesignACM Computing Surveys, 1978
- Reliable Resource Allocation Betvveen Unreliable ProcessesIEEE Transactions on Software Engineering, 1978
- Notes on data base operating systemsPublished by Springer Nature ,1978
- Data processing spheres of controlIBM Systems Journal, 1978
- The notions of consistency and predicate locks in a database systemCommunications of the ACM, 1976
- System structure for software fault toleranceIEEE Transactions on Software Engineering, 1975
- Recovery scenario for a DB/DC systemPublished by Association for Computing Machinery (ACM) ,1973
- Recovery semantics for a DB/DC systemPublished by Association for Computing Machinery (ACM) ,1973