Hypervisor-based fault tolerance
- 1 February 1996
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Computer Systems
- Vol. 14 (1) , 80-107
- https://doi.org/10.1145/225535.225538
Abstract
Protocols to implement a fault-tolerant computing system are described. These protocols augment the hypervisor of a virtual-machine manager and coordinate a primary virtual machine with its backup. No modifications to the hardware, operating system, or application programs are required. A prototype system was constructed for HP's PA-RISC instruction-set architecture. Even though the prototype was not carefully tuned, it ran programs about a factor of 2 slower than a bare machine would.Keywords
This publication has 8 references indexed in Scilit:
- The process group approach to reliable distributed computingCommunications of the ACM, 1993
- Limits to low-latency communication on high-speed networksACM Transactions on Computer Systems, 1993
- Implementing fault-tolerant services using the state machine approach: a tutorialACM Computing Surveys, 1990
- A software instruction counterPublished by Association for Computing Machinery (ACM) ,1989
- Fail-stop processorsACM Transactions on Computer Systems, 1983
- Time, clocks, and the ordering of events in a distributed systemCommunications of the ACM, 1978
- The PDP-11 virtual machine architecturePublished by Association for Computing Machinery (ACM) ,1975
- A virtual machine time-sharing systemIBM Systems Journal, 1970