Execution replay of multiprocessor virtual machines
Top Cited Papers
- 5 March 2008
- proceedings article
- Published by Association for Computing Machinery (ACM)
- p. 121-130
- https://doi.org/10.1145/1346256.1346273
Abstract
Execution replay of virtual machines is a technique which has many important applications, including debugging, fault-tolerance, and security. Execution replay for single processor virtual machines is well-understood, and available commercially. With the advancement of multi-core architectures, however, multiprocessor virtual machines are becoming more important. Our system, SMP-ReVirt, is the first system to log and replay a multiprocessor virtual machine on commodity hardware. We use hardware page protection to detect and accurately replay sharing between virtual cpus of a multi-cpu virtual machine, allowing us to replay the entire operating system and all applications. We have tested our system on a variety of workloads, and find that although sharing under SMP-ReVirt is expensive, for many workloads and applications, including debugging, the overhead is acceptable.Keywords
This publication has 13 references indexed in Scilit:
- BugNet: Continuously Recording Program Execution for Deterministic Replay DebuggingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- A fault-tolerant java virtual machinePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- Xen and the art of virtualizationPublished by Association for Computing Machinery (ACM) ,2003
- A "flight data recorder" for enabling full-system multiprocessor deterministic replayPublished by Association for Computing Machinery (ACM) ,2003
- A survey of rollback-recovery protocols in message-passing systemsACM Computing Surveys, 2002
- Deterministic replay of Java multithreaded applicationsPublished by Association for Computing Machinery (ACM) ,1998
- Hypervisor-based fault toleranceACM Transactions on Computer Systems, 1996
- The SPLASH-2 programsPublished by Association for Computing Machinery (ACM) ,1995
- Hypervisor-based fault tolerancePublished by Association for Computing Machinery (ACM) ,1995
- Hardware-assisted replay of multiprocessor programsPublished by Association for Computing Machinery (ACM) ,1991