The Replica Management System: a scheme for flexible and dynamic replication
- 17 December 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
The actual gains achieved by replication are a complex function of the number of replicas, the placement of those replicas, the replication protocol, the nature of the transactions performed on the replicas, and the availability and performance characteristics of the machines and networks composing the system. This paper describes the design and implementation of the Replica Management System, which allows a programmer to specify the quality of service required for replica groups in terms of availability and performance. From the quality of service specification, information about the replication protocol to be used, and data about the characteristics of the underlying distributed system, the RMS computes an initial placement and replication level. As machines and communications systems are detected to have failed or recovered, or performance characteristics change, the RMS can be re-invoked to compute an updated mapping of replicas which preserves the desired quality of service. The result is a flexible, dynamic and dependable replication system.Keywords
This publication has 18 references indexed in Scilit:
- An algorithm for dynamic data distributionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Intelligent job selection for distributed schedulingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Providing performance guarantees in an FDDI networkPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Tolerating transient faults in MARSPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Replicated K-resilient objects in ArjunaPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- An overview of the Arjuna distributed programming systemIEEE Software, 1991
- The limited performance benefits of migrating active processes for load sharingACM SIGMETRICS Performance Evaluation Review, 1988
- Distributed programming in ArgusCommunications of the ACM, 1988
- Reliable communication in the presence of failuresACM Transactions on Computer Systems, 1987
- Weighted voting for replicated dataPublished by Association for Computing Machinery (ACM) ,1979