RMP: fault-tolerant group communication
- 1 April 1996
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Micro
- Vol. 16 (2) , 59-67
- https://doi.org/10.1109/40.491463
Abstract
Fault-tolerant group communication is highly desirable for maintaining a consistent state in distributed systems. Many existing protocols are quite expensive and of limited benefit for distributed systems in terms of efficiency. This article presents a novel group communication protocol including dynamic assessment of membership. It achieves fault tolerant group communication based on a logical token ring approach. The protocol ensures total ordering of messages and atomicity of delivery in the presence of communication faults and site faults, guaranteeing that all operational members belonging to the same group observe a consistent view of ordered events. The dynamic membership and failure recovery algorithms can handle site faults and restart, group partitions and merges, dynamic members join and leave. The efficiency of the protocol is shown with some performance figures.Keywords
This publication has 15 references indexed in Scilit:
- A token-based protocol for reliable, ordered multicast communicationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Fast message ordering and membership using a logical token-passing ringPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- An efficient and reliable group multicast protocolPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Ordered and reliable multicast communicationACM Transactions on Computer Systems, 1991
- Lightweight causal and atomic group multicastACM Transactions on Computer Systems, 1991
- Group communication in the Amoeba distributed operating systemPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- AMp: a highly parallel atomic multicast protocolPublished by Association for Computing Machinery (ACM) ,1989
- Impossibility of distributed consensus with one faulty processJournal of the ACM, 1985
- Reliable broadcast protocolsACM Transactions on Computer Systems, 1984
- Time, clocks, and the ordering of events in a distributed systemCommunications of the ACM, 1978