Performance engineering of the Totem group communication system
- 1 June 1998
- journal article
- Published by IOP Publishing in Distributed Systems Engineering
- Vol. 5 (2) , 78-87
- https://doi.org/10.1088/0967-1846/5/2/003
Abstract
Group communication systems simplify the development of fault-tolerant distributed applications; however, concerns exist about the performance and overheads associated with such systems. This paper presents measurements of the performance of the Totem group communication system, operating on a single local-area network (LAN) and on multiple LANs interconnected by gateways. The Totem system runs in user space with standard commercial off-the-shelf (COTS) software and hardware. For 1 kbyte messages, a throughput of over 5000 messages per second has been measured for Totem on a single LAN, and an aggregate throughput of 10 000 messages per second has been measured for Totem on three LANs. At 2000 messages per second, the message latency is less than 3 ms. The paper also discusses some of what has been learned in engineering a group communication system for high performance.Keywords
This publication has 9 references indexed in Scilit:
- The Totem multiple-ring ordering and topology maintenance protocolACM Transactions on Computer Systems, 1998
- RMP: fault-tolerant group communicationIEEE Micro, 1996
- HorusCommunications of the ACM, 1996
- TotemCommunications of the ACM, 1996
- The Totem single-ring ordering and membership protocolACM Transactions on Computer Systems, 1995
- A performance comparison of asynchronous atomic broadcast protocolsDistributed Systems Engineering, 1994
- Consul: a communication substrate for fault-tolerant distributed programsDistributed Systems Engineering, 1993
- Broadcast protocols for distributed systemsIEEE Transactions on Parallel and Distributed Systems, 1990
- Reliable broadcast protocolsACM Transactions on Computer Systems, 1984