On Quiescent Reliable Communication
- 1 January 2000
- journal article
- Published by Society for Industrial & Applied Mathematics (SIAM) in SIAM Journal on Computing
- Vol. 29 (6) , 2040-2073
- https://doi.org/10.1137/s0097539798341296
Abstract
We study the problem of achieving reliable communication with quiescent algorithms (i.e., algorithms that eventually stop sending messages) in asynchronous systems with process crashes and lossy links. We first show that it is impossible to solve this problem in asynchronous systems (with no failure detectors). We then show that, among failure detectors that output lists of suspects, the weakest one that can be used to solve this problem is $\diamond \cal P,$ a failure detector that cannot be implemented. To overcome this difficulty, we introduce an implementable failure detector called Heartbeat and show that it can be used to achieve quiescent reliable communication. Heartbeat is novel: in contrast to typical failure detectors, it does not output lists of suspects and it is implementable without timeouts. With Heartbeat, many existing algorithms that tolerate only process crashes can be transformed into quiescent algorithms that tolerate both process crashes and message losses. This can be applied to consensus, atomic broadcast, k-set agreement, atomic commitment, etc.
Keywords
This publication has 11 references indexed in Scilit:
- Using the heartbeat failure detector for quiescent reliable communication and consensus in partitionable networksTheoretical Computer Science, 1999
- Failure Detection and Randomization: A Hybrid Approach to Solve ConsensusSIAM Journal on Computing, 1998
- The weakest failure detector for solving consensusJournal of the ACM, 1996
- Unreliable failure detectors for reliable distributed systemsJournal of the ACM, 1996
- More Choices Allow More Faults: Set Consensus Problems in Totally Asynchronous SystemsInformation and Computation, 1993
- Simple constant-time consensus protocols in realistic failure modelsJournal of the ACM, 1989
- Effects of message loss on the termination of distributed protocolsInformation Processing Letters, 1988
- Reaching approximate agreement in the presence of faultsJournal of the ACM, 1986
- Asynchronous consensus and broadcast protocolsJournal of the ACM, 1985
- Impossibility of distributed consensus with one faulty processJournal of the ACM, 1985