Constructing Replicated Systems Using Processors With Point To Point Communication Links
- 24 August 2005
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- No. 10636897,p. 177-184
- https://doi.org/10.1109/isca.1989.714552
Abstract
Replicated processing with majority voting is a well known method of achieving fault tolerance. We consider the problem of constructing a distributed system composed of an arbitrarily large number of N-modular redundant (NMR) nodes, where each node itself is composed of N, N = 2m + I and m /spl ges/ 1, processing and voting elements. Advanced microprocessors, such as Inmos Transputers, provide fast serial communication links for inter-processor communication, making it possible to construct large networks of processors. We describe how replicated processing with majority voting can be achieved for such processor networks. This paper will present the overall systems architecture, including voting and NMR synchronization algorithms specially developed to exploit fast point to point communication facilities.Keywords
This publication has 9 references indexed in Scilit:
- Fault tolerant parallel processor architecture overviewPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- The Delta-4 approach to dependability in open distributed computing systemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1988
- Modular redundancy in a message passing systemIEEE Transactions on Software Engineering, 1986
- Replication and fault-tolerance in the ISIS systemPublished by Association for Computing Machinery (ACM) ,1985
- Implementing remote procedure callsACM Transactions on Computer Systems, 1984
- Fault-tolerant clock synchronizationPublished by Association for Computing Machinery (ACM) ,1984
- The Byzantine Generals ProblemACM Transactions on Programming Languages and Systems, 1982
- A method for obtaining digital signatures and public-key cryptosystemsCommunications of the ACM, 1978
- SIFT: Design and analysis of a fault-tolerant computer for aircraft controlProceedings of the IEEE, 1978