Wave Scheduling—Decentralized Scheduling of Task Forces in Multicomputers
- 1 September 1984
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Computers
- Vol. C-33 (9) , 835-844
- https://doi.org/10.1109/tc.1984.1676500
Abstract
Decentralized operating systems that control large multicomputers need techniques to schedule competing parallel programs called task forces. Wave scheduling is a probabilistic technique that uses a hierarchical distributed virtual machine to schedule task forces by recursively subdividing and issuing wavefront-like commands to processing elements capable of executing individual tasks. Wave scheduling is highly resistant to processing element failures because it uses many distributed schedulers that dynamically assign scheduling responsibilities among themselves. The scheduling technique is trivially extensible as more processing elements join the host multicomputer. A simple model of scheduling cost is used by every scheduler node to distribute scheduling activity and minimize wasted processing capacity by using perceived workload to vary decentralized scheduling rules. At low to moderate levels of network activity, wave scheduling is only slightly less efficient than a central scheduler in its ability to direct processing elements to accomplish useful work.Keywords
This publication has 9 references indexed in Scilit:
- Operating Systems for the Micronet Network ComputerIEEE Micro, 1983
- The Contract Net Protocol: High-Level Communication and Control in a Distributed Problem SolverIEEE Transactions on Computers, 1980
- Micros, A Distributed Operating System for Micronet, A Reconfigurable Network ComputerIEEE Transactions on Computers, 1980
- MedusaCommunications of the ACM, 1980
- An overview of the Texas reconfigurable array computerPublished by Association for Computing Machinery (ACM) ,1980
- StarOS, a multiprocessor operating system for the support of task forcesPublished by Association for Computing Machinery (ACM) ,1979
- The Roscoe distributed operating systemPublished by Association for Computing Machinery (ACM) ,1979
- Micronet: A reconfigurable microcomputer network for distributed systems researchSIMULATION, 1978
- A Large Scale, Homogenous, Fully Distributed Parallel Machine, IIPublished by Association for Computing Machinery (ACM) ,1977