Resource management for distributed parallel systems
- 31 December 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 316-323
- https://doi.org/10.1109/hpdc.1993.263828
Abstract
Multiprocessor systems should exist in the larger context of distributed systems, allowing multiprocessor resources to be shared by those that need them. Unfortunately, typical multiprocessor resource management techniques do not scale to large networks. The Prospero Resource Manager (PRM) is a scalable resource allocation system that supports the allocation of processing resources in large networks and multiprocessor systems. To manage resources in such distributed parallel systems, PRM employs three types of managers: system managers, job managers, and node managers. There exist multiple independent instances of each type of manager, reducing bottlenecks. The complexity of each manager is further reduced because each is designed to utilize information at an appropriate level of abstraction.<>Keywords
This publication has 6 references indexed in Scilit:
- Finding and exploiting parallelism in an ocean simulation program: Experience, results, and implicationsJournal of Parallel and Distributed Computing, 1992
- Scheduler activationsACM Transactions on Computer Systems, 1992
- Transparent Process Migration for Personal WorkstationsPublished by Defense Technical Information Center (DTIC) ,1989
- Finding idle machines in a workstation-based distributed systemIEEE Transactions on Software Engineering, 1989
- The Benevolent Bandit Laboratory: a testbed for distributed algorithmsIEEE Journal on Selected Areas in Communications, 1989
- Exploiting virtual synchrony in distributed systemsPublished by Association for Computing Machinery (ACM) ,1987