The interaction of parallel and sequential workloads on a network of workstations
- 1 May 1995
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGMETRICS Performance Evaluation Review
- Vol. 23 (1) , 267-278
- https://doi.org/10.1145/223586.223618
Abstract
This paper examines the plausibility of using a network of workstations (NOW) for a mixture of parallel and sequential jobs. Through simulations, our study examines issues that arise when combining these two workloads on a single platform. Starting from a dedicated NOW just for parallel programs, we incrementally relax uniprogramming restrictions until we have a multi-programmed, multi-user NOW for both interactive sequential users and parallel programs. We show that a number of issues associated with the distributed NOW environment (e.g., daemon activity, coscheduling skew) can have a small but noticeable effect on parallel program performance. We also find that efficient migration to idle workstations is necessary to maintain acceptable parallel application performance. Furthermore, we present a methodology for deriving an optimal delay time for recruiting idle machines for use by parallel programs; this recruitment threshold was just 3 minutes for the research cluster we measured. Finally, we quantify the effects of the additional parallel load upon interactive users by keeping track of the potential number of user delays in our simulations. When we limit the maximum number of delays per user, we can still maintain acceptable parallel program performance. In summary, we find that for our workloads a 2:1 rule applies: a NOW cluster of approximately 60 machines can sustain a 32-node parallel workload in addition to the sequential load placed upon it by interactive users.Keywords
This publication has 29 references indexed in Scilit:
- Myrinet: a gigabit-per-second local area networkIEEE Micro, 1995
- High-speed switch scheduling for local-area networksACM Transactions on Computer Systems, 1993
- Transparent process migration: Design alternatives and the sprite implementationSoftware: Practice and Experience, 1991
- The available capacity of a privately owned workstation environmentPerformance Evaluation, 1991
- PVM: A framework for parallel distributed computingConcurrency: Practice and Experience, 1990
- Concurrent reading and writing of clocksACM Transactions on Computer Systems, 1990
- Finding idle machines in a workstation-based distributed systemIEEE Transactions on Software Engineering, 1989
- Process control and scheduling issues for multiprogrammed shared-memory multiprocessorsACM SIGOPS Operating Systems Review, 1989
- Linda in contextCommunications of the ACM, 1989
- VAXclusterACM Transactions on Computer Systems, 1986