A Stochastic Approach to Measuring the Robustness of Resource Allocations in Distributed Systems
- 1 August 2006
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- No. 01903918,p. 459-470
- https://doi.org/10.1109/icpp.2006.14
Abstract
Often, parallel and distributed computing systems must operate in an environment replete with uncertainty. Determining a resource allocation that accounts for this uncertainty in a way that can provide a probabilistic guarantee that a given level of quality of service (QoS) is achieved is an important research problem. This paper defines a stochastic methodology for quantifiably determining a resource allocation's ability to satisfy QoS constraints in the midst of uncertainty in system parameters. Uncertainty in system parameters and its impact on system performance are modeled stochastically. This stochastic model is then used to derive a quantitative expression for the robustness of a resource allocation. The paper investigates the utility of the proposed stochastic robustness metric by applying the metric to resource allocations in a simulated distributed system. The simulation results are then compared with deterministically defined metrics from the literatureKeywords
This publication has 16 references indexed in Scilit:
- Measuring the robustness of a resource allocationIEEE Transactions on Parallel and Distributed Systems, 2004
- Web search for a planet: the google cluster architectureIEEE Micro, 2003
- Robust scheduling of metaprogramsJournal of Scheduling, 2002
- Statistical prediction of task execution times through analytic benchmarking for scheduling in a heterogeneous environmentIEEE Transactions on Computers, 1999
- Task Matching and Scheduling in Heterogeneous Computing Environments Using a Genetic-Algorithm-Based ApproachJournal of Parallel and Distributed Computing, 1997
- Internet Web servers: workload characterization and performance implicationsIEEE/ACM Transactions on Networking, 1997
- Determining the Execution Time Distribution for a Data Parallel Program in a Heterogeneous Computing EnvironmentJournal of Parallel and Distributed Computing, 1997
- Performance analysis and scheduling of stochastic fork-join jobs in a multicomputer systemIEEE Transactions on Parallel and Distributed Systems, 1993
- Allocating modules to processors in a distributed systemIEEE Transactions on Software Engineering, 1989
- Heuristic Algorithms for Scheduling Independent Tasks on Nonidentical ProcessorsJournal of the ACM, 1977