File assignment in parallel I/O systems with minimal variance of service time
- 1 February 2000
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Computers
- Vol. 49 (2) , 127-140
- https://doi.org/10.1109/12.833109
Abstract
We address the problem of assigning nonpartitioned files in a parallel I/O system where the file accesses exhibit Poisson arrival rates and fixed service times. We present two new file assignment algorithms based on open queuing networks which aim at minimizing simultaneously the load balance across all disks, as well as the variance of the service time at each disk. We first present an off-line algorithm, Sort Partition, which assigns to each disk file with similar access time. Next, we show that, assuming that a perfectly balanced file assignment can be found for a given set of files, Sort Partition will find the one with minimal mean response time. We then present an on-line algorithm, Hybrid Partition, that assigns groups of files with similar service times in successive intervals while guaranteeing that the load imbalance at any point does not exceed a certain threshold. We report on synthetic experiments which exhibit skew in file accesses and sizes and we compare the performance of our new algorithms with the vanilla greedy file allocation algorithm.Keywords
This publication has 22 references indexed in Scilit:
- Supporting stored video: reducing rate variability and end-to-end resource requirements through optimal smoothingIEEE/ACM Transactions on Networking, 1998
- Database reorganization in parallel disk arrays with I/O service stealingIEEE Transactions on Knowledge and Data Engineering, 1998
- A Better Algorithm for an Ancient Scheduling ProblemJournal of Algorithms, 1996
- Allocating data and workload among multiple servers in a local area networkInformation Systems, 1995
- Allocating data and operations to nodes in distributed database designIEEE Transactions on Knowledge and Data Engineering, 1995
- NCSA's World Wide Web server: design and performanceComputer, 1995
- A self-adjusting data distribution mechanism for multidimensional load balancing in multiprocessor-based database systemsInformation Systems, 1994
- Data allocation for multidisk databasesIEEE Transactions on Knowledge and Data Engineering, 1993
- A microeconomic approach to optimal resource allocation in distributed computer systemsIEEE Transactions on Computers, 1989
- A Trace-Driven Analysis of the UNIX 4.2BSD File SystemPublished by Defense Technical Information Center (DTIC) ,1985