Highly concurrent shared storage
- 7 November 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
Switched system-area networks enable thousands of storage devices to be shared and directly accessed by end hosts, promising databases and file systems highly scalable, reliable storage. In such systems, hosts perform access tasks (read and write) and management tasks (storage migration and reconstruction of data on failed devices.) Each task translates into multiple phases of low-level device I/Os, so that concurrent host tasks accessing shared devices can corrupt redundancy codes and cause hosts to read inconsistent data. Concurrency control protocols that scale to large system sizes are required in order to coordinate on-line storage management and access tasks.In this paper, we identify the tasks that storage controllers must perform, and propose an approach which allows these tasks to be composed from basic operations-called base storage transactions (BSTs)-such that correctness requires only the serializability of the BSTs and not of the parent tasks. We present highly scalable distributed protocols, which exploit storage technology trends and BST properties to achieve serializability while coming within a few percent of ideal performance.Keywords
This publication has 16 references indexed in Scilit:
- Fault-tolerant replication management in large-scale distributed storage systemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Fast, on-line failure recovery in redundant disk arraysPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- PetalPublished by Association for Computing Machinery (ACM) ,1996
- TNet: a reliable system area networkIEEE Micro, 1995
- Fine-grained sharing in a page server OODBMSPublished by Association for Computing Machinery (ACM) ,1994
- The ObjectStore database systemCommunications of the ACM, 1991
- Network Time Protocol (version 1) specification and implementationPublished by RFC Editor ,1988
- Scale and performance in a distributed file systemACM Transactions on Computer Systems, 1988
- Principles of transaction-oriented database recoveryACM Computing Surveys, 1983
- The notions of consistency and predicate locks in a database systemCommunications of the ACM, 1976