Exploring data reliability tradeoffs in replicated storage systems
Open Access
- 11 June 2009
- conference paper
- Published by Association for Computing Machinery (ACM)
- p. 217-226
- https://doi.org/10.1145/1551609.1551643
Abstract
This paper explores the feasibility of a cost-efficient storage architecture that offers the reliability and access performance characteristics of a high-end system. This architecture exploits two opportunities: First, scavenging idle storage from LAN-connected desktops not only offers a low-cost storage space, but also high I/O throughput by aggregating the I/O channels of the participating nodes. Second, the two components of data reliability - durability and availability - can be decoupled to control overall system cost. To capitalize on these opportunities, we integrate two types of components: volatile, scavenged storage and dedicated, yet low-bandwidth durable storage. On the one hand, the durable storage forms a low-cost back-end that enables the system to restore the data the volatile nodes may lose. On the other hand, the volatile nodes provide a high-throughput front-end. While integrating these components has the potential to offer a unique combination of high throughput, low cost, and durability, a number of concerns need to be addressed to architect and correctly provision the system. To this end, we develop analytical- and simulation based tools to evaluate the impact of system characteristics (e.g., bandwidth limitations on the durable and the volatile nodes) and design choices (e.g., replica placement scheme) on data availability and the associated system costs (e.g., maintenance traffic). Further, we implement and evaluate a prototype of the proposed architecture: namely a GridFTP server that aggregates volatile resources. Our evaluation demonstrates an impressive, up to 800MBps transfer throughput for the new GridFTP service.Keywords
This publication has 16 references indexed in Scilit:
- Amazon S3 for science gridsPublished by Association for Computing Machinery (ACM) ,2008
- Scalable security for petascale parallel file systemsPublished by Association for Computing Machinery (ACM) ,2007
- FreeLoader: Scavenging Desktop Storage Resources for Scientific DataPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- On the Impact of Replica Placement to the Reliability of Distributed Brick Storage SystemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Modeling Machine Availability in Enterprise and Wide-Area Distributed Computing EnvironmentsPublished by Springer Nature ,2005
- Separating durability and availability in self-managed storagePublished by Association for Computing Machinery (ACM) ,2004
- The Google file systemPublished by Association for Computing Machinery (ACM) ,2003
- FarsiteACM SIGOPS Operating Systems Review, 2002
- Wide-area cooperative storage with CFSACM SIGOPS Operating Systems Review, 2001
- Hierarchical storage management in a distributed VOD systemIEEE MultiMedia, 1996