A fresh look at the reliability of long-term digital storage
- 18 April 2006
- proceedings article
- Published by Association for Computing Machinery (ACM)
- p. 221-234
- https://doi.org/10.1145/1217935.1217957
Abstract
Emerging Web services, such as email, photo sharing, and web site archives, must preserve large volumes of quickly accessible data indefinitely into the future. The costs of doing so often determine whether the service is economically viable. We make the case that these applications' demands on large scale storage systems over long time horizons require us to reevaluate traditional system designs. We examine threats to long-lived data from an end-to-end perspective, taking into account not just hardware and software faults but also faults due to humans and organizations. We present a simple model of long-term storage failures that helps us reason about various strategies for addressing some of these threats. Using this model we show that the most important strategies for increasing the reliability of long-term storage are detecting latent faults quickly, automating fault repair to make it cheaper and faster, and increasing the independence of data replicas.Keywords
All Related Versions
This publication has 12 references indexed in Scilit:
- IRON file systemsPublished by Association for Computing Machinery (ACM) ,2005
- Deep Store: An Archival Storage System ArchitecturePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- The LOCKSS peer-to-peer digital preservation systemACM Transactions on Computer Systems, 2005
- Commercial fault tolerance: a tale of two systemsIEEE Transactions on Dependable and Secure Computing, 2004
- TanglerPublished by Association for Computing Machinery (ACM) ,2001
- Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utilityPublished by Association for Computing Machinery (ACM) ,2001
- Wide-area cooperative storage with CFSPublished by Association for Computing Machinery (ACM) ,2001
- A prototype implementation of archival IntermemoryPublished by Association for Computing Machinery (ACM) ,1999
- RAID: high-performance, reliable secondary storageACM Computing Surveys, 1994
- Fault-tolerance in very large archival systemsPublished by Association for Computing Machinery (ACM) ,1990