ABSTRACT
The increasing popularity of cloud storage is leading organizations to consider moving data out of their own data centers and into the cloud. However, success for cloud storage providers can present a significant risk to customers; namely, it becomes very expensive to switch storage providers. In this paper, we make a case for applying RAID-like techniques used by disks and file systems, but at the cloud storage level. We argue that striping user data across multiple providers can allow customers to avoid vendor lock-in, reduce the cost of switching providers, and better tolerate provider outages or failures. We introduce RACS, a proxy that transparently spreads the storage load over many providers. We evaluate a prototype of our system and estimate the costs incurred and benefits reaped. Finally, we use trace-driven simulations to demonstrate how RACS can reduce the cost of switching storage vendors for a large organization such as the Internet Archive by seven-fold or more by varying erasure-coding parameters.
- Amazon S3. http://aws.amazon.com/s3.Google Scholar
- Amazon S3 july 2008 outage. http://www.networkworld.com/news/2008/072108-amazon-outages.html.Google Scholar
- Amazon S3 SLA. http://aws.amazon.com/s3-sla.Google Scholar
- Cloud services outage report. http://bit.ly/cloud_outage.Google Scholar
- DuraCloud Project. http://www.duraspace.org/duracloud.php.Google Scholar
- EMC Atmos Online Storage. http://www.atmosonline.com/?page_id=7.Google Scholar
- GoGrid Cloud Storage. http://www.gogrid.com/cloud-hosting.Google Scholar
- GoGrid SLA. http://www.gogrid.com/legal/sla.php.Google Scholar
- Internet Archive. http://www.archive.org/.Google Scholar
- Nirvanix Storage Deliver Network. http://www.nirvanix.com/how-to-buy/self-service-pricing.aspx.Google Scholar
- Rackspace Cloud Files. http://www.rackspacecloud.com/cloud_hosting_products/files.Google Scholar
- Rackspace june 2009 outage. http://www.bbc.co.uk/blogs/technology/2009/10/the_sidekick_cloud_disaster.html.Google Scholar
- Rackspace SLA. http://www.rackspacecloud.com/legal/cloudfilessla.Google Scholar
- The ZooKeeper project. http://hadoop.apache.org/zookeeper.Google Scholar
- E. Allen and C. M. Morris. Library of Congress and DuraCloud Launch Pilot Program Using Cloud Technologies to Test Perpetual Access to Digital Content. In Library of Congress, News Release, July 14 2009. http://www.loc.gov/today/pr/2009/09-140.html.Google Scholar
- J. Bloemer, M. Kalfane, M. Karpinski, R. Karp, M. Luby, and D. Zuckerman. An XOR-based erasure-resilient coding scheme. Technical Report TR-95-048, The International Computer Science Institute, Berkeley, CA, 1995.Google Scholar
- K. Bowers, A. Juels, and A. Oprea. HAIL: A high-availability and integrity layer for cloud storage. In Proc. of ACM Conference on Computer and Communications Security (CCS), Nov. 2003. Google ScholarDigital Library
- Y. Chen, J. Edler, A. Goldberg, A. Gottlieb, S. Sobti, and P. Yianilos. Prototype implementation of archival intermemory. In Proc. of IEEE ICDE, pages 485--495, Feb. 1996.Google Scholar
- B. Chun, F. Dabek, A. Haeberlen, E. Sit, H. Weatherspoon, M. F. Kaashoek, J. Kubiatowicz, and R. Morris. Efficient replica maintenance for distributed storage systems. In Proc. of USENIX NSDI, San Jose, CA, May 2006. Google ScholarDigital Library
- F. Dabek, M. F. Kaashoek, D. Karger, R. Morris, and I. Stoica. Wide-area cooperative storage with CFS. In Proc. of ACM SOSP, Oct. 2001. Google ScholarDigital Library
- F. Dabek, J. Li, E. Sit, J. Robertson, M. F. Kaashoek, and R. Morris. Designing a DHT for low latency and high throughput. In Proc. of USENIX NSDI, Mar. 2004. Google ScholarDigital Library
- R. Dingledine, M. Freedman, and D. Molnar. The freehaven project: Distributed anonymous storage service. In Proc. of the Workshop on Design Issues in Anonymity and Unobservability, July 2000. Google ScholarDigital Library
- D. Ionescu. Microsoft red-faced after massive sidekick data loss. PCWorld, Oct. 2009.Google Scholar
- M. Luby. Lt codes. In Proc. of FOCS Symp., pages 271--282, Nov. 2002. Google ScholarDigital Library
- M. Luby, M. Mitzenmacher, M. Shokrollahi, D. Spielman, and V. Stemann. Practical loss-resilient codes. In Proc. of ACM STOC, pages 150--159, 1997. Google ScholarDigital Library
- M. Luby, M. Mitzenmacher, M. Shokrollahi, D. Spielman, and V. Stemann. Analysis of low density codes and improved designs using irregular graphs. In Proc. of ACM STOC, May 1998. Google ScholarDigital Library
- P. Maymounkov. Online codes. Technical Report TR2002-833, New York University, New York, NY, Nov. 2002.Google Scholar
- D. Nurmi, R. Wolski, C. Grzegorczyk, G. Obertelli, S. Soman, L. Youseff, and D. Zagorodnov. The Eucalyptus open-source cloud-computing system. In CCGRID '09: Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, pages 124--131, Washington, DC, USA, 2009. IEEE Computer Society. Google ScholarDigital Library
- Z. O'Whielacronx. zfec forward error correction library. http://allmydata.org/trac/zfec, 2009.Google Scholar
- D. Patterson, G. Gibson, and R. Katz. The case for RAID: Redundant arrays of inexpensive disks. In Proc. of ACM SIGMOD Conf., pages 106--113, May 1988. Google ScholarDigital Library
- J. Plank. A tutorial on reed-solomon coding for fault-tolerance in RAID-like systems. Software Practice and Experience, 27(9):995--1012, Sept. 1997. Google ScholarDigital Library
- S. Rhea, P. Eaton, D. Geels, H. Weatherspoon, B. Zhao, and J. Kubiatowicz. Pond: the OceanStore prototype. In Proc. of USENIX FAST, 2003. Google ScholarDigital Library
- S. Rhea, C. Wells, P. Eaton, D. Geels, B. Zhao, H. Weatherspoon, and J. Kubiatowicz. Maintenance free global storage in oceanstore. In Proc. of IEEE Internet Computing. IEEE, Sept. 2001. Google ScholarDigital Library
- L. Rizzo and L. Vicisano. A reliable multicast data distribution protocol based on software fec. In Proc. of HPCS, Greece, June 1997.Google ScholarCross Ref
- M. A. Shah, M. Baker, J. C. Mogul, and R. Swaminathan. Auditing to keep online storage services honest. In HOTOS'07: Proceedings of the 11th USENIX workshop on Hot topics in operating systems, pages 1--6, Berkeley, CA, USA, 2007. USENIX Association. Google ScholarDigital Library
- A. Shokrollahi. Raptor codes. Technical Report DF2003-06-01, Digital Fountain, Inc., Fremont, CA, June 2003.Google Scholar
- H. Stevens and C. Pettey. Gartner Says Cloud Computing Will Be As Influential As E-business. In Gartner Newsroom, Online Ed., June 26 2008. http://www.gartner.com/it/page.jsp?id=707508.Google Scholar
- M. Vrable, S. Savage, and G. M. Voelker. Cumulus: Filesystem backup to the cloud. Trans. Storage, 5(4):1--28, 2009. Google ScholarDigital Library
- H. Weatherspoon and J. Kubiatowicz. Erasure coding vs. replication: A quantitative comparison. In Proc. of IPTPS, Mar. 2002. Google ScholarDigital Library
Index Terms
- RACS: a case for cloud storage diversity
Recommendations
HMAC-R: Hash-based message authentication code and Rijndael-based multilevel security model for data storage in cloud environment
AbstractIndividuals and IT organizations have begun to profit from cloud providers such as Amazon Web Services, Google Cloud Platform, Microsoft Azure, and others based on their demand provisioning of IaaS, PaaS, and SaaS resources, with a pay-as-you-go ...
Cloud Storage as the Infrastructure of Cloud Computing
ICICCI '10: Proceedings of the 2010 International Conference on Intelligent Computing and Cognitive InformaticsAs an emerging technology and business paradigm, Cloud Computing has taken commercial computing by storm. Cloud computing platforms provide easy access to a company’s high-performance computing and storage infrastructure through web services. With cloud ...
ElastMan: elasticity manager for elastic key-value stores in the cloud
CAC '13: Proceedings of the 2013 ACM Cloud and Autonomic Computing ConferenceThe increasing spread of elastic Cloud services, together with the pay-as-you-go pricing model of Cloud computing, has led to the need of an elasticity controller. The controller automatically resizes an elastic service in response to changes in ...
Comments