skip to main content
10.1145/1807128.1807165acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

RACS: a case for cloud storage diversity

Published:10 June 2010Publication History

ABSTRACT

The increasing popularity of cloud storage is leading organizations to consider moving data out of their own data centers and into the cloud. However, success for cloud storage providers can present a significant risk to customers; namely, it becomes very expensive to switch storage providers. In this paper, we make a case for applying RAID-like techniques used by disks and file systems, but at the cloud storage level. We argue that striping user data across multiple providers can allow customers to avoid vendor lock-in, reduce the cost of switching providers, and better tolerate provider outages or failures. We introduce RACS, a proxy that transparently spreads the storage load over many providers. We evaluate a prototype of our system and estimate the costs incurred and benefits reaped. Finally, we use trace-driven simulations to demonstrate how RACS can reduce the cost of switching storage vendors for a large organization such as the Internet Archive by seven-fold or more by varying erasure-coding parameters.

References

  1. Amazon S3. http://aws.amazon.com/s3.Google ScholarGoogle Scholar
  2. Amazon S3 july 2008 outage. http://www.networkworld.com/news/2008/072108-amazon-outages.html.Google ScholarGoogle Scholar
  3. Amazon S3 SLA. http://aws.amazon.com/s3-sla.Google ScholarGoogle Scholar
  4. Cloud services outage report. http://bit.ly/cloud_outage.Google ScholarGoogle Scholar
  5. DuraCloud Project. http://www.duraspace.org/duracloud.php.Google ScholarGoogle Scholar
  6. EMC Atmos Online Storage. http://www.atmosonline.com/?page_id=7.Google ScholarGoogle Scholar
  7. GoGrid Cloud Storage. http://www.gogrid.com/cloud-hosting.Google ScholarGoogle Scholar
  8. GoGrid SLA. http://www.gogrid.com/legal/sla.php.Google ScholarGoogle Scholar
  9. Internet Archive. http://www.archive.org/.Google ScholarGoogle Scholar
  10. Nirvanix Storage Deliver Network. http://www.nirvanix.com/how-to-buy/self-service-pricing.aspx.Google ScholarGoogle Scholar
  11. Rackspace Cloud Files. http://www.rackspacecloud.com/cloud_hosting_products/files.Google ScholarGoogle Scholar
  12. Rackspace june 2009 outage. http://www.bbc.co.uk/blogs/technology/2009/10/the_sidekick_cloud_disaster.html.Google ScholarGoogle Scholar
  13. Rackspace SLA. http://www.rackspacecloud.com/legal/cloudfilessla.Google ScholarGoogle Scholar
  14. The ZooKeeper project. http://hadoop.apache.org/zookeeper.Google ScholarGoogle Scholar
  15. E. Allen and C. M. Morris. Library of Congress and DuraCloud Launch Pilot Program Using Cloud Technologies to Test Perpetual Access to Digital Content. In Library of Congress, News Release, July 14 2009. http://www.loc.gov/today/pr/2009/09-140.html.Google ScholarGoogle Scholar
  16. J. Bloemer, M. Kalfane, M. Karpinski, R. Karp, M. Luby, and D. Zuckerman. An XOR-based erasure-resilient coding scheme. Technical Report TR-95-048, The International Computer Science Institute, Berkeley, CA, 1995.Google ScholarGoogle Scholar
  17. K. Bowers, A. Juels, and A. Oprea. HAIL: A high-availability and integrity layer for cloud storage. In Proc. of ACM Conference on Computer and Communications Security (CCS), Nov. 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Y. Chen, J. Edler, A. Goldberg, A. Gottlieb, S. Sobti, and P. Yianilos. Prototype implementation of archival intermemory. In Proc. of IEEE ICDE, pages 485--495, Feb. 1996.Google ScholarGoogle Scholar
  19. B. Chun, F. Dabek, A. Haeberlen, E. Sit, H. Weatherspoon, M. F. Kaashoek, J. Kubiatowicz, and R. Morris. Efficient replica maintenance for distributed storage systems. In Proc. of USENIX NSDI, San Jose, CA, May 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. F. Dabek, M. F. Kaashoek, D. Karger, R. Morris, and I. Stoica. Wide-area cooperative storage with CFS. In Proc. of ACM SOSP, Oct. 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. F. Dabek, J. Li, E. Sit, J. Robertson, M. F. Kaashoek, and R. Morris. Designing a DHT for low latency and high throughput. In Proc. of USENIX NSDI, Mar. 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. R. Dingledine, M. Freedman, and D. Molnar. The freehaven project: Distributed anonymous storage service. In Proc. of the Workshop on Design Issues in Anonymity and Unobservability, July 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. D. Ionescu. Microsoft red-faced after massive sidekick data loss. PCWorld, Oct. 2009.Google ScholarGoogle Scholar
  24. M. Luby. Lt codes. In Proc. of FOCS Symp., pages 271--282, Nov. 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. M. Luby, M. Mitzenmacher, M. Shokrollahi, D. Spielman, and V. Stemann. Practical loss-resilient codes. In Proc. of ACM STOC, pages 150--159, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. M. Luby, M. Mitzenmacher, M. Shokrollahi, D. Spielman, and V. Stemann. Analysis of low density codes and improved designs using irregular graphs. In Proc. of ACM STOC, May 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. P. Maymounkov. Online codes. Technical Report TR2002-833, New York University, New York, NY, Nov. 2002.Google ScholarGoogle Scholar
  28. D. Nurmi, R. Wolski, C. Grzegorczyk, G. Obertelli, S. Soman, L. Youseff, and D. Zagorodnov. The Eucalyptus open-source cloud-computing system. In CCGRID '09: Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, pages 124--131, Washington, DC, USA, 2009. IEEE Computer Society. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Z. O'Whielacronx. zfec forward error correction library. http://allmydata.org/trac/zfec, 2009.Google ScholarGoogle Scholar
  30. D. Patterson, G. Gibson, and R. Katz. The case for RAID: Redundant arrays of inexpensive disks. In Proc. of ACM SIGMOD Conf., pages 106--113, May 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. J. Plank. A tutorial on reed-solomon coding for fault-tolerance in RAID-like systems. Software Practice and Experience, 27(9):995--1012, Sept. 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. S. Rhea, P. Eaton, D. Geels, H. Weatherspoon, B. Zhao, and J. Kubiatowicz. Pond: the OceanStore prototype. In Proc. of USENIX FAST, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. S. Rhea, C. Wells, P. Eaton, D. Geels, B. Zhao, H. Weatherspoon, and J. Kubiatowicz. Maintenance free global storage in oceanstore. In Proc. of IEEE Internet Computing. IEEE, Sept. 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. L. Rizzo and L. Vicisano. A reliable multicast data distribution protocol based on software fec. In Proc. of HPCS, Greece, June 1997.Google ScholarGoogle ScholarCross RefCross Ref
  35. M. A. Shah, M. Baker, J. C. Mogul, and R. Swaminathan. Auditing to keep online storage services honest. In HOTOS'07: Proceedings of the 11th USENIX workshop on Hot topics in operating systems, pages 1--6, Berkeley, CA, USA, 2007. USENIX Association. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. A. Shokrollahi. Raptor codes. Technical Report DF2003-06-01, Digital Fountain, Inc., Fremont, CA, June 2003.Google ScholarGoogle Scholar
  37. H. Stevens and C. Pettey. Gartner Says Cloud Computing Will Be As Influential As E-business. In Gartner Newsroom, Online Ed., June 26 2008. http://www.gartner.com/it/page.jsp?id=707508.Google ScholarGoogle Scholar
  38. M. Vrable, S. Savage, and G. M. Voelker. Cumulus: Filesystem backup to the cloud. Trans. Storage, 5(4):1--28, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. H. Weatherspoon and J. Kubiatowicz. Erasure coding vs. replication: A quantitative comparison. In Proc. of IPTPS, Mar. 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. RACS: a case for cloud storage diversity

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SoCC '10: Proceedings of the 1st ACM symposium on Cloud computing
      June 2010
      264 pages
      ISBN:9781450300360
      DOI:10.1145/1807128

      Copyright © 2010 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 10 June 2010

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate169of722submissions,23%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader