ABSTRACT
Windows Azure Storage (WAS) is a cloud storage system that provides customers the ability to store seemingly limitless amounts of data for any duration of time. WAS customers have access to their data from anywhere at any time and only pay for what they use and store. In WAS, data is stored durably using both local and geographic replication to facilitate disaster recovery. Currently, WAS storage comes in the form of Blobs (files), Tables (structured storage), and Queues (message delivery). In this paper, we describe the WAS architecture, global namespace, and data model, as well as its resource provisioning, load balancing, and replication systems.
- J. Baker et al., "Megastore: Providing Scalable, Highly Available Storage for Interactive Services," in Conf. on Innovative Data Systems Research, 2011.Google Scholar
- Eric A. Brewer. "Towards Robust Distributed Systems. (Invited Talk)," in Principles of Distributed Computing, Portland, Oregon, 2000. Google ScholarDigital Library
- M. Burrows, "The Chubby Lock Service for Loosely-Coupled Distributed Systems," in OSDI, 2006. Google ScholarDigital Library
- F. Chang et al., "Bigtable: A Distributed Storage System for Structured Data," in OSDI, 2006. Google ScholarDigital Library
- B. Cooper et al., "PNUTS: Yahoo!'s Hosted Data Serving Platform," VLDB, vol. 1, no. 2, 2008. Google ScholarDigital Library
- G. DeCandia et al., "Dynamo: Amazon's Highly Available Key-value Store," in SOSP, 2007. Google ScholarDigital Library
- Cristian Estan and George Varghese, "New Directions in Traffic Measurement and Accounting," in SIGCOMM, 2002. Google ScholarDigital Library
- S. Ghemawat, H. Gobioff, and S. T. Leung, "The Google File System," in SOSP, 2003. Google ScholarDigital Library
- J. Gray, P. Helland, P. O'Neil, and D. Shasha, "The Dangers of Replication and a Solution," in SIGMOD, 1996. Google ScholarDigital Library
- Albert Greenberg et al., "VL2: A Scalable and Flexible Data Center Network," Communications of the ACM, vol. 54, no. 3, pp. 95--104, 2011. Google ScholarDigital Library
- Y. Hu and Q. Yang, "DCD---Disk Caching Disk: A New Approach for Boosting I/O Performance," in ISCA, 1996. Google ScholarDigital Library
- H. T. Kung and John T. Robinson, "On Optimistic Methods for Concurrency Control," ACM Transactions on Database Systems, vol. 6, no. 2, pp. 213--226, June 1981. Google ScholarDigital Library
- Leslie Lamport, "The Part-Time Parliament," ACM Transactions on Computer Systems, vol. 16, no. 2, pp. 133--169, May 1998. Google ScholarDigital Library
- A. Malik and P. Lakshman, "Cassandra: a decentralized structured storage system," SIGOPS Operating System Review, vol. 44, no. 2, 2010. Google ScholarDigital Library
- M. McKusick and S. Quinlan, "GFS: Evolution on Fast-forward," ACM File Systems, vol. 7, no. 7, 2009. Google ScholarDigital Library
- S. Mysore, B. Agrawal, T. Sherwood, N. Shrivastava, and S. Suri, "Profiling over Adaptive Ranges," in Symposium on Code Generation and Optimization, 2006. Google ScholarDigital Library
- P. O'Neil, E. Cheng, D. Gawlick, and E. O'Neil, "The Log-Structured Merge-Tree (LSM-tree)," Acta Informatica - ACTA, vol. 33, no. 4, 1996. Google ScholarDigital Library
- H. Patterson et al., "SnapMirror: File System Based Asynchronous Mirroring for Disaster Recovery," in USENIX-FAST, 2002. Google ScholarDigital Library
- Irving S. Reed and Gustave Solomon, "Polynomial Codes over Certain Finite Fields," Journal of the Society for Industrial and Applied Mathematics, vol. 8, no. 2, pp. 300--304, 1960.Google ScholarCross Ref
- R. Renesse and F. Schneider, "Chain Replication for Supporting High Throughput and Availability," in USENIX-OSDI, 2004. Google ScholarDigital Library
- J. Terrace and M. Freedman, "Object Storage on CRAQ: High-throughput chain replication for read-mostly workloads," in USENIX'09, 2009. Google ScholarDigital Library
- D. Terry, K. Petersen M. Theimer, A. Demers, M. Spreitzer, and C. Hauser, "Managing Update Conflicts in Bayou, A Weakly Connected Replicated Storage System," in ACM SOSP, 1995. Google ScholarDigital Library
- W. Vogel, "All Things Distributed - Choosing Consistency," in http://www.allthingsdistributed.com/2010/02/strong_consistency_simpledb.html, 2010.Google Scholar
Index Terms
Windows Azure Storage: a highly available cloud storage service with strong consistency
Recommendations
Erasure coding in windows azure storage
USENIX ATC'12: Proceedings of the 2012 USENIX conference on Annual Technical ConferenceWindows Azure Storage (WAS) is a cloud storage system that provides customers the ability to store seemingly limitless amounts of data for any duration of time. WAS customers have access to their data from anywhere, at any time, and only pay for what ...
A RAMCloud Storage System based on HDFS: Architecture, implementation and evaluation
Few cloud storage systems can handle random read accesses efficiently. In this paper, we present a RAMCloud Storage System, RCSS, to enable efficient random read accesses in cloud environments. Based on the Hadoop Distributed File System (HDFS), RCSS ...
Middleware enabled data sharing on cloud storage services
MW4SOC '10: Proceedings of the 5th International Workshop on Middleware for Service Oriented ComputingWith the emergence of public cloud storage platforms like Amazon, Microsoft and Google etc, individual applications and some enterprise storage are being increasingly deployed on Clouds. However, dynamic data sharing in public clouds face problems of ...
Comments