skip to main content
survey

Data Storage Management in Cloud Environments: Taxonomy, Survey, and Future Directions

Published:11 December 2017Publication History
Skip Abstract Section

Abstract

Storage as a Service (StaaS) is a vital component of cloud computing by offering the vision of a virtually infinite pool of storage resources. It supports a variety of cloud-based data store classes in terms of availability, scalability, ACID (Atomicity, Consistency, Isolation, Durability) properties, data models, and price options. Application providers deploy these storage classes across different cloud-based data stores not only to tackle the challenges arising from reliance on a single cloud-based data store but also to obtain higher availability, lower response time, and more cost efficiency. Hence, in this article, we first discuss the key advantages and challenges of data-intensive applications deployed within and across cloud-based data stores. Then, we provide a comprehensive taxonomy that covers key aspects of cloud-based data store: data model, data dispersion, data consistency, data transaction service, and data management cost. Finally, we map various cloud-based data stores projects to our proposed taxonomy to validate the taxonomy and identify areas for future research.

Skip Supplemental Material Section

Supplemental Material

References

  1. Daniel Abadi. 2012. Consistency tradeoffs in modern distributed database system design: CAP is only part of the story. Computer 2 (Feb. 2012). 0018-916 Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Daniel J. Abadi. 2009. Data management in the cloud: Limitations and opportunities. IEEE Data Eng. Bull. 32, 1 (2009), 3--12.Google ScholarGoogle Scholar
  3. Hussam Abu-Libdeh, Lonnie Princehouse, and Hakim Weatherspoon. 2010. RACS: A case for cloud storage diversity. In Proceedings of the 1st ACM Symposium on Cloud Computing (SoCC’10). ACM, New York, NY, 229--240. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Atul Adya. 1999. Weak Consistency: A Generalized Theory and Optimistic Implementations for Distributed Transactions. Ph.D. Dissertation. Cambridge, MA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Divyakant Agrawal, Amr El Abbadi, Hatem A. Mahmoud, Faisal Nawab, and Kenneth Salem. 2013. Managing Geo-replicated Data in Multi-datacenters. Springer, Berlin, 23--43.Google ScholarGoogle Scholar
  6. Rakesh Agrawal, Michael J. Carey, and Miron Livny. 1987. Concurrency control performance modeling: Alternatives and implications. ACM Trans. Data. Syst. 12, 4 (Nov. 1987), 609--654. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Marcos K. Aguilera, Arif Merchant, Mehul Shah, Alistair Veitch, and Christos Karamanolis. 2007. Sinfonia: A new paradigm for building scalable distributed systems. SIGOPS Oper. Syst. Rev. 6 (Oct. 2007), 5:1--5:48. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Mustaque Ahamad, Gil Neiger, James E. Burns, Prince Kohli, and Phillip W. Hutto. 1995. Causal memory: Definitions, implementation, and programming. Distrib. Comput. 9, 1 (01 Mar 1995), 37--49.Google ScholarGoogle Scholar
  9. Mohammad Al-Fares, Sivasankar Radhakrishnan, Barath Raghavan, Nelson Huang, and Amin Vahdat. 2010. Hedera: Dynamic flow scheduling for data center networks. In Proceedings of the 7th USENIX Conference on Networked Systems Design and Implementation (NSDI’10). USENIX Association, Berkeley, CA, 19--19. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Paulo Sérgio Almeida, Ali Shoker, and Carlos Baquero. 2015. Efficient State-Based CRDTs by Delta-Mutation. Springer International Publishing, Cham, 62--76.Google ScholarGoogle Scholar
  11. Sérgio Almeida, João Leitão, and Luís Rodrigues. 2013. ChainReaction: A causal+ consistent datastore based on chain replication. In Proceedings of the 8th ACM European Conference on Computer Systems (EuroSys’13). ACM, New York, NY, 85--98. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Peter Alvaro, Peter Bailis, Neil Conway, and Joseph M. Hellerstein. 2013. Consistency without borders. In Proceedings of the 4th Annual Symposium on Cloud Computing (SOCC’13). ACM, New York, NY, Article 23, 23:1--23:10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Peter Alvaro, Neil Conway, Joseph M. Hellerstein, and David Maier. 2014. Blazes: Coordination analysis for distributed programs. In Proceedings of the 30th IEEE International Conference on Data Engineering. 52--63.Google ScholarGoogle ScholarCross RefCross Ref
  14. Peter Alvaro, Neil Conway, Joseph M. Hellerstein, and William R. Marczak. 2011. Consistency analysis in bloom: A CALM and collected approach. In Proceedings of the 5th Biennial Conference on Innovative Data Systems Research. 249--260.Google ScholarGoogle Scholar
  15. Eric Anderson, Xiaozhou Li, Mehul A. Shah, Joseph Tucek, and Jay J. Wylie. 2010. What consistency does your key-value store actually provide? In Proceedings of the 6th International Conference on Hot Topics in System Dependability (HotDep’10). USENIX Association, Berkeley, CA, 1--16. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Masoud Saeida Ardekani, Pierre Sutra, and Marc Shapiro. 2013. Non-monotonic snapshot isolation: Scalable and strong consistency for geo-replicated transactional systems. In Proceedings of the 2013 IEEE 32Nd International Symposium on Reliable Distributed Systems (SRDS’13). IEEE Computer Society, Washington, DC, 163--172. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Masoud Saeida Ardekani, Pierre Sutra, and Marc Shapiro. 2014. G-DUR: A middleware for assembling, analyzing, and improving transactional protocols. In Proceedings of the 15th International Middleware Conference (Middleware’14). ACM, New York, NY, 13--24. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Masoud Saeida Ardekani, Pierre Sutra, Marc Shapiro, and Nuno M. Preguiça. 2013. On the scalability of snapshot isolation. In Proceedings Euro-Par 2013 Parallel Processing—19th International Conference. 369--381. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. E. Armendáriz-Iñigo, A. Mauch-Goya, J. R. González de Mendívil, and F. D. Muñoz Escoí. 2008. SIPRe: A partial database replication protocol with SI replicas. In Proceedings of the 2008 ACM Symposium on Applied Computing (SAC’08). ACM, New York, NY, 2181--2185. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Peter Bailis, Aaron Davidson, Alan Fekete, Ali Ghodsi, Joseph M. Hellerstein, and Ion Stoica. 2013. Highly available transactions: Virtues and limitations. Proc. VLDB Endow. 7, 3 (Nov. 2013), 181--192. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Peter Bailis, Alan Fekete, Michael J. Franklin, Ali Ghodsi, Joseph M. Hellerstein, and Ion Stoica. 2014. Coordination avoidance in database systems. Proc. VLDB Endow. 3 (Nov. 2014), 185--196. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Peter Bailis and Ali Ghodsi. 2013. Eventual consistency today: Limitations, extensions, and beyond. Queue 11, 3, Article 20 (March 2013), 20:20--20:32.1542-7730 Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Peter Bailis, Shivaram Venkataraman, Michael J. Franklin, Joseph M. Hellerstein, and Ion Stoica. 2012. Probabilistically bounded staleness for practical partial quorums. Proc. VLDB Endow. 5, 8 (April 2012), 776--787. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Jason Baker, Chris Bond, James C. Corbett, J. J. Furman, Andrey Khorlin, James Larson, Jean-Michel Leon, Yawei Li, Alexander Lloyd, and Vadim Yushprakh. 2011. Megastore: Providing scalable, highly available storage for interactive services. In Proceedings of the 5th Biennial Conference on Innovative Data Systems Research (CIDR’11). 223--234.Google ScholarGoogle Scholar
  25. Valter Balegas, Sérgio Duarte, Carla Ferreira, Rodrigo Rodrigues, Nuno Preguiça, Mahsa Najafzadeh, and Marc Shapiro. 2015. Putting consistency back into eventual consistency. In Proceedings of the 10th European Conference on Computer Systems (EuroSys’15). ACM, New York, NY, Article 6, 6:1--6:16. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Hitesh Ballani, Paolo Costa, Thomas Karagiannis, and Ant Rowstron. 2011. Towards predictable datacenter networks. In Proceedings of the ACM SIGCOMM Conference (SIGCOMM’11). ACM, New York, NY, 242--253. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Hal Berenson, Phil Bernstein, Jim Gray, Jim Melton, Elizabeth O’Neil, and Patrick O’Neil. 1995. A critique of ANSI SQL isolation levels. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD’95). ACM, New York, NY, 1--10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. David Bermbach, Markus Klems, Stefan Tai, and Michael Menzel. 2011. MetaStorage: A federated cloud storage system to manage consistency-latency tradeoffs. In Proceedings of IEEE International Conference on Cloud Computing (CLOUD’11). 452--459. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Alysson Bessani, Miguel Correia, Bruno Quaresma, Fernando André, and Paulo Sousa. 2011. DepSky: Dependable and secure storage in a cloud-of-clouds. In Proceedings of the 6th European Conference on Computer Systems (EuroSys’11). ACM, New York, NY, 31--46. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. C. E. Bezerra, F. Pedone, and R. V. Renesse. 2014. Scalable state-machine replication. In Proceedings of the 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks. 331--342. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Dhruba Borthakur, Jonathan Gray, Joydeep Sen Sarma, Kannan Muthukkaruppan, Nicolas Spiegelberg, Hairong Kuang, Karthik Ranganathan, Dmytro Molkov, Aravind Menon, Samuel Rash, Rodrigo Schmidt, and Amitanand Aiyer. 2011. Apache Hadoop goes realtime at facebook. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD’11). ACM, New York, NY, 1071--1080. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Kevin D. Bowers, Ari Juels, and Alina Oprea. 2009. HAIL: A high-availability and integrity layer for cloud storage. In Proceedings of the 16th ACM Conference on Computer and Communications Security (CCS’09). ACM, New York, NY, 187--198. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Matthias Brantner, Daniela Florescu, David Graf, Donald Kossmann, and Tim Kraska. 2008. Building a database on S3. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD’08). ACM, New York, NY, 251--264. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Christian Cachin, Robert Haas, and Marko Vukolic. 2010. Dependable storage in the Intercloud. Technical Report. Research Report RZ, 3783.Google ScholarGoogle Scholar
  35. Brad Calder, Ju Wang, Aaron Ogus, Niranjan Nilakantan, Arild Skjolsvold, Sam McKelvie, Yikang Xu, Shashwat Srivastav, Jiesheng Wu, Huseyin Simitci, Jaidev Haridas, Chakravarthy Uddaraju, Hemal Khatri, Andrew Edwards, Vaman Bedekar, Shane Mainali, Rafay Abbasi, Arpit Agarwal, Mian Fahim ul Haq, Muhammad Ikram ul Haq, Deepali Bhardwaj, Sowmya Dayanand, Anitha Adusumilli, Marvin McNett, Sriram Sankaran, Kavitha Manivannan, and Leonidas Rigas. 2011. Windows azure storage: A highly available cloud storage service with strong consistency. In Proceedings of the 23rd ACM Symposium on Operating Systems Principles (SOSP’11). ACM, New York, NY, 143--157. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. David G. Campbell, Gopal Kakivaya, and Nigel Ellis. 2010. Extreme scale with full SQL language support in microsoft SQL azure. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD’10). ACM, New York, NY, 1021--1024. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Rick Cattell. 2011. Scalable SQL and NoSQL data stores. SIGMOD Record 39, 4 (May 2011), 12--27. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. A. Chan and R. Gray. 1985. Implementing distributed read-only transactions. IEEE Trans. Softw. Eng. SE-11, 2 (Feb 1985), 205--212. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Chia-Wei Chang, Pangfeng Liu, and Jan-Jan Wu. 2012. Probability-based cloud storage providers selection algorithms with maximum availability. In Proceedings of the 41st International Conference on Parallel Processing. 199--208. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, and Robert E. Gruber. 2008. Bigtable: A distributed storage system for structured data. ACM Trans. Comput. Syst. 26, 2, Article 4 (June 2008), 4:1--4:26.0734-2071 Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Fangfei Chen, Katherine Guo, John Lin, and Thomas F. La Porta. 2012. Intra-cloud lightning: Building CDNs in the cloud. In Proceedings of the IEEE INFOCOM. 433--441.Google ScholarGoogle Scholar
  42. H. Chen, H. Jin, and S. Wu. 2016. Minimizing inter-server communications by exploiting self-similarity in online social networks. IEEE Trans. Parall. Distrib. Syst. 27, 4 (April 2016), 1116--1130. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Haopeng Chen, Zhenhua Wang, and Yunmeng Ban. 2013. Access-load-aware dynamic data balancing for cloud storage service. In Proceedings of the 6th International Conference on Internet and Distributed Computing Systems—Volume 8223 (IDCS’13). Springer-Verlag, New York, 307--320. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. H. C. H. Chen, Y. Hu, P. P. C. Lee, and Y. Tang. 2014. NCCloud: A network-coding-based storage system in a cloud-of-clouds. IEEE Trans. Comput. 63, 1 (Jan 2014), 31--44. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Kai Chen, Ankit Singla, Atul Singh, Kishore Ramachandran, Lei Xu, Yueping Zhang, Xitao Wen, and Yan Chen. 2014. OSA: An optical switching architecture for data center networks with unprecedented flexibility. IEEE/ACM Trans. Netw. 22, 2 (April 2014), 498--511. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. H. E. Chihoub, S. Ibrahim, G. Antoniu, and M. S. Prez. 2013. Consistency in the cloud: When money does matter! In Proceedings of the 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing. 352--359.Google ScholarGoogle Scholar
  47. D. Chiu and G. Agrawal. 2010. Evaluating caching and storage options on the Amazon web services cloud. In Proceedings of the 11th IEEE/ACM International Conference on Grid Computing. 17--24.Google ScholarGoogle Scholar
  48. Asaf Cidon, Robert Escriva, Sachin Katti, Mendel Rosenblum, and Emin Gun Sirer. 2015. Tiered replication: A cost-effective alternative to full cluster geo-replication. In Proceedings of the USENIX Annual Technical Conference (USENIX ATC’15). USENIX Association, Santa Clara, CA, 31--43. Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. Asaf Cidon, Stephen Rumble, Ryan Stutsman, Sachin Katti, John Ousterhout, and Mendel Rosenblum. 2013. Copysets: Reducing the frequency of data loss in cloud storage. In Presented as part of the 2013 USENIX Annual Technical Conference (USENIX ATC’13). USENIX, San Jose, CA, 37--48. Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. Brian F. Cooper, Raghu Ramakrishnan, Utkarsh Srivastava, Adam Silberstein, Philip Bohannon, Hans-Arno Jacobsen, Nick Puz, Daniel Weaver, and Ramana Yerneni. 2008. PNUTS: Yahoo!’s hosted data serving platform. Proc. VLDB Endow.2 (Aug. 2008), 1277--1288. Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. James C. Corbett, Jeffrey Dean, Michael Epstein, Andrew Fikes, Christopher Frost, J. J. Furman, Sanjay Ghemawat, Andrey Gubarev, Christopher Heiser, Peter Hochschild, Wilson Hsieh, Sebastian Kanthak, Eugene Kogan, Hongyi Li, Alexander Lloyd, Sergey Melnik, David Mwaura, David Nagle, Sean Quinlan, Rajesh Rao, Lindsay Rolig, Yasushi Saito, Michal Szymaniak, Christopher Taylor, Ruth Wang, and Dale Woodford. 2013. Spanner: Google’s globally distributed database. ACM Trans. Comput. Syst. 31, 3, Article 8 (Aug. 2013), 8:1--8:22. Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, and Clifford Stein. 2009. Introduction to Algorithms, 3rd ed. MIT Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. Paolo Costa, Austin Donnelly, Antony Rowstron, and Greg O’Shea. 2012. Camdoop: Exploiting in-network aggregation for big data applications. In Presented as Part of the 9th USENIX Symposium on Networked Systems Design and Implementation (NSDI’12). USENIX, San Jose, CA, 29--42. Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. Anupam Das, Cristian Lumezanu, Yueping Zhang, Vishal Singh, Guofei Jiang, and Curtis Yu. 2013b. Transparent and flexible network management for big data processing in the cloud. In Proceedings of the 5th USENIX Workshop on Hot Topics in Cloud Computing. USENIX, Berkeley, CA.Google ScholarGoogle Scholar
  55. Sudipto Das, Divyakant Agrawal, and Amr El Abbadi. 2013a. ElasTraS: An elastic, scalable, and self-managing transactional database for the cloud. ACM Trans. Database Syst. 38, 1, Article 5 (April 2013), 5:1--5:45. Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. DeCandia. 2007. Dynamo: Amazon’s highly available key-value store. SIGOPS Oper. Syst. Rev. 41, 6 (Oct. 2007), 205--220. Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. Xavier Défago, André Schiper, and Péter Urbán. 2004. Total order broadcast and multicast algorithms: Taxonomy and survey. Comput. Surveys 36, 4 (Dec. 2004), 372--421. Google ScholarGoogle ScholarDigital LibraryDigital Library
  58. Alan Demers, Dan Greene, Carl Hauser, Wes Irish, John Larson, Scott Shenker, Howard Sturgis, Dan Swinehart, and Doug Terry. 1987. Epidemic algorithms for replicated database maintenance. In Proceedings of the 6th Annual ACM Symposium on Principles of Distributed Computing (PODC’87). ACM, New York, NY, 1--12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  59. A. G. Dimakis, P. B. Godfrey, Y. Wu, M. J. Wainwright, and K. Ramchandran. 2010. Network coding for distributed storage systems. IEEE Trans. Info. Theory 56, 9 (Sept 2010), 4539--4551. Google ScholarGoogle ScholarDigital LibraryDigital Library
  60. Jiaqing Du, Sameh Elnikety, Amitabha Roy, and Willy Zwaenepoel. 2013. Orbe: Scalable causal consistency using dependency matrices and physical clocks. In Proceedings of the 4th Annual Symposium on Cloud Computing (SOCC’13). ACM, New York, NY, Article 11, 11:1--11:14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  61. Jiaqing Du, Calin Iorgulescu, Amitabha Roy, and Willy Zwaenepoel. 2014b. Closing the performance gap between causal consistency and eventual consistency. In Proceedings of the Workshop on the Principles and Practice of Eventual Consistency (PaPEC’14).Google ScholarGoogle Scholar
  62. Jiaqing Du, Calin Iorgulescu, Amitabha Roy, and Willy Zwaenepoel. 2014a. Gentlerain: Cheap and scalable causal consistency with physical clocks. In Proceedings of the ACM Symposium on Cloud Computing. ACM, 1--13. Google ScholarGoogle ScholarDigital LibraryDigital Library
  63. Robert Escriva, Bernard Wong, and Emin Gün Sirer. 2012. HyperDex: A distributed, searchable key-value store. In Proceedings of the ACM SIGCOMM 2012 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication (SIGCOMM’12). ACM, New York, NY, 25--36. Google ScholarGoogle ScholarDigital LibraryDigital Library
  64. Robert Escriva, Bernard Wong, and Emin Gün Sirer. 2013. Warp: Multi-key transactions for key value stores. United Networks, LLC, Technical Report (2013).Google ScholarGoogle Scholar
  65. Yuan Feng, Baochun Li, and Bo Li. 2012. Postcard: Minimizing costs on inter-datacenter traffic with store-and-forward. In Proceedings of the 32nd International Conference on Distributed Computing Systems Workshops (ICDCSW’12). IEEE Computer Society, Washington, DC, 43--50. Google ScholarGoogle ScholarDigital LibraryDigital Library
  66. Ilir Fetai and Heiko Schuldt. 2012. Cost-based data consistency in a data-as-a-service cloud environment. In Proceedings of the IEEE 5th International Conference on Cloud Computing (CLOUD’12). IEEE Computer Society, Washington, DC, 526--533. Google ScholarGoogle ScholarDigital LibraryDigital Library
  67. Shai Fine, Yoram Singer, and Naftali Tishby. 1998. The hierarchical hidden Markov model: Analysis and applications. Mach. Learn. 32, 1 (July 1998), 41--62. Google ScholarGoogle ScholarDigital LibraryDigital Library
  68. Daniel Ford, François Labelle, Florentina I. Popovici, Murray Stokely, Van-Anh Truong, Luiz Barroso, Carrie Grimes, and Sean Quinlan. 2010. Availability in globally distributed storage systems. In Proceedings of the 9th USENIX Symposium on Operating Systems Design and Implementation. 61--74. Google ScholarGoogle ScholarDigital LibraryDigital Library
  69. Seth Gilbert and Nancy Lynch. 2002. Brewer’s conjecture and the feasibility of consistent, available, partition-tolerant web services. SIGACT News 33, 2 (June 2002), 51--59. Google ScholarGoogle ScholarDigital LibraryDigital Library
  70. Jim Gray and Leslie Lamport. 2006. Consensus on transaction commit. ACM Trans. Data. Syst. 31, 1 (March 2006), 133--160. Google ScholarGoogle ScholarDigital LibraryDigital Library
  71. Rachid Guerraoui and André Schiper. 2001. Genuine atomic multicast in asynchronous distributed systems. Theor. Comput. Sci. 254, 1--2 (March 2001), 297--316. Google ScholarGoogle ScholarDigital LibraryDigital Library
  72. Jian Guo, Fangming Liu, Xiaomeng Huang, John C. S. Lui, Mi Hu, Qiao Gao, and Hai Jin. 2014. On efficient bandwidth allocation for traffic variability in datacenters. In Proceedings of the IEEE Conference on Computer Communications (INFOCOM’14). 1572--1580.Google ScholarGoogle ScholarCross RefCross Ref
  73. R. C. Hansdah and L. M. Patnaik. 1986. Proceedings of the International Conference on Database Theory (ICDT’86). Springer, Berlin, 171--185. Google ScholarGoogle ScholarDigital LibraryDigital Library
  74. Zach Hill and Marty Humphrey. 2010. CSAL: A cloud storage abstraction layer to enable portable cloud applications. In Proceedings of the 2nd IEEE International Conference on Cloud Computing Technology and Scie (CLOUDCOM’10). IEEE Computer Society, Washington, DC, 504--511. Google ScholarGoogle ScholarDigital LibraryDigital Library
  75. Cheng Huang, Huseyin Simitci, Yikang Xu, Aaron Ogus, Brad Calder, Parikshit Gopalan, Jin Li, and Sergey Yekhanin. 2012. Erasure coding in windows azure storage. In Proceedings of the USENIX Conference on Annual Technical Conference (USENIX ATC’12). USENIX Association, Berkeley, CA, 15--26. Google ScholarGoogle ScholarDigital LibraryDigital Library
  76. Anil K. Jain and Richard C. Dubes. 1988. Algorithms for Clustering Data. Prentice-Hall, Inc., Upper Saddle River, NJ. Google ScholarGoogle ScholarDigital LibraryDigital Library
  77. Sushant Jain, Alok Kumar, Subhasree Mandal, Joon Ong, Leon Poutievski, Arjun Singh, Subbaiah Venkata, Jim Wanderer, Junlan Zhou, Min Zhu, Jon Zolla, Urs Hölzle, Stephen Stuart, and Amin Vahdat. 2013. B4: Experience with a globally deployed software defined wan. SIGCOMM Comput. Commun. Rev. 43, 4 (Aug. 2013), 3--14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  78. Vimalkumar Jeyakumar, Mohammad Alizadeh, David Mazières, Balaji Prabhakar, Changhoon Kim, and Albert Greenberg. 2013. EyeQ: Practical network performance isolation at the edge. In Proceedings of the 10th USENIX Conference on Networked Systems Design and Implementation (NSDI’13). USENIX Association, Berkeley, CA, 297--312. Google ScholarGoogle ScholarDigital LibraryDigital Library
  79. Lei Jiao, Jun Li, Tianyin Xu, Wei Du, and Xiaoming Fu. 2016. Optimizing cost for online social networks on geo-distributed clouds. IEEE/ACM Trans. Netw. 24, 1 (Feb. 2016), 99--112. Google ScholarGoogle ScholarDigital LibraryDigital Library
  80. Joarder Kamal, Manzur Murshed, and Rajkumar Buyya. 2016. Workload-aware incremental repartitioning of shared-nothing distributed databases for scalable OLTP applications. Future Gen. Comput. Syst. 56, C (March 2016), 421--435. Google ScholarGoogle ScholarDigital LibraryDigital Library
  81. Osama Khan, Randal C. Burns, James S. Plank, William Pierce, and Cheng Huang. 2012. Rethinking erasure codes for cloud file systems: Minimizing I/O for recovery and degraded reads. In Proceedings of the 10th USENIX Conference on File and Storage Technologies (FAST’12). Google ScholarGoogle ScholarDigital LibraryDigital Library
  82. Tadeusz Kobus, Maciej Kokocinski, and Pawel T. Wojciechowski. 2013. Hybrid replication: State-machine-based and deferred-update replication schemes combined. In Proceedings of the 33rd IEEE International Conference on Distributed Computing Systems (ICDCS’13). IEEE Computer Society, Washington, DC, 286--296. Google ScholarGoogle ScholarDigital LibraryDigital Library
  83. Ramakrishna Kotla, Lorenzo Alvisi, and Mike Dahlin. 2007. SafeStore: A durable and practical storage system. In Proceedings of the USENIX Annual Technical Conference (ATC’07). USENIX Association, Berkeley, CA, Article 10, 10:1--10:14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  84. Tim Kraska, Martin Hentschel, Gustavo Alonso, and Donald Kossmann. 2009. Consistency rationing in the cloud: Pay only when it matters. Proc. VLDB Endow. 1 (Aug. 2009), 253--264. Google ScholarGoogle ScholarDigital LibraryDigital Library
  85. Diego Kreutz, Fernando M. V. Ramos, Paulo Veríssimo, Christian Esteve Rothenberg, Siamak Azodolmolky, and Steve Uhlig. 2015. Software-defined networking: A comprehensive survey. Pract. Proc. IEEE 103, 1 (2015), 14--76.Google ScholarGoogle ScholarCross RefCross Ref
  86. K. Ashwin Kumar, Abdul Quamar, Amol Deshpande, and Samir Khuller. 2014. SWORD: Workload-aware data placement and replica selection for cloud data management systems. VLDB J. 23, 6 (2014), 845--870. Google ScholarGoogle ScholarDigital LibraryDigital Library
  87. H. T. Kung and John T. Robinson. 1981. On optimistic methods for concurrency control. ACM Trans. Data. Syst. 2 (June 1981), 213--226. Google ScholarGoogle ScholarDigital LibraryDigital Library
  88. Avinash Lakshman and Prashant Malik. 2010. Cassandra: A decentralized structured storage system. SIGOPS Oper. Syst. Rev. 44, 2 (April 2010), 35--40. Google ScholarGoogle ScholarDigital LibraryDigital Library
  89. Leslie Lamport. 1998. The part-time parliament. ACM Trans. Comput. Syst. 16, 2 (May 1998), 133--169. Google ScholarGoogle ScholarDigital LibraryDigital Library
  90. Leslie Lamport. 2005. Generalized Consensus and Paxos. Technical Report MSR-TR-2005-33. Microsoft Research. 60 pages.Google ScholarGoogle Scholar
  91. Leslie Lamport. 2006. Fast paxos. Distrib. Comput. 19, 2 (2006), 79--103.Google ScholarGoogle ScholarDigital LibraryDigital Library
  92. Jeongkeun Lee, Yoshio Turner, Myungjin Lee, Lucian Popa, Sujata Banerjee, Joon-Myung Kang, and Puneet Sharma. 2014. Application-driven bandwidth guarantees in datacenters. SIGCOMM Comput. Commun. Rev. 44, 4 (Aug. 2014), 467--478. Google ScholarGoogle ScholarDigital LibraryDigital Library
  93. Cheng Li, João Leitão, Allen Clement, Nuno Preguiça, Rodrigo Rodrigues, and Viktor Vafeiadis. 2014. Automating the choice of consistency levels in replicated systems. In Proceedings of the 2014 USENIX Conference on USENIX Annual Technical Conference (USENIX ATC’14). USENIX Association, Berkeley, CA, 281--292. Google ScholarGoogle ScholarDigital LibraryDigital Library
  94. Cheng Li, Daniel Porto, Allen Clement, Johannes Gehrke, Nuno Preguiça, and Rodrigo Rodrigues. 2012. Making geo-replicated systems fast as possible, consistent when necessary. In Proceedings of the 10th USENIX Conference on Operating Systems Design and Implementation (OSDI’12). USENIX Association, Berkeley, CA, 265--278. Google ScholarGoogle ScholarDigital LibraryDigital Library
  95. Jialin Li, Ellis Michael, Naveen Kr. Sharma, Adriana Szekeres, and Dan R. K. Ports. 2016. Just say NO to paxos overhead: Replacing consensus with network ordering. In Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI’16). USENIX Association, 467--483. Google ScholarGoogle ScholarDigital LibraryDigital Library
  96. Mingqiang Li, Chuan Qin, and Patrick P. C. Lee. 2015. CDStore: Toward reliable, secure, and cost-efficient cloud storage via convergent dispersal. In Proceedings of the USENIX Annual Technical Conference (USENIX ATC’15). USENIX Association, Santa Clara, CA, 111--124. Google ScholarGoogle ScholarDigital LibraryDigital Library
  97. Wenhao Li, Yun Yang, Jinjun Chen, and Dong Yuan. 2012. A cost-effective mechanism for cloud data reliability management based on proactive replica checking. In Proceedings of the 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID’12). IEEE Computer Society, Washington, DC, 564--571. Google ScholarGoogle ScholarDigital LibraryDigital Library
  98. Wenhao Li, Yun Yang, and Dong Yuan. 2011. A novel cost-effective dynamic data replication strategy for reliability in cloud data centres. In Proceedings of the 9th IEEE International Conference on Dependable, Autonomic and Secure Computing (DASC’11). IEEE Computer Society, Washington, DC, 496--502. Google ScholarGoogle ScholarDigital LibraryDigital Library
  99. Guanfeng Liang and Ulaş C. Kozat. 2014. Fast cloud: Pushing the envelope on delay performance of cloud storage with coding. IEEE/ACM Trans. Netw. 22, 6 (Dec. 2014), 2012--2025. Google ScholarGoogle ScholarDigital LibraryDigital Library
  100. G. Liang and U. C. Kozat. 2016. On throughput-delay optimal access to storage clouds via load adaptive coding and chunking. IEEE/ACM Trans. Netw. 24, 4 (Aug 2016), 2168--2181. Google ScholarGoogle ScholarDigital LibraryDigital Library
  101. G. Liu, H. Shen, and H. Chandler. 2013. Selective data replication for online social networks with distributed datacenters. In Proceedings of the 21st IEEE International Conference on Network Protocols (ICNP’13). 1--10.Google ScholarGoogle Scholar
  102. J. Liu and H. Shen. 2016. A low-cost multi-failure resilient replication scheme for high data availability in cloud storage. In IEEE 23rd International Conference on High Performance Computing (HiPC’16). 242--251.Google ScholarGoogle Scholar
  103. Wyatt Lloyd, Michael J. Freedman, Michael Kaminsky, and David G. Andersen. 2011. Don’t settle for eventual: Scalable causal consistency for wide-area storage with COPS. In Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles (SOSP’11). ACM, New York, NY, 401--416. Google ScholarGoogle ScholarDigital LibraryDigital Library
  104. Wyatt Lloyd, Michael J. Freedman, Michael Kaminsky, and David G. Andersen. 2013. Stronger semantics for low-latency geo-replicated storage. In Proceedings of the 10th USENIX Conference on Networked Systems Design and Implementation (NSDI’13). USENIX Association, Berkeley, CA, 313--328. Google ScholarGoogle ScholarDigital LibraryDigital Library
  105. D. Lomet. 1996. Replicated indexes for distributed data. In Proceedings of the 4th International Conference on Parallel and Distributed Information Systems. 108--119. Google ScholarGoogle ScholarDigital LibraryDigital Library
  106. Yadi Ma, Thyaga Nandagopal, Krishna P. N. Puttaswamy, and Suman Banerjee. 2013. An ensemble of replication and erasure codes for cloud file systems. In Proceedings of the IEEE INFOCOM. 1276--1284.Google ScholarGoogle ScholarCross RefCross Ref
  107. Hatem Mahmoud, Faisal Nawab, Alexander Pucher, Divyakant Agrawal, and Amr El Abbadi. 2013. Low-latency multi-datacenter databases using replicated commit. Proc. VLDB Endow. 9 (July 2013), 661--672. Google ScholarGoogle ScholarDigital LibraryDigital Library
  108. Hatem A. Mahmoud, Vaibhav Arora, Faisal Nawab, Divyakant Agrawal, and Amr El Abbadi. 2014. MaaT: Effective and scalable coordination of distributed transactions in the cloud. Proc. VLDB Endow. 5 (Jan. 2014), 329--340. Google ScholarGoogle ScholarDigital LibraryDigital Library
  109. Yaser Mansouri and Rajkumar Buyya. 2016. To move or not to move: Cost optimization in a dual cloud-based storage architecture. J. Netw. Comput. Appl. 75 (2016), 223--235. Google ScholarGoogle ScholarDigital LibraryDigital Library
  110. Yaser Mansouri, Adel Nadjaran Toosi, and Rajkumar Buyya. 2013. Brokering algorithms for optimizing the availability and cost of cloud storage services. In Proceedings of the 5th IEEE International Conference on Cloud Computing Technology and Science (CloudCom’13). 581--589. Google ScholarGoogle ScholarDigital LibraryDigital Library
  111. Y. Mansouri, A. Nadjaran Toosi, and R. Buyya. 2017. Cost optimization for dynamic replication and migration of data in cloud data centers. IEEE Trans. Cloud Comput. (2017).Google ScholarGoogle Scholar
  112. Bo Mao, Suzhen Wu, and Hong Jiang. 2016. Exploiting workload characteristics and service diversity to improve the availability of cloud storage systems. IEEE Trans. Parall. Distrib. Syst. 27, 7 (2016), 2010--2021. Google ScholarGoogle ScholarDigital LibraryDigital Library
  113. John C. McCullough, John Dunagan, Alec Wolman, and Alex C. Snoeren. 2010. Stout: An adaptive interface to scalable cloud storage. In Proceedings of the USENIX Conference on USENIX Annual Technical Conference (USENIXATC’10). USENIX Association, Berkeley, CA, 4--4. Google ScholarGoogle ScholarDigital LibraryDigital Library
  114. Iulian Moraru, David G. Andersen, and Michael Kaminsky. 2013. There is more consensus in egalitarian parliaments. In Proceedings of the 24th ACM Symposium on Operating Systems Principles (SOSP’13). ACM, New York, NY, 358--372. Google ScholarGoogle ScholarDigital LibraryDigital Library
  115. Shuai Mu, Yang Cui, Yang Zhang, Wyatt Lloyd, and Jinyang Li. 2014. Extracting more concurrency from distributed transactions. In Proceedings of the 11th USENIX Conference on Operating Systems Design and Implementation (OSDI’14). USENIX Association, Berkeley, CA, 479--494. Google ScholarGoogle ScholarDigital LibraryDigital Library
  116. Subramanian Muralidhar, Wyatt Lloyd, Sabyasachi Roy, Cory Hill, Ernest Lin, Weiwen Liu, Satadru Pan, Shiva Shankar, Viswanath Sivakumar, Linpeng Tang, and Sanjeev Kumar. 2014. f4: Facebook’s warm BLOB storage system. In Proceedings of the 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI’14). USENIX Association, 383--398. Google ScholarGoogle ScholarDigital LibraryDigital Library
  117. Maurizio Naldi and Loretta Mastroeni. 2013. Cloud storage pricing: A comparison of current practices. In Proceedings of International Workshop on Hot Topics in Cloud Services (HotTopiCS’13). ACM, New York, NY, 27--34. Google ScholarGoogle ScholarDigital LibraryDigital Library
  118. Rajesh Nishtala, Hans Fugal, Steven Grimm, Marc Kwiatkowski, Herman Lee, Harry C. Li, Ryan McElroy, Mike Paleczny, Daniel Peek, Paul Saab, David Stafford, Tony Tung, and Venkateshwaran Venkataramani. 2013. Scaling memcache at facebook. In Presented as Part of the 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI’13). USENIX, 385--398. Google ScholarGoogle ScholarDigital LibraryDigital Library
  119. Brian M. Oki and Barbara H. Liskov. 1988. Viewstamped replication: A new primary copy method to support highly-available distributed systems. In Proceedings of the Seventh Annual ACM Symposium on Principles of Distributed Computing (PODC’88). ACM, New York, NY, 8--17. Google ScholarGoogle ScholarDigital LibraryDigital Library
  120. L. Pacheco, D. Sciascia, and F. Pedone. 2014. Parallel deferred update replication. In Proceedings of the IEEE 13th International Symposium on Network Computing and Applications. 205--212. Google ScholarGoogle ScholarDigital LibraryDigital Library
  121. Fernando Pedone, Matthias Wiesmann, André Schiper, Bettina Kemme, and Gustavo Alonso. 2000. Understanding replication in databases and distributed systems. In Proceedings of the 20th International Conference on Distributed Computing Systems. 464--474. Google ScholarGoogle ScholarDigital LibraryDigital Library
  122. Sebastiano Peluso, Pedro Ruivo, Paolo Romano, Francesco Quaglia, and Luis Rodrigues. 2012. When scalability meets consistency: Genuine multiversion update-serializable partial data replication. In Proceedings of the 2012 IEEE 32nd International Conference on Distributed Computing Systems (ICDCS’12). IEEE Computer Society, Washington, DC, 455--465. Google ScholarGoogle ScholarDigital LibraryDigital Library
  123. Lucian Popa, Praveen Yalagandula, Sujata Banerjee, Jeffrey C. Mogul, Yoshio Turner, and Jose Renato Santos. 2013. ElasticSwitch: Practical work-conserving bandwidth guarantees for cloud computing. In Proceedings of the ACM SIGCOMM Conference on SIGCOMM (SIGCOMM’13). ACM, New York, NY, 351--362. Google ScholarGoogle ScholarDigital LibraryDigital Library
  124. Dan R. K. Ports, Jialin Li, Vincent Liu, Naveen Kr. Sharma, and Arvind Krishnamurthy. 2015. Designing distributed systems using approximate synchrony in data center networks. In Proceedings of the 12th USENIX Symposium on Networked Systems Design and Implementation (NSDI’15). USENIX Association, Oakland, CA, 43--57. Google ScholarGoogle ScholarDigital LibraryDigital Library
  125. Josep M. Pujol, Vijay Erramilli, Georgos Siganos, Xiaoyuan Yang, Nikos Laoutaris, Parminder Chhabra, and Pablo Rodriguez. 2010. The little engine(s) that could: Scaling online social networks. In Proceedings of the ACM SIGCOMM Conference (SIGCOMM’10). ACM, New York, NY, 375--386. Google ScholarGoogle ScholarDigital LibraryDigital Library
  126. Krishna P. N. Puttaswamy, Thyaga Nandagopal, and Murali Kodialam. 2012. Frugal storage for cloud file systems. In Proceedings of the 7th ACM European Conference on Computer Systems (EuroSys’12). ACM, New York, NY, 71--84. Google ScholarGoogle ScholarDigital LibraryDigital Library
  127. K. V. Rashmi, Nihar B. Shah, Dikang Gu, Hairong Kuang, Dhruba Borthakur, and Kannan Ramchandran. 2014. A “Hitchhiker’s” guide to fast and efficient data reconstruction in erasure-coded data centers. SIGCOMM Comput. Commun. Rev.4 (Aug. 2014), 331--342. Google ScholarGoogle ScholarDigital LibraryDigital Library
  128. K. V. Rashmi, N. B. Shah, and P. V. Kumar. 2011. Optimal exact-regenerating codes for distributed storage at the MSR and MBR points via a product-matrix construction. IEEE Trans. Info. Theory 57, 8 (Aug 2011), 5227--5239. Google ScholarGoogle ScholarDigital LibraryDigital Library
  129. Ron Roth. 2006. Introduction to Coding Theory. Cambridge University Press, New York, NY. Google ScholarGoogle ScholarDigital LibraryDigital Library
  130. Sudip Roy, Lucja Kot, Gabriel Bender, Bailu Ding, Hossein Hojjat, Christoph Koch, Nate Foster, and Johannes Gehrke. 2015. The homeostasis protocol: Avoiding transaction coordination through program analysis. In Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data (SIGMOD’15). ACM, New York, NY, 1311--1326. Google ScholarGoogle ScholarDigital LibraryDigital Library
  131. Stephen M. Rumble, Ankita Kejriwal, and John Ousterhout. 2014. Log-structured memory for DRAM-based storage. In Proceedings of the 12th USENIX Conference on File and Storage Technologies (FAST’14). USENIX Association, Berkeley, CA, 1--16. Google ScholarGoogle ScholarDigital LibraryDigital Library
  132. Yasushi Saito and Marc Shapiro. 2005. Optimistic replication. Comput. Surveys 37, 1 (March 2005), 42--81. Google ScholarGoogle ScholarDigital LibraryDigital Library
  133. Sherif Sakr. 2014. Cloud-hosted databases: Technologies, challenges and opportunities. Cluster Comput. 17, 2 (June 2014), 487--502. Google ScholarGoogle ScholarDigital LibraryDigital Library
  134. S. Sakr, A. Liu, D. M. Batista, and M. Alomari. 2011. A survey of large scale data management approaches in cloud environments. IEEE Commun. Surveys Tutor. 13, 3 (2011), 311--336.Google ScholarGoogle ScholarCross RefCross Ref
  135. Maheswaran Sathiamoorthy, Megasthenis Asteris, Dimitris Papailiopoulos, Alexandros G. Dimakis, Ramkumar Vadali, Scott Chen, and Dhruba Borthakur. 2013. XORing elephants: Novel erasure codes for big data. Proc. VLDB Endow. 5 (March 2013), 325--336. Google ScholarGoogle ScholarDigital LibraryDigital Library
  136. N. Schiper, P. Sutra, and F. Pedone. 2009. Genuine versus non-genuine atomic multicast protocols for wide area networks: An empirical study. In Proceedings of the 28th IEEE International Symposium on Reliable Distributed Systems. 166--175. Google ScholarGoogle ScholarDigital LibraryDigital Library
  137. Nicolas Schiper, Pierre Sutra, and Fernando Pedone. 2010. P-store: Genuine partial replication in wide area networks. In Proceedings of the 2010 29th IEEE Symposium on Reliable Distributed Systems (SRDS’10). IEEE Computer Society, Washington, DC, 214--224. Google ScholarGoogle ScholarDigital LibraryDigital Library
  138. Daniele Sciascia, Fernando Pedone, and Flavio Junqueira. 2012. Scalable deferred update replication. In Proceedings of the 42nd Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN) (DSN’12). IEEE Computer Society, Washington, DC, 1--12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  139. Steven S. Seiden. 2000. A guessing game and randomized online algorithms. In Proceedings of the 32nd Annual ACM Symposium on Theory of Computing (STOC’00). ACM, New York, NY, 592--601. Google ScholarGoogle ScholarDigital LibraryDigital Library
  140. Nihar B. Shah, Kangwook Lee, and Kannan Ramchandran. 2014. The MDS queue: Analysing the latency performance of erasure codes. In Proceedings of the IEEE International Symposium on Information Theory. 861--865.Google ScholarGoogle ScholarCross RefCross Ref
  141. Marc Shapiro, Nuno Preguiça, Carlos Baquero, and Marek Zawirski. 2011. Conflict-free replicated data types. In Proceedings of the 13th International Conference on Stabilization, Safety, and Security of Distributed Systems (SSS’11). Springer-Verlag, Berlin, Heidelberg, 386--400. Google ScholarGoogle ScholarCross RefCross Ref
  142. Artyom Sharov, Alexander Shraer, Arif Merchant, and Murray Stokely. 2015. Take me to your leader! Online optimization of distributed storage configurations. Proc. VLDB Endow. 8, 12 (Aug. 2015), 1490--1501. Google ScholarGoogle ScholarDigital LibraryDigital Library
  143. Dennis Shasha, Francois Llirbat, Eric Simon, and Patrick Valduriez. 1995. Transaction chopping: Algorithms and performance studies. ACM Trans. Database Syst. 3 (Sept. 1995), 325--363. Google ScholarGoogle ScholarDigital LibraryDigital Library
  144. M. Shen, A. D. Kshemkalyani, and T. Y. Hsu. 2015. Causal consistency for geo-replicated cloud storage under partial replication. In Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshop (IPDPSW’15). 509--518. Google ScholarGoogle ScholarDigital LibraryDigital Library
  145. Alan Shieh, Srikanth Kandula, Albert Greenberg, Changhoon Kim, and Bikas Saha. 2011. Sharing the data center network. In Proceedings of the 8th USENIX Conference on Networked Systems Design and Implementation (NSDI’11). USENIX Association, Berkeley, CA, 309--322. Google ScholarGoogle ScholarDigital LibraryDigital Library
  146. Youngjoo Shin, Dongyoung Koo, and Junbeom Hur. 2017. A survey of secure data deduplication schemes for cloud storage systems. Comput. Surveys 49, 4, Article 74 (Jan. 2017), 441--446. Google ScholarGoogle ScholarDigital LibraryDigital Library
  147. David Shue, Michael J. Freedman, and Anees Shaikh. 2012. Performance isolation and fairness for multi-tenant cloud storage. In Proceedings of the 10th USENIX Conference on Operating Systems Design and Implementation (OSDI’12). USENIX Association, Berkeley, CA, 349--362. Google ScholarGoogle ScholarDigital LibraryDigital Library
  148. Jeff Shute, Radek Vingralek, Bart Samwel, Ben Handy, Chad Whipkey, Eric Rollins, Mircea Oancea, Kyle Littlefield, David Menestrina, Stephan Ellner, John Cieslewicz, Ian Rae, Traian Stancescu, and Himani Apte. 2013. F1: A distributed SQL database that scales. Proc. VLDB Endow. 11 (Aug. 2013), 1068--1079. Google ScholarGoogle ScholarDigital LibraryDigital Library
  149. K. C. Sivaramakrishnan, Gowtham Kaki, and Suresh Jagannathan. 2015. Declarative programming over eventually consistent data stores. SIGPLAN Not. 50, 6 (June 2015), 413--424. Google ScholarGoogle ScholarDigital LibraryDigital Library
  150. Yair Sovran, Russell Power, Marcos K. Aguilera, and Jinyang Li. 2011. Transactional storage for geo-replicated systems. In Proceedings of the 23rd ACM Symposium on Operating Systems Principles (SOSP’11). ACM, New York, NY, 385--400. Google ScholarGoogle ScholarDigital LibraryDigital Library
  151. J. Spillner, G. Bombach, S. Matthischke, J. Muller, R. Tzschichholz, and A. Schill. 2011. Information dispersion over redundant arrays of optimal cloud storage for desktop users. In Proceedings of the 4th IEEE International Conference on Utility and Cloud Computing (UCC’11). 1--8. Google ScholarGoogle ScholarDigital LibraryDigital Library
  152. Christopher Stewart, Aniket Chakrabarti, and Rean Griffith. 2013. Zoolander: Efficiently meeting very strict, low-latency SLOs. In Proceedings of the 10th International Conference on Autonomic Computing (ICAC’13). USENIX, San Jose, CA, 265--277.Google ScholarGoogle Scholar
  153. C. Suh and K. Ramchandran. 2011. Exact-repair MDS code construction using interference alignment. IEEE Transactions on Information Theory 57, 3 (March 2011), 1425--1442. Google ScholarGoogle ScholarDigital LibraryDigital Library
  154. Lalith Suresh, Marco Canini, Stefan Schmid, and Anja Feldmann. 2015. C3: Cutting tail latency in cloud data stores via adaptive replica selection. In Proceedings of the 12th USENIX Conference on Networked Systems Design and Implementation (NSDI’15). USENIX Association, Berkeley, CA, 513--527. Google ScholarGoogle ScholarDigital LibraryDigital Library
  155. Andrew S. Tanenbaum and Maarten van Steen. 2006. Distributed Systems: Principles and Paradigms (2nd ed.). Prentice-Hall, Inc. Google ScholarGoogle ScholarDigital LibraryDigital Library
  156. J. Tang, X. Tang, and J. Yuan. 2015. Optimizing inter-server communication for online social networks. In Proceedings of the IEEE 35th International Conference on Distributed Computing Systems. 215--224.Google ScholarGoogle Scholar
  157. Jeff Terrace and Michael J. Freedman. 2009. Object storage on CRAQ: High-throughput chain replication for read-mostly workloads. In Proceedings of the 2009 Conference on USENIX Annual Technical Conference (USENIX’09). USENIX Association, Berkeley, CA, 11--11. Google ScholarGoogle ScholarDigital LibraryDigital Library
  158. Douglas B. Terry, Vijayan Prabhakaran, Ramakrishna Kotla, Mahesh Balakrishnan, Marcos K. Aguilera, and Hussam Abu-Libdeh. 2013. Consistency-based service level agreements for cloud storage. In Proceedings of the 24th ACM Symposium on Operating Systems Principles (SOSP’13). ACM, New York, NY, 309--324. Google ScholarGoogle ScholarDigital LibraryDigital Library
  159. Robert H. Thomas. 1979. A majority consensus approach to concurrency control for multiple copy databases. ACM Trans. Database Syst. 2 (June 1979), 180--209. Google ScholarGoogle ScholarDigital LibraryDigital Library
  160. Alexander Thomson, Thaddeus Diamond, Shu-Chun Weng, Kun Ren, Philip Shao, and Daniel J. Abadi. 2012. Calvin: Fast distributed transactions for partitioned database systems. In Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data (SIGMOD’12). ACM, New York, NY, 1--12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  161. A. Thusoo, J. S. Sarma, N. Jain, Z. Shao, P. Chakka, N. Zhang, S. Antony, H. Liu, and R. Murthy. 2010. Hive - A petabyte scale data warehouse using hadoop. In Proceedings of the 26th IEEE International Conference on Data Engineering (ICDE’10). 996--1005.Google ScholarGoogle Scholar
  162. Duc A. Tran, Khanh Nguyen, and Cuong Pham. 2012. S-CLONE: Socially-aware data replication for social networks. Comput. Netw. 56, 7 (2012), 2001--2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  163. Nguyen Tran, Marcos K. Aguilera, and Mahesh Balakrishnan. 2011. Online migration for geo-distributed storage systems. In Proceedings of the USENIX Conference on USENIX Annual Technical Conference (USENIXATC’11). USENIX Association, Berkeley, CA, 15--15. Google ScholarGoogle ScholarDigital LibraryDigital Library
  164. Balajee Vamanan, Jahangir Hasan, and T. N. Vijaykumar. 2012. Deadline-aware datacenter TCP (D2TCP). In Proceedings of the ACM SIGCOMM Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication (SIGCOMM’12). ACM, New York, NY, 115--126. Google ScholarGoogle ScholarDigital LibraryDigital Library
  165. Robbert van Renesse and Fred B. Schneider. 2004. Chain replication for supporting high throughput and availability. In Proceedings of the 6th Conference on Symposium on Opearting Systems Design 8 Implementation—Volume 6 (OSDI’04). USENIX Association, Berkeley, CA, 7. Google ScholarGoogle ScholarDigital LibraryDigital Library
  166. Srikumar Venugopal, Rajkumar Buyya, and Kotagiri Ramamohanarao. 2006. A taxonomy of data grids for distributed data sharing, management, and processing. ACM Comput. Surv. 38, 1, Article 3 (June 2006), 1--53. Google ScholarGoogle ScholarDigital LibraryDigital Library
  167. Hoang Tam Vo, Sheng Wang, Divyakant Agrawal, Gang Chen, and Beng Chin Ooi. 2012. LogBase: A scalable log-structured database system in the cloud. Proc. VLDB Endow. 5, 10 (June 2012), 1004--1015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  168. Hiroshi Wada, Alan Fekete, Liang Zhao, Kevin Lee, and Anna Liu. 2011. Data consistency properties and the trade-offs in commercial cloud storages: The consumers’ perspective. In Proceedings of the 5th Biennial Conference on Innovative Data Systems Research (CIDR’11). 134--143.Google ScholarGoogle Scholar
  169. Ting Wang, Zhiyang Su, Yu Xia, and M. Hamdi. 2014. Rethinking the data center networking: Architecture, network protocols, and resource sharing. IEEE Access (2014).Google ScholarGoogle Scholar
  170. Wei Wang, Baochun Li, and Ben Liang. 2013. To reserve or not to reserve: Optimal online multi-instance acquisition in IaaS clouds. In Proceedings of the 10th International Conference on Autonomic Computing (ICAC’13). 13--22.Google ScholarGoogle Scholar
  171. Yunnan Wu, Alexandros G. Dimakis, and Kannan Ramchandran. 2007. Deterministic regenerating codes for distributed storage. In Proceedings of the Allerton Conference on Control, Computing, and Communication. 1--5.Google ScholarGoogle Scholar
  172. Yu Wu, Chuan Wu, Bo Li, Linquan Zhang, Zongpeng Li, and Francis C. M. Lau. 2015. Scaling social media applications into geo-distributed clouds. IEEE/ACM Trans. Netw. 23, 3 (June 2015), 689--702. Google ScholarGoogle ScholarDigital LibraryDigital Library
  173. Yu Wu, Zhizhong Zhang, Chuan Wu, Chuanxiong Guo, Zongpeng Li, and Francis C. M. Lau. 2017. Orchestrating bulk data transfers across geo-distributed datacenters. IEEE Trans. Cloud Comput. 5, 1 (2017), 112--125.Google ScholarGoogle ScholarCross RefCross Ref
  174. Zhe Wu, Michael Butkiewicz, Dorian Perkins, Ethan Katz-Bassett, and Harsha V. Madhyastha. 2013. SPANStore: Cost-effective geo-replicated storage spanning multiple cloud services. In Proceedings of the 24th ACM Symposium on Operating Systems Principles (SOSP’13). ACM, New York, NY, 292--308. Google ScholarGoogle ScholarDigital LibraryDigital Library
  175. Mingyuan Xia, Mohit Saxena, Mario Blaum, and David A. Pease. 2015. A tale of two erasure codes in HDFS. In Proceedings of the 13th USENIX Conference on File and Storage Technologies (FAST’15). USENIX Association, Berkeley, CA, 213--226. Google ScholarGoogle ScholarDigital LibraryDigital Library
  176. Yu Xiang, Tian Lan, Vaneet Aggarwal, and Yih Farn R. Chen. 2014. Joint latency and cost optimization for erasurecoded data center storage. SIGMETRICS Perform. Eval. Rev. 2 (Sept. 2014), 3--14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  177. Di Xie, Ning Ding, Y. Charlie Hu, and Ramana Kompella. 2012. The only constant is change: Incorporating time-varying network reservations in data centers. In Proceedings of the ACM SIGCOMM Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication (SIGCOMM’12). ACM, New York, NY, 199--210. Google ScholarGoogle ScholarDigital LibraryDigital Library
  178. Boyang Yu and Jianping Pan. 2015. Location-aware associated data placement for geo-distributed data-intensive applications. In Proceedings of the IEEE Conference on Computer Communications (INFOCOM’15). 603--611.Google ScholarGoogle ScholarCross RefCross Ref
  179. Haifeng Yu and Amin Vahdat. 2002. Design and evaluation of a conit-based continuous consistency model for replicated services. ACM Trans. Comput. Syst. 20, 3 (Aug. 2002), 239--282. Google ScholarGoogle ScholarDigital LibraryDigital Library
  180. Wenying Zeng, Yuelong Zhao, Kairi Ou, and Wei Song. 2009. Research on cloud storage architecture and key technologies. In Proceedings of the 2nd International Conference on Interaction Sciences: Information Technology, Culture and Human (ICIS’09). ACM, New York, NY, 1044--1048. Google ScholarGoogle ScholarDigital LibraryDigital Library
  181. Hong Zhang, Kai Chen, Wei Bai, Dongsu Han, Chen Tian, Hao Wang, Haibing Guan, and Ming Zhang. 2015. Guaranteeing deadlines for inter-datacenter transfers. In Proceedings of the 10th European Conference on Computer Systems (EuroSys’15). ACM, New York, NY, Article 20, 14 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  182. Irene Zhang, Naveen Kr. Sharma, Adriana Szekeres, Arvind Krishnamurthy, and Dan R. K. Ports. 2015. Building consistent transactions with inconsistent replication. In Proceedings of the 25th Symposium on Operating Systems Principles (SOSP’15). ACM, New York, NY, 263--278. Google ScholarGoogle ScholarDigital LibraryDigital Library
  183. Q. Zhang, S. Li, Z. Li, Y. Xing, Z. Yang, and Y. Dai. 2015. CHARM: A cost-efficient multi-cloud data hosting scheme with high availability. IEEE Trans. Cloud Comput. 3, 3 (July 2015), 372--386.Google ScholarGoogle ScholarCross RefCross Ref
  184. X. Zhang, C. Liu, S. Nepal, S. Pandey, and J. Chen. 2013a. A privacy leakage upper bound constraint-based approach for cost-effective privacy preserving of intermediate data sets in cloud. IEEE Trans. Parall. Distrib. Syst. 24, 6 (June 2013), 1192--1202. Google ScholarGoogle ScholarDigital LibraryDigital Library
  185. Yang Zhang, Russell Power, Siyuan Zhou, Yair Sovran, Marcos K. Aguilera, and Jinyang Li. 2013b. Transaction chains: Achieving serializability with low latency in geo-distributed storage systems. In Proceedings of ACM SIGOPS 24th Symposium on Operating Systems Principles (SOSP’13). 276--291. Google ScholarGoogle ScholarDigital LibraryDigital Library
  186. Zhe Zhang, Amey Deshpande, Xiaosong Ma, Eno Thereska, and Dushyanth Narayanan. 2010. Does erasure coding have a role to play in my data center. Microsoft Res. MSR-TR-2010-52 (2010).Google ScholarGoogle Scholar
  187. L. Zhao, S. Sakr, and A. Liu. 2015. A framework for consumer-centric SLA management of cloud-hosted databases. IEEE Trans. Services Comput. 8, 4 (July 2015), 534--549.Google ScholarGoogle ScholarCross RefCross Ref
  188. Jingya Zhou, Jianxi Fan, Jin Wang, Baolei Cheng, and Juncheng Jia. 2017. Towards traffic minimization for data placement in online social networks. Concurr. Comput.: Practice Exp. 29, 6 (2017).Google ScholarGoogle Scholar

Index Terms

  1. Data Storage Management in Cloud Environments: Taxonomy, Survey, and Future Directions

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in

            Full Access

            • Published in

              cover image ACM Computing Surveys
              ACM Computing Surveys  Volume 50, Issue 6
              November 2018
              752 pages
              ISSN:0360-0300
              EISSN:1557-7341
              DOI:10.1145/3161158
              • Editor:
              • Sartaj Sahni
              Issue’s Table of Contents

              Copyright © 2017 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 11 December 2017
              • Revised: 1 August 2017
              • Accepted: 1 August 2017
              • Received: 1 October 2016
              Published in csur Volume 50, Issue 6

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • survey
              • Research
              • Refereed

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader