Skip to main content
Log in

Availability-Aware Virtual Resource Provisioning for Infrastructure Service Agreements in the Cloud

  • Published:
Information Systems Frontiers Aims and scope Submit manuscript

Abstract

Service availability is a key construct in Service Level Agreements (SLA) between a cloud service provider and a client. The provider typically allocates backup resources to mitigate the risk of violating the SLA-specified uptime guarantee. However, initial backups may need to be adjusted in response to real-time failure and recovery events. In this study, we first develop a recurrent intervention at fixed intervals (RIFI) strategy that allows the provider to adjust the allocation of backup resources such that the expected total cost is minimized. Next, we focus on the limit to number of interventions, starting from single intervention strategy, as frequent reallocations may be operationally disruptive. Particularly, we provide a cost minimization approach to guide the service providers in their virtual resources management, and a specific downtime minimization approach for more mission-critical applications as a more aggressive alternative. We present computational results exploring the impact of intervention on the likelihood of SLA violation for the rest of the contract period, and evaluate parameters such as time and quantum of resource level adjustment, penalty levels desired by clients, and their influences on the backup resource provisioning strategies. We also validate our models through the analysis of use cases from Amazon Elastic Compute Cloud. Finally, we summarize this study by providing key practical managerial implications for resource deployment in the availability-aware cloud.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Notes

  1. https://aws.amazon.com/ec2/dedicated-hosts/pricing/

  2. http://technet.microsoft.com/en-us/library/bb740891.aspx

References

  • Bruneo, D. (2014). A stochastic model to investigate data center performance and QoS in IaaS cloud computing systems. IEEE Transactions on Parallel and Distributed Systems, 25(3), 560–569.

    Article  Google Scholar 

  • Chase, J., & Niyato, D. (2015). Joint optimization of resource provisioning in cloud computing. IEEE Transactions on Services Computing, 10(3), 396–409.

    Article  Google Scholar 

  • Cisco. (2012). Cisco Global Cloud Networking Survey.

  • Dean, J. (2009) Designs, lessons and advice from building large distributed systems. https://www.cs.cornell.edu/projects/ladis2009/talks/deankeynote-ladis2009.pdf

  • Du, A. Y., Das, S., Yang, Z., Qiao, C., & Ramesh, R. (2015). Predicting transient downtime in virtual server systems: An efficient sample path randomization approach. IEEE Transactions on Computers, 64(12), 3541–3554.

    Article  Google Scholar 

  • Ghosh, R., Longo, F., Frattini, F., Russo, S., & Trivedi, K. S. (2014a). Scalable analytics for IaaS cloud availability. IEEE Transactions on Cloud Computing, 2(1), 57–70.

    Article  Google Scholar 

  • Ghosh, R., Longo, F., Xia, R., Naik, V. K., & Trivedi, K. S. (2014b). Stochastic model driven capacity planning for an infrastructure-as-a-service cloud. IEEE Transactions on Services Computing, 7(4), 667–680.

    Article  Google Scholar 

  • Goudarzi, H., Ghasemazar, M., & Pedram, M. (2012). SLA-based optimization of power and migration cost in cloud computing 2012 12th IEEE/ACM international symposium on cluster, Cloud and Grid Computing, Ottawa, ON, Canada.

  • Guo, Z., Li, J., & Ramesh, R. (2019). Optimal Management of Virtual Infrastructures under flexible cloud service agreements. Information Systems Research, 30(4), 1424–1446.

    Article  Google Scholar 

  • Gutierrez-Garcia, J. O., & Sim, K. M. (2012). GA-based cloud resource estimation for agent-based execution of bag-of-tasks applications. Information Systems Frontiers, 14(4), 925–951.

    Article  Google Scholar 

  • Hassan, M. M., Hossain, M. S., Sarkar, A., & Huh, E.-N. (2014). Cooperative game-based distributed resource allocation in horizontal dynamic cloud federation platform. Information Systems Frontiers, 16(4), 523–542.

    Article  Google Scholar 

  • ITIC (2017) ITIC 2017–2018 global server hardware, server OS reliability report. https://www.ibm.com/downloads/cas/DV0XZV6R#:~:text=ITIC's%202020%20Reliability%20poll%20finds,mission%20critical%20servers%20and%20applications

  • Jammal, M., Kanso, A., Heidari, P., & Shami, A. (2016). A formal model for the availability analysis of cloud deployed multi-tiered applications 2016 IEEE international conference on cloud engineering workshop Berlin, Germany.

  • Jammal, M., Hawilo, H., Kanso, A., & Shami, A. (2018). ACE: Availability-aware CloudSim extension. IEEE Transactions on Network and Service Management, 15(4), 1586–1599.

    Article  Google Scholar 

  • Kauffman, R. J., Ma, D., Shang, R., Huang, J., & Yang, Y. (2015). On the Financification of cloud computing: An agenda for pricing and service delivery mechanism design research. International Journal of Cloud Computing.

  • Mansouri, Y., Toosi, A. N., & Buyya, R. (2019). Cost optimization for dynamic replication and migration of data in cloud data centers. IEEE Transactions on Cloud Computing, 7(3), 705–718.

    Article  Google Scholar 

  • Marques, D., Bronevetsky, G., Fernandes, R., Pingali, K., & Stodghill, P. (2005). Optimizing checkpoint sizes in the C3 system 19th IEEE international parallel and distributed processing symposium, Denver, CO, USA.

  • Martens, B., & Teuteberg, F. (2012). Decision-making in cloud computing environments: A cost and risk based approach. Information Systems Frontiers, 14(4), 871–893.

    Article  Google Scholar 

  • Mateo-Fornés, J., Solsona-Tehàs, F., Vilaplana-Mayoral, J., Teixidó-Torrelles, I., & Rius-Torrentó, J. (2019). CART, a decision SLA model for SaaS providers to keep QoS regarding availability and performance. IEEE Access, 7, 38195–38204.

    Article  Google Scholar 

  • Mell, P., & Grance, T. (2011). The NIST definition of cloud computing.

  • Mistry, S., Bouguettaya, A., Dong, H., & Qin, A. K. (2018). Metaheuristic optimization for long-term IaaS service composition. IEEE Transactions on Services Computing, 11(1), 131–143.

    Article  Google Scholar 

  • Panda, S. K., Gupta, I., & Jana, P. K. (2019). Task scheduling algorithms for multi-cloud systems: Allocation-aware approach. Information Systems Frontiers, 21(2), 241–259.

    Article  Google Scholar 

  • Ran, Y., Yang, J., Zhang, S., & Xi, H. (2017). Dynamic IaaS computing resource provisioning strategy with QoS constraint. IEEE Transactions on Services Computing, 10(2), 190–202.

    Article  Google Scholar 

  • Randal, A. (2020). The ideal versus the real: Revisiting the history of virtual machines and containers. ACM Computing Surveys (CSUR), 53(1), 1–31.

    Article  Google Scholar 

  • Silic, M., Delac, G., Krka, I., Srbljic, S., & J. I. T. o. S. C. (2014). Scalable and accurate prediction of availability of atomic web services. IEEE Transactions on Services Computing, 7(2), 252–264.

    Article  Google Scholar 

  • Singh, S., Chana, I., & Buyya, R. (2017). STAR: SLA-aware autonomic Management of Cloud Resources. IEEE Transactions on Cloud Computing.

  • Smith, J. E., & Nair, R. (2005). The architecture of virtual machines. Computer, 38(5), 32–38.

    Article  Google Scholar 

  • Toosi, A. N., Vanmechelen, K., Ramamohanarao, K., & Buyya, R. (2015). Revenue maximization with optimal capacity control in infrastructure as a service cloud markets. IEEE Transactions on Cloud Computing, 3(3), 261–274.

    Article  Google Scholar 

  • Van, H. N., Tran, F. D., & Menaud, J.-M. (2009). SLA-aware virtual resource Management for Cloud Infrastructures 2009 ninth IEEE international conference on computer and information technology, Xiamen, China.

  • Wang, X., Du, Z., Chen, Y., & Li, S. (2008). Virtualization-based autonomic resource Management for Multi-tier web Applications in shared data center. Journal of Systems and Software, 81(9), 1591–1608.

    Article  Google Scholar 

  • Wu, L., Garg, S. K., Versteeg, S., & Buyya, R. (2014). SLA-based resource provisioning for hosted software-as-a-service applications in cloud computing environments. IEEE Transactions on Services Computing, 7(3), 465–485.

    Article  Google Scholar 

  • Yala, L., Frangoudis, P. A., Lucarelli, G., & Ksentini, A. (2018). Cost and availability aware resource allocation and virtual function placement for CDNaaS provision. IEEE Transactions on Network and Service Management, 15(4), 1334–1348.

    Article  Google Scholar 

  • Yang, J., Liu, C., Shang, Y., Cheng, B., Mao, Z., Liu, C., Niu, L., & Chen, J. (2014). A cost-aware auto-scaling approach using the workload prediction in service clouds. Information Systems Frontiers, 16(1), 7–18.

    Article  Google Scholar 

  • Yuan, S., Das, S., Ramesh, R., & Qiao, C. (2018). Service agreement trifecta: Backup resources, Price and penalty in the availability-aware cloud. Information Systems Research.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shuai Yuan.

Ethics declarations

Conflict of Interest

The authors have no conflicts of interest to declare that are relevant to the content of this article.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yuan, S., Das, S., Ramesh, R. et al. Availability-Aware Virtual Resource Provisioning for Infrastructure Service Agreements in the Cloud. Inf Syst Front 25, 1495–1512 (2023). https://doi.org/10.1007/s10796-022-10302-4

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10796-022-10302-4

Keywords

Navigation