Abstract
Cloud computing has been regarded as an emerging approach to provisioning resources and managing applications. It provides attractive features, such as an on-demand model, scalability enhancement, and management cost reduction. However, cloud computing systems continue to face problems such as hardware failures, overloads caused by unexpected workloads, or the waste of energy due to inefficient resource utilization, which all result in resource shortages and application issues such as delays or saturation. A paradigm, the brownout, has been applied to handle these issues by adaptively activating or deactivating optional parts of applications or services to manage resource usage in cloud computing system. Brownout has successfully shown that it can avoid overloads due to changes in workload and achieve better load balancing and energy saving effects. This article proposes a taxonomy of the brownout approach for managing resources and applications adaptively in cloud computing systems and carries out a comprehensive survey. It identifies open challenges and offers future research directions.
- Frederico Alvares, Gwenaël Delaval, Eric Rutten, and Lionel Seinturier. 2017. Language support for modular autonomic managers in reconfigurable software components. In Proceedings of the 2017 IEEE International Conference on Autonomic Computing. IEEE, 271--278.Google ScholarCross Ref
- Paolo Arcaini, Elvinia Riccobene, and Patrizia Scandurra. 2015. Modeling and analyzing MAPE-K feedback loops for self-adaptation. In Proceedings of the 10th International Symposium on Software Engineering for Adaptive and Self-Managing Systems. 13--23. Google ScholarDigital Library
- Mohammad Sadegh Aslanpour, Mostafa Ghobaei-Arani, and Adel Nadjaran Toosi. 2017. Auto-scaling web applications in clouds. Journal of Network Computer Applications 95 (2017), 26--41. Google ScholarDigital Library
- Anton Beloglazov, Jemal Abawajy, and Rajkumar Buyya. 2012. Energy-aware resource allocation heuristics for efficient management of data centers for cloud computing. Future Generation Computer Systems 28, 5 (2012), 755--768. Google ScholarDigital Library
- Anton Beloglazov and Rajkumar Buyya. 2013. Managing overloaded hosts for dynamic consolidation of virtual machines in cloud data centers under quality of service constraints. IEEE Transactions on Parallel and Distributed Systems 24, 7 (2013), 1366--1379. Google ScholarDigital Library
- Rajkumar Buyya, Rodrigo N Calheiros, Jungmin Son, Amir Vahid Dastjerdi, and Young Yoon. 2014. Software-defined cloud computing: Architectural elements and open challenges. In Proceedings of the 2014 International Conference on Advances in Computing, Communications and Informatics. 1--12.Google ScholarCross Ref
- Rajkumar Buyya, Rajiv Ranjan, and Rodrigo N. Calheiros. 2009. Modeling and simulation of scalable Cloud computing environments and the CloudSim toolkit: Challenges and opportunities. In Proceedings of the International Conference on High Performance Computing and Simulation. 1--11.Google Scholar
- Dazhao Cheng, Jia Rao, Changjun Jiang, and Xiaobo Zhou. 2016. Elastic power-aware resource provisioning of heterogeneous workloads in self-sustainable datacenters. IEEE Transactions on Computer 65, 2 (2016), 508--521. Google ScholarDigital Library
- Rogério De Lemos, Holger Giese, Hausi A. Müller, Mary Shaw, Jesper Andersson, Marin Litoiu, Bradley Schmerl, Gabriel Tamura, Norha M Villegas, Thomas Vogel, et al. 2013. Software engineering for self-adaptive systems: A second research roadmap. In Software Engineering for Self-Adaptive Systems II, Rogério De Lemos, Holger Giese, Hausi A. Müller, and Mary Shaw (Eds.). 1--32.Google Scholar
- David Desmeurs. 2015. Algorithms for Event-Driven Application Brownout. Master Thesis, Umea University.Google Scholar
- David Desmeurs, Cristian Klein, Alessandro Vittorio Papadopoulos, and Johan Tordsson. 2015. Event-driven application brownout: Reconciling high utilization and low tail response times. In Proceedings of the 2015 International Conference on Cloud and Autonomic Computing. 1--12. Google ScholarDigital Library
- Docker. 2017a. Docker Compose file version 3 reference. Retrieved March 27, 2018 from https://docs.docker.com/compose/compose-file/.Google Scholar
- Docker. 2017b. Docker Documentation | Docker Documentation. Retrieved March 27, 2018 from https://docs.docker.com/.Google Scholar
- Docker. 2018. Swarm mode overview | Docker Documentation. Retrieved March 27, 2018 from https://docs.docker.com/engine/swarm/.Google Scholar
- Wanchun Dou, Xiaolong Xu, Shunmei Meng, Xuyun Zhang, Chunhua Hu, Shui Yu, and Jian Yang. 2017. An energy-aware virtual machine scheduling method for service QoS enhancement in clouds over big data. Concurrency and Computation: Practice and Experience 29, 14 (2017), 1--20.Google ScholarCross Ref
- Simon Dupont, Jonathan Lejeune, Frederico Alvares, and Thomas Ledoux. 2015. Experimental analysis on autonomic strategies for cloud elasticity. In Proceedings of the 2015 IEEE International Conference on Cloud and Autonomic Computing. 81--92. Google ScholarDigital Library
- Jonas Dürango, Manfred Dellkrantz, Martina Maggio, Cristian Klein, Alessandro Vittorio Papadopoulos, Francisco Hernández-Rodriguez, Erik Elmroth, and Karl-Erik Årzén. 2014. Control-theoretical load-balancing for cloud applications with brownout. In Proceedings of the 2014 IEEE 53rd Annual Conference on Decision and Control. 5320--5327.Google ScholarCross Ref
- Amazon EC2. 2018. Amazon Web Services. Retrieved March 27, 2018 from https://aws.amazon.com/ec2/.Google Scholar
- Tammy Everts. 2016. Google: 53 longer than 3 seconds to load. Retrieved March 27, 2018 from https://www.soasta.com/blog/google-mobile-web-performance-study/.Google Scholar
- FIFA. 2014. 1998 World Cup Web Site Access Logs - The Internet Traffic Archive. Retrieved March 27, 2018 from http://ita.ee.lbl.gov/html/contrib/WorldCup.html.Google Scholar
- Grid5000. 2017. Grid5000:Home. Retrieved Aril 10, 2018 from https://www.grid5000.fr/mediawiki/index.php/Grid5000:Home.Google Scholar
- M. D. Sabbir Hasan, Frederico Alvares, and Thomas Ledoux. 2017a. GPaaScaler: Green energy aware platform scaler for interactive cloud application. In Proceedings of the 10th ACM International Conference on Utility and Cloud Computing. 79--89. Google ScholarDigital Library
- M. D. Sabbir Hasan, Frederico Alvares, Thomas Ledoux, and Jean-Louis Pazat. 2017b. Investigating energy consumption and performance trade-off for interactive cloud application. IEEE Transactions on Sustainable Computing 2, 2 (2017), 113--126.Google ScholarCross Ref
- M. D. Sabbir Hasan, Frederico Alvares de Oliveira, Thomas Ledoux, and Jean-Louis Pazat. 2016. Enabling green energy awareness in interactive cloud application. In Proceedings of the 2016 IEEE International Conference on Cloud Computing Technology and Science. 414--422.Google Scholar
- Soamar Homsi, Shuo Liu, Gustavo A Chaparro-Baquero, Ou Bai, Shaolei Ren, and Gang Quan. 2017. Workload consolidation for cloud data centers with guaranteed QoS using request reneging. IEEE Transactions on Parallel and Distributed Systems 28, 7 (2017), 2103--2116.Google ScholarDigital Library
- Didac Gil De La Iglesia and Danny Weyns. 2015. MAPE-K formal templates to rigorously design behaviors for self-adaptive systems. ACM Transactions on Autonomous and Adaptive Systems 10, 3 (2015), 1--31. Google ScholarDigital Library
- Neil Irwin. 2013. These 12 technologies will drive our economic future. Retrieved March 27, 2018 from https://www.washingtonpost.com/news/wonk/wp/2013/05/24/these-12-technologies-will-drive-our-economic-future/?utm_term=.e5ad3815c7eb.Google Scholar
- Tarandeep Kaur and Inderveer Chana. 2015. Energy efficiency techniques in cloud computing: A survey and taxonomy. ACM Computing Surveys (CSUR) 48, 2 (2015), 1--46. Google ScholarDigital Library
- Cristian Klein, Martina Maggio, Karl-Erik Årzén, and Francisco Hernández-Rodriguez. 2014a. Brownout: Building more robust cloud applications. In Proceedings of the 36th International Conference on Software Engineering. 700--711. Google ScholarDigital Library
- Cristian Klein, Alessandro Vittorio Papadopoulos, Manfred Dellkrantz, Jonas Dürango, Martina Maggio, Karl-Erik Årzén, Francisco Hernández-Rodriguez, and Erik Elmroth. 2014b. Improving cloud service resilience using brownout-aware load-balancing. In Proceedings of the 2014 IEEE 33rd International Symposium on Reliable Distributed Systems. 31--40. Google ScholarDigital Library
- Manya Koetse. 2017. Weibo servers down after Lu Han announces new relationship. Retrieved March 27, 2018 from https://www.whatsonweibo.com/weibo-servers-lu-han-announces-new-relationship/.Google Scholar
- Kubernetes. 2018. Production-grade container orchestration - Kubernetes. Retrieved June 12, 2018 from https://kubernetes.io/.Google Scholar
- Hongjian Li, Guofeng Zhu, Yuyan Zhao, Yu Dai, and Wenhong Tian. 2017. Energy-efficient and QoS-aware model based resource consolidation in cloud data centers. Cluster Computing (2017), 1--11. Google ScholarDigital Library
- Zhenhua Liu, Minghong Lin, Adam Wierman, Steven Low, and Lachlan LH Andrew. 2015. Greening geographical load balancing. IEEE/ACM Transactions on Networking 23, 2 (2015), 657--671. Google ScholarDigital Library
- Tania Lorido-Botran, Jose Miguel-Alonso, and Jose A. Lozano. 2014. A review of auto-scaling techniques for elastic applications in cloud environments. Journal of Grid Computing 12, 4 (2014), 559--592. Google ScholarDigital Library
- Martina Maggio, Cristian Klein, and Karl-Erik Årzén. 2014. Control strategies for predictable brownouts in cloud computing. IFAC Proceedings Volumes 47, 3 (2014), 689--694.Google ScholarCross Ref
- Yaser Mansouri, Adel Nadjaran Toosi, and Rajkumar Buyya. 2017. Data storage management in cloud environments: Taxonomy, survey, and future directions. ACM Computing Surveys (CSUR) 50, 6 (2017), 1--51. Google ScholarDigital Library
- Toni Mastelic, Ariel Oleksiak, Holger Claussen, Ivona Brandic, Jean-Marc Pierson, and Athanasios V. Vasilakos. 2015. Cloud computing: Survey on energy efficiency. ACM Computing Surveys (CSUR) 47, 2 (2015), 1--36. Google ScholarDigital Library
- Peter Mell, Tim Grance, et al. 2011. The NIST definition of cloud computing. (2011).Google Scholar
- Mesos. 2018. Apache Mesos. Retrieved June 12, 2018 from http://mesos.apache.org/.Google Scholar
- Alireza Sadeghi Milani and Nima Jafari Navimipour. 2016a. Load balancing mechanisms and techniques in the cloud environments: Systematic literature review and future trends. Journal of Network and Computer Applications 71 (2016), 86--98. Google ScholarDigital Library
- Bahareh Alami Milani and Nima Jafari Navimipour. 2016b. A comprehensive review of the data replication techniques in the cloud environments: Major trends and future directions. Journal of Network and Computer Applications 64 (2016), 229--238. Google ScholarDigital Library
- Gabriel A. Moreno. 2017. Adaptation Timing in Self-Adaptive Systems. PhD Thesis, Carnegie Mellon University.Google Scholar
- Gabriel A. Moreno, Javier Cámara, David Garlan, and Bradley Schmerl. 2015. Proactive self-adaptation under uncertainty: A probabilistic model checking approach. In Proceedings of the 2015 10th Joint Meeting on Foundations of Software Engineering. 1--12. Google ScholarDigital Library
- Gabriel A. Moreno, Javier Cámara, David Garlan, and Bradley Schmerl. 2016. Efficient decision-making under uncertainty for proactive self-adaptation. In Proceedings of the 2016 IEEE International Conference on Autonomic Computing. 147--156.Google ScholarCross Ref
- Vladimir Nikolov, Steffen Kächele, Franz J. Hauck, and Dieter Rautenbach. 2014. Cloudfarm: An elastic cloud platform with flexible and adaptive resource management. In Proceedings of the 2014 IEEE/ACM 7th International Conference on Utility and Cloud Computing. 547--553. Google ScholarDigital Library
- OpenStack. 2018. Open source software for creating private and public clouds. Retrieved March 27, 2018 from https://www.openstack.org/.Google Scholar
- Ashutosh Pandey, Gabriel A. Moreno, Javier Cámara, and David Garlan. 2016. Hybrid planning for decision making in self-adaptive systems. In Proceedings of the 2016 IEEE 10th International Conference on Self-Adaptive and Self-Organizing Systems. 130--139.Google ScholarCross Ref
- Reena Panwar and Bhawna Mallick. 2015. Load balancing in cloud computing using dynamic load management algorithm. In Proceedings of the 2015 IEEE International Conference on Green Computing and Internet of Things. 773--778. Google ScholarDigital Library
- KyoungSoo Park and Vivek S. Pai. 2006. CoMon: A mostly-scalable monitoring system for PlanetLab. ACM SIGOPS Operating Systems Review 40, 1 (2006), 65--74. Google ScholarDigital Library
- Tim Biggs and Patrick Hatch. 2016. Banks, websites down as wild weather knocks out Amazon Web Services. Retrieved March 27, 2018 from http://www.afr.com/technology/banks-websites-down-as-wild-weather-knocks-out-amazon-web-services-20160605-gpc8ob.Google Scholar
- Ashikur Rahman, Xue Liu, and Fanxin Kong. 2014. A survey on geographic load balancing based data center power management in the smart grid environment. IEEE Communications Surveys and Tutorials 16, 1 (2014), 214--233.Google ScholarCross Ref
- Martin Randles, David Lamb, and A. Taleb-Bendiab. 2010. A comparative study into distributed load balancing algorithms for cloud computing. In Proceedings of the 2010 IEEE 24th International Conference on Advanced Information Networking and Applications Workshops. 551--556. Google ScholarDigital Library
- RUBBoS. 2005. RUBBoS: Bulletin Board Benchmark. Retrieved March 27, 2018 from http://jmob.ow2.org/rubbos.html.Google Scholar
- RUBiS. 2009. RUBiS. RUBiS: Rice University bidding system. Retrieved March 27, 2018 from http://rubis.ow2.org/.Google Scholar
- Altino M. Sampaio, Jorge G. Barbosa, and Radu Prodan. 2015. PIASA: A power and interference aware resource management strategy for heterogeneous workloads in cloud data centers. Simulation Modelling Practice and Theory 57 (2015), 142--160.Google ScholarCross Ref
- Sukhpal Singh and Inderveer Chana. 2016a. EARTH: Energy-aware autonomic resource scheduling in cloud computing. Journal of Intelligent and Fuzzy Systems 30, 3 (2016), 1581--1600.Google ScholarCross Ref
- Sukhpal Singh and Inderveer Chana. 2016b. QoS-aware autonomic resource management in cloud computing: A systematic review. ACM Computing Surveys (CSUR) 48, 3 (2016), 1--46. Google ScholarDigital Library
- Jungmin Son, Amir Vahid Dastjerdi, Rodrigo N. Calheiros, and Rajkumar Buyya. 2017. SLA-aware and energy-efficient dynamic overbooking in sdn-based cloud data centers. IEEE Transactions on Sustainable Computing 2, 2 (2017), 76--89.Google ScholarCross Ref
- El-Ghazali Talbi. 2009. Metaheuristics: From Design to Implementation. Vol. 74. John Wiley and Sons. Google ScholarDigital Library
- Luis Tomás, Cristian Klein, Johan Tordsson, and Francisco Hernández-Rodríguez. 2014. The straw that broke the camel’s back: Safe cloud overbooking with application brownout. In Proceedings of the 2014 IEEE International Conference on Cloud and Autonomic Computing. 151--160. Google ScholarDigital Library
- Tomcat. 2018. Apache Tomcat - Welcome! Retrieved March 13, 2018 from http://tomcat.apache.org/.Google Scholar
- VMware. 2018. VMware - Official Site. Retrieved March 27, 2018 from https://www.vmware.com/.Google Scholar
- Denis Weerasiri, Moshe Chai Barukh, Boualem Benatallah, Quan Z. Sheng, and Rajiv Ranjan. 2017. A taxonomy and survey of cloud resource orchestration techniques. ACM Computing Surveys (CSUR) 50, 2 (2017), 1--41. Google ScholarDigital Library
- Wikibench. 2017. Retrieved March 15, 2018 from http://www.wikibench.eu/wiki/2007-10/.Google Scholar
- Minxian Xu and Rajkumar Buyya. 2017. Energy efficient scheduling of application components via brownout and approximate Markov decision process. In Proceedings of the 15th International Conference on Service-Oriented Computing. 206--220.Google ScholarDigital Library
- Minxian Xu, Amir Vahid Dastjerdi, and Rajkumar Buyya. 2016. Energy efficient scheduling of cloud application components with brownout. IEEE Transactions on Sustainable Computing 1, 2 (2016), 40--53.Google ScholarCross Ref
- Minxian Xu, Wenhong Tian, and Rajkumar Buyya. 2017. A survey on load balancing algorithms for virtual machines placement in cloud computing. Concurrency and Computation: Practice and Experience 29, 12 (2017), 4123--4138.Google ScholarCross Ref
- Zhi-Hui Zhan, Xiao-Fang Liu, Yue-Jiao Gong, Jun Zhang, Henry Shu-Hung Chung, and Yun Li. 2015. Cloud computing resource scheduling and a survey of its evolutionary approaches. ACM Computing Surveys (CSUR) 47, 4 (2015), 1--33. Google ScholarDigital Library
- Yunqi Zhang, Michael A. Laurenzano, Jason Mars, and Lingjia Tang. 2014. Smite: Precise QoS prediction on real-system smt processors to improve utilization in warehouse scale computers. In Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture. 406--418. Google ScholarDigital Library
- Tianqi Zhao, Wei Zhang, Haiyan Zhao, and Zhi Jin. 2017. A reinforcement learning-based framework for the generation and evolution of adaptation rules. In Proceedings of the 2017 IEEE International Conference on Autonomic Computing. 103--112.Google ScholarCross Ref
Index Terms
- Brownout Approach for Adaptive Management of Resources and Applications in Cloud Computing Systems: A Taxonomy and Future Directions
Recommendations
Challenges in real-time virtualization and predictable cloud computing
Cloud computing and virtualization technology have revolutionized general-purpose computing applications in the past decade. The cloud paradigm offers advantages through reduction of operation costs, server consolidation, flexible system configuration ...
Optimal Resource Usage in Multi-Cloud Computing Environment
Cloud computing has emerged as a new paradigm for accessing distributed computing resources such as infrastructure, hardware platform, and software applications on-demand over the internet as services. This paper presents an optimal resource management ...
Key Challenges in Cloud Computing: Enabling the Future Internet of Services
Cloud computing will play a major role in the future Internet of Services, enabling on-demand provisioning of applications, platforms, and computing infrastructures. However, the cloud community must address several technology challenges to turn this ...
Comments