Abstract
The increasing use of server clusters has made their energy consumption an important issue. To address it, several power management techniques are being developed. In order to be useful, these techniques must address the performance and availability implications of reducing energy consumption. This paper presents a power management technique that maintains the quality of service (QoS) levels specified with service level agreements expressed as a threshold for a percentile of the response time. In addition, it provides self-healing by identifying when servers fail and automatically provisioning new servers. The technique is based on balancing the load so that it is concentrated in a small number of servers. For this, it only requires two utilization thresholds and models of performance and power consumption for the application executed in the server. It works in heterogeneous servers and provides overload protection. Several experiments carried out on a prototype show that the technique reduces energy consumption (up to 57.59 % compared to an always-on policy) while providing self-healing and maintaining the QoS.
Similar content being viewed by others
References
Anagnostopoulou V, Biswas S, Saadeldeen H, Savage A, Bianchini R, Yang T, Franklin D, Chong FT (2012) Barely alive memory servers: keeping data active in a low-power state. J Emerg Technol Comput Syst 8(4):31:1–31:20
Andersen D, Franklin J, Kaminsky M, Phanishayee A, Tan L, Vasudevan V (2011) FAWN: a fast array of wimpy nodes. Commun ACM 54(7):101–109
Barroso LA, Hölzle U (2007) The case for energy-proportional computing. Computer 40(12):33–37
Beloglazov A, Buyya R, Lee YC, Zomaya A (2011) A taxonomy and survey of energy-efficient data centers and cloud computing systems. Adv Comput 82:47–111
Bobroff N, Kochut A, Beaty K (2007) Dynamic placement of virtual machines for managing sla violations. In: Integrated network management. IEEE, pp 119–128
Buyya R, Beloglazov A, Abawajy J (2010) Energy-efficient management of data center resources for cloud computing: a vision, architectural elements, and open challenges. In: Proceedings of the 2010 international conference on parallel and distributed processing techniques and applications (PDPTA’10)
Cardosa M, Korupolu MR, Singh A (2009) Shares and utilities based power consolidation in virtualized server environments. In: Proceedings of the 11th IFIP/IEEE international conference on symposium on integrated network management (IM’09). IEEE Press, Piscataway, pp 327–334
Caulfield AM, Grupp LM, Swanson S (2009) Gordon: using flash memory to build fast, power-efficient clusters for data-intensive applications. SIGARCH Comput Archit News 37(1):217–228
Chase JS, Anderson DC, Thakar PN, Vahdat AM, Doyle RP (2001) Managing energy and server resources in hosting centers. In: Proceedings of the eighteenth ACM symposium on operating systems principles (SOSP’01). ACM, New York, pp 103–116
Chen G, Malkowski K, Kandemir M, Raghavan P (2005a) Reducing power with performance constraints for parallel sparse applications. In: Proceedings of the 19th IEEE international parallel and distributed processing symposium (IPDPS’05)—workshop 11, vol 12. IEEE Computer Society, Washington, DC, p 231.1
Chen Y, Das A, Qin W, Sivasubramaniam A, Wang Q, Gautam N (2005b) Managing server energy and operational costs in hosting centers. SIGMETRICS Perform Eval Rev 33(1):303–314
Cheng Y, Zeng Y (2011) Automatic energy status controlling with dynamic voltage scaling in power-aware high performance computing cluster. In: 12th international conference on parallel and distributed computing, applications and technologies (PDCAT’11), pp 412–416
Chetsa GLT, Lefrvre L, Pierson JM, Stolf P, Da Costa G (2012) A runtime framework for energy efficient hpc systems without a priori knowledge of applications. In: Proceedings of the 2012 IEEE 18th international conference on parallel and distributed systems (ICPADS’12). IEEE Computer Society, Washington, DC, pp 660–667
Cocaña-Fernández A, Ranilla J, Sánchez L (2014) Energy-efficient allocation of computing node slots in HPC clusters through parameter learning and hybrid genetic fuzzy system modeling. J Supercomput 71(3):1163–1174
Cox SJ, Cox JT, Boardman RP, Johnston SJ, Scott M, O’brien NS (2014) Iridis-pi: a low-cost, compact demonstration cluster. Clust Comput 17(2):349–358
Deng Q, Ramos L, Bianchini R, Meisner D, Wenisch T (2012) Active low-power modes for main memory with memscale. Micro IEEE 32(3):60–69
Elnozahy EN, Kistler M, Rajamony R (2003) Energy-efficient server clusters. In: Proceedings of the 2nd international conference on power-aware computer systems (PACS’02). Springer, Heidelberg, pp 179–197
Entrialgo J, García DF, García J, García M, Valledor P, Obaidat MS (2011) Dynamic adaptation of response-time models for QoS management in autonomic systems. J Syst Softw 84(5):810–820
Freeh VW, Lowenthal DK, Pan F, Kappiah N, Springer R, Rountree BL, Femal ME (2007) Analyzing the energy-time trade-off in high-performance computing applications. IEEE Trans Parallel Distrib Syst 18(6):835–848
Gandhi A, Harchol-Balter M, Das R, Lefurgy C (2009) Optimal power allocation in server farms. SIGMETRICS Perform Eval Rev 37(1):157–168
Gandhi A, Harchol-Balter M, Raghunathan R, Kozuch MA (2012) AutoScale: dynamic, robust capacity management for multi-tier data centers. ACM Trans Comput Syst 30(4):14:1–14:26
Ge R, Feng X, Cameron KW (2005) Improvement of power-performance efficiency for high-end computing. In: Proceedings of the 19th IEEE international parallel and distributed processing symposium (IPDPS’05)—workshop 11, vol 12. IEEE Computer Society, Washington, DC, p 233.2
Ge R, Feng X, Feng Wc, Cameron KW (2007) CPU MISER: a performance-directed, run-time system for power-aware clusters. In: Proceedings of the 2007 international conference on parallel processing (ICPP’07). IEEE Computer Society, Washington, DC, p 18
Gmach D, Rolia J, Cherkasova L, Kemper A (2009) Resource pool management: reactive versus proactive or let’s be friends. Comput Netw 53(17):2905–2922
Heath T, Diniz B, Carrera EV, Meira W Jr, Bianchini R (2005) Energy conservation in heterogeneous server clusters. In: Proceedings of the tenth ACM SIGPLAN symposium on principles and practice of parallel programming (PPoPP’05). ACM, New York, pp 186–195
Horvath T, Skadron K (2008) Multi-mode energy management for multi-tier server clusters. In: Proceedings of the 17th international conference on parallel architectures and compilation techniques (PACT’08). ACM, New York, pp 270–279
Huang S, Feng W (2009) Energy-efficient cluster computing via accurate workload characterization. In: Proceedings of the 2009 9th IEEE/ACM international symposium on cluster computing and the grid (CCGRID’09). IEEE Computer Society, Washington, DC, pp 68–75
Ita (1998) The internet traffic archives: worldcup98. http://ita.ee.lbl.gov/html/contrib/WorldCup.html. Accessed 18 Nov 2015
Krioukov A, Mohan P, Alspaugh S, Keys L, Culler D, Katz RH (2010) Napsac: design and implementation of a power-proportional web cluster. In: Proceedings of the first ACM SIGCOMM workshop on green networking (Green Networking’10). ACM, New York, pp 15–22
Kusic D, Kephart J, Hanson J, Kandasamy N, Jiang G (2009) Power and performance management of virtualized computing environments via lookahead control. Clust Comput 12(1):1–15
Lawson C, Hanson R (1995) Solving least squares problems. In: Classics in applied mathematics. Society for Industrial and Applied Mathematics, Philadelphia
Leite JC, Kusic DM, Mossé D, Bertini L (2010) Stochastic approximation control of power and tardiness in a three-tier web-hosting cluster. In: Proceedings of the 7th international conference on autonomic computing (ICAC’10). ACM, New York, pp 41–50
Lim M, Freeh V, Lowenthal D (2011) Adaptive, transparent CPU scaling algorithms leveraging inter-node mpi communication regions. Parallel Comput 37(10–11):667–683
Meisner D, Wenisch TF (2012) DreamWeaver: architectural support for deep sleep. In: Proceedings of the seventeenth international conference on architectural support for programming languages and operating systems (ASPLOS XVII). ACM, New York, pp 313–324
Meisner D, Gold B, Wenisch T (2009) Powernap: eliminating server idle power. ACM SIGPLAN Not 44(3):205–216
Nathuji R, Schwan K (2007) Virtualpower: coordinated power management in virtualized enterprise systems. SIGOPS Oper Syst Rev 41(6):265–278
Pinheiro E, Bianchini R, Carrera EV, Heath T (2001) Load balancing and unbalancing for power and performance in cluster-based systems. Workshop Compil Oper Syst Low Power 180:182–195
Srikantaiah S, Kansal A, Zhao F (2008) Energy aware consolidation for cloud computing. In: Proceedings of the 2008 conference on power aware computing and systems. USENIX Association, Berkeley
Tolentino ME, Turner J, Cameron KW (2007) Memory-MISER: a performance-constrained runtime system for power-scalable clusters. In: Proceedings of the 4th international conference on computing frontiers (CF’07). ACM, New York, pp 237–246
Urgaonkar B, Shenoy P, Chandra A, Goyal P (2005) Dynamic provisioning of multi-tier internet applications. In: Proceedings of second international conference on autonomic computing (ICAC’05), pp 217–228
Valentini GL, Lassonde W, Khan SU, Min-Allah N, Madani SA, Li J, Zhang L, Wang L, Ghani N, Kolodziej J, Li H, Zomaya AY, Xu CZ, Balaji P, Vishnu A, Pinel F, Pecero JE, Kliazovich D, Bouvry P (2013) An overview of energy efficiency techniques in cluster computing systems. Clust Comput 16(1):3–15
Whitney J, Delforge P (2014) Data center efficiency assessment. Tech. rep, Natural Resources Defense Council (NRDC)
Wong D, Annavaram M (2012) Knightshift: scaling the energy proportionality wall through server-level heterogeneity. In: Proceedings of the 2012 45th annual IEEE/ACM international symposium on microarchitecture (MICRO-45). IEEE Computer Society, Washington, DC, pp 119–130
Wong D, Annavaram M (2014) Implications of high energy proportional servers on cluster-wide energy proportionality. In: 20th IEEE international symposium on high performance computer architecture (HPCA’14), Orlando, pp 142–153
Author information
Authors and Affiliations
Corresponding author
Additional information
This research has been partially supported by the Project TIN2008-06045-C02-01 of the Spanish National Plan for Research, Development and Innovation.
Rights and permissions
About this article
Cite this article
Entrialgo, J., Medrano, R., García, D.F. et al. Autonomic power management with self-healing in server clusters under QoS constraints. Computing 98, 871–894 (2016). https://doi.org/10.1007/s00607-015-0477-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00607-015-0477-2