Skip to main content

Advertisement

Log in

Autonomic power management with self-healing in server clusters under QoS constraints

  • Published:
Computing Aims and scope Submit manuscript

Abstract

The increasing use of server clusters has made their energy consumption an important issue. To address it, several power management techniques are being developed. In order to be useful, these techniques must address the performance and availability implications of reducing energy consumption. This paper presents a power management technique that maintains the quality of service (QoS) levels specified with service level agreements expressed as a threshold for a percentile of the response time. In addition, it provides self-healing by identifying when servers fail and automatically provisioning new servers. The technique is based on balancing the load so that it is concentrated in a small number of servers. For this, it only requires two utilization thresholds and models of performance and power consumption for the application executed in the server. It works in heterogeneous servers and provides overload protection. Several experiments carried out on a prototype show that the technique reduces energy consumption (up to 57.59 % compared to an always-on policy) while providing self-healing and maintaining the QoS.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Anagnostopoulou V, Biswas S, Saadeldeen H, Savage A, Bianchini R, Yang T, Franklin D, Chong FT (2012) Barely alive memory servers: keeping data active in a low-power state. J Emerg Technol Comput Syst 8(4):31:1–31:20

  2. Andersen D, Franklin J, Kaminsky M, Phanishayee A, Tan L, Vasudevan V (2011) FAWN: a fast array of wimpy nodes. Commun ACM 54(7):101–109

    Article  Google Scholar 

  3. Barroso LA, Hölzle U (2007) The case for energy-proportional computing. Computer 40(12):33–37

    Article  Google Scholar 

  4. Beloglazov A, Buyya R, Lee YC, Zomaya A (2011) A taxonomy and survey of energy-efficient data centers and cloud computing systems. Adv Comput 82:47–111

    Article  Google Scholar 

  5. Bobroff N, Kochut A, Beaty K (2007) Dynamic placement of virtual machines for managing sla violations. In: Integrated network management. IEEE, pp 119–128

  6. Buyya R, Beloglazov A, Abawajy J (2010) Energy-efficient management of data center resources for cloud computing: a vision, architectural elements, and open challenges. In: Proceedings of the 2010 international conference on parallel and distributed processing techniques and applications (PDPTA’10)

  7. Cardosa M, Korupolu MR, Singh A (2009) Shares and utilities based power consolidation in virtualized server environments. In: Proceedings of the 11th IFIP/IEEE international conference on symposium on integrated network management (IM’09). IEEE Press, Piscataway, pp 327–334

  8. Caulfield AM, Grupp LM, Swanson S (2009) Gordon: using flash memory to build fast, power-efficient clusters for data-intensive applications. SIGARCH Comput Archit News 37(1):217–228

    Article  Google Scholar 

  9. Chase JS, Anderson DC, Thakar PN, Vahdat AM, Doyle RP (2001) Managing energy and server resources in hosting centers. In: Proceedings of the eighteenth ACM symposium on operating systems principles (SOSP’01). ACM, New York, pp 103–116

  10. Chen G, Malkowski K, Kandemir M, Raghavan P (2005a) Reducing power with performance constraints for parallel sparse applications. In: Proceedings of the 19th IEEE international parallel and distributed processing symposium (IPDPS’05)—workshop 11, vol 12. IEEE Computer Society, Washington, DC, p 231.1

  11. Chen Y, Das A, Qin W, Sivasubramaniam A, Wang Q, Gautam N (2005b) Managing server energy and operational costs in hosting centers. SIGMETRICS Perform Eval Rev 33(1):303–314

    Article  Google Scholar 

  12. Cheng Y, Zeng Y (2011) Automatic energy status controlling with dynamic voltage scaling in power-aware high performance computing cluster. In: 12th international conference on parallel and distributed computing, applications and technologies (PDCAT’11), pp 412–416

  13. Chetsa GLT, Lefrvre L, Pierson JM, Stolf P, Da Costa G (2012) A runtime framework for energy efficient hpc systems without a priori knowledge of applications. In: Proceedings of the 2012 IEEE 18th international conference on parallel and distributed systems (ICPADS’12). IEEE Computer Society, Washington, DC, pp 660–667

  14. Cocaña-Fernández A, Ranilla J, Sánchez L (2014) Energy-efficient allocation of computing node slots in HPC clusters through parameter learning and hybrid genetic fuzzy system modeling. J Supercomput 71(3):1163–1174

  15. Cox SJ, Cox JT, Boardman RP, Johnston SJ, Scott M, O’brien NS (2014) Iridis-pi: a low-cost, compact demonstration cluster. Clust Comput 17(2):349–358

  16. Deng Q, Ramos L, Bianchini R, Meisner D, Wenisch T (2012) Active low-power modes for main memory with memscale. Micro IEEE 32(3):60–69

    Article  Google Scholar 

  17. Elnozahy EN, Kistler M, Rajamony R (2003) Energy-efficient server clusters. In: Proceedings of the 2nd international conference on power-aware computer systems (PACS’02). Springer, Heidelberg, pp 179–197

  18. Entrialgo J, García DF, García J, García M, Valledor P, Obaidat MS (2011) Dynamic adaptation of response-time models for QoS management in autonomic systems. J Syst Softw 84(5):810–820

    Article  Google Scholar 

  19. Freeh VW, Lowenthal DK, Pan F, Kappiah N, Springer R, Rountree BL, Femal ME (2007) Analyzing the energy-time trade-off in high-performance computing applications. IEEE Trans Parallel Distrib Syst 18(6):835–848

    Article  Google Scholar 

  20. Gandhi A, Harchol-Balter M, Das R, Lefurgy C (2009) Optimal power allocation in server farms. SIGMETRICS Perform Eval Rev 37(1):157–168

    Google Scholar 

  21. Gandhi A, Harchol-Balter M, Raghunathan R, Kozuch MA (2012) AutoScale: dynamic, robust capacity management for multi-tier data centers. ACM Trans Comput Syst 30(4):14:1–14:26

  22. Ge R, Feng X, Cameron KW (2005) Improvement of power-performance efficiency for high-end computing. In: Proceedings of the 19th IEEE international parallel and distributed processing symposium (IPDPS’05)—workshop 11, vol 12. IEEE Computer Society, Washington, DC, p 233.2

  23. Ge R, Feng X, Feng Wc, Cameron KW (2007) CPU MISER: a performance-directed, run-time system for power-aware clusters. In: Proceedings of the 2007 international conference on parallel processing (ICPP’07). IEEE Computer Society, Washington, DC, p 18

  24. Gmach D, Rolia J, Cherkasova L, Kemper A (2009) Resource pool management: reactive versus proactive or let’s be friends. Comput Netw 53(17):2905–2922

    Article  Google Scholar 

  25. Heath T, Diniz B, Carrera EV, Meira W Jr, Bianchini R (2005) Energy conservation in heterogeneous server clusters. In: Proceedings of the tenth ACM SIGPLAN symposium on principles and practice of parallel programming (PPoPP’05). ACM, New York, pp 186–195

  26. Horvath T, Skadron K (2008) Multi-mode energy management for multi-tier server clusters. In: Proceedings of the 17th international conference on parallel architectures and compilation techniques (PACT’08). ACM, New York, pp 270–279

  27. Huang S, Feng W (2009) Energy-efficient cluster computing via accurate workload characterization. In: Proceedings of the 2009 9th IEEE/ACM international symposium on cluster computing and the grid (CCGRID’09). IEEE Computer Society, Washington, DC, pp 68–75

  28. Ita (1998) The internet traffic archives: worldcup98. http://ita.ee.lbl.gov/html/contrib/WorldCup.html. Accessed 18 Nov 2015

  29. Krioukov A, Mohan P, Alspaugh S, Keys L, Culler D, Katz RH (2010) Napsac: design and implementation of a power-proportional web cluster. In: Proceedings of the first ACM SIGCOMM workshop on green networking (Green Networking’10). ACM, New York, pp 15–22

  30. Kusic D, Kephart J, Hanson J, Kandasamy N, Jiang G (2009) Power and performance management of virtualized computing environments via lookahead control. Clust Comput 12(1):1–15

    Article  Google Scholar 

  31. Lawson C, Hanson R (1995) Solving least squares problems. In: Classics in applied mathematics. Society for Industrial and Applied Mathematics, Philadelphia

  32. Leite JC, Kusic DM, Mossé D, Bertini L (2010) Stochastic approximation control of power and tardiness in a three-tier web-hosting cluster. In: Proceedings of the 7th international conference on autonomic computing (ICAC’10). ACM, New York, pp 41–50

  33. Lim M, Freeh V, Lowenthal D (2011) Adaptive, transparent CPU scaling algorithms leveraging inter-node mpi communication regions. Parallel Comput 37(10–11):667–683

    Article  Google Scholar 

  34. Meisner D, Wenisch TF (2012) DreamWeaver: architectural support for deep sleep. In: Proceedings of the seventeenth international conference on architectural support for programming languages and operating systems (ASPLOS XVII). ACM, New York, pp 313–324

  35. Meisner D, Gold B, Wenisch T (2009) Powernap: eliminating server idle power. ACM SIGPLAN Not 44(3):205–216

    Article  Google Scholar 

  36. Nathuji R, Schwan K (2007) Virtualpower: coordinated power management in virtualized enterprise systems. SIGOPS Oper Syst Rev 41(6):265–278

    Article  Google Scholar 

  37. Pinheiro E, Bianchini R, Carrera EV, Heath T (2001) Load balancing and unbalancing for power and performance in cluster-based systems. Workshop Compil Oper Syst Low Power 180:182–195

    Google Scholar 

  38. Srikantaiah S, Kansal A, Zhao F (2008) Energy aware consolidation for cloud computing. In: Proceedings of the 2008 conference on power aware computing and systems. USENIX Association, Berkeley

  39. Tolentino ME, Turner J, Cameron KW (2007) Memory-MISER: a performance-constrained runtime system for power-scalable clusters. In: Proceedings of the 4th international conference on computing frontiers (CF’07). ACM, New York, pp 237–246

  40. Urgaonkar B, Shenoy P, Chandra A, Goyal P (2005) Dynamic provisioning of multi-tier internet applications. In: Proceedings of second international conference on autonomic computing (ICAC’05), pp 217–228

  41. Valentini GL, Lassonde W, Khan SU, Min-Allah N, Madani SA, Li J, Zhang L, Wang L, Ghani N, Kolodziej J, Li H, Zomaya AY, Xu CZ, Balaji P, Vishnu A, Pinel F, Pecero JE, Kliazovich D, Bouvry P (2013) An overview of energy efficiency techniques in cluster computing systems. Clust Comput 16(1):3–15

    Article  Google Scholar 

  42. Whitney J, Delforge P (2014) Data center efficiency assessment. Tech. rep, Natural Resources Defense Council (NRDC)

  43. Wong D, Annavaram M (2012) Knightshift: scaling the energy proportionality wall through server-level heterogeneity. In: Proceedings of the 2012 45th annual IEEE/ACM international symposium on microarchitecture (MICRO-45). IEEE Computer Society, Washington, DC, pp 119–130

  44. Wong D, Annavaram M (2014) Implications of high energy proportional servers on cluster-wide energy proportionality. In: 20th IEEE international symposium on high performance computer architecture (HPCA’14), Orlando, pp 142–153

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Joaquín Entrialgo.

Additional information

This research has been partially supported by the Project TIN2008-06045-C02-01 of the Spanish National Plan for Research, Development and Innovation.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Entrialgo, J., Medrano, R., García, D.F. et al. Autonomic power management with self-healing in server clusters under QoS constraints. Computing 98, 871–894 (2016). https://doi.org/10.1007/s00607-015-0477-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00607-015-0477-2

Keywords

Mathematics Subject Classification

Navigation