ABSTRACT
Recently, the high-performance computing community has realized that power is a performance-limiting factor. One reason for this is that supercomputing centers have limited power capacity and machines are starting to hit that limit. In addition, the cost of energy has become increasingly significant, and the heat produced by higher-energy components tends to reduce their reliability. One way to reduce power (and therefore energy) requirements is to use high-performance cluster nodes that are frequency- and voltage-scalable (e.g., AMD-64 processors).The problem we address in this paper is: given a target program, a power-scalable cluster, and an upper limit for energy consumption, choose a schedule (number of nodes and CPU frequency) that simultaneously (1) satisfies an external upper limit for energy consumption and (2) minimizes execution time. There are too many schedules for an exhaustive search. Therefore, we find a schedule through a novel combination of performance modeling, performance prediction, and program execution. Using our technique, we are able to find a near-optimal schedule for all of our benchmarks in just a handful of partial program executions.
- N.D. Adiga et al. An overview of the BlueGene/L supercomputer. In Supercomputing, November 2002.]]Google ScholarDigital Library
- D. Bailey, J. Barton, T. Lasinski, and H. Simon. The NAS parallel benchmarks. RNR-91-002, NASA Ames Research Center, August 1991.]]Google Scholar
- K.W. Cameron, X. Feng, and R. Ge. Performance-constrained, distributed dvs scheduling for scientific applications on power-aware clusters. In Supercomputing (to appear), November 2005.]] Google ScholarDigital Library
- Enrique V. Carrera, Eduardo Pinheiro, and Ricardo Bianchini. Conserving disk energy in network servers. In Intl. Conference on Supercomputing, June 2003.]] Google ScholarDigital Library
- Jeffrey S. Chase, Darrell C. Anderson, Prachi N. Thakar, Amin Vahdat, and Ronald P. Doyle. Managing energy and server resources in hosting centres. In Symposium on Operating Systems Principles, 2001.]] Google ScholarDigital Library
- Guilin Chen, Konrad Malkowski, Mahmut Kandemir, and Padma Raghavan. Reducing power with performance contraints for parallel sparse applications. In Workshop on High-Performance, Power-Aware Computing, April 2005.]] Google ScholarDigital Library
- Compaq Computer Corporation, Intel Corporation, Microsoft Corporation, Phoenix Technologies Ltd., and Toshiba Corporation. Advanced configuration and power interface specification, revision 2.0. July 2000.]]Google Scholar
- Carla S. Ellis. The case for higher-level power management. In Workshop on Hot Topics in Operating Systems, Mar 1999.]] Google ScholarDigital Library
- Elmootazbellah Elnozahy, Michael Kistler, and Ramakrishnan Rajamony. Energy conservation policies for web servers. In Usenix Symposium on Internet Technologies and Systems, 2003.]] Google ScholarDigital Library
- E.N. (Mootaz) Elnozahy, Michael Kistler, and Ramakrishnan Rajamony. Energy-efficient server clusters. In Workshop on Mobile Computing Systems and Applications, Feb 2002.]]Google Scholar
- Ahmad Faraj and Xin Yuan. Automatic generation and tuning of MPI collective communication routines. In Intl. Conference on Supercomputing, June 2005.]] Google ScholarDigital Library
- J. Flinn and M. Satyanarayanan. Powerscope: A tool for profiling the energy usage of mobile applications. In IEEE Workshop on Mobile Computing Systems and Applications, February 1999.]] Google ScholarDigital Library
- Vincent W. Freeh, David K. Lowenthal, Feng Pan, and Nandani Kappiah. Using multiple energy gears in MPI programs on a power-scalable cluster. In Principles and Practices of Parallel Programming, June 2005.]] Google ScholarDigital Library
- Vincent W. Freeh, David K. Lowenthal, Rob Springer, Feng Pan, and Nandani Kappiah. Exploring the energy-time tradeoff in MPI programs on a power-scalable cluster. In International Parallel and Distributed Processing Symposium, April 2005.]] Google ScholarDigital Library
- Vincent W. Freeh, Feng Pan, David K. Lowenthal, Nandini Kappiah, Rob Springer, Barry Rountree, and Mark E. Femal. Analyzing the energy-time tradeoff in high-performance computing applications. phSubmitted to Transactions on Parallel and Distributed Systems, 2005.]] Google ScholarDigital Library
- Chris Gniady, Y. Charlie Hu, and Yung-Hsiang Lu. Program counter based techniques for dynamic power management. In High-Performance Computer Architecture, February 2004.]] Google ScholarDigital Library
- C-H. Hsu and U. Kremer. The design, implementation, and evaluation of a compiler algorithm for CPU energy reduction. In Programming Languages, Design, and Implementation, June 2003.]] Google ScholarDigital Library
- Chung-Hsing Hsu and Wu-chun Feng. Towards efficient supercomputing: Choosing the right efficiency metric. In Workshop on High-Performance, Power-Aware Computing, April 2005.]]Google Scholar
- M. Huang, J. Renau, and J. Torellas. Positional adaptation of processors: Application to energy reduction. In International Symposium on Computer Architecture, June 2003.]] Google ScholarDigital Library
- Nandani Kappiah, Vincent W. Freeh, and David K. Lowenthal. Just in time dynamic voltage scaling: Exploiting inter-node slack to save energy in MPI programs. In Supercomputing (to appear), November 2005.]] Google ScholarDigital Library
- Ken Kennedy and Ulrich Kremer. Automatic data layout for distributed-memory machines. ACM Transactions on Programming Languages and Systems, 20(4):869--916, 1998.]] Google ScholarDigital Library
- Charles Lefurgy, Karthick Rajamani, Freeman Rawson, Wes Felter, Michael Kistler, and Tom W. Keller. Energy management for commerical servers. IEEE Computer, pages 39--48, December 2003.]] Google ScholarDigital Library
- Robert J. Minerick, Vincent W. Freeh, and Peter M. Kogge. Dynamic power management using feedback. In Workshop on Compilers and Operating Systems for Low Power, Charlottesville, Va, September 2002.]]Google Scholar
- Orion Multisystems. http://www.orionmulti.com/.]]Google Scholar
- Athanasios E. Papathanasiou and Michael L. Scott. Energy efficiency through burstiness. In Workshop on Mobile Computing Systems and Applications, October 2003.]]Google Scholar
- Eduardo Pinheiro, Ricardo Bianchini, Enrique V. Carrera, and Taliver Heath. Load balancing and unbalancing for power and performance in cluster-based systems. In Workshop on Compilers and Operating Systems for Low Power, September 2001.]]Google Scholar
- Thermal management and server density: Critical issues for today's data center. http://www.-rackable.com\Rackable\WPaper.pdf, 2004.]]Google Scholar
- Umit Rencuzogullari and Sandhya Dwarkadas. Dynamic adaptation to available resources for parallel computing in an autonomous network of workstations. In Principles and Practice of Parallel Programming, June 2001.]] Google ScholarDigital Library
- Vivek Sharma, Arun Thomas, Tarek Abdelzaher, and Kevin Skadron. Power-aware QoS management in web servers. In IEEE Real-Time Systems Symposium, Cancun, Mexico, December 2003.]] Google ScholarDigital Library
- Timothy Sherwood, Erez Perelman, Greg Hamerly, and Brad Calder. Automatically characterizing large scale program behavior. In Architectural Support for Programming Languages and Operating Systems, October 2002.]] Google ScholarDigital Library
- Jaspal Subhlok and Gary Vondran. Optimal mapping of sequences of data parallel tasks. In Principles and Practice of Parallel Programming, July 1995.]] Google ScholarDigital Library
- J. S. Vetter and M.O. McCracken. Statistical scalability analysis of communication operations in distributed applications. In Principles and Practice of Parallel Programming, June 2001.]] Google ScholarDigital Library
- R. Viswanath, W. Vijay, A. Watwe, and V. Lebonheur. Thermal performance challenges from silicon to systems. Intel Technology Journal, Q3 2000.]]Google Scholar
- Michael J. Voss and Rudolf Eigemann. High-level adaptive program optimization with adapt. In Principles and Practices of Parallel Programming, 2001.]] Google ScholarDigital Library
- M. Warren, E. Weigle, and W. Feng. High-density computing: A 240-node beowulf in one cubic meter. In Supercomputing, November 2002.]] Google ScholarDigital Library
- M. Weiser, B. Welch, A. J. Demers, and S. Shenker. Scheduling for reduced CPU energy. In Operating Systems Design and Implementation, November 1994.]] Google ScholarDigital Library
- R. Clint Whaley, Antoine Petitet, and Jack J. Dongarra. Automated empirical optimizations of software and the ATLAS project. Parallel Computing, 27(1--2):3--35, 2001.]]Google Scholar
- R. Clint Whaley and David B. Whalley. Tuning high performance kernels through empirical compilation. In International Conference on Parallel Processing, June 2005.]] Google ScholarDigital Library
- Tao Yang, Xiaosong Ma, and Frank Mueller. Predicting parallel applications' performance across platforms using partial execution. In Supercomputing (to appear), November 2005.]]Google Scholar
- Heng Zeng, Carla S. Ellis, Alvin R. Lebeck, and Amin Vahdat. Currentcy: Unifying policies for resource management. In USENIX Annual Technical Conference, June 2003.]] Google ScholarDigital Library
- Qingbo Zhu, Francis M. David, Christo Devaraj, Zhenmin Li, Yuanyuan Zhou, and Pei Cao. Reducing energy consumption of disk storage using power-aware cache management. In High-Performance Computer Architecture, February 2004.]] Google ScholarDigital Library
Index Terms
- Minimizing execution time in MPI programs on an energy-constrained, power-scalable cluster
Recommendations
Exploiting Energy Saving Opportunity of Barrier Operation in MPI Programs
AMS '08: Proceedings of the 2008 Second Asia International Conference on Modelling & Simulation (AMS)In high performance parallel computing, energy consumption has become a more and more important problem. Under the support of the voltage scaling-available processors, reducing CPU voltage/frequency in the non-critical path is an efficient energy ...
Analyzing Energy-Time Tradeoff in Power Overprovisioned HPC Data Centers
IPDPSW '15: Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium WorkshopMinimizing energy and power consumption of large scale data centers is one of the biggest challenges faced by the high performance computing community. In an over provisioned data center, nodes are power capped to run below their Thermal Design Power (...
Power and energy footprint of openMP programs using OpenMP runtime API
E2SC '14: Proceedings of the 2nd International Workshop on Energy Efficient SupercomputingPower and energy have become dominant aspects of hardware and software design in the High Performance Computing (HPC). Recently, the Department of Defense (DOD) has put a constraint that applications and architectures need to attain 75 GFLOPS/Watt in ...
Comments