skip to main content
10.1145/1122971.1123006acmconferencesArticle/Chapter ViewAbstractPublication PagesppoppConference Proceedingsconference-collections
Article

Minimizing execution time in MPI programs on an energy-constrained, power-scalable cluster

Published:29 March 2006Publication History

ABSTRACT

Recently, the high-performance computing community has realized that power is a performance-limiting factor. One reason for this is that supercomputing centers have limited power capacity and machines are starting to hit that limit. In addition, the cost of energy has become increasingly significant, and the heat produced by higher-energy components tends to reduce their reliability. One way to reduce power (and therefore energy) requirements is to use high-performance cluster nodes that are frequency- and voltage-scalable (e.g., AMD-64 processors).The problem we address in this paper is: given a target program, a power-scalable cluster, and an upper limit for energy consumption, choose a schedule (number of nodes and CPU frequency) that simultaneously (1) satisfies an external upper limit for energy consumption and (2) minimizes execution time. There are too many schedules for an exhaustive search. Therefore, we find a schedule through a novel combination of performance modeling, performance prediction, and program execution. Using our technique, we are able to find a near-optimal schedule for all of our benchmarks in just a handful of partial program executions.

References

  1. N.D. Adiga et al. An overview of the BlueGene/L supercomputer. In Supercomputing, November 2002.]]Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. D. Bailey, J. Barton, T. Lasinski, and H. Simon. The NAS parallel benchmarks. RNR-91-002, NASA Ames Research Center, August 1991.]]Google ScholarGoogle Scholar
  3. K.W. Cameron, X. Feng, and R. Ge. Performance-constrained, distributed dvs scheduling for scientific applications on power-aware clusters. In Supercomputing (to appear), November 2005.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Enrique V. Carrera, Eduardo Pinheiro, and Ricardo Bianchini. Conserving disk energy in network servers. In Intl. Conference on Supercomputing, June 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Jeffrey S. Chase, Darrell C. Anderson, Prachi N. Thakar, Amin Vahdat, and Ronald P. Doyle. Managing energy and server resources in hosting centres. In Symposium on Operating Systems Principles, 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Guilin Chen, Konrad Malkowski, Mahmut Kandemir, and Padma Raghavan. Reducing power with performance contraints for parallel sparse applications. In Workshop on High-Performance, Power-Aware Computing, April 2005.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Compaq Computer Corporation, Intel Corporation, Microsoft Corporation, Phoenix Technologies Ltd., and Toshiba Corporation. Advanced configuration and power interface specification, revision 2.0. July 2000.]]Google ScholarGoogle Scholar
  8. Carla S. Ellis. The case for higher-level power management. In Workshop on Hot Topics in Operating Systems, Mar 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Elmootazbellah Elnozahy, Michael Kistler, and Ramakrishnan Rajamony. Energy conservation policies for web servers. In Usenix Symposium on Internet Technologies and Systems, 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. E.N. (Mootaz) Elnozahy, Michael Kistler, and Ramakrishnan Rajamony. Energy-efficient server clusters. In Workshop on Mobile Computing Systems and Applications, Feb 2002.]]Google ScholarGoogle Scholar
  11. Ahmad Faraj and Xin Yuan. Automatic generation and tuning of MPI collective communication routines. In Intl. Conference on Supercomputing, June 2005.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. Flinn and M. Satyanarayanan. Powerscope: A tool for profiling the energy usage of mobile applications. In IEEE Workshop on Mobile Computing Systems and Applications, February 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Vincent W. Freeh, David K. Lowenthal, Feng Pan, and Nandani Kappiah. Using multiple energy gears in MPI programs on a power-scalable cluster. In Principles and Practices of Parallel Programming, June 2005.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Vincent W. Freeh, David K. Lowenthal, Rob Springer, Feng Pan, and Nandani Kappiah. Exploring the energy-time tradeoff in MPI programs on a power-scalable cluster. In International Parallel and Distributed Processing Symposium, April 2005.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Vincent W. Freeh, Feng Pan, David K. Lowenthal, Nandini Kappiah, Rob Springer, Barry Rountree, and Mark E. Femal. Analyzing the energy-time tradeoff in high-performance computing applications. phSubmitted to Transactions on Parallel and Distributed Systems, 2005.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Chris Gniady, Y. Charlie Hu, and Yung-Hsiang Lu. Program counter based techniques for dynamic power management. In High-Performance Computer Architecture, February 2004.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. C-H. Hsu and U. Kremer. The design, implementation, and evaluation of a compiler algorithm for CPU energy reduction. In Programming Languages, Design, and Implementation, June 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Chung-Hsing Hsu and Wu-chun Feng. Towards efficient supercomputing: Choosing the right efficiency metric. In Workshop on High-Performance, Power-Aware Computing, April 2005.]]Google ScholarGoogle Scholar
  19. M. Huang, J. Renau, and J. Torellas. Positional adaptation of processors: Application to energy reduction. In International Symposium on Computer Architecture, June 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Nandani Kappiah, Vincent W. Freeh, and David K. Lowenthal. Just in time dynamic voltage scaling: Exploiting inter-node slack to save energy in MPI programs. In Supercomputing (to appear), November 2005.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Ken Kennedy and Ulrich Kremer. Automatic data layout for distributed-memory machines. ACM Transactions on Programming Languages and Systems, 20(4):869--916, 1998.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Charles Lefurgy, Karthick Rajamani, Freeman Rawson, Wes Felter, Michael Kistler, and Tom W. Keller. Energy management for commerical servers. IEEE Computer, pages 39--48, December 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Robert J. Minerick, Vincent W. Freeh, and Peter M. Kogge. Dynamic power management using feedback. In Workshop on Compilers and Operating Systems for Low Power, Charlottesville, Va, September 2002.]]Google ScholarGoogle Scholar
  24. Orion Multisystems. http://www.orionmulti.com/.]]Google ScholarGoogle Scholar
  25. Athanasios E. Papathanasiou and Michael L. Scott. Energy efficiency through burstiness. In Workshop on Mobile Computing Systems and Applications, October 2003.]]Google ScholarGoogle Scholar
  26. Eduardo Pinheiro, Ricardo Bianchini, Enrique V. Carrera, and Taliver Heath. Load balancing and unbalancing for power and performance in cluster-based systems. In Workshop on Compilers and Operating Systems for Low Power, September 2001.]]Google ScholarGoogle Scholar
  27. Thermal management and server density: Critical issues for today's data center. http://www.-rackable.com\Rackable\WPaper.pdf, 2004.]]Google ScholarGoogle Scholar
  28. Umit Rencuzogullari and Sandhya Dwarkadas. Dynamic adaptation to available resources for parallel computing in an autonomous network of workstations. In Principles and Practice of Parallel Programming, June 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Vivek Sharma, Arun Thomas, Tarek Abdelzaher, and Kevin Skadron. Power-aware QoS management in web servers. In IEEE Real-Time Systems Symposium, Cancun, Mexico, December 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Timothy Sherwood, Erez Perelman, Greg Hamerly, and Brad Calder. Automatically characterizing large scale program behavior. In Architectural Support for Programming Languages and Operating Systems, October 2002.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Jaspal Subhlok and Gary Vondran. Optimal mapping of sequences of data parallel tasks. In Principles and Practice of Parallel Programming, July 1995.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. J. S. Vetter and M.O. McCracken. Statistical scalability analysis of communication operations in distributed applications. In Principles and Practice of Parallel Programming, June 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. R. Viswanath, W. Vijay, A. Watwe, and V. Lebonheur. Thermal performance challenges from silicon to systems. Intel Technology Journal, Q3 2000.]]Google ScholarGoogle Scholar
  34. Michael J. Voss and Rudolf Eigemann. High-level adaptive program optimization with adapt. In Principles and Practices of Parallel Programming, 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. M. Warren, E. Weigle, and W. Feng. High-density computing: A 240-node beowulf in one cubic meter. In Supercomputing, November 2002.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. M. Weiser, B. Welch, A. J. Demers, and S. Shenker. Scheduling for reduced CPU energy. In Operating Systems Design and Implementation, November 1994.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. R. Clint Whaley, Antoine Petitet, and Jack J. Dongarra. Automated empirical optimizations of software and the ATLAS project. Parallel Computing, 27(1--2):3--35, 2001.]]Google ScholarGoogle Scholar
  38. R. Clint Whaley and David B. Whalley. Tuning high performance kernels through empirical compilation. In International Conference on Parallel Processing, June 2005.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Tao Yang, Xiaosong Ma, and Frank Mueller. Predicting parallel applications' performance across platforms using partial execution. In Supercomputing (to appear), November 2005.]]Google ScholarGoogle Scholar
  40. Heng Zeng, Carla S. Ellis, Alvin R. Lebeck, and Amin Vahdat. Currentcy: Unifying policies for resource management. In USENIX Annual Technical Conference, June 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Qingbo Zhu, Francis M. David, Christo Devaraj, Zhenmin Li, Yuanyuan Zhou, and Pei Cao. Reducing energy consumption of disk storage using power-aware cache management. In High-Performance Computer Architecture, February 2004.]] Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Minimizing execution time in MPI programs on an energy-constrained, power-scalable cluster

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      PPoPP '06: Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming
      March 2006
      258 pages
      ISBN:1595931899
      DOI:10.1145/1122971

      Copyright © 2006 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 29 March 2006

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate230of1,014submissions,23%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader