skip to main content
column

On the energy (in)efficiency of Hadoop clusters

Published:12 March 2010Publication History
Skip Abstract Section

Abstract

Distributed processing frameworks, such as Yahoo!'s Hadoop and Google's MapReduce, have been successful at harnessing expansive datacenter resources for large-scale data analysis. However, their effect on datacenter energy efficiency has not been scrutinized. Moreover, the filesystem component of these frameworks effectively precludes scale-down of clusters deploying these frameworks (i.e. operating at reduced capacity). This paper presents our early work on modifying Hadoop to allow scale-down of operational clusters. We find that running Hadoop clusters in fractional configurations can save between 9% and 50% of energy consumption, and that there is a tradeoff between performance energy consumption. We also outline further research into the energy-efficiency of these frameworks.

References

  1. Lustre: A Scalable, High Performance File System. http://lustre.org/.Google ScholarGoogle Scholar
  2. Apache. Hadoop. http://hadoop.apache.org/.Google ScholarGoogle Scholar
  3. Luiz André Barroso and Urs Hölzle. The Case for Energy-Proportional Computing. Computer, 40(12), 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Standard Performance Evaluation Corporation. Specpower_ssj2008. http://www.spec.org/power_ssj2008/.Google ScholarGoogle Scholar
  5. Jeffrey Dean and Sanjay Ghemawat. MapReduce: Simplified Data Processing on Large Clusters. Commun. ACM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Chang Fay et al. Bigtable: A Distributed Storage System for Structured Data. OSDI, USENIX, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Xiaobo Fan, W. Weber, and L.A. Barroso. Power Provisioning for a Warehouse-sized Computer. ISCA, ACM, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung. The Google File System. SIGOPS Oper. Syst. Rev., 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Intelligent Platform Management Interface. http://www.intel.com/design/servers/ipmi/.Google ScholarGoogle Scholar
  10. David Meisner, B.T. Gold, and T.F. Wenisch. PowerNap: Eliminating Server Idle Power. ASPLOS, ACM, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Suzanne Rivoire, Parthasarathy Ranganathan, and Christos Kozyrakis. A Comparison of High-Level Full-System Power Models. HotPower, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Amazon Web Services. http://aws.amazon.com/.Google ScholarGoogle Scholar
  13. VMotion. http://vmware.com/products/vi/vc/vmotion.html.Google ScholarGoogle Scholar

Index Terms

  1. On the energy (in)efficiency of Hadoop clusters
            Index terms have been assigned to the content through auto-classification.

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in

            Full Access

            • Published in

              cover image ACM SIGOPS Operating Systems Review
              ACM SIGOPS Operating Systems Review  Volume 44, Issue 1
              January 2010
              115 pages
              ISSN:0163-5980
              DOI:10.1145/1740390
              Issue’s Table of Contents

              Copyright © 2010 Copyright is held by the owner/author(s)

              Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 12 March 2010

              Check for updates

              Qualifiers

              • column

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader