Multi-resource packing for cluster schedulers

Authors:
Robert Grandl

Microsoft, Redmond, WA, USA

Microsoft, Redmond, WA, USA
View Profile

,
Ganesh Ananthanarayanan

Microsoft, Redmond, WA, USA

Microsoft, Redmond, WA, USA
View Profile

,
Srikanth Kandula

Microsoft, Redmond, WA, USA

Microsoft, Redmond, WA, USA
View Profile

,
Sriram Rao

Microsoft, Redmond, WA, USA

Microsoft, Redmond, WA, USA
View Profile

,
Aditya Akella

Microsoft, Redmond, WA, USA

Microsoft, Redmond, WA, USA
View Profile

SIGCOMM '14: Proceedings of the 2014 ACM conference on SIGCOMMAugust 2014Pages 455–466https://doi.org/10.1145/2619239.2626334

Published:17 August 2014Publication History

SIGCOMM '14: Proceedings of the 2014 ACM conference on SIGCOMM

Pages 455–466

ABSTRACT

Tasks in modern data parallel clusters have highly diverse resource requirements, along CPU, memory, disk and network. Any of these resources may become bottlenecks and hence, the likelihood of wasting resources due to fragmentation is now larger. Today's schedulers do not explicitly reduce fragmentation. Worse, since they only allocate cores and memory, the resources that they ignore (disk and network) can be over-allocated leading to interference, failures and hogging of cores or memory that could have been used by other tasks. We present Tetris, a cluster scheduler that packs, i.e., matches multi-resource task requirements with resource availabilities of machines so as to increase cluster efficiency (makespan). Further, Tetris uses an analog of shortest-running-time-first to trade-off cluster efficiency for speeding up individual jobs. Tetris' packing heuristics seamlessly work alongside a large class of fairness policies. Trace-driven simulations and deployment of our prototype on a 250 node cluster shows median gains of 30% in job completion time while achieving nearly perfect fairness.

References

Apache Hadoop. http://hadoop.apache.org.Google Scholar
Facebook Data Grows By Over 500 TB Daily. http://bit.ly/1p5EV3c.Google Scholar
Hadoop MapReduce - Capacity Scheduler. http://bit.ly/1tGpbDN.Google Scholar
Hadoop MapReduce - Fair Scheduler. http://bit.ly/1p7sJ1I.Google Scholar
Hadoop YARN Project. http://bit.ly/1iS8xvP.Google Scholar
Petabyte Storage at Half Price with QFS. http://bit.ly/1x4A6vF.Google Scholar
S. Agarwal et al. Re-optimizing data parallel computing. In NSDI, 2012. Google ScholarDigital Library
M. Al-Fares et al. A Scalable, Commodity Data Center Network Architecture. In SIGCOMM, 2008. Google ScholarDigital Library
Y. Azar et al. Tight Bounds for Online Vector Bin Packing. In STOC, 2013. Google ScholarDigital Library
M. Chowdhury et al. Managing Data Transfers in Computer Clusters with Orchestra. In SIGCOMM, 2011. Google ScholarDigital Library
M. Chowdhury et al. Leveraging Endpoint Flexibility in Data-Intensive Clusters. In SIGCOMM, 2013. Google ScholarDigital Library
A. Ghodsi et al. Dominant Resource Fairness: Fair Allocation Of Multiple Resource Types. In NSDI, 2011. Google ScholarDigital Library
A. Greenberg et al. A Scalable and Flexible Datacenter Network . In SIGCOMM, 2009. Google ScholarDigital Library
S. Gulwani et al. SPEED: Precise And Efficient Static Estimation Of Program Computational Complexity. In POPL, 2009. Google ScholarDigital Library
C. Guo et al. BCube: A High Performance, Server-centric Network Architecture for Modular Data Centers. In SIGCOMM, 2009. Google ScholarDigital Library
M. Harchol-Balter et al. Connection Scheduling in Web Servers. In USITS, 1999.Google Scholar
M. Isard et al. Dryad: Distributed Data-Parallel Programs From Sequential Building Blocks. In EuroSys, 2007. Google ScholarDigital Library
M. Isard et al. Quincy: Fair Scheduling For Distributed Computing Clusters. In SOSP, 2009. Google ScholarDigital Library
L. Lu et al. Predictive VM Consolidation on Multiple Resources: Beyond Load Balancing. In IWQoS, 2013.Google ScholarCross Ref
R. Panigrahy et al. Heuristics for Vector Bin Packing. In MSR TR, 2011.Google Scholar
A. Rasmussen et al. Themis: An I/O-Efficient MapReduce. In SoCC, 2012. Google ScholarDigital Library
A. Shieh et al. Sharing the Data Center Network. In Usenix NSDI, 2011. Google ScholarDigital Library
T. Tannenbaum et al. Condor -- A Distributed Job Scheduler. In Beowulf Cluster Computing with Linux. MIT Press, 2001. Google ScholarDigital Library
A. Thusoo et al. Hive: A Warehousing Solution Over A Map-Reduce Framework. Proc. VLDB Endow., 2009. Google ScholarDigital Library
V. V. Vazirani. Approximation Algorithms. In Springer-Verlag, 2001. Google ScholarDigital Library
A. Wierman et al. Classifying Scheduling Policies with Respect to Unfairness in an M/GI/1 . In SIGMETRICS, 2003. Google ScholarDigital Library
G. J. Woeginger. There Is No Asymptotic Ptas For Two-Dimensional Vector Packing. In Information Processing Letters, 1997. Google ScholarDigital Library
C.-W. Yang et al. Tail Asymptotics For Policies Favoring Short Jobs In A Many-Flows Regime. In SIGMETRICS, 2006. Google ScholarDigital Library
M. Zaharia et al. Delay Scheduling: A Simple Technique For Achieving Locality And Fairness In Cluster Scheduling. In EuroSys, 2010. Google ScholarDigital Library
J. Zhou et al. SCOPE: Parallel Databases Meet MapReduce. Proc. VLDB Endow., 2012. Google ScholarDigital Library

Index Terms

Multi-resource packing for cluster schedulers

Recommendations

Multi-resource packing for cluster schedulers
SIGCOMM'14

Tasks in modern data parallel clusters have highly diverse resource requirements, along CPU, memory, disk and network. Any of these resources may become bottlenecks and hence, the likelihood of wasting resources due to fragmentation is now larger. Today'...
Read More
Multi-resource Packing for Job Scheduling in Virtual Machine Based Cloud Environment
SOSE '15: Proceedings of the 2015 IEEE Symposium on Service-Oriented System Engineering

To efficiently schedule jobs with highly diverse resource requirements along CPU, memory and bandwidth for job performance and resource utilization in a virtual machine based cloud environment, the multi-resource job scheduler is proposed to pack tasks ...
Read More
Improved Scheduling with a Shared Resource
Combinatorial Optimization and Applications
Abstract
We consider the following shared-resource scheduling problem: Given a set of jobs J, for each $j \in J$ we must schedule a job-specific processing volume of $v_{j} > 0$ . A total resource of 1 is available at any time. Jobs have a resource requirement $r_{j} \in [0, 1]$ , ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SIGCOMM '14: Proceedings of the 2014 ACM conference on SIGCOMM
August 2014
662 pages
ISBN:9781450328364
DOI:10.1145/2619239
General Chairs:
Fabián E. Bustamante
Northwestern University, USA
,
Y. Charlie Hu
Purdue University, USA
,
Program Chairs:
Arvind Krishnamurthy
University of Washington, USA
,
Sylvia Ratnasamy
University of California, Berkeley, USA
ACM SIGCOMM Computer Communication Review Volume 44, Issue 4
SIGCOMM'14
October 2014
672 pages
ISSN:0146-4833
DOI:10.1145/2740070
Editors:
Konstantina Papagiannaki
Telefonica Research, Barcelona, Spain
,
Katerina Argyraki
EPFL, Switzerland
,
Hitesh Ballani
Microsoft Research Cambridge, UK
,
Fabián Bustamante
Northwestern University, USA
,
Joseph Camp
SMU, USA
,
Augustin Chaintreau
Columbia University, USA
,
Phillipa Gill
Stony Brook University, USA
,
Marco Mellia
Politecnico di Torino, Italy
,
Bhaskaran Raman
IIT Bombay, India
,
Joel Sommers
Colgate University, USA
,
Aline Carneiro Viana
INRIA, France
Issue’s Table of Contents
Copyright © 2014 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 17 August 2014
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
cluster schedulers
completion time
fairness
makespan
multi-dimensional
packing
Qualifiers
- research-article
Conference

Acceptance Rates
SIGCOMM '14 Paper Acceptance Rate45of242submissions,19%Overall Acceptance Rate554of3,547submissions,16%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 407
  Total Citations
  View Citations
- 3,679
  Total Downloads
- Downloads (Last 12 months)459
- Downloads (Last 6 weeks)62
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Multi-resource packing for cluster schedulers

SIGCOMM '14: Proceedings of the 2014 ACM conference on SIGCOMM

ABSTRACT

References

Cited By

Index Terms

Recommendations

Multi-resource packing for cluster schedulers

Multi-resource Packing for Job Scheduling in Virtual Machine Based Cloud Environment

Improved Scheduling with a Shared Resource