research-article

Q-clouds: managing performance interference effects for QoS-aware clouds

Authors:
Ripal Nathuji

Microsoft Research, Redmond, WA, USA

Microsoft Research, Redmond, WA, USA
View Profile

,
Aman Kansal

Microsoft Research, Redmond, WA, USA

Microsoft Research, Redmond, WA, USA
View Profile

,
Alireza Ghaffarkhah

University of New Mexico, Albuquerque, NM, USA

University of New Mexico, Albuquerque, NM, USA
View Profile

EuroSys '10: Proceedings of the 5th European conference on Computer systemsApril 2010Pages 237–250https://doi.org/10.1145/1755913.1755938

Published:13 April 2010Publication History

EuroSys '10: Proceedings of the 5th European conference on Computer systems

Pages 237–250

ABSTRACT

Cloud computing offers users the ability to access large pools of computational and storage resources on demand. Multiple commercial clouds already allow businesses to replace, or supplement, privately owned IT assets, alleviating them from the burden of managing and maintaining these facilities. However, there are issues that must be addressed before this vision of utility computing can be fully realized. In existing systems, customers are charged based upon the amount of resources used or reserved, but no guarantees are made regarding the application level performance or quality-of-service (QoS) that the given resources will provide. As cloud providers continue to utilize virtualization technologies in their systems, this can become problematic. In particular, the consolidation of multiple customer applications onto multicore servers introduces performance interference between collocated workloads, significantly impacting application QoS. To address this challenge, we advocate that the cloud should transparently provision additional resources as necessary to achieve the performance that customers would have realized if they were running in isolation. Accordingly, we have developed Q-Clouds, a QoS-aware control framework that tunes resource allocations to mitigate performance interference effects. Q-Clouds uses online feedback to build a multi-input multi-output (MIMO) model that captures performance interference interactions, and uses it to perform closed loop resource management. In addition, we utilize this functionality to allow applications to specify multiple levels of QoS as application Q-states. For such applications, Q-Clouds dynamically provisions underutilized resources to enable elevated QoS levels, thereby improving system efficiency. Experimental evaluations of our solution using benchmark applications illustrate the benefits: performance interference is mitigated completely when feasible, and system utilization is improved by up to 35% using Q-states.

References

T. Abdelzaher, K. Shin, and N. Bhatti. Performance guarantees for web server end--systems: A control-theoretical approach. IEEE Transactions on Parallel and Distributed Systems, 13(1), 2002. Google ScholarDigital Library
Amazon Elastic Compute Cloud. http://aws.amazon.com/ec2.Google Scholar
P. Barham, B. Dragovic, K. Fraser, S. Hand, T. Harris, A. Ho, R. Neugebauer, I. Pratt, and A. Warfield. Xen and the art of virtualization. In Proceedings of the ACM Symposium on Operating Systems Principles (SOSP), 2003. Google ScholarDigital Library
N. Bobroff, A. Kochut, and K. Beaty. Dynamic placement of virtual machines for managing sla violations. In Proceedings of the 10th IFIP/IEEE International Symposium on Integrated Network Management (IM), 2007.Google ScholarCross Ref
A. Byde, M. Salle, and C. Bartolini. Market-based resource allocation for utility data centers. Technical Report HPL-2003-188, HP Laboratories, September 2003.Google Scholar
C. Clark, K. Fraser, S. Hand, J. G. Hansen, E. Jul, C. Limpach, I. Pratt, and A. Warfield. Live migration of virtual machines. In Proceedings of the 2nd ACM/USENIX Symposium on Networked Systems Design and Implementation (NSDI), May 2005. Google ScholarDigital Library
Y. Dong, Z. Yu, and G. Rose. Sr-iov networking in xen: Architecture, design and implementation. In Proceedings of the First Workshop on I/O Virtualization (WIOV), December 2008. Google ScholarDigital Library
A. Fedorova, M. Seltzer, and M. Smith. Improving performance isolation on chip multiprocessors via an operating system scheduler. In Proceedings of the International Conference on Parallel Architectures and Compilation Techniques (PACT), September 2007. Google ScholarDigital Library
Google App Engine. http://code.google.com/appengine.Google Scholar
A. Gulati, I. Ahmad, and C. A. Waldspurger. Parda: Proportional allocation of resources for distributed storage access. In Proceedings of the USENIX Conference on File and Storage Technologies (FAST), February 2009. Google ScholarDigital Library
F. Hermenier, X. Lorca, J.-M. Menaud, G. Muller, and J. Lawall. Entropy: a consolidation manager for clusters. In Proceedings of the International Conference on Virtual Execution Environments (VEE), 2009. Google ScholarDigital Library
C. Isci, G. Contreras, and M. Martonosi. Live, runtime phase monitoring and prediction on real systems with application to dynamic power management. In Proceedings of the International Symposium on Microarchitecture (MICRO), December 2006. Google ScholarDigital Library
R. Iyer, R. Illikkal, O. Tickoo, L. Zhao, P. Apparao, and D. Newell. Vm3: Measuring, modeling and managing vm shared resources. Journal of Computer Networks, 53(17), 2009. Google ScholarDigital Library
A. Kansal, J. Liu, A. Singh, R. Nathuji, and T. Abdelzaher. Semantic-less coordination of power management and application performance. In Workshop on Power Aware Computing and Systems (HotPower), October 2009.Google Scholar
G. Khanna, K. Beaty, G. Kar, and A. Kochut. Application performance management in virtualized server environments. In Proceedings of the 10th IEEE/IFIP Network Operations and Management Symposium (NOMS), 2006.Google ScholarCross Ref
Y. Koh, R. Knauerhase, P. Brett, M. Bowman, Z. Wen, and C. Pu. An analysis of performance interference effects in virtual environments. In IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pages 200--209, April 2007.Google ScholarCross Ref
J. Lin, Q. Lu, X. Ding, Z. Zhang, X. Zhang, and P. Sadayappan. Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems. In Proceedings of the International Symposium on High-Performance Computer Architecture (HPCA), 2008.Google Scholar
Microsoft Azure Services Platform. http://www.microsoft.com/azure.Google Scholar
T. Moscibroda and O. Mutlu. Memory performance attacks: Denial of memory service in multi-core systems. In Proceedings of the 16th USENIX Security Symposium, 2007. Google ScholarDigital Library
D. G. Murray, G. Milos, and S. Hand. Improving xen security through disaggregation. In Proceedings of the International Conference on Virtual Execution Environments (VEE), 2008. Google ScholarDigital Library
R. Nathuji and K. Schwan. Virtualpower: Coordinated power management in virtualized enterprise systems. In Proceedings of the 21st ACM Symposium on Operating Systems Principles (SOSP), October 2007. Google ScholarDigital Library
R. Nathuji, C. Isci, and E. Gorbatov. Exploiting platform heterogeneity for power efficient data centers. In Proceedings of the IEEE International Conference on Autonomic Computing (ICAC), June 2007. Google ScholarDigital Library
R. Nathuji, P. England, P. Sharma, and A. Singh. Feedback driven qos-aware power budgeting for virtualized servers. In Proceedings of the Workshop on Feedback Control Implementation and Design in Computing Systems and Networks (FeBID), April 2009.Google Scholar
M. Norgaard, O. Ravn, N. K. Poulsen, and L. K. Hansen. Neural Networks for Modelling and Control of Dynamic Systems. Springer, April 2003. Google ScholarDigital Library
P. Padala, K. Shin, X. Zhu, M. Uysal, Z. Wang, S. Singhal, A. Merchant, and K. Salem. Adaptive control of virtualized resources in utility computing environments. In Proceedings of the EuroSys Conference, 2007. Google ScholarDigital Library
P. Padala, K.-Y. Hou, K. Shin, X. Zhu, M. Uysal, Z. Wang, S. Singhal, and A. Merchant. Automated control of multiple virtualized resources. In Proceedings of the EuroSys Conference, 2009. Google ScholarDigital Library
M. Qureshi and Y. Patt. Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches. In Proceedings of the International Symposium on Microarchitecture (MICRO), December 2006. Google ScholarDigital Library
R. Raghavendra, P. Ranganathan, V. Talwar, Z. Wang, and X. Zhu. No "power" struggles: Coordinated multi-level power management for the data center. In Proceedings of the International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), March 2008. Google ScholarDigital Library
H. Raj, R. Nathuji, A. Singh, and P. England. Resource management for isolation enhanced cloud services. In Proceedings of the Cloud Computing Security Workshop (CCSW), 2009. Google ScholarDigital Library
D. Tullsen, S. Eggers, and H. Levy. Simultaneous multithreading: Maximizing on-chip parallelism. In Proceedings of the International Symposium on Computer Architecture (ISCA), 1995. Google ScholarDigital Library
A. Verma, P. Ahuja, and A. Neogi. Power-aware dynamic placement of hpc applications. In Proceedings of the International Conference on Supercomputing (ICS), 2008. Google ScholarDigital Library
VMware ESX. http://www.vmware.com/products/esx.Google Scholar
M.Wachs, M. Abd-El-Malek, E. Thereska, and G. R. Ganger. Argon: performance insulation for shared storage servers. In Proceedings of the USENIX Conference on File and Storage Technologies (FAST), February 2007. Google ScholarDigital Library
X.Wang and Y.Wang. Co-con: Coordinated control of power and application performance for virtualized server clusters. In Proceedings of the 17th IEEE International Workshop on Quality of Service (IWQoS), Charleston, South Carolina, July 2009.Google Scholar
Windows Server 2008 R2 Hyper-V. http://www.microsoft.com/hyperv.Google Scholar
Y. Xie and G. H. Loh. Pipp: Promotion/insertion pseudopartitioning of multi-core shared caches. In Proceedings of the International Symposium on Computer Architecture (ISCA),June 2009. Google ScholarDigital Library
X. Zhang, S. Dwarkadas, and K. Shen. Towards practical page coloring-based multicore cache management. In Proceedings of the EuroSys Conference, March 2009. Google ScholarDigital Library
L. Zhao, R. Iyer, R. Illikkal, J. Moses, S. Makineni, and D. Newell. Cachescouts: Fine-rain monitoring of shared caches in cmp platforms. In Proceedings of the International Conference on Parallel Architectures and Compilation Techniques (PACT), September 2007. Google ScholarDigital Library
S. Zhuravlev, S. Blagodurov, and A. Fedorova. Addressing shared resource contention in multicore processors via scheduling. In Proceedings of the International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2010. Google ScholarDigital Library

Index Terms

Q-clouds: managing performance interference effects for QoS-aware clouds

Recommendations

Challenges in real-time virtualization and predictable cloud computing

Cloud computing and virtualization technology have revolutionized general-purpose computing applications in the past decade. The cloud paradigm offers advantages through reduction of operation costs, server consolidation, flexible system configuration ...
Read More
Guest Editors' introduction: Special section on optimizing the cloud

Cloud computing is defined as a pool of virtualized computer resources. Based on this virtualization, the cloud computing paradigm allows workloads to be deployed and scaled-out quickly through the rapid provisioning of virtual machines or physical ...
Read More
Performance Analysis of Network I/O Workloads in Virtualized Data Centers

Server consolidation and application consolidation through virtualization are key performance optimizations in cloud-based service delivery industry. In this paper, we argue that it is important for both cloud consumers and cloud providers to understand ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
EuroSys '10: Proceedings of the 5th European conference on Computer systems
April 2010
388 pages
ISBN:9781605585772
DOI:10.1145/1755913
General Chair:
Christine Morin
INRIA Rennes, France
,
Program Chair:
Gilles Muller
INRIA/LIP6, France
Copyright © 2010 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 13 April 2010
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
cloud computing
resource management
virtualization
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate241of1,308submissions,18%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 438
  Total Citations
  View Citations
- 2,695
  Total Downloads
- Downloads (Last 12 months)68
- Downloads (Last 6 weeks)6
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Q-clouds: managing performance interference effects for QoS-aware clouds

EuroSys '10: Proceedings of the 5th European conference on Computer systems

ABSTRACT

References

Cited By

Index Terms

Recommendations

Challenges in real-time virtualization and predictable cloud computing

Guest Editors' introduction: Special section on optimizing the cloud

Performance Analysis of Network I/O Workloads in Virtualized Data Centers