ABSTRACT
Cloud computing offers users the ability to access large pools of computational and storage resources on demand. Multiple commercial clouds already allow businesses to replace, or supplement, privately owned IT assets, alleviating them from the burden of managing and maintaining these facilities. However, there are issues that must be addressed before this vision of utility computing can be fully realized. In existing systems, customers are charged based upon the amount of resources used or reserved, but no guarantees are made regarding the application level performance or quality-of-service (QoS) that the given resources will provide. As cloud providers continue to utilize virtualization technologies in their systems, this can become problematic. In particular, the consolidation of multiple customer applications onto multicore servers introduces performance interference between collocated workloads, significantly impacting application QoS. To address this challenge, we advocate that the cloud should transparently provision additional resources as necessary to achieve the performance that customers would have realized if they were running in isolation. Accordingly, we have developed Q-Clouds, a QoS-aware control framework that tunes resource allocations to mitigate performance interference effects. Q-Clouds uses online feedback to build a multi-input multi-output (MIMO) model that captures performance interference interactions, and uses it to perform closed loop resource management. In addition, we utilize this functionality to allow applications to specify multiple levels of QoS as application Q-states. For such applications, Q-Clouds dynamically provisions underutilized resources to enable elevated QoS levels, thereby improving system efficiency. Experimental evaluations of our solution using benchmark applications illustrate the benefits: performance interference is mitigated completely when feasible, and system utilization is improved by up to 35% using Q-states.
- T. Abdelzaher, K. Shin, and N. Bhatti. Performance guarantees for web server end--systems: A control-theoretical approach. IEEE Transactions on Parallel and Distributed Systems, 13(1), 2002. Google ScholarDigital Library
- Amazon Elastic Compute Cloud. http://aws.amazon.com/ec2.Google Scholar
- P. Barham, B. Dragovic, K. Fraser, S. Hand, T. Harris, A. Ho, R. Neugebauer, I. Pratt, and A. Warfield. Xen and the art of virtualization. In Proceedings of the ACM Symposium on Operating Systems Principles (SOSP), 2003. Google ScholarDigital Library
- N. Bobroff, A. Kochut, and K. Beaty. Dynamic placement of virtual machines for managing sla violations. In Proceedings of the 10th IFIP/IEEE International Symposium on Integrated Network Management (IM), 2007.Google ScholarCross Ref
- A. Byde, M. Salle, and C. Bartolini. Market-based resource allocation for utility data centers. Technical Report HPL-2003-188, HP Laboratories, September 2003.Google Scholar
- C. Clark, K. Fraser, S. Hand, J. G. Hansen, E. Jul, C. Limpach, I. Pratt, and A. Warfield. Live migration of virtual machines. In Proceedings of the 2nd ACM/USENIX Symposium on Networked Systems Design and Implementation (NSDI), May 2005. Google ScholarDigital Library
- Y. Dong, Z. Yu, and G. Rose. Sr-iov networking in xen: Architecture, design and implementation. In Proceedings of the First Workshop on I/O Virtualization (WIOV), December 2008. Google ScholarDigital Library
- A. Fedorova, M. Seltzer, and M. Smith. Improving performance isolation on chip multiprocessors via an operating system scheduler. In Proceedings of the International Conference on Parallel Architectures and Compilation Techniques (PACT), September 2007. Google ScholarDigital Library
- Google App Engine. http://code.google.com/appengine.Google Scholar
- A. Gulati, I. Ahmad, and C. A. Waldspurger. Parda: Proportional allocation of resources for distributed storage access. In Proceedings of the USENIX Conference on File and Storage Technologies (FAST), February 2009. Google ScholarDigital Library
- F. Hermenier, X. Lorca, J.-M. Menaud, G. Muller, and J. Lawall. Entropy: a consolidation manager for clusters. In Proceedings of the International Conference on Virtual Execution Environments (VEE), 2009. Google ScholarDigital Library
- C. Isci, G. Contreras, and M. Martonosi. Live, runtime phase monitoring and prediction on real systems with application to dynamic power management. In Proceedings of the International Symposium on Microarchitecture (MICRO), December 2006. Google ScholarDigital Library
- R. Iyer, R. Illikkal, O. Tickoo, L. Zhao, P. Apparao, and D. Newell. Vm3: Measuring, modeling and managing vm shared resources. Journal of Computer Networks, 53(17), 2009. Google ScholarDigital Library
- A. Kansal, J. Liu, A. Singh, R. Nathuji, and T. Abdelzaher. Semantic-less coordination of power management and application performance. In Workshop on Power Aware Computing and Systems (HotPower), October 2009.Google Scholar
- G. Khanna, K. Beaty, G. Kar, and A. Kochut. Application performance management in virtualized server environments. In Proceedings of the 10th IEEE/IFIP Network Operations and Management Symposium (NOMS), 2006.Google ScholarCross Ref
- Y. Koh, R. Knauerhase, P. Brett, M. Bowman, Z. Wen, and C. Pu. An analysis of performance interference effects in virtual environments. In IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pages 200--209, April 2007.Google ScholarCross Ref
- J. Lin, Q. Lu, X. Ding, Z. Zhang, X. Zhang, and P. Sadayappan. Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems. In Proceedings of the International Symposium on High-Performance Computer Architecture (HPCA), 2008.Google Scholar
- Microsoft Azure Services Platform. http://www.microsoft.com/azure.Google Scholar
- T. Moscibroda and O. Mutlu. Memory performance attacks: Denial of memory service in multi-core systems. In Proceedings of the 16th USENIX Security Symposium, 2007. Google ScholarDigital Library
- D. G. Murray, G. Milos, and S. Hand. Improving xen security through disaggregation. In Proceedings of the International Conference on Virtual Execution Environments (VEE), 2008. Google ScholarDigital Library
- R. Nathuji and K. Schwan. Virtualpower: Coordinated power management in virtualized enterprise systems. In Proceedings of the 21st ACM Symposium on Operating Systems Principles (SOSP), October 2007. Google ScholarDigital Library
- R. Nathuji, C. Isci, and E. Gorbatov. Exploiting platform heterogeneity for power efficient data centers. In Proceedings of the IEEE International Conference on Autonomic Computing (ICAC), June 2007. Google ScholarDigital Library
- R. Nathuji, P. England, P. Sharma, and A. Singh. Feedback driven qos-aware power budgeting for virtualized servers. In Proceedings of the Workshop on Feedback Control Implementation and Design in Computing Systems and Networks (FeBID), April 2009.Google Scholar
- M. Norgaard, O. Ravn, N. K. Poulsen, and L. K. Hansen. Neural Networks for Modelling and Control of Dynamic Systems. Springer, April 2003. Google ScholarDigital Library
- P. Padala, K. Shin, X. Zhu, M. Uysal, Z. Wang, S. Singhal, A. Merchant, and K. Salem. Adaptive control of virtualized resources in utility computing environments. In Proceedings of the EuroSys Conference, 2007. Google ScholarDigital Library
- P. Padala, K.-Y. Hou, K. Shin, X. Zhu, M. Uysal, Z. Wang, S. Singhal, and A. Merchant. Automated control of multiple virtualized resources. In Proceedings of the EuroSys Conference, 2009. Google ScholarDigital Library
- M. Qureshi and Y. Patt. Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches. In Proceedings of the International Symposium on Microarchitecture (MICRO), December 2006. Google ScholarDigital Library
- R. Raghavendra, P. Ranganathan, V. Talwar, Z. Wang, and X. Zhu. No "power" struggles: Coordinated multi-level power management for the data center. In Proceedings of the International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), March 2008. Google ScholarDigital Library
- H. Raj, R. Nathuji, A. Singh, and P. England. Resource management for isolation enhanced cloud services. In Proceedings of the Cloud Computing Security Workshop (CCSW), 2009. Google ScholarDigital Library
- D. Tullsen, S. Eggers, and H. Levy. Simultaneous multithreading: Maximizing on-chip parallelism. In Proceedings of the International Symposium on Computer Architecture (ISCA), 1995. Google ScholarDigital Library
- A. Verma, P. Ahuja, and A. Neogi. Power-aware dynamic placement of hpc applications. In Proceedings of the International Conference on Supercomputing (ICS), 2008. Google ScholarDigital Library
- VMware ESX. http://www.vmware.com/products/esx.Google Scholar
- M.Wachs, M. Abd-El-Malek, E. Thereska, and G. R. Ganger. Argon: performance insulation for shared storage servers. In Proceedings of the USENIX Conference on File and Storage Technologies (FAST), February 2007. Google ScholarDigital Library
- X.Wang and Y.Wang. Co-con: Coordinated control of power and application performance for virtualized server clusters. In Proceedings of the 17th IEEE International Workshop on Quality of Service (IWQoS), Charleston, South Carolina, July 2009.Google Scholar
- Windows Server 2008 R2 Hyper-V. http://www.microsoft.com/hyperv.Google Scholar
- Y. Xie and G. H. Loh. Pipp: Promotion/insertion pseudopartitioning of multi-core shared caches. In Proceedings of the International Symposium on Computer Architecture (ISCA),June 2009. Google ScholarDigital Library
- X. Zhang, S. Dwarkadas, and K. Shen. Towards practical page coloring-based multicore cache management. In Proceedings of the EuroSys Conference, March 2009. Google ScholarDigital Library
- L. Zhao, R. Iyer, R. Illikkal, J. Moses, S. Makineni, and D. Newell. Cachescouts: Fine-rain monitoring of shared caches in cmp platforms. In Proceedings of the International Conference on Parallel Architectures and Compilation Techniques (PACT), September 2007. Google ScholarDigital Library
- S. Zhuravlev, S. Blagodurov, and A. Fedorova. Addressing shared resource contention in multicore processors via scheduling. In Proceedings of the International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2010. Google ScholarDigital Library
Index Terms
- Q-clouds: managing performance interference effects for QoS-aware clouds
Recommendations
Challenges in real-time virtualization and predictable cloud computing
Cloud computing and virtualization technology have revolutionized general-purpose computing applications in the past decade. The cloud paradigm offers advantages through reduction of operation costs, server consolidation, flexible system configuration ...
Guest Editors' introduction: Special section on optimizing the cloud
Cloud computing is defined as a pool of virtualized computer resources. Based on this virtualization, the cloud computing paradigm allows workloads to be deployed and scaled-out quickly through the rapid provisioning of virtual machines or physical ...
Performance Analysis of Network I/O Workloads in Virtualized Data Centers
Server consolidation and application consolidation through virtualization are key performance optimizations in cloud-based service delivery industry. In this paper, we argue that it is important for both cloud consumers and cloud providers to understand ...
Comments