skip to main content
10.1145/1755913.1755938acmconferencesArticle/Chapter ViewAbstractPublication PageseurosysConference Proceedingsconference-collections
research-article

Q-clouds: managing performance interference effects for QoS-aware clouds

Published:13 April 2010Publication History

ABSTRACT

Cloud computing offers users the ability to access large pools of computational and storage resources on demand. Multiple commercial clouds already allow businesses to replace, or supplement, privately owned IT assets, alleviating them from the burden of managing and maintaining these facilities. However, there are issues that must be addressed before this vision of utility computing can be fully realized. In existing systems, customers are charged based upon the amount of resources used or reserved, but no guarantees are made regarding the application level performance or quality-of-service (QoS) that the given resources will provide. As cloud providers continue to utilize virtualization technologies in their systems, this can become problematic. In particular, the consolidation of multiple customer applications onto multicore servers introduces performance interference between collocated workloads, significantly impacting application QoS. To address this challenge, we advocate that the cloud should transparently provision additional resources as necessary to achieve the performance that customers would have realized if they were running in isolation. Accordingly, we have developed Q-Clouds, a QoS-aware control framework that tunes resource allocations to mitigate performance interference effects. Q-Clouds uses online feedback to build a multi-input multi-output (MIMO) model that captures performance interference interactions, and uses it to perform closed loop resource management. In addition, we utilize this functionality to allow applications to specify multiple levels of QoS as application Q-states. For such applications, Q-Clouds dynamically provisions underutilized resources to enable elevated QoS levels, thereby improving system efficiency. Experimental evaluations of our solution using benchmark applications illustrate the benefits: performance interference is mitigated completely when feasible, and system utilization is improved by up to 35% using Q-states.

References

  1. T. Abdelzaher, K. Shin, and N. Bhatti. Performance guarantees for web server end--systems: A control-theoretical approach. IEEE Transactions on Parallel and Distributed Systems, 13(1), 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Amazon Elastic Compute Cloud. http://aws.amazon.com/ec2.Google ScholarGoogle Scholar
  3. P. Barham, B. Dragovic, K. Fraser, S. Hand, T. Harris, A. Ho, R. Neugebauer, I. Pratt, and A. Warfield. Xen and the art of virtualization. In Proceedings of the ACM Symposium on Operating Systems Principles (SOSP), 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. N. Bobroff, A. Kochut, and K. Beaty. Dynamic placement of virtual machines for managing sla violations. In Proceedings of the 10th IFIP/IEEE International Symposium on Integrated Network Management (IM), 2007.Google ScholarGoogle ScholarCross RefCross Ref
  5. A. Byde, M. Salle, and C. Bartolini. Market-based resource allocation for utility data centers. Technical Report HPL-2003-188, HP Laboratories, September 2003.Google ScholarGoogle Scholar
  6. C. Clark, K. Fraser, S. Hand, J. G. Hansen, E. Jul, C. Limpach, I. Pratt, and A. Warfield. Live migration of virtual machines. In Proceedings of the 2nd ACM/USENIX Symposium on Networked Systems Design and Implementation (NSDI), May 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Y. Dong, Z. Yu, and G. Rose. Sr-iov networking in xen: Architecture, design and implementation. In Proceedings of the First Workshop on I/O Virtualization (WIOV), December 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Fedorova, M. Seltzer, and M. Smith. Improving performance isolation on chip multiprocessors via an operating system scheduler. In Proceedings of the International Conference on Parallel Architectures and Compilation Techniques (PACT), September 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Google App Engine. http://code.google.com/appengine.Google ScholarGoogle Scholar
  10. A. Gulati, I. Ahmad, and C. A. Waldspurger. Parda: Proportional allocation of resources for distributed storage access. In Proceedings of the USENIX Conference on File and Storage Technologies (FAST), February 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. F. Hermenier, X. Lorca, J.-M. Menaud, G. Muller, and J. Lawall. Entropy: a consolidation manager for clusters. In Proceedings of the International Conference on Virtual Execution Environments (VEE), 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. C. Isci, G. Contreras, and M. Martonosi. Live, runtime phase monitoring and prediction on real systems with application to dynamic power management. In Proceedings of the International Symposium on Microarchitecture (MICRO), December 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. R. Iyer, R. Illikkal, O. Tickoo, L. Zhao, P. Apparao, and D. Newell. Vm3: Measuring, modeling and managing vm shared resources. Journal of Computer Networks, 53(17), 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. A. Kansal, J. Liu, A. Singh, R. Nathuji, and T. Abdelzaher. Semantic-less coordination of power management and application performance. In Workshop on Power Aware Computing and Systems (HotPower), October 2009.Google ScholarGoogle Scholar
  15. G. Khanna, K. Beaty, G. Kar, and A. Kochut. Application performance management in virtualized server environments. In Proceedings of the 10th IEEE/IFIP Network Operations and Management Symposium (NOMS), 2006.Google ScholarGoogle ScholarCross RefCross Ref
  16. Y. Koh, R. Knauerhase, P. Brett, M. Bowman, Z. Wen, and C. Pu. An analysis of performance interference effects in virtual environments. In IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pages 200--209, April 2007.Google ScholarGoogle ScholarCross RefCross Ref
  17. J. Lin, Q. Lu, X. Ding, Z. Zhang, X. Zhang, and P. Sadayappan. Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems. In Proceedings of the International Symposium on High-Performance Computer Architecture (HPCA), 2008.Google ScholarGoogle Scholar
  18. Microsoft Azure Services Platform. http://www.microsoft.com/azure.Google ScholarGoogle Scholar
  19. T. Moscibroda and O. Mutlu. Memory performance attacks: Denial of memory service in multi-core systems. In Proceedings of the 16th USENIX Security Symposium, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. D. G. Murray, G. Milos, and S. Hand. Improving xen security through disaggregation. In Proceedings of the International Conference on Virtual Execution Environments (VEE), 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. R. Nathuji and K. Schwan. Virtualpower: Coordinated power management in virtualized enterprise systems. In Proceedings of the 21st ACM Symposium on Operating Systems Principles (SOSP), October 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. R. Nathuji, C. Isci, and E. Gorbatov. Exploiting platform heterogeneity for power efficient data centers. In Proceedings of the IEEE International Conference on Autonomic Computing (ICAC), June 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. R. Nathuji, P. England, P. Sharma, and A. Singh. Feedback driven qos-aware power budgeting for virtualized servers. In Proceedings of the Workshop on Feedback Control Implementation and Design in Computing Systems and Networks (FeBID), April 2009.Google ScholarGoogle Scholar
  24. M. Norgaard, O. Ravn, N. K. Poulsen, and L. K. Hansen. Neural Networks for Modelling and Control of Dynamic Systems. Springer, April 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. P. Padala, K. Shin, X. Zhu, M. Uysal, Z. Wang, S. Singhal, A. Merchant, and K. Salem. Adaptive control of virtualized resources in utility computing environments. In Proceedings of the EuroSys Conference, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. P. Padala, K.-Y. Hou, K. Shin, X. Zhu, M. Uysal, Z. Wang, S. Singhal, and A. Merchant. Automated control of multiple virtualized resources. In Proceedings of the EuroSys Conference, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. M. Qureshi and Y. Patt. Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches. In Proceedings of the International Symposium on Microarchitecture (MICRO), December 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. R. Raghavendra, P. Ranganathan, V. Talwar, Z. Wang, and X. Zhu. No "power" struggles: Coordinated multi-level power management for the data center. In Proceedings of the International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), March 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. H. Raj, R. Nathuji, A. Singh, and P. England. Resource management for isolation enhanced cloud services. In Proceedings of the Cloud Computing Security Workshop (CCSW), 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. D. Tullsen, S. Eggers, and H. Levy. Simultaneous multithreading: Maximizing on-chip parallelism. In Proceedings of the International Symposium on Computer Architecture (ISCA), 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. A. Verma, P. Ahuja, and A. Neogi. Power-aware dynamic placement of hpc applications. In Proceedings of the International Conference on Supercomputing (ICS), 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. VMware ESX. http://www.vmware.com/products/esx.Google ScholarGoogle Scholar
  33. M.Wachs, M. Abd-El-Malek, E. Thereska, and G. R. Ganger. Argon: performance insulation for shared storage servers. In Proceedings of the USENIX Conference on File and Storage Technologies (FAST), February 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. X.Wang and Y.Wang. Co-con: Coordinated control of power and application performance for virtualized server clusters. In Proceedings of the 17th IEEE International Workshop on Quality of Service (IWQoS), Charleston, South Carolina, July 2009.Google ScholarGoogle Scholar
  35. Windows Server 2008 R2 Hyper-V. http://www.microsoft.com/hyperv.Google ScholarGoogle Scholar
  36. Y. Xie and G. H. Loh. Pipp: Promotion/insertion pseudopartitioning of multi-core shared caches. In Proceedings of the International Symposium on Computer Architecture (ISCA),June 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. X. Zhang, S. Dwarkadas, and K. Shen. Towards practical page coloring-based multicore cache management. In Proceedings of the EuroSys Conference, March 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. L. Zhao, R. Iyer, R. Illikkal, J. Moses, S. Makineni, and D. Newell. Cachescouts: Fine-rain monitoring of shared caches in cmp platforms. In Proceedings of the International Conference on Parallel Architectures and Compilation Techniques (PACT), September 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. S. Zhuravlev, S. Blagodurov, and A. Fedorova. Addressing shared resource contention in multicore processors via scheduling. In Proceedings of the International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Q-clouds: managing performance interference effects for QoS-aware clouds

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              EuroSys '10: Proceedings of the 5th European conference on Computer systems
              April 2010
              388 pages
              ISBN:9781605585772
              DOI:10.1145/1755913

              Copyright © 2010 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 13 April 2010

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • research-article

              Acceptance Rates

              Overall Acceptance Rate241of1,308submissions,18%

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader