skip to main content
10.1145/2038916.2038921acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

CloudScale: elastic resource scaling for multi-tenant cloud systems

Published:26 October 2011Publication History

ABSTRACT

Elastic resource scaling lets cloud systems meet application service level objectives (SLOs) with minimum resource provisioning costs. In this paper, we present CloudScale, a system that automates fine-grained elastic resource scaling for multi-tenant cloud computing infrastructures. CloudScale employs online resource demand prediction and prediction error handling to achieve adaptive resource allocation without assuming any prior knowledge about the applications running inside the cloud. CloudScale can resolve scaling conflicts between applications using migration, and integrates dynamic CPU voltage/frequency scaling to achieve energy savings with minimal effect on application SLOs. We have implemented CloudScale on top of Xen and conducted extensive experiments using a set of CPU and memory intensive applications (RUBiS, Hadoop, IBM System S). The results show that CloudScale can achieve significantly higher SLO conformance than other alternatives with low resource and energy cost. CloudScale is non-intrusive and light-weight, and imposes negligible overhead (< 2% CPU in Domain 0) to the virtualized computing cluster.

References

  1. Amazon Elastic Compute Cloud. http://aws.amazon.com/ec2/.Google ScholarGoogle Scholar
  2. Apache Hadoop System. http://hadoop.apache.org/core/.Google ScholarGoogle Scholar
  3. KVM (Kernel-based Virtual Machine). http://www.linux-kvm.org/page/Main_Page.Google ScholarGoogle Scholar
  4. RUBiS Online Auction System. http://rubis.ow2.org/.Google ScholarGoogle Scholar
  5. The IRCache Project. http://www.ircache.net/.Google ScholarGoogle Scholar
  6. Virtual Computing Lab. http://vcl.ncsu.edu/.Google ScholarGoogle Scholar
  7. VMware Virtualization Technology. http://www.vmware.com/.Google ScholarGoogle Scholar
  8. Xen Credit Scheduler. http://wiki.xensource.com/xenwiki/CreditScheduler.Google ScholarGoogle Scholar
  9. M. Armbrust, A. Fox, D. A. Patterson, N. Lanham, B. Trushkowsky, J. Trutna, and H. Oh. Scads: Scale-independent storage for social computing applications. In Proc. CIDR, 2009.Google ScholarGoogle Scholar
  10. P. Barham and et al. Xen and the art of virtualization. In Proc. SOSP, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. D. Breitgand, M. B.-Yehuda, M. Factor, H. Kolodner, V. Kravtsov, and D. Pelleg. NAP: a building block for remediating performance bottlenecks via black box network analysis. In Proc. ICAC, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A. Chandra, W. Gong, and P. Shenoy. Dynamic resource allocation for shared data centers using online measurements. In Proc. IWQoS, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Chase, D. Anderson, P. N. Thakar, and A. M. Vahdat. Managing energy and server resources in hosting centers. In Proc. SOSP, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. C. Clark, K. Fraser, S. Hand, J. G. Hansen, E. Jul, C. Limpach, I. Pratt, and A. Warfield. Live migration of virtual machines. In Proc. NSDI, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. J. Dean and S. Ghemawat. MapReduce: Simplified data processing on large clusters. Dec. 2004.Google ScholarGoogle Scholar
  16. G. Dhiman and T. S. Rosing. Dynamic voltage frequency scaling for multi-tasking systems using online learning. In Proc. ISLPED, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. R. Doyle, J. Chase, O. Asad, W. Jin, and A. Vahdat. Model-based resource provisioning in a web service utility. In Proc. USITS, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. X. Fan, W.-D. Weber, and L. A. Barroso. Power provisioning for a warehouse-sized computer. In Proc. ISCA, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. Flinn and M. Satyanarayanan. Energy-aware adaptation for mobile applications. In Proc. SOSP, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. A. Ganapathi, H. Kuno, and et al. Predicting multiple metrics for queries: Better decisions enabled by machine learning. In Proc. ICDE, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. B. Gedik, H. Andrade, K.-L. Wu, P. S. Yu, and M. Doo. SPADE: the System S declarative stream processing engine. Proc. SIGMOD, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. D. Gmach, J. Rolia, L. Cherkasova, and A. Kemper. Capacity management and demand prediction for next generation data centers. In Proc. ICWS, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  23. Z. Gong and X. Gu. PAC: Pattern-driven Application Consolidation for Efficient Cloud Computing. In Proc. MASCOTS, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Z. Gong, X. Gu, and J. Wilkes. PRESS: PRedictive Elastic ReSource Scaling for Cloud Systems. In Proc. CNSM, 2010.Google ScholarGoogle Scholar
  25. S. Govindan, J. Choi, and et al. Statistical profiling-based techniques for effective power provisioning in data centers. In Proc. Eurosys, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. B. Guenter, N. Jain, and C. Williams. Managing cost, performance, and reliability tradeoffs for energy-aware server provisioning. In Proc. INFOCOM, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  27. F. Hermenier, X. Lorca, J.-M. Menaud, G. Muller, and J. Lawall. Entropy: a consolidation manager for clusters. In Proc. VEE, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. C.-H. Hsu and W.-C. Feng. A power-aware run-time system for high-performance computing. In Proc. SC, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. E. Kalyvianaki, T. Charalambous, and S. Hand. Self-adaptive and self-configured CPU resource provisioning for virtualized servers using Kalman filters. In Proc. ICAC, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. H. Lim, S. Babu, and J. Chase. Automated control for elastic storage. In Proc. ICAC, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. P. Padala and et al. Adaptive control of virtualized resources in utility computing environments. In Proc. Eurosys, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. P. Padala, K.-Y. Hou, K. G. Shin, X. Zhu, M. Uysal, Z. Wang, S. Singhal, and A. Merchant. Automated control of multiple virtualized resources. In Proc. Eurosys, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. J. Rolia, L. Cherkasova, M. Arlitt, and V. Machiraju. Supporting application QoS in shared resource pools. Communications of the ACM, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. P. Shivam, S. Babu, and J. Chase. Learning application models for utility resource planning. In Proc. USITS, 2003.Google ScholarGoogle Scholar
  35. P. Shivam, S. Babu, and J. Chase. Active and accelerated learning of cost models for optimizing scientific applications. In Proc. VLDB, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. S. S. Parekh, N. Gandhi, J. L. Hellerstein, D. M. Tilbury, T. Jayram, and J. P. Bigus. Using control theory to achieve service level objectives in performance management. In Real Time Systems, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. C. Stewart, T. Kelly, A. Zhang, and K. Shen. A dollar from 15 cents: cross-platform management for internet services. In Proc. USENIX Annual Technical Conference, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. B. Urgaonkar, M. S. G. Pacifici, P. J. Shenoy, and A. N. Tantawi. An analytical model for multi-tier internet services and its applications. In Proc. SIGMETRICS, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. B. Urgaonkar, P. Shenoy, and et al. Resource overbooking and application profiling in shared hosting platforms. In Proc. OSDI, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. T. Wood, L. Cherkasova, and et al. Profiling and modeling resource usage of virtualized applications. In Proc. Middleware, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. T. Wood, P. J. Shenoy, A. Venkataramani, and M. S. Yousif. Black-box and gray-box strategies for virtual machine migration. In Proc. NSDI, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. W. Yuan and K. Nahrstedt. Energy-efficient soft real-time CPU scheduling for mobile multimedia systems. In Proc. SOSP, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. W. Zheng, R. Bianchini, and et al. JustRunIt: Experiment-based management of virtualized data centers. In Proc. USENIX Annual Technical Conference, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. X. Zhu and et al. 1000 Islands: integrated capacity and workload management for the next generation data center. In Proc. ICAC, June 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. X. Zhu, Z. Wang, and S. Singhal. Utility-driven workload management using nested control design. In Proc. American Control Conference, 2006.Google ScholarGoogle Scholar

Index Terms

  1. CloudScale: elastic resource scaling for multi-tenant cloud systems

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          SOCC '11: Proceedings of the 2nd ACM Symposium on Cloud Computing
          October 2011
          377 pages
          ISBN:9781450309769
          DOI:10.1145/2038916

          Copyright © 2011 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 26 October 2011

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate169of722submissions,23%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader