skip to main content
10.1145/2494621.2494627acmotherconferencesArticle/Chapter ViewAbstractPublication PagescacConference Proceedingsconference-collections
research-article

Improving cloud infrastructure utilization through overbooking

Published:09 August 2013Publication History

ABSTRACT

Despite the potential given by the combination of multi-tenancy and virtualization, resource utilization in today's data centers is still low. We identify three key characteristics of cloud services and infrastructure as-a-service management practices: burstiness in service workloads, fluctuations in virtual machine resource usage over time, and virtual machines being limited to pre-defined sizes only. Based on these characteristics, we propose scheduling and admission control algorithms that incorporate resource overbooking to improve utilization. A combination of modeling, monitoring, and prediction techniques is used to avoid overpassing the total infrastructure capacity. A performance evaluation using a mixture of workload traces demonstrates the potential for significant improvements in resource utilization while still avoiding overpassing the total capacity.

References

  1. A. Ali-Eldin, J. Tordsson, and E. Elmroth. An adaptive hybrid elasticity controller for cloud infrastructures. In Proc. of Network Operations and Management Symposium (NOMS), pages 204--212. IEEE, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  2. L. A. Barroso and U. Holzle. The case for energy-proportional computing. Computer, 40(12):33--37, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. A. Beloglazov and R. Buyya. Managing overloaded hosts for dynamic consolidation of virtual machines in cloud data centers under quality of service constraints. IEEE Transactions on Parallel and Distributed Systems, In press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. G. Birkenheuer, A. Brinkmann, and H. Karl. The gain of overbooking. In E. Frachtenberg and U. Schwiegelshohn, editors, Job Scheduling Strategies for Parallel Processing, volume 5798 of LNCS, pages 80--100. 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. D. Breitgand, Z. Dubitzky, A. Epstein, A. Glikson, and I. Shapira. Sla-aware resource over-commit in an iaas cloud. In Proc. of 8th Intl. Conference on Network and Service Management (CNSM), pages 73--81, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. W. Cirne and F. Berman. A comprehensive model of the supercomputer workload. In Proc. of the Intl. Workshop on Workload Characterization, pages 140--148, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A. Corradi, M. Fanelli, and L. Foschini. VM Consolidation: a Real Case Based on OpenStack Cloud. Future Generation Computer Systems, In Press.Google ScholarGoogle Scholar
  8. M. Dobber, R. van der Mei, and G. Koole. A prediction method for job runtimes on shared processors: Survey, statistical analysis and new avenues. Performance Evaluation, 64(7-8):755--781, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. R. Ghosh and V. K. Naik. Biting off safely more than you can chew: Predictive analytics for resource over-commit in iaas cloud. In Proc. of 5th Intl. Conference on Cloud Computing, pages 25--32, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. D. Gmach, J. Rolia, and L. Cherkasova. Selling t-shirts and time shares in the cloud. In Proc. of Intl. Symposium on Cluster, Cloud and Grid Computing (CCGrid), pages 539--546, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. R. Guerin, H. Ahmadi, and M. Naghshineh. Equivalent capacity and its application to bandwidth allocation in high-speed networks. Selected Areas in Communications, 9(7):968--981, 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. S. He, L. Guo, M. Ghanem, and Y. Guo. Improving resource utilisation in the cloud environment using multivariate probabilistic models. In Proc. of 5th Intl. Conference on Cloud Computing (CLOUD), pages 574--581, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. H. Jin, X. Shi, W. Qiang, and D. Zou. An adaptive meta-scheduler for data intensive applications. Intl. Journal of Grid and Utility Computing, 1(1):32--37, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. M. Kalantari and M. K. Akbari. Grid performance prediction using state-space model. Concurrency and Computation: Practice and Experience, 21(9):1109--1130, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Y. C. Lee and A. Y. Zomaya. Energy efficient utilization of resources in cloud computing systems. The Journal of Supercomputing, 60(2):268--280, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. libvirt: The virtualization API. Web page at http://libvirt.org/, Visited 2013-03-13.Google ScholarGoogle Scholar
  17. LTTng Project. Linux Trace Toolkit - next generation. Web page at http://lttng.org/lttng2.0, Visited 2013-03-13.Google ScholarGoogle Scholar
  18. X. Meng, C. Isci, J. Kephart, L. Zhang, E. Bouillet, and D. Pendarakis. Efficient resource provisioning in compute clouds via VM multiplexing. In Proc. of the Intl. Conference on Autonomic Computing (ICAC), pages 11--20, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Nagios - The Industry Standard in IT infrastructure Monitoring. Web page at http://www.nagios.org/, Visited 2013-03-13.Google ScholarGoogle Scholar
  20. Page view statistics for Wikimedia projects. Web page at http://dumps.wikimedia.org/other/pagecounts-raw/, Visited 2013-03-13.Google ScholarGoogle Scholar
  21. C. Reiss, A. Tumanov, G. R. Ganger, R. H. Katz, and M. A. Kozuch. Towards understanding heterogeneous clouds at scale:Google trace analysis. Technical Report ISTC-CC-TR-12-101, Carnegie Mellon University, Pittsburgh, PA, USA, Apr. 2012. http://www.istc-cc.cmu.edu/publications/papers/2012/ISTC-CC-TR-12-101.pdf.Google ScholarGoogle Scholar
  22. J. Subramanian, S. Stidham, and C. J. Lautenbacher. Airline yield management with overbooking, cancellations, and no-shows. Transportation Science, 33(2):147--167, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. A. Sulistio, K. H. Kim, and R. Buyya. Managing cancellations and no-shows of reservations with overbooking to increase resource revenue. In Proc. of 8th Intl. Symposium on Cluster Computing and the Grid (CCGrid), pages 267--276, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. L. Tomás, A. Caminero, C. Carrión, and B. Caminero. Exponential Smoothing for Network-aware Meta-scheduler in Advance in Grids. In Proc. of the 6th Intl. Workshop on Scheduling and Resource Management on Parallel and Distributed Systems (SRMPDS), pages 323--330, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. L. Tomás, A. C. Caminero, C. Carrión, and B. Caminero. Network-aware meta-scheduling in advance with autonomous self-tuning system. Future Generation Computer Systems, 27(5):486--497, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. B. Urgaonkar, P. Shenoy, and T. Roscoe. Resource overbooking and application profiling in shared hosting platforms. In ACM SIGOPS Operating Systems Design and Implementation (OSDI), pages 239--254. ACM, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Improving cloud infrastructure utilization through overbooking

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            CAC '13: Proceedings of the 2013 ACM Cloud and Autonomic Computing Conference
            August 2013
            247 pages
            ISBN:9781450321723
            DOI:10.1145/2494621

            Copyright © 2013 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 9 August 2013

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader