skip to main content
10.1145/3225058.3225113acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicppConference Proceedingsconference-collections
research-article

Less Provisioning: A Fine-grained Resource Scaling Engine for Long-running Services with Tail Latency Guarantees

Authors Info & Claims
Published:13 August 2018Publication History

ABSTRACT

Modern resource management frameworks guarantee low tail latency for long-running services using the resource over-provisioning method, resulting in serious waste of resource and increasing the service costs greatly. To reduce the over-provisioning cost, we present EFRA, an elastic and fine-grained resource allocator that enables much more efficient resource provisioning while guaranteeing the tail latency Service Level Objective (SLO). EFRA achieves this through the cooperation of three key components running on a containerized platform: The period detector identifies the period features of the workload through a convolution-based time series analysis. The resource reservation component estimates the just-right amount of resources based on the period analysis through a top-K based collaborative filtering approach. The online reprovisioning component dynamically adjusts the resources for further enforcing the tail latency SLO. Testbed experiments show that EFRA is able to increase the average resource utilization to 43%, and save up to 66% resources while guaranteeing the same tail latency objective.

References

  1. 2017. Alibaba trace. (2017). https://github.com/alibaba/clusterdata.Google ScholarGoogle Scholar
  2. 2017. Docker platform. (2017). https://www.docker.com/docker-engine.Google ScholarGoogle Scholar
  3. 2017. The Internet Traffic Archive. (2017). http://ita.ee.lbl.gov/html/traces.html.Google ScholarGoogle Scholar
  4. Omer Adam, Young Choon Lee, et al. 2017. Stochastic Resource Provisioning for Containerized Multi-Tier Web Services in Clouds. IEEE Transactions on Parallel and Distributed Systems 28, 7 (2017), 2060--2073.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Omer Y. Adam, Young Choon Lee, et al. 2016. Constructing Performance-Predictable Clusters with Performance-Varying Resources of Clouds. IEEE Trans. Comput. 65, 9 (Sept. 2016), 2709--2724. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. J. Atallah, F. Chyzak, et al. 2001. A Randomized Algorithm for Approximate String Matching. Algorithmica 29, 3 (2001), 468--486. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Berk Atikoglu, Yuehai Xu, et al. 2012. Workload Analysis of a Large-scale Key-value Store. In SIGMETRICS '12. ACM, New York, NY, USA, 53--64. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Christian Bliek1ú, Pierre Bonami, et al. 2014. Solving mixed-integer quadratic programming problems with IBM-CPLEX: a progress report. In Proceedings of the Twenty-Sixth RAMP Symposium. 171--180.Google ScholarGoogle Scholar
  9. John S. Breese, David Heckerman, et al. 1998. Empirical Analysis of Predictive Algorithms for Collaborative Filtering. In UAI'98. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 43--52. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Ludmila Cherkasova. 2011. Performance Modeling in Mapreduce Environments: Challenges and Opportunities. In ICPE '11. ACM, New York, NY, USA, 5--6. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Eli Cortez, Anand Bonde, et al. 2017. Resource Central: Understanding and Predicting Workloads for Improved Resource Management in Large Cloud Platforms. In SOSP '17. ACM, New York, NY, USA, 153--167. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Carlo Curino, Djellel E. Difallah, et al. 2014. Reservation-based Scheduling: If You'Re Late Don'T Blame Us!. In SOCC '14. ACM, New York, NY, USA, Article 2, 14 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Christina Delimitrou and Christos Kozyrakis. 2014. Quasar: Resource-efficient and QoS-aware Cluster Management. In ASPLOS '14. ACM, New York, NY, USA, 127--144. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Christina Delimitrou and Christos Kozyrakis. 2016. HCloud: Resource-Efficient Provisioning in Shared Cloud Systems. In ASPLOS '16. ACM, New York, NY, USA, 473--488. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Mohamed G. Elfeky, Walid G. Aref, et al. 2005. Periodicity Detection in Time Series Databases. IEEE Trans. on Knowl. and Data Eng. 17, 7 (July 2005), 875--887. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Anshul Gandhi, Yuan Chen, et al. 2011. Minimizing data center SLA violations and power consumption via hybrid resource provisioning. In Green Computing Conference and Workshops. 1--8. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. D Gmach, J Rolia, L Cherkasova, et al. 2009. An integrated approach to resource pool management: Policies, efficiency and quality metrics. In IEEE International Conference on Dependable Systems and Networks with Ftcs and DCC. 326--335.Google ScholarGoogle Scholar
  18. Zhenhuan Gong, Xiaohui Gu, et al. 2010. PRESS: PRedictive Elastic ReSource Scaling for cloud systems. In International Conference on Network and Service Management. 9--16.Google ScholarGoogle Scholar
  19. Benjamin Hindman, Andy Konwinski, et al. 2011. Mesos: A Platform for Finegrained Resource Sharing in the Data Center. In NSDI'11. USENIX Association, Berkeley, CA, USA, 295--308. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Sangeetha Abdu Jyothi, Carlo Curino, et al. 2016. Morpheus: Towards Automated SLOs for Enterprise Clusters. In OSDI'16. USENIX Association, Berkeley, CA, USA, 117--134. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Palden Lama and Xiaobo Zhou. 2012. AROMA: Automated Resource Allocation and Configuration of Mapreduce Environment in the Cloud. In ICAC '12. ACM, New York, NY, USA, 63--72. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Richard J Larsen and Morris L Marx. 1981. An introduction to mathematical statistics and its applications. Prentice-Hall. 2061--2071 pages.Google ScholarGoogle Scholar
  23. Jessica Lin, Eamonn Keogh, et al. 2003. A Symbolic Representation of Time Series, with Implications for Streaming Algorithms. In DMKD '03. ACM, New York, NY, USA, 2--11. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Haikun Liu and Bingsheng He. 2014. Reciprocal Resource Fairness: Towards Cooperative Multiple-resource Fair Sharing in IaaS Clouds. In SC '14. IEEE Press, Piscataway, NJ, USA, 970--981. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. David Lo, Liqun Cheng, et al. 2016. Improving Resource Efficiency at Scale with Heracles. ACM Trans. Comput. Syst. 34, 2, Article 6 (May 2016), 33 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Richard Mccreadie, Ian Soboroff, et al. 2012. On building a reusable Twitter corpus. In International ACM SIGIR conference on research and development in Information Retrieval. 1113--1114. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Malte Schwarzkopf, Andy Konwinski, et al. 2013. Omega: Flexible, Scalable Schedulers for Large Compute Clusters. In EuroSys '13. ACM, New York, NY, USA, 351--364. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Zhiming Shen, Sethuraman Subbiah, et al. 2011. CloudScale: Elastic Resource Scaling for Multi-tenant Cloud Systems. In SOCC '11. ACM, New York, NY, USA, Article 5, 14 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Sethuraman Subbiah, John Wilkes, et al. 2013. AGILE: Elastic Distributed Resource Scaling for Infrastructure-as-a-Service. In International Conference on Autonomic Computing.Google ScholarGoogle Scholar
  30. Alexey Tumanov, Timothy Zhu, et al. 2016. TetriSched: Global Rescheduling with Adaptive Plan-ahead in Dynamic Heterogeneous Clusters. In EuroSys '16. ACM, New York, NY, USA, Article 35, 16 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Guido Urdaneta, Guillaume Pierre, et al. 2009. Wikipedia Workload Analysis for Decentralized Hosting. Comput. Netw. 53, 11 (July 2009), 1830--1845. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Vinod Kumar Vavilapalli, Arun C. Murthy, et al. 2013. Apache Hadoop YARN: Yet Another Resource Negotiator. In SOCC '13. ACM, New York, NY, USA, Article 5, 16 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Abhishek Verma, Ludmila Cherkasova, et al. 2011. ARIA: Automatic Resource Inference and Allocation for Mapreduce Environments. In ICAC '11. ACM, New York, NY, USA, 235--244. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Abhishek Verma, Luis Pedrosa, et al. 2015. Large-scale Cluster Management at Google with Borg. In EuroSys '15. ACM, New York, NY, USA, Article 18, 17 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Yunqi Zhang, George Prekas, et al. 2016. History-based Harvesting of Spare Cycles and Storage in Large-scale Datacenters. In OSDI '16. USENIX Association, Berkeley, CA, USA, 755--770. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Timothy Zhu, Daniel S. Berger, et al. 2016. SNC-Meister: Admitting More Tenants with Tail Latency SLOs. In SoCC '16. ACM, New York, NY, USA, 374--387. Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    ICPP '18: Proceedings of the 47th International Conference on Parallel Processing
    August 2018
    945 pages
    ISBN:9781450365109
    DOI:10.1145/3225058

    Copyright © 2018 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 13 August 2018

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

    Acceptance Rates

    ICPP '18 Paper Acceptance Rate91of313submissions,29%Overall Acceptance Rate91of313submissions,29%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader