skip to main content
article

The most common queueing theory questions asked by computer systems practitioners

Published:06 June 2022Publication History
Skip Abstract Section

Abstract

This document examines five performance questions which are repeatedly asked by practitioners in industry: (i) My system utilization is very low, so why are job delays so high? (ii) What should I do to lower job delays? (iii) How can I favor short jobs if I don't know which jobs are short? (iv) If some jobs are more important than others, how do I negotiate importance versus size? (v) How do answers change when dealing with a closed-loop system, rather than an open system? All these questions have simple answers through queueing theory. This short paper elaborates on the questions and their answers. To keep things readable, our tone is purposely informal throughout. For more formal statements of these questions and answers, please see [14].

References

  1. Samuli Aalto and Urtzi Ayesta. Optimal scheduling of jobs with a DHR tail in the M/G/1 queue. In VALUETOOLS 2008, October 2008.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Samuli Aalto, Urtzi Ayesta, and Rhonda Righter. On the Gittins index in the M/G/1 queue. Queueing 6 Performance Evaluation Review, Vol. 49, No. 4, March 2022 Systems, 63(1):437--458, 2009.Google ScholarGoogle Scholar
  3. Samuli Aalto, Urtzi Ayesta, and Rhonda Righter. Properties of the Gittins index with application to optimal scheduling. Probability in the Engineering and Informational Sciences, 25(3):269--288, 2011.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Eitan Bachmat, Josu Doncel, and Hagit Sarfati. Analysis of the task assignment based on guessing size policy. Performance Evaluation, 142, 2020.Google ScholarGoogle Scholar
  5. Nikhil Bansal and Mor Harchol-Balter. Analysis of SRPT scheduling: Investigating unfairness. In Proceedings of ACM SIGMETRICS, pages 279--290, Cambridge, MA, June 2001.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. D.R. Cox and W.L. Smith. Queues. Kluwer Academic Publishers, 1971.Google ScholarGoogle ScholarCross RefCross Ref
  7. Mark Crovella, Mor Harchol-Balter, and Cristina Murta. Task assignment in a distributed system: Improving performance by unbalancing load. In Proceedings of the ACM SIGMETRICS Joint International Conference on Measurement and Modeling of Computer Systems, pages 268--269, June 1998. Poster Session.Google ScholarGoogle Scholar
  8. Anshul Gandhi, Sherwin Doroudi, Mor Harchol-Balter, and Alan Scheller-Wolf. Exact analysis of the M/M/k/setup class of Markov chains via Recursive Renewal Reward. In ACM SIGMETRICS 2013 Conference on Measurement and Modeling of Computer Systems, pages 153--166, 2013.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Anshul Gandhi and Mor Harchol-Balter. How data center size impacts the effectiveness of dynamic power management. In 49th Annual Allerton Conference on Communication, Control, and Computing, pages 1164--1169, Urbana-Champaign, IL, September 2011.Google ScholarGoogle ScholarCross RefCross Ref
  10. John C. Gittins, Kevin D. Glazebrook, and Richard Weber. Multi-armed Bandit Allocation Indices. John Wiley & Sons, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  11. Isaac Grosof, Ziv Scully, and Mor Harchol-Balter. SRPT for multiserver systems. Performance Evaluation, 127:154--175, November 2018.Google ScholarGoogle ScholarCross RefCross Ref
  12. Mor Harchol-Balter. Task assignment with unknown duration. Journal of the ACM, 49(2):260--288, March 2002.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Mor Harchol-Balter. Queueing disciplines. Wiley Encyclopedia of Operations Research and Management Science, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  14. Mor Harchol-Balter. Performance Modeling and Design of Computer Systems: Queueing Theory in Action. Cambridge University Press, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  15. Mor Harchol-Balter, Mark Crovella, and Cristina Murta. On choosing a task assignment policy for a distributed server system. In Lecture Notes in Computer Science, No. 1469: 10th International Conference on Modeling Techniques and Tools for Computer Performance Evaluation, pages 231--242, September 1998.Google ScholarGoogle Scholar
  16. Mor Harchol-Balter, Alan Scheller-Wolf, and Andrew Young. Surprising results on task assignment in server farms with high-variability workloads. In ACM SIGMETRICS 2009 Conference on Measurement and Modeling of Computer Systems, pages 287--0298, 2009.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Mor Harchol-Balter, Alan Scheller-Wolf, and Andrew Young. Why segregating short jobs from long jobs under high variability is not always a win. In Forty-seventh Annual Allerton Conference on Communication, Control, and Computing, pages 121--127, University of Illinois at Urbana-Champaign, October 2009.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Mor Harchol-Balter, Bianca Schroeder, Nikhil Bansal, and Mukesh Agrawal. Size-based scheduling to improve web performance. ACM Transactions on Computer Systems, 21(2):207--233, May 2003.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Mor Harchol-Balter and Rein Vesilo. To balance or unbalance load in size-interval task allocation. Probability in the Engineering and Informational Sciences, 24:219--244, 2010.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. J.F.C. Kingman. Two similar queues in parallel. Biometrika, 48:1316--1323, 1961.Google ScholarGoogle Scholar
  21. Behnam Montazeri, Yilong Li, Mohammad Alizadeh, and John K. Ousterhout. Homa: A receiver-driven low-latency transport protocol using network priorities. In Proceedings of SIGCOMM 2018, pages 221--235. ACM SIGCOMM, 2018.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Alan Scheller-Wolf and Rein Vesilo. Structural interpretation and derivation of necessary and sufficient conditions for delay moments in fifo multiserver queues. Queueing Systems, 54(3):221--232, 2006.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Bianca Schroeder, Adam Wierman, and Mor Harchol-Balter. Open versus closed: a cautionary tale. In Proceedings of Networked Systems Design and Implementation (NSDI), 2006.Google ScholarGoogle Scholar
  24. Ziv Scully, Isaac Grosof, and Mor Harchol-Balter. The Gittins policy is nearly optimal in the M/G/k under extremely general conditions. Proceedings of ACM on Measurement and Analysis of Computer Systems (POMACS/SIGMETRICS), 4(3):1--29, 2020. Article 43.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Ziv Scully, Isaac Grosof, and Mor Harchol-Balter. Optimal multiserver scheduling with unknown job sizes in heavy traffic. In 38th International Symposium on Computer Performance, Modeling, Measurement, and Evaluation (IFIP PERFORMANCE 2020), Milan, Italy, November 2020.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Ziv Scully and Mor Harchol-Balter. The Gittins policy in the M/G/1 queue. In 19th International Symposium on Modeling and Optimization in Mobile, Ad hoc, and Wireless Networks (WiOpt '21), Philadelphia, PA, October 2021.Google ScholarGoogle ScholarCross RefCross Ref
  27. Ziv Scully and Mor Harchol-Balter. How to schedule near-optimally under real-world constraints. arXiv, 2021.Google ScholarGoogle Scholar
  28. Ziv Scully, Mor Harchol-Balter, and Alan Scheller-Wolf. SOAP: One clean analysis of all age-based scheduling policies. Proceedings of ACM on Measurement and Analysis of Computer Systems (POMACS/SIGMETRICS), 2(1):1--30, 2018. Article 16.Google ScholarGoogle Scholar
  29. Muhammad Tirmazi, Adam Barker, Nan Deng, MD E. Haque, Zhijing Gene Qin, Steven Hand, Mor Harchol-Balter, and John Wilkes. Borg: The next generation. In Proceedings of the Fifteenth European Conference on Computer Systems (EuroSys '20), pages 1--14, Greece, April 2020.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Adam Wierman and Mor Harchol-Balter. Classifying scheduling policies with respect to unfairness in an M/GI/1. In Proceedings of ACM SIGMETRICS, pages 238--249, San Diego, CA, June 2003.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. The most common queueing theory questions asked by computer systems practitioners
            Index terms have been assigned to the content through auto-classification.

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in

            Full Access

            • Published in

              cover image ACM SIGMETRICS Performance Evaluation Review
              ACM SIGMETRICS Performance Evaluation Review  Volume 49, Issue 4
              March 2022
              130 pages
              ISSN:0163-5999
              DOI:10.1145/3543146
              Issue’s Table of Contents

              Copyright © 2022 Copyright is held by the owner/author(s)

              Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 6 June 2022

              Check for updates

              Qualifiers

              • article
            • Article Metrics

              • Downloads (Last 12 months)39
              • Downloads (Last 6 weeks)11

              Other Metrics

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader