Skip to main content

Unveiling User Behavior on Summit Login Nodes as a User

  • Conference paper
  • First Online:
Computational Science – ICCS 2022 (ICCS 2022)

Abstract

We observe and analyze usage of the login nodes of the leadership class Summit supercomputer from the perspective of an ordinary user—not a system administrator—by periodically sampling user activities (job queues, running processes, etc.) for two full years (2020–2021). Our findings unveil key usage patterns that evidence misuse of the system, including gaming the policies, impairing I/O performance, and using login nodes as a sole computing resource. Our analysis highlights observed patterns for the execution of complex computations (workflows), which are key for processing large-scale applications.

This manuscript has been authored by UT-Battelle, LLC, under contract DE-AC05-00OR22725 with the US Department of Energy (DOE). The publisher acknowledges the US government license to provide public access under the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. IP Geolocation API (2022). https://www.abstractapi.com/ip-geolocation-api

  2. Ananthraj, V., et al.: Towards exascale computing for high energy physics: the atlas experience at ornl. In: 2018 IEEE 14th International Conference on e-Science (e-Science), pp. 341–342 (2018). https://doi.org/10.1109/eScience.2018.00086

  3. Badia Sala, R.M., Ayguadé Parra, E., Labarta Mancho, J.J.: Workflows for science: a challenge when facing the convergence of HPC and big data. Supercomput. Front. Innov. 4(1), 27–47 (2017). https://doi.org/10.14529/jsfi170102

  4. Bang, J., et al.: HPC workload characterization using feature selection and clustering. In: Proceedings of the 3rd International Workshop on Systems and Network Telemetry and Analytics, pp. 33–40 (2020). https://doi.org/10.1145/3391812.3396270

  5. Casalino, L., et al.: AI-driven multiscale simulations illuminate mechanisms of SARS-CoV-2 spike dynamics. Int. J. High Perform. Comput. Appl. (2021). https://doi.org/10.1177/10943420211006452

    Article  Google Scholar 

  6. Dongarra, J., Gottlieb, S., Kramer, W.T.: Race to exascale. Comput. Sci. Eng. 21(1) (2019). https://doi.org/10.1109/MCSE.2018.2882574

  7. Feitelson, D.G.: Looking at data. In: 2008 IEEE International Symposium on Parallel and Distributed Processing, pp. 1–9 (2008). https://doi.org/10.1109/IPDPS.2008.4536092

  8. Feng, J., Liu, G., Zhang, J., Zhang, Z., Yu, J., Zhang, Z.: Workload characterization and evolutionary analyses of Tianhe-1A supercomputer. In: Shi, Y., et al. (eds.) ICCS 2018. LNCS, vol. 10860, pp. 578–585. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93698-7_44

    Chapter  Google Scholar 

  9. Liu, Z., et al.: Characterization and identification of HPC applications at leadership computing facility. In: Proceedings of the 34th ACM International Conference on Supercomputing, pp. 1–12 (2020). https://doi.org/10.1145/3392717.3392774

  10. Lockwood, G.K., Snyder, S., Wang, T., Byna, S., Carns, P., Wright, N.J.: A year in the life of a parallel file system. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, pp. 931–943. IEEE Press (2018). https://doi.org/10.1109/SC.2018.00077

  11. Nersc benchmarking and workload characterization (2021). https://www.nersc.gov/research-and-development/benchmarking-and-workload-characterization

  12. Rodrigo, G.P., Östberg, P.O., Elmroth, E., Antypas, K., Gerber, R., Ramakrishnan, L.: Towards understanding HPC users and systems: a NERSC case study. J. Parallel Distrub. Comput. 111, 206–221 (2018). https://doi.org/10.1016/j.jpdc.2017.09.002

  13. Schlagkamp, S., Ferreira da Silva, R., Deelman, E., Schwiegelshohn, U.: Understanding user behavior: from HPC to HTC. Procedia Comput. Sci. 80, 2241–2245 (2016). https://doi.org/10.1016/j.procs.2016.05.397. International Conference on Computational Science 2016, ICCS 2016

  14. Ferreira da Silva, R., Filgueira, R., Pietri, I., Jiang, M., Sakellariou, R., Deelman, E.: A characterization of workflow management systems for extreme-scale applications. Fut. Gen. Comput. Syst. 75, 228–238 (2017). https://doi.org/10.1016/j.future.2017.02.026

  15. Ferreira da Silva, R., et al.: Characterizing a high throughput computing workload: the compact muon solenoid (CMS) experiment at LHC. Procedia Comput. Sci. 51, 39–48 (2015). https://doi.org/10.1016/j.procs.2015.05.190, International Conference On Computational Science, ICCS 2015 Computational Science at the Gates of Nature

  16. Top 500 (2021). https://www.top500.org

  17. Van Der Spoel, D., Lindahl, E., Hess, B., Groenhof, G., Mark, A.E., Berendsen, H.J.: Gromacs: fast, flexible, and free. J. Comput. Chem. 26(16) (2005). GROMACS: fast, flexible, and free

    Google Scholar 

  18. Vazhkudai, S.S., et al.: The design, deployment, and evaluation of the coral pre-exascale systems. In: SC18: International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 661–672. IEEE (2018)

    Google Scholar 

  19. Wolter, N., McCracken, M.O., Snavely, A., Hochstein, L., Nakamura, T., Basili, V.: What’s working in HPC: investigating HPC user behavior and productivity. CTWatch Q. 2(4A), 9–17 (2006)

    Google Scholar 

  20. Zhang, S., Zhang, C., Yang, Q.: Data preparation for data mining. Appl. Artif. Intell. 17(5–6) (2003). https://doi.org/10.1080/713827180

Download references

Acknowledgments

This research used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725. We acknowledge Suzanne Parete-Koon for early brainstorming of some of the ideas presented here. We thank Scott Atchley, Bronson Messer, and Sarp Oral for their thorough revision of this paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sean R. Wilkinson .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wilkinson, S.R., Maheshwari, K., Silva, R.F.d. (2022). Unveiling User Behavior on Summit Login Nodes as a User. In: Groen, D., de Mulatier, C., Paszynski, M., Krzhizhanovskaya, V.V., Dongarra, J.J., Sloot, P.M.A. (eds) Computational Science – ICCS 2022. ICCS 2022. Lecture Notes in Computer Science, vol 13350. Springer, Cham. https://doi.org/10.1007/978-3-031-08751-6_37

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-08751-6_37

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-08750-9

  • Online ISBN: 978-3-031-08751-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics