Abstract
We observe and analyze usage of the login nodes of the leadership class Summit supercomputer from the perspective of an ordinary user—not a system administrator—by periodically sampling user activities (job queues, running processes, etc.) for two full years (2020–2021). Our findings unveil key usage patterns that evidence misuse of the system, including gaming the policies, impairing I/O performance, and using login nodes as a sole computing resource. Our analysis highlights observed patterns for the execution of complex computations (workflows), which are key for processing large-scale applications.
This manuscript has been authored by UT-Battelle, LLC, under contract DE-AC05-00OR22725 with the US Department of Energy (DOE). The publisher acknowledges the US government license to provide public access under the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
IP Geolocation API (2022). https://www.abstractapi.com/ip-geolocation-api
Ananthraj, V., et al.: Towards exascale computing for high energy physics: the atlas experience at ornl. In: 2018 IEEE 14th International Conference on e-Science (e-Science), pp. 341–342 (2018). https://doi.org/10.1109/eScience.2018.00086
Badia Sala, R.M., Ayguadé Parra, E., Labarta Mancho, J.J.: Workflows for science: a challenge when facing the convergence of HPC and big data. Supercomput. Front. Innov. 4(1), 27–47 (2017). https://doi.org/10.14529/jsfi170102
Bang, J., et al.: HPC workload characterization using feature selection and clustering. In: Proceedings of the 3rd International Workshop on Systems and Network Telemetry and Analytics, pp. 33–40 (2020). https://doi.org/10.1145/3391812.3396270
Casalino, L., et al.: AI-driven multiscale simulations illuminate mechanisms of SARS-CoV-2 spike dynamics. Int. J. High Perform. Comput. Appl. (2021). https://doi.org/10.1177/10943420211006452
Dongarra, J., Gottlieb, S., Kramer, W.T.: Race to exascale. Comput. Sci. Eng. 21(1) (2019). https://doi.org/10.1109/MCSE.2018.2882574
Feitelson, D.G.: Looking at data. In: 2008 IEEE International Symposium on Parallel and Distributed Processing, pp. 1–9 (2008). https://doi.org/10.1109/IPDPS.2008.4536092
Feng, J., Liu, G., Zhang, J., Zhang, Z., Yu, J., Zhang, Z.: Workload characterization and evolutionary analyses of Tianhe-1A supercomputer. In: Shi, Y., et al. (eds.) ICCS 2018. LNCS, vol. 10860, pp. 578–585. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93698-7_44
Liu, Z., et al.: Characterization and identification of HPC applications at leadership computing facility. In: Proceedings of the 34th ACM International Conference on Supercomputing, pp. 1–12 (2020). https://doi.org/10.1145/3392717.3392774
Lockwood, G.K., Snyder, S., Wang, T., Byna, S., Carns, P., Wright, N.J.: A year in the life of a parallel file system. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, pp. 931–943. IEEE Press (2018). https://doi.org/10.1109/SC.2018.00077
Nersc benchmarking and workload characterization (2021). https://www.nersc.gov/research-and-development/benchmarking-and-workload-characterization
Rodrigo, G.P., Östberg, P.O., Elmroth, E., Antypas, K., Gerber, R., Ramakrishnan, L.: Towards understanding HPC users and systems: a NERSC case study. J. Parallel Distrub. Comput. 111, 206–221 (2018). https://doi.org/10.1016/j.jpdc.2017.09.002
Schlagkamp, S., Ferreira da Silva, R., Deelman, E., Schwiegelshohn, U.: Understanding user behavior: from HPC to HTC. Procedia Comput. Sci. 80, 2241–2245 (2016). https://doi.org/10.1016/j.procs.2016.05.397. International Conference on Computational Science 2016, ICCS 2016
Ferreira da Silva, R., Filgueira, R., Pietri, I., Jiang, M., Sakellariou, R., Deelman, E.: A characterization of workflow management systems for extreme-scale applications. Fut. Gen. Comput. Syst. 75, 228–238 (2017). https://doi.org/10.1016/j.future.2017.02.026
Ferreira da Silva, R., et al.: Characterizing a high throughput computing workload: the compact muon solenoid (CMS) experiment at LHC. Procedia Comput. Sci. 51, 39–48 (2015). https://doi.org/10.1016/j.procs.2015.05.190, International Conference On Computational Science, ICCS 2015 Computational Science at the Gates of Nature
Top 500 (2021). https://www.top500.org
Van Der Spoel, D., Lindahl, E., Hess, B., Groenhof, G., Mark, A.E., Berendsen, H.J.: Gromacs: fast, flexible, and free. J. Comput. Chem. 26(16) (2005). GROMACS: fast, flexible, and free
Vazhkudai, S.S., et al.: The design, deployment, and evaluation of the coral pre-exascale systems. In: SC18: International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 661–672. IEEE (2018)
Wolter, N., McCracken, M.O., Snavely, A., Hochstein, L., Nakamura, T., Basili, V.: What’s working in HPC: investigating HPC user behavior and productivity. CTWatch Q. 2(4A), 9–17 (2006)
Zhang, S., Zhang, C., Yang, Q.: Data preparation for data mining. Appl. Artif. Intell. 17(5–6) (2003). https://doi.org/10.1080/713827180
Acknowledgments
This research used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725. We acknowledge Suzanne Parete-Koon for early brainstorming of some of the ideas presented here. We thank Scott Atchley, Bronson Messer, and Sarp Oral for their thorough revision of this paper.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Wilkinson, S.R., Maheshwari, K., Silva, R.F.d. (2022). Unveiling User Behavior on Summit Login Nodes as a User. In: Groen, D., de Mulatier, C., Paszynski, M., Krzhizhanovskaya, V.V., Dongarra, J.J., Sloot, P.M.A. (eds) Computational Science – ICCS 2022. ICCS 2022. Lecture Notes in Computer Science, vol 13350. Springer, Cham. https://doi.org/10.1007/978-3-031-08751-6_37
Download citation
DOI: https://doi.org/10.1007/978-3-031-08751-6_37
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-08750-9
Online ISBN: 978-3-031-08751-6
eBook Packages: Computer ScienceComputer Science (R0)