Abstract
Performance of data reduction workflows at the High Flux Isotope Reactor (HFIR) and the Spallation Neutron Source (SNS) at Oak Ridge National Laboratory (ORNL) is mainly determined by the time spent loading raw measurement events stored in large and sparse datasets. This paper describes: (1) our long-term view to leverage SNS and HFIR data management needs with our experience at ORNL’s world-class high performance computing (HPC) facilities, and (2) our short-term efforts to speed up current workflows using Mantid, a data analysis and reduction community framework used across several neutron scattering facilities. We show that minimally invasive short-term improvements in metadata management have a moderate impact in speeding up current production workflows. We propose a more disruptive domain-specific solution: the No Cost Input Output (NCIO) framework, we provide an overview, the risks and challenges in NCIO’s adoption by HFIR and SNS stakeholders.
W. F. Godoy et al.—Contributed Equally.
This manuscript has been co-authored by UT-Battelle, LLC, under contract DE-AC05-00OR22725 with the US Department of Energy (DOE). The US government retains and the publisher, by accepting the article for publication, acknowledges that the US government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this manuscript, or allow others to do so, for US government purposes. DOE will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Change history
18 December 2020
In the originally published version of the chapter 12, the reference 21 contained a mistake in the name of the author. The author’s name in the reference was changed to S. Hahn.
References
Oak Ridge National Laboratory, Neutron Sciences. https://neutrons.ornl.gov/
Oak Ridge National Laboratory, Neutron Sciences Data Management. https://neutrons.ornl.gov/users/data-management
Donaldson, D.R., Martin, S., Proffen, T.: Understanding perspectives on sharing neutron data at Oak Ridge National Laboratory. Data Sci. J. 16, 35 (2017). https://doi.org/10.5334/dsj-2017-035
Campbell, S., Miller, S., Bilheux, J., Reuter, M., Peterson, P., Kohl, J., Trater, J., Vazhkudai, S., Lynch, V., Green, M.: The SNS and HFIR web portal system for SANS. J. Phys. Conf. Ser. 247, 012013 (2010). https://doi.org/10.1088/1742-6596/247/1/012013
Shipman, G., et al.: Accelerating data acquisition, reduction, and analysis at the spallation neutron source. In: 2014 IEEE 10th International Conference on e-Science, Sao Paulo, pp. 223–230 (2014). https://doi.org/10.1109/eScience.2014.31
Granroth, G.E., An, K., Smith, H.L., Whitfield, P., Neuefeind, J.C., Lee, J., Zhou, W., Sedov, V.N., Peterson, P.F., Parizzi, A., Skorpenske, H., Hartman, S.M., Huq, A., Abernathy, D.L.: Event-based processing of neutron scattering data at the Spallation Neutron Source. J. Appl. Cryst. 51, 616–62 (2018). https://doi.org/10.1107/S1600576718004727
Peterson, P.F., Campbell, S.I., Reuter, M.A., Taylor, R.J., Zikovsky, J.: Event-based processing of neutron scattering data. Nucl. Instrum. Methods Phys. Res. Sect. A: 803, 24–28 (2015)
Childs, H., Ahern, S.D., Ahrens, J., et al.: A terminology for in situ visualization and analysis systems. Int. J. High Perform. Comput. Appl. (2020). https://doi.org/10.1177/1094342020935991
Konnecke, M., Akeroyd, F.A., Bernstein, H.J., Brewster, A.S., Campbell, S.I., Clausen, B., Cottrell, S., Hoffmann, J.U., Jemian, P.R., Mannicke, D., Osborn, R., Peterson, P.F., Richter, T., Suzuki, J., Watts, B., Wintersberger, E., Wuttke, J.: J. Appl. Cryst. 48, 301–305 (2015)
The HDF Group. Hierarchical Data Format, version 5, 1997–2020. http://www.hdfgroup.org/HDF5/
Arnold, O., et al.: Mantid-Data analysis and visualization package for neutron scattering and \(\mu \) SR experiments, Nucl. Instrum. Methods Phys. Res. Sect. A: vol. 764, 156–166, ISSN 0168–9002 (2014). https://doi.org/10.1016/j.nima.2014.07.029
Oak Ridge Leadership Computing Facility, Summit supercomputer. https://www.olcf.ornl.gov/summit/
The Compute and Data Environment for Science (CADES). https://cades.ornl.gov/
Vetter, J.S., et al.: Extreme heterogeneity 2018 - productive computational science in the era of extreme heterogeneity. In: Report for DOE ASCR Workshop on Extreme Heterogeneity, United States. https://doi.org/10.2172/1473756
Sprinkle, J., Mernik, M., Tolvanen, J.P., Spinellis, D.: Guest editors’ introduction: What kinds of nails need a domain-specific hammer? IEEE Software 26(4), 15–18 (2009). https://doi.org/10.1109/MS.2009.92
Fowler, M.: A pedagogical framework for domain-specific languages. IEEE Softw. 26, 13–14 (2009). https://doi.org/10.1109/MS.2009.85
Stansberry, D., Somnath, S., Breet, J., Shutt, G., Shankar, M.: DataFed: towards reproducible research via federated data management. In: 2019 International Conference on Computational Science and Computational Intelligence (CSCI), Las Vegas, NV, USA, 2019, pp. 1312–1317 (2019). https://doi.org/10.1109/CSCI49370.2019.00245
Stevens, R., Taylor, V., Nichols, J., Maccabe, A.B., Yelick, K., Brown, D.: AI for Science. Technical report. https://doi.org/10.2172/1604756
Garcia-Cardona, C., Kannan, R., Johnston, T., Proffen, T., Page, K., Seal, S.K.: Learning to predict material structure from neutron scattering data. IEEE Int. Conf. Big Data (Big Data) 2019, 4490–4497 (2019)
Zhang, W., Byna, S., Niu, C., Chen, Y.: Exploring metadata search essentials for scientific data management. In: 2019 IEEE 26th International Conference on High Performance Computing, Data, and Analytics (HiPC), Hyderabad, India, 2019, pp. 83–92 (2019). https://doi.org/10.1109/HiPC.2019.00021
Godoy, W.F., Peterson, P., Hahn, S., Billings, J.J.: Efficient Data Management in Neutron Scattering Data Reduction Workflows at ORNL. In: International Workshop on Big Data Reduction held with 2020 IEEE International Conference on Big Data (accepted)
Godoy, W.F., Podhorszki, N., et al.: ADIOS 2: The Adaptable Input Output System. A framework for high-performance data management, SoftwareX, Volume 12, 2020, 100561, ISSN 2352-7110, https://doi.org/10.1016/j.softx.2020.100561
Gregg, B.: The Flame Graph. Queue 14, 2 (March-April 2016), 91–110 (2016). https://doi.org/10.1145/2927299.2927301
Zhao, J.K., Gao, C.Y., Liu, D.: The extended Q-range small-angle neutron scattering diffractometer at the SNS. J. Appl. Cryst. 43, 1068–1077 (2010)
Berry, K.D., et al.: Characterization of the neutron detector upgrade to the GP-SANS and Bio-SANS instruments at HFIR”, Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, Volume 693, 2012, Pages 179–185, ISSN 0168–9002, https://doi.org/10.1016/j.nima.2012.06.052
Huebl, A., et al.: openPMD 1.0.0: A meta data standard for particle and mesh based data (2015). https://doi.org/10.5281/zenodo.33624
Liu, J., et al.: Evaluation of HPC application i/o on object storage systems. In: 2018 IEEE/ACM 3rd International Workshop on Parallel Data Storage & Data Intensive Scalable Computing Systems (PDSW-DISCS), Dallas, TX, USA, pp. 24–34 (2018). https://doi.org/10.1109/PDSW-DISCS.2018.00005
Lofstead, J., Jimenez, I., Maltzahn, C., Koziol, Q., Bent, J., Barton, E.: DAOS and Friends: A Proposal for an Exascale Storage System. In: SC 2016: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, Salt Lake City, UT, pp. 585–596 (2016). https://doi.org/10.1109/SC.2016.49
Acknowledgements
A portion of this research used resources at the SNS, a Department of Energy (DOE) Office of Science User Facility operated by ORNL. ORNL is managed by UT-Battelle LLC for DOE under Contract DE-AC05-00OR22725.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 This is a U.S. government work and not under copyright protection in the U.S.; foreign copyright protection may apply
About this paper
Cite this paper
Godoy, W.F., Peterson, P.F., Hahn, S.E., Hetrick, J., Doucet, M., Billings, J.J. (2020). Performance Improvements on SNS and HFIR Instrument Data Reduction Workflows Using Mantid. In: Nichols, J., Verastegui, B., Maccabe, A.‘., Hernandez, O., Parete-Koon, S., Ahearn, T. (eds) Driving Scientific and Engineering Discoveries Through the Convergence of HPC, Big Data and AI. SMC 2020. Communications in Computer and Information Science, vol 1315. Springer, Cham. https://doi.org/10.1007/978-3-030-63393-6_12
Download citation
DOI: https://doi.org/10.1007/978-3-030-63393-6_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-63392-9
Online ISBN: 978-3-030-63393-6
eBook Packages: Computer ScienceComputer Science (R0)