skip to main content
10.1145/3205289.3205322acmconferencesArticle/Chapter ViewAbstractPublication PagesicsConference Proceedingsconference-collections
research-article
Public Access

IRIS: I/O Redirection via Integrated Storage

Authors Info & Claims
Published:12 June 2018Publication History

ABSTRACT

There is an ocean of available storage solutions in modern high-performance and distributed systems. These solutions consist of Parallel File Systems (PFS) for the more traditional high-performance computing (HPC) systems and of Object Stores for emerging cloud environments. More of ten than not, these storage solutions are tied to specific APIs and data models and thus, bind developers, applications, and entire computing facilities to using certain interfaces. Each storage system is designed and optimized for certain applications but does not perform well for others. Furthermore, modern applications have become more and more complex consisting of a collection of phases with different computation and I/O requirements. In this paper, we propose a unified storage access system, called IRIS (i.e., I/O Redirection via Integrated Storage). IRIS enables unified data access and seamlessly bridges the semantic gap between file systems and object stores. With IRIS, emerging High-Performance Data Analytics software has capable and diverse I/O support. IRIS can bring us closer to the convergence of HPC and Cloud environments by combining the best storage subsystems from both worlds. Experimental results show that IRIS can grant more than 7x improvement in performance than existing solutions.

References

  1. Amazon Inc. 2017. Amazon S3. (2017). http://docs.aws.amazon.com/AmazonS3/latest/API/Welcome.html.Google ScholarGoogle Scholar
  2. Apache Software Foundation. 2017. Bigtop software collection. (2017). http://bigtop.apache.org/.Google ScholarGoogle Scholar
  3. David John Bonnie. 2015. MarFS-Scalable POSIX on Object File System. Technical Report. Los Alamos National Lab.(LANL), Los Alamos, NM (United States).Google ScholarGoogle Scholar
  4. Peter J Braam et al. 2014. The Lustre storage architecture. (2014). ftp://ftp.uni-duisburg.de/linux/filesys/Lustre/lustre.pdf.Google ScholarGoogle Scholar
  5. Gorda Brent. 2015. DAOS: An Architecture for Exascale Storage. (2015). http://storageconference.us/2015/Presentations/Gorda.pdf.Google ScholarGoogle Scholar
  6. George H Bryan and J Michael Fritsch. 2002. A benchmark simulation for moist nonhydrostatic numerical models. Monthly Weather Review 130, 12 (2002), 2917--2928.Google ScholarGoogle ScholarCross RefCross Ref
  7. Lawrence Bryan. 2017. The UK JASMIN Environmental Data Commons. (2017). https://wr.informatik.uni-hamburg.de/_media/events/2017/iodc-17-lawerence.pdf.Google ScholarGoogle Scholar
  8. Philip Carns, Sam Lang, Robert Ross, Murali Vilayannur, Julian Kunkel, and Thomas Ludwig. 2009. Small-file access in parallel file systems. In Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE Symposium on. IEEE, Rome, Italy, 1--11. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Chameleon.org. 2017. Chameleon system. (2017). htps://www.chameleoncloud.org/about/chameleon/.Google ScholarGoogle Scholar
  10. Steve Conway and Chirag Dekate. 2014. High-Performance Data Analysis: HPC Meets Big Data. Technical Report. IDC.Google ScholarGoogle Scholar
  11. Jeffrey Dean and Sanjay Ghemawat. 2008. MapReduce: simplified data processing on large clusters. Commun. ACM 51, 1 (2008), 107--113. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Jack Dongarra et al. 2011. The International Exascale Software Project roadmap. International Journal of High Performance Computing Applications 25, 1 (2011), 3--60. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Robert Escriva, Bernard Wong, and Emin Gün Sirer. 2012. HyperDex: A distributed, searchable key-value store. Acm sigcomm computer communication review 42, 4 (2012), 25--36. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Mike Folk, Albert Cheng, and Kim Yates. 1999. HDF5: A file format and I/O library for high performance computing applications. In Proceedings of the International Conference for High Performance Computing, Networks, Storage and Analysis (Supercomputing), Vol. 99. ACM, Portland, OR, 5--33.Google ScholarGoogle Scholar
  15. Bryan George. 2017. UCAR CM1 atmospheric simulation. (2017). http://www2.mmm.ucar.edu/people/bryan/cm1/.Google ScholarGoogle Scholar
  16. Google Inc. 2017. LevelDB. (2017). htps://github.com/google/leveldb.Google ScholarGoogle Scholar
  17. John L Hennessy and David A Patterson. 2011. Computer architecture: a quantitative approach. Elsevier, New York, NY. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Tony Hey, Stewart Tansley, Kristin M Tolle, et al. 2009. The fourth paradigm: data-intensive scientific discovery. Vol. 1. Microsoft Research, Redmond, WA.Google ScholarGoogle Scholar
  19. High Performance Data Division Intel® Enterprise Edition for Lustre Software. 2014. WHITE PAPER Big Data Meets High Performance Computing. Technical Report. Intel. https://goo.gl/GLZrRH.Google ScholarGoogle Scholar
  20. Earl Joseph and Steve Conway. 2014. IDC Update on How Big Data Is Redefining High Performance Computing. Technical Report. IDC.Google ScholarGoogle Scholar
  21. Anthony Kougkas, Hariharan Devarajan, and Xian-He Sun. 2017. Enosis: Bridging the Semantic Gap between File-based and Object-based Data Models. In Data-Intensive Computing in the Clouds(Datacloud'17), 8th International Workshop on. ACM SIGHPC, Denver,CO.Google ScholarGoogle Scholar
  22. Anthony Kougkas, Hariharan Devarajan, and Xian-He Sun. 2017. Syndesis: Mapping Objects to Files for a Unified Data Access System. In Many-Task Computing on Clouds, Grids, and Supercomputers(MTAGS'17), 9th International Workshop on. ACM SIGHPC, Denver, CO.Google ScholarGoogle Scholar
  23. Anthony Kougkas, Hassan Eslami, Xian-He Sun, Rajeev Thakur, and William Gropp. 2017. Rethinking key--value store for parallel I/O optimization. The International Journal of High Performance Computing Applications 31, 4 (2017), 335--356. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Avinash Lakshman and Prashant Malik. 2010. Cassandra. ACM SIGOPS Operating Systems Review 44, 2 (2010), 35. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Haoyuan Li, Ali Ghodsi, Matei Zaharia, Scott Shenker, and Ion Stoica. 2014. Tachyon: Reliable, memory speed storage for cluster computing frameworks. In Proceedings of the ACM Symposium on Cloud Computing. ACM, Seattle, WA, 1--15. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Jianwei Li, Wei-keng Liao, Alok Choudhary, Robert Ross, Rajeev Thakur, William Gropp, Robert Latham, Andrew Siegel, Brad Gallagher, and Michael Zingale. 2003. Parallel netCDF: A high-performance scientific I/O interface. In Supercomputing, 2003 ACM/IEEE Conference. ACM/IEEE, Phoenix, AZ, 39--39. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Los Alamos National Laboratory. 2017. Anonymous Scientiic Application. (2017). htp://institute.lanl.gov/data/tdata/.Google ScholarGoogle Scholar
  28. MongoDB Inc. 2017. MongoDB. (2017). htps://www.mongodb.com/white-papers.Google ScholarGoogle Scholar
  29. Montage. 2017. An Astronomical Image Mosaic Engine. (2017). htp://montage.ipac.caltech.edu/docs/m101tutorial.html.Google ScholarGoogle Scholar
  30. Monty, Taylor. 2017. OpenStack Object Storage (swit). (2017). htps://launchpad.net/swit.Google ScholarGoogle Scholar
  31. David Nagle, Denis Serenyi, and Abbie Matthews. 2004. The panasas activescale storage cluster: Delivering scalable high bandwidth storage. In Proceedings of the 2004 ACM/IEEE conference on Supercomputing. IEEE Computer Society, Pittsburgh, PA, 53. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Bogdan Nicolae, Diana Moise, Gabriel Antoniu, Luc Bougé, and Matthieu Dorier. 2010. BlobSeer: Bringing high throughput under heavy concurrency to Hadoop MapReduce applications. In Parallel & Distributed Processing (IPDPS), 2010 IEEE International Symposium on. IEEE, Atlanta, GA, 1--11.Google ScholarGoogle ScholarCross RefCross Ref
  33. Swapnil Patil and Garth A Gibson. 2011. Scale and Concurrency of GIGA+: File System Directories with Millions of Files.. In FAST, Vol. 11. ACM/Usenix, San Jose, CA, 13. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Steve Plimpton. 1995. Fast parallel algorithms for short-range molecular dynamics. Journal of computational physics 117, 1 (1995), 1--19. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Daniel A Reed and Jack Dongarra. 2015. Exascale computing and big data. Commun. ACM 58, 7 (2015), 56--68. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Kai Ren, Qing Zheng, Swapnil Patil, and Garth Gibson. 2014. IndexFS: scaling file system metadata performance with stateless caching and bulk insertion. In SC14: International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE,New Orleans, LA, 237--248. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Robert B Ross, Rajeev Thakur, et al. 2000. PVFS: A parallel file system for Linux clusters. In Proceedings of the 4th annual Linux Showcase and Conference. Usenix, Atlanta, GA, 391--430. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Sandia National Laboratories. 2017. LAMMPS. (2017). htp://lammps.sandia.gov/.Google ScholarGoogle Scholar
  39. Frank B Schmuck and Roger L Haskin. 2002. GPFS: A Shared-Disk File System for Large Computing Clusters. In Proceedings of the 1st USENIX Conference on File and Storage Technologies, Vol. 2. Usenix, Monterey, CA, 231--244. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Wittawat Tantisiriroj, Seung Woo Son, Swapnil Patil, Samuel J Lang, Garth Gibson, and Robert B Ross. 2011. On the duality of data-intensive file system design: reconciling HDFS and PVFS. In Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis. ACM, Seattle, WA, 67. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Timothy James Tautges, Corey Ernst, Clint Stimpson, Ray J Meyers, and Karl Merkley. 2004. MOAB: a mesh-oriented database. Technical Report. Sandia National Laboratories.Google ScholarGoogle Scholar
  42. Rajeev Thakur, William Gropp, and Ewing Lusk. 1999. Data sieving and collective I/O in ROMIO. In Frontiers of Massively Parallel Computation, 1999. Frontiers' 99. The Seventh Symposium on the. IEEE, Annapolis, Maryland, 182--189. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Devesh Tiwari, Simona Boboila, Sudharshan S Vazhkudai, Youngjae Kim, Xiaosong Ma, Peter Desnoyers, and Yan Solihin. 2013. Active flash: towards energy-efficient, in-situ data analytics on extreme-scale machines.. In FAST. Usenix, San Jose, CA, 119--132. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Murali Vilayannur, Partho Nath, and Anand Sivasubramaniam. 2005. Providing Tunable Consistency for a Parallel File Store.. In FAST, Vol. 5. Usenix, San Francisco, CA, 2--2. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Feng Wang, Scott A Brandt, Ethan L Miller, and Darrell DE Long. 2004. OBFS: A File System for Object-Based Storage Devices.. In MSST, Vol. 4. IEEE Computer Society, Adelphi, MD, 283--300.Google ScholarGoogle Scholar
  46. Weather Research and Forecasting Model. 2017. WRF. (2017). htp://www.wrf-model.org/index.php.Google ScholarGoogle Scholar
  47. Sage A Weil, Scot A Brandt, Ethan L Miller, Darrell DE Long, and Carlos Maltzahn. 2006. Ceph: A scalable, high-performance distributed ile system. In Proceedings of the 7th symposium on Operating systems design and implementation. USENIX Association, Seatle, WA, United States, 307--320. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Brent Welch and Garth A Gibson. 2004. Managing Scalability in Object Storage Systems for HPC Linux Clusters.. In MSST. IEEE, Adelphi, Maryland, USA, 433--445.Google ScholarGoogle Scholar
  49. Brent Welch, Marc Unangst, Zainul Abbasi, Garth A Gibson, Brian Mueller, Jason Small, Jim Zelenka, and Bin Zhou. 2008. Scalable Performance of the Panasas Parallel File System.. In FAST, Vol. 8. USENIX Association, San Jose, CA, 1--17. Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. Yanlong Yin, Surendra Byna, Huaiming Song, Xian-He Sun, and Rajeev Thakur. 2012. Boosting application-specific parallel i/o optimization using IOSIG. In Cluster, Cloud and Grid Computing (CCGrid), 2012 12th IEEE/ACM International Symposium on. IEEE, Otawa, Canada, 196--203. Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave, Justin Ma, Murphy McCauley, Michael J Franklin, Scot Shenker, and Ion Stoica. 2012. Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing. In Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation. USENIX Association, San Jose, CA, 2--2. Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. Dongfang Zhao, Zhao Zhang, Xiaobing Zhou, Tonglin Li, Ke Wang, Dries Kimpe, Philip Carns, Robert Ross, and Ioan Raicu. 2014. Fusionfs: Toward supporting data-intensive scientific applications on extreme-scale high-performance computing systems. In Big Data (Big Data), 2014 IEEE International Conference on. IEEE, Washington DC, 61--70.Google ScholarGoogle ScholarCross RefCross Ref
  53. Qing Zheng, Kai Ren, and Garth Gibson. 2014. BatchFS: scaling the file system control plane with client-funded metadata servers. In Proceedings of the 9th Parallel Data Storage Workshop. IEEE Press, New Orleans, LA, 1--6. Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. Shujia Zhou, Bruce H Van Aartsen, and Thomas L Clune. 2008. A lightweight scalable I/O utility for optimizing High-End Computing applications. In Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on. IEEE, Miami, FL, USA, 1--7.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. IRIS: I/O Redirection via Integrated Storage

                Recommendations

                Comments

                Login options

                Check if you have access through your login credentials or your institution to get full access on this article.

                Sign in
                • Published in

                  cover image ACM Conferences
                  ICS '18: Proceedings of the 2018 International Conference on Supercomputing
                  June 2018
                  407 pages
                  ISBN:9781450357838
                  DOI:10.1145/3205289

                  Copyright © 2018 ACM

                  Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                  Publisher

                  Association for Computing Machinery

                  New York, NY, United States

                  Publication History

                  • Published: 12 June 2018

                  Permissions

                  Request permissions about this article.

                  Request Permissions

                  Check for updates

                  Qualifiers

                  • research-article
                  • Research
                  • Refereed limited

                  Acceptance Rates

                  Overall Acceptance Rate584of2,055submissions,28%

                PDF Format

                View or Download as a PDF file.

                PDF

                eReader

                View online with eReader.

                eReader