ABSTRACT
There is an ocean of available storage solutions in modern high-performance and distributed systems. These solutions consist of Parallel File Systems (PFS) for the more traditional high-performance computing (HPC) systems and of Object Stores for emerging cloud environments. More of ten than not, these storage solutions are tied to specific APIs and data models and thus, bind developers, applications, and entire computing facilities to using certain interfaces. Each storage system is designed and optimized for certain applications but does not perform well for others. Furthermore, modern applications have become more and more complex consisting of a collection of phases with different computation and I/O requirements. In this paper, we propose a unified storage access system, called IRIS (i.e., I/O Redirection via Integrated Storage). IRIS enables unified data access and seamlessly bridges the semantic gap between file systems and object stores. With IRIS, emerging High-Performance Data Analytics software has capable and diverse I/O support. IRIS can bring us closer to the convergence of HPC and Cloud environments by combining the best storage subsystems from both worlds. Experimental results show that IRIS can grant more than 7x improvement in performance than existing solutions.
- Amazon Inc. 2017. Amazon S3. (2017). http://docs.aws.amazon.com/AmazonS3/latest/API/Welcome.html.Google Scholar
- Apache Software Foundation. 2017. Bigtop software collection. (2017). http://bigtop.apache.org/.Google Scholar
- David John Bonnie. 2015. MarFS-Scalable POSIX on Object File System. Technical Report. Los Alamos National Lab.(LANL), Los Alamos, NM (United States).Google Scholar
- Peter J Braam et al. 2014. The Lustre storage architecture. (2014). ftp://ftp.uni-duisburg.de/linux/filesys/Lustre/lustre.pdf.Google Scholar
- Gorda Brent. 2015. DAOS: An Architecture for Exascale Storage. (2015). http://storageconference.us/2015/Presentations/Gorda.pdf.Google Scholar
- George H Bryan and J Michael Fritsch. 2002. A benchmark simulation for moist nonhydrostatic numerical models. Monthly Weather Review 130, 12 (2002), 2917--2928.Google ScholarCross Ref
- Lawrence Bryan. 2017. The UK JASMIN Environmental Data Commons. (2017). https://wr.informatik.uni-hamburg.de/_media/events/2017/iodc-17-lawerence.pdf.Google Scholar
- Philip Carns, Sam Lang, Robert Ross, Murali Vilayannur, Julian Kunkel, and Thomas Ludwig. 2009. Small-file access in parallel file systems. In Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE Symposium on. IEEE, Rome, Italy, 1--11. Google ScholarDigital Library
- Chameleon.org. 2017. Chameleon system. (2017). htps://www.chameleoncloud.org/about/chameleon/.Google Scholar
- Steve Conway and Chirag Dekate. 2014. High-Performance Data Analysis: HPC Meets Big Data. Technical Report. IDC.Google Scholar
- Jeffrey Dean and Sanjay Ghemawat. 2008. MapReduce: simplified data processing on large clusters. Commun. ACM 51, 1 (2008), 107--113. Google ScholarDigital Library
- Jack Dongarra et al. 2011. The International Exascale Software Project roadmap. International Journal of High Performance Computing Applications 25, 1 (2011), 3--60. Google ScholarDigital Library
- Robert Escriva, Bernard Wong, and Emin Gün Sirer. 2012. HyperDex: A distributed, searchable key-value store. Acm sigcomm computer communication review 42, 4 (2012), 25--36. Google ScholarDigital Library
- Mike Folk, Albert Cheng, and Kim Yates. 1999. HDF5: A file format and I/O library for high performance computing applications. In Proceedings of the International Conference for High Performance Computing, Networks, Storage and Analysis (Supercomputing), Vol. 99. ACM, Portland, OR, 5--33.Google Scholar
- Bryan George. 2017. UCAR CM1 atmospheric simulation. (2017). http://www2.mmm.ucar.edu/people/bryan/cm1/.Google Scholar
- Google Inc. 2017. LevelDB. (2017). htps://github.com/google/leveldb.Google Scholar
- John L Hennessy and David A Patterson. 2011. Computer architecture: a quantitative approach. Elsevier, New York, NY. Google ScholarDigital Library
- Tony Hey, Stewart Tansley, Kristin M Tolle, et al. 2009. The fourth paradigm: data-intensive scientific discovery. Vol. 1. Microsoft Research, Redmond, WA.Google Scholar
- High Performance Data Division Intel® Enterprise Edition for Lustre Software. 2014. WHITE PAPER Big Data Meets High Performance Computing. Technical Report. Intel. https://goo.gl/GLZrRH.Google Scholar
- Earl Joseph and Steve Conway. 2014. IDC Update on How Big Data Is Redefining High Performance Computing. Technical Report. IDC.Google Scholar
- Anthony Kougkas, Hariharan Devarajan, and Xian-He Sun. 2017. Enosis: Bridging the Semantic Gap between File-based and Object-based Data Models. In Data-Intensive Computing in the Clouds(Datacloud'17), 8th International Workshop on. ACM SIGHPC, Denver,CO.Google Scholar
- Anthony Kougkas, Hariharan Devarajan, and Xian-He Sun. 2017. Syndesis: Mapping Objects to Files for a Unified Data Access System. In Many-Task Computing on Clouds, Grids, and Supercomputers(MTAGS'17), 9th International Workshop on. ACM SIGHPC, Denver, CO.Google Scholar
- Anthony Kougkas, Hassan Eslami, Xian-He Sun, Rajeev Thakur, and William Gropp. 2017. Rethinking key--value store for parallel I/O optimization. The International Journal of High Performance Computing Applications 31, 4 (2017), 335--356. Google ScholarDigital Library
- Avinash Lakshman and Prashant Malik. 2010. Cassandra. ACM SIGOPS Operating Systems Review 44, 2 (2010), 35. Google ScholarDigital Library
- Haoyuan Li, Ali Ghodsi, Matei Zaharia, Scott Shenker, and Ion Stoica. 2014. Tachyon: Reliable, memory speed storage for cluster computing frameworks. In Proceedings of the ACM Symposium on Cloud Computing. ACM, Seattle, WA, 1--15. Google ScholarDigital Library
- Jianwei Li, Wei-keng Liao, Alok Choudhary, Robert Ross, Rajeev Thakur, William Gropp, Robert Latham, Andrew Siegel, Brad Gallagher, and Michael Zingale. 2003. Parallel netCDF: A high-performance scientific I/O interface. In Supercomputing, 2003 ACM/IEEE Conference. ACM/IEEE, Phoenix, AZ, 39--39. Google ScholarDigital Library
- Los Alamos National Laboratory. 2017. Anonymous Scientiic Application. (2017). htp://institute.lanl.gov/data/tdata/.Google Scholar
- MongoDB Inc. 2017. MongoDB. (2017). htps://www.mongodb.com/white-papers.Google Scholar
- Montage. 2017. An Astronomical Image Mosaic Engine. (2017). htp://montage.ipac.caltech.edu/docs/m101tutorial.html.Google Scholar
- Monty, Taylor. 2017. OpenStack Object Storage (swit). (2017). htps://launchpad.net/swit.Google Scholar
- David Nagle, Denis Serenyi, and Abbie Matthews. 2004. The panasas activescale storage cluster: Delivering scalable high bandwidth storage. In Proceedings of the 2004 ACM/IEEE conference on Supercomputing. IEEE Computer Society, Pittsburgh, PA, 53. Google ScholarDigital Library
- Bogdan Nicolae, Diana Moise, Gabriel Antoniu, Luc Bougé, and Matthieu Dorier. 2010. BlobSeer: Bringing high throughput under heavy concurrency to Hadoop MapReduce applications. In Parallel & Distributed Processing (IPDPS), 2010 IEEE International Symposium on. IEEE, Atlanta, GA, 1--11.Google ScholarCross Ref
- Swapnil Patil and Garth A Gibson. 2011. Scale and Concurrency of GIGA+: File System Directories with Millions of Files.. In FAST, Vol. 11. ACM/Usenix, San Jose, CA, 13. Google ScholarDigital Library
- Steve Plimpton. 1995. Fast parallel algorithms for short-range molecular dynamics. Journal of computational physics 117, 1 (1995), 1--19. Google ScholarDigital Library
- Daniel A Reed and Jack Dongarra. 2015. Exascale computing and big data. Commun. ACM 58, 7 (2015), 56--68. Google ScholarDigital Library
- Kai Ren, Qing Zheng, Swapnil Patil, and Garth Gibson. 2014. IndexFS: scaling file system metadata performance with stateless caching and bulk insertion. In SC14: International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE,New Orleans, LA, 237--248. Google ScholarDigital Library
- Robert B Ross, Rajeev Thakur, et al. 2000. PVFS: A parallel file system for Linux clusters. In Proceedings of the 4th annual Linux Showcase and Conference. Usenix, Atlanta, GA, 391--430. Google ScholarDigital Library
- Sandia National Laboratories. 2017. LAMMPS. (2017). htp://lammps.sandia.gov/.Google Scholar
- Frank B Schmuck and Roger L Haskin. 2002. GPFS: A Shared-Disk File System for Large Computing Clusters. In Proceedings of the 1st USENIX Conference on File and Storage Technologies, Vol. 2. Usenix, Monterey, CA, 231--244. Google ScholarDigital Library
- Wittawat Tantisiriroj, Seung Woo Son, Swapnil Patil, Samuel J Lang, Garth Gibson, and Robert B Ross. 2011. On the duality of data-intensive file system design: reconciling HDFS and PVFS. In Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis. ACM, Seattle, WA, 67. Google ScholarDigital Library
- Timothy James Tautges, Corey Ernst, Clint Stimpson, Ray J Meyers, and Karl Merkley. 2004. MOAB: a mesh-oriented database. Technical Report. Sandia National Laboratories.Google Scholar
- Rajeev Thakur, William Gropp, and Ewing Lusk. 1999. Data sieving and collective I/O in ROMIO. In Frontiers of Massively Parallel Computation, 1999. Frontiers' 99. The Seventh Symposium on the. IEEE, Annapolis, Maryland, 182--189. Google ScholarDigital Library
- Devesh Tiwari, Simona Boboila, Sudharshan S Vazhkudai, Youngjae Kim, Xiaosong Ma, Peter Desnoyers, and Yan Solihin. 2013. Active flash: towards energy-efficient, in-situ data analytics on extreme-scale machines.. In FAST. Usenix, San Jose, CA, 119--132. Google ScholarDigital Library
- Murali Vilayannur, Partho Nath, and Anand Sivasubramaniam. 2005. Providing Tunable Consistency for a Parallel File Store.. In FAST, Vol. 5. Usenix, San Francisco, CA, 2--2. Google ScholarDigital Library
- Feng Wang, Scott A Brandt, Ethan L Miller, and Darrell DE Long. 2004. OBFS: A File System for Object-Based Storage Devices.. In MSST, Vol. 4. IEEE Computer Society, Adelphi, MD, 283--300.Google Scholar
- Weather Research and Forecasting Model. 2017. WRF. (2017). htp://www.wrf-model.org/index.php.Google Scholar
- Sage A Weil, Scot A Brandt, Ethan L Miller, Darrell DE Long, and Carlos Maltzahn. 2006. Ceph: A scalable, high-performance distributed ile system. In Proceedings of the 7th symposium on Operating systems design and implementation. USENIX Association, Seatle, WA, United States, 307--320. Google ScholarDigital Library
- Brent Welch and Garth A Gibson. 2004. Managing Scalability in Object Storage Systems for HPC Linux Clusters.. In MSST. IEEE, Adelphi, Maryland, USA, 433--445.Google Scholar
- Brent Welch, Marc Unangst, Zainul Abbasi, Garth A Gibson, Brian Mueller, Jason Small, Jim Zelenka, and Bin Zhou. 2008. Scalable Performance of the Panasas Parallel File System.. In FAST, Vol. 8. USENIX Association, San Jose, CA, 1--17. Google ScholarDigital Library
- Yanlong Yin, Surendra Byna, Huaiming Song, Xian-He Sun, and Rajeev Thakur. 2012. Boosting application-specific parallel i/o optimization using IOSIG. In Cluster, Cloud and Grid Computing (CCGrid), 2012 12th IEEE/ACM International Symposium on. IEEE, Otawa, Canada, 196--203. Google ScholarDigital Library
- Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave, Justin Ma, Murphy McCauley, Michael J Franklin, Scot Shenker, and Ion Stoica. 2012. Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing. In Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation. USENIX Association, San Jose, CA, 2--2. Google ScholarDigital Library
- Dongfang Zhao, Zhao Zhang, Xiaobing Zhou, Tonglin Li, Ke Wang, Dries Kimpe, Philip Carns, Robert Ross, and Ioan Raicu. 2014. Fusionfs: Toward supporting data-intensive scientific applications on extreme-scale high-performance computing systems. In Big Data (Big Data), 2014 IEEE International Conference on. IEEE, Washington DC, 61--70.Google ScholarCross Ref
- Qing Zheng, Kai Ren, and Garth Gibson. 2014. BatchFS: scaling the file system control plane with client-funded metadata servers. In Proceedings of the 9th Parallel Data Storage Workshop. IEEE Press, New Orleans, LA, 1--6. Google ScholarDigital Library
- Shujia Zhou, Bruce H Van Aartsen, and Thomas L Clune. 2008. A lightweight scalable I/O utility for optimizing High-End Computing applications. In Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on. IEEE, Miami, FL, USA, 1--7.Google ScholarCross Ref
Index Terms
- IRIS: I/O Redirection via Integrated Storage
Recommendations
Using Working Set Reorganization to Manage Storage Systems with Hard and Solid State Disks
ICPPW '14: Proceedings of the 2014 43rd International Conference on Parallel Processing WorkshopsScientific applications from many problem domains produce and/or access large volumes of data. To support these applications, designers of high-end computing (HEC) systems have greatly increased the capacity of storage systems in recent years. However, ...
Hierarchical Storage from NVMe to Tapes
EMOSS '22: Proceedings of the 2022 Workshop on Emerging Open Storage Systems and Solutions for Data Intensive ComputingHierarchical storage allows to build both high-performance and high-capacity storage systems by combining various technologies like NVMe, SSDs, HDDs, or tapes.
While most existing systems only manage a limited number of levels and technologies, the IO-...
Storage utilization in the long tail of science
XSEDE '15: Proceedings of the 2015 XSEDE Conference: Scientific Advancements Enabled by Enhanced CyberinfrastructureThe increasing expansion of computations in non-traditional domain sciences has resulted in an increasing demand for research cyberinfrastructure that is suitable for small- and mid-scale job sizes. The computational aspects of these emerging ...
Comments