skip to main content
article
Free Access

Overview of the Vesta parallel file system

Published:01 December 1993Publication History
Skip Abstract Section

Abstract

The Vesta parallel file system provides parallel access from compute nodes to files distributed across I/O nodes in a massively parallel computer. Vesta is intended to solve the I/O problems of massively parallel computers executing numerically intensive scientific applications. Vesta has three interesting characteristics: First, it provides a user defined parallel view of file data, and allows user defined partitioning and repartitioning of files without moving data among I/O nodes. The parallel file access semantics of Vesta directly support the operations required by parallel language I/O libraries. Second, Vesta is scalable to a very large number (many hundreds) of I/O and compute nodes and does not contain any sequential bottlenecks in the data-access path. Third, it provides user-directed checkpointing of files during continuing program execution with very little processing overhead.

References

  1. [1] P.M. Chen, G.A. Gibson, R.H. Katz, and D.A. Patterson, "An evaluation of redundant arrays of disks using an Amdahl 5890". In SIGMETRICS Conf. Measurement & Modeling of Comput. Syst., pp. 74-85, May 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. [2] P. M. Chen and D. A. Patterson, "Maximizing performance in a striped disk array". In 17th Ann. Intl. Symp. Computer Architecture Conf. Proc., pp. 322-331, May 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. [3] A. L. Chervenak and R. H. Katz, "Performance of a disk array prototype". In SIGMETRICS Conf. Measurement & Modeling of Comput. Syst., pp. 188-197, May 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. [4] E. DeBenedictis and J. M. del Rosario, "nCUBE parallel I/O software". In 11th Intl. Phoenix Conf. Computers & Communications , pp. 117-124, Apr 1992.Google ScholarGoogle Scholar
  5. [5] P. C. Dibble, M. L. Scott, and C. S. Ellis, "Bridge: a high-performance file system [or parallel processors". In 8th Intl. Conf. Distributed Comput. Syst., pp. 154-161, 1988.Google ScholarGoogle ScholarCross RefCross Ref
  6. [6] J. Edler, J. Lipkis, and E. Schonberg, "Memory management in Symunix II: a design for large-scale shared memory multiprocessors ". In Proc. Workshop on UNIX and Supercomputers, pp. 151-168, USENIX, Sep 1988.Google ScholarGoogle Scholar
  7. [7] M. P. Herlihy and J. M. Wing, "Linearizabiliy: a correctness condition for concurrent objects". ACM Trans. Prog. Lang. & Syst. 12(3), pp. 463-492, Jul 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. [8] R. H. Katz, G. A. Gibson, and D. A. Patterson, "Disk system architectures for high performance computing". Proc. IEEE 77(12), pp. 1842-1858, Dec 1989.Google ScholarGoogle ScholarCross RefCross Ref
  9. [9] M. Y. Kim, "Synchronized disk interleaving". IEEE Trans. Comput. C-35(11), pp. 978-988, Nov 1986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. [10] J. J. Kistler and M. Satyanarayanan, "Disconnected operation in the Coda file system". ACM Trans. Comput. Syst. 10(1), pp. 3-25, Feb 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. [11] C. Koelbel and P. Mehrotra, "Compiling global name-space parallel loops for distributed execution". IEEE Trans. Parallel & Distributed Syst. 2(4), pp. 440-451, Oct 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. [12] D. F. Kotz and C. S. Ellis, "Prefetching in file systems for MIMD multiprocessors". IEEE Trans. Parallel & Distributed Syst. 1(2), pp. 218-230, Apr 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. [13] L. Lamport, "How to make a multiprocessor computer that correctly executes multiprocess programs". IEEE Trans. Comput. C-28(9), pp. 690-691, Sep 1979.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. [14] E.K. Lee and R.H. Katz, "Performance consequences of parity placement in disk arrays". In 4th Intl. Conf. Architect. Support for Prog. Lang. & Operating Syst., pp. 190-199, Apr 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. [15] M.N. Nelson, B.B. Welch, and J.K. Ousterhout, "Caching in the Sprite network file system". ACM Trans. Comput. Syst. 6(1), pp. 134-154, Feb 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. [16] S. Ng, D. Lang, and R. Selinger, "Trade-offs between devices and paths in achieving disk interleaving". In 15th Ann. Intl. Symp. Computer Architecture Conf. Proc., pp. 196-201, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. [17] D.A. Patterson, G. Gibson, and R.H. Katz, "A case for redundant arrays of inexpensive disks (RAID)". In SIGMOD Intl. Conf. Management of Data, pp. 109-116, Jun 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. [18] P. Pierce, "A concurrent file system for a highly parallel mass storage subsystem". In 4th Conf. Hypercubes, Concurrent Comput., & Appl., vol. I, pp. 155-160, Mar 1989.Google ScholarGoogle Scholar
  19. [19] R. Rashid, A. Tevanian, Jr., M. Young, D. Golub, R. Baron, D. Black, W. J. Bolosky, and J. Chew, "Machine-independent virtual memory management for paged uniprocessor and multiprocessor architectures". IEEE Trans. Comput. 37(8), pp. 896- 908, Aug 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. [20] A.L.N. Reddy and P. Banerjee, "An evaluation of multiple-disk I/O systems". IEEE Trans. Comput. 38(12), pp. 1680-1690, Dec 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. [21] K. Salem and H. Garcia-Molina, "Disk striping". In Proc. Intl. Conf. Data Engineering, pp. 336-342, 1986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. [22] M. Satyanarayanan, "Scalable, secure, and highly available distributed file access". Computer 23(5), pp. 9-21, May 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. [23] M. Satyanarayanan, J. J. Kistler, P. Kumar, M. E. Okasaki, E.H. Siegel, and D.C. Steere, "Coda: a highly available file system for a distributed workstation environment". IEEE Trans. Comput. 39(4), pp. 447-459, Apr 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. [24] D.C. Stokes, "DCE DIS vs. AFS: a disrtributed file system comparison". In UNITE! 92, pp. 1-12, 1992.Google ScholarGoogle Scholar

Index Terms

  1. Overview of the Vesta parallel file system

                Recommendations

                Comments

                Login options

                Check if you have access through your login credentials or your institution to get full access on this article.

                Sign in

                Full Access

                • Published in

                  cover image ACM SIGARCH Computer Architecture News
                  ACM SIGARCH Computer Architecture News  Volume 21, Issue 5
                  Special issue on input/output in parallel computer systems
                  Dec. 1993
                  83 pages
                  ISSN:0163-5964
                  DOI:10.1145/165660
                  Issue’s Table of Contents

                  Copyright © 1993 Authors

                  Publisher

                  Association for Computing Machinery

                  New York, NY, United States

                  Publication History

                  • Published: 1 December 1993

                  Check for updates

                  Qualifiers

                  • article

                PDF Format

                View or Download as a PDF file.

                PDF

                eReader

                View online with eReader.

                eReader