ABSTRACT
A semantic file system is an information storage system that provides flexible associative access to the system's contents by automatically extracting attributes from files with file type specific transducers. Associative access is provided by a conservative extension to existing tree-structured file system protocols, and by protocols that are designed specifically for content based access. Compatiblity with existing file system protocols is provided by introducing the concept of a virtual directory. Virtual directory names are interpreted as queries, and thus provide flexible associative access to files and directories in a manner compatible with existing software. Rapid attribute-based access to file system contents is implemented by automatic extraction and indexing of key properties of file system objects. The automatic indexing of files and directories is called "semantic" because user programmable transducers use information about the semantics of updated file system objects to extract the properties for indexing. Experimental results from a semantic file system implementation support the thesis that semantic file systems present a more effective storage abstraction than do traditional tree structured file systems for information sharing and command level programming.
- BP88.Brian N. Bershad and C. Brian Pinkerton. Watchdogs: extending the UNIX file system. In USENIX Association 1988 Winter Conference Proceedings, Dallas, Texas, February 1988.Google Scholar
- Cat83.R.G.G. Cattell. Design and Implementation of a Relationship-Entity-Datum Data Model. Technical Report CSL-83-4, Xerox PARC, Palo Alto, California, May 1983.Google Scholar
- CCI88.CCITT. The Directory- Overview of Concepts, Models and Services. Recommendation X.500, 1988.Google Scholar
- CG91.Vincent Care and Thomas Gross. Combining the concepts of compression and caching for a twolevel filesystem. In Fourth International Con/erence on Architectural Support/or Programming Languages and Operating Systems, pages 200- 211, ACM, Santa Clara, California, April 1991. Google ScholarDigital Library
- CL89.Brent Callaghan and Tom Lyon. The automounter. In USENIX Association 1989 Winter Con/erence Proceedings, 1989.Google Scholar
- Cla90.Claris Corporation, Santa Clara, California, January 1990. News Release.Google Scholar
- Cor.Lotus Corporation. Lotus Magellan: Quick Launch. Product tutorial, Lotus Corporation, Cambridge, Massachusetts. Part number 35115.Google Scholar
- DANO91.Peter B. Danzig, Jongsuk Ahn, John Noll, and Katia Obraczka. Distributed Indexing: A Scalable Mechanism/or Distributed In/ormation Retrieval. Technical Report USC-TR 91-06, University of Souther California, Computer Science Department, 1991.Google ScholarDigital Library
- GCS87.David K. Gifford, Robert G. Cote, and David A. Segal. Walter User's Manual. Technical Report MIT/LCS/TR-399, M.I.T. Laboratory for Computer Science, September 1987. Google ScholarDigital Library
- GLB85.David K. Gifford, John M. Lucassen, and Stephen T. Berlin. An architecture for large scale information systems. In l Oth Symposium on Operating System Principles, pages 161-170, ACM, December 1985. Google ScholarDigital Library
- GMT86.Ferdinando Gallo, Regis Minot, and Ian Thomas. he object management system of PCTE as a software engineering database management system. In Second A CM SIG- SOFT/SIGPLAN Software Engineering Symposium on Practical Software Development Environments, pages 12-15, ACM, December 1986. Google ScholarDigital Library
- GO91.David K. Gifford and James W. O'Toole. Intelligent file systems for object repositories. In Operating Systems o/ the 90s and Beyond, Internationales Begegnales- und Forschungszentrum fiir Informatik, Schloss Dagstuhl- GeschSftsstelle, Saarbriicken, Germany, July 1991. To be published by Springer-Verlag. Google ScholarDigital Library
- Gro86.Computer Systems Research Group. UNIX User's Reference Manual. 4.3 Berkeley Software Distribution, Berkeley, California, April 1986. Virtu al VAX- 11 Version.Google Scholar
- Inf90.Information Dimensions, Inc. BASISplus. The Key To Managing The World Of Information. Information Dimensions, Inc., Dublin, Ohio, 1990. Product description.Google Scholar
- Kaz88.Michael Leon Kazar. Synchronization and caching issues in the Andrew File System. In USENIX Association 1988 Winter Con/erence Proceedings, pages 31-43, 1988.Google Scholar
- Ki184.T.J. Killian. Processes as files. In USENIX Association 1984{ Summer Con/erence Proceedings, Salt Lake City, Utah, 1984.Google Scholar
- Kle86.S.R. Kleiman. Vnodes: an architecture for multiple file system types in Sun UNIX. in USENIX Association 1986 Winter Conference Proceedings, pages 238-247, 1986.Google Scholar
- KM91.Brewster Kahle and Art Medlar. An In/ormation System/or Corporate Users: Wide Area In- /ormation Servers. Technical Report TMC-199, Thinking Machines, Inc., April 1991. Version 3.Google Scholar
- Leg89.Legato Systems, Inc. Nhfsstone. Software package. Legato Systems, Inc., Palo Alto, California, 1989.Google Scholar
- Les.M.E. Lesk. Some applications of inverted indexes on the UNIX system. UNIX Supplementary Document, Section 30.Google Scholar
- Log91.Boss Logic, Inc. Boss DMS development specification. Technical documentation, Boss Logic, Inc., Fairfield, IA, February 1991.Google Scholar
- Mog86.Jeffrey C. Mogul. Representing In/ormation about F,les. Technical Report 86-1103, Stanford Univ. Department of CS, March 1986. Ph.D. Thesis. Google ScholarDigital Library
- NC89a.NeXT Corporation. 1.0 release notes: indexing. NeXT Corporation, Palo Alto, California, 1989.Google Scholar
- NC89b.NeXT Corporation. Text indexing facilities on the NeXT computer. NeXT Corporation, Palo Alto, California, 1989. from 1.0 Release Notes.Google Scholar
- Nee91.Roger Needham. 1991. Personal communication.Google Scholar
- Neu90.B. Clifford Neuman. The Virtual System Model: A Scalable Approach to Organizing Large Systems. Technical Report 90-05-01, Univ. of Washington CS Department, May 1990. Thesis Proposal. Google ScholarDigital Library
- NIS91.Ansi z39.50 version 2. National Information Standards Organization, Bethesda, Maryland, January 1991. Second Draft.Google Scholar
- OCH*85.John K. Ousterhout, Herv~ Da Costa, David Harrison, John A. Kunze, Mike Kupfer, and James G. Thompson. A trace-driven analysis of the unix 4.2bsd file system. In Symposium on Operating System Principles, pages 15-24, ACM, December 1985. Google ScholarDigital Library
- Pen90.Jan-Simon Pendry. Amd an automounter. Department of Computing, Imperial College, London, May 1990.Google Scholar
- Pet88.Larry Peterson. The Profile Naming Service. A CM Transactions on Computer Systems, 6(4):341-364, November 1988. Google ScholarDigital Library
- PPTT90.Rob Pike, Dave Presotto, Ken Thompson, and Howard Trickey. Plan 9 from Bell Labs. UK UUG proceedings, 1990.Google Scholar
- PW90.Jan-Simon Pendry and Nick Williams. Amd: the 4.4 BSD automounter reference manual. December 1990. Documentation for software revision 5.:t Alpha.Google Scholar
- Roc85.Marc J. Rochkind. Advanced UNIX Programming. Prentice-Hall, Inc., Englewood Cliffs, New Jersey, 1985. Google ScholarDigital Library
- RT74.D.M. Ritchie and K. Thompson. The UNIX Time-Sharing System. Comm. A CM, 17(7):365- 375, July 1974. Google ScholarDigital Library
- Sal83.Gerard Salton. Introduction to Modern Information Retrieval. McGraw-Hill, New York, 1983. Google ScholarDigital Library
- SC88.Sun Corporation. The Network Software Environment. Technical Report, Sun Computer Corporation, Mountain View, California, 1988.Google Scholar
- Sch89.Michael F. Schwartz. The Networked Resource Discovery Project. In Proceedings o/the IFIP XI World Congress, pages 827-832, IFIP, August 1989.Google Scholar
- SGK*85.R. Sandberg, D. Goldberg, S. Kleiman, D. Walsh, and B. Lyon. Design and implementation of the Sun Network Filesystem. In USENIX Association 1985 Summer Conference Proceedings, pages 119-130, 1985.Google Scholar
- SK86.C. Stanfill and B. Kahle. Parallel Free-Text Search on the Connection Machine System. Comm. ACM, 1229-1239, December 1986. Google ScholarDigital Library
- Sta87.Richard Stallman. GNU Emacs Manual. Free Software Foundation. Cambridge, MA, March 1987. Sixth Edition, Version 18. Google ScholarDigital Library
- Ste91.Richard Marion Stein. Browsing through terabytes: wide-area information servers open a new frontier in personal and corporate information services. Byte, 157-164, May 1991. Google ScholarDigital Library
- Sun88.Network Pro#ramming. Sun Microsystems, Sunnyvale, California, May 1988. Part Number 800- 1779-10.Google Scholar
- Sun89.NFS: network file system protocol specification. Sun Microsystems, Network Working Group, Request for Comments (RFC 1094), March 1989. Version 2.Google Scholar
- Tec90.ON Technology. ON Technology, Inc. announces On Location for the Apple Macintosh computer. News Release ON Technology, inc., Cambridge, Massachusetts, January 1990.Google Scholar
- Ver90.Verity. Topic. Product description, Verity, Mountain View, California, 1990.Google Scholar
- Wei.Peter Weinberger. CBT Program documentation. Bell Laboratories.Google Scholar
- WO88.Brent B. Welch and John K. Ousterhout. Pseudo devices: user-level extensions to the Sprite file system. In USENIX Association 1988 Summer Conference Proceedings, San Francisco, California, June 1988.Google Scholar
Index Terms
- Semantic file systems
Recommendations
Semantic file systems
A semantic file system is an information storage system that provides flexible associative access to the system's contents by automatically extracting attributes from files with file type specific transducers. Associative access is provided by a ...
Serverless network file systems
Special issue on operating system principlesWe propose a new paradigm for network file system design: serverless network file systems. While traditional network file systems rely on a central server machine, a serverless system utilizes workstations cooperating as peers to provide all file system ...
A multiple-file write scheme for improving write performance of small files in Fast File System
Fast File System (FFS) stores files to disk in separate disk writes, each of which incurs a disk positioning (seek + rotation) limiting the write performance for small files. We propose a new scheme called co-writing to accelerate small file writes in ...
Comments