Abstract
As language data and associatedtechnologies proliferate and as the languageresources community expands, it is becomingincreasingly difficult to locate and reuse existingresources. Are there any lexical resources forsuch-and-such a language? What tool workswith transcripts in this particular format?What is a good format to use for linguisticdata of this type? Questions like these dominate manymailing lists, since web search engines are anunreliable way to find language resources. Thispaper reports on a new digital infrastructurefor discovering language resources beingdeveloped by the Open Language Archives Community(OLAC). At the core of OLAC is its metadataformat, which is designed to facilitatedescription and discovery of all kinds oflanguage resources, including data, tools, oradvice. The paper describes OLAC metadata, itsrelationship to Dublin Core metadata, and itsdissemination using the metadata harvesting protocol of the Open Archives Initiative.
Similar content being viewed by others
References
Aristar Dry H., Appleby M. (2003). OLAC Linguistic Subject Vocabulary. [http://www.languagearchives.org/REC/field.html]
Aristar Dry H., Johnson H. (2002). OLAC Linguistic Data Type Vocabulary. [http://www.languagearchives.org/REC/type.html]
Bird S., Simons G. (eds.) (2000). Proceedings of the Workshop on Web-Based Language Documentation and Description. [http://www.ldc.upenn.edu/exploration/expl2000/]
Bird S., Simons G. (2001). The OLAC Metadata Set and Controlled Vocabularies. Proceedings of ACL/EACLWorkshop on Sharing Tools and Resources for Research and Education. [http://arXiv.org/abs/cs/0105030]
Bird S., Simons G. (eds.) (2002). Proceedings of the IRCS Workshop on Open Language Archives. [http://www.language-archives.org/events/olac02/]
Bird S., Simons G. (2003). Seven Dimensions of Portability for Language Documentation and Description. Language, 79, pp. 557–582.
DCMI (2000). Dublin Core Qualifiers. [http://dublincore.org/documents/2000/07/11/dcmesqualifiers/]
DCMI (2002). DCMI Elements and Element Refinements–a current list. [http://dublincore.org/ usage/terms/dc/current-elements/]
Grimes B.F. (ed.) (2000). Ethnologue: Languages of the World. Dallas: Summer Institute of Linguistics, 14th edition. [http//www.ethnologue.com/]
Heery R., Patel M. (2000). Application Profiles: Mixing and Matching Metadata Schemas. Ariadne, Vol. 25, UK Office for Library and Information networking (UKOLN), University of Bath. [http://www.ariadne.ac.uk/issue25/app-profiles/]
ISO (1998). ISO 639: Codes for the Representation of Names of Languages-Part 2: Alpha-3 Code. [http://lcweb.loc.gov/standards/iso639–2/langhome.html]
Johnson H. (2002). OLAC Role Vocabulary. [http://www.language-archives.org/REC/role.html]
Johnson H., Aristar Dry H. (2002). OLAC Discourse Type Vocabulary. [http://www.languagearchives.org/REC/discourse.html]
Lagoze C., Van de Sompel H. (2001). The Open Archives Initiative: Building a Low-barrier Interoperability Framework. Proceedings of the First ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 54–62. [http://www.cs.cornell.edu/lagoze/papers/oai-jcdl.pdf]
Powell A., Johnston P. (2002). Guidelines for Implementing Dublin Core in XML. [http://dublincore.org/documents/2002/09/09/dc-xml-guidelines]
Simons G. (2000). Language Identification inMetadata Descriptions of Language Archive Holdings. In Bird S. and Simons G. (eds.), Proceedings of the Workshop on Web-Based Language Documentation and Description. [http://www.ldc.upenn.edu/exploration/expl2000/papers/simons/]
Simons G. (2003). Specifications for an OLAC Metadata Display Format and an OLAC-to-OAI DC Crosswalk. [http://www.language-archives.org/NOTE/olac_display.html]
Simons G., Bird S. (2002a). OLAC Metadata. [http://www.language-archives.org/OLAC/metadata.html]
Simons G., Bird S. (2002b). OLAC Process. [http://www.language-archives.org/OLAC/process.html]
Simons G., Bird S. (2003). Building an Open Language Archives Community on the OAI Foundation. Library Hi Tech, 21/2. [http://www.arxiv.org/abs/cs.CL/0302021]
Svenonius E. (2000). The Intellectual Foundation of Information Organization. The MIT Press.
Van de Sompel H., Lagoze C. (2002). Notes from the Interoperability Front: A Progress Report on the Open Archives Initiative. Proceedings of the European Conference on Digital Libraries, pp. 144–57. [http://www.openarchives.org/documents/ecdl-oai.pdf]
Wolf M., Wicksteed C. (1997). Date and Time Formats. [http://www.w3.org/TR/NOTE-datetime]
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bird, S., Simons, G. Extending Dublin Core Metadata to Support the Description and Discovery of Language Resources. Computers and the Humanities 37, 375–388 (2003). https://doi.org/10.1023/A:1025720518994
Issue Date:
DOI: https://doi.org/10.1023/A:1025720518994