Skip to main content
Log in

Extending Dublin Core Metadata to Support the Description and Discovery of Language Resources

  • Published:
Computers and the Humanities Aims and scope Submit manuscript

Abstract

As language data and associatedtechnologies proliferate and as the languageresources community expands, it is becomingincreasingly difficult to locate and reuse existingresources. Are there any lexical resources forsuch-and-such a language? What tool workswith transcripts in this particular format?What is a good format to use for linguisticdata of this type? Questions like these dominate manymailing lists, since web search engines are anunreliable way to find language resources. Thispaper reports on a new digital infrastructurefor discovering language resources beingdeveloped by the Open Language Archives Community(OLAC). At the core of OLAC is its metadataformat, which is designed to facilitatedescription and discovery of all kinds oflanguage resources, including data, tools, oradvice. The paper describes OLAC metadata, itsrelationship to Dublin Core metadata, and itsdissemination using the metadata harvesting protocol of the Open Archives Initiative.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Aristar Dry H., Appleby M. (2003). OLAC Linguistic Subject Vocabulary. [http://www.languagearchives.org/REC/field.html]

  • Aristar Dry H., Johnson H. (2002). OLAC Linguistic Data Type Vocabulary. [http://www.languagearchives.org/REC/type.html]

  • Bird S., Simons G. (eds.) (2000). Proceedings of the Workshop on Web-Based Language Documentation and Description. [http://www.ldc.upenn.edu/exploration/expl2000/]

  • Bird S., Simons G. (2001). The OLAC Metadata Set and Controlled Vocabularies. Proceedings of ACL/EACLWorkshop on Sharing Tools and Resources for Research and Education. [http://arXiv.org/abs/cs/0105030]

  • Bird S., Simons G. (eds.) (2002). Proceedings of the IRCS Workshop on Open Language Archives. [http://www.language-archives.org/events/olac02/]

  • Bird S., Simons G. (2003). Seven Dimensions of Portability for Language Documentation and Description. Language, 79, pp. 557–582.

    Google Scholar 

  • DCMI (2000). Dublin Core Qualifiers. [http://dublincore.org/documents/2000/07/11/dcmesqualifiers/]

  • DCMI (2002). DCMI Elements and Element Refinements–a current list. [http://dublincore.org/ usage/terms/dc/current-elements/]

  • Grimes B.F. (ed.) (2000). Ethnologue: Languages of the World. Dallas: Summer Institute of Linguistics, 14th edition. [http//www.ethnologue.com/]

    Google Scholar 

  • Heery R., Patel M. (2000). Application Profiles: Mixing and Matching Metadata Schemas. Ariadne, Vol. 25, UK Office for Library and Information networking (UKOLN), University of Bath. [http://www.ariadne.ac.uk/issue25/app-profiles/]

  • ISO (1998). ISO 639: Codes for the Representation of Names of Languages-Part 2: Alpha-3 Code. [http://lcweb.loc.gov/standards/iso639–2/langhome.html]

  • Johnson H. (2002). OLAC Role Vocabulary. [http://www.language-archives.org/REC/role.html]

  • Johnson H., Aristar Dry H. (2002). OLAC Discourse Type Vocabulary. [http://www.languagearchives.org/REC/discourse.html]

  • Lagoze C., Van de Sompel H. (2001). The Open Archives Initiative: Building a Low-barrier Interoperability Framework. Proceedings of the First ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 54–62. [http://www.cs.cornell.edu/lagoze/papers/oai-jcdl.pdf]

  • Powell A., Johnston P. (2002). Guidelines for Implementing Dublin Core in XML. [http://dublincore.org/documents/2002/09/09/dc-xml-guidelines]

  • Simons G. (2000). Language Identification inMetadata Descriptions of Language Archive Holdings. In Bird S. and Simons G. (eds.), Proceedings of the Workshop on Web-Based Language Documentation and Description. [http://www.ldc.upenn.edu/exploration/expl2000/papers/simons/]

  • Simons G. (2003). Specifications for an OLAC Metadata Display Format and an OLAC-to-OAI DC Crosswalk. [http://www.language-archives.org/NOTE/olac_display.html]

  • Simons G., Bird S. (2002a). OLAC Metadata. [http://www.language-archives.org/OLAC/metadata.html]

  • Simons G., Bird S. (2002b). OLAC Process. [http://www.language-archives.org/OLAC/process.html]

  • Simons G., Bird S. (2003). Building an Open Language Archives Community on the OAI Foundation. Library Hi Tech, 21/2. [http://www.arxiv.org/abs/cs.CL/0302021]

  • Svenonius E. (2000). The Intellectual Foundation of Information Organization. The MIT Press.

  • Van de Sompel H., Lagoze C. (2002). Notes from the Interoperability Front: A Progress Report on the Open Archives Initiative. Proceedings of the European Conference on Digital Libraries, pp. 144–57. [http://www.openarchives.org/documents/ecdl-oai.pdf]

  • Wolf M., Wicksteed C. (1997). Date and Time Formats. [http://www.w3.org/TR/NOTE-datetime]

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Steven Bird.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bird, S., Simons, G. Extending Dublin Core Metadata to Support the Description and Discovery of Language Resources. Computers and the Humanities 37, 375–388 (2003). https://doi.org/10.1023/A:1025720518994

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1025720518994

Navigation