Skip to content
Licensed Unlicensed Requires Authentication Published by De Gruyter Mouton October 17, 2015

Predicting the gender of Welsh nouns

  • Michael Hammond EMAIL logo

Abstract

Welsh grammatical gender exhibits several unusual properties. This paper argues that these properties are necessarily connected. The argument is based on a series of corpus investigations using techniques from statistical natural language processing, specifically distinguishing properties that exhibit significant statistical patterns from those which can be used to make useable predictions. Specifically, it’s shown that the grammatical properties of Welsh gender are such that its unusual statistical properties follow.

Acknowledgements

Thanks to audiences in Tucson and Aberystwyth. Thanks also to Diane Ohala, Dalila Ayoun, Lionel Mathieu, several anonymous reviewers, and the editor for helpful feedback at various stages. All errors are my own.

References

Ayoun, Dalila. 2010. Corpus data: Shedding the light on French grammatical gender or not. Eurosla Yearbook 10. 119–141.10.1075/eurosla.10.08ayoSearch in Google Scholar

Cavnar, William B. & John M. Trenkle. 1994. N-gram-based text categorization. In Proceedings of sdair-94, third annual symposium on document analysis and information retrieval, 161–175.Search in Google Scholar

Cucerzan, Silviu & David Yarowsky. 2003. Minimally supervised induction of grammatical gender. In Proceedings of the 2003 conference of the north American chapter of the association for computational linguistics on human language technology-volume 1, 40–47. Association for Computational Linguistics.10.3115/1073445.1073451Search in Google Scholar

Daland, Robert, Andrea D. Sims & Janet Pierrehumbert. 2007. Much ado about nothing: a social network model of Russian paradigmatic gaps. Proceedings of the Annual Meeting of the Association of Computational Linguistics 45. 936–943.Search in Google Scholar

Dorian, Nancy C. 1976. Gender in a terminal Gaelic dialect. Scottish Gaelic Studies 12. 279–282.Search in Google Scholar

Ellis, N. C., C. O’Dochartaigh, W. Hicks, M. Morgan & N. Laporte. 2001. Cronfa electroneg o Gymraeg (CEG): A 1 million word lexical database and frequency count for Welsh. http://www.bangor.ac.uk/canolfanbedwyr/ceg.php.en (accessed 5 October 2015).Search in Google Scholar

Gathercole, Virginia C. Mueller & Enlli Môn Thomas. 2001. The acquisition of grammatical gender in Welsh. Journal of Celtic Language Learning 6. 53–87.Search in Google Scholar

Hammond, Michael. 2011. Welsh mutations and statistical phonotactics. In Andrew Carnie (ed.), Formal approaches to Celtic linguistics, 337–358. Newcastle upon Tyne: Cambridge Scholars Publishing.Search in Google Scholar

Hammond, Michael. 2014. Phonological complexity and input optimization. Phonological Studies 17. 85–94.Search in Google Scholar

Hannahs, S. J. 2013. The phonology of Welsh. Oxford: Oxford University Press.10.1093/acprof:oso/9780199601233.001.0001Search in Google Scholar

King, Gareth. 2003. Modern Welsh: a comprehensive grammar. London: Routledge.Search in Google Scholar

Manning, Christopher D. & Hinrich Schütze. 1999. Foundations of statistical natural language processing. Cambridge: MIT Press.Search in Google Scholar

Morgan, T.J. 1952. Y treigladau a’u cystrawen. Caerdydd (Cardiff): Gwasg Prifysgol Cymru.Search in Google Scholar

Nodine, Mark H. 2003. Welsh to English lexicon. http://www.cs.cf.ac.uk/fun/welsh/LexiconWE.html (accessed 31 December 2013).Search in Google Scholar

Sims, Andrea D. in press. Inflectional defectiveness. Cambridge: Cambridge University Press.10.1017/CBO9781107053854Search in Google Scholar

Thomas, Enlli Môn & Virginia C. Mueller Gathercole. 2005a. Minority language survival: Input factors influencing the acquisition of Welsh. In James Cohen, Kara T. McAlister, Kellie Rolstad & Jeff MacSwan (eds.), Isb4: Proceedings of the 4th International Symposium on Bilingualism, 852–874. Somerville, MA: Cascadilla Press.Search in Google Scholar

Thomas, Enlli Môn & Virginia C. Mueller Gathercole. 2005b. Minority language survival: Obsolescence or survival for Welsh in the face of English dominance. In James Cohen, Kara T. McAlister, Kellie Rolstad & Jeff MacSwan (eds.), Isb4: Proceedings of the 4th International Symposium on Bilingualism, 2233–2257. Somerville, MA: Cascadilla Press.Search in Google Scholar

Watkins, T. Arwyn. 1961. Ieithyddiaeth: Agweddau ar astudio iaith. Caerdydd (Cardiff): Gwasg Prifysgol Cymru.Search in Google Scholar

Published Online: 2015-10-17
Published in Print: 2016-10-1

©2016 by De Gruyter Mouton

Downloaded on 14.5.2024 from https://www.degruyter.com/document/doi/10.1515/cllt-2015-0001/html
Scroll to top button