Abstract
Welsh grammatical gender exhibits several unusual properties. This paper argues that these properties are necessarily connected. The argument is based on a series of corpus investigations using techniques from statistical natural language processing, specifically distinguishing properties that exhibit significant statistical patterns from those which can be used to make useable predictions. Specifically, it’s shown that the grammatical properties of Welsh gender are such that its unusual statistical properties follow.
Acknowledgements
Thanks to audiences in Tucson and Aberystwyth. Thanks also to Diane Ohala, Dalila Ayoun, Lionel Mathieu, several anonymous reviewers, and the editor for helpful feedback at various stages. All errors are my own.
References
Ayoun, Dalila. 2010. Corpus data: Shedding the light on French grammatical gender or not. Eurosla Yearbook 10. 119–141.10.1075/eurosla.10.08ayoSearch in Google Scholar
Cavnar, William B. & John M. Trenkle. 1994. N-gram-based text categorization. In Proceedings of sdair-94, third annual symposium on document analysis and information retrieval, 161–175.Search in Google Scholar
Cucerzan, Silviu & David Yarowsky. 2003. Minimally supervised induction of grammatical gender. In Proceedings of the 2003 conference of the north American chapter of the association for computational linguistics on human language technology-volume 1, 40–47. Association for Computational Linguistics.10.3115/1073445.1073451Search in Google Scholar
Daland, Robert, Andrea D. Sims & Janet Pierrehumbert. 2007. Much ado about nothing: a social network model of Russian paradigmatic gaps. Proceedings of the Annual Meeting of the Association of Computational Linguistics 45. 936–943.Search in Google Scholar
Dorian, Nancy C. 1976. Gender in a terminal Gaelic dialect. Scottish Gaelic Studies 12. 279–282.Search in Google Scholar
Ellis, N. C., C. O’Dochartaigh, W. Hicks, M. Morgan & N. Laporte. 2001. Cronfa electroneg o Gymraeg (CEG): A 1 million word lexical database and frequency count for Welsh. http://www.bangor.ac.uk/canolfanbedwyr/ceg.php.en (accessed 5 October 2015).Search in Google Scholar
Gathercole, Virginia C. Mueller & Enlli Môn Thomas. 2001. The acquisition of grammatical gender in Welsh. Journal of Celtic Language Learning 6. 53–87.Search in Google Scholar
Hammond, Michael. 2011. Welsh mutations and statistical phonotactics. In Andrew Carnie (ed.), Formal approaches to Celtic linguistics, 337–358. Newcastle upon Tyne: Cambridge Scholars Publishing.Search in Google Scholar
Hammond, Michael. 2014. Phonological complexity and input optimization. Phonological Studies 17. 85–94.Search in Google Scholar
Hannahs, S. J. 2013. The phonology of Welsh. Oxford: Oxford University Press.10.1093/acprof:oso/9780199601233.001.0001Search in Google Scholar
King, Gareth. 2003. Modern Welsh: a comprehensive grammar. London: Routledge.Search in Google Scholar
Manning, Christopher D. & Hinrich Schütze. 1999. Foundations of statistical natural language processing. Cambridge: MIT Press.Search in Google Scholar
Morgan, T.J. 1952. Y treigladau a’u cystrawen. Caerdydd (Cardiff): Gwasg Prifysgol Cymru.Search in Google Scholar
Nodine, Mark H. 2003. Welsh to English lexicon. http://www.cs.cf.ac.uk/fun/welsh/LexiconWE.html (accessed 31 December 2013).Search in Google Scholar
Sims, Andrea D. in press. Inflectional defectiveness. Cambridge: Cambridge University Press.10.1017/CBO9781107053854Search in Google Scholar
Thomas, Enlli Môn & Virginia C. Mueller Gathercole. 2005a. Minority language survival: Input factors influencing the acquisition of Welsh. In James Cohen, Kara T. McAlister, Kellie Rolstad & Jeff MacSwan (eds.), Isb4: Proceedings of the 4th International Symposium on Bilingualism, 852–874. Somerville, MA: Cascadilla Press.Search in Google Scholar
Thomas, Enlli Môn & Virginia C. Mueller Gathercole. 2005b. Minority language survival: Obsolescence or survival for Welsh in the face of English dominance. In James Cohen, Kara T. McAlister, Kellie Rolstad & Jeff MacSwan (eds.), Isb4: Proceedings of the 4th International Symposium on Bilingualism, 2233–2257. Somerville, MA: Cascadilla Press.Search in Google Scholar
Watkins, T. Arwyn. 1961. Ieithyddiaeth: Agweddau ar astudio iaith. Caerdydd (Cardiff): Gwasg Prifysgol Cymru.Search in Google Scholar
©2016 by De Gruyter Mouton