Skip to content
Licensed Unlicensed Requires Authentication Published by De Gruyter Mouton September 25, 2006

Ways of trying in Russian: clustering behavioral profiles

  • Dagmar Divjak and Stefan Th Gries

Abstract

This article proposes a methodology for addressing three long-standing problems of near synonym research. First, we show how the internal structure of a group of near synonyms can be revealed. Second, we deal with the problem of distinguishing the subclusters and the words in those subclusters from each other. Finally, we illustrate how these results identify the semantic properties that should be mentioned in lexicographic entries. We illustrate our methodology with a case study on nine near synonymous Russian verbs that, in combination with an infinitive, express TRY.

Our approach is corpus-linguistic and quantitative: assuming a strong correlation between semantic and distributional properties, we analyze 1,585 occurrences of these verbs taken from the Amsterdam Corpus and the Russian National Corpus, supplemented where necessary with data from the Web. We code each particular instance in terms of 87 variables (a.k.a. ID tags), i. e., morphosyntactic, syntactic and semantic characteristics that form a verb's behavioral profile. The resulting co-occurrence table is evaluated by means of a hierarchical agglomerative cluster analysis and additional quantitative methods. The results show that this behavioral profile approach can be used (i) to elucidate the internal structure of the group of near synonymous verbs and present it as a radial network structured around a prototypical member and (ii) to make explicit the scales of variation along which the near synonymous verbs vary.

Received: 2005-01
Revised: 2005-11
Accepted: 2005-11
Published Online: 2006-09-25
Published in Print: 2006-June-01

© Walter de Gruyter

Downloaded on 27.4.2024 from https://www.degruyter.com/document/doi/10.1515/CLLT.2006.002/html
Scroll to top button