Рупрехт фон Вальденфельс из Польской академии наук (Краков) рассказал о параллельных славянских корпусах
14 ноября на научном семинаре направления "Лингвистика" выступил Рупрехт фон Вальденфельс из Польской академии наук (Краков) с докладом на тему "Slavic derivational morphology between contact and inheritance: a corpus driven approach". Участник совместных с нашим факультетом экспедиций и других проектов, Рупрехт также читает курс лекций для студентов направления "Лингвистика".
Краткая аннотация доклада:
In a closely related set of languages such as that of the Slavic genus, differences in the use of derivational morphology are difficult to analyze because of the multitude of functions and factors which are rarely clear cut. The talk presents a method to investigate divergence and convergence in the use of derivational morphology on the basis of translationally equivalent texts. This investigation is part of a more general system which allows for the simple investigation of many, heterogenous linguistic variables.
Using a word aligned, morphologically tagged and lemmatized parallel corpus of prose in all major Slavic languages (ParaSol), I compare the use of nominal suffixes and verbal prefixes in translationally equivalent verb forms across languages. This gives me an extensionally defined handle on the domain of use for each derivational type and makes a comparison of interlanguage differences as well as differences between types possible. The data are evaluated both using clustering algorithms as well as qualitatively using a web interface that visualizes the contrasting prefixed forms in context.