Olga Lyashevskaya
- Professor: Faculty of Humanities / School of Linguistics
- Olga Lyashevskaya has been at HSE University since 2011.
Education and Degrees
All-Russian Insitute for Scientific and Technical Information
Thesis Title: Non-standard semantics of Russian nominal number
All-Russian Insitute for Scientific and Technical Information
Russian State University for the Humanities, Theoretical and applied linguistics
According to the International Standard Classification of Education (ISCED) 2011, Candidate of Sciences belongs to ISCED level 8 - "doctoral or equivalent", together with PhD, DPhil, D.Lit, D.Sc, LL.D, Doctorate or similar. Candidate of Sciences allows its holders to reach the level of the Associate Professor.
Awards and Accomplishments
Best Teacher — 2019, 2017, 2013
Courses (2024/2025)
- Computer Tools for Linguistic Research (Bachelor’s programme; Faculty of Humanities (Nizhny Novgorod); 2 year, 3, 4 module)Eng
- Corpus Linguistics (Master’s programme; Faculty of Humanities; 1 year, 3, 4 module)Rus
- Corpus Linguistics (Mago-Lego; 3, 4 module)Rus
- Programming and Linguistic Data (Bachelor’s programme; Faculty of Humanities; 1 year, 1, 2, 4 module)Rus
- Research Seminar "Analysis and visualization of text data" (Master’s programme; Faculty of Humanities; 1 year, 3, 4 module)Rus
- Research Seminar "Neural Network Modeling of Long Sequences in NLP” (Bachelor’s programme; Faculty of Humanities; 4 year, 1, 2 module)Rus
- Theoretical and Applied Lexicography (Bachelor’s programme; Faculty of Humanities; 4 year, 3 module)Rus
- Theoretical and Applied Lexicography (Bachelor’s programme; Faculty of Humanities; 3 year, 3 module)Rus
- Workshops (Master’s programme; Faculty of Humanities; 1 year, 1-4 module)Rus
- Past Courses
Courses (2023/2024)
- Computer Tools for Linguistic Research (Bachelor’s programme; Faculty of Humanities (Nizhny Novgorod); 2 year, 3, 4 module)Eng
- Corpus Linguistics (Mago-Lego; 3, 4 module)Rus
- Corpus Linguistics (Master’s programme; Faculty of Humanities; 1 year, 3, 4 module)Rus
- Corpus Linguistics II (Master’s programme; Faculty of Humanities; 2 year, 1, 2 module)Rus
- Corpus Linguistics II (Mago-Lego; 1, 2 module)Rus
- Programming and Linguistic Data (Bachelor’s programme; Faculty of Humanities; 1 year, 1, 2, 4 module)Rus
- Research Seminar "Analysis and visualization of text data" (Master’s programme; Faculty of Humanities; 1 year, 3, 4 module)Rus
- Theoretical and Applied Lexicography (Bachelor’s programme; Faculty of Humanities; 3 year, 3 module)Rus
- Theoretical and Applied Lexicography (Bachelor’s programme; Faculty of Humanities; 4 year, 3 module)Rus
Courses (2022/2023)
- Analysis and visualization of text data (Master’s programme; Faculty of Humanities; 1 year, 3, 4 module)Rus
- Computer Tools for Linguistic Research (Bachelor’s programme; Faculty of Humanities (Nizhny Novgorod); 2 year, 3, 4 module)Eng
- Corpus Linguistics (Mago-Lego; 3, 4 module)Rus
- Corpus Linguistics (Master’s programme; Faculty of Humanities; 1 year, 3, 4 module)Rus
- Corpus Linguistics (Master’s programme; HSE Tikhonov Moscow Institute of Electronics and Mathematics (MIEM HSE); 1 year, 3, 4 module)Rus
- Corpus Linguistics and Learning a Foreign Language (Bachelor’s programme; Faculty of Humanities (Nizhny Novgorod); 3 year, 3, 4 module)Rus
- Programming and Linguistic Data (Bachelor’s programme; Faculty of Humanities; 1 year, 1-4 module)Rus
- Theoretical and Applied Lexicography (Bachelor’s programme; Faculty of Humanities; 4 year, 3 module)Rus
- Theoretical and Applied Lexicography (Bachelor’s programme; Faculty of Humanities; 3 year, 3 module)Rus
Courses (2021/2022)
- Computer Tools for Linguistic Research (Bachelor’s programme; Faculty of Humanities (Nizhny Novgorod); 2 year, 3, 4 module)Eng
- Programming and Linguistic Data (Bachelor’s programme; Faculty of Humanities; 1 year, 1-4 module)Rus
- Theoretical and Applied Lexicography (Bachelor’s programme; Faculty of Humanities; 4 year, 3 module)Rus
- Theoretical and Applied Lexicography (Bachelor’s programme; Faculty of Humanities; 3 year, 3 module)Rus
Courses (2020/2021)
- Computer Tools for Linguistic Research (Bachelor’s programme; Faculty of Humanities (Nizhny Novgorod); 2 year, 3, 4 module)Eng
- Functional Approaches to Natural Language (Master’s programme; Faculty of Humanities; 1 year, 1, 2 module)Rus
- Linguistic Data: Quantitative Analysis and Visualisation (Master’s programme; Faculty of Humanities field of study Fundamental and Applied Linguistics, field of study Fundamental and Applied Linguistics; 1 year, 3, 4 module)Eng
- Programming and Linguistic Data (Bachelor’s programme; Faculty of Humanities; 1 year, 1-4 module)Rus
- Theoretical and Applied Lexicography (Bachelor’s programme; Faculty of Humanities; 4 year, 3 module)Rus
- Theoretical and Applied Lexicography (Bachelor’s programme; Faculty of Humanities; 3 year, 3 module)Rus
WordNet: Verbs (BA and MA students, 2014-2015)
Zaliznjak's dictionary (MA students, 2014-2015)
Russian word-formation (BA students, 2014-2015)
Annotation of FrameBank (summer internship, 2012-2015)
Conferences
- 2023
29-я Международная конференция по компьютерной лингвистике и интеллектуальным технологиям "Диалог" (Москва). Presentation: Disambiguation in context in the Russian National Corpus: 20 years later
CLASP Conference on Learning with Small Data (LSD) (Gothenburg). Presentation: From web to dialects: how to enhance non-standard Russian lects lemmatisation?
- 2022
6-й Колмогоровский семинар по компьютерной лингвистике и наукам о языке (Москва). Presentation: К задаче разработки версии корпусов НКРЯ с разрешенной неоднозначностью морфологической и синтаксической разметки
46-я школа-конференция ИППИ РАН «Информационные технологии и системы» (ИТиС-2022) (Огниково Московской области). Presentation: Опыт применения моделей-трансформеров для лемматизации современных и исторических текстов русского языка
International Conference on Historical Lexicography and Lexicology (ICHLL 2022) (Lorient). Presentation: Automatic collection of parallel thesauri in dictionary/corpus joint system
25th International Conference on Text, Speech, and Dialogue (TSD 2022) (Брно). Presentation: Review of Practices of Collecting and Annotating Texts in the Learner Corpus REALEC
Гаспаровские чтения - 2022 (Москва). Presentation: В стенах кипучих городов: О семантических границах эпитета в свете корпусных данных
- 2021
27-ая Международная конференция по компьютерной лингвистике и интеллектуальным технологиям «Диалог-2021» (Москва). Presentation: Adjunct role labeling for Russian
XIX EURALEX Congress (Александруполис). Presentation: Revised entries in the multi-volume edition and TEI encoding: a case of the historical dictionary of Russian
11th International Conference on Historical Lexicography and Lexicology (ICHLL 11) (Logroño, La Rioja). Presentation: Example, usage variant, and linking between dictionary and corpus data
11th International Conference on Historical Lexicography and Lexicology (ICHLL 11) (Logroño, La Rioja). Presentation: Lemmatization in corpus-to-dictionary systems: The case study for Old Church Slavonic
18th International Conference on Distributed Computing and Artificial Intelligence (DCAI) (Саламанка). Presentation: Automated Metaphor Identification in Russian and its Implications for Metaphor Studies
11th International Conference SLOVKO 2021: NLP, Corpus Linguistics and Interdisciplinarity (Братислава). Presentation: An HMM-based PoS Tagger for Old Church Slavonic
SCLC-2020/2021: The Slavic Cognitive Linguistics Conference (June 3-6, 2021) (Тромсё). Presentation: On syntactic structures in the Russian Constructicon entries and beyond
El’Manuscript 2021. Textual heritage and information technologies (Фрайбург). Presentation: Lemmatization of the Middle Russian Corpus within the RNC: Choice of Solutions
El’Manuscript 2021. Textual heritage and information technologies (Фрайбург). Presentation: Universal Dependencies for PreModern Russian: Morphology
Slavic aspect and (diachronic) corpora. International workshop (Майнц). Presentation: Profiling the behavior of verbs in the Middle Russian Corpus
The 10th International Conference on Analysis of Images, Social Networks and Texts (Тбилиси). Presentation: Sculpting enhanced dependencies for Belarusian
- 2020
26-я международная конференция по компьютерной лингвистике и интеллектуальным технологиям (Москва). Presentation: Русский конструктикон: новый лингвистический ресурс, его устройство и специфика
26-я международная конференция по компьютерной лингвистике и интеллектуальным технологиям (Москва). Presentation: GRAMEVAL 2020 Shared Task: Russian Full Morphology and Universal Dependencies Parsing
26-я международная конференция по компьютерной лингвистике и интеллектуальным технологиям (Москва). Presentation: Dialogue Evaluation: GramEval 2020: русский морфосинтаксис – сто2 лет спустя
- 2019
Digital Transformations & Global Society 2019 (DTGS’2019) (Санкт-Петербург). Presentation: A cross-genre morphological tagging and lemmatization of the Russian poetry: distinctive test sets and evaluation
Диалог (25-я международная конференция по компьютерной лингвистике и интеллектуальным технологиям) (Москва). Presentation: A Simple Fingerprint Approach to Extracting the Global Prosodic Properties from Field Data
Диалог (25-я международная конференция по компьютерной лингвистике и интеллектуальным технологиям) (Москва). Presentation: Многоцелевой морфологический стандарт разметки для языка с меняющейся грамматической структурой: случай старорусского корпуса
The Quantitative Approaches to Versification (Prague). Presentation: Lexical diversity and colour hues in Russian poetry: a corpus-based study of adjectives
Historical Corpora and Variation (Кальяри). Presentation: Variation in pre-modern Slavic corpus data and accuracy of neural tagging
Historical Corpora and Variation (Кальяри). Presentation: Spelling variation and word clusters in the Middle Russian Corpus
Межкампусная конференция по Digital Humanities «DH Meet-Up HSE» (Москва). Presentation: Данные поэтического корпуса НКРЯ как объект цифровой культуры
Towards a multilingual constructicon: issues, approaches, perspectives (Дюссельдорф). Presentation: Russian Constructicon: clusters, families, and usage scenarios
Международная конференция «И.А.Бодуэн де Куртенэ и мировая лингвистика» (VII Международные Бодуэновские чтения) (Казань). Presentation: В генеральских руках Ерофея: О синтаксическом представлении именованных сущностей в поэтическом и исторических корпусах
- 2018
Constructionist Approaches to Language Pedagogy (Остин, Техас). Presentation: A Constructicon for Learners of Russian
16th International Workshop on Treebanks and Linguistic Theories (Прага). Presentation: REALEC learner treebank: annotation principles and evaluation of automatic parsing
- 2017
Седьмой междисциплинарный семинар «Анализ разговорной русской речи» (АР3-2017) (Санкт-Петербург). Presentation: Устная разговорная речь и способы ее представления в Национальном корпусе русского языка
Компьютерная лингвистика и интеллектуальные технологии (Диалог 23) (Москва). Presentation: Multi-level student essay feedback in a learner corpus
Компьютерная лингвистика и интеллектуальные технологии (Диалог 23) (Москва). Presentation: Evaluation Tracks on Plagiarism Detection Algorithms for the Russian Language
Компьютерная лингвистика и интеллектуальные технологии (Диалог 23) (Москва). Presentation: MorphoRuEval-2017: an Evaluation Track for the Automatic Morphological Analysis Methods for Russian
9th International Conference SLOVKO: NLP, Corpus Linguistics, Terminology, e-Terminology (Братислава). Presentation: Text collections for evaluation of Russian morphological taggers
Международная научная конференция "Русский глагол" (к 50-летию выхода в свет книги А. В. Бондарко и Л. Л. Буланина) (Санкт-Петербург). Presentation: Словоизменение против словообразования: как предсказать вид русского глагола?
Русский язык: конструкционные и лексико-семантические подходы (V) (Санкт-Петербург). Presentation: Конструкционные свойства глаголов совершенного и несовершенного вида: от частного к общему
Slavic Cognitive Linguistics Conference 2017 (Санкт-Петербург). Presentation: А мы возьми и начни его строить: The Russian Constructicon
ALT-12 (Канберра). Presentation: A multilingual Constructicon: Bottom-up approach, compiling strategies, and construction types
The 4th Learner Corpus Research Conference (Больцано). Presentation: Automated English Essay Evaluation in Russian Students’ Learner Corpus
Гуманитарное образование и наука в техническом вузе (Ижевск). Presentation: Наречие в функции распространителя адъективированных причастий в современном русском языке
- 2016
Компьютерная лингвистика и интеллектуальные технологии (Диалог 22) (Москва). Presentation: Welcome to the club: Designing the inventory of semantic roles for adjectives
Письменное наследие и информационные технологии - El'Manuscript-2016 (Vilnius). Presentation: Создание лексико-грамматической базы для старорусского корпуса НКРЯ
Грамматические процессы и системы в синхронии и диахронии (Москва). Presentation: Орфографическая вариативность в древнерусском словоизменении: Квантитативное корпусное исследование
9th International Conference on Construction Grammar (ICCG 9) (Juiz de Fora). Presentation: Vector Space for Semantic Roles: Distances and Semantic Maps
2nd International FrameNet Workshop (Juiz de Fora). Presentation: Russian FrameBank now linked to FrameNet
Sky symposium Time and Language (Турку). Presentation: How learnable is Russian aspect?
- 2015
ICAI'15 - The 17th International Conference on Artificial Intelligence July 27-30, 2015 (Las Vegas). Presentation: Evaluation for morphologically rich language: Russian NLP
The 13th International Cognitive Linguistics Conference (ICLC-13) (Ньюкасл). Presentation: Whose mind do classifications of modality mirror?
Шестая международная конференция "Корпусная лингвистика-2015" (Санкт-Петербург). Presentation: Параллельный корпус автоматических и ручных расшифровок устной русской речи
- 2013
19-я Международная конференция по компьютерной лингвистике "Диалог". Presentation: Частотный лексико-грамматический словарь: проспект проект
19-я Международная конференция по компьютерной лингвистике "Диалог". Presentation: Семантические роли и сеть конструкций в системе FrameBank
19-я Международная конференция по компьютерной лингвистике "Диалог". Presentation: Визуализация данных для каталога русских лексических конструкций (на материале НКРЯ)
Корпусные технологии. Digital Humanities и современное знание (Нижний Новгород). Presentation: Квантитативный анализ корпуса для "чайников"
Supervisor of the following Doctoral theses
- 1N. Builova Verb constructions as a marker of literary formulas, 2023
- 2T. Shavrina Linguistic interpretation and evaluation of the wordvector models for Russian, 2022
- 3Y. Badryzlova Automated metaphor identification in Russian texts, 2019
- 4N. Login Machine learning on big corpora for e-learning tools development
Employment history
2011- senior researcher, Dept. of corpus linguistics and linguistic poetics, Vinogradov Institute of the Russian Language RAS (Moscow)
2011-2012 manager, Dept. of linguistics ontologies, Yandex (Moscow)
2010-2011 førsteamanuensis (Associate Professor),
2008-2010 post-doc, Institute of Linguistics, University of Tromso, Norway
2008-2011 doctoral researcher, Vinogradov Institute of the Russian Language RAS (Moscow)
2002-2008 senior researcher, Dept. of linguistic research,
2000-2002 senior researcher, Dept. of theoretical and applied informatics, Russian Institute for Scientific and Technical Information (VINITI, Moscow)
1997-2001 teacher of Russian as a foreign language,
1996-1998 teaching assistant, Philological faculty, Lomonosov Moscow State University (Moscow)
1995-1996 chief manager, dean's office, Faculty of theoretical and applied linguistics, Russian State University for the Humanities (RSUH, Moscow)
(Ab)normal Language: HSE Researchers Present Digital Tools for Assessing Mental Health Problems
Often, individuals with neurological or mental disorders exhibit distinctive language patterns. In modern clinical practice, digital tools can play a significant role in supporting language therapy and rehabilitation for persons with language disorders. Additionally, in the future, digital tools could assist healthcare specialists in assessing the severity of symptoms associated with such disorders.
'While it May Sound Futuristic, It Holds Great Promise': Olga Dragoy Shares Her Thoughts on Language Function Restoration and the Future of Neurotechnology
In the spring of 2023, the fifth strategic project of the Priority 2030 programme, 'Human Brain Resilience: Neurocognitive Technologies for Adaptation, Learning, Development and Rehabilitation in a Changing Environment,' was launched at HSE University. The strategic project brings together researchers from all campuses of HSE University. In her interview with the HSE News Service, Olga Dragoy, head of the strategic project and Director of the HSE Centre for Language and Brain, shares an overview of the advanced technologies neuroscientists are creating today, the underlying inspiration driving these efforts, and the operational dynamics of interdisciplinary applied projects.
New Technologies for Preserving Brain Functions: ‘Not Magic, but Normal Engineering’
New methods of brain mapping will make it easier to identify the cortex areas responsible for speech functions and to perform operations on the brain, as well as reduce the likelihood of damage to important areas. In addition, this will allow for more frequent use of non-invasive methods for restoring speech and other functions lost due to injuries and illnesses.
HSE Scholars to Participate in Creating a New Platform for Russian National Corpus
The Russian Ministry of Education and Science has announced the results of a grant competition for big research projects. One of the winners is a project with HSE University participation: the creation of a next generation computational linguistic platform to digitally record the Russian language.