Graphs and networks offer a convenient way to study systems around us, including such complex ones as human language. Graph-based representations are proven to be an effective approach for a wide variety of Natural Language Processing (NLP) tasks.
In this course we will seek answers to three questions: (1) how to express the linguistic phenomena as graphs, (2) how to gain knowledge based on them, and (3) how to assess the quality of this knowledge. We will start with such traditional graph-based NLP and Information Retrieval (IR) methods as TextRank and Markov Clustering, and finish with such contemporary Machine Learning approaches as StarSpace and Graph Convolutions. Since most methods described in this course are unsupervised, a special attention is paid to their thorough assessment using both automatic metrics and human judgements, including crowdsourcing.
The course has four lectures on Language Graphs, Graph Clustering, Machine Learning with Graphs, Evaluation, and Crowdsourcing, that elaborately go through the corresponding algorithms step-by-step and suggest useful linguistic datasets. The target audience of this course are advanced graduate students, data analysts, and researchers in NLP and IR (but it is not limited to them).
Lectures are in Russian, but the slides are in English.
Date and time | Class|Name | Venue|short | Materials |
---|---|---|---|
17 April 14:00–15:30 |
Язык и графы, Lecture | Конференция в zoom, Онлайн | No |
24 April 14:00–15:30 |
Кластеризация графов, Lecture | Конференция в zoom, Онлайн | No |
08 May 14:00–15:30 |
Машинное обучение с графами, Lecture | Конференция в zoom, Онлайн | No |
15 May 14:00–15:30 |
Оценка качества, Lecture | Конференция в zoom, Онлайн | No |
22 May 14:00–15:30 |
Краудсорсинг, Lecture | Конференция в zoom, Онлайн | No |