Methods for Rhetorical Structure Parsing in Russian

Авторы

Чистова Е. В.

Аннотация

The paper examines the methods for discourse parsing for the Russian language within the framework of rhetorical structure theory. The development of a new corpus for full-text parsing of Russian-language texts of various genres is described. The applicability of various pretrained encoding language models for rhetorical analysis using two Russian-language corpora is analyzed. We propose a method for training neural network models on a mix of expert-annotated data for rhetorical parsing. This approach allows the models to parse the texts effectively regardless of variations in rhetorical relation sets used in different corpora. It is evaluated on the two large multigenre corpora of rhetorical annotation for the Russian language.

Внешние ссылки

DOI: 10.3103/S0147688225700601

Скачать статью на сайте издательства Springer Nature (англ.): https://link.springer.com/content/pdf/10.3103/S0147688225700601.pdf

ResearchGate: https://www.researchgate.net/publication/401024768_Methods_for_Rhetorical_Structure_Parsing_in_Russian

Ссылка при цитировании

Chistova, E. V. Methods for Rhetorical Structure Parsing in Russian // Scientific and Technical Information Processing, 2025, Vol. 52, pp. 727–737.