Methods for Rhetorical Structure Parsing in Russian

Authors

Chistova E.

Annotation

The paper examines the methods for discourse parsing for the Russian language within the framework of rhetorical structure theory. The development of a new corpus for full-text parsing of Russian-language texts of various genres is described. The applicability of various pretrained encoding language models for rhetorical analysis using two Russian-language corpora is analyzed. We propose a method for training neural network models on a mix of expert-annotated data for rhetorical parsing. This approach allows the models to parse the texts effectively regardless of variations in rhetorical relation sets used in different corpora. It is evaluated on the two large multigenre corpora of rhetorical annotation for the Russian language.

External links

DOI: 10.3103/S0147688225700601

Download the article from Springer Nature: https://link.springer.com/content/pdf/10.3103/S0147688225700601.pdf

ResearchGate: https://www.researchgate.net/publication/401024768_Methods_for_Rhetorical_Structure_Parsing_in_Russian

Reference link

Chistova, E. V. Methods for Rhetorical Structure Parsing in Russian // Scientific and Technical Information Processing, 2025, Vol. 52, pp. 727–737.