The paper examines the methods for discourse parsing for the Russian language within the framework of rhetorical structure theory. The development of a new corpus for full-text parsing of Russian-language texts of various genres is described. The applicability of various pretrained encoding language models for rhetorical analysis using two Russian-language corpora is analyzed. We propose a method for training neural network models on a mix of expert-annotated data for rhetorical parsing. This approach allows the models to parse the texts effectively regardless of variations in rhetorical relation sets used in different corpora. It is evaluated on the two large multigenre corpora of rhetorical annotation for the Russian language.
DOI: 10.3103/S0147688225700601
Download the article from Springer Nature: https://link.springer.com/content/pdf/10.3103/S0147688225700601.pdf
ResearchGate: https://www.researchgate.net/publication/401024768_Methods_for_Rhetorical_Structure_Parsing_in_Russian
Chistova, E. V. Methods for Rhetorical Structure Parsing in Russian // Scientific and Technical Information Processing, 2025, Vol. 52, pp. 727–737.