The paper deals with the pilot version of the first RST discourse treebank for Russian. The project started in 2016. At present, the treebank consists of sixty news texts annotated for rhetorical relations according to RST scheme. However, this scheme was slightly modified in order to achieve higher inter-annotator agreement score. During the annotation procedure, we also registered the discourse connectives of different types and mapped them onto the corresponding rhetoric relations. In present paper, we discuss our experience of RST scheme adaptation for Russian news texts. Besides, we report on the distribution of the most frequent discourse connectives in our corpus.
DOI: http://dx.doi.org/10.18653/v1/W17-3604
PDF at Google Drive: https://drive.google.com/file/d/0B2w9OsTtG6QMYkYtd21XYjZVSGM/
Publications of Higher School of Economics: https://publications.hse.ru/en/chapters/211523840
Read at ResearchGate: https://www.researchgate.net/publication/320084029_Rhetorical_Relations_Markers_in_Russian_RST_Treebank
Semantic Scholar: https://api.semanticscholar.org/CorpusID:40676415
Toldova S., Pisarevskaya D., Ananyeva M., Kobozeva M., Nasedkin A., Nikiforova S., Pavlova I., Shelepov A. Rhetorical relations markers in Russian RST Treebank // Proceedings of the 6th Workshop on Recent Advances in RST and Related Formalisms. 2017. Pp. 29-33.