Analyse of the quality and expansion of tools for morphological analysis of texts in the russian language

Authors

DOI:

https://doi.org/10.17308/sait/1995-5499/2023/2/171-180

Keywords:

morphological analysis, automatic text processing, computer linguistics, text processing tools

Abstract

The increase of the amount of processed information leads to the necessity of its analysis automation and the development of corresponding software tools. The paper describes a comparative analysis of the existing tools of morphological text processing for the Russian language. For this task, a morphologically marked corpus of texts from the National Corpus of the Russian Language project was used. One of the compared tools is JMorfSdk developed by the authors. Based on the analysis results, features were proposed and implemented to eliminate the identified shortcomings, which allowed improving the quality of morphological analysis and expand the set of features of the developed tools for automatic analysis of texts in the Russian language.

Author Biographies

  • Ekaterina V. Politsyna, Moscow Aviation Institute (National Research University)

    PhD in Technical Sciences, Associate professor, department 319, Moscow Aviation Institute (National Research University)

  • Sergey A. Politsyn, Moscow Aviation Institute (National Research University)

    PhD in Technical Sciences, Associate professor, department 319, Moscow Aviation Institute (National Research University)

  • Alexander S. Porechny, Moscow Aviation Institute (National Research University)

    post-graduate student, department 319, Moscow Aviation Institute (National Research University)

  • Alexander N. Rykunov, Moscow Aviation Institute (National Research University)

    student, department 319, Moscow Aviation Institute (National Research University)

References

Downloads

Published

2023-09-29

Issue

Section

Computer Linguistics and Natural Language Processing

How to Cite

Analyse of the quality and expansion of tools for morphological analysis of texts in the russian language. (2023). Proceedings of Voronezh State University. Series: Systems Analysis and Information Technologies, 2, 171-180. https://doi.org/10.17308/sait/1995-5499/2023/2/171-180

Most read articles by the same author(s)