The complex of text corpus management tools usage in solving computer linguistics tasks

Authors

  • Сергей Александрович Полицын Moscow Aviation Institute (National Research University)
  • Екатерина Валерьевна Полицына Moscow Aviation Institute (National Research University)

DOI:

https://doi.org/10.17308/sait.2019.2/1300

Keywords:

automated text analysis tools, corpus of texts, linguistic markup, crawler, managing text corpuses

Abstract

The task of creation, markup and keeping up-to-date of linguistic corpuses is very urgent today including machine learning needs, and approbation of new algorithms. The paper shows development of the set of programs for creating and managing text corpuses, and some applications of these programs which allows creating sub-corpuses basing on flexible set of parameters.

Author Biographies

  • Сергей Александрович Полицын, Moscow Aviation Institute (National Research University)

    candidate of technical sciences, associate professor, department 319, Moscow Aviation Institute (National Research University)

  • Екатерина Валерьевна Полицына, Moscow Aviation Institute (National Research University)

    candidate of technical sciences, associate professor, department 319, Moscow Aviation Institute (National Research University)

References

Downloads

Published

2019-04-24

Issue

Section

Computer Linguistics and Natural Language Processing

How to Cite

The complex of text corpus management tools usage in solving computer linguistics tasks. (2019). Proceedings of Voronezh State University. Series: Systems Analysis and Information Technologies, 2, 134-142. https://doi.org/10.17308/sait.2019.2/1300

Most read articles by the same author(s)