Сравнительный анализ методов машинного обучения для решения задачи классификации документов научно-образовательного учреждения

Authors

  • Михаил Николаевич Краснянский Tambov State Technical University image/svg+xml
  • Артем Дмитриевич Обухов Tambov State Technical University image/svg+xml
  • Александра Алексеевна Воякина Tambov State Technical University image/svg+xml
  • Екатерина Михайловна Соломатина Tambov State Technical University image/svg+xml

DOI:

https://doi.org/10.17308/sait.2018.3/1245

Keywords:

machine learning, classification of documents, electronic document management system, data preprocessing algorithm

Abstract

This article discusses the actual problem of classification of documents using machine learning methods in the subject area of research and educational institutions. Analysis of developments in this area showed that there is no sufficient theoretical basis for the integration of existing classification methods for the analysis of documents of research and educational institutions. Therefore, to solve this problem, an algorithm of classification of documents, taking into account the specifics of the documents of the subject area of scientific and educational institutions. The article deals with the system of features used to solve the problem of combined classification. The paper considers the approach of preprocessing of the text, which allows using the known methods of machine learning to improve the accuracy and speed of document clas-sification.

Author Biographies

  • Михаил Николаевич Краснянский, Tambov State Technical University

    Professor, doctor of technical Sciences, rector of Tambov State Technical University

  • Артем Дмитриевич Обухов, Tambov State Technical University

    Candidate of technical Sciences, senior lecturer of the Department «Computer-integrated systems in mechanical engineering» of Tambov State Technical University

  • Александра Алексеевна Воякина, Tambov State Technical University

    student of the Department of «Automated decision support systems» of Tambov State Technical University

  • Екатерина Михайловна Соломатина, Tambov State Technical University

    student of the Department of «Automated decision support systems» Tambov State Technical University

References

Downloads

Published

2018-08-03

Issue

Section

Computer Linguistics and Natural Language Processing

How to Cite

Сравнительный анализ методов машинного обучения для решения задачи классификации документов научно-образовательного учреждения. (2018). Proceedings of Voronezh State University. Series: Systems Analysis and Information Technologies, 3, 173-182. https://doi.org/10.17308/sait.2018.3/1245

Most read articles by the same author(s)