Использование методов обучения с подкреплением в задачах медицинской практики

Мария Владиславовна Демченко; Ирина Леонидовна Каширина; Мария Андреевна Фирюлина

doi:10.17308/sait.2022.1/9206

Authors

Maria V. Demchenko Voronezh State University https://orcid.org/0000-0002-6439-8957 (unauthenticated)
Irina L. Kashirina Voronezh State University https://orcid.org/0000-0002-8664-9817 (unauthenticated)
Maria A. Firyulina Voronezh State University https://orcid.org/0000-0003-3468-5514 (unauthenticated)

DOI:

https://doi.org/10.17308/sait.2022.1/9206

Keywords:

Reinforcement learning, Markov decision process, dynamic programming, Bellman’s equation, iteration over strategies, iteration over values, Monte Carlo, time difference method, SARSA, Q-Learning

Abstract

In this article the features of the modern reinforcement learning methods development for the medical tasks are discussed. Reinforcement learning methods are a popular machine learning tool used in the problems of finding optimal patient treatment strategies, personalized medicine, as well as interactive patient monitoring systems. One of the important task is to choose the optimal reinforcement learning algorithm from a variety of currently existing methods that have their own application specifics, advantages and disadvantages. This article is devoted to the analysis of the algorithmic apparatus of the most popular reinforcement learning methods and contains examples of models and results of the methods under consideration in the context of the problem of finding optimal treatment regimens for cardiac patients.

Author Biographies

Maria V. Demchenko, Voronezh State University

post-graduate student at Applied Mathematics and Mechanics faculty, Voronezh State University
Irina L. Kashirina, Voronezh State University

DSc in Technical Sciences, Professor of the Department of Mathematical Methods of Operations Research at Applied Mathematics and Mechanics faculty, Voronezh State University
Maria A. Firyulina, Voronezh State University

post-graduate student at Applied Mathematics and Mechanics faculty, Voronezh State University