The use of reinforcement learning methods in medical problems

Authors

DOI:

https://doi.org/10.17308/sait.2022.1/9206

Keywords:

Reinforcement learning, Markov decision process, dynamic programming, Bellman’s equation, iteration over strategies, iteration over values, Monte Carlo, time difference method, SARSA, Q-Learning

Abstract

In this article the features of the modern reinforcement learning methods development for the medical tasks are discussed. Reinforcement learning methods are a popular machine learning tool used in the problems of finding optimal patient treatment strategies, personalized medicine, as well as interactive patient monitoring systems. One of the important task is to choose the optimal reinforcement learning algorithm from a variety of currently existing methods that have their own application specifics, advantages and disadvantages. This article is devoted to the analysis of the algorithmic apparatus of the most popular reinforcement learning methods and contains examples of models and results of the methods under consideration in the context of the problem of finding optimal treatment regimens for cardiac patients.

Author Biographies

  • Maria V. Demchenko, Voronezh State University

    post-graduate student at Applied Mathematics and Mechanics faculty, Voronezh State University

  • Irina L. Kashirina, Voronezh State University

    DSc in Technical Sciences, Professor of the Department of Mathematical Methods of Operations Research at Applied Mathematics and Mechanics faculty, Voronezh State University

  • Maria A. Firyulina, Voronezh State University

    post-graduate student at Applied Mathematics and Mechanics faculty, Voronezh State University

References

Downloads

Published

2022-04-26

Issue

Section

Intelligent Information Systems, Data Analysis and Machine Learning

How to Cite

The use of reinforcement learning methods in medical problems. (2022). Proceedings of Voronezh State University. Series: Systems Analysis and Information Technologies, 1, 111-124. https://doi.org/10.17308/sait.2022.1/9206

Most read articles by the same author(s)