Исследование и сравнительный анализ методов оптимизации, используемых при обучении нейронных сетей

Ирина Леонидовна Каширина; Мария Владимировна Демченко

doi:10.17308/sait.2018.4/1262

Исследование и сравнительный анализ методов оптимизации, используемых при обучении нейронных сетей

Authors

Ирина Леонидовна Каширина Voronezh State University
Мария Владимировна Демченко Voronezh State University

DOI:

https://doi.org/10.17308/sait.2018.4/1262

Keywords:

optimization methods, eural networks, radient descent, stochastic gradient, quasi-Newton methods, global and local minimum, objective error function

Abstract

Modern methods of deep learning of neural networks consist in finding the minimum of some continuous error function. In recent years, various optimization algorithms have been proposed that use different approaches to update model parameters. This article is devoted to the analysis of the most common optimization methods used in the tasks of teaching neural networks and forming recommendations on the choice of an algorithm for setting up neural networks on different data sets based on the identified properties. In the process of analysis, various implementations of the gradient descent method, impulse methods, adaptive methods, quasi-Newtonian methods were considered, the problems of their use were generalized, and the main advantages of each method were identified.

Author Biographies

Ирина Леонидовна Каширина, Voronezh State University

Doctor of Technical Sciences, Professor, Department of Mathematical Methods Operations Research, Faculty of Applied mathematics, Informatics and mechanics, Voronezh State University
Мария Владимировна Демченко, Voronezh State University

postgraduate student of the Faculty of Applied mathematics, Informatics and mechanics, Voronezh State University

References

Downloads

PDF (Russian)

Published

2018-10-30

Issue

No. 4 (2018): October - December

Section

Intelligent Information Systems

License

Условия передачи авторских прав in English

How to Cite

Исследование и сравнительный анализ методов оптимизации, используемых при обучении нейронных сетей. (2018). Proceedings of Voronezh State University. Series: Systems Analysis and Information Technologies, 4, 123-132. https://doi.org/10.17308/sait.2018.4/1262