Growing descent of stochastic gradient with the generalized logistic map

Kulikovskikh, I.; Legović, T.; Prokhorov, S.; Šmuc, T.

Отрывок: A simple model of neural network consists of one hidden layer with different number of neurons nneuron = 4 (see Fig. 1 a)) and nneuron = 1 (see Fig. 1 b)). Fig. 1 a) shows the advantage of Adagrad with the adaptive learning rate over other methods. Fig. 1 b), in turn, reveals the Adagrad’s...

Полная запись метаданных

Поле DC	Значение	Язык
dc.contributor.author	Kulikovskikh, I.	-
dc.contributor.author	Prokhorov, S.	-
dc.contributor.author	Legović, T.	-
dc.contributor.author	Šmuc, T.	-
dc.date.accessioned	2019-04-22 11:10:15	-
dc.date.available	2019-04-22 11:10:15	-
dc.date.issued	2019-05	-
dc.identifier	Dspace\SGAU\20190421\75664	ru
dc.identifier.citation	Kulikovskikh I. Growing descent of stochastic gradient with the generalized logistic map / Kulikovskikh I., Prokhorov S., Legović T., Šmuc T. // Сборник трудов ИТНТ-2019 [Текст]: V междунар. конф. и молодеж. шк. "Информ. технологии и нанотехнологии": 21-24 мая: в 4 т. / Самар. нац.-исслед. ун-т им. С. П. Королева (Самар. ун-т), Ин-т систем. обраб. изобр. РАН-фил. ФНИЦ "Кристаллография и фотоника" РАН; [под ред. В.А. Фурсова]. - Самара: Новая техника, 2019. – Т. 4: Науки о данных. - 2019. - С. 338-344.	ru
dc.identifier.uri	http://repo.ssau.ru/handle/Informacionnye-tehnologii-i-nanotehnologii/Growing-descent-of-stochastic-gradient-with-the-generalized-logistic-map-75664	-
dc.description.abstract	The paper considers the problem of accelerating the convergence of stochastic gradient descent (SGD) in an automatic way. Previous research puts forward such algorithms as Adagrad, Adadelta, RMSprop, Adam and etc. to adapt both the updates and learning rates to the slope of a loss function. However, these adaptive methods do not share the same regret bound as the gradient descent method. Adagrad provably achieves the optimal regret bound on the assumption of convexity but accumulates the squared gradients in the denominator that dramatically shrinks the learning rate. This research is aimed at introducing a generalized logistic map directly into the SGD method in order to automatically set its parameters to the slope of the logistic loss function. The optimizer based on the logistic map may be considered as a meta-learner that learns how to tune both the learning rate and gradient updates with respect to the rate of population growth. The present study yields the “growing” descent method and a series of computational experiments to point out the benefits of injecting the logistic map.	ru
dc.description.sponsorship	This work was supported by the Russian Federation President grant No. MK-6218.2018.9 and the Ministry of Education and Science of the Russian Federation grant No. 074-U01. The research in Section 3 was partly supported by RFBR (project No. 18-37-00219). The authors acknowledge the support by the Centre of Excellence project “DATACROSS”, co-financed by the Croatian Government and the European Union through the European Regional Development Fund – the Competitiveness and Cohesion Operational Programme (KK.01.1.1.01.0009).	ru
dc.language.iso	en_US	ru
dc.publisher	Новая техника	ru
dc.title	Growing descent of stochastic gradient with the generalized logistic map	ru
dc.type	Article	ru
dc.textpart	A simple model of neural network consists of one hidden layer with different number of neurons nneuron = 4 (see Fig. 1 a)) and nneuron = 1 (see Fig. 1 b)). Fig. 1 a) shows the advantage of Adagrad with the adaptive learning rate over other methods. Fig. 1 b), in turn, reveals the Adagrad’s...	-
Располагается в коллекциях:	Информационные технологии и нанотехнологии

Файлы этого ресурса:

Файл	Описание	Размер	Формат
paper42.pdf	Основная статья	819.57 kB	Adobe PDF	Просмотреть/Открыть

Показать базовое описание ресурса Просмотр статистики
Поделиться:

Все ресурсы в архиве электронных ресурсов защищены авторским правом, все права сохранены.

Репозиторий Самарского университета