This paper focuses on regularisation methods using models up to the third order to search for up to second-order critical points of a finite-sum minimisation problem. The variant presented belongs to the framework of [1]: it employs random models with accuracy guaranteed with a sufficiently large prefixed probability and deterministic inexact function evaluations within a prescribed level of accuracy. Without assuming unbiased estimators, the expected number of iterations is O( _1^ - 2 ) or O( _1^ - 3/2 ) when searching for a first-order critical point using a second or third order model, respectively, and of O( max [ _1^ - 3/2, _2^ - 3 ] ) when seeking for second-order critical points with a third order model, in which _j,j 1,2, is the j th-order tolerance. These results match the worst-case optimal complexity for the deterministic counterpart of the method. Preliminary numerical tests for first-order optimality in the context of nonconvex binary classification in imaging, with and without Artifical Neural Networks (ANNs), are presented and discussed.
Quadratic and Cubic Regularisation Methods with Inexact function and Random Derivatives for Finite-Sum Minimisation / Bellavia S.; Gurioli G.; Morini B.; Toint P.L.. - ELETTRONICO. - (2021), pp. 258-267. (Intervento presentato al convegno 21st International Conference on Computational Science and Its Applications, ICCSA 2021 tenutosi a ita nel 2021) [10.1109/ICCSA54496.2021.00043].
Quadratic and Cubic Regularisation Methods with Inexact function and Random Derivatives for Finite-Sum Minimisation
Bellavia S.;Gurioli G.;Morini B.;
2021
Abstract
This paper focuses on regularisation methods using models up to the third order to search for up to second-order critical points of a finite-sum minimisation problem. The variant presented belongs to the framework of [1]: it employs random models with accuracy guaranteed with a sufficiently large prefixed probability and deterministic inexact function evaluations within a prescribed level of accuracy. Without assuming unbiased estimators, the expected number of iterations is O( _1^ - 2 ) or O( _1^ - 3/2 ) when searching for a first-order critical point using a second or third order model, respectively, and of O( max [ _1^ - 3/2, _2^ - 3 ] ) when seeking for second-order critical points with a third order model, in which _j,j 1,2, is the j th-order tolerance. These results match the worst-case optimal complexity for the deterministic counterpart of the method. Preliminary numerical tests for first-order optimality in the context of nonconvex binary classification in imaging, with and without Artifical Neural Networks (ANNs), are presented and discussed.I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.