This paper focuses on regularisation methods using models up to the third order to search for up to second-order critical points of a finite-sum minimisation problem. The variant presented belongs to the framework of [1]: it employs random models with accuracy guaranteed with a sufficiently large prefixed probability and deterministic inexact function evaluations within a prescribed level of accuracy. Without assuming unbiased estimators, the expected number of iterations is O( _1^ - 2 ) or O( _1^ - 3/2 ) when searching for a first-order critical point using a second or third order model, respectively, and of O( max [ _1^ - 3/2, _2^ - 3 ] ) when seeking for second-order critical points with a third order model, in which _j,j 1,2, is the j th-order tolerance. These results match the worst-case optimal complexity for the deterministic counterpart of the method. Preliminary numerical tests for first-order optimality in the context of nonconvex binary classification in imaging, with and without Artifical Neural Networks (ANNs), are presented and discussed.

Quadratic and Cubic Regularisation Methods with Inexact function and Random Derivatives for Finite-Sum Minimisation / Bellavia S.; Gurioli G.; Morini B.; Toint P.L.. - ELETTRONICO. - (2021), pp. 258-267. (Intervento presentato al convegno 21st International Conference on Computational Science and Its Applications, ICCSA 2021 tenutosi a ita nel 2021) [10.1109/ICCSA54496.2021.00043].

Quadratic and Cubic Regularisation Methods with Inexact function and Random Derivatives for Finite-Sum Minimisation

Bellavia S.;Gurioli G.;Morini B.;
2021

Abstract

This paper focuses on regularisation methods using models up to the third order to search for up to second-order critical points of a finite-sum minimisation problem. The variant presented belongs to the framework of [1]: it employs random models with accuracy guaranteed with a sufficiently large prefixed probability and deterministic inexact function evaluations within a prescribed level of accuracy. Without assuming unbiased estimators, the expected number of iterations is O( _1^ - 2 ) or O( _1^ - 3/2 ) when searching for a first-order critical point using a second or third order model, respectively, and of O( max [ _1^ - 3/2, _2^ - 3 ] ) when seeking for second-order critical points with a third order model, in which _j,j 1,2, is the j th-order tolerance. These results match the worst-case optimal complexity for the deterministic counterpart of the method. Preliminary numerical tests for first-order optimality in the context of nonconvex binary classification in imaging, with and without Artifical Neural Networks (ANNs), are presented and discussed.
2021
Proceedings - 2021 21st International Conference on Computational Science and Its Applications, ICCSA 2021
21st International Conference on Computational Science and Its Applications, ICCSA 2021
ita
2021
Bellavia S.; Gurioli G.; Morini B.; Toint P.L.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1266215
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 1
social impact