We investigate stochastic gradient methods and stochastic counterparts of the Barzilai–Borwein steplengths and their application to finite-sum minimization problems. Our proposal is based on the Trust-Region-ish (TRish) framework introduced in [Curtis et al. (2019)]. The new framework, named TRishBB, aims to enhance the performance of TRish and at reducing the computational cost of the second-order TRish variant. We propose three different methods belonging to the TRishBB framework and present the convergence analysis for possibly nonconvex objective functions, considering biased and unbiased gradient approximations. Our analysis requires neither diminishing step-sizes nor full gradient evaluation. The numerical experiments in machine learning applications demonstrate the effectiveness of applying the Barzilai–Borwein steplength with stochastic gradients and show improved testing accuracy compared to the TRish method.

Fully stochastic trust-region methods with Barzilai–Borwein steplengths / Bellavia, Stefania; Morini, Benedetta; Yousefi, Mahsa. - In: JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS. - ISSN 0377-0427. - STAMPA. - 476:(2026), pp. 0-0. [10.1016/j.cam.2025.117059]

Fully stochastic trust-region methods with Barzilai–Borwein steplengths

Bellavia, Stefania;Morini, Benedetta
;
Yousefi, Mahsa
2026

Abstract

We investigate stochastic gradient methods and stochastic counterparts of the Barzilai–Borwein steplengths and their application to finite-sum minimization problems. Our proposal is based on the Trust-Region-ish (TRish) framework introduced in [Curtis et al. (2019)]. The new framework, named TRishBB, aims to enhance the performance of TRish and at reducing the computational cost of the second-order TRish variant. We propose three different methods belonging to the TRishBB framework and present the convergence analysis for possibly nonconvex objective functions, considering biased and unbiased gradient approximations. Our analysis requires neither diminishing step-sizes nor full gradient evaluation. The numerical experiments in machine learning applications demonstrate the effectiveness of applying the Barzilai–Borwein steplength with stochastic gradients and show improved testing accuracy compared to the TRish method.
2026
476
0
0
Bellavia, Stefania; Morini, Benedetta; Yousefi, Mahsa
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S0377042725005734-main.pdf

accesso aperto

Tipologia: Pdf editoriale (Version of record)
Licenza: Open Access
Dimensione 2.57 MB
Formato Adobe PDF
2.57 MB Adobe PDF

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1436275
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact