In this paper, we study the following nonlinear matrix decomposition (NMD) problem: given a sparse nonnegative matrix X, find a low-rank matrix \Theta such that X \approx f(\Theta), where f is an element-wise nonlinear function. We focus on the case where f(\cdot) = \max(0, \cdot), the rectified unit (ReLU) non-linear activation. We refer to the corresponding problem as ReLU-NMD. We first provide a brief overview of the existing approaches that were developed to tackle ReLU-NMD. Then we introduce two new algorithms: (1) aggressive accelerated NMD (A-NMD) which uses an adaptive Nesterov extrapolation to accelerate an existing algorithm, and (2) three-block NMD (3B-NMD) which parametrizes \Theta = WH and leads to a significant reduction in the computational cost. We also propose an effective initialization strategy based on the nuclear norm as a proxy for the rank function. We illustrate the effectiveness of the proposed algorithms (available on gitlab) on synthetic and real-world data sets.

Accelerated Algorithms For Nonlinear Matrix Decomposition With The Relu Function / Seraghiti G.; Awari A.; Vandaele A.; Porcelli M.; Gillis N.. - ELETTRONICO. - (2023), pp. 1-6. ( 33rd IEEE International Workshop on Machine Learning for Signal Processing Roma 17-20 Settembre 2023) [10.1109/MLSP55844.2023.10285984].

Accelerated Algorithms For Nonlinear Matrix Decomposition With The Relu Function

Porcelli M.;
2023

Abstract

In this paper, we study the following nonlinear matrix decomposition (NMD) problem: given a sparse nonnegative matrix X, find a low-rank matrix \Theta such that X \approx f(\Theta), where f is an element-wise nonlinear function. We focus on the case where f(\cdot) = \max(0, \cdot), the rectified unit (ReLU) non-linear activation. We refer to the corresponding problem as ReLU-NMD. We first provide a brief overview of the existing approaches that were developed to tackle ReLU-NMD. Then we introduce two new algorithms: (1) aggressive accelerated NMD (A-NMD) which uses an adaptive Nesterov extrapolation to accelerate an existing algorithm, and (2) three-block NMD (3B-NMD) which parametrizes \Theta = WH and leads to a significant reduction in the computational cost. We also propose an effective initialization strategy based on the nuclear norm as a proxy for the rank function. We illustrate the effectiveness of the proposed algorithms (available on gitlab) on synthetic and real-world data sets.
2023
33rd IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2023
33rd IEEE International Workshop on Machine Learning for Signal Processing
Roma
17-20 Settembre 2023
Seraghiti G.; Awari A.; Vandaele A.; Porcelli M.; Gillis N.
File in questo prodotto:
File Dimensione Formato  
ReLu_seraghiti_23.pdf

Accesso chiuso

Licenza: Tutti i diritti riservati
Dimensione 555.43 kB
Formato Adobe PDF
555.43 kB Adobe PDF   Richiedi una copia

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1351292
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? ND
social impact