In this paper, we study the following nonlinear matrix decomposition (NMD) problem: given a sparse nonnegative matrix X, find a low-rank matrix \Theta such that X \approx f(\Theta), where f is an element-wise nonlinear function. We focus on the case where f(\cdot) = \max(0, \cdot), the rectified unit (ReLU) non-linear activation. We refer to the corresponding problem as ReLU-NMD. We first provide a brief overview of the existing approaches that were developed to tackle ReLU-NMD. Then we introduce two new algorithms: (1) aggressive accelerated NMD (A-NMD) which uses an adaptive Nesterov extrapolation to accelerate an existing algorithm, and (2) three-block NMD (3B-NMD) which parametrizes \Theta = WH and leads to a significant reduction in the computational cost. We also propose an effective initialization strategy based on the nuclear norm as a proxy for the rank function. We illustrate the effectiveness of the proposed algorithms (available on gitlab) on synthetic and real-world data sets.
Accelerated Algorithms For Nonlinear Matrix Decomposition With The Relu Function / Seraghiti G.; Awari A.; Vandaele A.; Porcelli M.; Gillis N.. - ELETTRONICO. - (2023), pp. 1-6. ( 33rd IEEE International Workshop on Machine Learning for Signal Processing Roma 17-20 Settembre 2023) [10.1109/MLSP55844.2023.10285984].
Accelerated Algorithms For Nonlinear Matrix Decomposition With The Relu Function
Porcelli M.;
2023
Abstract
In this paper, we study the following nonlinear matrix decomposition (NMD) problem: given a sparse nonnegative matrix X, find a low-rank matrix \Theta such that X \approx f(\Theta), where f is an element-wise nonlinear function. We focus on the case where f(\cdot) = \max(0, \cdot), the rectified unit (ReLU) non-linear activation. We refer to the corresponding problem as ReLU-NMD. We first provide a brief overview of the existing approaches that were developed to tackle ReLU-NMD. Then we introduce two new algorithms: (1) aggressive accelerated NMD (A-NMD) which uses an adaptive Nesterov extrapolation to accelerate an existing algorithm, and (2) three-block NMD (3B-NMD) which parametrizes \Theta = WH and leads to a significant reduction in the computational cost. We also propose an effective initialization strategy based on the nuclear norm as a proxy for the rank function. We illustrate the effectiveness of the proposed algorithms (available on gitlab) on synthetic and real-world data sets.| File | Dimensione | Formato | |
|---|---|---|---|
|
ReLu_seraghiti_23.pdf
Accesso chiuso
Licenza:
Tutti i diritti riservati
Dimensione
555.43 kB
Formato
Adobe PDF
|
555.43 kB | Adobe PDF | Richiedi una copia |
I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.



