In this paper we propose an approach to avoiding catastrophic forgetting in sequential task learning scenarios. Our technique is based on a network reparameterization that approximately diagonalizes the Fisher Information Matrix of the network parameters. This reparameterization takes the form of a factorized rotation of parameter space which, when used in conjunction with Elastic Weight Consolidation (which assumes a diagonal Fisher Information Matrix), leads to significantly better performance on lifelong learning of sequential tasks. Experimental results on the MNIST, CIFAR-100, CUB-200 and Stanford-40 datasets demonstrate that we significantly improve the results of standard elastic weight consolidation, and that we obtain competitive results when compared to the state-of-the-art in lifelong learning without forgetting.

Rotate your Networks: Better Weight Consolidation and Less Catastrophic Forgetting / Liu, XL; Masana, M; Herranz, L; Van de Weijer, J; Lopez, AM; Bagdanov, AD. - ELETTRONICO. - (2018), pp. 2262-2268. (Intervento presentato al convegno INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION) [10.1109/ICPR.2018.8545895].

Rotate your Networks: Better Weight Consolidation and Less Catastrophic Forgetting

Bagdanov, AD
2018

Abstract

In this paper we propose an approach to avoiding catastrophic forgetting in sequential task learning scenarios. Our technique is based on a network reparameterization that approximately diagonalizes the Fisher Information Matrix of the network parameters. This reparameterization takes the form of a factorized rotation of parameter space which, when used in conjunction with Elastic Weight Consolidation (which assumes a diagonal Fisher Information Matrix), leads to significantly better performance on lifelong learning of sequential tasks. Experimental results on the MNIST, CIFAR-100, CUB-200 and Stanford-40 datasets demonstrate that we significantly improve the results of standard elastic weight consolidation, and that we obtain competitive results when compared to the state-of-the-art in lifelong learning without forgetting.
2018
Proceedings of ICPR 2018
INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION
Liu, XL; Masana, M; Herranz, L; Van de Weijer, J; Lopez, AM; Bagdanov, AD
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1151114
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 181
  • ???jsp.display-item.citation.isi??? 124
social impact