Do textual descriptions help action recognition?

In this paper we present a novel method to improve action recognition by leveraging a set of captioned videos. By learning linear projections to map videos and text onto a common space, our approach shows that improved results on unseen videos can be obtained. We also propose a novel structure preserving loss that further ameliorates the quality of the projections. We tested our method on the challenging, realistic, Hollywood2 action recognition dataset where a considerable gain in performance is obtained. We show that the gain is proportional to the number of training samples used to learn the projections.

Do textual descriptions help action recognition? / Bruni, M., Uricchio, T., Seidenari, L., Del Bimbo, A.. - ELETTRONICO. - (2016), pp. 645-649. (ACM Multimedia gbr 2016) [10.1145/2964284.2967301].

Do textual descriptions help action recognition?

BRUNI, MATTEO;URICCHIO, TIBERIO;SEIDENARI, LORENZO;DEL BIMBO, ALBERTO

2016

Abstract

In this paper we present a novel method to improve action recognition by leveraging a set of captioned videos. By learning linear projections to map videos and text onto a common space, our approach shows that improved results on unseen videos can be obtained. We also propose a novel structure preserving loss that further ameliorates the quality of the projections. We tested our method on the challenging, realistic, Hollywood2 action recognition dataset where a considerable gain in performance is obtained. We show that the gain is proportional to the number of training samples used to learn the projections.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di pubblicazione
	
				2016
			
	Titolo del volume degli atti
	
				MM 2016 - Proceedings of the 2016 ACM Multimedia Conference
			
	Titolo del congresso
	
				ACM Multimedia
			
	Luogo del congresso
	
				gbr
			
	Data del congresso
	
				2016
			
	Tutti gli autori
	
						Bruni, Matteo; Uricchio, Tiberio; Seidenari, Lorenzo; Del Bimbo, Alberto
					
	Appare nelle tipologie:
	
				4a - Articolo in atti di congresso

File in questo prodotto:

File	Dimensione	Formato
p645-bruni.pdf Accesso chiuso Tipologia: Pdf editoriale (Version of record) Licenza: Tutti i diritti riservati Dimensione 798.62 kB Formato Adobe PDF Richiedi una copia all'autore	798.62 kB	Adobe PDF	Richiedi una copia all'autore

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1065602

Citazioni

ND

4

ND

social impact