Spatio-Temporal Closed-Loop Object Detection

Object detection is one of the most important tasks of computer vision. It is usually performed by evaluating a subset of the possible locations of an image that are more likely to contain the object of interest. Exhaustive approaches have now been superseded by object proposal methods. The interplay of detectors and proposal algorithms has not been fully analyzed and exploited up to now, although this is a very relevant problem for object detection in video sequences. We propose to connect, in a closed-loop, detectors and object proposal generator functions exploiting the ordered and continuous nature of video sequences. Different from tracking we only require a previous frame to improve both proposal and detection: no prediction based on local motion is performed, thus avoiding tracking errors. We obtain 3 to 4 points of improvement in mAP and a detection time that is lower than Faster R-CNN, which is the fastest CNN based generic object detector known at the moment.

Spatio-Temporal Closed-Loop Object Detection / Galteri, Leonardo; Seidenari, Lorenzo; Bertini, Marco; Del Bimbo, Alberto. - In: IEEE TRANSACTIONS ON IMAGE PROCESSING. - ISSN 1057-7149. - ELETTRONICO. - (2017), pp. 0-0. [10.1109/TIP.2017.2651367]

Spatio-Temporal Closed-Loop Object Detection

GALTERI, LEONARDO;SEIDENARI, LORENZO;BERTINI, MARCO;DEL BIMBO, ALBERTO

2017

Abstract

Object detection is one of the most important tasks of computer vision. It is usually performed by evaluating a subset of the possible locations of an image that are more likely to contain the object of interest. Exhaustive approaches have now been superseded by object proposal methods. The interplay of detectors and proposal algorithms has not been fully analyzed and exploited up to now, although this is a very relevant problem for object detection in video sequences. We propose to connect, in a closed-loop, detectors and object proposal generator functions exploiting the ordered and continuous nature of video sequences. Different from tracking we only require a previous frame to improve both proposal and detection: no prediction based on local motion is performed, thus avoiding tracking errors. We obtain 3 to 4 points of improvement in mAP and a detection time that is lower than Faster R-CNN, which is the fastest CNN based generic object detector known at the moment.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di pubblicazione
	
				2017
			
	Rivista
	
				IEEE TRANSACTIONS ON IMAGE PROCESSING
			
	Pagina iniziale
	
				0
			
	Pagina finale
	
				0
			
	Tutti gli autori
	
						Galteri, Leonardo; Seidenari, Lorenzo; Bertini, Marco; Del Bimbo, Alberto
					
	Appare nelle tipologie:
	
				1a - Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
J1-TIP17.pdf accesso aperto Tipologia: Versione finale referata (Postprint, Accepted manuscript) Licenza: Tutti i diritti riservati Dimensione 1.96 MB Formato Adobe PDF	1.96 MB	Adobe PDF
Spatio-Temporal_Closed-Loop_Object_Detection.pdf Accesso chiuso Tipologia: Pdf editoriale (Version of record) Licenza: Tutti i diritti riservati Dimensione 4.33 MB Formato Adobe PDF Richiedi una copia	4.33 MB	Adobe PDF	Richiedi una copia

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1071440

Citazioni

ND

32

27

social impact