In this paper we report on techniques for automatically learning foveal sensing strategies for an active pan-tilt-zoom camera. The approach uses reinforcement learning to discover foveal actions maximizing the performance of visual detectors, that are in turn assumed to be highly correlated with the task at hand. In our case,the main goal is to recognize people, hence a frontal face detection module is employed. The system uses reinforcement learning to learn if when and how to foveate on a subject, basedonits previous experience in terms or successful actions in similar situations. An action is successful if it leads to a correct face detection in the high resolution images obtained when the subject is zoomed in. In contrast with existing methods,the proposed approach obviates the need for camera calibration and camera performance modeling. Also, the method does not rely on active tracking of targets. Experimental results show how the system can be deployed in unconstrained surveillance environments, and is capable of learning foveation strategies without requiring extensive a priori information or environmental models. Results also illustrate how the system effectively learns a strategy that allows the camera to foveate only in situations where successful detection is highly likely.

A reinforcement learning approach to active camera foveation / Bagdanov, Andrew D; Del Bimbo, Alberto; Nunziati, Walter; Pernici, Federico. - STAMPA. - (2006), pp. 179-186. (Intervento presentato al convegno 4th ACM International Workshop on Video Surveillance and Sensor Networks, VSSN'06, co-located with the 2006 ACM International Multimedia Conference tenutosi a Santa Barbara, CA, usa nel 2007) [10.1145/1178782.1178809].

A reinforcement learning approach to active camera foveation

BAGDANOV, ANDREW DAVID;DEL BIMBO, ALBERTO;NUNZIATI, WALTER;PERNICI, FEDERICO
2006

Abstract

In this paper we report on techniques for automatically learning foveal sensing strategies for an active pan-tilt-zoom camera. The approach uses reinforcement learning to discover foveal actions maximizing the performance of visual detectors, that are in turn assumed to be highly correlated with the task at hand. In our case,the main goal is to recognize people, hence a frontal face detection module is employed. The system uses reinforcement learning to learn if when and how to foveate on a subject, basedonits previous experience in terms or successful actions in similar situations. An action is successful if it leads to a correct face detection in the high resolution images obtained when the subject is zoomed in. In contrast with existing methods,the proposed approach obviates the need for camera calibration and camera performance modeling. Also, the method does not rely on active tracking of targets. Experimental results show how the system can be deployed in unconstrained surveillance environments, and is capable of learning foveation strategies without requiring extensive a priori information or environmental models. Results also illustrate how the system effectively learns a strategy that allows the camera to foveate only in situations where successful detection is highly likely.
2006
Proceedings of the ACM International Multimedia Conference and Exhibition
4th ACM International Workshop on Video Surveillance and Sensor Networks, VSSN'06, co-located with the 2006 ACM International Multimedia Conference
Santa Barbara, CA, usa
2007
Bagdanov, Andrew D; Del Bimbo, Alberto; Nunziati, Walter; Pernici, Federico
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1020691
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 6
  • ???jsp.display-item.citation.isi??? ND
social impact