We present a vision system for real-time 3D tracking of multiple people moving over an extended area, as seen from a rotating and zooming camera. Despite the general problems of multiple target tracking (MTT), the use of a pan-tilt-zoom (PTZ) camera adds several difficulties for the multiplicity of connected problems, Our approach exploits multi-view image matching techniques to index and refine, at runtime, the closest world to image homography for the current view. This is made possible by applying (in a batch phase) bundle adjustment method over a set of distinctive visual landmarks extracted from the field of regard of the zooming camera sensor. The approach is experimentally evaluated on several difficult video sequences. Quantitative results show that the proposed approach makes it possible to deliver stable tracking performance in scenes of previously infeasible complexity. We achieve an almost constant standard deviation error of less than 0.3 meters in recovering 3D trajectories of multiple moving targets in an area of 70x15 meters. ©2009 IEEE.
Scale Invariant 3D Multi-Person Tracking using a Base Set of Bundle Adjusted Visual Landmarks / Alberto Del Bimbo; Giuseppe Lisanti; Federico Pernici. - ELETTRONICO. - (2009), pp. 1121-1128. (Intervento presentato al convegno Ninth IEEE International Workshop on Visual Surveillance 2009 - 12th International Conference on Computer Vision tenutosi a Kyoto (Japan) nel 27/09/2009-04/10/2009) [10.1109/ICCVW.2009.5457579].
Scale Invariant 3D Multi-Person Tracking using a Base Set of Bundle Adjusted Visual Landmarks
DEL BIMBO, ALBERTO;LISANTI, GIUSEPPE;PERNICI, FEDERICO
2009
Abstract
We present a vision system for real-time 3D tracking of multiple people moving over an extended area, as seen from a rotating and zooming camera. Despite the general problems of multiple target tracking (MTT), the use of a pan-tilt-zoom (PTZ) camera adds several difficulties for the multiplicity of connected problems, Our approach exploits multi-view image matching techniques to index and refine, at runtime, the closest world to image homography for the current view. This is made possible by applying (in a batch phase) bundle adjustment method over a set of distinctive visual landmarks extracted from the field of regard of the zooming camera sensor. The approach is experimentally evaluated on several difficult video sequences. Quantitative results show that the proposed approach makes it possible to deliver stable tracking performance in scenes of previously infeasible complexity. We achieve an almost constant standard deviation error of less than 0.3 meters in recovering 3D trajectories of multiple moving targets in an area of 70x15 meters. ©2009 IEEE.File | Dimensione | Formato | |
---|---|---|---|
DLP09.pdf
Accesso chiuso
Tipologia:
Versione finale referata (Postprint, Accepted manuscript)
Licenza:
Tutti i diritti riservati
Dimensione
2.29 MB
Formato
Adobe PDF
|
2.29 MB | Adobe PDF | Richiedi una copia |
I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.