Speaker Tracking in Seminars by Human Body Detection
This paper presents evaluation results of a method for tracking speakers in seminars from multiple cameras. First, 2D human tracking and detection is done for each view. Then, 2D locations are converted to 3D based on the calibration parameters. Finally, cues from multiple cameras are integrated in a incremental way to refine the trajectories. We have developed two multi-view integration methods, which are evaluated and compared on the CHIL speaker tracking test set.
Unable to display preview. Download preview PDF.
- 1.Wu, B., Nevatia, R.: Detection of Multiple, Partially Occluded Humans in a Single Image by Bayesian Combination of Edgelet Part Detectors. In: ICCV’05, vol. 1, pp. 90–97 (2005)Google Scholar
- 2.Papageorgiou, C., Evgeniou, T., Poggio, T.: A Trainable Pedestrian Detection System. In: Proc. of Intelligent Vehicles, pp. 241–246 (1998)Google Scholar
- 3.Wu, B., Nevatia, R.: Tracking of Multiple, Partially Occluded Humans based on Static Body Part Detection. In: CVPR’06 (to appear, 2006)Google Scholar
- 4.Comaniciu, D., Ramesh, V., Meer, P.: The Variable Bandwidth Mean Shift and Data-Driven Scale Selection. In: ICCV’01, vol. 1, pp. 438–445 (2001)Google Scholar