Abstract
A fundamental task for a distributed multi-camera system is to associate people across camera views at different locations and times. In a crowded and uncontrolled environment observed by cameras from a distance, person re-identification by biometrics such as face and gait is infeasible due to insufficient image details and arbitrary viewing conditions. Visual appearance features, extracted mainly from clothing, are intrinsically weak for matching people. For instance, most people in public spaces wear dark clothes in winter. A person’s appearance can also change significantly between different camera views if large changes occur in view angle, lighting, background clutter and occlusion. This results in different people appearing more alike than that of the same person across different camera views. In this chapter, we describe a method for learning the optimal matching distance criterion, regardless feature representation. This approach to person re-identification shifts the burden of computation from finding some universally optimal imagery features to discovering a matching mechanism for selecting adaptively different features that are locally optimal for each and every pairs of matches. Moreover, behaviour correlations hold useful spatio-temporal contextual information about expectations on where and when a person may re-appear in a networked visible space. This information is utilised for improving matching accuracy through context-aware search.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
A probe image is an image of unknown identity.
- 2.
A gallery image is an image of known identity.
- 3.
A ‘probe’ is sometimes also referred to as a ‘query’.
- 4.
There were around 10% correct matches obtained from a gallery size of 532 people in the experiments carried out by Prosser et al. (2010).
References
Chapelle, O., Keerthi, S.: Efficient algorithms for ranking with SVMs. Inf. Retr. 13(3), 201–215 (2010)
Farenzena, M., Bazzani, L., Perina, A., Murino, V., Cristani, M.: Person re-identification by symmetry-driven accumulation of local features. In: IEEE Conference on Computer Vision and Pattern Recognition, San Francisco, USA, June 2010, pp. 2360–2367 (2010)
Freund, Y., Iyer, R., Schapire, R.E., Singer, Y.: An efficient boosting algorithm for combining preferences. J. Mach. Learn. Res. 4, 933–969 (2003)
Gheissari, N., Sebastian, T.B., Rittscher, J., Hartley, R.: Person reidentification using spatiotemporal appearance. In: IEEE Conference on Computer Vision and Pattern Recognition, New York, USA, June 2006, pp. 1528–1535 (2006)
Gilbert, A., Bowden, R.: Tracking objects across cameras by incrementally learning inter-camera colour calibration and patterns of activity. In: European Conference on Computer Vision, Graz, Austria, pp. 125–136 (2006)
Gray, D., Tao, H.: Viewpoint invariant pedestrian recognition with an ensemble of localized features. In: European Conference on Computer Vision, Marseille, France, October 2008, pp. 262–275 (2008)
Hahnel, M., Klunder, D., Kraiss, K.F.: Color and texture features for person recognition. In: IEEE International Joint Conference on Neural Networks, pp. 647–652 (2004)
Javed, O., Shafique, K., Shah, M.: Appearance modeling for tracking in multiple non-overlapping cameras. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 26–33 (2005)
Joachims, T.: Optimizing search engines using clickthrough data. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 133–142 (2002)
Loy, C.C., Xiang, T., Gong, S.: Time-delayed correlation analysis for multi-camera activity understanding. Int. J. Comput. Vis. 90(1), 106–129 (2010)
Madden, C., Cheng, E.D., Piccardi, M.: Tracking people across disjoint camera views by an illumination-tolerant appearance representation. Mach. Vis. Appl. 18(3), 233–247 (2007)
Prosser, B., Gong, S., Xiang, T.: Multi-camera matching using bi-directional cumulative brightness transfer functions. In: British Machine Vision Conference, Leeds, UK, September 2008a
Prosser, B., Gong, S., Xiang, T.: Multi-camera matching under illumination change over time. In: ECCV Workshop on Multi-camera and Multi-modal Sensor Fusion Algorithms and Applications, Marseille, France, October 2008b
Prosser, B., Zheng, W., Gong, S., Xiang, T.: Person re-identification by support vector ranking. In: British Machine Vision Conference, Aberystwyth, UK, September 2010
Schapire, R.E., Singer, Y.: Improved boosting algorithms using confidence-rated predictions. Mach. Learn. 37(3), 297–336 (1999)
Wang, H., Suter, D., Schindler, K.: Effective appearance model and similarity measure for particle filtering and visual tracking. In: European Conference on Computer Vision, Graz, Austria, pp. 606–618 (2006)
Wang, X., Doretto, G., Sebastian, T., Rittscher, J., Tu, P.: Shape and appearance context modeling. In: IEEE International Conference on Computer Vision, Rio de Janeiro, Brazil, October 2007, pp. 1–8 (2007)
Zheng, W., Gong, S., Xiang, T.: Associating groups of people. In: British Machine Vision Conference, London, UK, September 2009
Zheng, W., Gong, S., Xiang, T.: Person re-identification by probabilistic relative distance comparison. In: IEEE Conference on Computer Vision and Pattern Recognition, Colorado Springs, USA, June 2011
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2011 Springer-Verlag London Limited
About this chapter
Cite this chapter
Gong, S., Xiang, T. (2011). Person Re-identification. In: Visual Analysis of Behaviour. Springer, London. https://doi.org/10.1007/978-0-85729-670-2_14
Download citation
DOI: https://doi.org/10.1007/978-0-85729-670-2_14
Publisher Name: Springer, London
Print ISBN: 978-0-85729-669-6
Online ISBN: 978-0-85729-670-2
eBook Packages: Computer ScienceComputer Science (R0)