Fusion of Motion and Appearance for Robust People Detection in Cluttered Scenes

Zhang, Jianguo; Gong, Shaogang

doi:10.1007/978-3-642-17554-1_6

Fusion of Motion and Appearance for Robust People Detection in Cluttered Scenes

Jianguo Zhang⁶ &
Shaogang Gong⁷

Chapter

865 Accesses

Part of the book series: Studies in Computational Intelligence ((SCI,volume 332))

Abstract

Robust detection of people in video is critical in visual surveillance. In this work we present a framework for robust people detection in highly cluttered scenes with low resolution image sequences. Our model utilises both human appearance and their long-term motion information through a fusion formulated in a Bayesian framework. In particular, we introduce a spatial pyramid Gaussian Mixture approach to model variations of long-term human motion information, which is computed via an improved background modeling using spatial motion constrains. Simultaneously, people appearance is modeled by histograms of oriented gradients. Experiments demonstrate that our method reduces significantly false positive rate compared to that of a state of the art human detector under very challenging lighting condition, occlusion and background clutter.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Boiman, O., Irani, M.: Detecting irregularities in images and in video. In: International Conference on Computer Vision, pp. 462–469 (2005)
Google Scholar
Cutler, R., Davis, L.: Robust real-time periodic motion detection: Analysis and applications. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(8), 781–796 (2000)
Article Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: International Conference on Computer Vision & Pattern Recognition, vol. 2, pp. 886–893 (2005)
Google Scholar
Dalal, N., Triggs, B., Schmid, C.: Human detection using oriented histograms of flow and appearance. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3952, pp. 428–441. Springer, Heidelberg (2006)
Chapter Google Scholar
Gautama, T., Van Hulle, M.: A phase-based approach to the estimation of the optical flow field using spatial filtering. IEEE Transactions on Neural Networks 13, 1127–1136 (2002)
Article Google Scholar
Gavrila, D., Philomin, V.: Real-time object detection for “smart“ vehicles. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 87–93 (1999)
Google Scholar
Gavrila, D.M.: The visual analysis of human movement: A survey. Computer Vision and Image Understanding 73(1), 82–98 (1999)
Article MATH Google Scholar
Grauman, K., Darrell, T.: Pyramid match kernel: Discriminative classification with sets of image features. In: International Conference on Computer Vision, pp. 1458–1465 (2005)
Google Scholar
Hoffman, D.D., Flinchbaugh, B.E.: The interpretation of biological motion. Biological Cybernetics 42, 195–204 (1982)
MATH Google Scholar
Horn, B.K.P., Schunck, B.G.: ”determining optical flow”: A retrospective. Artifical Intelligence 59(1-2), 81–87 (1993)
Article Google Scholar
Johansson, G.: Visual perception of biological motion and a model for its analysis. Perception and Psychophysics 14, 201–211 (1973)
Article Google Scholar
Ke, Y., Sukthankar, R., Hebert, M.: Efficient visual event detection using volumetric features. In: International Conference on Computer Vision, pp. 166–173 (2005)
Google Scholar
Laptev, I.: On space-time interest points. International Journal of Computer Vision 64(2), 107–123 (2005)
Article MathSciNet Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2169–2178 (2006)
Google Scholar
de Lima, C., Alcaim, A., Apolinario, J.J.: On the use of pca in gmm and ar-vector models for text independent speaker verification. In: International Conference on Digital Signal Processing, vol. 2 (2002)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Article Google Scholar
Papageorgiou, C., Poggio, T.: A trainable system for object detection. International Journal of Computer Vision 38(1), 15–33 (2000)
Article MATH Google Scholar
Proesmans, M., Gool, L.J.V., Pauwels, E.J., Oosterlinck, A.: Determination of optical flow and its discontinuities using non-linear diffusion. In: Eklundh, J.-O. (ed.) ECCV 1994. LNCS, vol. 801, pp. 295–304. Springer, Heidelberg (1994)
Google Scholar
Rissanen, J.: A universal prior for integers and estimation by minimum description length. The Annals of Statistics, 416–431 (1983)
Google Scholar
Sankaranarayanan, A., Chellappa, R., Zheng, Q.: Tracking objects in video using motion and appearance models. In: IEEE International Conference on Image Processing, vol. 2, pp. 394–397 (2005)
Google Scholar
Schiele, B., Crowley, J.: Recognition without correspondence using multidimensional receptive field histograms. International Journal of Computer Vision 36(1), 31–50 (2000)
Article Google Scholar
Schuldt, C., Laptev, I., Caputo, B.: Recognizing human actions: A local svm approach. In: International Conference on Pattern Recognition, Cambridge, UK, pp. 32–36 (2004)
Google Scholar
Viola, P., Jones, M.J., Snow, D.: Detecting pedestrians using patterns of motion and appearance. International Journal of Computer Vision 63(2), 153–161 (2005)
Article Google Scholar
Xiang, T., Gong, S.: Beyond tracking: Modelling activity and understanding behaviour. International Journal of Computer Vision 67(1), 21–51 (2006)
Article Google Scholar
Zhang, J., Gong, S.: Beyond static detectors: A bayesian approach to fusing long-term motion with appearance for robust people detection in highly cluttered scenes. In: IEEE Workshop on Visual Surveillance in conjunction with ECCV 2006, Graz, pp. 121–128 (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Electronics, Electrical Engineering and Computer Science, Queen’s University Belfast, BT7 1NN, UK
Jianguo Zhang
School of Electronic Engineering and Computer Science, Queen Mary University of London, London, E1 4NS, UK
Shaogang Gong

Authors

Jianguo Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Shaogang Gong
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computing, University of Dundee , DD1 4HN, Dundee, Scotland, UK
Jianguo Zhang
Department of Electronic & Electrical Engineering, The University of Sheffield, S1 3JD, Sheffield, UK
Ling Shao
Microsoft Research Asia , 49 Zhichun Road, 100190, Beijing, P.R. China
Lei Zhang
Digital Imaging Research Centre, Faculty of Computing, Information Systems and Mathematics, Kingston University, Penrhyn Road, Kingston upon Thames, KT1 2EE, Surrey, UK
Graeme A. Jones

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Zhang, J., Gong, S. (2011). Fusion of Motion and Appearance for Robust People Detection in Cluttered Scenes. In: Zhang, J., Shao, L., Zhang, L., Jones, G.A. (eds) Intelligent Video Event Analysis and Understanding. Studies in Computational Intelligence, vol 332. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17554-1_6

Download citation

DOI: https://doi.org/10.1007/978-3-642-17554-1_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17553-4
Online ISBN: 978-3-642-17554-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics