Tracking Complex Objects Using Graphical Object Models

Sigal, Leonid; Zhu, Ying; Comaniciu, Dorin; Black, Michael

doi:10.1007/978-3-540-69866-1_17

Leonid Sigal¹,
Ying Zhu³,
Dorin Comaniciu² &
…
Michael Black¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3417))

Included in the following conference series:

International Workshop on Complex Motion

649 Accesses
4 Citations

Abstract

We present a probabilistic framework for component-based automatic detection and tracking of objects in video. We represent objects as spatio-temporal two-layer graphical models, where each node corresponds to an object or component of an object at a given time, and the edges correspond to learned spatial and temporal constraints. Object detection and tracking is formulated as inference over a directed loopy graph, and is solved with non-parametric belief propagation. This type of object model allows object-detection to make use of temporal consistency (over an arbitrarily sized temporal window), and facilitates robust tracking of the object. The two layer structure of the graphical model allows inference over the entire object as well as individual components. AdaBoost detectors are used to define the likelihood and form proposal distributions for components. Proposal distributions provide ‘bottom-up’ information that is incorporated into the inference process, enabling automatic object detection and tracking. We illustrate our method by detecting and tracking two classes of objects, vehicles and pedestrians, in video sequences collected using a single grayscale uncalibrated car-mounted moving camera.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Black, M.J., Yacoob, Y.: Recognizing facial expressions in image sequences using local parameterized models of image motion. International Journal of Computer Vision 25(1), 23–48 (1997)
Article Google Scholar
Burl, M., Weber, M., Perona, P.: A probabilistic approach to object recognition using local photometry and global geometry. In: ECCV, pp. 628–641 (1998)
Google Scholar
Coughlan, J., Ferreira, S.: Finding deformable shapes using loopy belief propagation. In: ECCV, vol. 3, pp. 453–468 (2002)
Google Scholar
Douce, A., de Freitas, N., Gordon, N.: Sequential Monte Carlo methods in practice. Statistics for Engineering and Information Sciences, pp. 3–14. Springer, Heidelberg (2001)
Google Scholar
Felzenszwalb, P., Huttenlocher, D.: Efficient matching of pictorial structures. CVPR 2, 66–73 (2000)
Google Scholar
Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. CVPR (2003)
Google Scholar
Ihler, A., Sudderth, E., Freeman, W., Willsky, A.: Efficient multiscale sampling from products of Gaussian mixtures. Advances in Neural Info. Proc. Sys. 16 (2003)
Google Scholar
Isard, M.: Pampas: Real-valued graphical models for computer vision. In: CVPR, vol. 1, pp. 613–620 (2003)
Google Scholar
Jordan, M., Sejnowski, T., Poggio, T.: Graphical models: Foundations of neural computation. MIT Press, Cambridge (2001)
MATH Google Scholar
Mikolajczyk, K., Schmid, C., Zisserman, A.: Human detection based on a probabilistic assembly of robust part detectors. ECCV (2004)
Google Scholar
Mohan, A., Papageorgiou, C., Poggio, T.: Example-based object detection in images by components. IEEE PAMI 23(4), 349–361 (2001)
Google Scholar
Murphy, K., Torralba, A., Freeman, W.: Using the forest to see the trees: A graphical model relating features, objects, and scenes. Advances in Neural Info. Proc. Sys. 16 (2003)
Google Scholar
Okuma, K., Teleghani, A., de Freitas, N., Little, J., Lowe, D.: A boosted particle filter: Multitarget detection and tracking. ECCV (2004)
Google Scholar
Sigal, L., Bhatia, S., Roth, S., Black, M., Isard, M.: Tracking loose-limbed people. CVPR (2004)
Google Scholar
Sudderth, E., Ihler, A., Freeman, W., Willsky, A.: Nonparametric belief propagation (see also MIT AI Lab Memo 2002-020). In: CVPR, vol. 1, pp. 605–612. MIT Press, Cambridge (2003)
Google Scholar
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. CVPR (2001)
Google Scholar
Viola, P., Jones, M., Snow, D.: Detecting pedestrians using patterns of motion and appearance. In: ICCV, pp. 734–741 (2003)
Google Scholar
Xie, B., Comaniciu, D., Ramesh, V., Simon, M., Boult, T.: Component fusion for face detection in the presence of heteroscedastic noise. In: Annual Conf. of the German Society for Pattern Recognition (DAGM’03), pp. 434–441 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Brown University, Providence, RI 02912,
Leonid Sigal & Michael Black
Integrated Data Systems, Siemens Corporate Research, Princeton, NJ 08540,
Dorin Comaniciu
Real Time Vision & Modeling, Siemens Corporate Research, Princeton, NJ 08540,
Ying Zhu

Authors

Leonid Sigal
View author publications
You can also search for this author in PubMed Google Scholar
Ying Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Dorin Comaniciu
View author publications
You can also search for this author in PubMed Google Scholar
Michael Black
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Bernd Jähne Rudolf Mester Erhardt Barth Hanno Scharr

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sigal, L., Zhu, Y., Comaniciu, D., Black, M. (2007). Tracking Complex Objects Using Graphical Object Models. In: Jähne, B., Mester, R., Barth, E., Scharr, H. (eds) Complex Motion. IWCM 2004. Lecture Notes in Computer Science, vol 3417. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69866-1_17

Download citation

DOI: https://doi.org/10.1007/978-3-540-69866-1_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69864-7
Online ISBN: 978-3-540-69866-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics