Key Object Driven Multi-category Object Recognition, Localization and Tracking Using Spatio-temporal Context

Li, Yuan; Nevatia, Ram

doi:10.1007/978-3-540-88693-8_30

Yuan Li⁴ &
Ram Nevatia⁴

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5305))

Included in the following conference series:

European Conference on Computer Vision

9843 Accesses
11 Citations

Abstract

In this paper we address the problem of recognizing, localizing and tracking multiple objects of different categories in meeting room videos. Difficulties such as lack of detail and multi-object co-occurrence make it hard to directly apply traditional object recognition methods. Under such circumstances, we show that incorporating object-level spatio-temporal relationships can lead to significant improvements in inference of object category and state. Contextual relationships are modeled by a dynamic Markov random field, in which recognition, localization and tracking are done simultaneously. Further, we define human as the key object of the scene, which can be detected relatively robustly and therefore is used to guide the inference of other objects. Experiments are done on the CHIL meeting video corpus. Performance is evaluated in terms of object detection and false alarm rates, object recognition confusion matrix and pixel-level accuracy of object segmentation.

Download to read the full chapter text

Chapter PDF

Online, Real-Time Tracking Using a Category-to-Individual Detector

Multiple Object Tracking Based on a Hierarchical Clustering of Features Approach

Model-Free Multiple Object Tracking with Shared Proposals

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: CVPR (2003)
Google Scholar
Cao, L., Fei-Fei, L.: Spatially coherent latent topic model for concurrent object segmentation and classification. In: ICCV (2007)
Google Scholar
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: CVPR (2001)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR (2005)
Google Scholar
Wu, B., Nevatia, R.: Cluster boosted tree classifier for multi-view, multi-pose object detection. In: ICCV (2007)
Google Scholar
Sudderth, E.B., Torralba, A., Freeman, W.T., Willsky, A.S.: Learning hierarchical models of scenes, objects, and parts. In: ICCV (2005)
Google Scholar
Hoiem, D., Efros, A.A., Hebert, M.: Putting objects in perspective. In: CVPR (2006)
Google Scholar
Torralba, A., Murphy, K., Freeman, W., Rubin, M.: Context-based vision system for place and object recognition. In: ICCV (2003)
Google Scholar
Li, L.-J., Fei-Fei, L.: What, where and who? classifying events by scene and object recognition. In: ICCV (2007)
Google Scholar
Shotton, J., Winn, J., Rother, C., Criminisi, A.: Textonboost: Joint appearance, shape and context modeling for multi-class object recognition and segmentation. In: ECCV (2006)
Google Scholar
Rabinovich, A., Vedaldi, A., Galleguillos, C., Wiewiora, E., Belongie, S.: Objects in context. In: ICCV (2007)
Google Scholar
Carbonetto, P., de Freitas, N., Barnard, K.: A statistical model for general contextual object recognition. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3021, pp. 350–362. Springer, Heidelberg (2004)
Google Scholar
Moore, D.J., Essa, I.A., Heyes, M.H.: Exploiting human actions and object context for recognition tasks. In: ICCV (1999)
Google Scholar
Peursum, P., West, G., Venkatesh, S.: Combining image regions and human activity for indirect object recognition in indoor wide-angle views. In: ICCV (2005)
Google Scholar
Gupta, A., Davis, L.S.: Objects in action: an approach for combining action understanding and object perception. In: CVPR (2007)
Google Scholar
Yu, T., Wu, Y.: Collaborative tracking of multiple targets. In: CVPR (2004)
Google Scholar
Wu, B., Nevatia, R.: Tracking of multiple humans in meetings. In: V4HCI (2006)
Google Scholar
Friedman, J., Hastie, T., Tibshirani, R.: Additive logistic regression: a statistical view of boosting. Annals of Statistics 28(2), 337–407 (2000)
Article MATH MathSciNet Google Scholar
Comaniciu, D., Meer, P.: Mean shift: A robust approach toward feature space analysis. IEEE Transaction on Pattern Analysis and Machine Intelligence 24(5), 603–619 (2002)
Article Google Scholar
Sutton, C., McCallum, A.: Piecewise training for undirected models. In: Conference on Uncertainty in Artificial Intelligence (2005)
Google Scholar
Pearl, J.: Probabilistic Reasoning in Intelligent Systems. Morgan Kaufman, San Mateo (1988)
Google Scholar
Sudderth, E.B., Ihler, A.T., Freeman, W.T., Willsky, A.S.: Nonparametric belief propagation. In: CVPR (2003)
Google Scholar
CHIL: The chil project, http://chil.server.de/

Download references

Author information

Authors and Affiliations

Institute for Robotics and Intelligent Systems, University of Southern California, Los Angeles, CA, USA
Yuan Li & Ram Nevatia

Authors

Yuan Li
View author publications
You can also search for this author in PubMed Google Scholar
Ram Nevatia
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Science Department, University of Illinois at Urbana Champaign, 3310 Siebel Hall, IL 61801, Urbana, USA
David Forsyth
Department of Computing, Wheatley, Oxford Brookes University, OX33 1HX, Oxford, UK
Philip Torr
Department of Engineering Science, University of Oxford, Parks Road, OX1 3PJ, Oxford, UK
Andrew Zisserman

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, Y., Nevatia, R. (2008). Key Object Driven Multi-category Object Recognition, Localization and Tracking Using Spatio-temporal Context. In: Forsyth, D., Torr, P., Zisserman, A. (eds) Computer Vision – ECCV 2008. ECCV 2008. Lecture Notes in Computer Science, vol 5305. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88693-8_30

Download citation

DOI: https://doi.org/10.1007/978-3-540-88693-8_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88692-1
Online ISBN: 978-3-540-88693-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Key Object Driven Multi-category Object Recognition, Localization and Tracking Using Spatio-temporal Context

Abstract

Chapter PDF

Similar content being viewed by others

Online, Real-Time Tracking Using a Category-to-Individual Detector

Multiple Object Tracking Based on a Hierarchical Clustering of Features Approach

Model-Free Multiple Object Tracking with Shared Proposals

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Key Object Driven Multi-category Object Recognition, Localization and Tracking Using Spatio-temporal Context

Abstract

Chapter PDF

Similar content being viewed by others

Online, Real-Time Tracking Using a Category-to-Individual Detector

Multiple Object Tracking Based on a Hierarchical Clustering of Features Approach

Model-Free Multiple Object Tracking with Shared Proposals

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation