Retrieving Actions in Group Contexts

Lan, Tian; Wang, Yang; Mori, Greg; Robinovitch, Stephen N.

doi:10.1007/978-3-642-35749-7_14

Tian Lan¹⁷,
Yang Wang¹⁷,
Greg Mori¹⁷ &
…
Stephen N. Robinovitch¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6553))

Included in the following conference series:

European Conference on Computer Vision

1852 Accesses
11 Citations

Abstract

We develop methods for action retrieval from surveillance video using contextual feature representations. The novelty of our proposed approach is two-fold. First, we introduce a new feature representation called the action context (AC) descriptor. The AC descriptor encodes information about not only the action of an individual person in the video, but also the behaviour of other people nearby. This feature representation is inspired by the fact that the context of what other people are doing provides very useful cues for recognizing the actions of each individual. Second, we formulate our problem as a retrieval/ranking task, which is different from previous work on action classification. We develop an action retrieval technique based on rank-SVM, a state-of-the-art approach for solving ranking problems. We apply our proposed approach on two real-world datasets. The first dataset consists of videos of multiple people performing several group activities. The second dataset consists of surveillance videos from a nursing home environment. Our experimental results show the advantage of using contextual information for disambiguating different actions and the benefit of using rank-SVMs instead of regular SVMs for video retrieval problems.

Download to read the full chapter text

Chapter PDF

Improving Human Action Recognition Using Score Distribution and Ranking

Discriminative Dictionary Design for Action Classification in Still Images and Videos

Article 03 March 2021

Motion pattern based representation for improving human action retrieval

Article 12 March 2018

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Wang, Y., Mori, G.: Human action recognition by semi-latent topic models. IEEE Trans. PAMI 31, 1762–1774 (2009)
Article Google Scholar
Wang, X., Ma, X., Grimson, E.: Unsupervised activity perception in crowded and complicated scenes using hierarchical bayesian models. IEEE Trans. PAMI 31, 539–555 (2009)
Article Google Scholar
Loy, C.C., Xiang, T., Gong, S.: Modelling activity global temporal dependencies using time delayed probabilistic graphical model. In: ICCV (2009)
Google Scholar
Marszalek, M., Laptev, I., Schmid, C.: Actions in context. In: CVPR (2009)
Google Scholar
Han, D., Bo, L., Sminchisescu, C.: Selection and context for action recognition. In: IEEE International Conference on Computer Vision (2009)
Google Scholar
Xiang, T., Gong, S.: Beyond tracking: Modelling activity and understanding behaviour. Int. Journal of Computer Vision 67, 21–51 (2006)
Article Google Scholar
Gupta, A., Srinivasan, P., Shi, J., Davis, L.S.: Understanding videos, constructing plots - learning a visually grounded storyline model from annotated videos. In: CVPR (2009)
Google Scholar
Zhong, H., Shi, J., Visontai, M.: Detecting unusual activity in video. In: Proc. IEEE Comput. Soc. Conf. Comput. Vision and Pattern Recogn. (2004)
Google Scholar
Mehran, R., Oyama, A., Shah, M.: Abnormal crowd behavior detection using social force model. In: CVPR (2009)
Google Scholar
Choi, W., Shahid, K., Savarese, S.: What are they doing?: Collective activity classification using spatio-temporal relationship among people. In: VS (2009)
Google Scholar
Joachims, T.: Optimizing search engines using clickthrough data. In: ACM SIGKDD (2002)
Google Scholar
Felzenszwalb, P., McAllester, D., Ramanan, D.: A discriminatively trained, multiscale, deformable part model. In: CVPR (2008)
Google Scholar
Dalal, N., Triggs, B.: Histogram of oriented gradients for human detection. In: CVPR (2005)
Google Scholar
Schuldt, C., Laptev, I., Caputo, B.: Recognizing human actions: A local svm approach. In: 17th International Conference on Pattern Recognition (2004)
Google Scholar
Blank, M., Gorelick, L., Shechtman, E., Irani, M., Basri, R.: Actions as space-time shapes. In: Proc. 10th Int. Conf. Computer Vision (2005)
Google Scholar
Niebles, J.C., Wang, H., Fei-Fei, L.: Unsupervised learning of human action categories using spatial-temporal words. In: BMVC (2006)
Google Scholar
Laptev, I., Marszalek, M., Schmid, C., Rozenfeld, B.: Learning realistic human actions from movies. In: CVPR (2008)
Google Scholar
Joachims, T.: A support vector method for multivariate performance measures. In: International Conference on Machine Learning (2005)
Google Scholar
Blank, M., Gorelick, L., Shechtman, E., Irani, M., Basri, R.: Actions as space-time shapes. In: ICCV (2005)
Google Scholar
Chapelle, O., Le, Q., Smola, A.: Large margin optimization of ranking measures. In: NIPS Workshop on Learning to Rank (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computing Science, Simon Fraser University, Canada
Tian Lan, Yang Wang & Greg Mori
School of Kinesiology, Simon Fraser University, Canada
Stephen N. Robinovitch

Authors

Tian Lan
View author publications
You can also search for this author in PubMed Google Scholar
Yang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Greg Mori
View author publications
You can also search for this author in PubMed Google Scholar
Stephen N. Robinovitch
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of Toronto, 10 King’s College Road, ON M5S 3G4, Toronto, Canada
Kiriakos N. Kutulakos

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lan, T., Wang, Y., Mori, G., Robinovitch, S.N. (2012). Retrieving Actions in Group Contexts. In: Kutulakos, K.N. (eds) Trends and Topics in Computer Vision. ECCV 2010. Lecture Notes in Computer Science, vol 6553. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35749-7_14

Download citation

DOI: https://doi.org/10.1007/978-3-642-35749-7_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35748-0
Online ISBN: 978-3-642-35749-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Retrieving Actions in Group Contexts

Abstract

Chapter PDF

Similar content being viewed by others

Improving Human Action Recognition Using Score Distribution and Ranking

Discriminative Dictionary Design for Action Classification in Still Images and Videos

Motion pattern based representation for improving human action retrieval

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Retrieving Actions in Group Contexts

Abstract

Chapter PDF

Similar content being viewed by others

Improving Human Action Recognition Using Score Distribution and Ranking

Discriminative Dictionary Design for Action Classification in Still Images and Videos

Motion pattern based representation for improving human action retrieval

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation