Activities as Time Series of Human Postures

Brendel, William; Todorovic, Sinisa

doi:10.1007/978-3-642-15552-9_52

William Brendel¹⁹ &
Sinisa Todorovic¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6312))

Included in the following conference series:

European Conference on Computer Vision

5596 Accesses
32 Citations

Abstract

This paper presents an exemplar-based approach to detecting and localizing human actions, such as running, cycling, and swinging, in realistic videos with dynamic backgrounds. We show that such activities can be compactly represented as time series of a few snapshots of human-body parts in their most discriminative postures, relative to other activity classes. This enables our approach to efficiently store multiple diverse exemplars per activity class, and quickly retrieve exemplars that best match the query by aligning their short time-series representations. Given a set of example videos of all activity classes, we extract multiscale regions from all their frames, and then learn a sparse dictionary of most discriminative regions. The Viterbi algorithm is then used to track detections of the learned codewords across frames of each video, resulting in their compact time-series representations. Dictionary learning is cast within the large-margin framework, wherein we study the effects of ℓ₁ and ℓ₂ regularization on the sparseness of the resulting dictionaries. Our experiments demonstrate robustness and scalability of our approach on challenging YouTube videos.

Download to read the full chapter text

Chapter PDF

Investigating the Use of Space-Time Primitives to Understand Human Movements

Complex Activity Recognition Via Attribute Dynamics

Article 21 June 2016

Geometry-Based Symbolic Approximation for Fast Sequence Matching on Manifolds

Article 18 June 2015

References

Laptev, I., Marszalek, M., Schmid, C., Rozenfeld, B.: Learning realistic human actions from movies. In: CVPR (2008)
Google Scholar
Niebles, J.C., Han, B., Ferencz, A., Fei-Fei, L.: Extracting moving people from internet videos. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part IV. LNCS, vol. 5305, pp. 527–540. Springer, Heidelberg (2008)
Chapter Google Scholar
Yao, B., Zhu, S.C.: Learning deformable action templates from cluttered videos. In: ICCV (2009)
Google Scholar
Liu, J., Luo, J., Shah, M.: Recognizing realistic actions from videos “in the wild”. In: CVPR (2009)
Google Scholar
Fanti, C., Zelnik-Manor, L., Perona, P.: Hybrid models for human motion recognition. In: CVPR (2005)
Google Scholar
Bissacco, A., Yang, M.H., Soatto, S.: Detecting humans via their pose. In: NIPS (2007)
Google Scholar
Ning, H., Xu, W., Gong, Y., Huang, T.: Discriminative learning of visual words for 3d human pose estimation. In: CVPR (2008)
Google Scholar
Ke, Y., Sukthankar, R., Hebert, M.: Event detection in crowded videos. In: ICCV (2007)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In. CVPR, pp. 886–893 (2005)
Google Scholar
Lin, Z., Davis, L.S.: A pose-invariant descriptor for human detection and segmentation. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part IV. LNCS, vol. 5305, pp. 423–436. Springer, Heidelberg (2008)
Chapter Google Scholar
Marszalek, M., Laptev, I., Schmid, C.: Actions in context. In: CVPR (2009)
Google Scholar
Lin, Z., Jiang, Z., Davis, L.S.: Recognizing actions by shape-motion prototype trees. In: ICCV (2009)
Google Scholar
Liu, J., Shah, M.: Learning human actions via information maximization. In: CVPR (2008)
Google Scholar
Gorelick, L., Blank, M., Shechtman, E., Irani, M., Basri, R.: Actions as space-time shapes. IEEE TPAMI 29, 2247–2253 (2007)
Google Scholar
Brendel, W., Todorovic, S.: Video object segmentation by tracking regions. In: ICCV (2009)
Google Scholar
Frome, A., Singer, Y., Sha, F., Malik, J.: Learning globally-consistent local distance functions for shape-based image retrieval and classification. In: ICCV (2007)
Google Scholar
Gilad-Bachrach, R., Navot, A., Tishby, N.: Margin based feature selection – theory and algorithms. In: ICML, vol. 43 (2004)
Google Scholar
Vedaldi, A., Soatto, S.: Quick shift and kernel methods for mode seeking. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part IV. LNCS, vol. 5305, pp. 705–718. Springer, Heidelberg (2008)
Chapter Google Scholar
Schueldt, C., Laptev, I., Caputo, B.: Recognizing human actions: A local SVM approach. In: ICPR (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Kelley Engineering Center, Oregon State University, Corvallis, OR, 97331, USA
William Brendel & Sinisa Todorovic

Authors

William Brendel
View author publications
You can also search for this author in PubMed Google Scholar
Sinisa Todorovic
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

GRASP Laboratory, University of Pennsylvania, 3330 Walnut Street, 19104, Philadelphia, PA, USA
Kostas Daniilidis
School of Electrical and Computer Engineering, National Technical University of Athens, 15773, Athens, Greece
Petros Maragos
Department of Applied Mathematics, Ecole Centrale de Paris, Grande Voie des Vignes, 92295, Chatenay-Malabry, France
Nikos Paragios

1 Electronic Supplementary Material

Electronic Supplementary Material (1 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Brendel, W., Todorovic, S. (2010). Activities as Time Series of Human Postures. In: Daniilidis, K., Maragos, P., Paragios, N. (eds) Computer Vision – ECCV 2010. ECCV 2010. Lecture Notes in Computer Science, vol 6312. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15552-9_52

Download citation

DOI: https://doi.org/10.1007/978-3-642-15552-9_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15551-2
Online ISBN: 978-3-642-15552-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Activities as Time Series of Human Postures

Abstract

Chapter PDF

Similar content being viewed by others

Investigating the Use of Space-Time Primitives to Understand Human Movements

Complex Activity Recognition Via Attribute Dynamics

Geometry-Based Symbolic Approximation for Fast Sequence Matching on Manifolds

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

1 Electronic Supplementary Material

Electronic Supplementary Material (1 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Activities as Time Series of Human Postures

Abstract

Chapter PDF

Similar content being viewed by others

Investigating the Use of Space-Time Primitives to Understand Human Movements

Complex Activity Recognition Via Attribute Dynamics

Geometry-Based Symbolic Approximation for Fast Sequence Matching on Manifolds

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

1 Electronic Supplementary Material

Electronic Supplementary Material (1 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation