Benchmarking Datasets for Human Activity Recognition

Liu, Haowei; Feris, Rogerio; Sun, Ming-Ting

doi:10.1007/978-0-85729-997-0_20

Haowei Liu⁵,
Rogerio Feris⁶ &
Ming-Ting Sun⁵

3420 Accesses
6 Citations

Abstract

Recognizing human activities has become an important topic in the past few years. A variety of techniques for representing and modeling different human activities have been proposed, achieving reasonable performances in many scenarios. On the other hand, different benchmarks have also been collected and published. Different from other chapters focusing on the algorithmic aspects, this chapter gives an overview of different benchmarking datasets, summarizes the performances of the-state-of-the-art algorithms, and analyzes these datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Ali, S., Shah, M.: Human action recognition in videos using kinematic features and multiple instance learning. IEEE Trans. Pattern Anal. Mach. Intell. 32(2), 288–303 (2010)
Article Google Scholar
Bregonzio, M., Gong, S., Xiang, T.: Recognising action as clouds of space–time interest points. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2009)
Google Scholar
Brendel, W., Todorovic, S.: Activities as time series of human postures. In: IEEE European Conference on Computer Vision (ECCV) (2010)
Google Scholar
Cao, L., Liu, Z., Huang, T.: Cross-dataset action detection. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2010)
Google Scholar
Chaudhry, R., Ravichandran, A., Hager, G., Vidal, R.: Histograms of oriented optical flow and Binet–Cauchy kernels on nonlinear dynamical systems for the recognition of human actions. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2009)
Google Scholar
Christensen, H., Phillips, J.: Empirical Evaluation Methods in Computer Vision. World Scientific, Singapore (2002)
Book MATH Google Scholar
Efros, A., Berg, A., Mori, G., Malik, J.: Recognizing action at a distance. In: IEEE International Conference on Computer Vision (ICCV) (2003)
Google Scholar
Fathi, A., Mori, G.: Action recognition by learning mid-level motion features. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2008)
Google Scholar
Gilbert, A., Illingworth, J., Bowden, R.: Action recognition using mined hierarchical compound features. IEEE Transaction on Pattern Analysis and Machine Intelligence (PAMI) (2010)
Google Scholar
Gorelick, L., Blank, M., Shechtman, E., Irani, M., Basri, R.: Actions as space–time shapes. In: IEEE International Conference on Computer Vision (ICCV) (2005)
Google Scholar
Gupta, A., Kembhavi, A., Davis, L.: Observing human–object interactions: Using spatial and functional compatibility for recognition. IEEE Trans. Pattern Anal. Mach. Intell. 31(10), 1775–1789 (2009)
Article Google Scholar
Han, D., Bo, L., Sminchisescu, C.: Selection and context for action recognition. In: IEEE International Conference on Computer Vision (ICCV) (2009)
Google Scholar
IEEE: Performance Evaluation of Tracking and Surveillance (2004)
Google Scholar
IEEE: Performance Evaluation of Tracking and Surveillance (2007)
Google Scholar
IEEE: Performance Evaluation of Tracking and Surveillance (2009)
Google Scholar
Ikizler-Cinbis, N., Sclaroff, S.: Object, scene and actions: Combining multiple features for human action recognition. In: IEEE European Conference on Computer Vision (ECCV) (2010)
Google Scholar
Jhuang, H., Serre, T., Wolf, L., Poggio, T.: A biologically inspired system for action recognition. In: IEEE International Conference on Computer Vision (ICCV) (2007)
Google Scholar
Jiang, H., Martin, D.: Finding actions using shape flows. In: IEEE European Conference on Computer Vision (ECCV) (2008)
Google Scholar
Ke, Y., Sukthankar, R., Hebert, M.: Event detection in cluttered videos. In: IEEE International Conference on Computer Vision (ICCV) (2007)
Google Scholar
Kjellström, H., Romero, J., Martínez, D., Kragić, D.: Simultaneous visual recognition of manipulation actions and manipulated objects. In: IEEE European Conference on Computer Vision (ECCV) (2008)
Google Scholar
Kovashka, A., Grauman, K.: Learning a hierarchy of discriminative space–time neighborhood features for human action recognition. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2010)
Google Scholar
Laptev, I., Perez, P.: Retrieving actions in movies. In: IEEE International Conference on Computer Vision (ICCV), pp. 1–8 (2007)
Chapter Google Scholar
Laptev, I., Marszałek, M., Schmid, C., Rozenfeld, B.: Learning realistic human actions from movies. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2008)
Google Scholar
Lin, Z., Jiang, Z., Davis, L.: Recognizing actions by shape-motion prototype trees. In: IEEE International Conference on Computer Vision (ICCV), pp. 444–451 (2009)
Chapter Google Scholar
Liu, J., Shah, M.: Learning human action via information maximization. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2008)
Google Scholar
Liu, J., Luo, J., Shah, M.: Recognizing realistic actions from videos in the wild. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2009)
Google Scholar
Liu, J., Yang, Y., Shah, M.: Learning semantic visual vocabularies using diffusion distance. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2009)
Google Scholar
Lv, F., Nevatia, R.: Single view human action recognition using key pose matching and Viterbi path searching. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2007)
Google Scholar
Marszałek, M., Laptev, I., Schmid, C.: Actions in context. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2009)
Google Scholar
Matikainen, P., Hebert, M., Sukthankar, R.: Representing pairwise spatial and temporal relations for action recognition. In: IEEE European Conference on Computer Vision (ECCV) (2010)
Google Scholar
Messing, R., Pal, C., Kautz, H.: Activity recognition using the velocity histories of tracked keypoints. In: IEEE International Conference on Computer Vision (ICCV) (2009)
Google Scholar
Niebles, J., Li, F.-F.: A hierarchical model of shape and appearance for human action classification. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2007)
Google Scholar
Niebles, J., Chen, C.-W., Li, F.-F.: Modeling temporal structure of decomposable motion segments for activity classification. In: IEEE European Conference on Computer Vision (ECCV) (2010)
Google Scholar
Prabhakar, K., Oh, S., Wang, P., Abowd, G., Rehg, J.: Temporal causality for the analysis of visual events. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2010)
Google Scholar
Raptis, M., Soatto, S.: Tracklet descriptors for action modeling and video analysis. In: IEEE European Conference on Computer Vision (ECCV) (2010)
Google Scholar
Ribeiro, P., Santos-Victor, J.: Human activity recognition from video: modeling, feature selection and classification architecture. In: International Workshop on Human Activity Recognition and Modelling (2005)
Google Scholar
Rodriguez, M., Ahmed, J., Shah, M.: Action mach: A spatio-temporal maximum average correlation height filter for action recognition. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2008)
Google Scholar
Russell, B., Torralba, A., Murphy, K.: Labelme: A database and web-based tool for image annotation. Int. J. Comput. Vis. 77(1–3), 157–173 (2008)
Article Google Scholar
Satkin, S., Hebert, M.: Modeling the temporal extent of actions. In: IEEE European Conference on Computer Vision (ECCV) (2010)
Google Scholar
Schuldt, C., Laptev, I., Caputo, B.: Recognizing human actions: A local SVM approach. In: International Conference on Pattern Recognition (ICPR) (2004)
Google Scholar
Sigal, L., Balan, A., Black, M.: Humaneva: Synchronized video and motion capture dataset and baseline algorithm for evaluation of articulated human motion. International Journal on Computer Vision (IJCV) 87(1–2) (2010)
Google Scholar
Smeaton, A., Over, P., Kraaij, W.: Evaluation campaigns and trecvid. In: ACM International Conference on Multimedia Information Retrieval (MIR) (2006)
Google Scholar
Sun, J., Wu, X., Yan, S., Cheong, L.-F., Chua, T.-S., Li, J.: Hierarchical spatio-temporal context modeling for action recognition. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2009)
Google Scholar
Tran, D., Sorokin, A.: Human activity recognition with metric learning. In: IEEE European Conference on Computer Vision (ECCV) (2008)
Google Scholar
Turaga, P., Chellappa, R.: Machine recognition of human activities: A survey. IEEE Trans. Circuits Syst. Video Technol. 18(11), 1473–1488 (2008)
Article Google Scholar
Uemura, H., Ishikawa, S., Mikolajczyk, K.: Feature tracking and motion compensation for action recognition. In: British Machine Vision Conference (BMVC) (2008)
Google Scholar
Venkata, S., Ahn, I., Jeon, D., Gupta, A., Louie, C., Garcia, S., Belongie, S., Taylor, M.: Sd-vbs: The San Diego Vision Benchmark Suite (2009)
Google Scholar
Wang, P., Abowd, G., Rehg, J.: Quasi-periodic event analysis for social game retrieval. In: IEEE International Conference on Computer Vision (ICCV) (2009)
Google Scholar
Wang, Y., Mori, G.: Learning a discriminative hidden part model for human action recognitio. In: Advances in Neural Information Processing Systems (NIPS) (2008)
Google Scholar
Wang, Y., Mori, G.: Human action recognition by semilatent topic models. IEEE Trans. Pattern Anal. Mach. Intell. 31(10), 1762–1774 (2009)
Article Google Scholar
Wang, Y., Mori, G.: Max-margin hidden conditional random fields for human action recognition. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2009)
Google Scholar
Weinland, D., Boyer, E., Ronfard, R.: Action recognition from arbitrary views using 3d exemplars. In: IEEE International Conference on Computer Vision (ICCV) (2007)
Google Scholar
Weinland, D., Ronfard, R., Boyer, E.: Free viewpoint action recognition using motion history volumes. Computer Vision and Image Understanding (2006)
Google Scholar
Yao, B., Fei-Fei, L.: Grouplet: A structured image representation for recognizing human and object interactions. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2010)
Google Scholar
Yao, B., Fei-Fei, L.: Modeling mutual context of object and human pose in human–object interaction activities. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2010)
Google Scholar
Yao, B., Zhu, S.-C.: Learning deformable action templates from cluttered videos. In: IEEE International Conference on Computer Vision (ICCV) (2009)
Google Scholar
Yeffet, L., Wolf, L.: Local trinary patterns for human action recognition. In: IEEE International Conference on Computer Vision (ICCV) (2009)
Google Scholar
Yuan, J., Liu, Z., Wu, Y.: Discriminative subvolume search for efficient action detection. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2009)
Google Scholar
Yuen, J., Russell, B., Liu, C., Torralba, A.: Labelme video: Building a video database with human annotations. In: IEEE International Conference on Computer Vision (ICCV) (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Washington, Seattle, WA, 98195, USA
Haowei Liu & Ming-Ting Sun
IBM T.J. Watson Research Center, Hawthorn, NY, 10532, USA
Rogerio Feris

Authors

Haowei Liu
View author publications
You can also search for this author in PubMed Google Scholar
Rogerio Feris
View author publications
You can also search for this author in PubMed Google Scholar
Ming-Ting Sun
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Haowei Liu .

Editor information

Editors and Affiliations

Department of Media Technology, Aalborg University, Niels Jernes Vej 14, Aalborg, 9220, Denmark
Thomas B. Moeslund
Centre for Vision, Speech & Signal Proc., University of Surrey, Guildford, GU2 7XH, Surrey, United Kingdom
Adrian Hilton
Copenhagen Institute of Technology, Aalborg University, Lautrupvang 2B, Ballerup, 2750, Denmark
Volker Krüger
Disney Research, Forbes Avenue 615, Pittsburgh, 15213, Pennsylvania, USA
Leonid Sigal

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Liu, H., Feris, R., Sun, MT. (2011). Benchmarking Datasets for Human Activity Recognition. In: Moeslund, T., Hilton, A., Krüger, V., Sigal, L. (eds) Visual Analysis of Humans. Springer, London. https://doi.org/10.1007/978-0-85729-997-0_20

Download citation

DOI: https://doi.org/10.1007/978-0-85729-997-0_20
Publisher Name: Springer, London
Print ISBN: 978-0-85729-996-3
Online ISBN: 978-0-85729-997-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics