Probability Matrix SVM+ Learning for Complex Action Recognition

Liu, Fang; Xu, Xiangmin; Qing, Chunmei; Jin, Jianxiu

doi:10.1007/978-981-10-8530-7_39

Fang Liu¹²,
Xiangmin Xu¹²,
Chunmei Qing¹² &
…
Jianxiu Jin¹²

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 819))

Included in the following conference series:

International Conference on Internet Multimedia Computing and Service

1434 Accesses
1 Citations

Abstract

Complex action recognition is a hot topic in computer vision. When training a robust model, a large amount of labeled data is required. However, labeling complex actions is often time-consuming and expensive. Considering that each complex action is composed of a sequence of simple actions, we propose a new perspective to provide more information during training in order to solve the problem of insufficient labeled data. The probability matrix is then designed by manual annotation, which encodes a probability distribution of simple actions in complex actions. So the probability matrix is only available during training but unavailable during testing. Finally, a probability matrix is regared as privileged information in a SVM+ framework, and we regard this setting as probability matrix SVM+(pmSVM+). To validate the proposed model, extensive experiments are carried out on complex action datasets. Experiment results show the effectiveness of pmSVM+ for complex action recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 107.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Corso, J.J., Sadanand, S.: Action bank: a high-level representation of activity in video. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1234–1241 (2012)
Google Scholar
Gorelick, L., Blank, M., Shechtman, E., Irani, M., Basri, R.: Actions as space-time shapes. IEEE Trans. Pattern Anal. Mach. Intell. 29(12), 2247–2253 (2007)
Article Google Scholar
Hong, R., Hu, Z., Wang, R., Wang, M., Tao, D.: Multi-view object retrieval via multi-scale topic models. IEEE Trans. Image Process. 25(12), 5814–5827 (2016)
Article MathSciNet Google Scholar
Hong, R., Yang, Y., Wang, M., Hua, X.S.: Learning visual semantic relationships for efficient visual retrieval. IEEE Trans. Big Data 1(4), 152–161 (2017)
Article Google Scholar
Hong, R., Zhang, L., Zhang, C., Zimmermann, R.: Flickr circles: aesthetic tendency discovery by multi-view regularized topic modeling. IEEE Trans. Multimedia 18(8), 1555–1567 (2016)
Article Google Scholar
Kuehne, H., Jhuang, H., Garrote, E., Poggio, T., Serre, T.: HMDB: a large video database for human motion recognition. In: International Conference on Computer Vision, pp. 2556–2563 (2015)
Google Scholar
Liu, F., Xu, X., Qiu, S., Qing, C.: Simple to complex transfer learning for action recognition. IEEE Trans. Image Process. Publ. IEEE Sig. Process. Soc. 25(2), 949–960 (2015)
Article MathSciNet Google Scholar
Liu, J., Luo, J., Shah, M.: Recognizing realistic actions from videos in the wild. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009, pp. 1996–2003. IEEE (2009)
Google Scholar
Niebles, J.C., Chen, C.-W., Fei-Fei, L.: Modeling temporal structure of decomposable motion segments for activity classification. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6312, pp. 392–405. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15552-9_29
Chapter Google Scholar
Pechyony, D., Izmailov, R., Vashist, A., Vapnik, V.: SMO-style algorithms for learning using privileged information. In: International Conference on Data Mining, Dmin 2010, 12–15 July 2010, Las Vegas, Nevada, USA, pp. 235–241 (2010)
Google Scholar
Pechyony, D., Vapnik, V.: Fast optimization algorithms for solving SVM+. Stat. Learn. Data Sci. (2011)
Google Scholar
Reddy, K.K., Shah, M.: Recognizing 50 human action categories of web videos. Mach. Vis. Appl. 24(5), 971–981 (2013)
Article Google Scholar
Schüldt, C., Laptev, I., Caputo, B.: Recognizing human actions: a local SVM approach. In: Proceedings of the 17th International Conference on Pattern Recognition, ICPR 2004, vol. 3, pp. 32–36. IEEE (2004)
Google Scholar
Sharmanska, V., Quadrianto, N., Lampert, C.H.: Learning to rank using privileged information. In: IEEE International Conference on Computer Vision, pp. 825–832 (2013)
Google Scholar
Vapnik, V., Vashist, A.: A new learning paradigm: learning using privileged information. Neural Netw. 22(5), 544–557 (2009)
Article MATH Google Scholar
Wang, H., Kläser, A., Schmid, C., Liu, C.L.: Action recognition by dense trajectories. In: 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3169–3176. IEEE (2011)
Google Scholar
Wang, H., Schmid, C.: Action recognition with improved trajectories. In: IEEE International Conference on Computer Vision, pp. 3551–3558 (2013)
Google Scholar
Wang, L., Qiao, Y., Tang, X.: Motionlets: mid-level 3D parts for human motion recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2674–2681 (2013)
Google Scholar
Wang, S., Tao, D., Yang, J.: Relative attribute SVM+ learning for age estimation. IEEE Trans. Cybern. 46(3), 827–839 (2016)
Article Google Scholar
Zhang, Z., Wang, C., Xiao, B., Zhou, W., Liu, S.: Attribute regularization based human action recognition. IEEE Trans. Inf. Forensics Secur. 8(10), 1600–1609 (2013)
Article Google Scholar

Download references

Acknowledgment

This work is supported in part by the National Natural Science Founding of China (61171142, 61401163, U1636218), Science and Technology Planning Project of Guangdong Province of China (2014B010111003, 2014B010111006), the Fundamental Research Funds for the Central Universities (2017MS045), and Guangzhou Key Lab of Body Data Science (201605030011).

Author information

Authors and Affiliations

South China University of Technology, Guangzhou, China
Fang Liu, Xiangmin Xu, Chunmei Qing & Jianxiu Jin

Authors

Fang Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xiangmin Xu
View author publications
You can also search for this author in PubMed Google Scholar
Chunmei Qing
View author publications
You can also search for this author in PubMed Google Scholar
Jianxiu Jin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiangmin Xu .

Editor information

Editors and Affiliations

Multimedia Communications Department, EURECOM, Sophia Antipolis, France
Benoit Huet
Shandong University , Qingdao, China
Liqiang Nie
Hefei University of Technology , Hefei, China
Richang Hong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, F., Xu, X., Qing, C., Jin, J. (2018). Probability Matrix SVM+ Learning for Complex Action Recognition. In: Huet, B., Nie, L., Hong, R. (eds) Internet Multimedia Computing and Service. ICIMCS 2017. Communications in Computer and Information Science, vol 819. Springer, Singapore. https://doi.org/10.1007/978-981-10-8530-7_39

Download citation

DOI: https://doi.org/10.1007/978-981-10-8530-7_39
Published: 01 March 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-8529-1
Online ISBN: 978-981-10-8530-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics