Abstract
Complex action recognition is a hot topic in computer vision. When training a robust model, a large amount of labeled data is required. However, labeling complex actions is often time-consuming and expensive. Considering that each complex action is composed of a sequence of simple actions, we propose a new perspective to provide more information during training in order to solve the problem of insufficient labeled data. The probability matrix is then designed by manual annotation, which encodes a probability distribution of simple actions in complex actions. So the probability matrix is only available during training but unavailable during testing. Finally, a probability matrix is regared as privileged information in a SVM+ framework, and we regard this setting as probability matrix SVM+(pmSVM+). To validate the proposed model, extensive experiments are carried out on complex action datasets. Experiment results show the effectiveness of pmSVM+ for complex action recognition.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Corso, J.J., Sadanand, S.: Action bank: a high-level representation of activity in video. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1234–1241 (2012)
Gorelick, L., Blank, M., Shechtman, E., Irani, M., Basri, R.: Actions as space-time shapes. IEEE Trans. Pattern Anal. Mach. Intell. 29(12), 2247–2253 (2007)
Hong, R., Hu, Z., Wang, R., Wang, M., Tao, D.: Multi-view object retrieval via multi-scale topic models. IEEE Trans. Image Process. 25(12), 5814–5827 (2016)
Hong, R., Yang, Y., Wang, M., Hua, X.S.: Learning visual semantic relationships for efficient visual retrieval. IEEE Trans. Big Data 1(4), 152–161 (2017)
Hong, R., Zhang, L., Zhang, C., Zimmermann, R.: Flickr circles: aesthetic tendency discovery by multi-view regularized topic modeling. IEEE Trans. Multimedia 18(8), 1555–1567 (2016)
Kuehne, H., Jhuang, H., Garrote, E., Poggio, T., Serre, T.: HMDB: a large video database for human motion recognition. In: International Conference on Computer Vision, pp. 2556–2563 (2015)
Liu, F., Xu, X., Qiu, S., Qing, C.: Simple to complex transfer learning for action recognition. IEEE Trans. Image Process. Publ. IEEE Sig. Process. Soc. 25(2), 949–960 (2015)
Liu, J., Luo, J., Shah, M.: Recognizing realistic actions from videos in the wild. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009, pp. 1996–2003. IEEE (2009)
Niebles, J.C., Chen, C.-W., Fei-Fei, L.: Modeling temporal structure of decomposable motion segments for activity classification. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6312, pp. 392–405. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15552-9_29
Pechyony, D., Izmailov, R., Vashist, A., Vapnik, V.: SMO-style algorithms for learning using privileged information. In: International Conference on Data Mining, Dmin 2010, 12–15 July 2010, Las Vegas, Nevada, USA, pp. 235–241 (2010)
Pechyony, D., Vapnik, V.: Fast optimization algorithms for solving SVM+. Stat. Learn. Data Sci. (2011)
Reddy, K.K., Shah, M.: Recognizing 50 human action categories of web videos. Mach. Vis. Appl. 24(5), 971–981 (2013)
Schüldt, C., Laptev, I., Caputo, B.: Recognizing human actions: a local SVM approach. In: Proceedings of the 17th International Conference on Pattern Recognition, ICPR 2004, vol. 3, pp. 32–36. IEEE (2004)
Sharmanska, V., Quadrianto, N., Lampert, C.H.: Learning to rank using privileged information. In: IEEE International Conference on Computer Vision, pp. 825–832 (2013)
Vapnik, V., Vashist, A.: A new learning paradigm: learning using privileged information. Neural Netw. 22(5), 544–557 (2009)
Wang, H., Kläser, A., Schmid, C., Liu, C.L.: Action recognition by dense trajectories. In: 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3169–3176. IEEE (2011)
Wang, H., Schmid, C.: Action recognition with improved trajectories. In: IEEE International Conference on Computer Vision, pp. 3551–3558 (2013)
Wang, L., Qiao, Y., Tang, X.: Motionlets: mid-level 3D parts for human motion recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2674–2681 (2013)
Wang, S., Tao, D., Yang, J.: Relative attribute SVM+ learning for age estimation. IEEE Trans. Cybern. 46(3), 827–839 (2016)
Zhang, Z., Wang, C., Xiao, B., Zhou, W., Liu, S.: Attribute regularization based human action recognition. IEEE Trans. Inf. Forensics Secur. 8(10), 1600–1609 (2013)
Acknowledgment
This work is supported in part by the National Natural Science Founding of China (61171142, 61401163, U1636218), Science and Technology Planning Project of Guangdong Province of China (2014B010111003, 2014B010111006), the Fundamental Research Funds for the Central Universities (2017MS045), and Guangzhou Key Lab of Body Data Science (201605030011).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Liu, F., Xu, X., Qing, C., Jin, J. (2018). Probability Matrix SVM+ Learning for Complex Action Recognition. In: Huet, B., Nie, L., Hong, R. (eds) Internet Multimedia Computing and Service. ICIMCS 2017. Communications in Computer and Information Science, vol 819. Springer, Singapore. https://doi.org/10.1007/978-981-10-8530-7_39
Download citation
DOI: https://doi.org/10.1007/978-981-10-8530-7_39
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-8529-1
Online ISBN: 978-981-10-8530-7
eBook Packages: Computer ScienceComputer Science (R0)