Cross-View Action Recognition Based on Statistical Machine Translation

Wang, Jing; Zheng, Huicheng

doi:10.1007/978-3-642-35136-5_32

Cross-View Action Recognition Based on Statistical Machine Translation

Jing Wang²¹ &
Huicheng Zheng²¹

Conference paper

1845 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7701))

Abstract

In this paper, we propose an approach for human action recognition from different views in a knowledge transfer framework. Each frame in an action is considered as a sentence in an article. We believe that, though the appearance for the same action is quite different in different views, there exists some translation relationship between them. To abstract the relationship, we use the IBM Model 1 in statistical machine translation and the translation probabilities for vocabularies in the source view to those in the target view can be obtained from the training data. Consequently, we can translate an action based on the maximum a posteriori criterion. We validated our method on the public multi-view IXMAS dataset and obtained promising results compared to the state-of-the-art knowledge transfer based methods.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Brown, P.F., Pietra, V.J., Pietra, S.A.D., Mercer, R.L.: The mathematics of statistical machine translation: parameter estimation. Computational Linguistics 19, 263–311 (1993)
Google Scholar
Dollar, P., Rabaud, V., Cottrell, G., Belongie, S.: Behavior recognition via sparse spatio-temporal features. In: IEEE International Workshop on VS-PETS (2005)
Google Scholar
Farhadi, A., Tabrizi, M.K.: Learning to Recognize Activities from the Wrong View Point. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part I. LNCS, vol. 5302, pp. 154–166. Springer, Heidelberg (2008)
Chapter Google Scholar
Ji, X., Liu, H.: Advances in view-invariant human motion analysis: a review. IEEE Transactions on System, Man, and Cybernetics, Part C: Applications and Reviews 40(1), 13–24 (2010)
Article Google Scholar
Junejo, I.N., Dexter, E., Laptev, I., Pérez, P.: Cross-View Action Recognition from Temporal Self-similarities. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part II. LNCS, vol. 5303, pp. 293–306. Springer, Heidelberg (2008)
Chapter Google Scholar
Junejo, I.N., Dexter, E., Laptev, I., Pérez, P.: View-independent action recognition from temporal self-similarities. IEEE Transactions on Pattern Analysis and Machine Intelligence 33(1), 172–185 (2011)
Article Google Scholar
Liu, J., Shah, M., Kuipers, B., Savarese, S.: Cross-view action recognition via view knowledge transfer. In: IEEE Conference on Computer Vision and Pattern Recognition (2011)
Google Scholar
Lucas, B., Kanade, T.: An iterative image registration technique with an application to stereo vision. In: International Joint Conference on Artificial Intelligence, vol. 2, pp. 674–679 (1981)
Google Scholar
Lv, F., Nevatia, R.: Single view human action recognition using key pose matching and Viterbi path searching. In: IEEE Conference on Computer Vision and Pattern Recognition (2007)
Google Scholar
Collins, M.: Statistical machine translation: IBM models 1 and 2, http://www.cs.columbia.edu/~cs4705/notes/ibm12.pdf
Tran, D., Sorokin, A.: Human Activity Recognition with Metric Learning. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part I. LNCS, vol. 5302, pp. 548–561. Springer, Heidelberg (2008)
Chapter Google Scholar
Weinland, D., Boyer, E., Ronfard, R.: Action recognition from arbitrary views using 3D exemplars. In: IEEE International Conference on Computer Vision (2007)
Google Scholar
Weinland, D., Ronfard, R., Boyer, E.: Free viewpoint action recognition using motion history volumes. In: Computer Vision and Image Understanding, vol. 104, pp. 249–257 (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Science and Technology, Sun Yat-sen University, 135 West Xingang Road, 510275, Guangzhou, China
Jing Wang & Huicheng Zheng

Authors

Jing Wang
View author publications
You can also search for this author in PubMed Google Scholar
Huicheng Zheng
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Information Science and Technology, Sun Yat-Sen University, 510275, Guangzhou, P.R. China
Wei-Shi Zheng & Jianhuang Lai &
Institute of Automation, National Laboratory of Pattern Recognition, Chinese Academy of Sciences, 100190, Beijing, P.R. China
Zhenan Sun
School of Computer Science and Engineering, Beihang University, Beijing University of Aeronautics and Astronautics, 100191, Beijing, P.R. China
Yunhong Wang
Institute of Computing Technology, Chinese Academy of Sciences, 100190, Beijing, P.R. China
Xilin Chen
Department of Computer Science, Hong Kong Baptist University, Kowloon Tong, Kowloon, Hong Kong, China
Pong C. Yuen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, J., Zheng, H. (2012). Cross-View Action Recognition Based on Statistical Machine Translation. In: Zheng, WS., Sun, Z., Wang, Y., Chen, X., Yuen, P.C., Lai, J. (eds) Biometric Recognition. CCBR 2012. Lecture Notes in Computer Science, vol 7701. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35136-5_32

Download citation

DOI: https://doi.org/10.1007/978-3-642-35136-5_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35135-8
Online ISBN: 978-3-642-35136-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics