Skip to main content

Cross-View Action Recognition Based on Statistical Machine Translation

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7701))

Abstract

In this paper, we propose an approach for human action recognition from different views in a knowledge transfer framework. Each frame in an action is considered as a sentence in an article. We believe that, though the appearance for the same action is quite different in different views, there exists some translation relationship between them. To abstract the relationship, we use the IBM Model 1 in statistical machine translation and the translation probabilities for vocabularies in the source view to those in the target view can be obtained from the training data. Consequently, we can translate an action based on the maximum a posteriori criterion. We validated our method on the public multi-view IXMAS dataset and obtained promising results compared to the state-of-the-art knowledge transfer based methods.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Brown, P.F., Pietra, V.J., Pietra, S.A.D., Mercer, R.L.: The mathematics of statistical machine translation: parameter estimation. Computational Linguistics 19, 263–311 (1993)

    Google Scholar 

  2. Dollar, P., Rabaud, V., Cottrell, G., Belongie, S.: Behavior recognition via sparse spatio-temporal features. In: IEEE International Workshop on VS-PETS (2005)

    Google Scholar 

  3. Farhadi, A., Tabrizi, M.K.: Learning to Recognize Activities from the Wrong View Point. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part I. LNCS, vol. 5302, pp. 154–166. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  4. Ji, X., Liu, H.: Advances in view-invariant human motion analysis: a review. IEEE Transactions on System, Man, and Cybernetics, Part C: Applications and Reviews 40(1), 13–24 (2010)

    Article  Google Scholar 

  5. Junejo, I.N., Dexter, E., Laptev, I., Pérez, P.: Cross-View Action Recognition from Temporal Self-similarities. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part II. LNCS, vol. 5303, pp. 293–306. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  6. Junejo, I.N., Dexter, E., Laptev, I., Pérez, P.: View-independent action recognition from temporal self-similarities. IEEE Transactions on Pattern Analysis and Machine Intelligence 33(1), 172–185 (2011)

    Article  Google Scholar 

  7. Liu, J., Shah, M., Kuipers, B., Savarese, S.: Cross-view action recognition via view knowledge transfer. In: IEEE Conference on Computer Vision and Pattern Recognition (2011)

    Google Scholar 

  8. Lucas, B., Kanade, T.: An iterative image registration technique with an application to stereo vision. In: International Joint Conference on Artificial Intelligence, vol. 2, pp. 674–679 (1981)

    Google Scholar 

  9. Lv, F., Nevatia, R.: Single view human action recognition using key pose matching and Viterbi path searching. In: IEEE Conference on Computer Vision and Pattern Recognition (2007)

    Google Scholar 

  10. Collins, M.: Statistical machine translation: IBM models 1 and 2, http://www.cs.columbia.edu/~cs4705/notes/ibm12.pdf

  11. Tran, D., Sorokin, A.: Human Activity Recognition with Metric Learning. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part I. LNCS, vol. 5302, pp. 548–561. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  12. Weinland, D., Boyer, E., Ronfard, R.: Action recognition from arbitrary views using 3D exemplars. In: IEEE International Conference on Computer Vision (2007)

    Google Scholar 

  13. Weinland, D., Ronfard, R., Boyer, E.: Free viewpoint action recognition using motion history volumes. In: Computer Vision and Image Understanding, vol. 104, pp. 249–257 (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wang, J., Zheng, H. (2012). Cross-View Action Recognition Based on Statistical Machine Translation. In: Zheng, WS., Sun, Z., Wang, Y., Chen, X., Yuen, P.C., Lai, J. (eds) Biometric Recognition. CCBR 2012. Lecture Notes in Computer Science, vol 7701. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35136-5_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-35136-5_32

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-35135-8

  • Online ISBN: 978-3-642-35136-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics