Advertisement

Weakly Aligned Multi-part Bag-of-Poses for Action Recognition from Depth Cameras

  • Lorenzo Seidenari
  • Vincenzo Varano
  • Stefano Berretti
  • Alberto Del Bimbo
  • Pietro Pala
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8158)

Abstract

In this work, we propose an efficient and effective method to recognize human actions based on the estimated 3D positions of skeletal joints in temporal sequences of depth maps. First, the body skeleton is decomposed in a set of kinematic chains, and the position of each joint is expressed in a locally defined reference system, which makes the coordinates invariant to body translations and rotations. A multi-part bag-of-poses approach is then defined, which permits the separate alignment of body parts through a nearest-neighbor classification. Experiments conducted on the MSR Daily Activity dataset show promising results.

Keywords

depth camera action recognition nearest-neighbor classification 

References

  1. 1.
    Baak, A., Muller, M., Bharaj, G., Seidel, H.P., Theobalt, C.: A data-driven approach for real-time full body pose reconstruction from a depth camera. In: Proc. of Int. Conf. on Computer Vision, Barcelona, Spain, pp. 1092–1099 (November 2011)Google Scholar
  2. 2.
    Bagdanov, A.D., Del Bimbo, A., Seidenari, L., Usai, L.: Real-time hand status recognition from RGB-D imagery. In: Proc. of Int. Conf. on Pattern Recognition, Tsukuba, Japan, pp. 2456–2459 (November 2012)Google Scholar
  3. 3.
    Berretti, S., Del Bimbo, A., Pala, P.: Superfaces: A super-resolution model for 3D faces. In: Proc. of Work. on Non-Rigid Shape Analysis and Deformable Image Alignment, Florence, Italy, pp. 73–82 (October 2012)Google Scholar
  4. 4.
    Boiman, O., Shechtman, E., Irani, M.: In defense of nearest-neighbor based image classification. In: Proc. of IEEE Conf. on Computer Vision and Pattern Recognition, Anchorage, Alaska, USA, pp. 1–8 (June 2008)Google Scholar
  5. 5.
    Girshick, R., Shotton, J., Kohli, P., Criminisi, A., Fitzgibbon, A.: Efficient regression of general-activity human poses from depth images. In: Proc. of Int. Conf. on Computer Vision, Barcelona, Spain (November 2011)Google Scholar
  6. 6.
    Hadfield, S., Bowden, R.: Kinecting the dots: Particle based scene flow from depth sensors. In: Proc. of Int. Conf. on Computer Vision, Barcelona, Spain, pp. 2290–2295 (November 2011)Google Scholar
  7. 7.
    Li, W., Zhang, Z., Liu, Z.: Action recognition based on a bag of 3D points. In: Work. on Human Communicative Behavior Analysis, San Francisco, California, pp. 9–14 (June 2010)Google Scholar
  8. 8.
    Muller, M., Röder, T.: Motion templates for automatic classification and retrieval of motion capture data. In: ACM SIGGRAPH/Eurographics Symp. on Computer Animation, Vienna, Austria, pp. 137–146 (September 2006)Google Scholar
  9. 9.
    Raptis, M., Kirovski, D., Hoppe, H.: Real-time classification of dance gestures from skeleton animation. In: ACM SIGGRAPH/Eurographics Symp. on Computer Animation, Vancouver, Canada, pp. 147–156 (August 2011)Google Scholar
  10. 10.
    Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., Blake, A.: Real-time human pose recognition in parts from single depth images. In: IEEE Conf. on Computer Vision and Pattern Recognition, Colorado Springs, Colorado, pp. 1–8 (June 2011)Google Scholar
  11. 11.
    Vinciarelli, A., Pantic, M., Bourlard, H.: Social signal processing: Survey of an emerging domain. Image and Vison Computing 27(12), 1743–1759 (2009)CrossRefGoogle Scholar
  12. 12.
    Wang, J., Liu, Z., Wu, Y., Yuan, J.: Mining actionlet ensemble for action recognition with depth cameras. In: IEEE Conf. on Computer Vision and Pattern Recognition, Providence, Rhode Island, pp. 1–8 (June 2012)Google Scholar
  13. 13.
    Xia, L., Chen, C.C., Aggarwal, J.K.: View invariant human action recognition using histograms of 3D joints. In: Work. on Human Activity Understanding from 3D Data, Providence, Rhode Island, pp. 20–27 (June 2012)Google Scholar
  14. 14.
    Yang, X., Tian, Y.: Eigenjoints-based action recognition using naive-bayes-nearest-neighbor. In: Work. on Human Activity Understanding from 3D Data, Providence, Rhode Island, pp. 14–19 (June 2012)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Lorenzo Seidenari
    • 1
  • Vincenzo Varano
    • 1
  • Stefano Berretti
    • 1
  • Alberto Del Bimbo
    • 1
  • Pietro Pala
    • 1
  1. 1.University of FirenzeFirenzeItaly

Personalised recommendations