Model-Based Human Motion Tracking and Behavior Recognition Using Hierarchical Finite State Automata

Park, Jihun; Park, Sunghun; Aggarwal, J. K.

doi:10.1007/978-3-540-24768-5_33

Model-Based Human Motion Tracking and Behavior Recognition Using Hierarchical Finite State Automata

Jihun Park²⁰,
Sunghun Park²¹ &
J. K. Aggarwal²²

Conference paper

957 Accesses
5 Citations

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3046))

Abstract

The generation of motion of an articulated body for computer animation is an expensive and time-consuming task. Recognition of human actions and interactions is important to video annotation, automated surveillance, and content-based video retrieval. This paper presents a new model-based human-intervention-free approach to articulated body motion tracking and recognition of human interaction using static-background monocular video sequences. This paper presents two major applications based on basic motion tracking: motion capture and human behavior recognition.

To determine a human body configuration in a scene, a 3D human body model is postulated and projected on a 2D projection plane to overlap with the foreground image silhouette. We convert the human model body overlapping problem into a parameter optimization problem to avoid the kinematic singularity problem. Unlike other methods, our body tracking does not need any user intervention. A cost function is used to estimate the degree of the overlapping between the foreground input image silhouette and a projected 3D model body silhouette. The configuration the best overlap with the foreground of the image least overlap with the background is sought. The overlapping is computed using computational geometry by converting a set of pixels from the image domain to a polygon in the 2D projection plane domain.

We recognize human interaction motion using hierarchical finite state automata (FA). The model motion data we get from tracking is analyzed to get various states and events in terms of feet, torso, and hands by a low-level behavior recognition model. The recognition model represents human behaviors as sequences of states that classify the configuration of individual body parts in space and time. To overcome the exponential growth of the number of states that usually occurs in a single-level FA, we present a new hierarchical FA that abstracts states and events from motion data at three levels: the low-level FA analyzes body parts only, the middle-level FAs recognize motion and the high-level FAs analyze a human interaction. Motion tracking results and behavior recognition from video sequences are very encouraging.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Morris, D., Rehg, J.: Singularity analysis for articulated object tracking. In: Computer Vision and Pattern Recognition (1998)
Google Scholar
Park, J., Park, S., Aggarwal, J.K.: Human motion tracking by combining viewbased and model-based methods for monocular video sequences. In: Kumar, V., Gavrilova, M.L., Tan, C.J.K., L’Ecuyer, P. (eds.) ICCSA 2003. LNCS, vol. 2669, Springer, Heidelberg (2003)
Google Scholar
Park, J., Park, S., Aggarwal, J.K.: Model-based human motion capture from monocular video sequences. In: Yazıcı, A., Şener, C. (eds.) ISCIS 2003. LNCS, vol. 2869, pp. 405–412. Springer, Heidelberg (2003)
Chapter Google Scholar
Park, S., Park, J., Aggarwal, J.K.: Video retrieval of human interactions using model-based motion tracking and multi-layer finite state automata. In: Bakker, E.M., Lew, M., Huang, T.S., Sebe, N., Zhou, X.S. (eds.) CIVR 2003. LNCS, vol. 2728, Springer, Heidelberg (2003)
Google Scholar
Oliver, N.M., Rosario, B., Pentland, A.P.: A Bayesian computer vision system for modeling human interactions. IEEE Trans. Pattern Analysis and Machine Intelligence 22, 831–843 (2000)
Article Google Scholar
Hongeng, S., Bremond, F., Nevatia, R.: Representation and optimal recognition of human activities. In: IEEE Conf. on Computer Vision and Pattern Recognition., vol. 1, pp. 818–825 (2000)
Google Scholar
Hong, P., Turk, M., Huang, T.S.: Gesture modeling and recognition using finite state machines. In: IEEE Conf. on Face and Gesture Recognition (2000)
Google Scholar
Wada, T., Matsuyama, T.: Appearance based behavior recognition by event driven slective attention. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Orlando, FL, pp. 759–764 (1998)
Google Scholar
Lasdon, L., Waren, A.: GRG2 User’s Guide (1989)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Engineering, Hongik University, Seoul, Korea
Jihun Park
Department of Management Information Systems, Myongji University, Seoul, Korea
Sunghun Park
Department of Electrical and Computer Engineering, The University of Texas at Austin, Austin, TX, 78712, USA
J. K. Aggarwal

Authors

Jihun Park
View author publications
You can also search for this author in PubMed Google Scholar
Sunghun Park
View author publications
You can also search for this author in PubMed Google Scholar
J. K. Aggarwal
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Chemistry, University of Perugia, Via Elce di Sotto, 8, I-06123, Perugia, Italy
Antonio Laganá
Department of Computer Science, University of Calgary, 2500 University Drive N.W., T2N 1N4, Calgary, AB, Canada
Marina L. Gavrilova
William Norris Professor, Head of the Computer Science and Engineering Department, University of Minnesota, USA
Vipin Kumar
School of Computing, Soongsil University, Seoul, Korea
Youngsong Mun
OptimaNumerics Ltd., Cathedral House, 23-31 Waring Street, BT1 2DX, Belfast, UK
C. J. Kenneth Tan
Department of Mathematics and Computer Science, University of Perugia, via Vanvitelli, 1, I-06123, Perugia, Italy
Osvaldo Gervasi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Park, J., Park, S., Aggarwal, J.K. (2004). Model-Based Human Motion Tracking and Behavior Recognition Using Hierarchical Finite State Automata. In: Laganá, A., Gavrilova, M.L., Kumar, V., Mun, Y., Tan, C.J.K., Gervasi, O. (eds) Computational Science and Its Applications – ICCSA 2004. ICCSA 2004. Lecture Notes in Computer Science, vol 3046. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24768-5_33

Download citation

DOI: https://doi.org/10.1007/978-3-540-24768-5_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22060-2
Online ISBN: 978-3-540-24768-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics