Human Action Recognition Using Non-separable Oriented 3D Dual-Tree Complex Wavelets

Minhas, Rashid; Baradarani, Aryaz; Seifzadeh, Sepideh; Wu, Q. M. Jonathan

doi:10.1007/978-3-642-12297-2_22

Rashid Minhas¹⁹,
Aryaz Baradarani¹⁹,
Sepideh Seifzadeh²⁰ &
…
Q. M. Jonathan Wu¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5996))

Included in the following conference series:

Asian Conference on Computer Vision

1673 Accesses

Abstract

This paper introduces an efficient technique for simultaneous processing of video frames to extract spatio-temporal features for fine activity detection and localization. Such features, obtained through motion-selectivity attribute of 3D dual-tree complex wavelet transform (3D-DTCWT), are used to train a classifier for categorization of an incoming video. The proposed learning model offers three core advantages: 1) significantly faster training stage than traditional supervised approaches, 2) volumetric processing of video data due to the use of 3D transform, 3) rich representation of human actions in view of directionality and shift-invariance of DTCWT. No assumptions of scene background, location, objects of interest, or point of view information are made for activity learning whereas bidirectional 2D-PCA is employed to preserve structure and correlation amongst neighborhood pixels of a video frame. Experimental results compare favorably to recently published results in literature.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ali, S., Basharat, A., Shah, M.: Chaotic invariants for human action recognition. In: Proc. of Int. Conf. on CV (2007)
Google Scholar
Baradarani, A., Wu, J.: Moving object segmentation using the 9/7–10/8 dual-tree complex filter bank. In: Proc. of the 19th IEEE Int. Conf. on PR, Florida, pp. 7–11 (2008)
Google Scholar
Black, M.J.: Explaining optical flow events with parameterized spatio-temporal models. In: Proc. of Int. Conf. on CVPR, pp. 1326–1332 (1999)
Google Scholar
Brand, M., Oliver, N., Pentland, A.: Coupled HMM for Complex Action Recognition. In: Proc. of Int. Conf. on CVPR (1997)
Google Scholar
Burns, T.J.: A non-homogeneous wavelet multiresolution analysis and its application to the analysis of motion, PhD thesis, Air Force Institute of Tech. (1993)
Google Scholar
Gorelick, L., Blank, M., Shechtman, E., Irani, M., Basri, R.: Actions as space time shapes. IEEE Trans. on PAMI, 2247–2253 (2007)
Google Scholar
Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: theory and applications. Neurocomputing (2005)
Google Scholar
Jiang, H., Drew, M.S., Li, Z.N.: Successive convex matching for action detection. In: Proc. of Int. Conf. on CVPR (2006)
Google Scholar
Kingsbury, N.G.: Complex wavelets for shift invariant analysis and filtering of signals. Journal of Applied and Computational Harmonic Analysis 10(3), 234–253 (2001)
Article MATH MathSciNet Google Scholar
Li, L.-J., Li, F.-F.: What, where and who? Classifying events by scene and object recognition. In: Proc. of Int. Conf. on CV (2007)
Google Scholar
Liu, J., Ali, S., Shah, M.: Recognizing human actions using multiple features. In: Proc. of Int. Conf. on CVPR (2008)
Google Scholar
Mori, G., Ren, X., Efros, A.A., Malik, J.: Recovering human body configurations: combining segmentation and recognition. In: Proc. of Int. Conf. on CVPR (2004)
Google Scholar
Niebels, J. L.F.-F.: A hierarchical model of shape and appearance for human action classification. In: Proc. of Int. Conf. on CVPR (2007)
Google Scholar
Selesnick, I.W.: Hilbert transform pairs of wavelet bases. IEEE Signal Processing Letters 8, 170–173 (2001)
Article Google Scholar
Selesnick, I.W., Li, K.Y.: Video denoising using 2D and 3D dual-tree complex wavelet transforms, Wavelet Applications in Signal and Image. In: Proc. SPIE 5207, San Diego (August 2003)
Google Scholar
Selesnick, I.W., Shi, F.: Video denoising using oriented complex wavelet transforms. In: Proc. of the IEEE Int. Conf. on Acoust., Speech, and Signal Proc., May 2004, vol. 2, pp. 949–952 (2004)
Google Scholar
Selesnick, I.W., Baraniuk, R.G., Kingsbury, N.G.: The dual-tree complex wavelet transform – a coherent framework for multiscale signal and image processing. IEEE Signal Processing Magazine 6, 123–151 (2005)
Article Google Scholar
Smith, P., Victoria, N.D., Shah, M.: TemporalBoost for event recognition. In: Proc. of Int. Conf. on CV (2005)
Google Scholar
Strang, G., Nguyen, T.: Wavelets and Filter Banks. Wellesley, Cambridge (1996)
Google Scholar
Yang, J., Zhang, D., Frangi, F., Yang, J.-Y.: Two-dimensional PCA: a new approach to appearance based face representation and recognition. IEEE Trans. on PAMI (1), 131–137 (2004)
Google Scholar
Yilmaz, A., Shah, M.: Actions sketch: a novel action representation. In: Proc. of Int. Conf. on CVPR (2005)
Google Scholar
Yu, R., Baradarani, A.: Sampled-data design of FIR dual filter banks for dual-tree complex wavelet transforms. IEEE Trans. on Signal Proc. 56(7), 3369–3375 (2008)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, University of Windsor, Ontario, N9B3P4, Canada
Rashid Minhas, Aryaz Baradarani & Q. M. Jonathan Wu
School of Computer Science, University of Windsor, Ontario, N9B3P4, Canada
Sepideh Seifzadeh

Authors

Rashid Minhas
View author publications
You can also search for this author in PubMed Google Scholar
Aryaz Baradarani
View author publications
You can also search for this author in PubMed Google Scholar
Sepideh Seifzadeh
View author publications
You can also search for this author in PubMed Google Scholar
Q. M. Jonathan Wu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Machine Intelligence, Peking University, 100871, Beijing, China
Hongbin Zha
Department of Advanced Information Technology, Kyushu University, 819-0395, Fukuoka, Japan
Rin-ichiro Taniguchi
Birkbeck College, Department of Computer Science, University of London, WC1E 7HX, London, UK
Stephen Maybank

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Minhas, R., Baradarani, A., Seifzadeh, S., Wu, Q.M.J. (2010). Human Action Recognition Using Non-separable Oriented 3D Dual-Tree Complex Wavelets. In: Zha, H., Taniguchi, Ri., Maybank, S. (eds) Computer Vision – ACCV 2009. ACCV 2009. Lecture Notes in Computer Science, vol 5996. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12297-2_22

Download citation

DOI: https://doi.org/10.1007/978-3-642-12297-2_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12296-5
Online ISBN: 978-3-642-12297-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics