Skip to main content

DTW Based Clustering to Improve Hand Gesture Recognition

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7065))

Abstract

Vision based hand gesture recognition systems track the hands and extract their spatial trajectory and shape information, which are then classified with machine learning methods. In this work, we propose a dynamic time warping (DTW) based pre-clustering technique to significantly improve hand gesture recognition accuracy of various graphical models used in the human computer interaction (HCI) literature. A dataset of 1200 samples consisting of the ten digits written in the air by 12 people is used to show the efficiency of the method. Hidden Markov model (HMM), input-output HMM (IOHMM), hidden conditional random field (HCRF) and explicit duration model (EDM), which is a type of hidden semi Markov model (HSMM) are trained on the raw dataset and the clustered dataset. Optimal model complexities and recognition accuracies of each model for both cases are compared. Experiments show that the recognition rates undergo substantial improvement, reaching perfect accuracy for most of the models, and the optimal model complexities are significantly reduced.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bengio, Y., Frasconi, P.: Input-output HMM’s for sequence processing. IEEE Transactions on Neural Networks 7(5), 1231–1249 (1996)

    Article  Google Scholar 

  2. Hu, J., Ray, B., Han, L.: An interweaved hmm/dtw approach to robust time series clustering. In: 18th International Conference on Pattern Recognition, ICPR 2006, vol. 3, pp. 145–148 (August 2006)

    Google Scholar 

  3. Keskin, C., Akarun, L.: Stars: Sign tracking and recognition system using input-output hmms. Pattern Recogn. Lett. 30, 1086–1095 (2009)

    Article  Google Scholar 

  4. Lafferty, J.D., McCallum, A., Pereira, F.C.N.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proceedings of the Eighteenth International Conference on Machine Learning, pp. 282–289. Morgan Kaufmann Publishers Inc. (2001)

    Google Scholar 

  5. Lee, H.-K., Kim, J.-H.: Gesture spotting from continuous hand motion. Pattern Recognition Letters 19(5-6), 513–520 (1998)

    Article  Google Scholar 

  6. Liao, T.W.: Clustering of time series data - a survey. Pattern Recognition, 1857–1874 (2005)

    Google Scholar 

  7. Ma, G., Lin, X.: Typical Sequences Extraction and Recognition. In: Sebe, N., Lew, M., Huang, T.S. (eds.) ECCV/HCI 2004. LNCS, vol. 3058, pp. 60–71. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  8. Meila, M., Shi, J.: A random walks view of spectral segmentation (2001)

    Google Scholar 

  9. Oates, T., Firoiu, L., Cohen, P.: Using Dynamic Time Warping to Bootstrap Hmm-Based Clustering of Time Series. In: Sun, R., Giles, C.L. (eds.) Sequence Learning. LNCS (LNAI), vol. 1828, pp. 35–52. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  10. Pavlovic, V., Sharma, R., Huang, T.S.: Visual interpretation of hand gestures for human-computer interaction: A review. IEEE Tran. on Patt. Anal. and Machine Intel. 19(7), 677–695 (1997)

    Article  Google Scholar 

  11. Rabiner, L., Juang, B.: An introduction to hidden markov models. In: IEEE Acoustic Speech Signal Processing Magazine, pp. 3–4 (1986)

    Google Scholar 

  12. Shi, J., Malik, J.: Normalized cuts and image segmentation. In: Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR 1997), p. 731. IEEE Computer Society, Washington, DC (1997)

    Google Scholar 

  13. Wang, S.B., Quattoni, A., Morency, L.-P., Demirdjian, D.: Hidden conditional random fields for gesture recognition. In: CVPR 2006: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1521–1527. IEEE Computer Society, Washington, DC (2006)

    Google Scholar 

  14. Yu, S.-Z., Kobayashi, H.: Practical implementation of an efficient forward-backward algorithm for an explicit-duration hidden markov model. IEEE Transactions on Signal Processing 54(5), 1947–1951 (2006)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Keskin, C., Cemgil, A.T., Akarun, L. (2011). DTW Based Clustering to Improve Hand Gesture Recognition. In: Salah, A.A., Lepri, B. (eds) Human Behavior Understanding. HBU 2011. Lecture Notes in Computer Science, vol 7065. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25446-8_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-25446-8_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-25445-1

  • Online ISBN: 978-3-642-25446-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics