Abstract
Gesture recognition is important, because it is a useful communication medium between humans and computers. In this paper we use multiple sensors, i.e., PSD cameras for detecting LEDs attached on a body and DataGloves for both hands. One of the major difficulties in gesture recognition is temporal segmentation from continuous motion. We use training samples which are manually segmented and labeled as prior knowledge. A self-organizing map(SOM) is constructed based on training samples. Test gestural data are segmented by systematic search to obtain the best match with reference vectors on a competitive layer. A comparative study is done between the use of a single SOM and 3 SOMs for representing spatio-temporal information obtained from PSD cameras and DataGloves.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chappell, G. J. and Taylor, J. G. (1993). The temporal Kohonen map, Neural Networks, Vol. 6, pp. 441–445.
Harling, P.A. et al. Eds., (1996). Progress in Gestural Interaction: Proceedings of Gesture Workshop’96, Springer.
Wachsmuth, I, Froelich, M. Eds., (1997). Gesture and Sign Language in Human-Computer Interaction, International Gesture Workshop, Bielefeld, Germany, Springer.
Braffort, A. et al. Eds., (1999). Gesture-Based Communication in Human Computer Interaction: International Gesture Workshop, GW’99, Lecture Notes in Computer Science, VOL. 1739, Springer-Verlag.
Ishikawa, M. (2000). Recognition of hand-gestures based on self-organization using a DataGlove, Australian Journal of Intelligent Information Processing Systems, Vol. 6, No. 2, pp. 65–71.
Ishikawa, M. and Suenaga, H. (2001). Self-organization for temporal data of varying length, ICONIP2001, Shanghai, China, pp. 247–252.
Ishikawa, M. and Sasaki, N. (2002). Gesture recognition based on SOM using multiple sensors, 9th International Conference on Neural Information Processing(ICONIP2002), Singapore, pp. 1300–1304.
Kangas, J. (1990). Time-delayed self-organizing maps, IJCNN-90, Vol.2, pp. 331336, San Diego, CA.
Kohonen, T., (2001). Self-organizing Maps, 3rd Ed., Springer.
Kurokawa, T, Morichi, T. and Watanabe, S. (1993). Bidirectional translation between sign language and Japanese for communication with deaf-mute people, Advances in Human Factors/Ergonomics, 19B, pp. 1109–1114.
Kurtenbach, G. and Hulteen, E.A. (1990). Gestures in human-computer communication, The Art of Human-Computer Interface Design, Addison-Wesley, pp. 309–317.
Lee, H-J. and Chen, Z., (1985). Determination of 3D human body postures from a single view, Computer Vision, Graphics, and Image Processing, Vol. 30, pp. 148–168.
Mozer, M. C. (1994). Neural net architectures for temporal sequence processing, in A. Weigend and N, Gershenfeld (Eds.), Time Series Prediction: Forecasting the Future and Understanding the Past, pp.243–264, Addison Wesley.
O’Rourke, J. and Badler, N.I. (1980), Model-Based Image Analysis of Human Motion Using Constraint Propagation, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. PAMI-2, No.6, November, pp. 522–536.
Salmela, P., Kuusisto, S., Saarinen, J., Laurila, K. and Haavisto, P. (1996). Isolated spoken number recognition with hybrid of self-organizing map and multilayer Perceptron, Proceedings of the International Conference on Neural Networks, (ICNN’96), Vol. 4, pp. 1912–1917.
Rabiner, L.R. and Juang, B.H. (1986). An introduction to hidden Markov models, IEEE ASSP Magazine, January 1986, pp. 4–16.
Rabiner, L.R. (1989). A tutorial on hidden markov models and selected applications in speech recognition, Proceedings of the IEEE, vol. 77, No. 2, pp. 257–285.
Ritter, H. and Kohonen, T. (1989). Self-organizing semantic maps, Biological Cybernetics, vol. 61, pp. 241–254.
Rohr, K. (1994), Towards Model-Based Recognition of Human Movements in Image Sequences, CVGIP: Image Understanding, Vol. 59, No. 1, pp. 94–115.
Rubine, D. (1991). Specifying gestures by example, Computer Graphics, Vol. 25, No. 4, pp. 329–337.
Takahashi, K., Seki, S., and Oka, R. (1994), Spotting recognition of human gestures from motion images, Time-Varying Image Processing and Moving Object Recognition, 3, in Cappellini, V. (Ed.), Proceedings of the 4th International Workshop, pp.65–72, Elsevier.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Ishikawa, M. (2004). Gesture Recognition Based on SOM Using Multiple Sensors. In: Rajapakse, J.C., Wang, L. (eds) Neural Information Processing: Research and Development. Studies in Fuzziness and Soft Computing, vol 152. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39935-3_21
Download citation
DOI: https://doi.org/10.1007/978-3-540-39935-3_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-53564-2
Online ISBN: 978-3-540-39935-3
eBook Packages: Springer Book Archive