Abstract
For any visually-based interaction between persons and acting systems within a real-world environment the localization of a user by the system is a necessary condition. The presented work deals with this visual localization problem of a user concretely referred to the autonomous mobile robot system MILVA of our department. Since this system is applied under real-world conditions especially for the localization some proper techniques are needed which have an adequate robustness. In our opinion, this requires the combination of several components of saliency towards a multi-cue approach, consisting of structure- and color-based features [2].
This paper introduces one of them: the localization based on a typical shape of contour. A simple contour shape prototype model consists of an arrangement of oriented filters doing a piecewise approximation of the upper shape (head, shoulder) of a frontally aligned person. Applying such filter arrangement in a multiresolution manner, this leads to a robust localization of frontally aligned persons even in depth. The central problem of selecting the most promising (salient) image region is treated by means of a three-dimensional dynamic neural field performing a dynamic winner-take-all process (WTA, [1,6]).
After a successful localization of a person one can start a more detailed analysis of the gesture’s meaning: besides the recognition of static gestures we also concentrate on the acquisition and later the recognition of dynamic gestures.
TMR Marie Curie EU Fellowship # ERB FMBI CT 97 2613
Thuringian Ministry of Science, Research and Culture: GESTIK-Project
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Amari S.: Dynamics of Pattern Formation in Lateral-Inhibition Type Neural Fields. Biol. Cyb., 27:77–87, 1977.
Boehme H.-J., Braumann U.-D., Brakensiek A., Corradini A., Krabbes M. and Gross H.-M.: User Localization for Visually-based Human-Machine-Interaction, In Proc. of the Third IEEE Int. Conf. on Automatic Face and Gesture Recognition, pp. 486–491, 1998
Davis, J. W. and Bobick, A. F.: The Representation and Recognition of Action Using Temporal Templates. In Proc. of the IEEE Computer Society Conference on Comp. Vis. and Patt. Rec., pp. 928–934, 1997.
Jähne, B.: Digital Image Processing Springer-Verlag, 1995.
Jones, J. P. and Palmer, L. A.: An Evaluation of the Two-Dimensional Gabor Filter Model of Simple Receptive Fields in Cat Striate Cortex. J. Neurophysiol., 58(6):1233–1258, 1987.
Kaski, S. and Kohonen, T.: WTA Networks for Physiological Models of Competitive Learning. Neural Networks, 7(6/7):973–984, 1994.
Koenderink, J. J. and v. Doorn, A. J.: Receptive Field Families. Biol. Cyb., 63:291–297, 1990.
Koffka, K.: Principles of the Gestalt Psychology. Brace & World, 1935.
Oren, M., Papageorgiou, C., Sinha, P., Osuna, E., and Poggio, T.: Pedestrian Detection Using Wavelet Templates. In Proc. of the IEEE Computer Society Conference on Comp. Vis. and Patt. Rec., pp. 193–199, 1997.
Reichardt, W., Poggio, T., and Hausen, K.: Figure-Ground Discrimination by Relative Movement in the Visual System of the Fly — Part II: Towards a Neural Circuitry. Biol Cyb., 46(suppl.):1–30, 1983.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag London Limited
About this paper
Cite this paper
Corradini, A., Braumann, UD., Brakensiek, A., Krabbes, M., Boehme, HJ., Gross, HM. (1999). Visual Person Localization with Dynamic Neural Fields: Towards a Gesture Recognition System. In: Marinaro, M., Tagliaferri, R. (eds) Neural Nets WIRN VIETRI-98. Perspectives in Neural Computing. Springer, London. https://doi.org/10.1007/978-1-4471-0811-5_19
Download citation
DOI: https://doi.org/10.1007/978-1-4471-0811-5_19
Publisher Name: Springer, London
Print ISBN: 978-1-4471-1208-2
Online ISBN: 978-1-4471-0811-5
eBook Packages: Springer Book Archive