Integrating Context-Free and Context-Dependent Attentional Mechanisms for Gestural Object Reference

Heidemann, Gunther; Rae, Robert; Bekel, Holger; Bax, Ingo; Ritter, Helge

doi:10.1007/3-540-36592-3_3

Gunther Heidemann⁸,
Robert Rae⁹,
Holger Bekel⁸,
Ingo Bax⁸ &
…
Helge Ritter⁸

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2626))

Included in the following conference series:

International Conference on Computer Vision Systems

700 Accesses
9 Citations

Abstract

We present a vision system for human-machine interaction that relies on a small wearable camera which can be mounted to common glasses. The camera views the area in front of the user, especially the hands. To evaluate hand movements for pointing gestures to objects and to recognise object reference, an approach relying on the integration of bottom-up generated feature maps and top-down propagated recognition results is introduced. In this vision system, modules for context free focus of attention work in parallel to a recognition system for hand gestures. In contrast to other approaches, the fusion of the two branches is not on the symbolic but on the sub-symbolic level by use of attention maps. This method is plausible from a cognitive point of view and facilitates the integration of entirely different modalities.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

G. Backer, B. Mertsching, and M. Bollmann. Data-and Model-Driven Gaze Control for an Active-Vision System. IEEE Trans. on Pattern Analysis and Machine Intelligence, 23(12):1415–1429, 2001.
Article Google Scholar
C. Bauckhage, G. A. Fink, J. Fritsch, F. Kummert, F. Lömker, G. Sagerer, and S. Wachsmuth. An Integrated System for Cooperative Man-Machine Interaction. In IEEE Int.’l Symp. on Comp. Intelligence in Robotics and Automation, Banff, Canada, 2001.
Google Scholar
V. Bruce and M. Morgan. Violations of Symmetry and Repetition in Visual Patterns. Psychological Review, 61:183–193, 1954.
Article Google Scholar
D. Crevier and R. Lepage. Knowledge-based image understanding systems: A survey. Computer Vision and Image Understanding, 67(2):161–185, 1997.
Article Google Scholar
M. Fislage, R. Rae, and H. Ritter. Using visual attention to recognize human pointing gestures in assembly tasks. In 7th IEEE Int’l Conf. Comp. Vision, 1999.
Google Scholar
C. Harris and M. Stephens. A Combined Corner and Edge Detector. In Proc. 4th Alvey Vision Conf., pages 147–151, 1988.
Google Scholar
G. Heidemann, D. Lücke, and H. Ritter. A System for Various Visual Classification Tasks Based on Neural Networks. In A. Sanfeliu et al., editor, Proc. 15th Int’l Conf. on Pattern Recognition ICPR 2000, Barcelona, volume I, pages 9–12, 2000.
Google Scholar
G. Heidemann and H. Ritter. Efficient Vector Quantization Using the WTA-rule with Activity Equalization. Neural Processing Letters, 13(1):17–30, 2001.
Article MATH Google Scholar
G. Heidemann and H. Ritter. Visual Checking of Grasping Positions of a Three-Fingered Robot Hand. In G. Dorffner, H. Bischof, and K. Hornik, editors, Proc. ICANN 2001, pages 891–898. Springer-Verlag, 2001.
Google Scholar
L. Itti, C. Koch, and E. Niebur. A Model of Saliency-Based Visual Attention for Rapid Scene Analysis. IEEE Trans. on Pattern Analysis and Machine Intelligence, 20(11):1254–1259, 1998.
Article Google Scholar
I. Jolliffe. Principal Component Analysis. Springer Verlag, New York, 1986.
Google Scholar
T. Kalinke and U. Handmann. Fusion of Texture and Contour Based Methods for Object Recognition. In IEEE Conf. on Intelligent Transportation Systems 1997, Stuttgart, 1997.
Google Scholar
T. Kalinke and W. v. Seelen. Entropie als Maß des lokalen Informationsgehalts in Bildern zur Realisierung einer Aufmerksamkeitssteuerung. In B. Jähne et al., editor, Mustererkennung 1996. Springer, Heidelberg, 1996.
Google Scholar
T. Kohonen. Self-organization and associative memory. In Springer Series in Information Sciences 8. Springer-Verlag Heidelberg, 1984.
Google Scholar
P. J. Locher and C. F. Nodine. Symmetry Catches the Eye. In A. Levy-Schoen and J. K. O’Reagan, editors, Eye Movements: From Physiology to Cognition, pages 353–361. Elsevier Science Publishers B. V. (North Holland), 1987.
Google Scholar
J. Moody and C. Darken. Learning with localized receptive fields. In Proc. of the 1988 Connectionist Models Summer School, pages 133–143. Morgan Kaufman Publishers, San Mateo, CA, 1988.
Google Scholar
D. Reisfeld, H. Wolfson, and Y. Yeshurun. Context-Free Attentional Operators: The Generalized Symmetry Transform. Int’l J. of Computer Vision, 14:119–130, 1995.
Article Google Scholar
H. J. Ritter, T. M. Martinetz, and K. J. Schulten. Neuronale Netze. Addison-Wesley, München, 1992.
MATH Google Scholar
T. D. Sanger. Optimal Unsupervised Learning in a Single-Layer Linear Feedforward Neural Network. Neural Networks, 2:459–473, 1989.
Article Google Scholar
C. Schmid, R. Mohr, and C. Bauckhage. Evaluation of Interest Point Detectors. Int’l J. of Computer Vision, 37(2):151–172, 2000.
Article MATH Google Scholar
C. Theis, I. Iossifidis, and A. Steinhage. Image processing methods for interactive robot control. In Proc. IEEE Roman International Workshop on Robot-Human Interactive Communication, Bordeaux and Paris, France, 2001.
Google Scholar
M. E. Tipping and C. M. Bishop. Mixtures of probabilistic principal component analyzers. Neural Computation, 11(2):443–482, 1999.
Article Google Scholar
D. Walther, L. Itti, M. Riesenhuber, T. Poggio, and C. Koch. Attentional Selection for Object Recognition — a Gentle Way. In Proc. 2nd Workshop on Biologically Motivated Computer Vision (BMCV’02), Tübingen, Germany, 2002.
Google Scholar

Download references

Author information

Authors and Affiliations

Neuroinformatics Group, Faculty of Technology, Bielefeld University, Postfach 10 01 31, D-33501, Bielefeld, Germany
Gunther Heidemann, Holger Bekel, Ingo Bax & Helge Ritter
PerFact Innovation, Lampingstr. 8, D-33615, Bielefeld, Germany
Robert Rae

Authors

Gunther Heidemann
View author publications
You can also search for this author in PubMed Google Scholar
Robert Rae
View author publications
You can also search for this author in PubMed Google Scholar
Holger Bekel
View author publications
You can also search for this author in PubMed Google Scholar
Ingo Bax
View author publications
You can also search for this author in PubMed Google Scholar
Helge Ritter
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

INRIA Rhône-Alpes, 655 Ave de l’Europe, 38330, Montbonnot, France
James L. Crowley
Montefiore Institute, University of Liège, 4000, Liège Sart-Tilman, Belgium
Justus H. Piater
Automation and Control Institute, Vienna University of Technology, Gusshausstraße 27/376, 1040, Vienna, Austria
Markus Vincze
Institute of Digital Image Processing, Joanneum Research, Wastiangasse 6, 8010, Graz, Austria
Lucas Paletta

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Heidemann, G., Rae, R., Bekel, H., Bax, I., Ritter, H. (2003). Integrating Context-Free and Context-Dependent Attentional Mechanisms for Gestural Object Reference. In: Crowley, J.L., Piater, J.H., Vincze, M., Paletta, L. (eds) Computer Vision Systems. ICVS 2003. Lecture Notes in Computer Science, vol 2626. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36592-3_3

Download citation

DOI: https://doi.org/10.1007/3-540-36592-3_3
Published: 14 March 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00921-4
Online ISBN: 978-3-540-36592-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics