Abstract
In this paper we introduce connectionist techniques for visually mediated interaction to be used, for example, in video-conferencing applications. First, we briefly present background work on recognition of identity, expression and pose using Radial Basis Function (RBF) networks. Flexible, example-based, learning methods allow a set of specialised networks to be trained. Second, we address the problem of gesturebased communication and attentional focus using Time-Delay versions of the networks. Colour/motion cues are used to direct face detection and the capture of ‘attentional frames’ surrounding the upper torso and head of the subjects, which focus the processing for visually mediated interaction. Third, we present methods for the gesture recognition and behaviour (user-camera) coordination in the system. In this work, we are taking an appearance-based approach and use the specific phases of communicative gestures to control the camera systems in an integrated system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
H. Buxton and S. Gong. Visual surveillance in a dynamic and uncertain world. Artificial Intelligence, 78:431–459, 1995.
S. Duvdevani-Bar, S. Edelman, A. J. Howell, and H. Buxton. A similarity-based method for the generalization of face recognition over pose and expression. In Proc. IEEE Int. Conf. Face & Gesture Recognition, pp. 118–123, Nara, Japan, 1998.
R. J. Howarth and H. Buxton. Visual surveillance monitoring and watching. In Proc. European Conference on Computer Vision, Cambridge, UK, 1996.
R. J. Howarth and H. Buxton. Attentional control for visual surveillance. In Proc. International Conference on Computer Vision, Bombay, India, 1998.
R. J. Howarth and H. Buxton. Conceptual descriptions from monitoring and watching image sequences. Image & Vision Computing, 18:105–135, 2000.
A. J. Howell. Automatic face recognition using radial basis function networks. PhD thesis, University of Sussex, 1997.
A. J. Howell. Face recognition using RBF networks. In R. J. Howlett and L. C. Jain, editors, Radial Basis Function Networks 2: New Advances in Design, pp. 103–142. Physica-Verlag, 2001.
A. J. Howell and H. Buxton. Towards unconstrained face recognition from image sequences. In Proc. International Conference on Automatic Face & Gesture Recognition, pp. 224–229, Killington, VT, 1996.
A. J. Howell and H. Buxton. Recognising simple behaviours using time-delay RBF networks. Neural Processing Letters, 5:97–104, 1997.
A. J. Howell and H. Buxton. Learning gestures for visually mediated interaction. In Proc. British Machine Vision Conference, pp. 508–517, Southampton, UK, 1998.
A. J. Howell and H. Buxton. Learning identity with radial basis function networks. Neurocomputing, 20:15–34, 1998.
A. J. Howell and H. Buxton. Gesture recognition for visually mediated interaction. In Proc. Int. Gesture Workshop, GW’9, pp. 141–152, Gif-sur-Yvette, France, 1999.
A. J. Howell and H. Buxton. RBF network methods for face detection and attentional frames. Neural Processing Letters, 15, 2002 (In Press).
A. Jebara and A. Pentland. Action reaction learning: Automatic visual analysis and synthesis of interactive behaviour. In Proc. International Conference on Vision Systems (ICVS’99), Las Palmas de Gran Canaria, Spain, 1999.
P. Maes, T. Darrell, B. Blumberg, and A. Pentland. The ALIVE system: Wireless, full-body interaction with autonomous agents. ACM Multimedia Systems, 1996.
S. J. McKenna and S. Gong. Gesture recognition for visually mediated interaction using probabilistic event trajectories. In Proc. British Machine Vision Conference, pp. 498–507, Southampton, UK, 1998.
S. J. McKenna, S. Gong, and Y. Raja. Face recognition in dynamic scenes. In Proc. British Machine Vision Conference, pp. 140–151, Colchester, UK, 1997.
A. Pentland. Smart rooms. Scientific American, 274(4):68–76, 1996.
H. A. Rowley, S. Baluja, and T. Kanade. Human face detection in visual scenes. In Advances in Neural Information Processing Systems, volume 8, pp. 875–881, Cambridge, MA, 1996.
J. Sherrah and S. Gong. Fusion of 2D face alignment and 3D head pose estimation for robust and real-time performance. In Proc. IEEE Int. Workshop Recognition, Analysis, and Tracking of Faces & Gestures in Real-Time Systems, pp. 24–31, Corfu, Greece, 1999.
J. Sherrah, S. Gong, A. J. Howell, and H. Buxton. Interpretation of group behaviour in visually mediated interaction. In Proc. Int. Conf. Pattern Recognition, pp. 266–269, Barcelona, Spain, 2000.
A. D. Wilson and A. F. Bobick. Recognition and interpretation of parametric gesture. In Proc. Int. Conf. Computer Vision, pp. 329–336, Bombay, India, 1998.
C. R. Wren, A. Azarbayejani, T. Darrell, and A. P. Pentland. Pfinder: Real-time tracking of the human body. IEEE Trans. Pattern Analysis & Machine Intelligence, 19:780–785, 1997.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Howell, A.J., Buxton, H. (2002). Visually Mediated Interaction Using Learnt Gestures and Camera Control. In: Wachsmuth, I., Sowa, T. (eds) Gesture and Sign Language in Human-Computer Interaction. GW 2001. Lecture Notes in Computer Science(), vol 2298. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-47873-6_29
Download citation
DOI: https://doi.org/10.1007/3-540-47873-6_29
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43678-2
Online ISBN: 978-3-540-47873-7
eBook Packages: Springer Book Archive