Visual Focus of Attention in Dynamic Meeting Scenarios

Voit, Michael; Stiefelhagen, Rainer

doi:10.1007/978-3-540-85853-9_1

Michael Voit¹ &
Rainer Stiefelhagen²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5237))

Included in the following conference series:

International Workshop on Machine Learning for Multimodal Interaction

870 Accesses
3 Citations

Abstract

This paper presents our data collection and first evaluations on visual focus of attention during dynamic meeting scenes. We included moving focus targets and unforeseen interruptions in each meeting, by guiding each meeting along a predefined script of events that three participating actors were instructed to follow. Further meeting attendees were not introduced to upcoming actions or the general purpose of the meeting, hence we were able to capture their natural focus changes within this predefined dynamic scenario with an extensive setup of both visual and acoustical sensors throughout our smart room. We present an adaptive approach to estimate visual focus of attention based on head orientation under these unforeseen conditions and show, that our system achieves an overall recognition rate of 59%, compared to 9% less when choosing the best matching focus target directly from the observed head orientation angles.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ekenel, H., Fischer, M., Stiefelhagen, R.: Face recognition in smart rooms. In: Popescu-Belis, A., Renals, S., Bourlard, H. (eds.) MLMI 2007. LNCS, vol. 4892, Springer, Heidelberg (2008)
Chapter Google Scholar
Voit, M., Nickel, K., Stiefelhagen, R.: Head pose estimation in single- and multi-view environments - Results on the CLEAR 2007 benchmarks. In: Stiefelhagen, R., Bowers, R., Fiscus, J.G. (eds.) CLEAR 2007 and RT 2007. LNCS, vol. 4625, Springer, Heidelberg (2007)
Google Scholar
Head orientation estimation using particle filtering in multiview scenarios. In: Stiefelhagen, R., Bowers, R., Fiscus, J.G. (eds.) CLEAR 2007 and RT 2007. LNCS, vol. 4625. Springer, Heidelberg (2007)
Google Scholar
Lanz, O., Brunelli, R.: Joint bayesian tracking of head location and pose from low-resolution video. In: Stiefelhagen, R., Bowers, R., Fiscus, J.G. (eds.) CLEAR 2007 and RT 2007. LNCS, vol. 4625. Springer, Heidelberg (2007)
Google Scholar
Maganti, H.K., Motlicek, P., Gatica-Perez, D.: Unsupervised speech/non-speech detection for automatic speech recognition in meeting rooms. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2007)
Google Scholar
Maganti, H.K., Gatica-Perez, D.: Speaker localization for microphone-array-based asr: the effects of accuracy on overlapping speech. In: Proceedings of IEEE International Conference on Multimodal Interfaces (ICMI) (2006)
Google Scholar
Bernardin, K., Stiefelhagen, R.: Audio-visual multi-person tracking and identification for smart environments. In: Proceedings of ACM Multimedia (2007)
Google Scholar
Lanz, O., P.C., Brunelli, R.: An appearance-based particle filter for visual tracking in smart rooms. In: Stiefelhagen, R., Bowers, R., Fiscus, J.G. (eds.) CLEAR 2007 and RT 2007. LNCS, vol. 4625. Springer, Heidelberg (2007)
Google Scholar
Chen, L., Harper, M., Franklin, A., Rose, R.T., Kimbara, I.: A multimodal analysis of floor control in meetings. In: Renals, S., Bengio, S., Fiscus, J.G. (eds.) MLMI 2006. LNCS, vol. 4299, Springer, Heidelberg (2006)
Chapter Google Scholar
Freedman, E.G., Sparks, D.L.: Eye-head coordination during head-unrestrained gaze shifts in rhesus monkeys. Journal of Neurophysiology 77, 2328 (1997)
Google Scholar
Wang, X., Jin, J.: A quantitative analysis for decomposing visual signal of the gaze displacement. In: Proceedings of the Pan-Sydney area workshop on Visual information processing, p. 153 (2001)
Google Scholar
Stiefelhagen, R.: Tracking focus of attention in meetings. In: Proceedings of IEEE International Conference on Multimodal Interfaces (ICMI), p. 273 (2002)
Google Scholar
Ba, S., Odobez, J.: Multi-party focus of attention recognition in meetings from head pose and multimodal contextual cues. In: International Conference on on Acoustics, Speech, and Signal Processing (ICASSP) (2008)
Google Scholar
Voit, M., Stiefelhagen, R.: Tracking head pose and focus of attention with multiple far-field cameras. In: International Conference on Multimodal Interfaces (ICMI) (2006)
Google Scholar
Ba, S., Odobez, J.: A cognitive and unsupervised map adaptation approach to the recognition of focus of attention from head pose. In: Proceedings of International Conference on Multimedia and Expo (ICME) (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Fraunhofer IITB, Karlsruhe,
Michael Voit
Interactive Systems Labs, Universität Karlsruhe (TH),
Rainer Stiefelhagen

Authors

Michael Voit
View author publications
You can also search for this author in PubMed Google Scholar
Rainer Stiefelhagen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Andrei Popescu-Belis Rainer Stiefelhagen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Voit, M., Stiefelhagen, R. (2008). Visual Focus of Attention in Dynamic Meeting Scenarios. In: Popescu-Belis, A., Stiefelhagen, R. (eds) Machine Learning for Multimodal Interaction. MLMI 2008. Lecture Notes in Computer Science, vol 5237. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85853-9_1

Download citation

DOI: https://doi.org/10.1007/978-3-540-85853-9_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85852-2
Online ISBN: 978-3-540-85853-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics