Advertisement

A Generative Approach to Audio-Visual Person Tracking

  • Roberto Brunelli
  • Alessio Brutti
  • Paul Chippendale
  • Oswald Lanz
  • Maurizio Omologo
  • Piergiorgio Svaizer
  • Francesco Tobia
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4122)

Abstract

This paper focuses on the integration of acoustic and visual information for people tracking. The system presented relies on a probabilistic framework within which information from multiple sources is integrated at an intermediate stage. An advantage of the method proposed is that of using a generative approach which supports easy and robust integration of multi source information by means of sampled projection instead of triangulation. The system described has been developed in the EU funded CHIL Project research activities. Experimental results from the CLEAR evaluation workshop are reported.

Keywords

Sound Source Coherence Measure Microphone Array Target Height Coherent Noise 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    CLEAR 2006 evaluation campaign. http://www.clear-evaluation.org/
  2. 2.
    Rich transcription 2005 spring meeting recognition evaluation. http://www.nist.gov/speech/tests/rt/rt2005/spring/
  3. 3.
    Brandstein, M., Ward, D.: Microphone Arrays. Springer, Heidelberg (2001)Google Scholar
  4. 4.
    Comaniciu, D., Ramesh, V., Meer, P.: Real-time tracking of non-rigid objects using mean-shift. In: Int. Conf. on Computer Vision and Pattern Recognition, vol. 2, pp. 142–149 (2000)Google Scholar
  5. 5.
    De Mori, R.: Spoken Dialogue with Computers. Academic Press, London (1998)Google Scholar
  6. 6.
    Doucet, A., de Freitas, N., Gordon, N.: Sequential Monte Carlo Methods in Practice. Springer, Heidelberg (2001)zbMATHGoogle Scholar
  7. 7.
    Isard, M., Blake, A.: Condensation – conditional density propagation for visual tracking. Int. Journal of Computer Vision 1(29), 5–28 (1998)CrossRefGoogle Scholar
  8. 8.
    Isard, M., MacCormick, J.: BraMBLe: A Bayesian multiple-blob tracker. In: Int. Conf. of Computer Vision, vol. 2, pp. 34–41 (2003)Google Scholar
  9. 9.
    Lanz, O.: Approximate bayesian multibody tracking. IEEE Trans. Pattern Analysis and Machine Intelligence, to appear (2006)Google Scholar
  10. 10.
    Omologo, M., Svaizer, P.: Acoustic event localization using a crosspower-spectrum phase based technique. In: Int. Conf. on Acoustics, Speech, and Signal Processing, vol. 2, pp. 273–276 (1994)Google Scholar
  11. 11.
    Sullivan, J., Rittscher, J.: Guiding random particles by deterministic search. In: Int. Conf. of Computer Vision, vol. 1, pp. 323–330 (2001)Google Scholar

Copyright information

© Springer Berlin Heidelberg 2007

Authors and Affiliations

  • Roberto Brunelli
    • 1
  • Alessio Brutti
    • 1
  • Paul Chippendale
    • 1
  • Oswald Lanz
    • 1
  • Maurizio Omologo
    • 1
  • Piergiorgio Svaizer
    • 1
  • Francesco Tobia
    • 1
  1. 1.ITC-irst, Via Sommarive 18, 38050 Povo di TrentoItaly

Personalised recommendations