Skip to main content

TUT Acoustic Source Tracking System 2007

  • Conference paper
  • 1310 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4625))

Abstract

This paper is a documentation of the acoustic person tracking system developed by TUT. The system performance was evaluated in the CLEAR 2007 evaluation. The proposed system is designed to track a speaker position in a meeting room domain using only audio data. Audio data provided for the evaluation consists of recordings from multiple microphone arrays. The meeting rooms are equipped with three to seven arrays.

Speaker localization is performed by mapping pairwise cross-correlations of microphone signals into a three dimensional likelihood field. The resulting likelihood is used as source evidence for a particle filtering algorithm. A point estimate for the speaker position for each time frame is derived from the resulting sequential process. Results indicate an 85% success rate of localization with 15 cm average precision.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bernardin, K.: Clear 2007 evaluation plan v.1.0 (2007), http://isl.ira.uka.de/clear07/downloads/?download=CLEAR07-3DPT-2007-03-09.pdf

  2. Mostefa, D., Moreau, N., Choukri, K., Potamianos, G., Chu, S.M., Tyagi, A., Casas, J.R., Turmo, J., Christoforetti, L., Tobia, F., Pnevmatikakis, A., Mylonakis, V., Talantzis, F., Burger, S., Stiefelhagen, R., Bernardin, K., Rochet, C.: The CHIL audiovisual corpus for lecture and meeting analysis inside smart rooms (accepted for publication, Kluwer Academic publishers). Journal of Language Resources and Evaluation (2007)

    Google Scholar 

  3. Knapp, C., Carter, G.: The generalized correlation method for estimation of time delay. IEEE Trans. on Acoustics, Speech, and Signal Processing 4, 320–327 (1976)

    Article  Google Scholar 

  4. Aarabi, P.: The Fusion of Distributed Microphone Arrays for Sound Localization. EURASIP Journal on Applied Signal Processing 4, 338–347 (2003)

    Article  Google Scholar 

  5. DiBiase, J., Silverman, H., Brandstein, M.: Microphone Arrays, ch. 8. Springer, Heidelberg (2001)

    Google Scholar 

  6. Gordon, N., Salmond, D., Smith, A.: Novel approach to nonlinear/non-Gaussian Bayesian state estimation. Radar and Signal Processing, IEE Proceedings F 140, 107–113 (1993)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Rainer Stiefelhagen Rachel Bowers Jonathan Fiscus

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Korhonen, T., Pertilä, P. (2008). TUT Acoustic Source Tracking System 2007. In: Stiefelhagen, R., Bowers, R., Fiscus, J. (eds) Multimodal Technologies for Perception of Humans. RT CLEAR 2007 2007. Lecture Notes in Computer Science, vol 4625. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68585-2_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-68585-2_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-68584-5

  • Online ISBN: 978-3-540-68585-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics