Abstract
This paper is a documentation of the acoustic person tracking system developed by TUT. The system performance was evaluated in the CLEAR 2007 evaluation. The proposed system is designed to track a speaker position in a meeting room domain using only audio data. Audio data provided for the evaluation consists of recordings from multiple microphone arrays. The meeting rooms are equipped with three to seven arrays.
Speaker localization is performed by mapping pairwise cross-correlations of microphone signals into a three dimensional likelihood field. The resulting likelihood is used as source evidence for a particle filtering algorithm. A point estimate for the speaker position for each time frame is derived from the resulting sequential process. Results indicate an 85% success rate of localization with 15 cm average precision.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Bernardin, K.: Clear 2007 evaluation plan v.1.0 (2007), http://isl.ira.uka.de/clear07/downloads/?download=CLEAR07-3DPT-2007-03-09.pdf
Mostefa, D., Moreau, N., Choukri, K., Potamianos, G., Chu, S.M., Tyagi, A., Casas, J.R., Turmo, J., Christoforetti, L., Tobia, F., Pnevmatikakis, A., Mylonakis, V., Talantzis, F., Burger, S., Stiefelhagen, R., Bernardin, K., Rochet, C.: The CHIL audiovisual corpus for lecture and meeting analysis inside smart rooms (accepted for publication, Kluwer Academic publishers). Journal of Language Resources and Evaluation (2007)
Knapp, C., Carter, G.: The generalized correlation method for estimation of time delay. IEEE Trans. on Acoustics, Speech, and Signal Processing 4, 320–327 (1976)
Aarabi, P.: The Fusion of Distributed Microphone Arrays for Sound Localization. EURASIP Journal on Applied Signal Processing 4, 338–347 (2003)
DiBiase, J., Silverman, H., Brandstein, M.: Microphone Arrays, ch. 8. Springer, Heidelberg (2001)
Gordon, N., Salmond, D., Smith, A.: Novel approach to nonlinear/non-Gaussian Bayesian state estimation. Radar and Signal Processing, IEE Proceedings F 140, 107–113 (1993)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Korhonen, T., Pertilä, P. (2008). TUT Acoustic Source Tracking System 2007. In: Stiefelhagen, R., Bowers, R., Fiscus, J. (eds) Multimodal Technologies for Perception of Humans. RT CLEAR 2007 2007. Lecture Notes in Computer Science, vol 4625. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68585-2_8
Download citation
DOI: https://doi.org/10.1007/978-3-540-68585-2_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68584-5
Online ISBN: 978-3-540-68585-2
eBook Packages: Computer ScienceComputer Science (R0)