Localization of Directional Sound Sources Supported by A Priori Information of the Acoustic Environment

Open Access
Research Article

Abstract

Speaker localization with microphone arrays has received significant attention in the past decade as a means for automated speaker tracking of individuals in a closed space for videoconferencing systems, directed speech capture systems, and surveillance systems. Traditional techniques are based on estimating the relative time difference of arrivals (TDOA) between different channels, by utilizing crosscorrelation function. As we show in the context of speaker localization, these estimates yield poor results, due to the joint effect of reverberation and the directivity of sound sources. In this paper, we present a novel method that utilizes a priori acoustic information of the monitored region, which makes it possible to localize directional sound sources by taking the effect of reverberation into account. The proposed method shows significant improvement of performance compared with traditional methods in "noise-free" condition. Further work is required to extend its capabilities to noisy environments.

Keywords

Crosscorrelation Sound Source Traditional Technique Closed Space Noisy Environment 

Publisher note

To access the full article, please see PDF.

Copyright information

© Z. Fodróczi and A. Radványi. 2008

This article is published under license to BioMed Central Ltd. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Authors and Affiliations

  1. 1.Faculty of Information TechnologyPázmány Péter Catholic UniversityBudapestHungary
  2. 2.Analogic and Neural Computing Laboratory, Computer and Automation Research InstituteHungarian Academy of SciencesBudapestHungary

Personalised recommendations