Performance Improvement of TDOA-Based Speaker Localization in Joint Noisy and Reverberant Conditions

  • Hamid Reza Abutalebi (EURASIP Member)
  • Hossein Momenzadeh
Open Access
Research Article

Abstract

TDOA- (time difference of arrival-) based algorithms are common methods for speech source localization. The generalized cross correlation (GCC) method is the most important approach for estimating TDOA between microphone pairs. The performance of this method significantly degrades in the presence of noise and reverberation. This paper addresses the problem of 3D localization in joint noisy and reverberant conditions and a single-speaker scenario. We first propose a modification to make the GCC-PHAse transform (GCC-PHAT) method robust against environment noise. Then, we use an iterative technique that employs location estimation to improve TDOAs accuracy. Extensive experiments on both simulated and real (practical) data (in a single-source scenario) show the capability of the proposed methods to significantly improve TDOA accuracy and, consequently, source location estimates.

Keywords

Cross Correlation Time Difference Performance Improvement Quantum Information Source Localization 

Publisher note

To access the full article, please see PDF.

Copyright information

© H. R. Abutalebi and H. Momenzadeh. 2011

This article is published under license to BioMed Central Ltd. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Authors and Affiliations

  • Hamid Reza Abutalebi (EURASIP Member)
    • 1
    • 2
  • Hossein Momenzadeh
    • 1
  1. 1.Speech Processing Research Lab (SPRL) Electrical and Computer Engineering Department, Yazd UniversityYazdIran
  2. 2.Idiap Research InstituteMartignySwitzerland

Personalised recommendations