Skip to main content
Log in

Speaker Direction-of-Arrival Estimation Based on Orthogonal Dipoles

  • Published:
Circuits, Systems, and Signal Processing Aims and scope Submit manuscript

Abstract

The small aperture microphone array becomes more and more popular in the consumer electronics. However, the small aperture usually limits the performance of the traditional DoA estimation methods. The differential microphone array (DMA) has attracted much attention, recently. The DMA has the frequency-independent beampatterns owing to the small size and the dipole is one of the basic types. In this paper, we investigate the relationship between the direction-of-arrival (DoA) and the dipole beampatterns. It shows that the DoA can be directly yielded by an orthogonal dipole pair for the small aperture microphone array. Based on this relationship, we propose a speaker DoA estimation method with orthogonal dipoles (OD). The OD exhibits a good performance to DoA estimation. Nevertheless, it is vulnerable to the axial directions in the reverberant environment. To increase the robustness to the axial directions, we introduce the anti-reverberation function in OD and propose the improved OD method. Both simulations and experiments show that the proposed methods not only significantly outperform the traditional methods but also are much more computationally efficient without the spatial spectrum search.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  1. J. Allen, D. Berkley, Image method for efficiently simulating small-room acoustics. J. Acoust. Soc. Am. 65(4), 943 (1979)

    Article  Google Scholar 

  2. G. Aneeja, B. Yegnanarayana, Single frequency filtering approach for discriminating speech and nonspeech. IEEE-ACM Trans. Audio Speech Lang. Process. 23(4), 705–717 (2015)

    Article  Google Scholar 

  3. X. Anguera, C. Wooters, J. Hernando, Acoustic beamforming for speaker diarization of meetings. IEEE Trans. Audio Speech Lang. Process. 15(7), 2011–2022 (2007)

    Article  Google Scholar 

  4. M.R. Azimi-Sadjadi, N. Roseveare, A. Pezeshki, Wideband DOA estimation algorithms for multiple moving sources using unattended acoustic sensors. IEEE Trans. Aerosp. Electron. Syst. 44(4), 1585–1599 (2008)

    Article  Google Scholar 

  5. J. Benesty, J. Chen, Study and Design of Differential Microphone Arrays (Springer, Berlin, 2012)

    Google Scholar 

  6. J. Benesty, M. Souden, Y. Huang, A perspective on differential microphone arrays in the context of noise reduction. IEEE Trans. Audio Speech Lang. Process. 20(2), 699–704 (2012)

    Article  Google Scholar 

  7. E. Bezzam, R. Scheibler, J. Azcarreta, H. Pan, M. Simeoni, R. Beuchat, P. Hurley, B. Bruneau, C. Ferry, S. Kashani, Hardware and software for reproducible research in audio array signal processing, in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2017), pp. 6591–6592

  8. C. Blandin, A. Ozerov, E. Vincent, Multi-source TDOA estimation in reverberant audio using angular spectra and clustering. Signal Process. 92, 1950–1960 (2012)

    Article  Google Scholar 

  9. J.D. Chen, J. Benesty, C. Pan, On the design and implementation of linear differential microphone arrays. J. Acoust. Soc. Am. 136(6), 3097–3113 (2014)

    Article  Google Scholar 

  10. M. Cobos, A. Marti, J.J. Lopez, A modified SRP-PHAT functional for robust real-time sound source localization with scalable spatial sampling. IEEE Signal Process. Lett. 18(1), 71–74 (2011)

    Article  Google Scholar 

  11. S. Ding, H. Chen, DOA estimation of multiple speech sources by selecting reliable local sound intensity estimates. Appl. Acoust. 127, 336–345 (2017). https://doi.org/10.1016/j.apacoust.2017.07.002

    Article  Google Scholar 

  12. H. Do, H.F. Silverman, Y. Yu, A real-time SRP-PHAT source location implementation using stochastic region contraction (SRC) on a large-aperture microphone array, in 2007 IEEE International Conference on Acoustics, Speech and Signal Processing—ICASSP, vol. 1, pp. I-121–I-124

  13. G. Feng, L. Huawei, H. Jingchang, Z. Xin, Z. Xingshui, L. Baoqing, Y. Xiaobing, Design of a direction-of-arrival estimation method used for an automatic bearing tracking system. Sensors 16(7), 1145 (2016)

    Article  Google Scholar 

  14. J.S. Garofolo, Getting started with the darpa timit cdrom: an acoustic phonetic continuous speech database, in National Institute of Standards and Technology (NIST) (1988)

  15. F. Guo, J. Huang, X. Zhang, Y. Cheng, H. Liu, B. Li, A two-stage detection method for moving targets in the wild based on microphone array. IEEE Sens. J. 15(10), 5795–5803 (2015)

    Article  Google Scholar 

  16. https://www.bksv.com/en/DIRAC

  17. S. He, H. Chen, Closed-form DOA estimation using first-order differential microphone arrays via joint temporal-spectral-spatial processing. IEEE Sens. J. 17(4), 1558–1748 (2017)

    Article  Google Scholar 

  18. Z. Huang, G. Zhan, D. Ying, Y. Yan, Robust multiple speech source localization using time delay histogram, in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2016), pp. 3191–3195

  19. J.C. Jacob Benesty, I. Cohen, Design of Circular Differential Microphone Arrays (Springer, Berlin, 2015)

    Book  Google Scholar 

  20. F. Jacobsen, Sound intensity, in Sound Intensity (2014), p. 1093–1114

  21. D.M. Kitavi, K.T. Wong, M. Zou, K. Agrawal, Lower bound of the estimation error of an emitter’s direction-of-arrival/polarisation, for a collocated triad of orthogonal dipoles/loops that fail randomly. IET Microw. Antennas Propag. 11(7), 961–970 (2017)

    Article  Google Scholar 

  22. H. Krim, M. Viberg, Two decades of array signal processing research—the parametric approach. IEEE Signal Process. Mag. 13(4), 67–94 (1996)

    Article  Google Scholar 

  23. J. Krolik, D. Swingler, Multiple broad-band source location using steered covariance matrices. IEEE Trans. Acoust. Speech Signal Process. 37(10), 1481–1494 (1989)

    Article  Google Scholar 

  24. C.H. Lee, H.R.L. Lee, K.T. Wong, M. Razo, The spatial-matched-filter beam pattern of a biaxial non-orthogonal velocity sensor. J. Sound Vib. 367, 250–255 (2016)

    Article  Google Scholar 

  25. T.C. Lin, K.T. Wong, M.O. Cordel, J.P. Ilao, Beamforming pointing error of a triaxial velocity sensor under gain uncertainties. J. Acoust. Soc. Am. 140(3), 1675 (2016)

    Article  Google Scholar 

  26. A.H. Moore, C. Evers, P.A. Naylor, Direction of arrival estimation in the spherical harmonic domain using subspace pseudointensity vectors. IEEE/ACM Trans. Audio Speech Lang. Process. 25(1), 178–192 (2017)

    Article  Google Scholar 

  27. M. Muaz, Y.I. Wu, K.T. Wong, D. Su, A higher-order “figure-8” sensor and an isotropic sensor-for azimuth-elevation bivariate direction finding. J. Acoust. Soc. Am. 143(4), 2041 (2018)

    Article  Google Scholar 

  28. A. Palla, L. Fanucci, R. Sannino, M. Settin, Wearable speech enhancement system based on MEMS microphone array for disabled people, In 2015 10th International Conference on Design, Technology of Integrated Systems in Nanoscale Era (DTIS) (2015), pp. 1–5

  29. C. Pan, J.D. Chen, J. Benesty, Theoretical analysis of differential microphone array beamforming and an improved solution. IEEE-ACM Trans. Audio Speech Lang. Process. 23(11), 2093–2105 (2015)

    Article  Google Scholar 

  30. S.U. Pillai, B.H. Kwon, Performance analysis of music-type high-resolution estimators for direction finding in correlated and coherent scenes. IEEE Trans. Acoust. Speech Signal Process. 37(8), 1176–1189 (1989)

    Article  Google Scholar 

  31. R.O. Schmidt, Multiple emitter location and signal parameter estimation. IEEE Trans. Antennas Propag. 34(3), 276–280 (1986)

    Article  Google Scholar 

  32. M. Seifipour, S. Seyedtabaii, Computation saving in a SRP-PHAT sound source locator variant, in 2013 21st Iranian Conference on Electrical Engineering (ICEE), pp. 1–5

  33. E.D. Sena, H. Hacihabiboglu, Z. Cvetkovic, On the design and implementation of higher order differential microphones. IEEE Trans. Audio Speech Lang. Process. 20(1), 162–174 (2012)

    Article  Google Scholar 

  34. Y. Song, K.T. Wong, Acoustic direction finding using a spatially spread tri-axial velocity sensor. IEEE Trans. Aerosp. Electron. Syst. 51(2), 834–842 (2015)

    Article  Google Scholar 

  35. Y. Song, K.T. Wong, Closed-form direction finding using collocated but orthogonally oriented higher-order acoustic sensors. IEEE Sens. J. 12(2), 2604–2608 (2015)

    Google Scholar 

  36. Y. Song, K.T. Wong, Y. Li, Direction finding using a biaxial particle-velocity sensor. J. Sound Vib. 340, 354–367 (2015)

    Article  Google Scholar 

  37. J. Traa, D. Wingate, N.D. Stein, P. Smaragdis, Robust source localization and enhancement with a probabilistic steered response power model. IEEE/ACM Trans. Audio Speech Lang. Process. 24(3), 493–503 (2016)

    Article  Google Scholar 

  38. S. Valaee, P. Kabal, Wideband array processing using a two-sided correlation transformation. IEEE Trans. Signal Process. 43(1), 160–172 (1995)

    Article  Google Scholar 

  39. X. Zhang, J.C. Huang, E.L. Song, H.W. Liu, B.Q. Li, X.B. Yuan, Design of small MEMS microphone array systems for direction finding of outdoors moving vehicles. Sensors 14(3), 4384–4398 (2014)

    Article  Google Scholar 

  40. L. Zhao, J. Benesty, J. Chen, Design of robust differential microphone arrays. IEEE/ACM Trans. Audio Speech Lang. Process. 22(10), 1455–1466 (2014)

    Article  Google Scholar 

  41. Y. Zhuliang, S. Rahardja, DOA estimation using two closely spaced microphones, in IEEE International Symposium on Circuits and Systems, 2002 (ISCAS 2002), vol. 2 (2002), pp. II-193–II-196

  42. M. Zohourian, G. Enzner, R. Martin, Binaural speaker localization integrated into an adaptive beamformer for hearing aids. IEEE/ACM Trans. Audio Speech Lang. Process. 26(3), 515–528 (2018)

    Article  Google Scholar 

Download references

Acknowledgements

This paper is sponsored by Natural Science Foundation of Shanghai, Fund No. 14ZR1447200. The authors would like to thank the associate editor and anonymous reviewers for their valuable comments and suggestions to improve this paper. Furthermore, as the first author, I would like to thank my wife Doctor Chen Wang. The more I know about you, the more deeply I fall in love with you. Without a splendid diamond or even a grand wedding, you married me. Thanks for being with me and supporting me. Love you forever.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Feng Guo.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Guo, F., Cao, Y., Huang, Z. et al. Speaker Direction-of-Arrival Estimation Based on Orthogonal Dipoles. Circuits Syst Signal Process 38, 2320–2334 (2019). https://doi.org/10.1007/s00034-018-0976-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00034-018-0976-4

Keywords

Navigation