Skip to main content

Improved EMD Usable Speech Detection for Co-channel Speaker Identification

  • Conference paper
Advances in Nonlinear Speech Processing (NOLISP 2013)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7911))

Included in the following conference series:

Abstract

The speech signal is the result of many nonlinearly interacting processes; therefore any linear analysis has the potential risk of missing a great amount of information content. Recently the technique of Empirical Mode Decomposition (EMD) has been proposed as a new tool for analysis of non linear and non stationary data. This paper deals with this new tool, to detect usable speech in co-channel speech. We applied EMD analysis to decompose co-channel speech signal into intrinsic oscillatory modes. Detected usable speech segments are organized into speaker streams, which are applied to speaker identification system (SID). The system is evaluated on co-channel speech across various Targets to Interferer Ratios (TIR). Performance evaluation has shown that EMD performs better than the linear dyadic wavelet decomposition based methods for usable speech detection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 72.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Reynolds, D.A.: Speaker identification and verification using Gaussian mixture speaker models. Speech Commun. 17, 91–108 (1995)

    Article  Google Scholar 

  2. Khanwalkar, S., Smolenski, Y.: Robert E. Yantorno and S. J. Wenndt, Enhancement of speaker identification using SID usable speech. EUSIPCO (2005)

    Google Scholar 

  3. Yantorno, R.E.: Method for improving speaker identification by determining usable speech. Journal of the Acoustical Society of America 124 (2008)

    Google Scholar 

  4. Lovekin, J., Yantorno, R.E., Benincasa, S., Wenndt, S., Huggins, M.: Developing usable speech criteria for speaker identification. In: Proc. ICASSP, pp. 421–424 (2001)

    Google Scholar 

  5. Krishnamachari, K.R., Yantorno, R.E., Benincasa, D.S., Wenndt, S.J.: Spectral autocorrelation ratio as a usability measure of speech segments under cochannel conditions. In: IEEE International Symposium Intelligent Sig. Process. and Comm. Sys. (2000)

    Google Scholar 

  6. Smolenski, B.Y., Ramachandran, R.P.: Usable Speech processing: a filterless ap-proach in the presence of interference. IEEE Circuits and Systems Magazine (2011)

    Google Scholar 

  7. Kizhanatham, A., Yantorno, R.E.: Peak Difference Autocorrelation of Wavelet Trans-form Algorithm Based Usable Speech Measure. In: 7th World Multi-Conference on Systemic, Cybernetics, and Informatics (2003)

    Google Scholar 

  8. Ghezaiel, W., Ben Slimane, A., Ben Braiek, E.: Usable speech detection for speaker identification system under co-channel conditions. In: International Conference on Electrical System and Automatic Control JTEA, Tunisia (2010)

    Google Scholar 

  9. Ghezaiel, W., Ben Slimane, A., Ben Braiek, E.: Evaluation of a multi-resolution dyadic wavelet transform method for usable speech detection. World Academy of Science, Engineering and Technology Journal WASET, 829–833 (2011) pISSN 2010-376X, eISSN 2010-3778

    Google Scholar 

  10. Huang, N.E., Shen, Z., Long, S.R., et al.: The empirical mode decomposition and Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc. R. Soc. Lond. A 454, 903–995 (1998)

    Article  MathSciNet  MATH  Google Scholar 

  11. Flandrin, P., Rilling, G., Goncalves, P.: Empirical mode decomposition as a filter bank. IEEE Signal Process. Letters 11(2), 112–114 (2004)

    Article  Google Scholar 

  12. Ghezaiel, W., Ben Slimane, A., Ben Braiek, E.: Usable Speech Assignment for Speaker Identification under Co-Channel Situation. International Journal of Computer Applications 59(18), 7–11 (2012)

    Article  Google Scholar 

  13. Jan, T., Wang, W.: Empirical Mode Decomposition for joint denoising and dereverberation. In: Proc. 19th European Signal Processing Conference (EUSIPCO 2011), Barcelona, Spain, August 29-September 2 (2011)

    Google Scholar 

  14. Hess, W.H.: Pitch determination of speech signal: Algorithms and devices, Springer-Verlag. Springer, Heidelberg (1983)

    Book  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ghezaiel, W., Ben Slimane, A., Ben Braiek, E. (2013). Improved EMD Usable Speech Detection for Co-channel Speaker Identification. In: Drugman, T., Dutoit, T. (eds) Advances in Nonlinear Speech Processing. NOLISP 2013. Lecture Notes in Computer Science(), vol 7911. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38847-7_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-38847-7_24

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-38846-0

  • Online ISBN: 978-3-642-38847-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics