Skip to main content

Part of the book series: SpringerBriefs in Electrical and Computer Engineering ((BRIEFSSPEECHTECH))

  • 731 Accesses

Abstract

This chapter introduces the concept of speaker recognition (SR) and its applications. It emphasizes on explaining the requirement of developing SR technologies that are robust towards background environments. The intermediate sections provide broad overviews of various stages associated in developing a SR system and different categories of SR. The later sections highlight the issues addressed in the book and its contributions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. J.P. Campbell, W. Shen, W.M. Campbell, R. Schwartz, J.F. Bonastre, D. Matrouf, Forensic speaker recognition. IEEE Signal Process. Mag. 26(2), 95–103 (2009)

    Article  Google Scholar 

  2. B.G.B. Fauve, D. Matrouf, N. Scheffer, J.F. Bonastre, J.S.D. Mason, State-of-the-art performance in text-independent speaker verification through open-source software. IEEE Trans. Audio Speech Lang. Process. 15(7), 1960–1968 (2007)

    Article  Google Scholar 

  3. T. Kinnunen, H. Li, An overview of text-independent speaker recognition: from features to supervectors. Speech Commun. 52, 12–40 (2010)

    Article  Google Scholar 

  4. D.V. Lancker, J. Kreiman, K. Emmorey, Familiar voice recognition: patterns and parameters Part I: recognition of backward voices. J. Phon. 13, 19–38 (1985)

    Google Scholar 

  5. A. Reich, J. Duke, Effects of selected vocal disguises on speaker identification by listening. J. Acoust. Soc. Am. 66(4), 1023–1029 (1979)

    Article  Google Scholar 

  6. G. Doddington, Speaker recognition—identifying people by their voices. Proc. IEEE 73(11), 1651–1664 (1985)

    Article  Google Scholar 

  7. J. Wolf, Efficient acoustic parameters for speaker recognition J. Acoust. Soc. Am. 6(51), 2044–2056 (1972)

    Article  Google Scholar 

  8. S. Davis, P. Mermelstein, Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust. Speech Signal Process. 28(4), 357–366 (1980)

    Article  Google Scholar 

  9. D.A. Reynolds, Experimental evaluation of features for robust speaker identification. IEEE Trans. Speech Audio Process. 2(4), 639–643 (1994)

    Article  Google Scholar 

  10. D.A. Reynolds, R.C. Rose, Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans. Acoust. Speech Signal Process. 3(1), 72–83 (1995)

    Article  Google Scholar 

  11. D. Reynolds, T. Quatieri, R. Dunn, Speaker verification using adapted Gaussian mixture models. Digit. Signal Process. 10(1), 19–41 (2000)

    Article  Google Scholar 

  12. M. BenZeghiba, H. Bourland, On the combination of speech and speaker recognition, in Proceedings of the European Conference of Speech Communication and Technology (EUROSPEECH ’03), Geneva, 2003

    Google Scholar 

  13. D. Burton, Text-dependent speaker verification using vector quantization source coding. IEEE Trans. Acoust. Speech Signal Process. 35(2), 133–143 (1987)

    Article  Google Scholar 

  14. L. Rabiner, B.H. Juang, Fundamentals of Speech Recognition, 1st edn. (Prentice-Hall, Englewood Cliffs, 1993)

    Google Scholar 

  15. W. Campbell, J. Campbell, D. Reynolds, E. Singer, P. Carrasquillo, Support vector machines for speaker and language recognition. Comput. Speech Lang. 20, 210–229 (2006)

    Article  Google Scholar 

  16. K. Farrell, R. Mammone, K. Assaleh, Speaker recognition using neural networks and conventional classifiers. IEEE Trans. Speech Audio Process. 2(1), 195–204 (1994)

    Article  Google Scholar 

  17. W. Campbell, J. Campbell, D. Reynolds, Support vector machines using GMM supervectors for speaker verification. IEEE Signal Process. Lett. 13(5), 308–311 (2006)

    Article  Google Scholar 

  18. B. Yegnanarayana, S.P. Kishore, AANN: an alternative to GMM for pattern recognition. Neural Netw. 15, 456–469 (2002)

    Article  Google Scholar 

  19. F. Bimbot, J.F. Bonastre, C. Fredouille, G. Gravier, I. Magrin-Chagnolleau, S. Meignier, T. Merlin, J. Ortega-Garcia, D. Petrovska-Delacrétaz, D.A. Reynolds, A tutorial on text-independent speaker verification. EURASIP J. Adv. Signal Process. (Spec. Issue Biom. Signal Process.) 4(4), 430–451 (2004)

    Google Scholar 

  20. M. Hebert, Text-dependent speaker recognition, in Springer Handbook of Speech Processing (Springer, Berlin/Hiedelberg, 2008), pp. 743–762

    Google Scholar 

  21. P. Kenny, G. Boulianne, P. Ouellet, P. Dumouchel, Speaker and session variability in GMM-based speaker verification. IEEE Trans. Audio Speech Lang. Process. 15(4), 1448–1460 (2007)

    Article  Google Scholar 

  22. R. Vogt, S. Sridharan, Explicit modeling of session variability for speaker verification. Speech Commun. 1(22), 17–38 (2008)

    Google Scholar 

  23. J. Kahn, N. Audibert, S. Rossato, J.F. Bonastre, Intra-speaker variability effects on speaker verification performance, in The Speaker and Language Recognition Workshop (Odyssey ’10), Brno, 2010

    Google Scholar 

  24. J. Ming, T.J. Hazen, J.R. Glass, D. Reynolds, Robust speaker recognition in noisy conditions. IEEE Trans. Audio Speech Lang. Process. 15(5), 1711–1723 (2007)

    Article  Google Scholar 

  25. A. Acero, Acoustical and environmental robustness in automatic speech recognition. PhD thesis, Carnegie Mellon University, Sept 1990

    Google Scholar 

  26. P. Moreno, Speech recognition in noisy environments. PhD thesis, Electrical & Computer Engineering Department, Carnegie Mellon University, Pittsburgh, 1996

    Google Scholar 

  27. C.M. Bishop, Pattern Recognition and Machine Learning (Springer, New York, 2006)

    MATH  Google Scholar 

  28. J. Gauvain, C. Lee, Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains. IEEE Trans. Speech Audio Process. 2(2), 291–298 (1994)

    Article  Google Scholar 

  29. V. Hautamaki, T. Kinnunen, I. Karkkainen, M. Tuononen, J. Saastamoinen, P. Franti, Maximum a posteriori adaptation of the centroid model for speaker verification. IEEE Signal Process. Lett. 15, 162–165 (2008)

    Article  Google Scholar 

  30. R. Togneri, D. Pullella, An overview of speaker identification: accuracy and robustness issues. IEEE Circuits Syst. Mag. 11(2), 23–61 (2011)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2014 The Author(s)

About this chapter

Cite this chapter

Rao, K.S., Sarkar, S. (2014). Introduction. In: Robust Speaker Recognition in Noisy Environments. SpringerBriefs in Electrical and Computer Engineering(). Springer, Cham. https://doi.org/10.1007/978-3-319-07130-5_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-07130-5_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-07129-9

  • Online ISBN: 978-3-319-07130-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics