Skip to main content

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 395))

Abstract

This paper presents to ameliorate the performance of text-independent speaker recognition system in a noisy environment and cross-channel recordings of the utterances. In this paper presents the combination of Gammatone Frequency Cepstral Coefficients (GFCC) to handle noisy environment with i-vectors to handle the session variability. Experiments are evaluated on NIST-2003 database.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. D. A. Reynolds, “Experimental Evaluation of Features for Robust Speaker Identification,” IEEE Trans. on Acoustic Speech and Audio Processing, vol. 2, no. 4, 1994.

    Google Scholar 

  2. Davis, S. Mermelstein, P.,” Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences”, In IEEE Transactions on Acoustics, Speech, and Signal Processing, Vol. 28 No. 4, pp. 357–366, 1980.

    Google Scholar 

  3. H. Seddik, A. Rahmouni and M. Sayadi, “Text independent speaker recognition using the mel frequency cepstral coefficients and a neural network classifier”, Ecole Nationale Des Sciences Informatiques, 2010, Manouba, Tunisia.

    Google Scholar 

  4. R.D Patterson, I. Nimmo-Smith, J. Holdsworth, and P. Rice, “An efficient auditory filterbank based on Gammatone function,” in Paper presented at a meeting of the IOC Speech Group on Auditory Modelling at RSRE, December 14–15, 1987.

    Google Scholar 

  5. N. Dehak, R. Dehak, P. Kenny, N. Brummer, P. Ouellet, and P. Dumouchel, “Support vector machines versus fast scoring in the low-dimensional total variability space for speaker verification,” in Proceedings of Inter speech, Brighton, UK, 2009.

    Google Scholar 

  6. T. Kinnunen and P. Rajan. “A practical, self-adaptive voice activity detector for speaker verification with noisy telephone and microphone data,” in Proc. International Conference on Acoustics, Speech, and Signal Processing, 2013, pp. 7229–7233.

    Google Scholar 

  7. R. Patterson and I. N. Smith, “An efficient auditory filter bank based on the gammatone function,” Speech-Group meeting of the Institute of Acoustics on Auditory Modelling, vol. 54, Apr 1987FLEXChip Signal Processor (MC68175/D), Motorola, 1996.

    Google Scholar 

  8. D. Povey, S.M. Chu, B. Varadarajan, Universal background model based speech recognition. IEEE International Conference on Acoustics, Speech and Signal Processing, 2008.

    Google Scholar 

  9. M. Przybocki, A. Martin, and A. Le, “NIST speaker recognition evaluations utilizing the mixer corpora 2004, 2005, 2006,” IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 7, pp. 1951–1959, Sep. 2007.

    Google Scholar 

  10. Zhao, Xiaojia et Wang, DeLiang. Analyzing noise robustness of MFCC and GFCC features in speaker identification. In: Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 2013. p. 7204–7208.

    Google Scholar 

Download references

Acknowledgment

The present study is a part of ongoing project on “Personal Authentication using Multimodal Behavioural Biometrics: Voice and Gait” and the authors express their gratitude to the Department of Science & Technology, Govt. Of India for funding the project.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Medikonda Jeevan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer India

About this paper

Cite this paper

Jeevan, M., Dhingra, A., Hanmandlu, M., Panigrahi, B.K. (2017). Robust Speaker Verification Using GFCC Based i-Vectors. In: Lobiyal, D., Mohapatra, D., Nagar, A., Sahoo, M. (eds) Proceedings of the International Conference on Signal, Networks, Computing, and Systems. Lecture Notes in Electrical Engineering, vol 395. Springer, New Delhi. https://doi.org/10.1007/978-81-322-3592-7_9

Download citation

  • DOI: https://doi.org/10.1007/978-81-322-3592-7_9

  • Published:

  • Publisher Name: Springer, New Delhi

  • Print ISBN: 978-81-322-3590-3

  • Online ISBN: 978-81-322-3592-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics