Robust Speaker Verification Using GFCC Based i-Vectors

Jeevan, Medikonda; Dhingra, Atul; Hanmandlu, M.; Panigrahi, B. K.

doi:10.1007/978-81-322-3592-7_9

Medikonda Jeevan⁵,
Atul Dhingra⁵,
M. Hanmandlu⁵ &
…
B. K. Panigrahi⁵

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 395))

889 Accesses
19 Citations

Abstract

This paper presents to ameliorate the performance of text-independent speaker recognition system in a noisy environment and cross-channel recordings of the utterances. In this paper presents the combination of Gammatone Frequency Cepstral Coefficients (GFCC) to handle noisy environment with i-vectors to handle the session variability. Experiments are evaluated on NIST-2003 database.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

D. A. Reynolds, “Experimental Evaluation of Features for Robust Speaker Identification,” IEEE Trans. on Acoustic Speech and Audio Processing, vol. 2, no. 4, 1994.
Google Scholar
Davis, S. Mermelstein, P.,” Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences”, In IEEE Transactions on Acoustics, Speech, and Signal Processing, Vol. 28 No. 4, pp. 357–366, 1980.
Google Scholar
H. Seddik, A. Rahmouni and M. Sayadi, “Text independent speaker recognition using the mel frequency cepstral coefficients and a neural network classifier”, Ecole Nationale Des Sciences Informatiques, 2010, Manouba, Tunisia.
Google Scholar
R.D Patterson, I. Nimmo-Smith, J. Holdsworth, and P. Rice, “An efficient auditory filterbank based on Gammatone function,” in Paper presented at a meeting of the IOC Speech Group on Auditory Modelling at RSRE, December 14–15, 1987.
Google Scholar
N. Dehak, R. Dehak, P. Kenny, N. Brummer, P. Ouellet, and P. Dumouchel, “Support vector machines versus fast scoring in the low-dimensional total variability space for speaker verification,” in Proceedings of Inter speech, Brighton, UK, 2009.
Google Scholar
T. Kinnunen and P. Rajan. “A practical, self-adaptive voice activity detector for speaker verification with noisy telephone and microphone data,” in Proc. International Conference on Acoustics, Speech, and Signal Processing, 2013, pp. 7229–7233.
Google Scholar
R. Patterson and I. N. Smith, “An efficient auditory filter bank based on the gammatone function,” Speech-Group meeting of the Institute of Acoustics on Auditory Modelling, vol. 54, Apr 1987FLEXChip Signal Processor (MC68175/D), Motorola, 1996.
Google Scholar
D. Povey, S.M. Chu, B. Varadarajan, Universal background model based speech recognition. IEEE International Conference on Acoustics, Speech and Signal Processing, 2008.
Google Scholar
M. Przybocki, A. Martin, and A. Le, “NIST speaker recognition evaluations utilizing the mixer corpora 2004, 2005, 2006,” IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 7, pp. 1951–1959, Sep. 2007.
Google Scholar
Zhao, Xiaojia et Wang, DeLiang. Analyzing noise robustness of MFCC and GFCC features in speaker identification. In: Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 2013. p. 7204–7208.
Google Scholar

Download references

Acknowledgment

The present study is a part of ongoing project on “Personal Authentication using Multimodal Behavioural Biometrics: Voice and Gait” and the authors express their gratitude to the Department of Science & Technology, Govt. Of India for funding the project.

Author information

Authors and Affiliations

Indian Institute of Technology, New Delhi, 110016, India
Medikonda Jeevan, Atul Dhingra, M. Hanmandlu & B. K. Panigrahi

Authors

Medikonda Jeevan
View author publications
You can also search for this author in PubMed Google Scholar
Atul Dhingra
View author publications
You can also search for this author in PubMed Google Scholar
M. Hanmandlu
View author publications
You can also search for this author in PubMed Google Scholar
B. K. Panigrahi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Medikonda Jeevan .

Editor information

Editors and Affiliations

School of Computer & Systems Sciences, Jawaharlal Nehru University, New Delhi, Delhi, India
Daya K. Lobiyal
Department of Computer Science and Engineering, National Institute of Technology, Rourkela, India
Durga Prasad Mohapatra
Department of Computer Science, Liverpool Hope University Faculty of Science, Liverpool, United Kingdom
Atulya Nagar
Computer Science & Engineering, National Institute of Technology, Rourkela, Odisha, India
Manmath N. Sahoo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jeevan, M., Dhingra, A., Hanmandlu, M., Panigrahi, B.K. (2017). Robust Speaker Verification Using GFCC Based i-Vectors. In: Lobiyal, D., Mohapatra, D., Nagar, A., Sahoo, M. (eds) Proceedings of the International Conference on Signal, Networks, Computing, and Systems. Lecture Notes in Electrical Engineering, vol 395. Springer, New Delhi. https://doi.org/10.1007/978-81-322-3592-7_9

Download citation

DOI: https://doi.org/10.1007/978-81-322-3592-7_9
Published: 14 October 2016
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-3590-3
Online ISBN: 978-81-322-3592-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics