Phase Based Mel Frequency Cepstral Coefficients for Speaker Identification

Srivastava, Sumit; Chandra, Mahesh; Sahoo, G.

doi:10.1007/978-81-322-2757-1_31

Phase Based Mel Frequency Cepstral Coefficients for Speaker Identification

Sumit Srivastava⁶,
Mahesh Chandra⁷ &
G. Sahoo⁶

Conference paper
First Online: 04 February 2016

1516 Accesses

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 435))

Abstract

In this paper new Phase based Mel frequency Cepstral Coefficient (PMFCC) are used for speaker identification. GMM with VQ are used as a classifier for classification of speakers. The identification performance of proposed features is compared with identification performance of MFCC features and phase features. The performance of PMFCC features has been found superior compared to MFCC features and phase features. Ten Hindi digits database of fifty speakers is used for simulation of results. This paper also explore the usefulness of phase information for speaker recognition.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

D.A. Reynolds, and R.C. Rose, “Robust Text-Independent Speaker Identification using Gaussian Mixture Speaker Models,” IEEE Transactions on Speech and Audio Processing, vol. 3, no. 1, pp. 74–77, January 1995.
Google Scholar
Md. Fozur Rahman Chowdhury, “Text independent distributed speaker identification and verification using GMM UBM speaker models for mobile communications,” 10th International Conference on Information Science, Signal Processing and Their Application, 2010, pp 57–60.
Google Scholar
Tomi Kinnunen, Evgeny Karpov and Pasi Franti “Real-time speaker identification and verification”, IEEE Transaction on Audio, Speech and Language Processing, Vol. 14, No. 1, pp. 277–278, 2006.
Google Scholar
L.R. Rabiner and B.H. Juang, Fundamentals of Speech Recognition, 1st ed., Pearson Education, Delhi, 2003.
Google Scholar
J. Makhoul, “Linear prediction: A tutorial review,” Proc. of IEEE, vol. 63, no. 4, pp. 561–580, 19756.
Google Scholar
R.C. Snell and F. Milinazzo, “Formant location from LPC Analysis data,” IEEE Transactions on Speech and Audio Processing, vol. 1, no. 2, pp. 129–134, Apr. 1993.
Google Scholar
S.S. McCandless, “An algorithm for automatic formant extraction using linear prediction spectra,” IEEE Trans. On Acoustic, Speech and Signal Processing, ASSP-22, No. 2, pp. 135–141, 1974.
Google Scholar
Pawan Kumar, Nitika Jakhanwal, Anirban Bhowmick, and Mahesh Chandra, “Gender Classification Using Pitch and Formants” International Conference on Communication, Computing &Security (ICCCS), February 12–14, 2011, Rourkela, Odisha, India, pp. 319–324.
Google Scholar
J.D. Markel, “Digital inverse filtering-A new tool for formant trajectory estimation,” IEEE Trans. AU-20, pp. 129–1 37, 1972.
Google Scholar
A. Holzapfel and Y. Stylianou, “Beat tracking using group delay based onset detection.” in ISMIR, 2008, pp. 653–658.
Google Scholar
M. E. P. Davies and M. Plumbley, “Context-dependent beat tracking of musical audio,” IEEE Trans. on Audio, Speech, and Language Processing, vol. 15, no. 3, pp. 1009–1020, March 2007.
Google Scholar
K. Hofbauer, G. Kubin, and W. Kleijn, “Speech watermarking for analog flat-fading bandpass channels,” IEEE Trans. on Audio, Speech, and Language Processing, vol. 17, no. 8, pp. 1624–1637, Nov. 2009.
Google Scholar
I. Saratxaga, D. Erro, I. Hernez, I. Sainz, and E. Navas, “Use of harmonic phase information for polarity detection in speech signals.” in INTERSPEECH, 2009.
Google Scholar
Munish Bhatia, Navpreet Singh, Amitpal Singh,” Speaker Accent Recognition by MFCC Using KNearest Neighbour Algorithm: A Different Approach”, in IJARCCE.2015.
Google Scholar
Sumit Srivastava, Pratibha Nandi, G. Sahoo, Mahesh Chandra,” Formant Based Linear Prediction Coefficients for Speaker Identification”, SPIN 2014.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science & Enginnering, BIT Mesra, Ranchi, India
Sumit Srivastava & G. Sahoo
Department of Electronics & Communication Enginnering, BIT Mesra, Ranchi, India
Mahesh Chandra

Authors

Sumit Srivastava
View author publications
You can also search for this author in PubMed Google Scholar
Mahesh Chandra
View author publications
You can also search for this author in PubMed Google Scholar
G. Sahoo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sumit Srivastava .

Editor information

Editors and Affiliations

Department of Computer Science Engineering, Anil Neerukonda Institute of Technology and Sciences, Visakhapatnam, India
Suresh Chandra Satapathy
Kalyani University, Nadia, West Bengal, India
Jyotsna Kumar Mandal
University of Hyderabad, Hyderabad, India
Siba K. Udgata
Department of Electronics and Communication Engineering, Shri Ramswaroop Memorial Group of Professional Colleges, Lucknow, Uttar Pradesh, India
Vikrant Bhateja

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Srivastava, S., Chandra, M., Sahoo, G. (2016). Phase Based Mel Frequency Cepstral Coefficients for Speaker Identification. In: Satapathy, S., Mandal, J., Udgata, S., Bhateja, V. (eds) Information Systems Design and Intelligent Applications. Advances in Intelligent Systems and Computing, vol 435. Springer, New Delhi. https://doi.org/10.1007/978-81-322-2757-1_31

Download citation

DOI: https://doi.org/10.1007/978-81-322-2757-1_31
Published: 04 February 2016
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-2756-4
Online ISBN: 978-81-322-2757-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics