Abstract
Speaker recognition is a process of recognizing a person based on their unique voice signals and it is a topic of great importance in areas of intelligent and security. Considerable research and development has been carried out to extract speaker specific features and to develop features matching techniques. The goal of this paper is to perform text-independent speaker identification. These models rely on Mel Frequency Cepstral Coefficients (MFCC) for extraction of speaker specific features and for speaker modelling Vector Quantization (VQ) is used due to high accuracy and simplicity. The proposed system efficiency was analyzed by using 20 filter banks for extracting features. The performance was evaluated using MATLAB against different speakers in different languages such as Tamil, Malayalam, Hindi, Telugu and English with duration of 2, 3 and 4 s. Experimental result shows that 4 s duration of speech regardless of language is able to produce 98 %, 99 % and 97 % of identification when compared to 2 and 3 s. The system efficiency may further be improved using other speaker modelling techniques like Neural Network, Hidden Markov Model and Gaussian Mixture Model.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
S. Furui, Speaker-independent and speaker-adaptive recognition techniques, in Advances in Speech Signal Processing, ed. by S. Furui, M.M. Sondhi (Marcel Dekker, New York, 1991), pp. 597–622
S. Furui, Recent advances in speaker recognition, in Proceedings of the First International Conference on Audio- and Video-Based Biometric Person Authentication, 1997, pp. 237–252
S. Furui, Digital Speech Processing, Synthesis, and Recognition, 2nd edn. (Marcel Dekker, New York, 2000)
D.A. Reynolds, Automatic Speaker Recognition: Current Approaches and Future Trends (MIT Lincoln Laboratory, Lexington, 2006)
M. Sigmund, Voice Recognition by Computer (Tectum Verlag DE, Marburg, 2003)
H.S. Jayanna, S.R.M. Prasanna, Analysis, feature extraction, modeling and testing techniques for speaker recognition. IETE Tech. Rev. 26, 181–190 (2009)
A.N. Sigappi, S. Palanivel, Spoken word recognition strategy for Tamil language. IJCSI Int. J. Comput. Sci. Issues, 9(1), No. 3 (2012). ISSN 1694-0814
M.G. Sumithra, K. Thanuskodi, A new speaker recognition system with combined feature extraction techniques. J. Comput. Sci., 7(4), 459–465 (2011), Science Publications. ISSN 1549-3636
Y. Goto, T. Akatsu et al., An investigation on speaker vector-based speaker identification under noisy conditions, in Proceedings of the International Conference on Audio, Language and Image Processing, IEEE Xplore, pp. 1430–1435
S. Menon, M. Lech, N. Maddage, Speaker verification based on different vector quantization techniques with Gaussian mixture models, in Proceedings of the 3rd International Conference on Network and System Security, pp. 403–408
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Ambika, D., Radha, V. (2014). Vector Quantization in Language Independent Speaker Identification Using Mel-Frequency Cepstrum Co-efficient. In: Meghanathan, N., Nagamalai, D., Rajasekaran, S. (eds) Networks and Communications (NetCom2013). Lecture Notes in Electrical Engineering, vol 284. Springer, Cham. https://doi.org/10.1007/978-3-319-03692-2_14
Download citation
DOI: https://doi.org/10.1007/978-3-319-03692-2_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-03691-5
Online ISBN: 978-3-319-03692-2
eBook Packages: EngineeringEngineering (R0)