Vector Quantization in Language Independent Speaker Identification Using Mel-Frequency Cepstrum Co-efficient

Ambika, D.; Radha, V.

doi:10.1007/978-3-319-03692-2_14

Vector Quantization in Language Independent Speaker Identification Using Mel-Frequency Cepstrum Co-efficient

D. Ambika⁴ &
V. Radha⁴

Conference paper
First Online: 01 January 2014

702 Accesses

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 284))

Abstract

Speaker recognition is a process of recognizing a person based on their unique voice signals and it is a topic of great importance in areas of intelligent and security. Considerable research and development has been carried out to extract speaker specific features and to develop features matching techniques. The goal of this paper is to perform text-independent speaker identification. These models rely on Mel Frequency Cepstral Coefficients (MFCC) for extraction of speaker specific features and for speaker modelling Vector Quantization (VQ) is used due to high accuracy and simplicity. The proposed system efficiency was analyzed by using 20 filter banks for extracting features. The performance was evaluated using MATLAB against different speakers in different languages such as Tamil, Malayalam, Hindi, Telugu and English with duration of 2, 3 and 4 s. Experimental result shows that 4 s duration of speech regardless of language is able to produce 98 %, 99 % and 97 % of identification when compared to 2 and 3 s. The system efficiency may further be improved using other speaker modelling techniques like Neural Network, Hidden Markov Model and Gaussian Mixture Model.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

S. Furui, Speaker-independent and speaker-adaptive recognition techniques, in Advances in Speech Signal Processing, ed. by S. Furui, M.M. Sondhi (Marcel Dekker, New York, 1991), pp. 597–622
Google Scholar
S. Furui, Recent advances in speaker recognition, in Proceedings of the First International Conference on Audio- and Video-Based Biometric Person Authentication, 1997, pp. 237–252
Google Scholar
S. Furui, Digital Speech Processing, Synthesis, and Recognition, 2nd edn. (Marcel Dekker, New York, 2000)
Google Scholar
D.A. Reynolds, Automatic Speaker Recognition: Current Approaches and Future Trends (MIT Lincoln Laboratory, Lexington, 2006)
Google Scholar
M. Sigmund, Voice Recognition by Computer (Tectum Verlag DE, Marburg, 2003)
Google Scholar
H.S. Jayanna, S.R.M. Prasanna, Analysis, feature extraction, modeling and testing techniques for speaker recognition. IETE Tech. Rev. 26, 181–190 (2009)
Article Google Scholar
A.N. Sigappi, S. Palanivel, Spoken word recognition strategy for Tamil language. IJCSI Int. J. Comput. Sci. Issues, 9(1), No. 3 (2012). ISSN 1694-0814
Google Scholar
M.G. Sumithra, K. Thanuskodi, A new speaker recognition system with combined feature extraction techniques. J. Comput. Sci., 7(4), 459–465 (2011), Science Publications. ISSN 1549-3636
Google Scholar
Y. Goto, T. Akatsu et al., An investigation on speaker vector-based speaker identification under noisy conditions, in Proceedings of the International Conference on Audio, Language and Image Processing, IEEE Xplore, pp. 1430–1435
Google Scholar
S. Menon, M. Lech, N. Maddage, Speaker verification based on different vector quantization techniques with Gaussian mixture models, in Proceedings of the 3rd International Conference on Network and System Security, pp. 403–408
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Avinashilingam Institute for Home Science and Higher Education for Women, Coimbatore, India
D. Ambika & V. Radha

Authors

D. Ambika
View author publications
You can also search for this author in PubMed Google Scholar
V. Radha
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to D. Ambika .

Editor information

Editors and Affiliations

Department of Computer Science, Jackson State University, Jackson, Mississippi, USA
Natarajan Meghanathan
Wireilla Net Solutions PTY Ltd, Melbourne, Victoria, Australia
Dhinaharan Nagamalai
Dept. of CSE, University of Connecticut, Storrs, Connecticut, USA
Sanguthevar Rajasekaran

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ambika, D., Radha, V. (2014). Vector Quantization in Language Independent Speaker Identification Using Mel-Frequency Cepstrum Co-efficient. In: Meghanathan, N., Nagamalai, D., Rajasekaran, S. (eds) Networks and Communications (NetCom2013). Lecture Notes in Electrical Engineering, vol 284. Springer, Cham. https://doi.org/10.1007/978-3-319-03692-2_14

Download citation

DOI: https://doi.org/10.1007/978-3-319-03692-2_14
Published: 17 January 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-03691-5
Online ISBN: 978-3-319-03692-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics