Abstract
This paper addresses the problem of recognizing spoken Kannada words. The designed algorithm recognizes spoken Kannada words independent of speakers. The proposed method normalizes the original speech signal of every isolated word and extracts Linear-Predictive coding (LPC) coefficients, and converts them into Real Cepstrum Coefficient. These Real Cepstrum Coefficient values are subjected to dimensionality reduction through normal fit. These coefficients are used as the representatives of each spoken word. Euclidian distance measure is then used to compute the distance between the test samples to the model data in the database. The model datum in the database at a minimum distance is declared as the recognized word. For experimentation, we have used 294 unique Kannada words. Each of these words was recorded with 10 Speakers yielding 2,940 samples in total. Out of 10 speakers’ data, 8 speakers’ data i.e., 2,352 samples were used to compute the representative co-efficient for each word. Remaining 2 speakers’ data along with re-recorded data of two speakers out of the 8 speakers is used for testing. Totally 2,352 signals are used for training and 1,176 signals are used for testing. The success rate of the proposed system- known speaker data is 98.29 % and unknown speaker data is 91.66 %.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Hemakumar G (2007) A study on hidden markov model for speech recognition. Submitted for the award of M. Phil in Computer Science, Bharathiar University, during Nov 2007
Rabiner L, Jung B-H (1993) Fundamentals of speech recognition. Pearson Education (Singapore) Private Limited, Indian Branch, 482 F.I.E Patpargans, Delhi 110092, India
Swarna Lata, Country Manager, W3C India and Head, TDIL Programme, Department of Information Technology, Government of India (2011) Challenges of multilingual web in India: technology development and standardization perspective. Reported-2011
Rajewara Rao R et al. JNT University, Hyderabad, India (2007) Text-dependent speaker recognition system for Indian languages. IJCSNS 7(11), Nov-2007
Kumar K, Aggarwal RK, Department of Computer Enginerring, National Institute of Technology, Kurukshetra (2011) Hindi speech recognition system using HTK. Int J Comput Bus Res ISSN (Online) 2(2-May issue):2229–6166
Anusuya1 MA, Katti SK (2010) Mel frequency discrete wavelet coefficients for Kannada speech recognition using PCA. In: Proceedings of international conference on advances in computer science 2010
Rajput N, Verma A, Neti C, NCC, Mumbai (2002) A large vocabulary continuous speech recognition system for Hindi. 26–27 Jan 2002
Rao PVS (1993) VOICE: an integrated speech recognition synthesis system for the Hindi language. Speech Commun 13:197–205
Tan P-N, Steinbach M, Kumar V (2009) Introduction to data mining. Dorling Kindersley (India) Pvt. Ltd., Licensees of Pearson Education in South Asia, 4th Impression, 2009
Matlab R2009a help menu
Quatieri TF (2002) Discrete-time speech signal processing principles and practice. Pearson Education (Singapore) Private. Ltd, Indian Branch, 482 F. I. E Patparganj, Delhi 110092, India
Saeed K, Nammous MK (2007) A speech-and-speaker identification system: feature extraction, description, and classification of speech-signal image. IEEE Trans Ind Electron 54(2)
Umesh S (2010) Automatic speech recognition-research and standards. Department of Electrical Engineering, IIT, Madras, May 7th 2010
Three day’s workshop on “Hands on experience in Sphinx and HTK for Speech Recognition” held on Feb-2011 at AU-KBC research center, MIT campus, Chennai
Acknowledgments
The author would like to thank for all my friends who supported me in preparing the speech database and developing Kannada word list of covering all phonemes of the language, reviewers and Editorial staff for their efforts in preparation of this paper.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer India
About this paper
Cite this paper
Hemakumar, G., Punitha, P. (2013). Speaker Independent Isolated Kannada Word Recognizer. In: Swamy, P., Guru, D. (eds) Multimedia Processing, Communication and Computing Applications. Lecture Notes in Electrical Engineering, vol 213. Springer, New Delhi. https://doi.org/10.1007/978-81-322-1143-3_27
Download citation
DOI: https://doi.org/10.1007/978-81-322-1143-3_27
Published:
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-1142-6
Online ISBN: 978-81-322-1143-3
eBook Packages: EngineeringEngineering (R0)