Speaker Independent Isolated Kannada Word Recognizer

Hemakumar, G.; Punitha, P.

doi:10.1007/978-81-322-1143-3_27

G. Hemakumar³ &
P. Punitha⁴

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 213))

926 Accesses
1 Citations

Abstract

This paper addresses the problem of recognizing spoken Kannada words. The designed algorithm recognizes spoken Kannada words independent of speakers. The proposed method normalizes the original speech signal of every isolated word and extracts Linear-Predictive coding (LPC) coefficients, and converts them into Real Cepstrum Coefficient. These Real Cepstrum Coefficient values are subjected to dimensionality reduction through normal fit. These coefficients are used as the representatives of each spoken word. Euclidian distance measure is then used to compute the distance between the test samples to the model data in the database. The model datum in the database at a minimum distance is declared as the recognized word. For experimentation, we have used 294 unique Kannada words. Each of these words was recorded with 10 Speakers yielding 2,940 samples in total. Out of 10 speakers’ data, 8 speakers’ data i.e., 2,352 samples were used to compute the representative co-efficient for each word. Remaining 2 speakers’ data along with re-recorded data of two speakers out of the 8 speakers is used for testing. Totally 2,352 signals are used for training and 1,176 signals are used for testing. The success rate of the proposed system- known speaker data is 98.29 % and unknown speaker data is 91.66 %.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Hemakumar G (2007) A study on hidden markov model for speech recognition. Submitted for the award of M. Phil in Computer Science, Bharathiar University, during Nov 2007
Google Scholar
Rabiner L, Jung B-H (1993) Fundamentals of speech recognition. Pearson Education (Singapore) Private Limited, Indian Branch, 482 F.I.E Patpargans, Delhi 110092, India
Google Scholar
http://en.wikipedia.org/wiki/Speech_recognition
Swarna Lata, Country Manager, W3C India and Head, TDIL Programme, Department of Information Technology, Government of India (2011) Challenges of multilingual web in India: technology development and standardization perspective. Reported-2011
Google Scholar
Rajewara Rao R et al. JNT University, Hyderabad, India (2007) Text-dependent speaker recognition system for Indian languages. IJCSNS 7(11), Nov-2007
Google Scholar
Kumar K, Aggarwal RK, Department of Computer Enginerring, National Institute of Technology, Kurukshetra (2011) Hindi speech recognition system using HTK. Int J Comput Bus Res ISSN (Online) 2(2-May issue):2229–6166
Google Scholar
Anusuya1 MA, Katti SK (2010) Mel frequency discrete wavelet coefficients for Kannada speech recognition using PCA. In: Proceedings of international conference on advances in computer science 2010
Google Scholar
Rajput N, Verma A, Neti C, NCC, Mumbai (2002) A large vocabulary continuous speech recognition system for Hindi. 26–27 Jan 2002
Google Scholar
Rao PVS (1993) VOICE: an integrated speech recognition synthesis system for the Hindi language. Speech Commun 13:197–205
Google Scholar
Tan P-N, Steinbach M, Kumar V (2009) Introduction to data mining. Dorling Kindersley (India) Pvt. Ltd., Licensees of Pearson Education in South Asia, 4th Impression, 2009
Google Scholar
Matlab R2009a help menu
Google Scholar
Quatieri TF (2002) Discrete-time speech signal processing principles and practice. Pearson Education (Singapore) Private. Ltd, Indian Branch, 482 F. I. E Patparganj, Delhi 110092, India
Google Scholar
Saeed K, Nammous MK (2007) A speech-and-speaker identification system: feature extraction, description, and classification of speech-signal image. IEEE Trans Ind Electron 54(2)
Google Scholar
Umesh S (2010) Automatic speech recognition-research and standards. Department of Electrical Engineering, IIT, Madras, May 7th 2010
Google Scholar
Three day’s workshop on “Hands on experience in Sphinx and HTK for Speech Recognition” held on Feb-2011 at AU-KBC research center, MIT campus, Chennai
Google Scholar

Download references

Acknowledgments

The author would like to thank for all my friends who supported me in preparing the speech database and developing Kannada word list of covering all phonemes of the language, reviewers and Editorial staff for their efforts in preparation of this paper.

Author information

Authors and Affiliations

Department of Computer Science, Government College for Women, Mandya, India
G. Hemakumar
Department of MCA, PESIT, Banashankari 3rd Stage, 100 Feet Ring Road, Bangalore, India
P. Punitha

Authors

G. Hemakumar
View author publications
You can also search for this author in PubMed Google Scholar
P. Punitha
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to G. Hemakumar .

Editor information

Editors and Affiliations

Master of Computer Applications, PES Institute of Technology, Banashankari 3rd stage, Near Hoskerehalli Cross 100 Feet, Bangalore, 560085, Karnataka, India
Punitha P. Swamy
Studies in Computer Science, University of Mysore, Manasagangotri, Mysore, 570006, Karnataka, India
Devanur S. Guru

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hemakumar, G., Punitha, P. (2013). Speaker Independent Isolated Kannada Word Recognizer. In: Swamy, P., Guru, D. (eds) Multimedia Processing, Communication and Computing Applications. Lecture Notes in Electrical Engineering, vol 213. Springer, New Delhi. https://doi.org/10.1007/978-81-322-1143-3_27

Download citation

DOI: https://doi.org/10.1007/978-81-322-1143-3_27
Published: 26 May 2013
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-1142-6
Online ISBN: 978-81-322-1143-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics