Skip to main content

Speaker Independent Isolated Kannada Word Recognizer

  • Conference paper
  • First Online:
Multimedia Processing, Communication and Computing Applications

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 213))

Abstract

This paper addresses the problem of recognizing spoken Kannada words. The designed algorithm recognizes spoken Kannada words independent of speakers. The proposed method normalizes the original speech signal of every isolated word and extracts Linear-Predictive coding (LPC) coefficients, and converts them into Real Cepstrum Coefficient. These Real Cepstrum Coefficient values are subjected to dimensionality reduction through normal fit. These coefficients are used as the representatives of each spoken word. Euclidian distance measure is then used to compute the distance between the test samples to the model data in the database. The model datum in the database at a minimum distance is declared as the recognized word. For experimentation, we have used 294 unique Kannada words. Each of these words was recorded with 10 Speakers yielding 2,940 samples in total. Out of 10 speakers’ data, 8 speakers’ data i.e., 2,352 samples were used to compute the representative co-efficient for each word. Remaining 2 speakers’ data along with re-recorded data of two speakers out of the 8 speakers is used for testing. Totally 2,352 signals are used for training and 1,176 signals are used for testing. The success rate of the proposed system- known speaker data is 98.29 % and unknown speaker data is 91.66 %.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Hemakumar G (2007) A study on hidden markov model for speech recognition. Submitted for the award of M. Phil in Computer Science, Bharathiar University, during Nov 2007

    Google Scholar 

  2. Rabiner L, Jung B-H (1993) Fundamentals of speech recognition. Pearson Education (Singapore) Private Limited, Indian Branch, 482 F.I.E Patpargans, Delhi 110092, India

    Google Scholar 

  3. http://en.wikipedia.org/wiki/Speech_recognition

  4. Swarna Lata, Country Manager, W3C India and Head, TDIL Programme, Department of Information Technology, Government of India (2011) Challenges of multilingual web in India: technology development and standardization perspective. Reported-2011

    Google Scholar 

  5. Rajewara Rao R et al. JNT University, Hyderabad, India (2007) Text-dependent speaker recognition system for Indian languages. IJCSNS 7(11), Nov-2007

    Google Scholar 

  6. Kumar K, Aggarwal RK, Department of Computer Enginerring, National Institute of Technology, Kurukshetra (2011) Hindi speech recognition system using HTK. Int J Comput Bus Res ISSN (Online) 2(2-May issue):2229–6166

    Google Scholar 

  7. Anusuya1 MA, Katti SK (2010) Mel frequency discrete wavelet coefficients for Kannada speech recognition using PCA. In: Proceedings of international conference on advances in computer science 2010

    Google Scholar 

  8. Rajput N, Verma A, Neti C, NCC, Mumbai (2002) A large vocabulary continuous speech recognition system for Hindi. 26–27 Jan 2002

    Google Scholar 

  9. Rao PVS (1993) VOICE: an integrated speech recognition synthesis system for the Hindi language. Speech Commun 13:197–205

    Google Scholar 

  10. Tan P-N, Steinbach M, Kumar V (2009) Introduction to data mining. Dorling Kindersley (India) Pvt. Ltd., Licensees of Pearson Education in South Asia, 4th Impression, 2009

    Google Scholar 

  11. Matlab R2009a help menu

    Google Scholar 

  12. Quatieri TF (2002) Discrete-time speech signal processing principles and practice. Pearson Education (Singapore) Private. Ltd, Indian Branch, 482 F. I. E Patparganj, Delhi 110092, India

    Google Scholar 

  13. Saeed K, Nammous MK (2007) A speech-and-speaker identification system: feature extraction, description, and classification of speech-signal image. IEEE Trans Ind Electron 54(2)

    Google Scholar 

  14. Umesh S (2010) Automatic speech recognition-research and standards. Department of Electrical Engineering, IIT, Madras, May 7th 2010

    Google Scholar 

  15. Three day’s workshop on “Hands on experience in Sphinx and HTK for Speech Recognition” held on Feb-2011 at AU-KBC research center, MIT campus, Chennai

    Google Scholar 

Download references

Acknowledgments

The author would like to thank for all my friends who supported me in preparing the speech database and developing Kannada word list of covering all phonemes of the language, reviewers and Editorial staff for their efforts in preparation of this paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to G. Hemakumar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer India

About this paper

Cite this paper

Hemakumar, G., Punitha, P. (2013). Speaker Independent Isolated Kannada Word Recognizer. In: Swamy, P., Guru, D. (eds) Multimedia Processing, Communication and Computing Applications. Lecture Notes in Electrical Engineering, vol 213. Springer, New Delhi. https://doi.org/10.1007/978-81-322-1143-3_27

Download citation

  • DOI: https://doi.org/10.1007/978-81-322-1143-3_27

  • Published:

  • Publisher Name: Springer, New Delhi

  • Print ISBN: 978-81-322-1142-6

  • Online ISBN: 978-81-322-1143-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics