Skip to main content

Isolated Tamil Digit Speech Recognition Using Template-Based and HMM-Based Approaches

  • Conference paper
Global Trends in Information Systems and Software Applications (ObCom 2011)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 270))

Included in the following conference series:

Abstract

For more than three decades, a great amount of research was carried out on various aspects of speech signal processing and its applications. Highly successful application of speech processing is Automatic Speech Recognition (ASR). Early attempts to ASR consisted of making deterministic models of whole words in a small vocabulary and recognizing a given speech utterance as the word whose model comes closest to it. The introduction of Hidden Markov Models (HMMs) in the early 1980 provided much more powerful tool for speech recognition. And the recognition can be done for continuous speech using large vocabulary, in a speaker independent manner. Two approaches like conventional template-based and Hidden Markov Model usually performs speaker independent isolated word recognition. In this work, speaker independent isolated Tamil digit speech recognizers are designed by employing template based and HMM based approaches. The results of the approaches are compared and observed that HMM based model performs well and the word error rate is greatly reduced.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Jurafsky, D., Martin, J.H.: Speech and Language Processing - An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Pearson Education (2002)

    Google Scholar 

  2. Terissi, L.D., Gomez, J.C.: Template-based and HMM-based Approaches for Isolated Spanish Digit Recognition. Intelligencia Artificial.Revista lberoamericana de Intelligencia Artificial 9(26) (2005)

    Google Scholar 

  3. Satori, H., Harti, M., Chenfour, N.: Arabic Speech Recognition System based on CMUSphinx. In: International Symposium on Computational Intelligence and Intelligent Informatics (March 2007)

    Google Scholar 

  4. Rabiner, L., Juang, B.-H.: Fundamentals of Speech Recognition. Prentice-Hall, Inc., Engelwood (1993)

    Google Scholar 

  5. Kamm, T., Hermansky, H., Andreou, A.G.: Learning the Mel-scale and Optimal VTN Mapping. In: Center for Language and Speech Processing, Workshop (WS 1997). Johns Hopkins University (1997)

    Google Scholar 

  6. Hornback, J.R., Lieutenant, S.: Speech Recognition Using The Mellin Transform, MS Thesis report, Air Force Instituite of Technology, Wright-Patterson Air Force Base, Ohio (2006)

    Google Scholar 

  7. Li, D., Strik, H.: Structure-Based and Template-Based Automatic Speech Recognition-Comparing parametric and non-parametric approaches. Microsoft Research, One Microsoft Way, Redmond, WA, USA, CLST, Department of Linguistics, Radboud University, Nijmegen

    Google Scholar 

  8. Hachkar, Z., Farchi, A., Mounir, B., El Abbadi, J.: A Comparison of DHMM and DTW for Isolated Digit Recognition System for Arabic Language. International Journal of Computer Science and Engineering 3(3) (March 2011); ISSN : 0975-3397

    Google Scholar 

  9. Jacob, B., Sondhi, M.M., Huang, Y.: Springer Handbook of Speech Processing, XXXVI (2008)

    Google Scholar 

  10. Rabiner, L.R.: A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE 77(2), 257–286 (1989)

    Article  Google Scholar 

  11. Thangarajan, R., Natarajan, A.M., Selvam, M.: Word and Triphone based approaches. Continuous Speech Recognition for Tamil Language 4(3) (March 2008)

    Google Scholar 

  12. Anusuya, M.A., Katti, S.K.: Speech Recognition by Machine: A Review. International Journal of Computer Science and Information Security 6(3), 181–205 (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Karpagavalli, S., Deepika, R., Kokila, P., Usha Rani, K., Chandra, E. (2012). Isolated Tamil Digit Speech Recognition Using Template-Based and HMM-Based Approaches. In: Krishna, P.V., Babu, M.R., Ariwa, E. (eds) Global Trends in Information Systems and Software Applications. ObCom 2011. Communications in Computer and Information Science, vol 270. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29216-3_48

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-29216-3_48

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-29215-6

  • Online ISBN: 978-3-642-29216-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics