Isolated Tamil Digit Speech Recognition Using Template-Based and HMM-Based Approaches

Karpagavalli, S.; Deepika, R.; Kokila, P.; Usha Rani, K.; Chandra, E.

doi:10.1007/978-3-642-29216-3_48

S. Karpagavalli⁴,
R. Deepika⁴,
P. Kokila⁴,
K. Usha Rani⁴ &
…
E. Chandra⁵

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 270))

Included in the following conference series:

International Conference on Computing and Communication Systems

2592 Accesses
1 Citations

Abstract

For more than three decades, a great amount of research was carried out on various aspects of speech signal processing and its applications. Highly successful application of speech processing is Automatic Speech Recognition (ASR). Early attempts to ASR consisted of making deterministic models of whole words in a small vocabulary and recognizing a given speech utterance as the word whose model comes closest to it. The introduction of Hidden Markov Models (HMMs) in the early 1980 provided much more powerful tool for speech recognition. And the recognition can be done for continuous speech using large vocabulary, in a speaker independent manner. Two approaches like conventional template-based and Hidden Markov Model usually performs speaker independent isolated word recognition. In this work, speaker independent isolated Tamil digit speech recognizers are designed by employing template based and HMM based approaches. The results of the approaches are compared and observed that HMM based model performs well and the word error rate is greatly reduced.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Jurafsky, D., Martin, J.H.: Speech and Language Processing - An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Pearson Education (2002)
Google Scholar
Terissi, L.D., Gomez, J.C.: Template-based and HMM-based Approaches for Isolated Spanish Digit Recognition. Intelligencia Artificial.Revista lberoamericana de Intelligencia Artificial 9(26) (2005)
Google Scholar
Satori, H., Harti, M., Chenfour, N.: Arabic Speech Recognition System based on CMUSphinx. In: International Symposium on Computational Intelligence and Intelligent Informatics (March 2007)
Google Scholar
Rabiner, L., Juang, B.-H.: Fundamentals of Speech Recognition. Prentice-Hall, Inc., Engelwood (1993)
Google Scholar
Kamm, T., Hermansky, H., Andreou, A.G.: Learning the Mel-scale and Optimal VTN Mapping. In: Center for Language and Speech Processing, Workshop (WS 1997). Johns Hopkins University (1997)
Google Scholar
Hornback, J.R., Lieutenant, S.: Speech Recognition Using The Mellin Transform, MS Thesis report, Air Force Instituite of Technology, Wright-Patterson Air Force Base, Ohio (2006)
Google Scholar
Li, D., Strik, H.: Structure-Based and Template-Based Automatic Speech Recognition-Comparing parametric and non-parametric approaches. Microsoft Research, One Microsoft Way, Redmond, WA, USA, CLST, Department of Linguistics, Radboud University, Nijmegen
Google Scholar
Hachkar, Z., Farchi, A., Mounir, B., El Abbadi, J.: A Comparison of DHMM and DTW for Isolated Digit Recognition System for Arabic Language. International Journal of Computer Science and Engineering 3(3) (March 2011); ISSN : 0975-3397
Google Scholar
Jacob, B., Sondhi, M.M., Huang, Y.: Springer Handbook of Speech Processing, XXXVI (2008)
Google Scholar
Rabiner, L.R.: A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE 77(2), 257–286 (1989)
Article Google Scholar
Thangarajan, R., Natarajan, A.M., Selvam, M.: Word and Triphone based approaches. Continuous Speech Recognition for Tamil Language 4(3) (March 2008)
Google Scholar
Anusuya, M.A., Katti, S.K.: Speech Recognition by Machine: A Review. International Journal of Computer Science and Information Security 6(3), 181–205 (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science (PG), PSGR Krishnammal College for Women, Coimbatore, India
S. Karpagavalli, R. Deepika, P. Kokila & K. Usha Rani
DJ Academy for Managerial Excellence, Coimbatore, India
E. Chandra

Authors

S. Karpagavalli
View author publications
You can also search for this author in PubMed Google Scholar
R. Deepika
View author publications
You can also search for this author in PubMed Google Scholar
P. Kokila
View author publications
You can also search for this author in PubMed Google Scholar
K. Usha Rani
View author publications
You can also search for this author in PubMed Google Scholar
E. Chandra
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computing Science and Engineering, VIT University, 632014, Vellore, TN, India
P. Venkata Krishna
School of Computing and Engineering, VIT University, 632014, Vellore, TN, India
M. Rajasekhara Babu
London Metropolitan University, UK
Ezendu Ariwa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Karpagavalli, S., Deepika, R., Kokila, P., Usha Rani, K., Chandra, E. (2012). Isolated Tamil Digit Speech Recognition Using Template-Based and HMM-Based Approaches. In: Krishna, P.V., Babu, M.R., Ariwa, E. (eds) Global Trends in Information Systems and Software Applications. ObCom 2011. Communications in Computer and Information Science, vol 270. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29216-3_48

Download citation

DOI: https://doi.org/10.1007/978-3-642-29216-3_48
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29215-6
Online ISBN: 978-3-642-29216-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics