HMM Based Enhanced Dynamic Time Warping Model for Efficient Hindi Language Speech Recognition System

Kumar, Sharma Krishna; Kant, Lavania Krishan; Shachi, Sharma

doi:10.1007/978-3-642-35864-7_28

Sharma Krishna Kumar³,
Lavania Krishan Kant⁴ &
Sharma Shachi⁴

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 296))

Included in the following conference series:

International Conference on Advances in Information Technology and Mobile Communication

3196 Accesses

Abstract

Dynamic Time Warping (DTW) is template based cost minimization technique. We propose Hidden Markov Model (HMM) based enhanced DTW technique to efficiently recognize various speaking rate signals and for recognizing closely similar utterances. We extend the derivation of Viterbi and forward algorithms for finding optimized path alignment in new propose technique and extend the Baum-Welch algorithm to optimize the model parameters. The proposed technique is compared with conventional DTW technique, and from comparative results analysis we find that it improves the results from 84% to 94 % using DTW technique for Hindi spoken words for various speech utterances in different environmental conditions or for varying speakers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Kumar, K., Agarwal, R.K.: Hindi speech recognition system using HTK. International Journal of Computing and Business Research 2(2) (May 2011) ISSN (Online): 2229-6166
Google Scholar
Aggarwal, R.K., Dave, M.: Design and Modeling of A Speech Understanding System for Hindi Language, Deptt. of Computer Engg. N.I.T., Kurukshetra
Google Scholar
Ranjan, S.: A Discrete Wavelet Transform Based Approach to Hindi Speech Recognition. In: 2010 International Conference on Signal Acquisition and Processing (2010)
Google Scholar
Abdulla, W.H., Chow, D., Sin, G.: Cross-words Reference Template for DTW-based Speech Recognition Systems. In: TENCON 2003. Conference on Convergent Technologies for Asia-Pacific Region, October 15-17, vol. 4, pp. 1576–1579 (2003)
Google Scholar
The online encyclopedia of writing systems & languages, http://www.omniglot.com/index.html
Furui, S.: Cepstral analysis technique for automatic speaker verification. IEEE Trans. ASSP-29(2), 254–272 (1981)
Google Scholar
Furui, S.: Speaker-independent isolated word recognition using dynamic features of spectrum. IEEE ASSP-34(1), 52–59 (1986)
Google Scholar
ETSI ES 201 108, V1.1.3.: ETSI standard: speech processing, transmission and quality aspects (STQ); Distributed speech recognition; Front-end feature extraction algorithm; Compression algorithms; Sect. 4, pp. 8–12, (September 2003)
Google Scholar
Pruthi, T., Saksena, S., Das, P.K.: Swaranjali: Isolated word recognition for Hindi language using VQ and HMM, Hughes Software Systems and IIT Guwahati (2000)
Google Scholar
Sharma, K.K., Kapoor, P., Chakraborty, P., Nandi, G.C.: Dynamic Spectrum Derived MFCC and HFCC Parameters and Human Robot Speech Interaction. In: International Conference on Advances in Computer Engineering–ACE 2011 (2011)
Google Scholar
Hocine Bourouba, E., Bedda, M., Djemili, R.: Isolated Words Recognition System Based on Hybrid Approach DTW/GHMM. Informatica 30, 373–384 (2006)
MATH Google Scholar
Yaniv, R., Burshtein, D.: An Enhanced Dynamic Time Warping Model for Improved Estimation of DTW Parameters. IEEE Transactions on Speech and Audio Processing 11(3) (May 2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Central University of Rajasthan, Kishangarh, Ajmer, India
Sharma Krishna Kumar
Department of CS, AIET, Rajasthan Technical University, Jaipur, India
Lavania Krishan Kant & Sharma Shachi

Authors

Sharma Krishna Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Lavania Krishan Kant
View author publications
You can also search for this author in PubMed Google Scholar
Sharma Shachi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Doctors Engineering and Scientist, Amsterdam, The Netherlands
Vinu V Das
Guru Jambheshwar University of Science and Technology, Hisar, Haryana, India
Yogesh Chaba

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kumar, S.K., Kant, L.K., Shachi, S. (2013). HMM Based Enhanced Dynamic Time Warping Model for Efficient Hindi Language Speech Recognition System. In: Das, V.V., Chaba, Y. (eds) Mobile Communication and Power Engineering. AIM 2012. Communications in Computer and Information Science, vol 296. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35864-7_28

Download citation

DOI: https://doi.org/10.1007/978-3-642-35864-7_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35863-0
Online ISBN: 978-3-642-35864-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics