HMM-Based Lightweight Speech Recognition System for Gujarati Language

Tailor, Jinal H.; Shah, Dipti B.

doi:10.1007/978-981-10-3920-1_46

Jinal H. Tailor⁵ &
Dipti B. Shah⁶

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 10))

868 Accesses
6 Citations

Abstract

Speech recognition system (SRS) is growing research interest in the area of natural language processing (NLP). To develop speech recognition system for low resource language is difficult task. This paper defines a lightweight speech recognition system approach for Indian Gujarati language using hidden Markov model (HMM). The aim of this research is to design and implement SRS for routine Gujarati language which is difficult due to language barrier, complex language framework, and morphological variance. To train the HMM-based SRS we have manually created speech corpora that contained 650 routine Gujarati utterances which are recorded from total 40 speakers of South Gujarat region. Total numbers of speakers are selected on the basis of gender. We have achieved accuracy of 87.23% with average error rate 12.7% based on the word error rate (WER) computing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Weischedel, R., Carbonell, J., Grosz, B., Lehnert, W., Marcus, M., Perrault, R., & Wilensky, R. (1989, October). White paper on natural language processing. In Proceedings of the workshop on Speech and Natural Language (pp. 481–493). Association for Computational Linguistics.
Google Scholar
Sandanalakshmi, R., Viji, P. A., Kiruthiga, M., Manjari, M., & Sharina, M. (2013). Speaker Independent Continuous Speech to Text Converter for Mobile Application. arXiv preprint arXiv:1307.5736.
Uchat, N. S. (2007). Hidden Markov Model and Speech Recognition. In Seminar report, Department of Computer Science and Engineering Indian Institute of Technology, Mumbai.
Google Scholar
Jain, D., & Cardona, G. (2007). The Indo-Aryan languages. Routledge.
Google Scholar
Gales, M., & Young, S. (2008). The application of hidden Markov models in speech recognition. Foundations and trends in signal processing, 1(3), 195–304.
Google Scholar
Samudravijaya, K. (1878). Computer recognition of spoken Hindi. training, 198(56), 93. Samudravijaya, K., Computer Recognition of Spoken Hindi‖. Proceeding of International Conference of Speech, Music and Allied Signal Processing, Triruvananthapuram, pages 8–13, 2000.
Google Scholar
Samudravijaya, K., Ahuja, R., Bondale, N., Jose, T., Krishnan, S., Poddar, P., & Raveendran, R. (1998). A feature-based hierarchical speech recognition system for Hindi. Sadhana, 23(4), 313–340.
Google Scholar
Kuldeep Kumar and R.K. Aggarwal, “hındı speech recognition system using HTK”, International Journal of Computing and Business Research, vol. 2, no. 2, 2011.
Google Scholar
Kumar, M., Rajput, N., & Verma, A. (2004). A large-vocabulary continuous speech recognition system for Hindi. IBM journal of research and development, 48(5.6), 703–715.
Google Scholar
Kumar, M., Aggarwal, R. K., Leekha, G., & Kumar, Y. (2012). Ensemble feature extraction modules for improved Hindi speech recognition system. Proc Int J Comput Sci, (9), 3.
Google Scholar
Gaurav, G., Deiv, D. S., Sharma, G. K., & Bhattacharya, M. (2012). Development of application specific continuous speech recognition system in Hindi.
Google Scholar
Thangarajan, R., Natarajan, A. M., & Selvam, M. (2008). Word and triphone based approaches in continuous speech recognition for Tamil language. WSEAS transactions on signal processing, 4(3), 76–86.
Google Scholar
Das, B., Mandal, S., & Mitra, P. (2011, October). Bengali speech corpus for continuous automatic speech recognition system. In Speech Database and Assessments (Oriental COCOSDA), 2011 International Conference on (pp. 51–55). IEEE.
Google Scholar
Udhyakumar, N., Swaminathan, R., & Ramakrishnan, S. K. (2004, May). Multilingual speech recognition for information retrieval in Indian context. In Proceedings of the Student Research Workshop at HLT-NAACL 2004 (pp. 1–6). Association for Computational Linguistics.
Google Scholar
Lakshmi, A., & Murthy, H. A. (2008). A new approach to continuous speech recognition in Indian languages. In Proceedings national conference communication.
Google Scholar
Dua, M., Aggarwal, R. K., Kadyan, V., & Dua, S. (2012). Punjabi automatic speech recognition using HTK. IJCSI International Journal of Computer Science Issues, 9(4), 1694–0814.
Google Scholar
Aggarwal, R. K., & Dave, M. (2011). Using Gaussian mixtures for Hindi speech recognition system. International Journal of Signal Processing, Image Processing and Pattern Recognition, 4(4), 157–170.
Google Scholar
Mishra, A. N., Chandra, M., Biswas, A., & Sharan, S. N. (2011). Robust features for connected Hindi digits recognition. International Journal of Signal Processing, Image Processing and Pattern Recognition, 4(2), 79–90.
Google Scholar
Kumar, R., Kishore, S., Gopalakrishna, A., Chitturi, R., Joshi, S., Singh, S., & Sitaram, R. (2005). Development of Indian language speech databases for large vocabulary speech recognition systems. In International Conference on Speech and Computer (SPECOM) Proceedings.
Google Scholar
Nilsson, M., & Ejnarsson, M. (2002). Speech recognition using hidden markov model.
Google Scholar
Huang, X., & Deng, L. (2010). An Overview of Modern Speech Recognition.
Google Scholar

Download references

Author information

Authors and Affiliations

S.P. University, Vallabh Vidhyanagar, Gujarat, India
Jinal H. Tailor
G.H. Patel Post Graduate Department of Computer Science, S.P. University, Vallabh Vidhyanagar, Gujarat, India
Dipti B. Shah

Authors

Jinal H. Tailor
View author publications
You can also search for this author in PubMed Google Scholar
Dipti B. Shah
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jinal H. Tailor .

Editor information

Editors and Affiliations

Microsoft Innovation Centre, Sri Aurobindo Institute of Technology, Indore, Madhya Pradesh, India
Durgesh Kumar Mishra
Dagenham, United Kingdom
Malaya Kumar Nayak
Department of Information Technology, Sabar Institute of Technology, Ahmedabad, Gujarat, India
Amit Joshi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tailor, J.H., Shah, D.B. (2018). HMM-Based Lightweight Speech Recognition System for Gujarati Language. In: Mishra, D., Nayak, M., Joshi, A. (eds) Information and Communication Technology for Sustainable Development. Lecture Notes in Networks and Systems, vol 10. Springer, Singapore. https://doi.org/10.1007/978-981-10-3920-1_46

Download citation

DOI: https://doi.org/10.1007/978-981-10-3920-1_46
Published: 08 November 2017
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3919-5
Online ISBN: 978-981-10-3920-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics