Abstract
Speech recognition system (SRS) is growing research interest in the area of natural language processing (NLP). To develop speech recognition system for low resource language is difficult task. This paper defines a lightweight speech recognition system approach for Indian Gujarati language using hidden Markov model (HMM). The aim of this research is to design and implement SRS for routine Gujarati language which is difficult due to language barrier, complex language framework, and morphological variance. To train the HMM-based SRS we have manually created speech corpora that contained 650 routine Gujarati utterances which are recorded from total 40 speakers of South Gujarat region. Total numbers of speakers are selected on the basis of gender. We have achieved accuracy of 87.23% with average error rate 12.7% based on the word error rate (WER) computing.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Weischedel, R., Carbonell, J., Grosz, B., Lehnert, W., Marcus, M., Perrault, R., & Wilensky, R. (1989, October). White paper on natural language processing. In Proceedings of the workshop on Speech and Natural Language (pp. 481–493). Association for Computational Linguistics.
Sandanalakshmi, R., Viji, P. A., Kiruthiga, M., Manjari, M., & Sharina, M. (2013). Speaker Independent Continuous Speech to Text Converter for Mobile Application. arXiv preprint arXiv:1307.5736.
Uchat, N. S. (2007). Hidden Markov Model and Speech Recognition. In Seminar report, Department of Computer Science and Engineering Indian Institute of Technology, Mumbai.
Jain, D., & Cardona, G. (2007). The Indo-Aryan languages. Routledge.
Gales, M., & Young, S. (2008). The application of hidden Markov models in speech recognition. Foundations and trends in signal processing, 1(3), 195–304.
Samudravijaya, K. (1878). Computer recognition of spoken Hindi. training, 198(56), 93. Samudravijaya, K., Computer Recognition of Spoken Hindi‖. Proceeding of International Conference of Speech, Music and Allied Signal Processing, Triruvananthapuram, pages 8–13, 2000.
Samudravijaya, K., Ahuja, R., Bondale, N., Jose, T., Krishnan, S., Poddar, P., & Raveendran, R. (1998). A feature-based hierarchical speech recognition system for Hindi. Sadhana, 23(4), 313–340.
Kuldeep Kumar and R.K. Aggarwal, “hındı speech recognition system using HTK”, International Journal of Computing and Business Research, vol. 2, no. 2, 2011.
Kumar, M., Rajput, N., & Verma, A. (2004). A large-vocabulary continuous speech recognition system for Hindi. IBM journal of research and development, 48(5.6), 703–715.
Kumar, M., Aggarwal, R. K., Leekha, G., & Kumar, Y. (2012). Ensemble feature extraction modules for improved Hindi speech recognition system. Proc Int J Comput Sci, (9), 3.
Gaurav, G., Deiv, D. S., Sharma, G. K., & Bhattacharya, M. (2012). Development of application specific continuous speech recognition system in Hindi.
Thangarajan, R., Natarajan, A. M., & Selvam, M. (2008). Word and triphone based approaches in continuous speech recognition for Tamil language. WSEAS transactions on signal processing, 4(3), 76–86.
Das, B., Mandal, S., & Mitra, P. (2011, October). Bengali speech corpus for continuous automatic speech recognition system. In Speech Database and Assessments (Oriental COCOSDA), 2011 International Conference on (pp. 51–55). IEEE.
Udhyakumar, N., Swaminathan, R., & Ramakrishnan, S. K. (2004, May). Multilingual speech recognition for information retrieval in Indian context. In Proceedings of the Student Research Workshop at HLT-NAACL 2004 (pp. 1–6). Association for Computational Linguistics.
Lakshmi, A., & Murthy, H. A. (2008). A new approach to continuous speech recognition in Indian languages. In Proceedings national conference communication.
Dua, M., Aggarwal, R. K., Kadyan, V., & Dua, S. (2012). Punjabi automatic speech recognition using HTK. IJCSI International Journal of Computer Science Issues, 9(4), 1694–0814.
Aggarwal, R. K., & Dave, M. (2011). Using Gaussian mixtures for Hindi speech recognition system. International Journal of Signal Processing, Image Processing and Pattern Recognition, 4(4), 157–170.
Mishra, A. N., Chandra, M., Biswas, A., & Sharan, S. N. (2011). Robust features for connected Hindi digits recognition. International Journal of Signal Processing, Image Processing and Pattern Recognition, 4(2), 79–90.
Kumar, R., Kishore, S., Gopalakrishna, A., Chitturi, R., Joshi, S., Singh, S., & Sitaram, R. (2005). Development of Indian language speech databases for large vocabulary speech recognition systems. In International Conference on Speech and Computer (SPECOM) Proceedings.
Nilsson, M., & Ejnarsson, M. (2002). Speech recognition using hidden markov model.
Huang, X., & Deng, L. (2010). An Overview of Modern Speech Recognition.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Tailor, J.H., Shah, D.B. (2018). HMM-Based Lightweight Speech Recognition System for Gujarati Language. In: Mishra, D., Nayak, M., Joshi, A. (eds) Information and Communication Technology for Sustainable Development. Lecture Notes in Networks and Systems, vol 10. Springer, Singapore. https://doi.org/10.1007/978-981-10-3920-1_46
Download citation
DOI: https://doi.org/10.1007/978-981-10-3920-1_46
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3919-5
Online ISBN: 978-981-10-3920-1
eBook Packages: EngineeringEngineering (R0)