Abstract
Speech recognition is one of the entrancing fields in the zone of computer science. Exactness of speech recognition framework may decrease because of the nearness of noise exhibited by the speech signal. Consequently, noise removal is a fundamental advance in automatic speech recognition (ASR) system. ASR is researched for various languages in light of the fact that every language has its particular highlights. Particularly, the requirement for ASR framework in Tamil language has been expanded broadly over the most recent couple of years. In this work, bidirectional recurrent neural network (BRNN) with self-organizing map (SOM)-based classification scheme is suggested for Tamil speech recognition. At first, the input speech signal is pre-prepared by utilizing Savitzky–Golay filter keeping in mind the end goal to evacuate the background noise and to improve the signal. At that point, Multivariate Autoregressive based highlights by presenting discrete cosine transformation piece to give a proficient signal investigation. And in addition, perceptual linear predictive coefficients likewise separated to enhance the classification accuracy. The feature vector is shifted in measure, for picking the right length of feature vector SOM utilized. At long last, Tamil digits and words are ordered by utilizing BRNN classifier where the settled length feature vector from SOM is given as input, named as BRNN-SOM. The experimental analysis demonstrates that the suggested conspire accomplished preferable outcomes looked at over exist deep neural network–hidden Markov model algorithm regarding signal-to-noise ratio, classification accuracy, and mean square error.
Similar content being viewed by others
Change history
19 December 2022
This article has been retracted. Please see the Retraction Notice for more detail: https://doi.org/10.1007/s00521-022-08144-x
References
Varatharajan R, Manogaran G, Priyan MK, Sundarasekar R (2017) Wearable sensor devices for early detection of Alzheimer disease using dynamic time warping algorithm. Cluster Comput. https://doi.org/10.1007/s10586-017-0977-2
Varatharajan R, Manogaran G, Priyan MK, Balaş VE, Barna C (2017) Visual analysis of geospatial habitat suitability model based on inverse distance weighting with paired comparison analysis. Multimedia Tools Appl. https://doi.org/10.1007/s11042-017-4768-9
Balan EV, Priyan MK, Gokulnath C, Devi GU (2015) Fuzzy based intrusion detection systems in MANET. Procedia Comput Sci 50:109–114
Devi GU, Balan EV, Priyan MK, Gokulnath C (2015) Mutual authentication scheme for IoT application. Indian J Sci Technol 8(26). https://doi.org/10.17485/ijst/2015/v8i26/80996
Manogaran G, Varatharajan R, Priyan MK (2018) Hybrid recommendation system for heart disease diagnosis based on multiple kernel learning with adaptive neuro-fuzzy inference system. Multimedia Tools Appl 77(4):4379–4399
Priyan MK, Devi GU (2017) Energy efficient node selection algorithm based on node performance index and random waypoint mobility model in internet of vehicles. Cluster Comput. https://doi.org/10.1007/s10586-017-0998-x
Varatharajan R, Manogaran G, Priyan MK (2017) A big data classification approach using LDA with an enhanced SVM method for ECG signals in cloud computing. Multimedia Tools Appl. https://doi.org/10.1007/s11042-017-5318-1
Devi GU, Priyan MK, Balan EV, Nath CG, Chandrasekhar M (2015) Detection of DDoS attack using optimized hop count filtering technique. Indian J Sci Technol 8(26):1–6. https://doi.org/10.17485/ijst/2015/v8i26/83981
Gokulnath C, Priyan MK, Balan EV, Prabha KR, Jeyanthi R (2015) Preservation of privacy in data mining by using PCA based perturbation technique. In: 2015 international conference on smart technologies and management for computing, communication, controls, energy and materials (ICSTM). IEEE, pp 202–206
Thota C, Sudarasekhar R, Manogaran G, Varatharajan R, Priyan MK (2017) Centralized fog computing security platform for IoT and cloud in healthcare system. In: Krishna Prasad AV (ed) Exploring the convergence of big data and the internet of things. IGI Global, Hershey, pp 141–154
Kumar PM, Gandhi U, Varatharajan R, Manogaran G, Jidhesh R, Vadivel T (2017) Intelligent face recognition and navigation system using neural learning for smart security in Internet of Things. Cluster Comput. https://doi.org/10.1007/s10586-017-1323-4
Manogaran G, Varatharajan R, Lopez D, Kumar PM, Sundarasekar R, Thota C (2017) A new architecture of Internet of Things and big data ecosystem for secured smart healthcare monitoring and alerting system. Future Gener Comput Syst 82:375–387
Kumar PM, Gandhi UD (2017) A novel three-tier Internet of Things architecture with machine learning algorithm for early detection of heart diseases. Comput Electr Eng 65:222–235
Radha V, Vimala C, Krishnaveni M (2012) Continuous speech recognition system for Tamil language using monophone-based hidden markov model. In: Proceedings of the second international conference on computational science, engineering and information technology. ACM, pp 227–231
Radha V, Vimala C, Krishnaveni M (2011) Isolated word recognition system for Tamil spoken language using back propagation neural network based on LPCC features. Comput Sci Eng 1(4):1–11
Patel I, Rao YS (2010) Speech recognition using HMM with MFCC: an analysis using frequency spectral decomposition technique. Signal Image Process Int J (SIPIJ) 1(2):101–110
Chandrasekar M, Ponnavaikko M (2008) Tamil speech recognition: a complete model. Electron J Tech Acoust, article no. 20. http://www.ejta.org/en/chandrasekar2
Rojathai S, Venkatesulu M (2012) A novel speech recognition system for Tamil word recognition based on MFCC and FFBNN. Eur J Sci Res 85(4):578–590
Sigappi AN, Palanivel S (2012) Spoken word recognition strategy for Tamil language. Int J Comput Sci Issues 9(1):1694-0814
Sivaraj P, Rama M (2012) Recognition of isolated spoken words using DWT. Int J Eng Sci Res 2(9):1187–1196
Thangarajan R, Natarajan AM, Selvam M (2008) Word and triphone based approaches in continuous speech recognition for Tamil language. WSEAS Trans Signal Process 4(3):76–86
Saraswathi S, Geetha TV (2010) Design of language models at various phases of Tamil speech recognition system. Int J Eng Sci Technol 2(5):244–257
Karpagavalli S, Rani KU, Deepika R, Kokila P (2012) Isolated Tamil digits speech recognition using vector quantization. Int J Eng Res Technol 1(4):1–12
Iswarya P, Radha V (2012) Speech based query processing architecture for Tamil-English in cross language text retrieval system. Int J Emerg Trends Eng Dev 7(2):437–442
Schafer R (2011) What is a Savitzky-Golay filter? IEEE Signal Process Mag 28:111–117 (lecture notes)
Savitzky A, Golay MJE (1964) Smoothing and differentiation of data by simplified least squares procedures. Anal Chem 36:1627–1639
Neumaier A, Schneider T (2001) Estimation of parameters and eigenmodes of multivariate autoregressive models. ACM Trans Math Softw (TOMS) 27(1):27–57
Lütkepohl H (2005) New introduction to multiple time series analysis. Springer, Berlin
Box GE, Jenkins GM, Reinsel GC, Ljung GM (2015) Time series analysis: forecasting and control. Wiley, Hoboken
Misra H (2006) Multi-stream processing for noise robust speech recognition. Doctoral thesis, Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland, March 2006
Chen R, Jamieson LH (1996) Experiments on the implementation of recurrent neural networks for speech phone recognition. In: Proceedings of the thirtieth annual Asilomar conference on signals, systems and computers, Pacific Grove, California, November, pp 779–782
Lee SJ, Kim KC, Yoon H, Cho JW (1991) Application of fully neural networks for speech recognition. In: Korea Advanced Institute of Science and Technology, Korea, pp 77–80
He J, Liu L (1999) Speaker verification performance and the length of test sentence. In: Proceedings on ICASSP 1999, vol 1, pp 305–308
Gingras F, Bengio Y (1998) Handling asynchronous or missing data with recurrent networks. Int J Comput Intell Organ 1(3):154–163
Schuster M, Paliwal KK (1997) Bidirectional recurrent neural networks. IEEE Trans Signal Process 45:2673–2681
Fredes J, Novoa J, King S, Stern RM, Yoma NB (2017) Locally normalized filter banks applied to deep neural-network-based robust speech recognition. IEEE Signal Process Lett 24(4):377–381
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
This statement is to certify that all authors have seen and approved the manuscript being submitted. We warrant that the article is the authors’ original work. We warrant that the article has not received prior publication and is not under consideration for publication elsewhere. On behalf of all co-authors, the corresponding author shall bear full responsibility for the submission. The author(s) declare that there is no conflict of interest.
Additional information
This article has been retracted. Please see the retraction notice for more detail: https://doi.org/10.1007/s00521-022-08144-x
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Lokesh, S., Malarvizhi Kumar, P., Ramya Devi, M. et al. RETRACTED ARTICLE: An Automatic Tamil Speech Recognition system by using Bidirectional Recurrent Neural Network with Self-Organizing Map. Neural Comput & Applic 31, 1521–1531 (2019). https://doi.org/10.1007/s00521-018-3466-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-018-3466-5