Abstract
In this paper, speaker-independent isolated word recognition system is proposed using the Mel-Frequency Cepstral Coefficients feature extraction method to create the feature vector. Support vector machine, sigmoid neural net, and the novel wavelet neural network are used as classifiers and the results are compared in terms of the maximum accuracy obtained and the number of iterations taken to achieve this. The effect of stretch factor on the accuracy of classification for WaveNets is shown in the results. The number of features is also varied using dimension reduction technique and its effect on the accuracies is studied. The data is prepared using feature scaling and dimensionality reduction before training SVM and NN classifiers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Besbes S, Lachiri Z (2016) Multi-class SVM for stressed speech recognition. In: 2016 2nd international conference on advanced technologies for signal and image processing (ATSIP), 21–23 March 2016. https://doi.org/10.1109/atsip.2016.7523188
Padrell-Sendra J, MartÃn-Iglesias D, DÃaz-de-MarÃa F (2006) Support vector machines for continuous speech recognition. In: 2006 14th European on signal processing conference, 4–8 Sept 2006
Gurban M, Thiran J-P (2005) Audio-visual speech recognition with a hybrid SVM-HMM system. In: 2005 13th European on signal processing conference, 4–8 Sept 2005
Riis SK (1998) Hidden neural networks: application to speech recognition. In: Proceedings of the 1998 IEEE international conference on acoustics, speech and signal processing, 1998, 15–15 May 1998. https://doi.org/10.1109/icassp.1998.675465
Barua P, Ahmad K, Khan AAS, Sanaullah M (2014) Neural network based recognition of speech using MFCC features. In: 2014 international conference on informatics, electronics & vision (ICIEV), 23–24 May 2014. https://doi.org/10.1109/iciev.2014.6850680
Renals S, Swietojanski P (2014) Neural networks for distant speech recognition. In: 2014 4th joint workshop on hands-free speech communication and microphone arrays (HSCMA), 12–14 May 2014. https://doi.org/10.1109/hscma.2014.6843274
Zainuddin Z, Pauline O (2007) Function approximation using artificial neural networks. Int J Syst Appl Eng Dev 1(4)
Wang G, Guo L, Duan H (2013) Wavelet neural network using multiple wavelet functions in target threat assessment. Sci World J 2013 (Article ID 632437)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Alex, J.S.R., Das, A., Kodgule, S.A., Venkatesan, N. (2018). A Comparative Study of Isolated Word Recognizer Using SVM and WaveNet. In: Nandi, A., Sujatha, N., Menaka, R., Alex, J. (eds) Computational Signal Processing and Analysis. Lecture Notes in Electrical Engineering, vol 490. Springer, Singapore. https://doi.org/10.1007/978-981-10-8354-9_13
Download citation
DOI: https://doi.org/10.1007/978-981-10-8354-9_13
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-8353-2
Online ISBN: 978-981-10-8354-9
eBook Packages: EngineeringEngineering (R0)