Abstract
In this paper we investigated Artificial Neural Networks (ANN) based Automatic Speech Recognition (ASR) by using limited Arabic vocabulary corpora. These limited Arabic vocabulary subsets are digits and vowels carried by specific carrier words. In addition to this, Hidden Markov Model (HMM) based ASR systems are designed and compared to ANN based systems by using the same corpora. All systems are isolated word speech recognizers. The ANN based recognition system achieved 99.5% correct digit recognition. On the other hand, the HMM based recognition system achieved 98.1% correct digit recognition. With vowels carrier words, the ANN based recognition system achieved 92.13% correct vowel recognition; but the HMM based recognition system achieved 91.6% correct vowel recognition.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Alkhouli, M.: Alaswaat Alaghawaiyah. Daar Alfalah, Jordan (1990) (in Arabic)
Elshafei, M.: Toward an Arabic Text-to -Speech System. The Arabian Journal for Science and Engineering 16(4B), 565–583 (1991)
Deller, J., Proakis, J., Hansen, J.H.: Discrete-Time Processing of Speech Signal. Macmillan, NYC (1993)
Alghamdi, M.: Arabic Phonetics, Al-Toubah Bookshop, Riyadh 2001 (in Arabic)
Omar, A.: Derasat Alaswat Aloghawi, Aalam Alkutob, Eygpt (1991) (in Arabic)
El-Imam, Y.A.: An Unrestricted Vocabulary Arabic Speech Synthesis System. IEEE Trans. on Acoustic, Speech, and Signal Processing 37(12), 1829–1845 (1989)
Kirchhoff, K., Bilmes, J., Das, S., Duta, N., Egan, M., Gang, J., Feng, H., Henderson, J., Daben, L., Noamany, M., Schone, P., Schwartz, R., Vergyri, D.: Novel approaches to Arabic speech recognition: report from the 2002 Johns-Hopkins Summer Workshop. In: Proceedings of ICASSP 2003, vol. 1, pp. 344–347 (April 2003)
Hagos, E.: Implementation of an Isolated Word Recognition System, UMI Dissertation Service (1985)
Iqbal, H.R., Awais, M.M., Masud, S., Shamail, S.: New Challenges in Applied Intelligence Technologies. In: On Vowels Segmentation and Identification Using Formant Transitions in Continuous Recitation of Quranic Arabic, pp. 155–162. Springer, Berlin (2008)
Razak, Z., Ibrahim, N.J., Tamil, E.M., Idris, M.Y.I., Yakub, M., Yusoff, Z.B.M.: Quranic Verse Recitation Feature Extraction Using Mel-Frequency Cepstral Coefficient (MFCC). In: Proceedings of the 4th IEEE International Colloquium on Signal Processing and its Application (CSPA 2008), Kuala Lumpur, Malaysia, March 7-9 (2008)
Tolba, M.F., Nazmy, T., Abdelhamid, A.A., GadallahA, M.E.: A Novel Method for Arabic Consonant/Vowel Segmentation using Wavelet Transform. International Journal on Intelligent Cooperative Information Systems, IJICIS, 5(1), 353–364 (2005)
Alghamdi, M.M.: A spectrographic analysis of Arabic vowels: A cross-dialect study. Journal of King Saud University 10(1), 3–24 (1998)
Newman, D.L., Verhoeven, J.: Frequency Analysis of Arabic Vowels in Connected Speech, pp. 77–87
Lippmann, R.: Review of Neural Networks for Speech Recognition. Neural Computation, 1–38 (1989)
Haykin, S.: Neural Networks: A Comprehensive Foundation, 2nd edn. Prentice Hall, Englewood Cliffs (1999)
Loizou, P.C., Spanias, A.S.: High-Performance Alphabet Recognition. IEEE Trans. on Speech and Audio Processing 4(6), 430–445 (1996)
Rabiner, L.R.: A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE 77(2), 257–286 (1989)
Juang, B., Rabiner, L.: Hidden Markov Models for Speech Recognition. Technometrics 33(3), 251–272 (1991)
Young, S., Evermann, G., Gales, M., Hain, T., Kershaw, D., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: The HTK Book (for HTK Version. 3.4) Cambridge University Engineering Department (2006), http://htk.eng.cam.ac.uk/prot-doc/ktkbook.pdf
Alotaibi, Y.A.: High Performance Arabic Digits Recognizer Using Neural Networks. In: The 2003 International Joint Conference on Neural Networks IJCNN2003, Portland, Oregon (2003)
Linguistic Data Consortium (LDC) Catalog Number LDC2002S02 (2002), http://www.ldc.upenn.edu/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Alotaibi, Y.A. (2011). Experiments on ANN Based ASR Systems Using Limited Arabic Vocabulary. In: Corchado, E., Snášel, V., Sedano, J., Hassanien, A.E., Calvo, J.L., Ślȩzak, D. (eds) Soft Computing Models in Industrial and Environmental Applications, 6th International Conference SOCO 2011. Advances in Intelligent and Soft Computing, vol 87. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19644-7_48
Download citation
DOI: https://doi.org/10.1007/978-3-642-19644-7_48
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19643-0
Online ISBN: 978-3-642-19644-7
eBook Packages: EngineeringEngineering (R0)