Experiments on ANN Based ASR Systems Using Limited Arabic Vocabulary

Alotaibi, Yousef Ajami

doi:10.1007/978-3-642-19644-7_48

Yousef Ajami Alotaibi⁸

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 87))

1309 Accesses

Abstract

In this paper we investigated Artificial Neural Networks (ANN) based Automatic Speech Recognition (ASR) by using limited Arabic vocabulary corpora. These limited Arabic vocabulary subsets are digits and vowels carried by specific carrier words. In addition to this, Hidden Markov Model (HMM) based ASR systems are designed and compared to ANN based systems by using the same corpora. All systems are isolated word speech recognizers. The ANN based recognition system achieved 99.5% correct digit recognition. On the other hand, the HMM based recognition system achieved 98.1% correct digit recognition. With vowels carrier words, the ANN based recognition system achieved 92.13% correct vowel recognition; but the HMM based recognition system achieved 91.6% correct vowel recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Alkhouli, M.: Alaswaat Alaghawaiyah. Daar Alfalah, Jordan (1990) (in Arabic)
Google Scholar
Elshafei, M.: Toward an Arabic Text-to -Speech System. The Arabian Journal for Science and Engineering 16(4B), 565–583 (1991)
MathSciNet Google Scholar
Deller, J., Proakis, J., Hansen, J.H.: Discrete-Time Processing of Speech Signal. Macmillan, NYC (1993)
Google Scholar
Alghamdi, M.: Arabic Phonetics, Al-Toubah Bookshop, Riyadh 2001 (in Arabic)
Google Scholar
Omar, A.: Derasat Alaswat Aloghawi, Aalam Alkutob, Eygpt (1991) (in Arabic)
Google Scholar
El-Imam, Y.A.: An Unrestricted Vocabulary Arabic Speech Synthesis System. IEEE Trans. on Acoustic, Speech, and Signal Processing 37(12), 1829–1845 (1989)
Article Google Scholar
Kirchhoff, K., Bilmes, J., Das, S., Duta, N., Egan, M., Gang, J., Feng, H., Henderson, J., Daben, L., Noamany, M., Schone, P., Schwartz, R., Vergyri, D.: Novel approaches to Arabic speech recognition: report from the 2002 Johns-Hopkins Summer Workshop. In: Proceedings of ICASSP 2003, vol. 1, pp. 344–347 (April 2003)
Google Scholar
Hagos, E.: Implementation of an Isolated Word Recognition System, UMI Dissertation Service (1985)
Google Scholar
Iqbal, H.R., Awais, M.M., Masud, S., Shamail, S.: New Challenges in Applied Intelligence Technologies. In: On Vowels Segmentation and Identification Using Formant Transitions in Continuous Recitation of Quranic Arabic, pp. 155–162. Springer, Berlin (2008)
Google Scholar
Razak, Z., Ibrahim, N.J., Tamil, E.M., Idris, M.Y.I., Yakub, M., Yusoff, Z.B.M.: Quranic Verse Recitation Feature Extraction Using Mel-Frequency Cepstral Coefficient (MFCC). In: Proceedings of the 4th IEEE International Colloquium on Signal Processing and its Application (CSPA 2008), Kuala Lumpur, Malaysia, March 7-9 (2008)
Google Scholar
Tolba, M.F., Nazmy, T., Abdelhamid, A.A., GadallahA, M.E.: A Novel Method for Arabic Consonant/Vowel Segmentation using Wavelet Transform. International Journal on Intelligent Cooperative Information Systems, IJICIS, 5(1), 353–364 (2005)
Google Scholar
Alghamdi, M.M.: A spectrographic analysis of Arabic vowels: A cross-dialect study. Journal of King Saud University 10(1), 3–24 (1998)
MathSciNet Google Scholar
Newman, D.L., Verhoeven, J.: Frequency Analysis of Arabic Vowels in Connected Speech, pp. 77–87
Google Scholar
Lippmann, R.: Review of Neural Networks for Speech Recognition. Neural Computation, 1–38 (1989)
Google Scholar
Haykin, S.: Neural Networks: A Comprehensive Foundation, 2nd edn. Prentice Hall, Englewood Cliffs (1999)
MATH Google Scholar
Loizou, P.C., Spanias, A.S.: High-Performance Alphabet Recognition. IEEE Trans. on Speech and Audio Processing 4(6), 430–445 (1996)
Article Google Scholar
Rabiner, L.R.: A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE 77(2), 257–286 (1989)
Article Google Scholar
Juang, B., Rabiner, L.: Hidden Markov Models for Speech Recognition. Technometrics 33(3), 251–272 (1991)
Article MathSciNet MATH Google Scholar
Young, S., Evermann, G., Gales, M., Hain, T., Kershaw, D., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: The HTK Book (for HTK Version. 3.4) Cambridge University Engineering Department (2006), http://htk.eng.cam.ac.uk/prot-doc/ktkbook.pdf
Alotaibi, Y.A.: High Performance Arabic Digits Recognizer Using Neural Networks. In: The 2003 International Joint Conference on Neural Networks IJCNN2003, Portland, Oregon (2003)
Google Scholar
Linguistic Data Consortium (LDC) Catalog Number LDC2002S02 (2002), http://www.ldc.upenn.edu/

Download references

Author information

Authors and Affiliations

Computer Engineering Dept., College of Computer & Information Sciences, King Saud University, P.O. Box 57168, Riyadh, 11574, Saudi Arabia
Yousef Ajami Alotaibi

Authors

Yousef Ajami Alotaibi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Universidad de Salamanca, Plaza de la Merced S/N, 37008, Salamanca, Spain
Emilio Corchado
VŠB-TU Ostrava, 17. listopadu 15, 70833, Ostrava, Czech Republic
Václav Snášel
University of Burgos, Avenida Cantaria S/N, 09006, Burgos, Spain
Javier Sedano
Cairo University, 5 Ahmed Zewal St., Orman, Cairo, Egypt
Aboul Ella Hassanien
University of La Coruña, Avda. 19 de Febrero, S/N, A Coruña,, 15403, Ferrol, Spain
José Luis Calvo
Infobright, 47 Colborne Street, Suite 403, M5E1P8, Toronto, Ontario, Canada
Dominik Ślȩzak

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Alotaibi, Y.A. (2011). Experiments on ANN Based ASR Systems Using Limited Arabic Vocabulary. In: Corchado, E., Snášel, V., Sedano, J., Hassanien, A.E., Calvo, J.L., Ślȩzak, D. (eds) Soft Computing Models in Industrial and Environmental Applications, 6th International Conference SOCO 2011. Advances in Intelligent and Soft Computing, vol 87. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19644-7_48

Download citation

DOI: https://doi.org/10.1007/978-3-642-19644-7_48
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19643-0
Online ISBN: 978-3-642-19644-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics