Impact of a Newly Developed Modern Standard Arabic Speech Corpus on Implementing and Evaluating Automatic Continuous Speech Recognition Systems

Abushariah, Mohammad A. M.; Ainon, Raja N.; Zainuddin, Roziati; Al-Qatab, Bassam A.; Alqudah, Assal A. M.

doi:10.1007/978-3-642-16202-2_1

Mohammad A. M. Abushariah^23,24,
Raja N. Ainon²³,
Roziati Zainuddin²³,
Bassam A. Al-Qatab²³ &
…
Assal A. M. Alqudah²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6392))

Included in the following conference series:

International Workshop on Spoken Dialogue Systems Technology

474 Accesses
2 Citations

Abstract

Being current formal linguistic standard and only acceptable form of Arabic language for all native speakers, Modern Standard Arabic (MSA) still lacks sufficient spoken corpora compared to other forms like Dialectal Arabic. This paper describes our work towards developing a new speech corpus for MSA, which can be used for implementing and evaluating any Arabic automatic continuous speech recognition system. The speech corpus contains 415 (367 training and 48 testing) sentences recorded by 42 (21 male and 21 female) Arabic native speakers from 11 countries representing three major regions (Levant, Gulf, and Africa). The impact of using this speech corpus on overall performance of Arabic automatic continuous speech recognition systems was examined. Two development phases were conducted based on the size of training data, Gaussian mixture distributions, and tied states (senones). Overall results indicate that larger training data size result higher word recognition rates and lower Word Error Rates (WER).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Elmahdy, M., Gruhn, R., Minker, W., Abdennadher, S.: Survey on common Arabic language forms from a speech recognition point of view. In: International Conference on Acoustics (NAG-DAGA), Rotterdam, Netherlands, pp. 63 – 66 (2009)
Google Scholar
Alotaibi, Y.A.: Comparative Study of ANN and HMM to Arabic Digits Recognition Systems. Journal of King Abdulaziz University: Engineering Sciences 19(1), 43–59 (2008)
Article Google Scholar
Kirchhoff, K., Bilmes, J., Das, S., Duta, N., Egan, M., Ji, G., He, F., Henderson, J., Liu, D., Noamany, M., Schone, P., Schwartz, R., Vergyri, D.: Novel approaches to Arabic speech recognition. In: Report from the 2002 Johns-Hopkins Summer Workshop, ICASSP 2003, Hong Kong, vol. 1, pp. 344–347 (2003)
Google Scholar
Al-Sulaiti, L., Atwell, E.: The design of a corpus of Contemporary Arabic. International Journal of Corpus Linguistics, John Benjamins Publishing Company, 1 – 36 (2006)
Google Scholar
Nikkhou, M., Choukri, K.: Survey on Industrial needs for Language Resources. Technical Report, NEMLAR – Network for Euro-Mediterranean Language Resources (2004)
Google Scholar
Nikkhou, M., Choukri, K.: Survey on Arabic Language Resources and Tools in the Mediterranean Countries. Technical Report, NEMLAR – Network for Euro-Mediterranean Language Resources (2005)
Google Scholar
Alghamdi, M., Alhamid, A.H., Aldasuqi, M.M.: Database of Arabic Sounds: Sentences. Technical Report, King Abdulaziz City of Science and Technology, Saudi Arabia, In Arabic (2003)
Google Scholar
Ali, M., Elshafei, M., Alghamdi, M., Almuhtaseb, H., Al-Najjar, A.: Generation of Arabic Phonetic Dictionaries for Speech Recognition. In: IEEE Proceedings of the International Conference on Innovations in Information Technology, UAE, pp. 59 – 63 (2008)
Google Scholar
Elshafei, A.M.: Toward an Arabic Text-to-Speech System. The Arabian Journal of Science and Engineering 16(4B), 565–583 (1991)
MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Computer Science and Information Technology, University of Malaya, 50603, Kuala Lumpur, Malaysia
Mohammad A. M. Abushariah, Raja N. Ainon, Roziati Zainuddin, Bassam A. Al-Qatab & Assal A. M. Alqudah
Department of Computer Information Systems, King Abdullah II School for Information Technology, University of Jordan, 11942, Amman, Jordan
Mohammad A. M. Abushariah

Authors

Mohammad A. M. Abushariah
View author publications
You can also search for this author in PubMed Google Scholar
Raja N. Ainon
View author publications
You can also search for this author in PubMed Google Scholar
Roziati Zainuddin
View author publications
You can also search for this author in PubMed Google Scholar
Bassam A. Al-Qatab
View author publications
You can also search for this author in PubMed Google Scholar
Assal A. M. Alqudah
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Pohang University of Science and Technology, San 31, Hyoja-dong, Nam-gu, 790-784, Pohang, South Korea
Gary Geunbae Lee
Laboratoire d’Informatique pour la Mécanique et les Sciences de L’ Ingénieur, Centre National de la Recherche Scientifique, B.P. 133 91403, Orsy cedex, France
Joseph Mariani
Institute of Information Technology, University of Ulm, Albert-Einstein-Allee 43, 89081, Ulm, Germany
Wolfgang Minker
national Institute of Information and Communications Technology, 3-5 Hikaridai, Keihanna Science City, Kyoto, Japan
Satoshi Nakamura

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Abushariah, M.A.M., Ainon, R.N., Zainuddin, R., Al-Qatab, B.A., Alqudah, A.A.M. (2010). Impact of a Newly Developed Modern Standard Arabic Speech Corpus on Implementing and Evaluating Automatic Continuous Speech Recognition Systems. In: Lee, G.G., Mariani, J., Minker, W., Nakamura, S. (eds) Spoken Dialogue Systems for Ambient Environments. IWSDS 2010. Lecture Notes in Computer Science(), vol 6392. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16202-2_1

Download citation

DOI: https://doi.org/10.1007/978-3-642-16202-2_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16201-5
Online ISBN: 978-3-642-16202-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics