Speech Recognition System of Arabic Alphabet Based on a Telephony Arabic Corpus

Ajami Alotaibi, Yousef; Alghamdi, Mansour; Alotaiby, Fahad

doi:10.1007/978-3-642-13681-8_15

Yousef Ajami Alotaibi²⁰,
Mansour Alghamdi²¹ &
Fahad Alotaiby²²

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6134))

Included in the following conference series:

International Conference on Image and Signal Processing

3600 Accesses
6 Citations

Abstract

Automatic recognition of spoken alphabets is one of the difficult tasks in the field of computer speech recognition. In this research, spoken Arabic alphabets are investigated from the speech recognition problem point of view. The system is designed to recognize spelling of an isolated word. The Hidden Markov Model Toolkit (HTK) is used to implement the isolated word recognizer with phoneme based HMM models. In the training and testing phase of this system, isolated alphabets data sets are taken from the telephony Arabic speech corpus, SAAVB. This standard corpus was developed by KACST and it is classified as a noisy speech database. A hidden Markov model based speech recognition system was designed and tested with automatic Arabic alphabets recognition. Four different experiments were conducted on these subsets, the first three trained and tested by using each individual subset, the fourth one conducted on these three subsets collectively. The recognition system achieved 64.06% overall correct alphabets recognition using mixed training and testing subsets collectively.

Download to read the full chapter text

Chapter PDF

An experimental framework for Arabic digits speech recognition in noisy environments

Article 03 February 2017

Arabic Speech Recognition Independent of Vocabulary for Isolated Words

Person-Dependent and Person-Independent Arabic Speech Recognition System

Keywords

References

http://en.wikipedia.org/wiki/List_of_languages_by_number_of_native_speakers , http://en.wikipedia.org/wiki/Arab_world .
Alkhouli, M.: Alaswaat Alaghawaiyah. Daar Alfalah, Jordan (1990) (in Arabic)
Google Scholar
Deller, J., Proakis, J., Hansen, J.H.: Discrete-Time Processing of Speech Signal. Macmillan, Basingstoke (1993)
Google Scholar
Elshafei, M.: Toward an Arabic Text-to-Speech System. The Arabian Journal for Scince and Engineering 16(4B), 565–583 (1991)
MathSciNet Google Scholar
Cole, R., Fanty, M., Muthusamy, Y., Gopalakrishnan, M.: Speaker-Independent Recognition of Spoken English Letters. In: International Joint Conference on Neural Networks (IJCNN), vol. 2, pp. 45–51 (June 1990)
Google Scholar
Loizou, P.C., Spanias, A.S.: High-Performance Alphabet Recognition. IEEE Trans. on Speech and Audio Processing 4(6), 430–445 (1996)
Article Google Scholar
Karnjanadecha, M., Zahorian, Z.: Signal Modeling for High-Performance Robust Isolated Word Recognition. IEEE Trans. on Speech and Audio Processing 9(6), 647–654 (2001)
Article Google Scholar
Cosi, P., Hosom, J., Valente, A.: High Performance Telephone Bandwidth Speaker Independent Continuous Digit Recognition. In: Automatic Speech Recognition and Understanding Workshop (ASRU), Trento, Italy (2001)
Google Scholar
Hagos, E.: Implementation of an Isolated Word Recognition System. UMI Dissertation Service (1985)
Google Scholar
Abdulah, W., Abdul-Karim, M.: Real-time Spoken Arabic Recognizer. Int. J. Electronics 59(5), 645–648 (1984)
Article Google Scholar
Al-Otaibi, A.: Speech Processing. The British Library in Association with UMI (1988)
Google Scholar
Rabiner, L.R.: A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE 77(2), 257–286 (1989)
Article Google Scholar
Juang, B., Rabiner, L.: Hidden Markov Models for Speech Recognition. Technometrics 33(3), 251–272 (1991)
Article MATH MathSciNet Google Scholar
Young, S., Evermann, G., Gales, M., Hain, T., Kershaw, D., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: The HTK Book (for HTK Version 3.4). Cambridge University Engineering Department, Cambridge (2006), http://htk.eng.cam.ac.uk/prot-doc/ktkbook.pdf
Alghamdi, M., Alhargan, F., Alkanhal, M., Alkhairi, A., Aldusuqi, M.: Saudi Accented Arabic Voice Bank (SAAVB). Final report, Computer and Electronics Research Institute, King Abdulaziz City for Science and technology, Riyadh, Saudi Arabia (2003)
Google Scholar
Alghamdi, M., El Hadj, Y., Alkanhal, M.: A Manual System to Segment and Transcribe Arabic Speech. In: IEEE International Conference on Signal Processing and Communication (ICSPC’07), Dubai, UAE, November 24-27 (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Engineering Department, King Saud University, Riyadh, Saudi Arabia
Yousef Ajami Alotaibi
King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
Mansour Alghamdi
Department of Electrical Engineering, King Saud University, Riyadh, Saudi Arabia
Fahad Alotaiby

Authors

Yousef Ajami Alotaibi
View author publications
You can also search for this author in PubMed Google Scholar
Mansour Alghamdi
View author publications
You can also search for this author in PubMed Google Scholar
Fahad Alotaiby
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Université de Caen Basse-Normandie GREYC UMR CNRS 6072, ENSICAEN, 14050, Caen, France
Abderrahim Elmoataz & Olivier Lezoray &
Département de Mathématiques et d’ informatique, Université de Québec à Trois-Rivières, C.P. 500, G9A 5H7, Trois-Rivières, Québec, Canada
Fathallah Nouboud
Faculté des Sciences, Université IbnZohr, Agadir, Morocco
Driss Mammass
Département d’ Informatique et de Recherche Opérationnelle, Université de Montreal, H3C 3J7, Montréal, QC, Canada
Jean Meunier

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ajami Alotaibi, Y., Alghamdi, M., Alotaiby, F. (2010). Speech Recognition System of Arabic Alphabet Based on a Telephony Arabic Corpus. In: Elmoataz, A., Lezoray, O., Nouboud, F., Mammass, D., Meunier, J. (eds) Image and Signal Processing. ICISP 2010. Lecture Notes in Computer Science, vol 6134. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13681-8_15

Download citation

DOI: https://doi.org/10.1007/978-3-642-13681-8_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13680-1
Online ISBN: 978-3-642-13681-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Speech Recognition System of Arabic Alphabet Based on a Telephony Arabic Corpus

Abstract

Chapter PDF

Similar content being viewed by others

An experimental framework for Arabic digits speech recognition in noisy environments

Arabic Speech Recognition Independent of Vocabulary for Isolated Words

Person-Dependent and Person-Independent Arabic Speech Recognition System

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Speech Recognition System of Arabic Alphabet Based on a Telephony Arabic Corpus

Abstract

Chapter PDF

Similar content being viewed by others

An experimental framework for Arabic digits speech recognition in noisy environments

Arabic Speech Recognition Independent of Vocabulary for Isolated Words

Person-Dependent and Person-Independent Arabic Speech Recognition System

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation