Abstract
Automatic recognition of spoken alphabets is one of the difficult tasks in the field of computer speech recognition. In this research, spoken Arabic alphabets are investigated from the speech recognition problem point of view. The system is designed to recognize spelling of an isolated word. The Hidden Markov Model Toolkit (HTK) is used to implement the isolated word recognizer with phoneme based HMM models. In the training and testing phase of this system, isolated alphabets data sets are taken from the telephony Arabic speech corpus, SAAVB. This standard corpus was developed by KACST and it is classified as a noisy speech database. A hidden Markov model based speech recognition system was designed and tested with automatic Arabic alphabets recognition. Four different experiments were conducted on these subsets, the first three trained and tested by using each individual subset, the fourth one conducted on these three subsets collectively. The recognition system achieved 64.06% overall correct alphabets recognition using mixed training and testing subsets collectively.
Chapter PDF
Similar content being viewed by others
References
http://en.wikipedia.org/wiki/List_of_languages_by_number_of_native_speakers , http://en.wikipedia.org/wiki/Arab_world .
Alkhouli, M.: Alaswaat Alaghawaiyah. Daar Alfalah, Jordan (1990) (in Arabic)
Deller, J., Proakis, J., Hansen, J.H.: Discrete-Time Processing of Speech Signal. Macmillan, Basingstoke (1993)
Elshafei, M.: Toward an Arabic Text-to-Speech System. The Arabian Journal for Scince and Engineering 16(4B), 565–583 (1991)
Cole, R., Fanty, M., Muthusamy, Y., Gopalakrishnan, M.: Speaker-Independent Recognition of Spoken English Letters. In: International Joint Conference on Neural Networks (IJCNN), vol. 2, pp. 45–51 (June 1990)
Loizou, P.C., Spanias, A.S.: High-Performance Alphabet Recognition. IEEE Trans. on Speech and Audio Processing 4(6), 430–445 (1996)
Karnjanadecha, M., Zahorian, Z.: Signal Modeling for High-Performance Robust Isolated Word Recognition. IEEE Trans. on Speech and Audio Processing 9(6), 647–654 (2001)
Cosi, P., Hosom, J., Valente, A.: High Performance Telephone Bandwidth Speaker Independent Continuous Digit Recognition. In: Automatic Speech Recognition and Understanding Workshop (ASRU), Trento, Italy (2001)
Hagos, E.: Implementation of an Isolated Word Recognition System. UMI Dissertation Service (1985)
Abdulah, W., Abdul-Karim, M.: Real-time Spoken Arabic Recognizer. Int. J. Electronics 59(5), 645–648 (1984)
Al-Otaibi, A.: Speech Processing. The British Library in Association with UMI (1988)
Rabiner, L.R.: A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE 77(2), 257–286 (1989)
Juang, B., Rabiner, L.: Hidden Markov Models for Speech Recognition. Technometrics 33(3), 251–272 (1991)
Young, S., Evermann, G., Gales, M., Hain, T., Kershaw, D., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: The HTK Book (for HTK Version 3.4). Cambridge University Engineering Department, Cambridge (2006), http://htk.eng.cam.ac.uk/prot-doc/ktkbook.pdf
Alghamdi, M., Alhargan, F., Alkanhal, M., Alkhairi, A., Aldusuqi, M.: Saudi Accented Arabic Voice Bank (SAAVB). Final report, Computer and Electronics Research Institute, King Abdulaziz City for Science and technology, Riyadh, Saudi Arabia (2003)
Alghamdi, M., El Hadj, Y., Alkanhal, M.: A Manual System to Segment and Transcribe Arabic Speech. In: IEEE International Conference on Signal Processing and Communication (ICSPC’07), Dubai, UAE, November 24-27 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ajami Alotaibi, Y., Alghamdi, M., Alotaiby, F. (2010). Speech Recognition System of Arabic Alphabet Based on a Telephony Arabic Corpus. In: Elmoataz, A., Lezoray, O., Nouboud, F., Mammass, D., Meunier, J. (eds) Image and Signal Processing. ICISP 2010. Lecture Notes in Computer Science, vol 6134. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13681-8_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-13681-8_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13680-1
Online ISBN: 978-3-642-13681-8
eBook Packages: Computer ScienceComputer Science (R0)