Speech Recognition System of Arabic Alphabet Based on a Telephony Arabic Corpus

  • Yousef Ajami Alotaibi
  • Mansour Alghamdi
  • Fahad Alotaiby
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6134)

Abstract

Automatic recognition of spoken alphabets is one of the difficult tasks in the field of computer speech recognition. In this research, spoken Arabic alphabets are investigated from the speech recognition problem point of view. The system is designed to recognize spelling of an isolated word. The Hidden Markov Model Toolkit (HTK) is used to implement the isolated word recognizer with phoneme based HMM models. In the training and testing phase of this system, isolated alphabets data sets are taken from the telephony Arabic speech corpus, SAAVB. This standard corpus was developed by KACST and it is classified as a noisy speech database. A hidden Markov model based speech recognition system was designed and tested with automatic Arabic alphabets recognition. Four different experiments were conducted on these subsets, the first three trained and tested by using each individual subset, the fourth one conducted on these three subsets collectively. The recognition system achieved 64.06% overall correct alphabets recognition using mixed training and testing subsets collectively.

Keywords

Arabic alphabets SAAVB HMM Recognition Telephony corpus 

References

  1. 1.
  2. 2.
    Alkhouli, M.: Alaswaat Alaghawaiyah. Daar Alfalah, Jordan (1990) (in Arabic)Google Scholar
  3. 3.
    Deller, J., Proakis, J., Hansen, J.H.: Discrete-Time Processing of Speech Signal. Macmillan, Basingstoke (1993)Google Scholar
  4. 4.
    Elshafei, M.: Toward an Arabic Text-to-Speech System. The Arabian Journal for Scince and Engineering 16(4B), 565–583 (1991)MathSciNetGoogle Scholar
  5. 5.
    Cole, R., Fanty, M., Muthusamy, Y., Gopalakrishnan, M.: Speaker-Independent Recognition of Spoken English Letters. In: International Joint Conference on Neural Networks (IJCNN), vol. 2, pp. 45–51 (June 1990)Google Scholar
  6. 6.
    Loizou, P.C., Spanias, A.S.: High-Performance Alphabet Recognition. IEEE Trans. on Speech and Audio Processing 4(6), 430–445 (1996)CrossRefGoogle Scholar
  7. 7.
    Karnjanadecha, M., Zahorian, Z.: Signal Modeling for High-Performance Robust Isolated Word Recognition. IEEE Trans. on Speech and Audio Processing 9(6), 647–654 (2001)CrossRefGoogle Scholar
  8. 8.
    Cosi, P., Hosom, J., Valente, A.: High Performance Telephone Bandwidth Speaker Independent Continuous Digit Recognition. In: Automatic Speech Recognition and Understanding Workshop (ASRU), Trento, Italy (2001)Google Scholar
  9. 9.
    Hagos, E.: Implementation of an Isolated Word Recognition System. UMI Dissertation Service (1985)Google Scholar
  10. 10.
    Abdulah, W., Abdul-Karim, M.: Real-time Spoken Arabic Recognizer. Int. J. Electronics 59(5), 645–648 (1984)CrossRefGoogle Scholar
  11. 11.
    Al-Otaibi, A.: Speech Processing. The British Library in Association with UMI (1988)Google Scholar
  12. 12.
    Rabiner, L.R.: A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE 77(2), 257–286 (1989)CrossRefGoogle Scholar
  13. 13.
    Juang, B., Rabiner, L.: Hidden Markov Models for Speech Recognition. Technometrics 33(3), 251–272 (1991)MATHCrossRefMathSciNetGoogle Scholar
  14. 14.
    Young, S., Evermann, G., Gales, M., Hain, T., Kershaw, D., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: The HTK Book (for HTK Version 3.4). Cambridge University Engineering Department, Cambridge (2006), http://htk.eng.cam.ac.uk/prot-doc/ktkbook.pdf
  15. 15.
    Alghamdi, M., Alhargan, F., Alkanhal, M., Alkhairi, A., Aldusuqi, M.: Saudi Accented Arabic Voice Bank (SAAVB). Final report, Computer and Electronics Research Institute, King Abdulaziz City for Science and technology, Riyadh, Saudi Arabia (2003)Google Scholar
  16. 16.
    Alghamdi, M., El Hadj, Y., Alkanhal, M.: A Manual System to Segment and Transcribe Arabic Speech. In: IEEE International Conference on Signal Processing and Communication (ICSPC’07), Dubai, UAE, November 24-27 (2007)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Yousef Ajami Alotaibi
    • 1
  • Mansour Alghamdi
    • 2
  • Fahad Alotaiby
    • 3
  1. 1.Computer Engineering DepartmentKing Saud UniversityRiyadhSaudi Arabia
  2. 2.King Abdulaziz City for Science and TechnologyRiyadhSaudi Arabia
  3. 3.Department of Electrical EngineeringKing Saud UniversityRiyadhSaudi Arabia

Personalised recommendations