Wavelet network for recognition system of Arabic word
Focusing on the development of new technologies of information, research in the speech communication field is an activity in full expansion. Several disciplines and skills interact in order to improve performance of Human Machine Communication Systems (HMC). In order to increase the performance of these systems, various techniques, including Hidden Markov Models (HMM) and Neural Network (NN), are implemented.
In this paper, we advance a new approach for modelling of acoustic units and a new method for speech recognition, especially recognition of Arabic word, adapting to this new type of modelling based on Wavelet Network (WN). The new recognition system is a hybrid classifier. It is based on NN as a general model and the wavelets assume the role of activation function.
Our approach of speech recognition is divided into two parts: training, and recognition phases. The training stage is based on audio corpus. After converting all training signals from original format to a specific parameterisation, each acoustic vector will be modelled by WN. These vectors will refine and cover all signal properties in one model. It consists in generating a WN for every training signal. The recognition phase is divided into three steps. The first is to extract features from the input vector to be recognized. The second is to estimate all resulting vectors from training WN. The third is to evaluate the distance between the vector to be recognized and the reconstructed vectors.
The obtained results shows that our system, based on WN, is very competitive compared to systems based on HMM.
KeywordsSpeech communication Hidden Markov models Speech recognition Recognition of Arabic words Acoustic vector Wavelet Wavelet network
Unable to display preview. Download preview PDF.
- Bahi, H., & Sellami, M. (2001). Combination of vector quantization and hidden Markov models for Arabic speech recognition. In Proceeding ACS/IEEE international conference on computer systems and applications (pp. 96–100). Beirut, Liban, Juin 2001. Google Scholar
- Bahi, H., Benouareth, A., & Sellami, M. (2000). Application of HMMs for Arabic speech recognition. In Proceeding of Maghreb conference MCSEAI’2000 (pp. 379–388). Fes, Maroc, November 2000. Google Scholar
- Bakis, R. (1976). Continuous speech recognition via centisecond acoustic states. In Proc. 91st meeting of the acoustic society in America. Google Scholar
- Baloul, S. (2003). Development of an automatic synthesis of speech from vowelized Arabic Standard text. PhD thesis, University of Maine. Google Scholar
- Ben Amar, C., & Jemai, O. (2005). Wavelet networks approach for image compression. ICGST International Journal on Graphics, Vision and Image Processing, SI1, 37–45. Google Scholar
- Boudraa, B., & Boudraa, M. (1998). Twenty list of ten Arabic sentences for assessment. Acustica. Acta Acoustica, 86, 870–882. Google Scholar
- Bouselmi, G. (2008). Contributions to automatic recognition of not native speech. Ph.D. from the University Henri Poincare Nancy 1, Lorraine Laboratory for Research in Computing and its Applications, UMR 7503. Google Scholar
- Cai, J., Bouselmi, G., Laprie, Y., & Haton, J. P. (2008) Efficient likelihood evaluation and dynamic Gaussian selection for hmm-based speech recognition. In Computer speech and language—CSL. Google Scholar
- Chien, J. T., & Chueh, C. H. (2009). Joint acoustic and language modeling for speech recognition. Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 70101, Taiwan, ROC. Speech Communication. Google Scholar
- Ejbali, R., Benayed, Y., & Alimi, A.M. (2009a). Arabic continues speech recognition system using context-independent. In 6th International multi-conference on systems, signals and devices, Jerba, Tunisie, Marsh 2009. Google Scholar
- Ejbali, R., Benayed, Y., Zaied, M., & Alimi, A.M. (2009b) Wavelet networks for phonemes recognition. In International conference on systems and information processing, Guelma, Algeria, May 2009. Google Scholar
- Jelinek, F. (2001). Aspects of the statistical approach to speech recognition. In IEEE international symposium on information theory, Washington D.C., Juin 2001. Google Scholar
- Kruger, V., & Sommer, G. (2001). Gabor wavelet networks for object representation (Technical Report CS-TR-4245). University of Maryland, CFAR. Google Scholar
- Morlet, J., Arehs, G., Fourgeau, I., & Giard, D. (1982). Wave propagation and sampling theory (p. 203). Google Scholar
- Postalcioglu, S., & Becerikli, Y. (2005). Nonlinear system modelling using wavelet networks. In Lecture notes in computer science (LNCS) (Vol. 3497, pp. 411–417). Google Scholar
- Rabiner, L. R., & Juang, B. H. (2006). Speech recognition: Statistical methods. In Encyclopedia of language & linguistics (2nd ed.) (pp. 1–18). Google Scholar
- Young, S., et al. (2005). The HTK book (for HTK version 3.3). Cambridge University Engineering Department. Google Scholar
- Zaied, M., Ben Amar, C., & Alimi, A. M. (2003). Award a new wavelet based beta function. In International conference on signal, system and design, SSD03 (Vol. 1, pp. 185–191), Tunisia, Mars 2003. Google Scholar
- Zaied, M., Ben Amar, C., & Alimi, A. M. (2005). Beta wavelet networks for face recognition. Journal of Decision Systems—New Trends in the Design of Intelligent Decision Systems, 14, 109–122. Google Scholar
- Zaied, M., Jemai, O., & Ben Amar, C. (2008). Training of the beta wavelet networks by the frames theory: Application to face recognition. In The international workshops on image processing theory, tools and applications, Tunisia, November 2008. Google Scholar