A Speech Recognition System using an Auditory Model and TOM Neural Network
This paper is devoted to a neurobiologically plausible approach for the design of speech processing systems. The temporal organization map (TOM) neural net model is a connectionist model for time representation. The definition of a generic neural unit, inspired by the neurobiological model of the cortical column, allows the model to be used for problems including the temporal dimension. In the framework of automatic speech recognition, TOM has been previously tested with conventional techniques of signal processing. An auditory model as front-end processor is now used with TOM, in order to test the efficiency and the accuracy of a physiologically based speech recognition system. Preliminary results axe presented for speaker-dependent and speaker-independent speech recognition experiments. The interest of auditory model is the possibility to develop more valuable processing and communication strategies between TOM and the front-end processor, including afferent and efferent information flow.
KeywordsSpeech Recognition Automatic Speech Recognition Basilar Membrane Automatic Gain Control Auditory Nerve Fibre
Unable to display preview. Download preview PDF.
- W. A. Ainsworth. Auditory mechanisms for speech perception. In Proc. of Euro speech’95, pages 171–178, Madrid, Spain, 1995.Google Scholar
- F. Berthommier. Intégration neuronale dans le système auditif. Modélisation de réseaux neuronaux temporo-dépendants. PhD thesis, Université Joseph Fourier — Grenoble I, 1992.Google Scholar
- Y. Burnod. An adaptive neural network: The cerebral cortex. Masson Paris, 1988.Google Scholar
- S. Durand and F. Alexandre. Tom, a new temporal neural net architecture for speech signal processing. In IEEE International Conference on Acoustic Speech and Signal Processing, Atlanta, USA, 1996.Google Scholar
- B. Fritzke. A growing neural gas network learns topologies. In G. Tesauro, D.S. Touretzky, and T.K. Leen, editors, Advances in Neural Information Processing Systems 7. MIT Press, Cambridge MA, 1995.Google Scholar
- Y. Gao, T. Huang, S. Chen, and J. P. Haton. Auditory model based speech processing. In Proc. of ICSLP, pages 73–76, Alberta, Canada, 1992.Google Scholar
- T. Kohonen. Self-Organization and Associative Memory. Springer Series in Information Sciences. Springer-Verlag, third edition, 1989.Google Scholar
- T. M. Martinetz and K. J. Schulten. A “neural-gas” network learns topologies. In T. Kohonen, K. Mäkisara, O. Simula, and J. Kangas, editors, Artificial Neural Network, pages 397–402. North-Holland, Amsterdam, 1991.Google Scholar
- S. Seneff. A joint synchrony/mean-rate model of auditory speech processing. Journal of Phonetics, 16:55–76, 1988.Google Scholar