Abstract
Multilayer perceptrons generate a posteriori probabilities related to emission probabilities of Hidden Markov Models through Bayes rule. This property is used to improve the discrimination of H M M. Moreover, it gives rise to many statistical interpretations which can be cast in neural architectures for nonlinear prediction and triphone probability estimation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
F. Jelinek, Continuous Recognition by Statistical Methods, Proceedings IEEE, vol. 64 no 4, pp.532–555, 1976.
A. Waibel, T. Hanazawa, G. Hinton, K. Shikano, K. Lang, Phoneme Recognition: Neural Networks vs. Hidden Markov Models, Proc. Intl. Conf. on Acoustics, Speech and Signal Processing, New York, vol.1, pp107–110, 1988.
H. Bourlard & C.J. Wellekens, Multilayer Perceptrons and Speech Recognition, IEEE Proc. of the First International Conf. on Neural Networks, vol IV, pp.407–416, San Diego, CA, 1987.
H. Bourlard & C.J. Wellekens, Links between Markov Models and Multilayer Perceptrons, Computer, Speech and Language, vol.3, pp. 1–19, 1989
R.P. Lippmann, Neural Network Classifiers Estimate Bayesian a Posteriori Probabilities, Neural Computation, vol.3, no 4, pp. 461–484, 1991.
R.P. Lippmann, An Introduction to Computing with Neural Nets, IEEE Acoustic, Speech, and Signal Processing Magazine, vol. 4, no 2, pp.4–22, 1987.
M. Cohen, H. Franco, N. Morgan, D.E. Rumelhart & V. Abrash, Hybrid Neural Network/Hidden Markov Model Continuous Speech Recognition, Proc. Intl. Conf. on Spoken Language Processing, vol2, pp. 915–918, Banff, Canada, 1992.
H. Bourlard, Continuous Speech Recognition: From Hidden Markov Models to Artificial Neural Networks, Doctoral Dissertation, Faculté Polytechnique de Mons, Belgium, Feb.1992.
H. Bourlard, N. Morgan, Chuck Wooters & Steve Renais, CDNN: A Context Dependent Neural Network for Continuous Speech Recognition, IEEE Proc. Intl. Conf. on Acoustics, Speech and Signal Processing San Francisco, CA, vol.2 pp. 349–352, 1992.
C.J. Wellekens, Explicit Time Correlation in Hidden Markov Models in Speech Recognition, Proc. IEEE Conference on Audio, Speech and Signal Processing, pp.384–387, April 1987.
E. Levin, Speech Recognition Using Hidden Control Neural Network Architecture, IEEE Proc. Intl. Conf. on Acoustics, Speech, and Signal Processing, pp. 433–436, Albuquerque, NM, 1990.
S. Furui, Speaker Independent Isolated Word Recognizer Using Dynamic Features of Speech Spectrum, IEEE Trans. on Acoustic, Speech, ans Signal Processing, vol.34, no 1, pp 52–59, 1986.
M. Minsky & S. Papert, Perceptrons, Cambridge, MA: MIT Press, 1969.
D.E. Rumelhart, G.E. Hinton & R.J. Williams, Parallel Distributed Processing. Exploration of the Microstructure of Cognition, vol.1: Foundations, Ed. D.E. Rumelhart & J.L. McClelland, MIT Press, 1986.
N. Morgan & H. Bourlard, Continuous Speech Recognition Using Multilayer Perceptrons with Hidden Markov Models, IEEE Intl. Conf. on Audio, Speech and Signal Processing, pp. 413–416, Albuquerque, New Mexico, 1990.
N. Morgan, J. Beck, P. Kohn, J. Bilmes, E. Allman & J. Beer, The Ring Array Processor (RAP): A multiprocessing peripheral for connectionist applications, Journal of Parallel and Distributed Computing, 1992.
L.R. Rabiner, B.H. Juang, “Fundamentals of Speech Recognition”, Prentice Hall, 1993.
L.R. Rabiner, R.W. Schafer, “Digital Processing of Speech Signals”, Prentice Hall, 1978.
J.R. Deller, J.G. Proakis, J.H. L.Hansen,”Discrete-Time Processing of Speech Signals”, Mc Millan, 1993.
H. Hermansky, Perceptual Linear Predictive (PLP) Analysis of Speech, Jl of Acoustical Soc. of America, vol 87, no 4.
H.Hermansky, J-C Junqua, Optimization of Perceptually-Based ASR Front-End, ICASSP 88, pp. 219–222.
Y. Linde, A. Buzo, R.M. Gray, An Algorithm for Vector Quantizer Design, IEEE Trans. on Communications, vol 28, no 1, pp. 84–95, 1980.
H. Ney, The Use of One-Stage Dynamic Programming Algorithm for Connected Word Recognition, IEEE Trans. on Acoustic, Speech and Signal Processing, pp 833–836. (1984)
J.S. Bridie, M.D. Brown, R.M. Chamberlain, An Algorithm for Connected Word Recognition, ICASSP 1982, pp.899–902.
H. Bourlard, C.J. Wellekens, Links between Markov Models and Multilayer Perceptrons, IEEE Trans. on Pattern Analysis and Machine Intelligence, vol 12, no 12, pp. 1167–1178.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1994 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wellekens, C.J. (1994). Improved Hidden Markov Models for Speech Recognition Through Neural Network Learning. In: Cherkassky, V., Friedman, J.H., Wechsler, H. (eds) From Statistics to Neural Networks. NATO ASI Series, vol 136. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-79119-2_11
Download citation
DOI: https://doi.org/10.1007/978-3-642-79119-2_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-79121-5
Online ISBN: 978-3-642-79119-2
eBook Packages: Springer Book Archive