A more flexible method for recognizing signals using back propagation: Piecewise linear regression vectors
Using the neural network architecture of back propagation applied to speech recognition, a new input data structure is developed which improves shift invariance when recognizing signal data in the form such as a formant. The preliminary development of the data structure, piecewise linear regression vectors, is reported. The new input data structure reduces the amount of data presented to the network by as much as an order of magnitude, giving a computational advantage in execution speed.
KeywordsSpeech Recognition Back Propagation Time Slice Test Pattern Dynamic Time Warping
Unable to display preview. Download preview PDF.
- Y. Bengio and R. De Mori. Speaker normalization and automatic speech recognition using spectral lines and neural networks. In Proceedings of the 1988 Connectionist Models Summer School, pages 388–397. Carnegie Mellon University, 1988.Google Scholar
- Michael A. Franzini. Learning to recognize spoken words: A study in connectionist speech recognition. In Proceedings of the 1988 Connectionist Models Summer School, pages 407–416. Carnegie Mellon University, 1988.Google Scholar
- Richard P. Lippmann. Neural network classifiers for speech recognition. The Lincoln Laboratory Journal, Vol 1, No 1:107–124, 1988.Google Scholar
- Richard P. Lippmann. Review of neural networks for speech recognition. Neural Computation, Vol 1, No 1:1–38, 1989.Google Scholar
- Thomas W. Parsons. Voice and Speech Processing. McGraw-Hill, 1987.Google Scholar
- M. L. Rossen and J. A. Anderson. Representational issues in a neural network model of syllable recognition. In International Joint Conference on Neural Networks, pages I.19–I.21. IEEE, 1989.Google Scholar
- David E. Rumelhart and James L. McClelland. Explorations in Parallel Distributed Processing. MIT Press, 1988.Google Scholar
- David E. Rumelhart and James L. McClelland. Parallel Distributed Processing, Vol I,II. MIT Press, 1988.Google Scholar
- Terrence J. Sejnowski and Charles R. Rosenberg. Nettalk: a parallel network that learns to read aloud. Technical Report JHU/EECS-86/01, 32pp., The Johns Hopkins University, Electrical Engineering and Computer Science, Johns Hopkins, 1986.Google Scholar
- Alex Waibel, H. Sawai, and K. Shikano. Modularity and scaling in large phonemic neural networks. Technical Report TR-I-0034, 25pp, ATR Interpreting Telephony Research Laboratories, Japan, 1988.Google Scholar