Abstract
Using the neural network architecture of back propagation applied to speech recognition, a new input data structure is developed which improves shift invariance when recognizing signal data in the form such as a formant. The preliminary development of the data structure, piecewise linear regression vectors, is reported. The new input data structure reduces the amount of data presented to the network by as much as an order of magnitude, giving a computational advantage in execution speed.
Preview
Unable to display preview. Download preview PDF.
References
Y. Bengio and R. De Mori. Speaker normalization and automatic speech recognition using spectral lines and neural networks. In Proceedings of the 1988 Connectionist Models Summer School, pages 388–397. Carnegie Mellon University, 1988.
Michael A. Franzini. Learning to recognize spoken words: A study in connectionist speech recognition. In Proceedings of the 1988 Connectionist Models Summer School, pages 407–416. Carnegie Mellon University, 1988.
Richard P. Lippmann. An introduction to computing with neural nets. IEEE ASSP, Vol 3, No 4:4–22, 1987.
Richard P. Lippmann. Neural network classifiers for speech recognition. The Lincoln Laboratory Journal, Vol 1, No 1:107–124, 1988.
Richard P. Lippmann. Review of neural networks for speech recognition. Neural Computation, Vol 1, No 1:1–38, 1989.
Thomas W. Parsons. Voice and Speech Processing. McGraw-Hill, 1987.
Lawrence R. Rabiner, A.E. Rosenberg, and S. Levinson. Considerations in dynamic time warping algorithms for discrete word recognition. IEEE Transactions on Acoustics, Speech, and Signal Processing, VOL ASSP-26, NO. 6:575–582, 1978.
M. L. Rossen and J. A. Anderson. Representational issues in a neural network model of syllable recognition. In International Joint Conference on Neural Networks, pages I.19–I.21. IEEE, 1989.
David E. Rumelhart and James L. McClelland. Explorations in Parallel Distributed Processing. MIT Press, 1988.
David E. Rumelhart and James L. McClelland. Parallel Distributed Processing, Vol I,II. MIT Press, 1988.
Terrence J. Sejnowski and Charles R. Rosenberg. Nettalk: a parallel network that learns to read aloud. Technical Report JHU/EECS-86/01, 32pp., The Johns Hopkins University, Electrical Engineering and Computer Science, Johns Hopkins, 1986.
E. Vidal, H. Rulot, C. Casacuberta, and J. Benedi. On the use of a metric-space search algorithm (aesa) for fast dtw-base recognition of isolated words. IEEE Transactions on Acoustics, Speech, and Signal Processing, VOL ASSP-36, NO. 5:651–660, 1988.
Alex Waibel, H. Sawai, and K. Shikano. Modularity and scaling in large phonemic neural networks. Technical Report TR-I-0034, 25pp, ATR Interpreting Telephony Research Laboratories, Japan, 1988.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1991 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Makowski, G. (1991). A more flexible method for recognizing signals using back propagation: Piecewise linear regression vectors. In: Sherwani, N.A., de Doncker, E., Kapenga, J.A. (eds) Computing in the 90's. Great Lakes CS 1989. Lecture Notes in Computer Science, vol 507. Springer, New York, NY. https://doi.org/10.1007/BFb0038480
Download citation
DOI: https://doi.org/10.1007/BFb0038480
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-97628-0
Online ISBN: 978-0-387-34815-5
eBook Packages: Springer Book Archive