Abstract
Many algorithms for identification of speech models are directly or indirectly based on linear predictive coding (LPC) analysis.† LPC analysis is tantamount to identification of an autoregressive (AR) model using short-term batch processing of the observations.(1) The LPC model, therefore, is a special case of the discrete-time linear-in-parameters models treated in foregoing chapters. Accordingly, many speech processing tasks represent natural domains for applying bounded-error methods. This chapter discusses the fundamental principles requisite to application of optimal-bounded-ellipsoid (OBE) processing to problems in speech analysis, recognition and coding. The focus is the general problem of LPC identification of speech using OBE methods, including the significant issue of tracking the time-varying parameters of this very dynamic signal. Potential applications of this work in specific speech-processing endeavors include:
-
1.
General modeling and analysis by predictive methods for spectral (formant) estimation, pitch detection, glottal waveform deconvolution, and pathology detection.(1)
-
2.
Automated recognition of speech in which LPC parameters, or related parameters to which LPC coefficients are converted, are used as features in classifying phones, words, or complete messages in isolated utterances or continuous speech.
-
3.
Speaker recognition, or speaker verification, in which the speaker’s identity is determined or verified, respectively, through parametric feature analysis.
-
4.
Compression and synthesis of speech in which LPC parameters are used in strategies which remove redundancy in the acoustic waveform as a means of bandwidth compression or improving storage requirements. Similarly, spectral compression based on LPC analysis can be used for translation of the spectrum for hearing aids.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
J. R. Deller, Jr., J. G. Proakis, and J. H. L. Hansen, Discrete-Time Processing of Speech Signals, Macmillan, New York (1993).
K. Steiglitz and B. Dickinson, IEEE Trans. Acoust., Speech, Signal Process. 25, 34 (1977).
M. G. Berouti, D. G. Childers, and A. Paige, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 1, Hartford, CT, pp. 33-36 (1977).
D. Y. Wong, J. D. Markel, and A. H. Gray, IEEE Trans. Acoust., Speech, Signal Process. 27, 350 (1979).
J. R. Deller, Jr., IEEE Trans. Acoust., Speech, Signal Process. 29, 917 (1981).
J. N. Larar, Y. A. Alsaka, and D. G. Childers, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 2, Tampa, FL, pp. 1089-1092 (1985).
A. K. Krishnamurthy, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 3, San Diego, CA, pp. 36.3.1-36.3.4 (1984).
D. E. Veeneman and S. L. BeMent, IEEE Trans. Acoust., Speech, Signal Process. 33, 369 (1985).
A. K. Krishnamurthy and D. G. Childers, IEEE Trans. Acoust., Speech, Signal Process. 34, 730 (1985).
Y. Miyoshi, K. Yamamoto, R. Mizoguchi, Y. Masuzo, and O. Kakusho, IEEE Trans. Acoust., Speech, Signal Process. 35, 1233 (1987).
G. P. Pichaché, A Givens Rotation Algorithm for Single Channel Format Tracking and Glottal Waveform Deconvolution, M.S. Dissertation, Northeastern University, Boston, MA (1988).
L. V. R. Arruda and G. Favier, in: Proceedings of the 9th IFAC/IFORS Symposium on Identification and System Parameter Estimation 2, Budapest, Hungary, pp. 1027-1032 (1991).
J. R. Deller, Jr., M. Nayeri, and S. F. Odeh, Proc. IEEE 81, 813 (1993).
J. R. Deller, Jr., M. Nayeri, and M. S. Liu, Int. J. Autom. Control Signal Process. 8, 43 (1994).
J. R. Deller, Jr. and T. C. Luk, Comput. Speech Lang. 3, 301 (1989).
R. Lozano-Leal and R. Ortega, Automatica 23, 247 (1987).
S. M. Veres and J. P. Norton, Int. J. Control 50, 639 (1989).
L. Ljung and T. Söderström, Theory and Practice of Recursive Identification, MIT Press, Cambridge, MA (1983).
G. H. Golub and C. F. van Loan, Matrix Computations, 2nd Ed., Johns-Hopkins Univ. Press, Baltimore, MD (1989).
W. M. Gentleman and H. T. Kung, in: Proceedings of the Society of Photoptical Instrumentation Engineers: Real Time Signal Processing IV, San Diego, CA, pp. 19-26 (1981).
J. G. McWhirter, in: Proceedings of the Society of Photoptical Instrumentation Engineers: Real Time Signal Processing IV, San Diego, CA, pp. 105-112 (1983).
J. R. Deller, Jr. and D. Hsu, IEEE Trans. Circuits Systems 34, 782 (1987).
A. K. Rao and Y. F. Huang, IEEE Trans. Signal Process. 41, 1140 (1993).
Y. F. Huang and J. R. Deller, Jr., in: Proceedings of the 39th Annual Allerton Conference on Communications, Control, and Computing, Monticello, IL, pp. 50-59 (1992).
S. F. Odeh, Algorithms and Architectures for Adaptive Set Membership-based Signal Processing, Ph.D. Dissertation, Michigan State University, East Lansing, MI (1990).
J. P. Norton and S. H. Mo, Math. Comput. Simul. 32, 527 (1990).
S. F. Odeh and J. R. Deller, Jr., in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 5, Albuquerque, NM, pp. 2419-2422 (1990).
J. R. Deller, Jr. and S. F. Odeh, Proceedings of the 9th IFAC/IFORS Symposium on Identification and System Parameter Estimation 2, Budapest, Hungary, pp. 1044-1049 (1991).
J. R. Deller, Jr., IEEE Trans. Acoust., Speech, Signal Process. 37, 1432 (1989).
S. Dasgupta and Y. F. Huang, IEEE Trans. Inf. Theory 33, 383 (1987).
J. R. Deller, Jr., and S. F. Odeh, in: IEEE International Conference on Acoustics, Speech, and Signal Processing 2, Glasgow, Scotland, pp. 1067-1070 (1989).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1996 Springer Science+Business Media New York
About this chapter
Cite this chapter
Deller, J.R. (1996). Applications of OBE Algorithms to Speech Processing. In: Milanese, M., Norton, J., Piet-Lahanier, H., Walter, É. (eds) Bounding Approaches to System Identification. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-9545-5_29
Download citation
DOI: https://doi.org/10.1007/978-1-4757-9545-5_29
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4757-9547-9
Online ISBN: 978-1-4757-9545-5
eBook Packages: Springer Book Archive