Applications of OBE Algorithms to Speech Processing

Deller, John R.

doi:10.1007/978-1-4757-9545-5_29

John R. Deller Jr.⁵

397 Accesses

Abstract

Many algorithms for identification of speech models are directly or indirectly based on linear predictive coding (LPC) analysis.^† LPC analysis is tantamount to identification of an autoregressive (AR) model using short-term batch processing of the observations.⁽¹⁾ The LPC model, therefore, is a special case of the discrete-time linear-in-parameters models treated in foregoing chapters. Accordingly, many speech processing tasks represent natural domains for applying bounded-error methods. This chapter discusses the fundamental principles requisite to application of optimal-bounded-ellipsoid (OBE) processing to problems in speech analysis, recognition and coding. The focus is the general problem of LPC identification of speech using OBE methods, including the significant issue of tracking the time-varying parameters of this very dynamic signal. Potential applications of this work in specific speech-processing endeavors include:

1.
General modeling and analysis by predictive methods for spectral (formant) estimation, pitch detection, glottal waveform deconvolution, and pathology detection.⁽¹⁾
2.
Automated recognition of speech in which LPC parameters, or related parameters to which LPC coefficients are converted, are used as features in classifying phones, words, or complete messages in isolated utterances or continuous speech.
3.
Speaker recognition, or speaker verification, in which the speaker’s identity is determined or verified, respectively, through parametric feature analysis.
4.
Compression and synthesis of speech in which LPC parameters are used in strategies which remove redundancy in the acoustic waveform as a means of bandwidth compression or improving storage requirements. Similarly, spectral compression based on LPC analysis can be used for translation of the spectrum for hearing aids.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

J. R. Deller, Jr., J. G. Proakis, and J. H. L. Hansen, Discrete-Time Processing of Speech Signals, Macmillan, New York (1993).
Google Scholar
K. Steiglitz and B. Dickinson, IEEE Trans. Acoust., Speech, Signal Process. 25, 34 (1977).
Article Google Scholar
M. G. Berouti, D. G. Childers, and A. Paige, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 1, Hartford, CT, pp. 33-36 (1977).
Google Scholar
D. Y. Wong, J. D. Markel, and A. H. Gray, IEEE Trans. Acoust., Speech, Signal Process. 27, 350 (1979).
Article Google Scholar
J. R. Deller, Jr., IEEE Trans. Acoust., Speech, Signal Process. 29, 917 (1981).
Article Google Scholar
J. N. Larar, Y. A. Alsaka, and D. G. Childers, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 2, Tampa, FL, pp. 1089-1092 (1985).
Google Scholar
A. K. Krishnamurthy, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 3, San Diego, CA, pp. 36.3.1-36.3.4 (1984).
Google Scholar
D. E. Veeneman and S. L. BeMent, IEEE Trans. Acoust., Speech, Signal Process. 33, 369 (1985).
Article Google Scholar
A. K. Krishnamurthy and D. G. Childers, IEEE Trans. Acoust., Speech, Signal Process. 34, 730 (1985).
Article Google Scholar
Y. Miyoshi, K. Yamamoto, R. Mizoguchi, Y. Masuzo, and O. Kakusho, IEEE Trans. Acoust., Speech, Signal Process. 35, 1233 (1987).
Article Google Scholar
G. P. Pichaché, A Givens Rotation Algorithm for Single Channel Format Tracking and Glottal Waveform Deconvolution, M.S. Dissertation, Northeastern University, Boston, MA (1988).
Google Scholar
L. V. R. Arruda and G. Favier, in: Proceedings of the 9th IFAC/IFORS Symposium on Identification and System Parameter Estimation 2, Budapest, Hungary, pp. 1027-1032 (1991).
Google Scholar
J. R. Deller, Jr., M. Nayeri, and S. F. Odeh, Proc. IEEE 81, 813 (1993).
Article Google Scholar
J. R. Deller, Jr., M. Nayeri, and M. S. Liu, Int. J. Autom. Control Signal Process. 8, 43 (1994).
Article MathSciNet MATH Google Scholar
J. R. Deller, Jr. and T. C. Luk, Comput. Speech Lang. 3, 301 (1989).
Article Google Scholar
R. Lozano-Leal and R. Ortega, Automatica 23, 247 (1987).
Article MathSciNet MATH Google Scholar
S. M. Veres and J. P. Norton, Int. J. Control 50, 639 (1989).
Article MathSciNet MATH Google Scholar
L. Ljung and T. Söderström, Theory and Practice of Recursive Identification, MIT Press, Cambridge, MA (1983).
MATH Google Scholar
G. H. Golub and C. F. van Loan, Matrix Computations, 2nd Ed., Johns-Hopkins Univ. Press, Baltimore, MD (1989).
MATH Google Scholar
W. M. Gentleman and H. T. Kung, in: Proceedings of the Society of Photoptical Instrumentation Engineers: Real Time Signal Processing IV, San Diego, CA, pp. 19-26 (1981).
Google Scholar
J. G. McWhirter, in: Proceedings of the Society of Photoptical Instrumentation Engineers: Real Time Signal Processing IV, San Diego, CA, pp. 105-112 (1983).
Google Scholar
J. R. Deller, Jr. and D. Hsu, IEEE Trans. Circuits Systems 34, 782 (1987).
Article Google Scholar
A. K. Rao and Y. F. Huang, IEEE Trans. Signal Process. 41, 1140 (1993).
Article MATH Google Scholar
Y. F. Huang and J. R. Deller, Jr., in: Proceedings of the 39th Annual Allerton Conference on Communications, Control, and Computing, Monticello, IL, pp. 50-59 (1992).
Google Scholar
S. F. Odeh, Algorithms and Architectures for Adaptive Set Membership-based Signal Processing, Ph.D. Dissertation, Michigan State University, East Lansing, MI (1990).
Google Scholar
J. P. Norton and S. H. Mo, Math. Comput. Simul. 32, 527 (1990).
Article MathSciNet Google Scholar
S. F. Odeh and J. R. Deller, Jr., in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 5, Albuquerque, NM, pp. 2419-2422 (1990).
Google Scholar
J. R. Deller, Jr. and S. F. Odeh, Proceedings of the 9th IFAC/IFORS Symposium on Identification and System Parameter Estimation 2, Budapest, Hungary, pp. 1044-1049 (1991).
Google Scholar
J. R. Deller, Jr., IEEE Trans. Acoust., Speech, Signal Process. 37, 1432 (1989).
Article MathSciNet MATH Google Scholar
S. Dasgupta and Y. F. Huang, IEEE Trans. Inf. Theory 33, 383 (1987).
Article MATH Google Scholar
J. R. Deller, Jr., and S. F. Odeh, in: IEEE International Conference on Acoustics, Speech, and Signal Processing 2, Glasgow, Scotland, pp. 1067-1070 (1989).
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical Engineering, Michigan State University, East Lansing, MI, 48824, USA
John R. Deller Jr.

Authors

John R. Deller Jr.
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimento di Automatica e Informatica, Politecnico di Torino, Torino, 10129, Italy
Mario Milanese
School of Electronic and Electrical Engineering, University of Birmingham, Edgbaston, Birmingham, B15 2TT, UK
John Norton
Direction des Études de Synthèse, SM Office National d’Études et de Recherches Aérospatiales, Châtillon Cedex, F-92322, France
Hélène Piet-Lahanier
Laboratoire des Signaux et Systèmes, CNRS-École Supérieure d’Électricité, Gif-sur-Yvette Cedex, 91192, France
Éric Walter

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Deller, J.R. (1996). Applications of OBE Algorithms to Speech Processing. In: Milanese, M., Norton, J., Piet-Lahanier, H., Walter, É. (eds) Bounding Approaches to System Identification. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-9545-5_29

Download citation

DOI: https://doi.org/10.1007/978-1-4757-9545-5_29
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4757-9547-9
Online ISBN: 978-1-4757-9545-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics