Skip to main content

Applications of OBE Algorithms to Speech Processing

  • Chapter
Bounding Approaches to System Identification
  • 397 Accesses

Abstract

Many algorithms for identification of speech models are directly or indirectly based on linear predictive coding (LPC) analysis. LPC analysis is tantamount to identification of an autoregressive (AR) model using short-term batch processing of the observations.(1) The LPC model, therefore, is a special case of the discrete-time linear-in-parameters models treated in foregoing chapters. Accordingly, many speech processing tasks represent natural domains for applying bounded-error methods. This chapter discusses the fundamental principles requisite to application of optimal-bounded-ellipsoid (OBE) processing to problems in speech analysis, recognition and coding. The focus is the general problem of LPC identification of speech using OBE methods, including the significant issue of tracking the time-varying parameters of this very dynamic signal. Potential applications of this work in specific speech-processing endeavors include:

  1. 1.

    General modeling and analysis by predictive methods for spectral (formant) estimation, pitch detection, glottal waveform deconvolution, and pathology detection.(1)

  2. 2.

    Automated recognition of speech in which LPC parameters, or related parameters to which LPC coefficients are converted, are used as features in classifying phones, words, or complete messages in isolated utterances or continuous speech.

  3. 3.

    Speaker recognition, or speaker verification, in which the speaker’s identity is determined or verified, respectively, through parametric feature analysis.

  4. 4.

    Compression and synthesis of speech in which LPC parameters are used in strategies which remove redundancy in the acoustic waveform as a means of bandwidth compression or improving storage requirements. Similarly, spectral compression based on LPC analysis can be used for translation of the spectrum for hearing aids.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. J. R. Deller, Jr., J. G. Proakis, and J. H. L. Hansen, Discrete-Time Processing of Speech Signals, Macmillan, New York (1993).

    Google Scholar 

  2. K. Steiglitz and B. Dickinson, IEEE Trans. Acoust., Speech, Signal Process. 25, 34 (1977).

    Article  Google Scholar 

  3. M. G. Berouti, D. G. Childers, and A. Paige, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 1, Hartford, CT, pp. 33-36 (1977).

    Google Scholar 

  4. D. Y. Wong, J. D. Markel, and A. H. Gray, IEEE Trans. Acoust., Speech, Signal Process. 27, 350 (1979).

    Article  Google Scholar 

  5. J. R. Deller, Jr., IEEE Trans. Acoust., Speech, Signal Process. 29, 917 (1981).

    Article  Google Scholar 

  6. J. N. Larar, Y. A. Alsaka, and D. G. Childers, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 2, Tampa, FL, pp. 1089-1092 (1985).

    Google Scholar 

  7. A. K. Krishnamurthy, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 3, San Diego, CA, pp. 36.3.1-36.3.4 (1984).

    Google Scholar 

  8. D. E. Veeneman and S. L. BeMent, IEEE Trans. Acoust., Speech, Signal Process. 33, 369 (1985).

    Article  Google Scholar 

  9. A. K. Krishnamurthy and D. G. Childers, IEEE Trans. Acoust., Speech, Signal Process. 34, 730 (1985).

    Article  Google Scholar 

  10. Y. Miyoshi, K. Yamamoto, R. Mizoguchi, Y. Masuzo, and O. Kakusho, IEEE Trans. Acoust., Speech, Signal Process. 35, 1233 (1987).

    Article  Google Scholar 

  11. G. P. Pichaché, A Givens Rotation Algorithm for Single Channel Format Tracking and Glottal Waveform Deconvolution, M.S. Dissertation, Northeastern University, Boston, MA (1988).

    Google Scholar 

  12. L. V. R. Arruda and G. Favier, in: Proceedings of the 9th IFAC/IFORS Symposium on Identification and System Parameter Estimation 2, Budapest, Hungary, pp. 1027-1032 (1991).

    Google Scholar 

  13. J. R. Deller, Jr., M. Nayeri, and S. F. Odeh, Proc. IEEE 81, 813 (1993).

    Article  Google Scholar 

  14. J. R. Deller, Jr., M. Nayeri, and M. S. Liu, Int. J. Autom. Control Signal Process. 8, 43 (1994).

    Article  MathSciNet  MATH  Google Scholar 

  15. J. R. Deller, Jr. and T. C. Luk, Comput. Speech Lang. 3, 301 (1989).

    Article  Google Scholar 

  16. R. Lozano-Leal and R. Ortega, Automatica 23, 247 (1987).

    Article  MathSciNet  MATH  Google Scholar 

  17. S. M. Veres and J. P. Norton, Int. J. Control 50, 639 (1989).

    Article  MathSciNet  MATH  Google Scholar 

  18. L. Ljung and T. Söderström, Theory and Practice of Recursive Identification, MIT Press, Cambridge, MA (1983).

    MATH  Google Scholar 

  19. G. H. Golub and C. F. van Loan, Matrix Computations, 2nd Ed., Johns-Hopkins Univ. Press, Baltimore, MD (1989).

    MATH  Google Scholar 

  20. W. M. Gentleman and H. T. Kung, in: Proceedings of the Society of Photoptical Instrumentation Engineers: Real Time Signal Processing IV, San Diego, CA, pp. 19-26 (1981).

    Google Scholar 

  21. J. G. McWhirter, in: Proceedings of the Society of Photoptical Instrumentation Engineers: Real Time Signal Processing IV, San Diego, CA, pp. 105-112 (1983).

    Google Scholar 

  22. J. R. Deller, Jr. and D. Hsu, IEEE Trans. Circuits Systems 34, 782 (1987).

    Article  Google Scholar 

  23. A. K. Rao and Y. F. Huang, IEEE Trans. Signal Process. 41, 1140 (1993).

    Article  MATH  Google Scholar 

  24. Y. F. Huang and J. R. Deller, Jr., in: Proceedings of the 39th Annual Allerton Conference on Communications, Control, and Computing, Monticello, IL, pp. 50-59 (1992).

    Google Scholar 

  25. S. F. Odeh, Algorithms and Architectures for Adaptive Set Membership-based Signal Processing, Ph.D. Dissertation, Michigan State University, East Lansing, MI (1990).

    Google Scholar 

  26. J. P. Norton and S. H. Mo, Math. Comput. Simul. 32, 527 (1990).

    Article  MathSciNet  Google Scholar 

  27. S. F. Odeh and J. R. Deller, Jr., in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 5, Albuquerque, NM, pp. 2419-2422 (1990).

    Google Scholar 

  28. J. R. Deller, Jr. and S. F. Odeh, Proceedings of the 9th IFAC/IFORS Symposium on Identification and System Parameter Estimation 2, Budapest, Hungary, pp. 1044-1049 (1991).

    Google Scholar 

  29. J. R. Deller, Jr., IEEE Trans. Acoust., Speech, Signal Process. 37, 1432 (1989).

    Article  MathSciNet  MATH  Google Scholar 

  30. S. Dasgupta and Y. F. Huang, IEEE Trans. Inf. Theory 33, 383 (1987).

    Article  MATH  Google Scholar 

  31. J. R. Deller, Jr., and S. F. Odeh, in: IEEE International Conference on Acoustics, Speech, and Signal Processing 2, Glasgow, Scotland, pp. 1067-1070 (1989).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1996 Springer Science+Business Media New York

About this chapter

Cite this chapter

Deller, J.R. (1996). Applications of OBE Algorithms to Speech Processing. In: Milanese, M., Norton, J., Piet-Lahanier, H., Walter, É. (eds) Bounding Approaches to System Identification. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-9545-5_29

Download citation

  • DOI: https://doi.org/10.1007/978-1-4757-9545-5_29

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4757-9547-9

  • Online ISBN: 978-1-4757-9545-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics