Advertisement

Linear Prediction Techniques in Speech Coding

  • A. Gersho
Part of the International Centre for Mechanical Sciences book series (CISM, volume 324)

Abstract

Recent advances in speech coding algorithms and techniques based on the use of linear prediction now permit high quality voice reproduction at remarkably low bit rates. This paper reviews some of the main ideas underlying the algorithms of major interest today. The concept of removing redundancy by linear prediction is reviewed, first in the context of predictive quantization or DPCM. Then linear predictive coding, adaptive predictive coding, and vector quantization are discussed. The concepts of excitation coding via analysis-by-synthesis linear predictive coding is explained and some important enhancements such as vector sum excitations, and adaptive postfiltering are described. Low-delay coding by backward computation of LPC parameters is explained. The concept of phonetic segmentation of speech for closed-loop coding systems is also presented. Linear prediction is the key technique that underlies almost all of the important algorithms for speech coding of interest today. Finally, we discuss some recent work on nonlinear prediction of speech and its potential for the future of speech coding.

Keywords

Vector Quantization Linear Prediction Speech Code Code Vector Pitch Period 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. [1]
    J. D. Markel and A. H. Gray, Jr., Linear Prediction of Speech, Springer-Verlag, New York, NY, 1976.CrossRefMATHGoogle Scholar
  2. [2]
    A. Buzo, A. H. Gray, R. M. Gray, and J. D. Markel, “Speech Coding Based upon Vector Quantization,” IEEE Trans. Acoust., Speech, and Signal Processing, vol. ASSP-28, no. 5, pp. 562–574, October 1980.MathSciNetCrossRefMATHGoogle Scholar
  3. [3]
    V. Cuperman and A. Gersho, “Vector Predictive Coding of Speech at 16 kbits/s,” IEEE Transactions on Communications, vol. COM-33, pp. 685–696, July 1985.CrossRefGoogle Scholar
  4. [4]
    J. H. Chen and A. Gersho, “Vector Adaptive Predictive Coding of Speech at 9.6 kb/s,” Proc. IEEE Inter. Conference on Acoust., Speech, and Signal Processing, pp. 1693–1696, Tokyo, Japan, April 1986.Google Scholar
  5. [5]
    I. A. Gerson, M. A. Jasiuk, “Vector Sum Excited Linear Prediction,” IEEE Workshop on Speech Coding for Telecommunications, Vancouver, September 1989.Google Scholar
  6. [6]
    G. Davidson, A. Gersho, “Speech Waveforms,” Proc. Inter. Conf. Acoust., Speech, & Signal Processing, pp. 163–166, April 1988.Google Scholar
  7. [7]
    S. Singhal and B. S. Atal, “Improving Performance of Multi-Pulse LPC Coders at Low Rates,” Proc. IEEE Inter. Conf Acoustics, Speech, and Signal Processing, vol. 1, pp. 1.3.1–1.3.4, San Diego, March 1984.Google Scholar
  8. [8]
    R. C. Ross and T. P. Barnwell, “The Self-Excited Vocoder,” Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 453–456, Japan, April, 1986.Google Scholar
  9. [9]
    P. Kabal, J.L. Moncet, and C.C. Chu, “Synthesis Filter Optimization and Coding: Applications to CELP,” Proc. IEEE Inter. Conf. Acoust., Speech, and Signal Processing, vol. 1, pp. 147–150, New York City, April 1988.Google Scholar
  10. [10]
    W. B. Kleijn, D. J. Krasinski, R. H. Ketchum, and Improved Speech Quality and Efficient Vector Quantization in SELP, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 155–158, New York, April, 1988.Google Scholar
  11. [11]
    V. Ramamoorthy, N.S. Jayant, “Enhancement of ADPCM Speech by Adaptive Postfiltering,” Conf. Rec, IEEE Conf. on Commun., pp. 917–920, June 1985.Google Scholar
  12. [12]
    Y. Yatsuzuka, S. Iizuka, T. Yamazaki, “A variable Rate Coding by APC with Maximum Likelihood Quantization from 4.8 bit/s to 16 kbit/s,” Proc. Inter. Conf Acoust., Speech, & Signal Processing, pp. 3071–3074, April 1986.Google Scholar
  13. [13]
    J. H. Chen and A. Gersho, “Real-Time Vector APC Speech Coding at 4800 bps with Adaptive Postfiltering,” Proc. Int. Conf. on Acoust., Speech, Signal Processing Speech, and Signal Processing, vol. 4, pp. 2185–2188, Dallas, April 1987.Google Scholar
  14. [14]
    J.P. Campbell, Jr., V.C. Welch, T.E. Tremain, “An Expandable Error-Protected 4800 BPS CELP Coder (U.S. Federal Standard 4800 BPS Voice Coder),” Proc. Inter. Conf Acoust., Speech, & Signal Processing, pp. 735–738, May 1989.Google Scholar
  15. [15]
    V. Cuperman, A. Gersho, R. Pettigrew, J. Shynk, J. Yao and J. H. Chen, “Backward Adaptive Configurations for Low-Delay Speech Coding,” Proc, IEEE Global Commun. Conf, November 1989.Google Scholar
  16. [16]
    J. H. Chen, “A Robust Low-Delay CELP Speech Coder at 16 kb/s,” Proc, IEEE Global Commun. Conf, November 1989.Google Scholar
  17. [17]
    Shihua Wang and Allen Gersho, “Phonetically-Based Vector Excitation Coding of Speech at 3.6 kbit/s,” Proc. IEEE Inter. Conf. Acousi., Speech, and Signal Processing, Glasgow, May 1989.Google Scholar
  18. [18]
    Shihua Wang and Allen Gersho, “Phonetic Segmentation for Low Rate Speech Coding,” Advances in Speech Coding, Kluwer Academic Publishers, to appear 1990.Google Scholar
  19. [19]
    A. Gersho, “Optimal Nonlinear Interpolative Vector Quantization,” IEEE Trans. on Comm., vol. COM-38, No. 9, pp. 1285–1287, September 1990.CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Wien 1991

Authors and Affiliations

  • A. Gersho
    • 1
  1. 1.University of CaliforniaSanta BarbaraUSA

Personalised recommendations