Skip to main content

Speech Coding

  • Chapter

Part of the book series: The Kluwer International Series in Engineering and Computer Science ((SECS,volume 155))

Abstract

Speech is the communication mechanism that distinguishes humans from lower animal forms and is an essential part of what allows man to function in civilization — our sophisticated ability to use language and communicate directly with one another via an acoustic channel. With the invention of the telephone by A. G. Bell, a major advance in human communication took place. Now we can communicate “in real-time” (not by writing letters or sending telegrams) with one another while geographically separated, perhaps around the world or in an aircraft or space vehicle. Of course the telephone was until recently based on analog communication: a simple modulation of an electric current in proportion to the instantaneous intensity of an acoustic signal. In recent decades digital communications emerged as a revolutionary new technology for the transportation of information and allowed us to develop new digital highways and superhighways carrying a variety of traffic such as data, video, and multiple channels of voice with greater reliability, cost effectiveness, privacy and security. Advances in error control and modulation techniques, including spread-spectrum and trellis-coded modulation allow reliable digital communication over radio channels that often suffer from interference, fading, and other degradations.

This work was supported in part by the National Science Foundation under grant NCR 8914741 and by Bell Communications Research, Inc., Bell-Northern Research, Inc., Rockwell International Corp., and the State of California MICRO program.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. A. Buzo, A. H. Gray, R. M. Gray, and J. D. Markel, “Speech Coding Based upon Vector Quantization,” IEEE Trans. Acoust., Speech, and Signal Processing, vol. ASSP-28, no. 5, pp. 562–574, October 1980.

    Article  MathSciNet  Google Scholar 

  2. V. Cuperman and A. Gersho, “Vector Predictive Coding of Speech at 16 kbits/s,” IEEE Transactions on Communications, vol. COM-33, pp. 685–696, July 1985.

    Article  Google Scholar 

  3. A. Gersho, S. Wang, and K. Zeger, Vector Quantization Techniques in Speech Coding, Marcel Dekker, 1991.

    Google Scholar 

  4. A. Gersho and R. M. Gray, Vector Quantization and Signal Compression, Kluwer Academic Publishers, Norwell, Massachusetts, 1991.

    Google Scholar 

  5. J. H. Chen and A. Gersho, “Vector Adaptive Predictive Coding of Speech at 9.6 kb/s,” Proc. IEEE Inter. Conference on Acoust., Speech, and Signal Processing, pp. 1693-1696, Tokyo, Japan, April 1986.

    Google Scholar 

  6. I. A. Gerson, M. A. Jasiuk, “Vector Sum Excited Linear Prediction,” IEEE Workshop on Speech Coding for Telecommunications, Vancouver, September 1989.

    Google Scholar 

  7. G. Davidson, A. Gersho, “Speech Waveforms,” Proc. Inter. Conf. Acoust., Speech, & Signal Processing, pp. 163-166, April 1988.

    Google Scholar 

  8. M. Johnson and T. Taniguchi, “Pitch-Orthogonal Code-Excited LPC,” Proc. IEEE Global Communications Conference, pp. 542-546, Dec. 1990.

    Google Scholar 

  9. S. Singhal and B. S. Atal, “Improving Performance of Multi-Pulse LPC Coders at Low Rates,” Proc. IEEE Inter. Conf. Acoustics, Speech, and Signal Processing, vol. 1, pp. 1.3.1–1.3.4, San Diego, March 1984.

    Google Scholar 

  10. R. C. Ross and T. P. Barnwell, “The Self-Excited Vocoder,” Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 453–456, Japan, April, 1986.

    Google Scholar 

  11. P. Kabal, J.L. Moncet, and C.C. Chu, “Synthesis Filter Optimization and Coding: Applications to CELP,” Proc. IEEE Inter. Conf. Acoust., Speech, and Signal Processing, vol. 1, pp. 147–150, New York City, April 1988.

    Google Scholar 

  12. W. B. Kleijn, D. J. Krasinski, R. H. Ketchum, and Improved Speech Quality and Efficient Vector Quantization in SELP, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 155–158, New York, April, 1988.

    Google Scholar 

  13. P. Kroon and B. S. Atal, and T. Moriya, L. G. Neumeyer, W. P. LeBlanc, and S. A. Mahmoud, “A Low-Delay 8 kb/s Backward-Adaptive CELP Coder,” Proc. International Mobile Satellite Conference, Ottawa%P 684-689, vol. 2, pp. 15.16.1–15.16.4, Albuquerque, 1990.

    Google Scholar 

  14. V. Ramamoorthy, N.S. Jayant, “Enhancement of ADPCM Speech by Adaptive Postfiltering,” Conf. Rec., IEEE Conf. on Commun., pp. 917-920, June 1985.

    Google Scholar 

  15. Y. Yatsuzuka, S. Iizuka, T. Yamazaki, “A variable Rate Coding by APC with Maximum Likelihood Quantization from 4.8 bit/s to 16 kbit/s,” Proc. Inter. Conf. Acoust., Speech, & Signal Processing, pp. 3071-3074, April 1986.

    Google Scholar 

  16. J. H. Chen and A. Gersho, “Real-Time Vector APC Speech Coding at 4800 bps with Adaptive Postfiltering,” Proc. Int. Conf on Acoust., Speech, Signal Processing Speech, and Signal Processing, vol. 4, pp. 2185–2188, Dallas, April 1987.

    Google Scholar 

  17. J.P. Campbell, Jr., V.C. Welch, T.E. Tremain, “An Expandable Error-Protected 4800 BPS CELP Coder (U.S. Federal Standard 4800 BPS Voice Coder),” Proc. Inter. Conf. Acoust., Speech, & Signal Processing, pp. 735-738, May 1989.

    Google Scholar 

  18. J. H. Chen, “A Robust Low-Delay CELP Speech Coder at 16 kb/s,” Proc., IEEE Global Commun. Conf., November 1989.

    Google Scholar 

  19. V. Cuperman, A. Gersho, R. Pettigrew, J. Shynk, J. Yao and J. H. Chen, “Backward Adaptive Configurations for Low-Delay Speech Coding,” Proc., IEEE Global Commun. Conf., November 1989.

    Google Scholar 

  20. J.-H. Yao, J. J. Shynk, and A. Gersho, “Low Delay Vector Excitation Coding of Speech at 8 kbit/s,” Proc. IEEE Global Commun. Conf, submitted for publication, 1991. 1991.

    Google Scholar 

  21. Shihua Wang and Allen Gersho, “Phonetically-Based Vector Excitation Coding of Speech at 3.6 kbit/s,” Proc. IEEE Inter. Conf. Acoust., Speech, and Signal Processing, Glasgow, May 1989.

    Google Scholar 

  22. Shihua Wang and Allen Gersho, “Phonetic Segmentation for Low Rate Speech Coding,” Advances in Speech Coding, Kluwer Academic Publishers, 1991.

    Google Scholar 

  23. A. Gersho, “Optimal Nonlinear Interpolative Vector Quantization,” IEEE Trans, on Comm., vol. COM-38, No. 9, pp. 1285–1287, September 1990.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1992 Springer Science+Business Media New York

About this chapter

Cite this chapter

Gersho, A. (1992). Speech Coding. In: Ince, A.N. (eds) Digital Speech Processing. The Kluwer International Series in Engineering and Computer Science, vol 155. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-2148-5_3

Download citation

  • DOI: https://doi.org/10.1007/978-1-4757-2148-5_3

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4419-5128-1

  • Online ISBN: 978-1-4757-2148-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics