Speech Coding

Gersho, Allen

doi:10.1007/978-1-4757-2148-5_3

Speech Coding

Allen Gersho²

Chapter

207 Accesses
1 Citations

Part of the book series: The Kluwer International Series in Engineering and Computer Science ((SECS,volume 155))

Abstract

Speech is the communication mechanism that distinguishes humans from lower animal forms and is an essential part of what allows man to function in civilization — our sophisticated ability to use language and communicate directly with one another via an acoustic channel. With the invention of the telephone by A. G. Bell, a major advance in human communication took place. Now we can communicate “in real-time” (not by writing letters or sending telegrams) with one another while geographically separated, perhaps around the world or in an aircraft or space vehicle. Of course the telephone was until recently based on analog communication: a simple modulation of an electric current in proportion to the instantaneous intensity of an acoustic signal. In recent decades digital communications emerged as a revolutionary new technology for the transportation of information and allowed us to develop new digital highways and superhighways carrying a variety of traffic such as data, video, and multiple channels of voice with greater reliability, cost effectiveness, privacy and security. Advances in error control and modulation techniques, including spread-spectrum and trellis-coded modulation allow reliable digital communication over radio channels that often suffer from interference, fading, and other degradations.

This work was supported in part by the National Science Foundation under grant NCR 8914741 and by Bell Communications Research, Inc., Bell-Northern Research, Inc., Rockwell International Corp., and the State of California MICRO program.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

A. Buzo, A. H. Gray, R. M. Gray, and J. D. Markel, “Speech Coding Based upon Vector Quantization,” IEEE Trans. Acoust., Speech, and Signal Processing, vol. ASSP-28, no. 5, pp. 562–574, October 1980.
Article MathSciNet Google Scholar
V. Cuperman and A. Gersho, “Vector Predictive Coding of Speech at 16 kbits/s,” IEEE Transactions on Communications, vol. COM-33, pp. 685–696, July 1985.
Article Google Scholar
A. Gersho, S. Wang, and K. Zeger, Vector Quantization Techniques in Speech Coding, Marcel Dekker, 1991.
Google Scholar
A. Gersho and R. M. Gray, Vector Quantization and Signal Compression, Kluwer Academic Publishers, Norwell, Massachusetts, 1991.
Google Scholar
J. H. Chen and A. Gersho, “Vector Adaptive Predictive Coding of Speech at 9.6 kb/s,” Proc. IEEE Inter. Conference on Acoust., Speech, and Signal Processing, pp. 1693-1696, Tokyo, Japan, April 1986.
Google Scholar
I. A. Gerson, M. A. Jasiuk, “Vector Sum Excited Linear Prediction,” IEEE Workshop on Speech Coding for Telecommunications, Vancouver, September 1989.
Google Scholar
G. Davidson, A. Gersho, “Speech Waveforms,” Proc. Inter. Conf. Acoust., Speech, & Signal Processing, pp. 163-166, April 1988.
Google Scholar
M. Johnson and T. Taniguchi, “Pitch-Orthogonal Code-Excited LPC,” Proc. IEEE Global Communications Conference, pp. 542-546, Dec. 1990.
Google Scholar
S. Singhal and B. S. Atal, “Improving Performance of Multi-Pulse LPC Coders at Low Rates,” Proc. IEEE Inter. Conf. Acoustics, Speech, and Signal Processing, vol. 1, pp. 1.3.1–1.3.4, San Diego, March 1984.
Google Scholar
R. C. Ross and T. P. Barnwell, “The Self-Excited Vocoder,” Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 453–456, Japan, April, 1986.
Google Scholar
P. Kabal, J.L. Moncet, and C.C. Chu, “Synthesis Filter Optimization and Coding: Applications to CELP,” Proc. IEEE Inter. Conf. Acoust., Speech, and Signal Processing, vol. 1, pp. 147–150, New York City, April 1988.
Google Scholar
W. B. Kleijn, D. J. Krasinski, R. H. Ketchum, and Improved Speech Quality and Efficient Vector Quantization in SELP, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 155–158, New York, April, 1988.
Google Scholar
P. Kroon and B. S. Atal, and T. Moriya, L. G. Neumeyer, W. P. LeBlanc, and S. A. Mahmoud, “A Low-Delay 8 kb/s Backward-Adaptive CELP Coder,” Proc. International Mobile Satellite Conference, Ottawa%P 684-689, vol. 2, pp. 15.16.1–15.16.4, Albuquerque, 1990.
Google Scholar
V. Ramamoorthy, N.S. Jayant, “Enhancement of ADPCM Speech by Adaptive Postfiltering,” Conf. Rec., IEEE Conf. on Commun., pp. 917-920, June 1985.
Google Scholar
Y. Yatsuzuka, S. Iizuka, T. Yamazaki, “A variable Rate Coding by APC with Maximum Likelihood Quantization from 4.8 bit/s to 16 kbit/s,” Proc. Inter. Conf. Acoust., Speech, & Signal Processing, pp. 3071-3074, April 1986.
Google Scholar
J. H. Chen and A. Gersho, “Real-Time Vector APC Speech Coding at 4800 bps with Adaptive Postfiltering,” Proc. Int. Conf on Acoust., Speech, Signal Processing Speech, and Signal Processing, vol. 4, pp. 2185–2188, Dallas, April 1987.
Google Scholar
J.P. Campbell, Jr., V.C. Welch, T.E. Tremain, “An Expandable Error-Protected 4800 BPS CELP Coder (U.S. Federal Standard 4800 BPS Voice Coder),” Proc. Inter. Conf. Acoust., Speech, & Signal Processing, pp. 735-738, May 1989.
Google Scholar
J. H. Chen, “A Robust Low-Delay CELP Speech Coder at 16 kb/s,” Proc., IEEE Global Commun. Conf., November 1989.
Google Scholar
V. Cuperman, A. Gersho, R. Pettigrew, J. Shynk, J. Yao and J. H. Chen, “Backward Adaptive Configurations for Low-Delay Speech Coding,” Proc., IEEE Global Commun. Conf., November 1989.
Google Scholar
J.-H. Yao, J. J. Shynk, and A. Gersho, “Low Delay Vector Excitation Coding of Speech at 8 kbit/s,” Proc. IEEE Global Commun. Conf, submitted for publication, 1991. 1991.
Google Scholar
Shihua Wang and Allen Gersho, “Phonetically-Based Vector Excitation Coding of Speech at 3.6 kbit/s,” Proc. IEEE Inter. Conf. Acoust., Speech, and Signal Processing, Glasgow, May 1989.
Google Scholar
Shihua Wang and Allen Gersho, “Phonetic Segmentation for Low Rate Speech Coding,” Advances in Speech Coding, Kluwer Academic Publishers, 1991.
Google Scholar
A. Gersho, “Optimal Nonlinear Interpolative Vector Quantization,” IEEE Trans, on Comm., vol. COM-38, No. 9, pp. 1285–1287, September 1990.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Center for Information Processing Research, Dept. of Electrical & Computer Engineering, University of California, Santa Barbara, CA, 93106, USA
Allen Gersho

Authors

Allen Gersho
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Marmara Research Centre, Gebze-Kocaeli, Turkey
A. Nejat Ince

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Gersho, A. (1992). Speech Coding. In: Ince, A.N. (eds) Digital Speech Processing. The Kluwer International Series in Engineering and Computer Science, vol 155. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-2148-5_3

Download citation

DOI: https://doi.org/10.1007/978-1-4757-2148-5_3
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-5128-1
Online ISBN: 978-1-4757-2148-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics