Abstract
Speech is the communication mechanism that distinguishes humans from lower animal forms and is an essential part of what allows man to function in civilization — our sophisticated ability to use language and communicate directly with one another via an acoustic channel. With the invention of the telephone by A. G. Bell, a major advance in human communication took place. Now we can communicate “in real-time” (not by writing letters or sending telegrams) with one another while geographically separated, perhaps around the world or in an aircraft or space vehicle. Of course the telephone was until recently based on analog communication: a simple modulation of an electric current in proportion to the instantaneous intensity of an acoustic signal. In recent decades digital communications emerged as a revolutionary new technology for the transportation of information and allowed us to develop new digital highways and superhighways carrying a variety of traffic such as data, video, and multiple channels of voice with greater reliability, cost effectiveness, privacy and security. Advances in error control and modulation techniques, including spread-spectrum and trellis-coded modulation allow reliable digital communication over radio channels that often suffer from interference, fading, and other degradations.
This work was supported in part by the National Science Foundation under grant NCR 8914741 and by Bell Communications Research, Inc., Bell-Northern Research, Inc., Rockwell International Corp., and the State of California MICRO program.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
A. Buzo, A. H. Gray, R. M. Gray, and J. D. Markel, “Speech Coding Based upon Vector Quantization,” IEEE Trans. Acoust., Speech, and Signal Processing, vol. ASSP-28, no. 5, pp. 562–574, October 1980.
V. Cuperman and A. Gersho, “Vector Predictive Coding of Speech at 16 kbits/s,” IEEE Transactions on Communications, vol. COM-33, pp. 685–696, July 1985.
A. Gersho, S. Wang, and K. Zeger, Vector Quantization Techniques in Speech Coding, Marcel Dekker, 1991.
A. Gersho and R. M. Gray, Vector Quantization and Signal Compression, Kluwer Academic Publishers, Norwell, Massachusetts, 1991.
J. H. Chen and A. Gersho, “Vector Adaptive Predictive Coding of Speech at 9.6 kb/s,” Proc. IEEE Inter. Conference on Acoust., Speech, and Signal Processing, pp. 1693-1696, Tokyo, Japan, April 1986.
I. A. Gerson, M. A. Jasiuk, “Vector Sum Excited Linear Prediction,” IEEE Workshop on Speech Coding for Telecommunications, Vancouver, September 1989.
G. Davidson, A. Gersho, “Speech Waveforms,” Proc. Inter. Conf. Acoust., Speech, & Signal Processing, pp. 163-166, April 1988.
M. Johnson and T. Taniguchi, “Pitch-Orthogonal Code-Excited LPC,” Proc. IEEE Global Communications Conference, pp. 542-546, Dec. 1990.
S. Singhal and B. S. Atal, “Improving Performance of Multi-Pulse LPC Coders at Low Rates,” Proc. IEEE Inter. Conf. Acoustics, Speech, and Signal Processing, vol. 1, pp. 1.3.1–1.3.4, San Diego, March 1984.
R. C. Ross and T. P. Barnwell, “The Self-Excited Vocoder,” Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 453–456, Japan, April, 1986.
P. Kabal, J.L. Moncet, and C.C. Chu, “Synthesis Filter Optimization and Coding: Applications to CELP,” Proc. IEEE Inter. Conf. Acoust., Speech, and Signal Processing, vol. 1, pp. 147–150, New York City, April 1988.
W. B. Kleijn, D. J. Krasinski, R. H. Ketchum, and Improved Speech Quality and Efficient Vector Quantization in SELP, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 155–158, New York, April, 1988.
P. Kroon and B. S. Atal, and T. Moriya, L. G. Neumeyer, W. P. LeBlanc, and S. A. Mahmoud, “A Low-Delay 8 kb/s Backward-Adaptive CELP Coder,” Proc. International Mobile Satellite Conference, Ottawa%P 684-689, vol. 2, pp. 15.16.1–15.16.4, Albuquerque, 1990.
V. Ramamoorthy, N.S. Jayant, “Enhancement of ADPCM Speech by Adaptive Postfiltering,” Conf. Rec., IEEE Conf. on Commun., pp. 917-920, June 1985.
Y. Yatsuzuka, S. Iizuka, T. Yamazaki, “A variable Rate Coding by APC with Maximum Likelihood Quantization from 4.8 bit/s to 16 kbit/s,” Proc. Inter. Conf. Acoust., Speech, & Signal Processing, pp. 3071-3074, April 1986.
J. H. Chen and A. Gersho, “Real-Time Vector APC Speech Coding at 4800 bps with Adaptive Postfiltering,” Proc. Int. Conf on Acoust., Speech, Signal Processing Speech, and Signal Processing, vol. 4, pp. 2185–2188, Dallas, April 1987.
J.P. Campbell, Jr., V.C. Welch, T.E. Tremain, “An Expandable Error-Protected 4800 BPS CELP Coder (U.S. Federal Standard 4800 BPS Voice Coder),” Proc. Inter. Conf. Acoust., Speech, & Signal Processing, pp. 735-738, May 1989.
J. H. Chen, “A Robust Low-Delay CELP Speech Coder at 16 kb/s,” Proc., IEEE Global Commun. Conf., November 1989.
V. Cuperman, A. Gersho, R. Pettigrew, J. Shynk, J. Yao and J. H. Chen, “Backward Adaptive Configurations for Low-Delay Speech Coding,” Proc., IEEE Global Commun. Conf., November 1989.
J.-H. Yao, J. J. Shynk, and A. Gersho, “Low Delay Vector Excitation Coding of Speech at 8 kbit/s,” Proc. IEEE Global Commun. Conf, submitted for publication, 1991. 1991.
Shihua Wang and Allen Gersho, “Phonetically-Based Vector Excitation Coding of Speech at 3.6 kbit/s,” Proc. IEEE Inter. Conf. Acoust., Speech, and Signal Processing, Glasgow, May 1989.
Shihua Wang and Allen Gersho, “Phonetic Segmentation for Low Rate Speech Coding,” Advances in Speech Coding, Kluwer Academic Publishers, 1991.
A. Gersho, “Optimal Nonlinear Interpolative Vector Quantization,” IEEE Trans, on Comm., vol. COM-38, No. 9, pp. 1285–1287, September 1990.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1992 Springer Science+Business Media New York
About this chapter
Cite this chapter
Gersho, A. (1992). Speech Coding. In: Ince, A.N. (eds) Digital Speech Processing. The Kluwer International Series in Engineering and Computer Science, vol 155. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-2148-5_3
Download citation
DOI: https://doi.org/10.1007/978-1-4757-2148-5_3
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-5128-1
Online ISBN: 978-1-4757-2148-5
eBook Packages: Springer Book Archive