Advertisement

How is speech processed in a cell phone conversation?

  • T. Dutoit
  • N. Moreau
  • P. Kroon

Abstract

Although most people see the cell phone as an extension of conventional wired phone service or POTS (plain old telephone service), the truth is that cell phone technology is extremely complex and a marvel of technology. Very few people realize that these small devices perform hundreds of millions of operations per second to be able to maintain a phone conversation. If we take a closer look at the module that converts the electronic version of the speech signal into a sequence of bits, we see that for every 20 ms of input speech, a set of speech model parameters is computed and transmitted to the receiver. The receiver converts these parameters back into speech. In this chapter, we will see how linear predictive (LP) analysis- synthesis lies at the very heart of mobile phone transmission of speech. We first start with an introduction to linear predictive speech modeling and follow with a MATLAB-based proof of concept.

Keywords

Vocal Tract Spectral Envelope Synthetic Speech Pitch Period Inverse Filter 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Atal BS, Remde JR (1982) A New Model LPC Excitation for Producing Natural Sounding Speech at Low Bit Rates. In: Proc. ICASSP’82, pp 614–617Google Scholar
  2. de la Cuadra P (2007) Pitch Detection Methods Review [online] Available: http://www-ccrma.stanford.edu/~pdelac/154/ml54paper.htm [20/2/1007]Google Scholar
  3. Ellis D (2006) Matlab Audio Processing Examples [online] Available: http://www.ee.columbia.edu/%7Edpwe/resources/matlab/ [20/2/2007]Google Scholar
  4. Fant G (1960) Acoustic Theory of Speech Production. The Hague: MoutonGoogle Scholar
  5. Fellbaum K (2007) Human Speech Production Based on a Linear Predictive Vocoder [online] Available: http://www.kt.tu-cottbus.de/speech-analysis/ [20/2/2007]Google Scholar
  6. Goldberg RG, Riek L (2000) Speech Coders. CRC Press: Boca Raton, FLMATHCrossRefGoogle Scholar
  7. Gray RM (2006) Packet speech on the Arpanet: A history of early LPC speech and its accidental impact on the Internet Protocol [online] Available: http://www.ieee.org/organizations/society/sp/Packet_Speech.pdf [20/2/2007]Google Scholar
  8. Hess W (1992) Pitch and Voicing Determination. In: Advances in Speech Signal Processing, S. Furui, M. Sondhi, eds., Dekker, New York, pp 3–48Google Scholar
  9. Khan A, Kashif F (2003) Speech Coding with Linear Predictive Coding (LPC) [online] Available: http://www.dspexperts.com/dsp/projects/lpc [20/2/2007]Google Scholar
  10. Kroon P, Deprettere E, Sluyter R (1986) Regular-pulse excitation -A novel approach to effective and efficient multipulse coding of speech. IEEE Transactions on Acoustics, Speech, and Signal Processing 34(5): 1054–1063CrossRefGoogle Scholar
  11. Matsumoto J, Nishiguchi M, Iijima K (1997) Harmonic Vector Excitation Coding at 2.0 kbps. In: Proc. IEEE Workshop on Speech Coding, pp 39–40Google Scholar
  12. McCree AV, Barnwell TP (1995) A mixed excitation LPC vocoder model for low bit rate speech coding. IEEE Transactions on Speech and Audio Processing, 3(4):242–250CrossRefGoogle Scholar
  13. NATO (1984) Parameters and coding characteristics that must be common to assure interoperability of 2400 bps linear predictive encoded speech. NATO Standard STANAG-4I98-EdlGoogle Scholar
  14. Quatieri T (2002) Discrete-Time Speech Signal Processing: Principles and Practice. Prentice-Hall, Inc.: Upper Saddle River, NJGoogle Scholar
  15. Rabiner LR, Schafer RW (1978) Digital Processing of Speech Signals. Prentice-Hall, Inc.: Englewood Cliffs, NJGoogle Scholar
  16. Salami R, Laflamme C, Adoul J-P, Kataoka A, Hayashi S, Moriya T, Lamblin C, Massaloux D, Proust S, Kroon P, Shoham, Y (1998) Design and description of CS-ACELP: A toll quality 8 kb/s speech coder, IEEE Transactions on Speech and Audio Processing 6(2): 116–130CrossRefGoogle Scholar
  17. Schroeder MR, Atal B (1985) Code-Excited Linear Prediction(CELP): High Quality Speech at Very Low Bit Rates. In: Proc. IEEE ICASSP-85, pp 937–940Google Scholar
  18. Spanias A, Painter T (2002) Matlab simulation of LPClOe vocoder [online] Available: http://www.cysip.net/lpc10e_FORM.htm [19/2/2007]Google Scholar
  19. Woodard J (2007) Speech coding [online] Available: http://www-mobile.ecs.soton.ac.uk/speech_codecs/ [20/2/2007]Google Scholar

Copyright information

© Springer Science+Business Media New York 2009

Authors and Affiliations

  • T. Dutoit
    • 1
  • N. Moreau
    • 2
  • P. Kroon
    • 3
  1. 1.Faculté Polytechnique de MonsBelgium
  2. 2.Ecole Nationale Supérieure des TélécommunicationsParisFrance
  3. 3.LSIAllentownUSA

Personalised recommendations