Advertisement

The Speech Signal

  • Melvyn J. Hunt
Part of the The Kluwer International Series in Engineering and Computer Science book series (SECS, volume 155)

Abstract

This chapter provides a non-mathematical introduction to the speech signal. The production of speech is first described, including a survey of the categories into which speech sounds are grouped. This is followed by an account of some properties of human perception of sounds in general and of speech in particular. Speech is then compared with other signals. It is argued that it is more complex than artificial message bearing signals, and that unlike such signals speech contains no easily identified context-independent units that can be used in bottom-up decoding. Words and phonemes are examined, and phonemes are shown to have no simple manifestation in the acoustic signal. Speech communication is presented as an interactive process, in which the listener actively reconstructs the message from a combination of acoustic cues and prior knowledge, and the speaker takes the listener’s capacities into account in deciding how much acoustic information to provide. The final section compares speech and text, arguing that our cultural emphasis on written communication causes us to project properties of text onto speech and that there are large differences between the styles of language appropriate for the two modes of communication. These differences are often ignored, with unfortunate results.

Keywords

Vocal Cord Speech Signal Speech Perception Vocal Tract Speech Sound 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Hunt M.J., “Studies of Glottal Excitation using Inverse Filtering and an Electroglottograph,” Proc. XI’th Intl. Congress of Phonetic Sciences, Tallinn, Estonia, August 1–7, 1987, Vol 3, pp. 23–26.Google Scholar
  2. 2.
    Markel J.D. & Gray A.H. Linear Prediction of Speech, Springer-Verlag, Berlin, 1976.zbMATHCrossRefGoogle Scholar
  3. 3.
    Klatt D.H., “Prediction of perceived phonetic distance from critical-band spectra: a first step” Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, Paris, May 1982, pp.1278-1281.Google Scholar
  4. 4.
    Bordn, G.J & Harris, K.S., Speech Science Primer (2nd ed.), Williams & Wilkins, Baltimore, 1984.Google Scholar
  5. 5.
    Liberman A.M, Cooper F.S, Harris K.S. & Macneilage P.F “A motor theory of speech perception,” Proc. Stockholm Speech Comm. Seminar, R.I.T., Stockholm, September 1962.Google Scholar
  6. 6.
    McGurk H. & MacDonald J. “Hearing lips and seeing voices,” Nature Vol. 264 #5588, pp.746–748, 1976.CrossRefGoogle Scholar
  7. 7.
    Leeberman, P. “Some effects of semantic and grammatical context on the production and perception of speech,” Language and Speech, Vol. 6, 1963, pp.172–187.Google Scholar
  8. 8.
    Hunnicutt, S. “Intelligibility versus redundancy — conditions of dependency,” Language and Speech, Vol. 28, 1985, pp.47–56.Google Scholar
  9. 9.
    Stubbs, M. Discourse Analysis: The Sociolinguistic Analysis of Natural Language, Chicago, University of Chicago Press, 1983.Google Scholar
  10. 10.
    Chapanis, A. “Interactive Human Communication,” Scientific American, Vol. 232, No. 3, March 1975, pp.36–49.CrossRefGoogle Scholar
  11. 11.
    Martin, H. & Pelletier, C. Vocabulaire de la téléphonie, Quebec City, Government of Quebec, June 1984, p.15.Google Scholar

Copyright information

© Springer Science+Business Media New York 1992

Authors and Affiliations

  • Melvyn J. Hunt
    • 1
  1. 1.Marconi Speech & Information SystemsPortsmouth, HantsEngland

Personalised recommendations