How a Full Account of Segmental Perception Depends on Prosody and Vice Versa

  • Quentin Summerfield
Part of the Communication and Cybernetics book series (COMMUNICATION, volume 11)

Abstract

The synthesis time-base of a synthetic precursor phrase is one factor determining the position of the phoneme boundary on a continuum of synthetic CV target syllables, which vary in the durational cue Voice Onset Times (VOT) and are introduced by the precursor. Reducing the time-base, thereby increasing the rate of speech in the precursor, increases the probability of the consonant in the target being perceived as voiceless. The results of such perceptual experiments are compared to those of a production study in which six adult male speakers of British English produced examples of CV syllables composed of the consonants /b,p,g,k/ and the vowels /i,a/ in a sentence frame. VOTs in productions of /p/ and /k/ were 20 milliseconds shorter at fast than at slow rates of speech, warranting a normalisation of VOT duration in perception like that actually obtained.

VOTs were longer in /ki/ than in /ka/. It has been suggested that voicing onset is retarded before high vowels because the oral constriction is reduced more slowly than before lower vowels, thus delaying the attainment of the transglottal pressure drop required for voicing. However, VOTs in /pi/ were shorter than those in /pa/. In the case of voiceless bilabials the aerodynamic factor appears to be outweighed by a mechanical influence of anticipatory co-articulation of tongue position which leads to larynx elevation and less vocal cord abduction before high vowels. The production result for velars, but not that for bilabials, is paralleled in perception where longer values of VOT are required for stops to be perceived as voiceless before /i/ compared to /a/ at all places of production. This failure of the perceptual process to follow exactly the constraints in production provides an illustration of a heuristic rather than an algorithmic perceptual strategy presumably designed to allow fast decisions while tolerating some loss of accuracy in exceptional cases.

The ‘precursor target’ paradigm used in these experiments could be extended to examine prosodie influences on other segmental distinctions, but also, to determine the perceptual substrates of prosodie variables such as “rate of speech” and to measure the precision of perceptual expectations for vowel and consonant durations in different sentential and syntactic environments.

Keywords

Adduct Acoustics Poss Verse Estima 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. KIM, C.W. (1970). A theory of aspiration. Phonetica, 21, 107–116.CrossRefGoogle Scholar
  2. KLATT, D.H. (1973). Voice onset time, frication and aspiration in word-initial consonant clusters. M.I.T. Research Laboratory of Electronics, Quarterly Progress Report No. 109, 124–136.Google Scholar
  3. KLATT, D.H. and COOPER, W.E. (1975). Perception of vowel duration in sentence contexts. Paper presented to the 89th meeting of the Acoustical Society of America, Austin, Texas.Google Scholar
  4. KOZHEVNIKOV, V.A. and CHISTOVICH, L.A. Speech Articulation and Perception. JPRS 30, 543. Washington: U.S. Department of Commerce, 1965.Google Scholar
  5. LACKNER, J.R. and LEVINE, B.K. (1975). Speech production: evidence for syntactically and phonologically determined units. Perception and Psycho-physics, 17, 107–113.CrossRefGoogle Scholar
  6. LINDBLOM, B. (1968). Temporal organisation of syllable production. Speech Transmission Laboratory Quarterly Progress Report, 2/3, 1–5. (Royal Institute of Technology, Stockholm.)Google Scholar
  7. LISKER, L. (1974). Is it VOT or a first formant transition detector? Paper presented to the autumn meeting of the American Association of Phonetic Sciences.Google Scholar
  8. LISKER, L. and ABRAMSON, A.S. (1967). Some effects of context on Voice Onset Time in English stops. Language and Speech, 10, 1–28.Google Scholar
  9. LISKER, L. and ABRAMSON, A.S. (1970). The voicing dimension: some experiments in comparative phonetics. Proceedings of the 6th International Congress of Phonetic Sciences. Prague, 1967. Prague: Academia.Google Scholar
  10. OLLER, D.K. (1973). The effect of position in utterance of speech segment duration in English. Journal of the Acoustical Society of America, 54, 1217–1247.CrossRefGoogle Scholar
  11. PETERSON, G.E. and BARNEY, H.L. (1952). Control methods used in a study of the vowels. Journal of the Acoustical Society of America, 24, 175–184.CrossRefADSGoogle Scholar
  12. SUMMERFIELD, A.Q. (1971). Some tests of an information-flow model of speech perception. Unpublished Bachelor’s Thesis, University of Cambridge.Google Scholar
  13. SUMMERFIELD, A.Q. (1974). Towards a detailed model for the perception of voicing constrasts Speech Perception No. 3, 1–26. (Progress Report, Department of Psychology, The Queen’s University of Belfast.)Google Scholar
  14. SUMMERFIELD, A.Q. (1975a). Cues, contexts and complications in the perception of voicing contrasts. Speech Perception No. 4, (in press). (Progress Report, Department of Psychology, The Queen’s University of Belfast.)Google Scholar
  15. SUMMERFIELD, A.Q. (1975b).Aerodynamics versus mechanics in the control of voicing onset in consonant-vowel syllables. Speech Perception No. 4, (in press). (Progress Report, Department of Psychology, The Queen’s University of Belfast.)Google Scholar
  16. SUMMERFIELD, A.Q. and HAGGARD, M.P. (1974). Perceptual processing of multiple cues,and contexts: effects of following vowel on stop consonant voicing. Journal of Phonetics, 2, 279–295.Google Scholar
  17. TAYLOR, M.M. and CREELMAN, D.D. (1967). PEST: Efficient estimates on probability functions. Journal of the Acoustical Society of America, 41, 782–787.CrossRefADSGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1975

Authors and Affiliations

  • Quentin Summerfield

There are no affiliations available

Personalised recommendations