Abstract
One of the most intractable problems in the description of utterance prosody is that of understanding speech timing control. Gösta Bruce’s seminal monograph on Swedish word accents in sentence context (Bruce 1977) highlighted an aspect of this problem that is crucially important for the perception of intonational contrasts in many languages — that of modeling what speakers do when they utter a given tune for a sentence and coordinate it with the string of words that constitute its text. By decomposing the tune into independent contributions from word accent, focal phrase accent, and final boundary events, Bruce was able to show that a robust correlate of the contrast between Accent I and Accent II involves a difference in timing, relative to rhythmically critical events in the text, of tonal events that are common to both word accent types. While much work remained to be done (cf. Bruce 1990 and the literature reviewed there), Bruce’s careful attention to prosodic structure, both at the level of lexical contrasts and at the level of focal prominence contrasts for the phrase or utterance as a whole, allowed him to build a phonological model of Swedish intonation contours that could generate precise, testable quantitative predictions. This work was a foundation stone for Pierrehumbert’s later phonological model for English intonation (Pierrehumbert 1980; Liberman and Pierrehumbert 1984; Beckman and Pierrehumbert 1986), a model which produced a synthesis system capable of generating by rule the full range of grammatical intonations. for English (Anderson et al. 1984). Summarizing such developments in intonation synthesis over the last fifteen years, we can say that Bruce’s work on the Swedish model has inspired, directly or indirectly, implementable phonological models of intonation systems for many languages (see reviews in Ladd 1992, 1996; Pierrehumbert, This volume). The English model, in particular, underscores the importance of accurate tonal timing, since there are pairs of accent types that are like the Swedish word accents in contrasting just in tune-text alignment (Pierrehumbert and Steele 1989), and at the same time, there are differences in timing for the same accent type when it is coordinated with rhythmic events at different structural positions in the text (Silverman and Pierrehumbert 1990).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abbs, J.H. and Gilbert, B.N. 1973. A strain gauge transducer system for lip and jaw motion in two dimensions. Journal of Speech and Hearing Research 16, 248–256.
Abel, S.M. 1972. Duration discrimination of noise bursts. Journal of the Acoustical Society of America 51, 1219–1223.
Anderson, M.D., Pierrehumbert, J.B. and Liberman, M.Y. 1984. Synthesis by rule of English intonation patterns. Proc. 1984 IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 1, 2.8.1–2. 8. 4.
Bannert, R. 1982. An FO-dependent model for segment duration? Reports from Uppsala University, Department of Linguistics 8, 59–80.
Beckman, M.E. 1986. Stress and Non-Stress Accent. Dordrecht: Foris.
Beckman, M.E. 1996. When is a syllable not a syllable? In T. Otake and A. Cutler (eds), Phonological Structure and Language Processing: Cross-Linguistic Studies. Berlin: Mouton de Gruyter, 95–123.
Beckman, M.E. and Edwards, J. 1994. Articulatory evidence for differentiating stress categories. In P.A. Keating (ed) Phonological Structure and Phonetic Form: Papers in Laboratory Phonology III. Cambridge, UK: Cambridge University Press, 7–33.
Beckman, M., Edwards, J. and Fletcher, J. 1992. Prosodic structure and tempo in a sonority model of articulatory dynamics. In Docherty and Ladd (eds), 68–86.
Beckman, M.E. and Hirschberg, J. 1994. The ToBI annotation conventions. Ms. Ohio State University. [Obtain by writing to tobi@ling.ohio-state.edu.]
Beckman, M.E. and Pierrehumbert, J.B. 1986. Intonational structure in Japanese and English. Phonology Yearbook 3, 255–309.
Bolinger, D. 1958. A theory of pitch accent in English. Word 14, 109–149.
Browman, C.P. and Goldstein, L. 1990. Tiers in articulatory phonology, with some implications for casual speech. In J. Kingston and M.E. Beckman (eds), 341–346.
Bruce, G. 1977. Swedish Word Accents in Sentence Perspective. Lund: Gleerup.
Bruce, G. 1983. Accentuation and timing in Swedish. Folia Linguistica 17, 221–238.
Bruce, G. 1990. Alignment and composition of tonal accents: comments on Silverman and Pierrehumbert’s paper. In Kingston and Beckman (eds), 107–114.
Carlson, R., Granström, B. and Klatt, D.H. 1979. Some notes on the perception of temporal patterns in speech. Proc. IXth International Congress of Phonetic Sciences, Vol. II, 260–267.
Coker, C.H., Umeda, N. and Browman, C. 1973. Automatic synthesis from text. IEEE Transactions on Audio Electroacoustics AU-21, 293–297.
Creelman, C.D. 1962. Human discrimination of auditory duration. Journal of the Acoustical Society of America 34, 582–593.
de Jong, K. 1995a. The supraglottal articulation of prominence in English: linguistic stress as localized hyperarticulation. Journal of the Acoustical Society of America 97, 491–504.
de Jong, K. 1995b. On status of redundant features: the case of backing and rounding in American English. In B. Connell and A. Arvaniti (eds). Phonology and Phonetic Evidence: Papers in Laboratory Phonology IV. Cambridge, UK: Cambridge University Press, 61–86.
de Jong, K., Beckman, M.E. and Edwards, J. 1993. The interplay between prosodic structure and coarticulation. Language and Speech 36, 197–212.
Denes, P. 1955. Effect of duration on the perception of voicing. Journal of the Acoustical Society of America 27, 761–764.
Docherty, G.J. and Ladd, D.R. (eds) 1992. Papers in Laboratory Phonology II: Gesture, Segment, Prosody. Cambridge: Cambridge University Press.
Fry, D.B. 1955. Duration and intensity as physical correlates of linguistic stress. Journal of the Acoustical Society of America 27, 765–768.
Fry, D.B. 1958. Experiments in the perception of stress. Language and Speech 1, 126–152.
Gussenhoven, C. and Rietveld, A.C.M. 1992. Intonation contours, prosodic structure and preboundary lengthening. Journal of Phonetics 20, 283–303.
Harrington, J., Cassidy, S., Fletcher, J. and McVeigh, A. 1993. The mu+ system for corpus based speech research. Computer Speech and Language 7, 305–331.
Harrington, J., Fletcher, J. and Beckman, M.E. 1996. Manner and place conflicts in the articulation of accent in Australian English. Paper presented at the 5th Conference in Laboratory Phonology, Northwestern University, 6–8 July 1996.
Harrington, J., Fletcher, J. and Roberts, C. 1995. Coarticulation and the accented unaccented distinction: evidence from jaw movement data. Journal of Phonetics 23, 305–322.
Harris, K. 1978. Vowel duration change and its underlying physiological mechanisms. Language and Speech 21, 354–361.
House, A.S. and Fairbanks, G. 1953. The influence of consonantal environment upon the secondary acoustical characteristics of vowels. Journal of the Acoustical Society of America 25, 105–113.
Huss, V. 1978. English word stress in post-nuclear position. Phonetica 35, 86–105.
Kelso, J.A.S., Vatikiotis-Bateson, E., Saltzman, E.L. and Kay, B. 1985. A qualitative dynamic analysis of reiterant speech production: phase portraits, kinematics, and dynamic modeling. Journal of the Acoustical Society of America 77, 266–280.
Kent, R. and Netsell, R. 1971. Effects of stress contrasts on certain articulatory parameters. Phonetica 24, 23–44.
Kingston, J. and Beckman, M.E. (eds). 1990. Papers in Laboratory Phonology I: Between the Grammar and the Physics of Speech. Cambridge: Cambridge University Press.
Klatt, D.H. 1976. Linguistic uses of syllable duration in English: Acoustic and perceptual evidence. Journal of the Acoustical Society of America 59, 1208–1221.
Klatt, D.H. 1979. Synthesis by rule of segmental durations in English sentences. In B. Lindblom and S. Öhman (eds) Frontiers of Human Communication Research. New York: Academic Press, 287–299.
Kuehn, D.P. and Moll, K.L. 1976. A cineradiographic study of VC and CV articulatory velocities. Journal of Phonetics 4, 303–320.
Ladd, D.R. 1992. An introduction to intonational phonology. In Docherty and Ladd (eds), 321–334.
Ladd, D.R. 1996. Intonational Phonology. Cambridge: Cambridge University Press. Laeufer, C. 1992. Patterns of voicing-conditioned vowel duration in French and English. Journal of Phonetics 20, 411–440.
Lehiste, 1. 1972. The timing of utterances and linguistic boundaries. Journal of the Acoustical Society of America 51, 2018–2024.
Lehiste, I. 1975. The phonetic structure of paragraphs. In A. Cohen and S. Nooteboom (eds) Structure and Process in Speech Perception. Heidelberg: Springer, 195–203.
Levelt, W.J.M. 1992. Accessing words in speech production: stages, processes and representations. Cognition 42, 1–22.
Liberman, M. and Pierrehumbert, J. 1984. Intonational invariance under changes in pitch range and length. In M. Aronoff and R. Oerhle (eds) Language Sound Structure. Cambridge, MA: MIT Press, 157–233.
Lindblom, B. 1963. Spectrographic study of vowel reduction. Journal of the Acoustical Society of America 35, 1773–1781.
Lindblom, B. 1967. Vowel duration and a model of lip mandible coordination. Speech Transmission Laboratory Quarterly Progress and Status Report 4 1967, 1–29.
Lyberg, B. 1979. Final lengthening–partly a consequence of restrictions on the speed of fundamental frequency change? Journal of Phonetics 7, 187–196.
Nakatani, L.H. and Aston, C.H. 1978. Acoustic and linguistic factors in stress perception. Ms., Bell Laboratories.
Nakatani, L.H., O’Conner, K.D. and Aston, C.H. 1981. Prosodic aspects of American speech rhythm. Phonetica 38, 84–105.
Nittrouer, S., Munhall, K.G., Kelso, J.A.S., Tuller B. and Harris, K. 1988. Patterns of interarticulator phasing and their relation to linguistic structure. Journal of the Acoustical Society of America 84, 1653–1661.
Nooteboom, S.G. 1973. The perceptual reality of some prosodic durations. Journal of Phonetics 1, 25–45.
Öhman, S., Zetterlund, S., Nordstrand, L. and Engstrand, O. 1979. Predicting segment duration in terms of a gesture theory of speech production. Proc. IXth International Congress of Phonetic Sciences, vol. 2, 305–311.
Olive, J.P. 1977. Rule synthesis of English from diadic units. Proc. 1977 IEEEInternational Conference on Acoustics, Speech and Signal Processing, 568–570.
Ostry, D. and Munhall, K. 1985. Control of rate and duration of speech movements.Journal of the Acoustical Society of America 77, 640–648.
Pierrehumbert, J. 1980. The Phonetics and Phonology of English Intonation. Doctoral dissertation, Massachusetts Institute of Technology.
Pierrehumbert, J. This volume. Tonal elements and their alignment.
Pierrehumbert, J. and Steele, S. 1989. Categories of tonal alignment in English. Phonetica 46 (4), 181–196.
Port, R.F. 1981. Linguistic timing factors in combination. Journal of the Acoustical Society of America 69, 262–274.
Saltzman, E. 1986. Task dynamic coordination of the speech articulators: a preliminary model. In H. Heuer and C. Fromm (eds) Generation and Modulation of Action Patterns. New York: Springer-Verlag, 129–144.
Saltzman, E. and K. Munhall. 1989. A dynamical approach to gestural patterning in speech production. Ecological Psychology 1, 333–382.
Silverman, K.E.A. and Pierrehumbert, J.B. 1990. The timing of prenuclear high accents in English. In Kingston and Beckman (eds), 72–106.
Stevens, K.N. and House, A.S. 1963. Perturbation of vowel articulations by consonantal context: an acoustical study. Journal of Speech and Hearing Research 6, 111–128.
Summers, V. 1987. Effects of stress and final-consonant voicing on vowel production: articulatory and acoustic analyses. Journal of the Acoustical Society of America 82, 847–863.
Summers, V. 1988. F1 structure provides information for final-consonant voicing. Journal of the Acoustical Society of America 84, 485–492.
Vanderslice, R. and Ladefoged, P. 1972. Binary suprasegmental features and transformational word-accentual rules. Language 48, 819–839.
Wightman, C.W. and Ostendorf, M. 1994. Automatic labeling of prosodic patterns. IEEE Transactions on Speech and Audio Processing, 469–481.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Beckman, M.E., Cohen, K.B. (2000). Modeling the Articulatory Dynamics of Two Levels of Stress Contrast. In: Horne, M. (eds) Prosody: Theory and Experiment. Text, Speech and Language Technology, vol 14. Springer, Dordrecht. https://doi.org/10.1007/978-94-015-9413-4_7
Download citation
DOI: https://doi.org/10.1007/978-94-015-9413-4_7
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-5562-0
Online ISBN: 978-94-015-9413-4
eBook Packages: Springer Book Archive