Paying the meter: Effect of metrical similarity on word lengthening

Myers, Brett R.; Watson, Duane G.

doi:10.3758/s13423-019-01635-4

Paying the meter: Effect of metrical similarity on word lengthening

Brief Report
Published: 09 July 2019

Volume 26, pages 1941–1947, (2019)
Cite this article

Download PDF

Psychonomic Bulletin & Review Aims and scope Submit manuscript

Paying the meter: Effect of metrical similarity on word lengthening

Download PDF

1181 Accesses
3 Citations
1 Altmetric
Explore all metrics

Abstract

Language has a rhythmic structure, but little is known about the mechanisms that underlie how it is planned. Traditional models of language production assume that metrical and segmental planning occur independently and in parallel (Roelofs & Meyer Learning Memory and Cognition, 24(4), 922–939, 1998). We test this claim in two experiments. In Experiment 1, participants completed an event-description task in which a disyllabic target word shared segmental overlap with a prime that either had matching or nonmatching lexical stress. Participants lengthened words in trials with both segmental and metrical overlap, which could either be the result of metrical interference or having uttered a prime with similar segmental realizations. To adjudicate between these possibilities, Experiment 2 included segmentally distinct word pairs with either matching or nonmatching stress. Participants again showed lengthening in trials with both segmental and metrical overlap, but no lengthening from metrical overlap alone. These data suggest that the acoustic-phonetic similarity of the initial syllables of the prime and target creates competition that leads to word lengthening. These are consistent with production models in which segmental and metrical structures are tightly bound at the point of phonological encoding.

When overlap leads to competition: Effects of phonological encoding on word duration

Article 09 April 2015

The Effect of Phonological Encoding on Word Duration: Selection Takes Time

Not just a function of function words: Distal speech rate influences perception of prosodically weak syllables

Article 28 November 2018

Spoken English is composed of continuous chains of stressed and unstressed syllables that create a rhythmic pattern in phrases such as SterlingCooperDraperPryce or DunderMifflinPaperCompany. This rhythmic framework across syllables is called metrical structure or meter. In English, syllables may be stressed by accentuating pitch, intensity, and durational variations, and meter is thought to play a role in segmenting words in the speech signal (Pitt & Samuel, 1990). Meter also plays a role in distinguishing between words that are otherwise identical phonologically, such as DES-ert (a barren landscape) versus dess-ERT (a tasty treat). Despite these observations about meter, it is not clear how word stress is planned and encoded during speech production.

Current theories of lexical production assume that metrical structure is planned independently of segmental structure (Levelt, Roelofs, & Meyer, 1999; Schiller, Jansma, Peters, & Levelt, 2006). According to Levelt et al. (1999), word-form generation involves separately retrieving phonemes and stress patterns, and then associating the phonological segments with the metrical frame. Roelofs and Meyer (1998) conducted a series of implicit priming experiments to test the independence of phonological and metrical encoding within the framework of the Word-form Encoding by Activation and Verification (WEAVER) model (Roelofs, 1997). They tested this by teaching participants prime–target word pairs. During testing, the experimenters presented the subject with a prime and measured the onset latency to articulate the associated target word. Roelofs and Meyer (1998) found that pairs with both segmental and metrical overlap were produced with shorter reaction times, suggesting that overlap can facilitate lexical access by priming the target word. However, they did not find an effect when prime–target pairs shared only one feature—either segmental or metrical structure. Therefore, they concluded that processes associated with planning segmental and metrical structure are independent and run in parallel. They argue that priming one system is not facilitative because the other unprimed system acts as a bottleneck, preventing language production from proceeding until processing in all systems is complete.

Other models have also proposed that segmental and metrical structures are independent (e.g., Keating & Shattuck-Hufnagel, 2002). Evidence for this comes from analyses of speech errors in which speakers misplace sound segments, as in “well-boiled icicle” (well-oiled bicycle) or “Is the bean dizzy?” (Is the dean busy?). Misplaced segments generally maintain their position within a syllable; that is, an onset exchanges for an onset, nucleus for nucleus, and coda for coda (MacKay, 1970; Shattuck-Hufnagel, 1987), suggesting that there is a predetermined metrical outline that is independent of segments. In addition, when speakers produce segmental errors, the overall stress pattern of the utterance is typically preserved (Berg, 1990; Shattuck-Hufnagel, 1986).

In this paper, we investigate whether metrical and segmental representations are actually planned independently or whether they rely on the same representations. To answer this question, we use a paradigm that has been used to understand the processes that underlie phonological encoding. It has been argued that phonological encoding—the process of selecting and ordering the phonemes in a word—occurs serially in time such that phonemes are selected sequentially (e.g., O’Seaghdha & Marin, 2000; Sevald & Dell, 1994). Sevald and Dell (1994) found that word pairs with initial segmental overlap (e.g., pick–pin) are produced at a slower rate than word pairs with final segmental overlap (e.g., pick–tick). They argue that the production of initially overlapping phonemes [pɪ] activate lexical representations for both words, which then compete with one another. This interference leads to overall longer word durations because the system slows down over the course of articulating the entire word in order to accommodate phonological activation time. Lexical competition for overlapping offsets [ɪk] is lower because interference does not occur until the end of the word, which leads to less overall miscuing of the correct sound sequence. These findings suggest that phonological overlap between words may create competition in production planning, for which the system requires additional time for generating the appropriate word form (Watson, Buxó-Lugo, & Simmons, 2015).

This phonological-related lengthening effect has been found in numerous event description experiments (e.g., Buxó-Lugo, Jacobs, & Watson, 2018; Yiu & Watson, 2015). Yiu and Watson (2015) found that when a prime word overlaps phonologically with a target word, the target’s duration increases. In “The beetle shrinks. The beaker flashes,” the target—beaker—has a longer duration than usual due to overlap with the prime—beetle. Using a similar paradigm, Buxó-Lugo et al. (2018) found that these lengthening effects occur even when the prime is produced by another speaker, suggesting that auditory feedback mechanisms may play a role in ordering the sounds of a word (also see Guenther, 2014; Hickok, 2014; Jacobs, Yiu, Watson, & Dell, 2015).

Thus, phonological overlap interference offers a useful tool for investigating speech-planning mechanisms. In the context of the overlap interference paradigm, we can assume that if a target lengthens, the dimension of overlap with the prime is (a) encoded serially and (b) can lead to competition between lexical representations. In the current set of studies, we use this paradigm to examine the organization of metrical and segmental spell-out in two experiments. Our strategy was to first test whether metrical structure has the same effect on duration as segmental structure (Experiment 1). If so, we can then use this paradigm to explore whether segmental and metrical structures are planned independently during phonological encoding or whether they interact (Experiment 2).

In Experiment 1, we manipulated whether primes and targets overlapped metrically while keeping segmental overlap constant. Prime and target words either shared the same metrical structure or had differing metrical structures. If metrical planning engages the same types of encoding mechanisms as segmental planning, we should see similar overlap-driven lengthening effects, with longer productions of a target word when the prime shares the same metrical structure. In Experiment 2, we manipulated both segmental overlap and metrical overlap independently to directly test whether segmental and metrical planning have independent roles in phonological encoding or whether metrical and segmental information interact.

Experiment 1

Method

Participants

Sixty-nine healthy adults (age range: 18–27, M = 20.3 years, SD = 2.4, 51 female) participated in this study, a sample size that was similar to that of previous studies that have used this paradigm (Jacobs et al., 2015; Yiu & Watson, 2015). Participants were native speakers of English recruited from the Vanderbilt University Psychology Department subject pool, and they either received course credit or $10 for participating in the study. All participants provided written informed consent in accordance with the Vanderbilt University Institutional Review Board.

Materials

A set of 144 color images was selected from the Snodgrass and Vanderwart (1980) data set (Rossion & Pourtois, 2001) and clip art. A subset of 72 images served as the critical items, and the remaining 72 images were filler items. Critical items consisted of 18 targets and 54 primes. There were three conditions:

1.
Same meter: The candy shrinks. The candle flashes.
2.
Different meter: The canteen shrinks. The candle flashes.
3.
Control: The giraffe shrinks. The candle flashes.

In the same and different meter conditions, the prime–target pairs had segmental overlap for their initial segments, and the meter of the words either matched (1) or did not match (2). In the control condition (3), the prime–target pairs had no segmental overlap and had nonmatching meter.

A Latin square design yielded three counterbalanced lists of items, such that each participant was presented with 18 critical prime–target pairs. An equal number of trochees and iambs were used as critical targets. Each list had six critical pairs for each of the three conditions. In addition, participants were exposed to 38 noncritical pairs, drawn from the filler items, for a total of 56 trials in the experiment. Trials were randomized for each participant.

Audio recording

Participant responses were recorded via a head-mounted microphone at a sampling rate of 44100 Hz. Participants were instructed to speak directly into the microphone as they described the events on the computer screen.

Procedure

Participants completed the experiment on a Mac computer in MATLAB using the CogToolbox (Fraundorf et al., 2014) and Psychophysics Toolbox 3 (Kleiner, Brainard, & Pelli, 2007). Participants first completed a training task to learn the names of potentially difficult to name items (e.g., Buxó-Lugo et al., 2018). Items were displayed in the center of the screen with the intended label at the top of the screen, and participants recited the label aloud. They were encouraged to use these names during testing.

Following item training, participants received instructions for the experiment. For each trial, four images were displayed equidistant around the center of the screen (see Fig. 1). One image—the prime—would shrink, and participants described the action. Then another image—the target—would flash, and participants described the action. Events occurred in the same order for all trials (i.e., shrinking then flashing). The first three trials were conducted with the experimenter present, and the subject was allowed to ask questions if needed. Trials were randomized and separated into three blocks, allowing participants to take a break between blocks as needed.

Acoustic analysis

Speech recordings were analyzed in Praat (Boersma & Weenink, 2017), using manual segmentation to code the start and end times of target words. Three coders (including the first author) analyzed a subset of all trials in isolation using spectrographic and waveform information, and coders were blind to experimental condition of the trials. Target words were segmented such that they were not identifiable as anything other than the targets. Praat scripting was used to calculate the duration of each target word. Interrater reliability was assessed by comparing manual coding from a random subset of trials (~10%) between all coders and the first author, who was blinded to the original measurements and experimental condition. The intraclass correlation coefficient (ICC) was calculated using a one-way single-measures approach, and the average of these was ICC = 0.931, indicating excellent agreement between coders.

Results

Target-word durations across conditions were analyzed, and only target utterances that matched the intended label were considered in the analyses. Trials were excluded if participants mispronounced the prime or target, or if they used alternate names (e.g., boat for canoe, orchestra for quartet, cologne for perfume). A total of 97 out of 1,242 trials met these criteria and were removed. Scripts and the complete data set are available at https://osf.io/zk4qv/.

To examine the effects of condition on word duration, results were analyzed using a linear mixed-effects model, with condition as a fixed effect and random slopes and intercepts by item and by participant. Models were built using R package lme4 Version 1.1-10 (Bates, Maechler, Bolker, & Walker, 2015). Data were log transformed and centered. Helmert contrasts were used in model development such that the condition with segmental and metrical overlap was compared with the average of the segmental overlap and control conditions, and the segmental overlap condition was compared with the control condition. Significance was assumed for t values with an absolute value above 1.96 in a two-tailed test (Baayen, 2008).

We found that target items with segmental and metrical overlap were significantly longer than target items in the other conditions (β = 0.047, t = 3.729), and target items in the segmental overlap condition were significantly longer than target items with no overlap (β = −0.033, t = −2.446). Table 1 displays parameter estimates for the model. Additionally, iambs were significantly longer than trochees (β = −0.138; t = −2.707), regardless of condition; there was no interaction between meter type and overlap condition. Figure 2 displays average target durations by condition for this experiment.

Table 1 Fixed effects estimates for target word durations in Experiment 1

Full size table

Discussion

In this experiment, we replicate previous findings that have shown that segmental overlap leads to significant word lengthening compared with prime–target pairs that do not overlap. We also found that the addition of metrical overlap leads to even more lengthening. This experiment demonstrates that metrical structure plays a role in the dynamics of phonological encoding, and representational similarity at the metrical level between prime and target has the same planning consequences for encoding as segmental similarity. It is also potentially consistent with the notion that segmental and metrical spell-out occur through separate but similar processes (e.g., Roelofs & Meyer, 1998).

However, an alternative explanation for the additive effect of metrical and segmental overlap is that the two representations are not independent, and a representation that has access to both segmental and metrical information guides phonological encoding. Although there are multiple ways to think about what such a representational system might look like (see the General Discussion), one possibility is that encoding depends on a representation that tracks acoustic-phonetic detail that includes metrical and segmental information. That is to say, the overlapping syllable in words with the same stress pattern (candy/candle) are more similar than overlapping syllables in words with a conflicting stress pattern (canteen/candle). Thus, it is possible that an acoustically detailed representation that includes both metrical and segmental information is used in phonological encoding and is driving the lengthening/competition effects we see in Experiment 1.

Thus, our goal in Experiment 2 was to understand whether segmental and metrical planning occur independently or whether they share representations at the level of phonological encoding. To adjudicate between these possible explanations, we introduced a condition with metrical overlap alone in Experiment 2. If the lengthening effect from Experiment 1 is driven by metrical similarity, rather than acoustic-phonetic similarity, we should see lengthening in a condition in which there is metrical overlap between prime and target but no segmental overlap.