Formant-Frequency Variation and Its Effects on Across-Formant Grouping in Speech Perception

Roberts, Brian; Summers, Robert J.; Bailey, Peter J.

doi:10.1007/978-1-4614-1590-9_36

Brian Roberts⁶,
Robert J. Summers⁶ &
Peter J. Bailey⁷

Part of the book series: Advances in Experimental Medicine and Biology ((volume 787))

4265 Accesses

Abstract

How speech is separated perceptually from other speech remains poorly understood. In a series of experiments, perceptual organisation was probed by presenting three-formant (F1+F2+F3) analogues of target sentences dichotically, together with a competitor for F2 (F2C), or for F2+F3, which listeners must reject to optimise recognition. To control for energetic masking, the competitor was always presented in the opposite ear to the corresponding target formant(s). Sine-wave speech was used initially, and different versions of F2C were derived from F2 using separate manipulations of its amplitude and frequency contours. F2Cs with time-varying frequency contours were highly effective competitors, whatever their amplitude characteristics, whereas constant-frequency F2Cs were ineffective. Subsequent studies used synthetic-formant speech to explore the effects of manipulating the rate and depth of formant-frequency change in the competitor. Competitor efficacy was not tuned to the rate of formant-frequency variation in the target sentences; rather, the reduction in intelligibility increased with competitor rate relative to the rate for the target sentences. Therefore, differences in speech rate may not be a useful cue for separating the speech of concurrent talkers. Effects of competitors whose depth of formant-frequency variation was scaled by a range of factors were explored using competitors derived either by inverting the frequency contour of F2 about its geometric mean (plausibly speech-like pattern) or by using a regular and arbitrary frequency contour (triangle wave, not plausibly speech-like) matched to the average rate and depth of variation for the inverted F2C. Competitor efficacy depended on the overall depth of frequency variation, not depth relative to that for the other formants. Furthermore, the triangle-wave competitors were as effective as their more speech-like counterparts. Overall, the results suggest that formant-frequency variation is critical for the across-frequency grouping of formants but that this grouping does not depend on speech-specific constraints.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bailey PJ, Summerfield Q, Dorman M (1977) On the identification of sine-wave analogues of certain speech sounds. Haskins Lab Status Rep SR-51/52:1–25
Google Scholar
Boersma P, Weenink D (2010) Praat, a system for doing phonetics by computer. Institute of Phonetic Sciences, University of Amsterdam
Google Scholar
Bregman AS (1990) Auditory scene analysis: the perceptual organization of sound. MIT Press, Cambridge
Google Scholar
Darwin CJ (2008) Listening to speech in the presence of other sounds. Philos Trans R Soc Lond B Biol Sci 363:1011–1021
Article PubMed CAS Google Scholar
Mesgarani N, Chang EF (2012) Selective cortical representation of attended speaker in multi-talker speech perception. Nature 485:233–236
Article PubMed CAS Google Scholar
Remez RE (1996) Perceptual organization of speech in one and several modalities: common functions, common resources. In: ICSLP-1996, Philadelphia, p 1660–1663
Google Scholar
Remez RE, Rubin PE, Pisoni DB, Carrell TD (1981) Speech perception without traditional speech cues. Science 212:947–950
Article PubMed CAS Google Scholar
Remez RE, Rubin PE, Berns SM, Pardo JS, Lang JM (1994) On the perceptual organization of speech. Psychol Rev 101:129–156
Article PubMed CAS Google Scholar
Roberts B, Summers RJ, Bailey PJ (2010) The perceptual organization of sine-wave speech under competitive conditions. J Acoust Soc Am 128:804–817
Article PubMed Google Scholar
Rosenberg AE (1971) Effect of glottal pulse shape on the quality of natural vowels. J Acoust Soc Am 49:583–590
Article PubMed Google Scholar
Summers RJ, Bailey PJ, Roberts B (2012) Effects of the rate of formant-frequency variation on the grouping of formants in speech perception. J Assoc Res Otolaryngol 13:269–280
Article PubMed Google Scholar
Weismer G, Berry J (2003) Effects of speaking rate on second formant trajectories of selected vocalic nuclei. J Acoust Soc Am 113: 3362–3378
Article PubMed Google Scholar

Download references

Acknowledgements

This research was supported by Research Grant EP/F016484/1 from the Engineering and Physical Sciences Research Council (UK).

Author information

Authors and Affiliations

Psychology, School of Life and Health Sciences, Aston University, Birmingham, B4 7ET, UK
Brian Roberts & Robert J. Summers
Department of Psychology, University of York, Heslington, York, YO10 5DD, UK
Peter J. Bailey

Authors

Brian Roberts
View author publications
You can also search for this author in PubMed Google Scholar
Robert J. Summers
View author publications
You can also search for this author in PubMed Google Scholar
Peter J. Bailey
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Brian Roberts .

Editor information

Editors and Affiliations

Department of Experimental Psychology, University of Cambridge, Cambridge, United Kingdom
Brian C. J. Moore
Physiology Department, University of Cambridge, Cambridge, United Kingdom
Roy D. Patterson
Physiology Department, University of Cambridge, Cambridge, United Kingdom
Ian M. Winter
MRC-Cognition and Brain Sciences Unit, MRC-Cognition and Brain Sciences Unit, Cambridge, United Kingdom
Robert P. Carlyon
MRC-Cognition and Brain Sciences Unit, MRC-Cognition and Brain Sciences Unit, Cambridge, United Kingdom
Hedwig E Gockel

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Roberts, B., Summers, R.J., Bailey, P.J. (2013). Formant-Frequency Variation and Its Effects on Across-Formant Grouping in Speech Perception. In: Moore, B., Patterson, R., Winter, I., Carlyon, R., Gockel, H. (eds) Basic Aspects of Hearing. Advances in Experimental Medicine and Biology, vol 787. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-1590-9_36

Download citation

DOI: https://doi.org/10.1007/978-1-4614-1590-9_36
Published: 16 April 2013
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-1589-3
Online ISBN: 978-1-4614-1590-9
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)

Publish with us

Policies and ethics