Reviewing the Definition of Timbre as it Pertains to the Perception of Speech and Musical Sounds

Patterson, Roy D.; Walters, Thomas C.; Monaghan, Jessica J. M.; Gaudrain, Etienne

doi:10.1007/978-1-4419-5686-6_21

Roy D. Patterson⁴,
Thomas C. Walters,
Jessica J. M. Monaghan &
…
Etienne Gaudrain

1315 Accesses

Abstract

The purpose of this paper is to draw attention to the definition of timbre as it pertains to the vowels of speech. There are two forms of size information in these “source-filter” sounds, information about the size of the excitation mechanism (the vocal folds), and information about the size of the resonators in the vocal tract that filter the excitation before it is projected into the air. The current definitions of pitch and timbre treat the two forms of size information differently. In this paper, we argue that the perception of speech sounds by humans suggests that the definition of timbre would be more useful if it grouped the size variables together and separated the pair of them from the remaining properties of these sounds.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Cohen L (1993) The scale transform. IEEE Trans Acoust 41:3275–3292
Google Scholar
Fitch WT, Giedd J (1999) Morphology and development of the human vocal tract: a study using magnetic resonance imaging. J Acoust Soc Am 106:1511–1522
Article PubMed CAS Google Scholar
Irino T, Patterson RD (2002) Segregating information about the size and shape of the vocal tract using a time-domain auditory model: the stabilized wavelet-Mellin transform. Speech Commun 36:181–203
Article Google Scholar
Ives DT, Smith DRR, Patterson RD (2005) Discrimination of speaker size from syllable phrases. J Acoust Soc Am 118:3816–3822
Article PubMed Google Scholar
Kawahara H, Irino T (2004) Underlying principles of a high-quality speech manipulation system STRAIGHT and its application to speech segregation. In: Divenyi PL (ed) Speech separation by humans and machines. Kluwer Academic, MA
Google Scholar
Krumbholz K, Patterson RD, Pressnitzer D (2000) The lower limit of pitch as determined by rate discrimination. J Acoust Soc Am 108:1170–1180
Article PubMed CAS Google Scholar
Lee S, Potamianos A, Narayanan S (1999) Acoustics of children’s speech: developmental changes and spectral parameters. J Acoust Soc Am 105:1455–1468
Article PubMed CAS Google Scholar
Patterson RD, van Dinther R, Irino T (2007) The robustness of bio-acoustic communication and the role of normalization. In: Proceedings of 19th international congress on acoustics, Madrid, September 2007, pp 7–11
Google Scholar
Patterson RD, Smith DRR, van Dinther R, Walters TC (2008) Size information in the production and perception of communication sounds. In: Yost WA, Popper AN, Fay RR (eds) Auditory perception of sound sources. Springer Science/Business Media, LLC, New York
Google Scholar
Peterson GE, Barney HI (1952) Control methods used in the study of vowels. J Acoust Soc Am 24:75–184
Google Scholar
Pressnitzer D, Patterson RD, Krumbholtz K (2001) The lower limit of melodic pitch. J Acoust Soc Am 109:2074–2084
Article PubMed CAS Google Scholar
Smith DRR, Patterson RD (2005) The interaction of glottal-pulse rate and vocal-tract length in judgements of speaker size, sex, and age. J Acoust Soc Am 118:3177–3186
Article PubMed Google Scholar
Smith DRR, Patterson RD, Turner RE, Kawahara H, Irino T (2005) The processing and perception of size information in speech sounds. J Acoust Soc Am 117:305–318
Article PubMed Google Scholar
Turner RE, Walters TC, Monaghan JJM, Patterson RD (2009) A statistical, formant-pattern model for segregating vowel type and vocal-tract length in developmental formant data. J Acoust Soc Am 125:2374–2386
Article PubMed Google Scholar

Download references

Acknowledgments

Research supported by the UK Medical Research Council [G0500221, G9900369].

Author information

Authors and Affiliations

Centre for the Neural Basis of Hearing, University of Cambridge, Downing Site, Cambridge, CB2 3EG, UK
Roy D. Patterson

Authors

Roy D. Patterson
View author publications
You can also search for this author in PubMed Google Scholar
Thomas C. Walters
View author publications
You can also search for this author in PubMed Google Scholar
Jessica J. M. Monaghan
View author publications
You can also search for this author in PubMed Google Scholar
Etienne Gaudrain
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Roy D. Patterson .

Editor information

Editors and Affiliations

Inst. Neurociencias de Castilla y León, Universidad de Salamanca, Av. Alfonso X El Sabio s/n, Salamanca, 37007, Spain
Enrique A. Lopez-Poveda
MRC Inst.of Hearing Research, University Park, Nottingham, NG7 2RD, United Kingdom
Alan R. Palmer
University of Essex, Wivenhoe Park, Colchester, Essex, CO4 3SQ, United Kingdom
Ray Meddis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Patterson, R.D., Walters, T.C., Monaghan, J.J.M., Gaudrain, E. (2010). Reviewing the Definition of Timbre as it Pertains to the Perception of Speech and Musical Sounds. In: Lopez-Poveda, E., Palmer, A., Meddis, R. (eds) The Neurophysiological Bases of Auditory Perception. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-5686-6_21

Download citation

DOI: https://doi.org/10.1007/978-1-4419-5686-6_21
Published: 16 February 2010
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-5685-9
Online ISBN: 978-1-4419-5686-6
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)

Publish with us

Policies and ethics