Abstract
The purpose of this paper is to draw attention to the definition of timbre as it pertains to the vowels of speech. There are two forms of size information in these “source-filter” sounds, information about the size of the excitation mechanism (the vocal folds), and information about the size of the resonators in the vocal tract that filter the excitation before it is projected into the air. The current definitions of pitch and timbre treat the two forms of size information differently. In this paper, we argue that the perception of speech sounds by humans suggests that the definition of timbre would be more useful if it grouped the size variables together and separated the pair of them from the remaining properties of these sounds.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Cohen L (1993) The scale transform. IEEE Trans Acoust 41:3275–3292
Fitch WT, Giedd J (1999) Morphology and development of the human vocal tract: a study using magnetic resonance imaging. J Acoust Soc Am 106:1511–1522
Irino T, Patterson RD (2002) Segregating information about the size and shape of the vocal tract using a time-domain auditory model: the stabilized wavelet-Mellin transform. Speech Commun 36:181–203
Ives DT, Smith DRR, Patterson RD (2005) Discrimination of speaker size from syllable phrases. J Acoust Soc Am 118:3816–3822
Kawahara H, Irino T (2004) Underlying principles of a high-quality speech manipulation system STRAIGHT and its application to speech segregation. In: Divenyi PL (ed) Speech separation by humans and machines. Kluwer Academic, MA
Krumbholz K, Patterson RD, Pressnitzer D (2000) The lower limit of pitch as determined by rate discrimination. J Acoust Soc Am 108:1170–1180
Lee S, Potamianos A, Narayanan S (1999) Acoustics of children’s speech: developmental changes and spectral parameters. J Acoust Soc Am 105:1455–1468
Patterson RD, van Dinther R, Irino T (2007) The robustness of bio-acoustic communication and the role of normalization. In: Proceedings of 19th international congress on acoustics, Madrid, September 2007, pp 7–11
Patterson RD, Smith DRR, van Dinther R, Walters TC (2008) Size information in the production and perception of communication sounds. In: Yost WA, Popper AN, Fay RR (eds) Auditory perception of sound sources. Springer Science/Business Media, LLC, New York
Peterson GE, Barney HI (1952) Control methods used in the study of vowels. J Acoust Soc Am 24:75–184
Pressnitzer D, Patterson RD, Krumbholtz K (2001) The lower limit of melodic pitch. J Acoust Soc Am 109:2074–2084
Smith DRR, Patterson RD (2005) The interaction of glottal-pulse rate and vocal-tract length in judgements of speaker size, sex, and age. J Acoust Soc Am 118:3177–3186
Smith DRR, Patterson RD, Turner RE, Kawahara H, Irino T (2005) The processing and perception of size information in speech sounds. J Acoust Soc Am 117:305–318
Turner RE, Walters TC, Monaghan JJM, Patterson RD (2009) A statistical, formant-pattern model for segregating vowel type and vocal-tract length in developmental formant data. J Acoust Soc Am 125:2374–2386
Acknowledgments
Research supported by the UK Medical Research Council [G0500221, G9900369].
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer Science+Business Media, LLC
About this paper
Cite this paper
Patterson, R.D., Walters, T.C., Monaghan, J.J.M., Gaudrain, E. (2010). Reviewing the Definition of Timbre as it Pertains to the Perception of Speech and Musical Sounds. In: Lopez-Poveda, E., Palmer, A., Meddis, R. (eds) The Neurophysiological Bases of Auditory Perception. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-5686-6_21
Download citation
DOI: https://doi.org/10.1007/978-1-4419-5686-6_21
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-5685-9
Online ISBN: 978-1-4419-5686-6
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)