A New Approach to Sound Source Segregation

Lutfi, Robert A.; Liu, Ching-Ju; Stoelinga, Christophe N. J.

doi:10.1007/978-1-4614-1590-9_23

Robert A. Lutfi⁶,
Ching-Ju Liu⁶ &
Christophe N. J. Stoelinga⁶

Part of the book series: Advances in Experimental Medicine and Biology ((volume 787))

4243 Accesses
3 Citations

Abstract

We rely critically on our ability to ‘hear out’ (segregate) individual sound sources in a mixture. Yet, despite its importance, little is known regarding this ability. Perturbation analysis is a psychophysical method that has been successfully applied to related problems in vision [Murray, R.F. 2011. J. of Vision 11, 1–25]. Here the approach is adapted to audition. The application proceeds in three stages: First, simple speech and environmental sounds are synthesized according to a generative model of the sound-producing source. Second, listener decision strategy in segregating target from non-target (noise) sources is determined from decision weights (regression coefficients) relating listener judgments regarding the target to lawful perturbations in acoustic parameters, as dictated by the generative model. Third, factors limiting segregation are identified by comparing the obtained weights and residuals to those of a maximum-likelihood (ML) observer that optimizes segregation based on the equations of motion of the generating source. Here, the approach is applied to test between the two major models of sound source segregation; target enhancement versus noise cancellation. The results indicate a tendency of noise segregation to preempt target enhancement when the noise source is unchanging. However, the results also show individual differences in segregation strategy that are not evident in the measures of performance accuracy alone.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

ANSI (2004) ANSI S3.6-2004. Specification for audiometers. American National Standards Institute, New York
Google Scholar
Berg BG (1989) Analysis of weights in multiple observation tasks. J Acoust Soc Am 86:1743–1746
Article Google Scholar
Bregman AS (1990) Auditory scene analysis: the perceptual organization of sound. Bradford Books/MIT Press, Cambridge, MA
Google Scholar
Carlyon RP, Moore BCJ (1986) Continuous versus gated pedestals and the severe departure from Weber’s law. J Acoust Soc Am 79:453–460
Article PubMed CAS Google Scholar
de Cheveigné A, McAdams S, Laroche J, Rosenberg M (1995) Identification of concurrent harmonic and inharmonic vowels: a test of the theory of harmonic cancellation and enhancement. J Acoust Soc Am 97:3736–3748
Article PubMed Google Scholar
Durlach N (1963) Equalization and cancellation theory of binaural masking-level differences. J Acoust Soc Am 35:1206–1218
Article Google Scholar
Klatt DH (1980) Software for a cascade/parallel formant synthesizer. J Acoust Soc Am 67:971–995
Article Google Scholar
Lutfi RA (2001) Auditory detection of hollowness. J Acoust Soc Am 110:1010–1019
Article PubMed CAS Google Scholar
Lutfi RA, Liu CJ (2007) Individual differences in source identification from synthesized impact sounds. J Acoust Soc Am 122:1017–1028
Article PubMed Google Scholar
Lutfi RA, Liu CJ (2011a) A method for evaluating the relation between sound source segregation and masking. J Acoust Soc Am 129:EL34–38
Article PubMed Google Scholar
Lutfi RA, Liu CJ (2011b) Target enhancement and noise cancellation in the identification of a rudimentary sound source in noise. J Acoust Soc Am 129:EL52–56
Article PubMed Google Scholar
Lutfi RA, Stoelinga CNJ (2010) Sensory constraints on the identification of the geometric and material properties of struck bars. J Acoust Soc Am 127:350–360
Article PubMed Google Scholar
Lutfi RA, Liu CJ, Stoelinga CNJ (2008) Level dominance in sound source identification. J Acoust Soc Am 124:3784–3792
Article PubMed Google Scholar
Lutfi RA, Wightman FL (1996) Guessing or confusion?: Analytic predictions for two models of target-distracter interference in children. Abstracts of the Assoc. for Research in Otolaryngology 19:142
Google Scholar
Meddis R, Hewitt MJ (1992) Modeling the identification of concurrent vowels with different fundamental frequencies. J Acoust Soc Am 91:233–245
Article PubMed CAS Google Scholar
Morse PM, Ingard KU (1968) Theoretical acoustics. Princeton University Press, Princeton, pp 175–121
Google Scholar
Patterson RD (1976) Auditory filter shapes derived with noise stimuli. J Acoust Soc Am 59:640–654
Article PubMed CAS Google Scholar
Patterson RD, Nimmo-Smith IN (1980) Off-frequency listening and auditory-filter asymmetry. J Acoust Soc Am 67:229–245
Article CAS Google Scholar
Patterson RD, Smith DRR, van Dinter R, Walters TC (2008) Size information in the production and perception of communication sounds. In: Yost WA, Popper AN (eds) Springer handbook of auditory research: auditory perception of sound sources. Springer, New York, pp 43–76
Google Scholar
Piechowiak T, Ewert SD, Dau T (2007) Modeling comodulation masking release using an equalization-cancellation mechanism. J Acoust Soc Am 121:2111–2126
Article PubMed Google Scholar
Werner LA, Bargones JY (1991) Sources of auditory masking in infants: distraction effects. Percept Psychophys 50:405–412
Article PubMed CAS Google Scholar
Wier CC, Jesteadt W, Green DM (1977) Frequency discrimination as a function of frequency and sensation level. J Acoust Soc Am 61:178–184
Article PubMed CAS Google Scholar

Download references

Acknowledgments

Figure 23.1 was reproduced from Lutfi and Liu (2011b) with permission from the Journal of the Acoustical Society of America. This research was supported by NIDCD grants R01DC006875 and R01DC01262.

Author information

Authors and Affiliations

Auditory Behavioral Research Laboratory, Department of Communicative Disorders, University of Wisconsin, Madison, WI, 53706, USA
Robert A. Lutfi, Ching-Ju Liu & Christophe N. J. Stoelinga

Authors

Robert A. Lutfi
View author publications
You can also search for this author in PubMed Google Scholar
Ching-Ju Liu
View author publications
You can also search for this author in PubMed Google Scholar
Christophe N. J. Stoelinga
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Robert A. Lutfi .

Editor information

Editors and Affiliations

Department of Experimental Psychology, University of Cambridge, Cambridge, United Kingdom
Brian C. J. Moore
Physiology Department, University of Cambridge, Cambridge, United Kingdom
Roy D. Patterson
Physiology Department, University of Cambridge, Cambridge, United Kingdom
Ian M. Winter
MRC-Cognition and Brain Sciences Unit, MRC-Cognition and Brain Sciences Unit, Cambridge, United Kingdom
Robert P. Carlyon
MRC-Cognition and Brain Sciences Unit, MRC-Cognition and Brain Sciences Unit, Cambridge, United Kingdom
Hedwig E Gockel

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lutfi, R.A., Liu, CJ., Stoelinga, C.N.J. (2013). A New Approach to Sound Source Segregation. In: Moore, B., Patterson, R., Winter, I., Carlyon, R., Gockel, H. (eds) Basic Aspects of Hearing. Advances in Experimental Medicine and Biology, vol 787. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-1590-9_23

Download citation

DOI: https://doi.org/10.1007/978-1-4614-1590-9_23
Published: 16 April 2013
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-1589-3
Online ISBN: 978-1-4614-1590-9
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)

Publish with us

Policies and ethics