Skip to main content

A New Approach to Sound Source Segregation

  • Conference paper
  • First Online:
Basic Aspects of Hearing

Abstract

We rely critically on our ability to ‘hear out’ (segregate) individual sound sources in a mixture. Yet, despite its importance, little is known regarding this ­ability. Perturbation analysis is a psychophysical method that has been successfully applied to related problems in vision [Murray, R.F. 2011. J. of Vision 11, 1–25]. Here the approach is adapted to audition. The application proceeds in three stages: First, simple speech and environmental sounds are synthesized according to a generative model of the sound-­producing source. Second, listener decision strategy in segregating target from non-­target (noise) sources is determined from decision weights (regression coefficients) relating listener judgments regarding the target to lawful perturbations in acoustic parameters, as dictated by the generative model. Third, factors limiting segregation are identified by comparing the obtained weights and residuals to those of a maximum-likelihood (ML) observer that optimizes segregation based on the equations of motion of the generating source. Here, the approach is applied to test between the two major models of sound source segregation; target enhancement versus noise cancellation. The results indicate a tendency of noise segregation to preempt target enhancement when the noise source is unchanging. However, the results also show individual differences in segregation strategy that are not evident in the measures of performance accuracy alone.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • ANSI (2004) ANSI S3.6-2004. Specification for audiometers. American National Standards Institute, New York

    Google Scholar 

  • Berg BG (1989) Analysis of weights in multiple observation tasks. J Acoust Soc Am 86:1743–1746

    Article  Google Scholar 

  • Bregman AS (1990) Auditory scene analysis: the perceptual organization of sound. Bradford Books/MIT Press, Cambridge, MA

    Google Scholar 

  • Carlyon RP, Moore BCJ (1986) Continuous versus gated pedestals and the severe departure from Weber’s law. J Acoust Soc Am 79:453–460

    Article  PubMed  CAS  Google Scholar 

  • de Cheveigné A, McAdams S, Laroche J, Rosenberg M (1995) Identification of concurrent harmonic and inharmonic vowels: a test of the theory of harmonic cancellation and enhancement. J Acoust Soc Am 97:3736–3748

    Article  PubMed  Google Scholar 

  • Durlach N (1963) Equalization and cancellation theory of binaural masking-level differences. J Acoust Soc Am 35:1206–1218

    Article  Google Scholar 

  • Klatt DH (1980) Software for a cascade/parallel formant synthesizer. J Acoust Soc Am 67:971–995

    Article  Google Scholar 

  • Lutfi RA (2001) Auditory detection of hollowness. J Acoust Soc Am 110:1010–1019

    Article  PubMed  CAS  Google Scholar 

  • Lutfi RA, Liu CJ (2007) Individual differences in source identification from synthesized impact sounds. J Acoust Soc Am 122:1017–1028

    Article  PubMed  Google Scholar 

  • Lutfi RA, Liu CJ (2011a) A method for evaluating the relation between sound source segregation and masking. J Acoust Soc Am 129:EL34–38

    Article  PubMed  Google Scholar 

  • Lutfi RA, Liu CJ (2011b) Target enhancement and noise cancellation in the identification of a rudimentary sound source in noise. J Acoust Soc Am 129:EL52–56

    Article  PubMed  Google Scholar 

  • Lutfi RA, Stoelinga CNJ (2010) Sensory constraints on the identification of the geometric and material properties of struck bars. J Acoust Soc Am 127:350–360

    Article  PubMed  Google Scholar 

  • Lutfi RA, Liu CJ, Stoelinga CNJ (2008) Level dominance in sound source identification. J Acoust Soc Am 124:3784–3792

    Article  PubMed  Google Scholar 

  • Lutfi RA, Wightman FL (1996) Guessing or confusion?: Analytic predictions for two models of target-distracter interference in children. Abstracts of the Assoc. for Research in Otolaryngology 19:142

    Google Scholar 

  • Meddis R, Hewitt MJ (1992) Modeling the identification of concurrent vowels with different fundamental frequencies. J Acoust Soc Am 91:233–245

    Article  PubMed  CAS  Google Scholar 

  • Morse PM, Ingard KU (1968) Theoretical acoustics. Princeton University Press, Princeton, pp 175–121

    Google Scholar 

  • Patterson RD (1976) Auditory filter shapes derived with noise stimuli. J Acoust Soc Am 59:640–654

    Article  PubMed  CAS  Google Scholar 

  • Patterson RD, Nimmo-Smith IN (1980) Off-frequency listening and auditory-filter asymmetry. J Acoust Soc Am 67:229–245

    Article  CAS  Google Scholar 

  • Patterson RD, Smith DRR, van Dinter R, Walters TC (2008) Size information in the production and perception of communication sounds. In: Yost WA, Popper AN (eds) Springer handbook of auditory research: auditory perception of sound sources. Springer, New York, pp 43–76

    Google Scholar 

  • Piechowiak T, Ewert SD, Dau T (2007) Modeling comodulation masking release using an equalization-cancellation mechanism. J Acoust Soc Am 121:2111–2126

    Article  PubMed  Google Scholar 

  • Werner LA, Bargones JY (1991) Sources of auditory masking in infants: distraction effects. Percept Psychophys 50:405–412

    Article  PubMed  CAS  Google Scholar 

  • Wier CC, Jesteadt W, Green DM (1977) Frequency discrimination as a function of frequency and sensation level. J Acoust Soc Am 61:178–184

    Article  PubMed  CAS  Google Scholar 

Download references

Acknowledgments

Figure 23.1 was reproduced from Lutfi and Liu (2011b) with permission from the Journal of the Acoustical Society of America. This research was supported by NIDCD grants R01DC006875 and R01DC01262.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Robert A. Lutfi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer Science+Business Media New York

About this paper

Cite this paper

Lutfi, R.A., Liu, CJ., Stoelinga, C.N.J. (2013). A New Approach to Sound Source Segregation. In: Moore, B., Patterson, R., Winter, I., Carlyon, R., Gockel, H. (eds) Basic Aspects of Hearing. Advances in Experimental Medicine and Biology, vol 787. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-1590-9_23

Download citation

Publish with us

Policies and ethics