Skip to main content

Rate Versus Temporal Code? A Spatio-Temporal Coherence Model of the Cortical Basis of Streaming

  • Conference paper
  • First Online:
The Neurophysiological Bases of Auditory Perception

Abstract

A better understanding of auditory scene analysis requires uncovering the brain processes that govern the segregation of sound patterns into perceptual streams. Existing models of auditory streaming emphasize tonotopic or “spatial” separation of neural responses as the primary determinant of stream segregation. While partially true, this theory is far from complete. It overlooks the involvement of and interaction between both “sequential” and “simultaneous” grouping mechanisms in the process of scene analysis.

Here, we describe a new neuro-computational model of auditory streaming. Inspired by recent psychophysical (cf. abstract by Micheyl et al.) and physiological findings, this model is based on the premise that perceived segregation results from spatio-temporal incoherence, rather than just tonotopic separation. While tonotopic separation still plays an important role in this model, it is an indirect one: tonotopic overlap tends to reduce temporal incoherence, which in turn impedes segregation. The model simulates responses at the level of the primary auditory cortex and performs a correlative analysis of cortical responses in order to assess how different sound elements evolve in time in relation to each other. An eigenvector decomposition of this coherence analysis is used to predict how the input stimulus is organized into streams. The model is evaluated by comparing its neural and perceptual predictions under various stimulus conditions to physiological and psychophysical results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Blake R, Lee SH (2005) The role of temporal structure in human vision. Behav Cogn Neurosci Rev 4:21–42

    Article  PubMed  Google Scholar 

  • Bregman AS (1990) Auditory scene analysis. (Cambridge MIT Press), MA

    Google Scholar 

  • Bregman AS, Campbell J (1971) Primary auditory stream segregation and perception of order in rapid sequences of tones. J Exp Psychol 89:244–249

    Article  PubMed  CAS  Google Scholar 

  • Carlyon RP, Shamma S (2003) An account of monaural phase sensitivity. J Acoust Soc Am 114:333–348

    Article  PubMed  Google Scholar 

  • Chi T, Gao Y, Guyton MC, Ru P, Shamma S (1999) Spectro-temporal modulation transfer functions and speech intelligibility. J Acoust Soc Am 106:2719–2732

    Article  PubMed  CAS  Google Scholar 

  • Chi T, Ru P, Shamma SA (2005) Multiresolution spectrotemporal analysis of complex sounds. J Acoust Soc Am 118:887–906

    Article  PubMed  Google Scholar 

  • Elhilali M, Chi T, Shamma SA (2003) A spectro-temporal modulation index (STMI) for assessment of speech intelligibility. Speech Commun 41:331–348

    Article  Google Scholar 

  • Elhilali M, Ma L, Micheyl C, Oxenham AJ, Shamma S (2009) Temporal coherence in the perceptual organization and cortical representation of auditory scenes. Neuron 61:317–329

    Article  PubMed  CAS  Google Scholar 

  • Elhilali M, Shamma SA (2007) The correlative brain: a stream segregation model. In: Kollmeier B, Klump G, Hohmann V, Langemann U, Mauermann M, Uppenkamp S, Verhey J (eds) Hearing: from Sensory processing to perception. Springer, New York

    Google Scholar 

  • Fishman YI, Arezzo JC, Steinschneider M (2004) Auditory stream segregation in monkey auditory cortex: effects of frequency separation, presentation rate, and tone duration. J Acoust Soc Am 116:1656–1670

    Article  PubMed  Google Scholar 

  • Fishman YI, Reser DH, Arezzo JC, Steinschneider M (2001) Neural correlates of auditory stream segregation in primary auditory cortex of the awake monkey. Hear Res 151:167–187

    Article  PubMed  CAS  Google Scholar 

  • Micheyl C, Shamma S, Elhilali M, Oxenham A (2010) Sequential and simultaneous auditory grouping measured with synchromy detection. In: E.A. Lopez-Poveda, A-R. Palmer, R. Meddis (eds) The neurophysiological baser of auditory perfection. Springer, New York

    Google Scholar 

  • Micheyl C, Tian B, Carlyon RP, Rauschecker JP (2005) Perceptual organization of tone sequences in the auditory cortex of awake macaques. Neuron 48:139–148

    Article  PubMed  CAS  Google Scholar 

  • Viemeister NF (1979) Temporal modulation transfer functions based upon modulation thresholds. J Acoust Soc Am 66:1364–1380

    Article  PubMed  CAS  Google Scholar 

  • Wang K, Shamma SA (1995) Spectral shape analysis in the central auditory system. IEEE Trans Speech Audio Process 3:382–395

    Article  Google Scholar 

Download references

Acknowledgments

We thank Pingbo Yin and Stephen David for their assistance with physiological recordings. More details concerning the methods and results of the physiological experiment may be found in Elhilali et al. (2009). This work is supported by grants from the National Institute on Deafness and Other Communication Disorders (R01 DC 07657) and the National Institute on Aging, through the Collaborative Research in Computational Neuroscience program (R01 AG 02757301).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mounya Elhilali .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer Science+Business Media, LLC

About this paper

Cite this paper

Elhilali, M., Ma, L., Micheyl, C., Oxenham, A., Shamma, S. (2010). Rate Versus Temporal Code? A Spatio-Temporal Coherence Model of the Cortical Basis of Streaming. In: Lopez-Poveda, E., Palmer, A., Meddis, R. (eds) The Neurophysiological Bases of Auditory Perception. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-5686-6_46

Download citation

Publish with us

Policies and ethics