Rate Versus Temporal Code? A Spatio-Temporal Coherence Model of the Cortical Basis of Streaming

Elhilali, Mounya; Ma, Ling; Micheyl, Christophe; Oxenham, Andrew; Shamma, Shihab

doi:10.1007/978-1-4419-5686-6_46

Mounya Elhilali⁴,
Ling Ma,
Christophe Micheyl,
Andrew Oxenham &
…
Shihab Shamma

1317 Accesses

Abstract

A better understanding of auditory scene analysis requires uncovering the brain processes that govern the segregation of sound patterns into perceptual streams. Existing models of auditory streaming emphasize tonotopic or “spatial” separation of neural responses as the primary determinant of stream segregation. While partially true, this theory is far from complete. It overlooks the involvement of and interaction between both “sequential” and “simultaneous” grouping mechanisms in the process of scene analysis.

Here, we describe a new neuro-computational model of auditory streaming. Inspired by recent psychophysical (cf. abstract by Micheyl et al.) and physiological findings, this model is based on the premise that perceived segregation results from spatio-temporal incoherence, rather than just tonotopic separation. While tonotopic separation still plays an important role in this model, it is an indirect one: tonotopic overlap tends to reduce temporal incoherence, which in turn impedes segregation. The model simulates responses at the level of the primary auditory cortex and performs a correlative analysis of cortical responses in order to assess how different sound elements evolve in time in relation to each other. An eigenvector decomposition of this coherence analysis is used to predict how the input stimulus is organized into streams. The model is evaluated by comparing its neural and perceptual predictions under various stimulus conditions to physiological and psychophysical results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Blake R, Lee SH (2005) The role of temporal structure in human vision. Behav Cogn Neurosci Rev 4:21–42
Article PubMed Google Scholar
Bregman AS (1990) Auditory scene analysis. (Cambridge MIT Press), MA
Google Scholar
Bregman AS, Campbell J (1971) Primary auditory stream segregation and perception of order in rapid sequences of tones. J Exp Psychol 89:244–249
Article PubMed CAS Google Scholar
Carlyon RP, Shamma S (2003) An account of monaural phase sensitivity. J Acoust Soc Am 114:333–348
Article PubMed Google Scholar
Chi T, Gao Y, Guyton MC, Ru P, Shamma S (1999) Spectro-temporal modulation transfer functions and speech intelligibility. J Acoust Soc Am 106:2719–2732
Article PubMed CAS Google Scholar
Chi T, Ru P, Shamma SA (2005) Multiresolution spectrotemporal analysis of complex sounds. J Acoust Soc Am 118:887–906
Article PubMed Google Scholar
Elhilali M, Chi T, Shamma SA (2003) A spectro-temporal modulation index (STMI) for assessment of speech intelligibility. Speech Commun 41:331–348
Article Google Scholar
Elhilali M, Ma L, Micheyl C, Oxenham AJ, Shamma S (2009) Temporal coherence in the perceptual organization and cortical representation of auditory scenes. Neuron 61:317–329
Article PubMed CAS Google Scholar
Elhilali M, Shamma SA (2007) The correlative brain: a stream segregation model. In: Kollmeier B, Klump G, Hohmann V, Langemann U, Mauermann M, Uppenkamp S, Verhey J (eds) Hearing: from Sensory processing to perception. Springer, New York
Google Scholar
Fishman YI, Arezzo JC, Steinschneider M (2004) Auditory stream segregation in monkey auditory cortex: effects of frequency separation, presentation rate, and tone duration. J Acoust Soc Am 116:1656–1670
Article PubMed Google Scholar
Fishman YI, Reser DH, Arezzo JC, Steinschneider M (2001) Neural correlates of auditory stream segregation in primary auditory cortex of the awake monkey. Hear Res 151:167–187
Article PubMed CAS Google Scholar
Micheyl C, Shamma S, Elhilali M, Oxenham A (2010) Sequential and simultaneous auditory grouping measured with synchromy detection. In: E.A. Lopez-Poveda, A-R. Palmer, R. Meddis (eds) The neurophysiological baser of auditory perfection. Springer, New York
Google Scholar
Micheyl C, Tian B, Carlyon RP, Rauschecker JP (2005) Perceptual organization of tone sequences in the auditory cortex of awake macaques. Neuron 48:139–148
Article PubMed CAS Google Scholar
Viemeister NF (1979) Temporal modulation transfer functions based upon modulation thresholds. J Acoust Soc Am 66:1364–1380
Article PubMed CAS Google Scholar
Wang K, Shamma SA (1995) Spectral shape analysis in the central auditory system. IEEE Trans Speech Audio Process 3:382–395
Article Google Scholar

Download references

Acknowledgments

We thank Pingbo Yin and Stephen David for their assistance with physiological recordings. More details concerning the methods and results of the physiological experiment may be found in Elhilali et al. (2009). This work is supported by grants from the National Institute on Deafness and Other Communication Disorders (R01 DC 07657) and the National Institute on Aging, through the Collaborative Research in Computational Neuroscience program (R01 AG 02757301).

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, Johns Hopkins University, Baltimore, MD, USA
Mounya Elhilali

Authors

Mounya Elhilali
View author publications
You can also search for this author in PubMed Google Scholar
Ling Ma
View author publications
You can also search for this author in PubMed Google Scholar
Christophe Micheyl
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Oxenham
View author publications
You can also search for this author in PubMed Google Scholar
Shihab Shamma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mounya Elhilali .

Editor information

Editors and Affiliations

Inst. Neurociencias de Castilla y León, Universidad de Salamanca, Av. Alfonso X El Sabio s/n, Salamanca, 37007, Spain
Enrique A. Lopez-Poveda
MRC Inst.of Hearing Research, University Park, Nottingham, NG7 2RD, United Kingdom
Alan R. Palmer
University of Essex, Wivenhoe Park, Colchester, Essex, CO4 3SQ, United Kingdom
Ray Meddis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Elhilali, M., Ma, L., Micheyl, C., Oxenham, A., Shamma, S. (2010). Rate Versus Temporal Code? A Spatio-Temporal Coherence Model of the Cortical Basis of Streaming. In: Lopez-Poveda, E., Palmer, A., Meddis, R. (eds) The Neurophysiological Bases of Auditory Perception. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-5686-6_46

Download citation

DOI: https://doi.org/10.1007/978-1-4419-5686-6_46
Published: 16 February 2010
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-5685-9
Online ISBN: 978-1-4419-5686-6
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)

Publish with us

Policies and ethics