Abstract
A better understanding of auditory scene analysis requires uncovering the brain processes that govern the segregation of sound patterns into perceptual streams. Existing models of auditory streaming emphasize tonotopic or “spatial” separation of neural responses as the primary determinant of stream segregation. While partially true, this theory is far from complete. It overlooks the involvement of and interaction between both “sequential” and “simultaneous” grouping mechanisms in the process of scene analysis.
Here, we describe a new neuro-computational model of auditory streaming. Inspired by recent psychophysical (cf. abstract by Micheyl et al.) and physiological findings, this model is based on the premise that perceived segregation results from spatio-temporal incoherence, rather than just tonotopic separation. While tonotopic separation still plays an important role in this model, it is an indirect one: tonotopic overlap tends to reduce temporal incoherence, which in turn impedes segregation. The model simulates responses at the level of the primary auditory cortex and performs a correlative analysis of cortical responses in order to assess how different sound elements evolve in time in relation to each other. An eigenvector decomposition of this coherence analysis is used to predict how the input stimulus is organized into streams. The model is evaluated by comparing its neural and perceptual predictions under various stimulus conditions to physiological and psychophysical results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Blake R, Lee SH (2005) The role of temporal structure in human vision. Behav Cogn Neurosci Rev 4:21–42
Bregman AS (1990) Auditory scene analysis. (Cambridge MIT Press), MA
Bregman AS, Campbell J (1971) Primary auditory stream segregation and perception of order in rapid sequences of tones. J Exp Psychol 89:244–249
Carlyon RP, Shamma S (2003) An account of monaural phase sensitivity. J Acoust Soc Am 114:333–348
Chi T, Gao Y, Guyton MC, Ru P, Shamma S (1999) Spectro-temporal modulation transfer functions and speech intelligibility. J Acoust Soc Am 106:2719–2732
Chi T, Ru P, Shamma SA (2005) Multiresolution spectrotemporal analysis of complex sounds. J Acoust Soc Am 118:887–906
Elhilali M, Chi T, Shamma SA (2003) A spectro-temporal modulation index (STMI) for assessment of speech intelligibility. Speech Commun 41:331–348
Elhilali M, Ma L, Micheyl C, Oxenham AJ, Shamma S (2009) Temporal coherence in the perceptual organization and cortical representation of auditory scenes. Neuron 61:317–329
Elhilali M, Shamma SA (2007) The correlative brain: a stream segregation model. In: Kollmeier B, Klump G, Hohmann V, Langemann U, Mauermann M, Uppenkamp S, Verhey J (eds) Hearing: from Sensory processing to perception. Springer, New York
Fishman YI, Arezzo JC, Steinschneider M (2004) Auditory stream segregation in monkey auditory cortex: effects of frequency separation, presentation rate, and tone duration. J Acoust Soc Am 116:1656–1670
Fishman YI, Reser DH, Arezzo JC, Steinschneider M (2001) Neural correlates of auditory stream segregation in primary auditory cortex of the awake monkey. Hear Res 151:167–187
Micheyl C, Shamma S, Elhilali M, Oxenham A (2010) Sequential and simultaneous auditory grouping measured with synchromy detection. In: E.A. Lopez-Poveda, A-R. Palmer, R. Meddis (eds) The neurophysiological baser of auditory perfection. Springer, New York
Micheyl C, Tian B, Carlyon RP, Rauschecker JP (2005) Perceptual organization of tone sequences in the auditory cortex of awake macaques. Neuron 48:139–148
Viemeister NF (1979) Temporal modulation transfer functions based upon modulation thresholds. J Acoust Soc Am 66:1364–1380
Wang K, Shamma SA (1995) Spectral shape analysis in the central auditory system. IEEE Trans Speech Audio Process 3:382–395
Acknowledgments
We thank Pingbo Yin and Stephen David for their assistance with physiological recordings. More details concerning the methods and results of the physiological experiment may be found in Elhilali et al. (2009). This work is supported by grants from the National Institute on Deafness and Other Communication Disorders (R01 DC 07657) and the National Institute on Aging, through the Collaborative Research in Computational Neuroscience program (R01 AG 02757301).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer Science+Business Media, LLC
About this paper
Cite this paper
Elhilali, M., Ma, L., Micheyl, C., Oxenham, A., Shamma, S. (2010). Rate Versus Temporal Code? A Spatio-Temporal Coherence Model of the Cortical Basis of Streaming. In: Lopez-Poveda, E., Palmer, A., Meddis, R. (eds) The Neurophysiological Bases of Auditory Perception. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-5686-6_46
Download citation
DOI: https://doi.org/10.1007/978-1-4419-5686-6_46
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-5685-9
Online ISBN: 978-1-4419-5686-6
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)