Definition
Auditory stream segregation is a fundamental process that allows us to segregate and focus on distinct sound streams (e.g., voices) in a dynamic noisy environment. Stream segregation has been widely studied with the auditory streaming paradigm, a relatively idealized stimulus for which two sequences of pure tones can be segregated in different streams. Our understanding – still far from complete – of this process is informed by a wealth of behavioral experiments, electrophysiology, brain imaging, and theoretical modelling. This review will highlight modelling efforts that have advanced our understanding of this important dynamical process.
Detailed Description
Auditory scene analysis involves segregating a complex scene into objects or streams (recent reviews: Bendixen (2014) and Snyder and Elhilali (2017)). A valued paradigm involves segregating two interleaved sequences of A tones and B tones repeating in an ABA_ pattern and separable by a perceived difference in pure...
References
Almonte F, Jirsa V, Large E, Tuller B (2005) Integration and segregation in auditory streaming. Physica D 212(1):137–159
Anstis S, Saida S (1985) Adaptation to auditory streaming of frequency-modulated tones. J Exp Psychol Hum Percept Perform 11(3):257–271
Barniv D, Nelken I (2015) Auditory streaming as an online classification process with evidence accumulation. PLoS One 10(12):e0144788
Beauvois M, Meddis R (1991) A computer model of auditory stream segregation. Q J Exp Psychol 43(3):517–541
Bendixen A (2014) Predictability effects in auditory scene analysis: a review. Front Neurosci 8:60
Billig AJ, Davis MH, Carlyon RP (2018) Neural decoding of bistable sounds reveals an effect of intention on perceptual organization. J Neurosci 38(11):2844–2853
Brascamp J, Klink P, Levelt WJ (2015) The ‘laws’ of binocular rivalry: 50 years of levelt’s propositions. Vis Res 109:20–37
Cao R, Pastukhov A, Mattia M, Braun J (2016) Collective activity of many bistable assemblies reproduces characteristic dynamics of multistable perception. J Neurosci 36(26):6957–6972
Costa-Faidella J, Sussman ES, Escera C (2017) Selective entrainment of brain oscillations drives auditory perceptual organization. NeuroImage 159:195–206
Cusimano M, Hewitt L, Tenenbaum JB, McDermott JH (2018) Auditory scene analysis as bayesian inference in sound source models maddie cusimano. In CogSci 2018 Proceedings
Deike S, Heil P, Böckmann-Barthel M, Brechmann A (2012) The build-up of auditory stream segregation: a different perspective. Front Psychol 3(461):1–7
Denham SL, Farkas D, Van Ee R, Taranu M, Kocsis Z, Wimmer M, Carmel D, Winkler I (2018) Similar but separate systems underlie perceptual bistability in vision and audition. Sci Rep 8:7106
Elhilali M, Shamma S (2008) A cocktail party with a cortical twist: how cortical mechanisms contribute to sound segregation. J Acoust Soc Am 124(6):3751–3771
Elhilali M, Ma L, Micheyl C, Oxenham A, Shamma S (2009) Temporal coherence in the perceptual organization and cortical representation of auditory scenes. Neuron 61(2):317–329
Farley BJ, Noreña AJ (2015) Membrane potential dynamics of populations of cortical neurons during auditory streaming. J Neurophysiol 114(4):2418–2430. ISSN 0022-3077, 1522-1598
Fishman Y, Reser D, Arezzo J, Steinschneider M (2001) Neural correlates of auditory stream segregation in primary auditory cortex of the awake monkey. Hear Res 151(1):167–187
Fishman Y, Arezzo J, Steinschneider M (2004) Auditory stream segregation in monkey auditory cortex: effects of frequency separation, presentation rate, and tone duration. J Acoust Soc Am 116(3):1656–1670
Hupé J, Rubin N (2003) The dynamics of bi-stable alternation in ambiguous motion displays: a fresh look at plaids. Vis Res 43(5):531–548. ISSN 0042-6989
Kashino M, Kondo H (2012) Functional brain networks underlying perceptual switching: auditory streaming and verbal transformations. Philos Trans R Soc B 367(1591):977–987
Kondo HM, Pressnitzer D, Shimada Y, Kochiyama T, Kashino M (2018) Inhibition-excitation balance in the parietal cortex modulates volitional control for auditory and visual multistability. Sci Rep 8(1):14548
Krishnan L, Elhilali M, Shamma S (2014) Segregating complex sound sources through temporal coherence. PLoS Comput Biol 10(12):e1003985
Laing C, Chow C (2002) A spiking neuron model for binocular rivalry. J Comput Neurosci 12(1):39–53
Large E, Herrera J, Velasco M (2015) Neural networks for beat perception in musical rhythm. Front Syst Neurosci 9:159
Leopold D, Logothetis N (1999) Multistable phenomena: changing views in perception. Trends Cogn Sci 3(7):254–264
Levelt WJ (1968) On binocular rivalry, vol 2. Mouton, The Hague
Li H-H, Rankin J, Rinzel J, Carrasco M, Heeger D (2017) Attention model of binocular rivalry. Proc Natl Acad Sci U S A (in press)
Long GM, Toppino TC (2004) Enduring interest in perceptual ambiguity: alternating views of reversible figures. Psychol Bull 130(5):748
Ma L (2011) Auditory streaming: behavior, physiology, and modeling PhD thesis
McCabe S, Denham M (1996) A model of auditory streaming. Adv Neural Inf Proces Syst 101(3):1611–1621
Meso AI, Rankin J, Faugeras O, Kornprobst P, Masson GS (2016) The relative contribution of noise and adaptation to competition during tri-stable motion perception. J Vis 16(15):6–6. ISSN 1534-7362
Micheyl C, Tian B, Carlyon R, Rauschecker J (2005) Perceptual organization of tone sequences in the auditory cortex of awake macaques. Neuron 48(1):139–148
Mill R, BÅ‘hm T, Bendixen A, Winkler I, Denham S (2013) Modelling the emergence and dynamics of perceptual organisation in auditory streaming. PLoS Comput Biol 9(3):e1002925
Moreno-Bote R, Shpiro A, Rinzel J, Rubin N (2010) Alternation rate in perceptual bistability is maximal at and symmetric around equi-dominance. J Vis 10(11):1–18. ISSN 1534-7362
Pressnitzer D, Hupé J (2006) Temporal dynamics of auditory and visual bistability reveal common principles of perceptual organization. Curr Biol 16(13):1351–1357
Pressnitzer D, Sayles M, Micheyl C, Winter I (2008) Perceptual organization of sound begins in the auditory periphery. Curr Biol 18(15):1124–1128
Rankin J, Sussman E, Rinzel J (2015) Neuromechanistic model of auditory bistability. PLoS Comput Biol 11(11):e1004555
Rankin J, Osborn Popp P, Rinzel J (2017) Stimulus pauses and perturbations differentially delay or promote the segregation of auditory objects: psychoacoustics and modeling. Front Neurosci 11:198
Rogers W, Bregman A (1993) An experimental evaluation of three theories of auditory stream segregation. Percept Psychophys 53(2):179–189
Shpiro A, Moreno-Bote R, Rubin N, Rinzel J (2009) Balance between noise and adaptation in competition models of perceptual bistability. J Comput Neurosci 27:37–54. ISSN 0929-5313
Snyder JS, Elhilali M (2017) Recent advances in exploring the neural underpinnings of auditory scene perception. Ann N Y Acad Sci 1396:39
Steele S, Tranchina D, Rinzel J (2015) An alternating renewal process describes the buildup of perceptual segregation. Front Comput Neurosci 8(166):1–13
Szabó BT, Denham SL, Winkler I (2016) Computational models of auditory scene analysis: a review. Front Neurosci 10:524
Tal I, Large EW, Rabinovitch E, Wei Y, Schroeder CE, Poeppel D, Golumbic EZ (2017) Neural entrainment to the beat: the missing-pulse phenomenon. J Neurosci 37(26):6331–6341
van Noorden L (1975) Temporal coherence in the perception of tone sequences PhD thesis, Eindhoven University
Wang D, Chang P (2008) An oscillatory correlation model of auditory streaming. Cogn Neurodyn 2(1):7–19
Yates T, Larigaldie N, Beierholm U (2017) A non-parametric bayesian prior for causal inference of auditory streaming. In: bioRxiv. https://www.biorxiv.org/content/early/2017/05/17/139188
Zhang P, Jamison K, Engel S, He B, He S (2011) Binocular rivalry requires visual attention. Neuron 71(2):362–369
Funding
Rankin acknowledges support from an Engineering and Physical Sciences Research Council (EPSRC) New Investigator Award (EP/R03124X/1) and from the EPSRC Centre for Predictive Modelling in Healthcare (EP/N014391/1). This is a review study, and as such did not generate any new data.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Section Editor information
Rights and permissions
Copyright information
© 2019 Springer Science+Business Media, LLC, part of Springer Nature
About this entry
Cite this entry
Rankin, J., Rinzel, J. (2019). Computational Models of Auditory Stream Segregation. In: Jaeger, D., Jung, R. (eds) Encyclopedia of Computational Neuroscience. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-7320-6_100685-1
Download citation
DOI: https://doi.org/10.1007/978-1-4614-7320-6_100685-1
Received:
Accepted:
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-7320-6
Online ISBN: 978-1-4614-7320-6
eBook Packages: Springer Reference Biomedicine and Life SciencesReference Module Biomedical and Life Sciences