Bottom-up driven involuntary attention modulates auditory signal in noise processing
- 3.6k Downloads
Auditory evoked responses can be modulated by both the sequencing and the signal-to-noise ratio of auditory stimuli. Constant sequencing as well as intense masking sounds basically lead to N1m response amplitude reduction. However, the interaction between these two factors has not been investigated so far. Here, we presented subjects tone stimuli of different frequencies, which were either concatenated in blocks of constant frequency or in blocks of randomly changing frequencies. The tones were presented either in silence or together with broad-band noises of varying levels.
In silence, tones presented with random sequencing elicited a larger N1m response than tones presented with constant sequencing. With increasing noise level, this difference decreased and even vanished in the condition where noise intensity exceeded the tone intensity by 10 dB. Furthermore, under noisy conditions, the N1m latency was shorter in the constant sequencing condition compared to the random sequencing condition.
Besides the well-known neural habituation mechanisms, bottom-up driven attention plays an important role during auditory processing in noisy environments. This bottom-up driven attention would allow us to track a certain auditory signal in noisy situations without voluntarily paying attention to the auditory modality.
KeywordsTest Stimulus Auditory Cortex Noise Condition Sequencing Condition Source Strength
The ability of humans to disentangle and to perceptually isolate a single sound from multiple simultaneously present irrelevant sounds ("noise") is vitally important. From an evolutionary perspective, this ability might have helped noticing and spotting predators sneaking up in the midst of wind rustling in the trees or against a background of heavy rain. Of course it also plays an important role in today's everyday life (e.g., to be warned of an approaching vehicle in a diffuse traffic noise setting). The segregational and integrational mechanisms at work during this auditory scene analysis  are based upon the physical features of the sounds (such as spectrum, intensity, phase, etc.) coming from distinct or identical sources.
Besides these features neural activity in the auditory cortex is also affected by the sequencing of sounds in time. Repeated applications of a stimulus can decrease the corresponding neural activity . This phenomenon, called habituation, appears on several time scales [3, 4], at several stages of the auditory system [5, 6], and is reflected in auditory evoked potentials [7, 8, 9] and auditory evoked fields . Budd and colleagues  note that N1 response  decrements can stem from different mechanisms, habituation as well as refractoriness. A further term used to describe the phenomenon of neural response decline during sensory stimulation is adaptation [4, 12]. This term refers to a different possible neural mechanism; however, it would as well result in a decrement of neural activity after repetition of identical stimuli.
In a recent experiment, Okamoto et al. ,  presented subjects with tonal stimuli of different frequencies embedded in band-eliminated noises and measured their magnetoencephalographic (MEG) activity. They found that the tonal stimuli of randomly changing frequencies elicited a smaller N1m response (the magnetic counterpart of the electrical N1 response) than stimuli of identical frequency. At first glance, the relatively stronger neural activity elicited by identical auditory stimuli was contrary to what one would have expected considering the habituation effect. The authors hypothesized that in a noisy environment bottom-up and/or top-down driven attention could have compensated for the habituation effect. However, the use of the band-eliminated noises entailed that the tonal stimuli were centred in a silent band within background noise. This led to a different pre-stimulus noise exposure depending on the sequencing condition. In the constant sequencing condition the band-eliminated noises never masked the neural population in the frequency range of the tone stimuli. In the random sequencing condition on the other hand, the frequency region of the respective tonal stimulus had always been covered shortly beforehand by the noise from the preceding stimulation. This might have caused different short- and long-term habituation effects on the neural groups corresponding to the frequencies within and outside of the eliminated bands. Additionally, the authors always used noises of identical intensity. Therefore, the question still remains whether in noisy environments of different level under non-attentive listening conditions the neural activity in human auditory cortex can be enhanced by constant sound signal sequencing. With the present study, we attempted to address this general question and to investigate interactive effects of stimulus sequencing and masking noise level in the auditory cortex. We presented subjects amplitude modulated pure-tone stimuli differing in sequencing (constant vs. random); furthermore, the level of simultaneously presented masking noise (no noise vs. medium noise level vs. high noise level) was varied. We adopted amplitude modulated tones in order to elicit both, the auditory steady state as well as the N1m response . To cause similar masking and habituation effects on the different frequency test tone stimuli, we used broad-band noise.
17 healthy subjects (10 females, age range 21-30 years old) participated in the study. All subjects were distinctly right handed (assessed with the Edinburgh Handedness Inventory ) and had normal hearing. All subjects were fully informed about the study and gave written informed consent for their participation in accordance with procedures approved by the Ethics Commission of the Medical Faculty, University of Münster. The study thus conforms to The Code of Ethics of the World Medical Association (Declaration of Helsinki).
Stimuli and experimental design
We used Presentation (Neurobehavioral Systems, Albany, CA, United States) to control the timing of sound presentation, and SRM-212 electrostatic earphones (Stax, Saitama, Japan) to transduce sound stimuli. All sounds were delivered diotically through silicon tubes (length: 60 cm; inner diameter: 5 mm) and silicon earpieces adjusted to fit into each individual ear.
Before starting the MEG acquisition, each subject's hearing threshold for the 1000 Hz carrier frequency test stimulus (TS) was measured for each ear. During the MEG session, the stimuli were presented at an intensity of 40 dB above this individual sensation level.
Data acquisition and analysis
The auditory evoked fields were measured with a whole-head 275 channels MEG system (Omega; CTF Systems, Coquitlam, British Columbia, Canada) in a magnetically shielded and acoustically silent room. Subjects were instructed not to move their head position during the MEG measurement and monitored by video camera. The MEG data was recorded with a sampling rate of 600 Hz. The magnetic fields evoked by TS were averaged selectively for each noise and sequencing condition (irrespective of the carrier frequency), starting 0.35 s prior to TS-onset, and ending 1 s after TS-onset. Epochs containing field changes larger than 3 pT were rejected as artefacts. The overall percentage of rejected trials was less than 10%, with no significant difference between conditions.
For the analysis of the N1m response, which is the major deflection of the slow auditory evoked field , the averaged evoked fields of all conditions were 30 Hz low-pass filtered, and the baseline was corrected relatively to a pre-stimulus interval of 0.3 s. Initially, the maximal N1m response was identified at the time point of maximal root-mean square value of the global field power around 0.1 s after TS-onset. The N1m source locations and orientations were estimated by an equivalent current dipole model (one dipole in each hemisphere) for each subject individually. A 0.01 s interval around the N1m peak in the grand-averaged data of all conditions of each subject was used to estimate the equivalent dipolar current sources. Source estimations with insufficient goodness-of-fit (smaller than 95%) were excluded from further analysis, reducing the number of subjects from 17 to 15 (8 females, age range 21-30 years old). The estimated source was fixed in its location and orientation for each hemisphere of each subject as a spatial filter  to calculate source strength for each noise condition and each stimulus sequencing condition ('constant sequencing' and 'random sequencing'), irrespective of the TS carrier frequency. The maximal source strength in each noise and sequencing condition in the time range between 0.09 and 0.3 s was used for further statistical analysis of the N1m. Unfortunately, the auditory steady state responses  suffered from the low signal-to-noise ratio, possibly due to the masking sounds and the different carrier frequency TS. Therefore the auditory steady state responses were not analysed in the present study.
The maximum source strengths and latencies of the N1m responses elicited by the TS for each condition were analysed separately via planned comparisons, post hoc tests and repeated-measures analysis of variance (ANOVA) using two factors: SEQUENCING (constant and random) and NOISE_LEVEL (no noise, +/-0 dB, + 10 dB).
In the present study, subjects were exposed either to test stimuli (TS) presented in silence, or to TS embedded in broad-band noise. We found that the N1m source strength decreased with increasing noise level, as previously reported . Additionally, we found that the noise level had different impacts on the constant sequencing and the random sequencing conditions. In the no noise condition, the source strength of the N1m evoked component was larger during random sequencing than during constant sequencing. However, this pattern changed when noise was added, yielding an interaction between the factors SEQUENCING and NOISE_LEVEL. This interaction shows that the processing in the auditory path and ultimately in the auditory cortex was differentially influenced by the acoustic environment depending on the different sequencing conditions.
The significant interaction between SEQUENCING and NOISE_LEVEL may arise from masking effects of simultaneously presented broad-band noise on the neural activity corresponding to the TS. In the silent surrounding, the TS presented in constant sequencing elicited smaller N1m responses compared to random sequencing because the identical neural population was repeatedly activated and therefore habituated. In the noisy surrounding, however, all auditory neurons were constantly stimulated by the broad-band noise - regardless of the sequencing order of the TS. The noise thus might have "evened out" the habituation effect of the TS. The higher source strength for constant stimulation in noise that was reported in earlier studies [13, 14] was probably caused by differential masking effects of the band-eliminated noises for constant and random sequencing.
While habituation might account for some of the interaction between SEQUENCING and NOISE_LEVEL regarding the N1m source strength, it alone cannot explain the latency differences between CONSTANT and RANDOM sequencing in the +10 dB noise conditions. In previous behavioural studies [20, 21] it was shown that signal in noise detection is facilitated by cuing the frequency of the tone to be detected compared to no cuing. In those studies, cuing was most effective when cue and target were identical. Comparable results were also found by Okamoto and colleagues , who reported faster reaction times and lower error rates in a detection task using tone stimuli overlaid with band-eliminated noise presented in constant sequencing (i.e. cued) compared to randomly sequenced stimuli (i.e. not cued). Additionally, they found stronger N1m activity and shorter latencies for the constant condition than for the random condition. These results suggest that auditory focused attention can increase the amplitude and shorten the latency of the N1m cortical source corresponding to the task relevant auditory signal. In our study, however, the auditory stimuli were not only task irrelevant, but subjects even had to focus their attention to another, the visual domain. With the subjects' attention not directed to the auditory domain, similar effects have been shown using band-eliminated noises and tones . In this MEG experiment, the frequency of the tonal stimulus was at the centre frequency of the eliminated frequency band, and the stimuli were presented either in a constant or in a random fashion. As in our present study, the constantly presented stimuli yielded shorter latencies. The latency difference between the constant and random sequencing conditions in the present study might reflect the amount of time needed to involuntarily allocate attentional resources to the neural population tuned to the TS carrier frequency. To explain our results regarding latency differences between sequencing conditions merely with mechanisms of cuing would not suffice though, because then one would as well expect shorter latencies in the no-noise condition. We attribute the results of our present experiment to involuntary bottom-up driven neural mechanisms involved in the automatic tracking of auditory sequences , and we consider our finding to be comparable to simultaneous sound segregation phenomena described in earlier studies. During constant sequencing in broad-band noise, the sequence of TS of the same frequency is clearly separated from the noisy background. In the random sequencing condition, the series of TS is not separated from the noise as clearly as for the constant sequenced stimuli. The series of stimuli with constant frequency could be tracked more easily and the single entities were detected earlier. In the no noise condition, however, this constant tracking was not necessary since detection of the TS was very easy for both, the constant and the random sequencing condition. This seems to be the reason why no significant latency difference between constant and random sequencing in the silent condition was observed.
The present data demonstrate that besides well-known neural response declining phenomena such as refractoriness, stimulus specific adaptation, or habituation another mechanism, elicited by constant signal sequencing, plays an important role in auditory signal in noise processing under distracted attentional conditions. This bottom-up driven involuntary neural mechanism may appropriately and automatically adjust the distribution of processing resources based on the surrounding noise level, resulting in better auditory performance in noisy environments.
We would like to thank Karin Berning, Hildegard Deitermann, and Ute Trompeter for helping to acquire the data, as well as Andreas Wollbrink for technical support and Maximilian Bruchmann for statistical advice.
This work was supported by the "Tinnitus Research Initiative (TRI)", and the "Deutsche Forschungsgemeinschaft (Pa 392/13-1, Pa 392/10-3)"
- 1.Bregman AS: Auditory scene analysis: the perceptual organization of sound. 1990, Cambridge, Mass.: MIT PressGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.