Frontal cortex electrophysiology in reward- and punishment-related feedback processing during advice-guided decision making: An interleaved EEG-DC stimulation study

Wischnewski, Miles; Bekkering, Harold; Schutter, Dennis J. L. G.

doi:10.3758/s13415-018-0566-8

Frontal cortex electrophysiology in reward- and punishment-related feedback processing during advice-guided decision making: An interleaved EEG-DC stimulation study

Open access
Published: 29 January 2018

Volume 18, pages 249–262, (2018)
Cite this article

Download PDF

You have full access to this open access article

Cognitive, Affective, & Behavioral Neuroscience Aims and scope Submit manuscript

Frontal cortex electrophysiology in reward- and punishment-related feedback processing during advice-guided decision making: An interleaved EEG-DC stimulation study

Download PDF

Miles Wischnewski¹,
Harold Bekkering¹ &
Dennis J. L. G. Schutter¹

2942 Accesses
12 Citations
1 Altmetric
Explore all metrics

Abstract

During decision making, individuals are prone to rely on external cues such as expert advice when the outcome is not known. However, the electrophysiological correlates associated with outcome uncertainty and the use of expert advice are not completely understood. The feedback-related negativity (FRN), P3a, and P3b are event-related brain potentials (ERPs) linked to dissociable stages of feedback and attentional processing during decision making. Even though these ERPs are influenced by both reward- and punishment-related feedback, it remains unclear how extrinsic information during uncertainty modulates these brain potentials. In this study, the effects of advice cues on decision making were investigated in two separate experiments. In the first experiment, electroencephalography (EEG) was recorded in healthy volunteers during a decision-making task in which the participants received reward or punishment feedback preceded by novice, amateur, or expert advice. The results showed that the P3a component was significantly influenced by the subjective predictive value of an advice cue, whereas the FRN and P3b were unaffected by the advice cues. In the second, sham-controlled experiment, cathodal transcranial direct current stimulation (ctDCS) was administered in conjunction with EEG in order to explore the direct contributions of the frontal cortex to these brain potentials. Results showed no significant change in either advice-following behavior or decision times. However, ctDCS did decrease FRN amplitudes as compared to sham, with no effect on the P3a or P3b. Together, these findings suggest that advice information may act primarily on attention allocation during feedback processing, whereas the electrophysiological correlates of the detection and updating of internal prediction models are not affected.

Modulating what is and what could have been: The effect of transcranial direct current stimulation on the evaluation of attained and unattained decision outcomes

Article 10 October 2017

Mascha van ’t Wout & Hannah Silverman

Non-invasive Brain Stimulation Effects on the Perceptual and Cognitive Processes Underlying Decision-making: a Mini Review

Article 30 July 2020

Tad T. Brunyé

Application of Transcranial Magnetic Stimulation in Studies of Cognitive Dissonance in Decision Making

Article 01 February 2023

A. G. Davydova, J. P. Sheronova, … V. A. Klucharev

Decision making is a multifaceted process associated with evaluating and selecting among a finite set of alternatives on the basis of probability and outcome (Lee, 2013). Both implicit and explicit forms of knowledge are used to reduce uncertainty and maximize the likelihood of making the correct choice. An important source of explicit knowledge that guides decision making during uncertainty comes from expert advice. This is advice that is subjectively perceived as a reliable predictor of the desired outcome (Sniezek, Schrah, & Dalal, 2004), which has been further illustrated by results showing that even good decision-makers remain biased toward the opinions of experts (Cook, den Ouden, Heyes, & Cools, 2014; Harvey & Fischer, 1997). The usefulness of advice is determined by its predictive value in terms of rewards and punishments (Bonaccio & Dalal, 2006). Yet how advice affects the cortical processing underlying decision making is still poorly understood.

It is assumed that during feedback processing, an internal prediction model is used to evaluate the current feedback. On the basis of errors in the prediction, this model can be further refined. Electroencephalography (EEG) studies have identified event-related potential (ERP) components that are associated with different stages of feedback processing (Baker & Holroyd, 2011; Cavanagh, Masters, Bath, & Frank, 2014; Enriquez-Geppert, Konrad, Pantev, & Huster, 2010). First, an internal prediction model detects a mismatch between the expected and actual outcomes. The fronto-central midline feedback-related negativity (FRN) is associated with such error detection (Holroyd & Coles, 2002; Ullsperger, Fischer, Nigbur, & Endrass, 2014). Furthermore, several studies have indicated that the FRN is affected by the valence and magnitude of rewards, as well as by the context in which the rewards are presented (Bellebaum, Polezzi, & Daum, 2010; Holroyd, Larsen, & Cohen, 2004; Wu & Zhou, 2009).

Source localization studies have indicated that the neural generator of the FRN lies in the anterior cingulate cortex (ACC; Bocquillon et al., 2014; Hauser et al., 2014; Ullsperger et al., 2014). In addition, functional magnetic resonance imaging (fMRI) studies have supported the importance of the ACC in reward and punishment processing, together with the orbitofrontal cortex (OFC) and ventral striatum (Beckmann, Johansen-Berg, & Rushworth, 2009; Rogers et al., 2004). There is increasing evidence that the neural activity in these regions during reward processing is modulated by the presence of expert information (Engelmann, Capra, Noussair, & Berns, 2009; Engelmann, Moore, Capra, & Berns, 2012; Meshi, Biele, Korn, & Heekeren, 2012; Tomlin, Nedic, Prentice, Holmes, & Cohen, 2013). Hemodynamic activity in the OFC has been shown to increase when advice is more likely to change one’s initial opinion in favor of following the expert’s opinion (Meshi et al., 2012). Furthermore, the OFC has been shown to reflect the subjective value of rewards and external information (Padoa-Schioppa & Cai, 2011; Peters & Büchel, 2010). These results indicate that the OFC is involved in processing the subjective valuation of advice cues, in which seemingly more informative cues are associated with increased OFC activity and, hence, increased following behavior (Meshi et al., 2012). In contrast, Suen, Brown, Morck, and Silverstone (2014) showed increased ACC activity when financially disadvantageous expert advice was opposed, providing a possible way to override following advice. Therefore, whether advice is followed may depend on a balance between the urge to follow experts, mediated by OFC activity, and the actual benefits of following advice, mediated by ACC activity. In addition to cortical structures, this balance determining the subjective value of a cue has also been associated with activity in the ventral striatum (Meshi et al., 2012).

Following the detection of a mismatch between the expected and actual outcomes the prediction model is updated to make the model more accurate for future feedback. A parietal positive deflection that can be observed between 300 and 600 ms (P300) after reward- and punishment-related feedback is associated with these processes (Goldstein et al., 2006). The P300 component is associated with attention allocation and consists of two subcomponents, the P3a and P3b (Polich, 2007). It has been proposed that the P3a reflects a process of novelty detection (Polich, 2007). Studies have indicated that the amplitude of the P3a is influenced by the expectedness of feedback (Donchin & Coles, 1988; Donchin, Ritter & McCallum, 1978), as well as by the magnitude of the reward or punishment (Sato et al., 2005; Wu & Zhou, 2009). On the basis of these observations, the allocation of attention toward relevant information accompanies a higher P3a amplitude (Polich, 2007). In accordance, neuroimaging studies have related the P300 to the fronto-parietal attention network (Bengson, Kelley, & Mangun, 2015; Pfabigan et al., 2014). Fronto-parietal network activity, it is proposed, is directly influenced by the amount of uncertainty during decision making (Kopp et al., 2016; Paulus et al., 2001). Expert advice can increase confidence in a decision, especially during uncertain situations (Bonaccio & Dalal, 2006). However, it is unknown whether the predictive value of advice modulates attention allocation, and consequently P3a amplitudes, during reward and punishment feedback. The P3b component has been suggested to underlie the adaptation of behavior in subsequent trials by means of updating one’s internal prediction model on the basis of current reward or punishment feedback (Fischer & Ullsperger, 2013; Polich, 2007). Studies have indicated that changes in behavior associated with learning and optimizing decision making are paralleled by larger P3b amplitudes (Chase, Swainson, Durham, Benham, & Cools, 2011; Fischer & Ullsperger, 2013).

The FRN, P3a, and P3b are thus thought to reflect different aspectsunderlying feedback processing. FMRI studies have provided evidence that activity in the networks associated with these ERPs is influenced by advice during decision making (Meshi et al., 2012). However, the extent to which advice actually affects these electrophysiological components of feedback processing remains unclear. Although some studies have investigated the ERP components directly related to the presentation of an advice cue (Chen, Wu, Tong, Guan, & Zhou, 2012; Kim, Liss, Rao, Singer, & Compton, 2012; Kimura & Katayama, 2013; Shestakova et al., 2013; Trautmann-Lengsfeld & Herrmann, 2013; Yu & Sun, 2013), to date no study has investigated the effects of advice cues on the ERP components of subsequent feedback processing. If people are to accurately use advice during decision making, the advice needs to be validated and its usefulness—that is, whether the advice is predictive of subsequent gains or losses—needs to be determined. It is therefore possible that feedback- and attention-related electro-cortical components are susceptible to advice cues.

The present study consisted of two experiments aimed at delineating the relation between advice cues and the processes related to feedback and attention. In the first experiment, the effects of advice cues on performance and ERPs were investigated during a forced choice reward–punishment task. We hypothesized that expert cues would be perceived as more predictive than nonexpert cues. Furthermore, we expected that this difference in perceived predictiveness would be reflected in the subsequent feedback-processing ERP components. Specifically, feedback following more predictive cues would be of more importance for making successful choices. Therefore, we hypothesized that advice cues viewed as being more predictive would increase mismatch detection and direct attention toward the feedback, as revealed by larger FRN and P300 amplitudes.

In the second experiment, cathodal transcranial direct current stimulation (ctDCS) targeting the OFC was applied during the forced choice reward–punishment task, in order to directly manipulate decision making on the basis of advice cues and the associated feedback-related brain potentials. Transcranial direct current stimulation (tDCS) has been shown to be effective in modulating both cortical physiological activity (Nitsche et al., 2003; Nitsche & Paulus, 2000, 2001) and cognitive performance (for a review, see Kuo & Nitsche, 2015). Because the OFC is related to processing of the subjective valuation of advice cues (Meshi et al., 2012), we hypothesized that ctDCS-related interference with OFC activity would decrease the subjective biases toward the advice cues. As a result, the percentage following of advice cues in this case would be closer to chance level. Additionally, we explored the effects of online ctDCS on feedback-related ERPs. Because our hypothesis in the first experiment had stated that increased FRN and P300 amplitudes are related to increased advice following, we expected to find a decrease in FRN and P300 amplitudes during ctDCS.

Experiment 1

Materials and method

Participants

Twenty-one right-handed participants with (corrected-to-)normal vision and no history of neurological or psychiatric disorders participated in Experiment 1 (12 female, nine males; mean age ± SD: 22.67 ± 3.18 years). The study protocol was approved by the Committee on Research Involving Human Subjects of the Radboud University Medical Centre.

Experimental design and procedure

In each trial of the decision-making task, two neutral objects of the same type (vases) were presented on a black screen (22-in., 30 × 48 cm; resolution: 1,024 × 768), and participants were asked to indicate which was more expensive. The participants were placed approximately 80 cm from the screen, and the objects (resolution 350 × 250 pixels) were presented 5 cm to the left and right of the screen’s center point, on a white background. In every trial different vases were shown (a total of 240 vases), so as to prevent learning effects. After the objects had appeared, one of three advice cues was randomly presented, indicated by a red frame (1-cm width) surrounding the picture. Participants were informed that the advice cue represented the choice of a group of participants from a previous study. Three types of cues were used—“novices,” “amateurs,” and “experts”—with the labels being shown above the red frame. The level of expertise was manipulated by informing participants that the experts had attained high scores on this task, whereas the novices had low scores. However, participants were free to use any decision strategy. Although advice information purposely implied that following the expert cues was better than following the novice cues, in reality the predictive value of each cue was at chance level. The objects were shown for maximally 2,500 ms, and the cue appeared after the first 500 ms. This means that the participants had a maximum of 2,000 ms to make a decision by pressing either the left or the right button with their index finger (Fig. 1). Subsequently, the participants received points that reflected monetary rewards. The points ranged from – 40 (punishment) to + 50 (reward) in steps of 10. When the participants did not respond within 2,000 ms, a message with the text “faster” appeared. The outline of a single trial is shown in Fig. 1. To prevent participants from realizing that the advice cue information was random, they were informed that the points that they received were relative to the points they would have received if they had opted for the alternative. Since the number of points for this alternative was not presented, participants were kept uncertain about the actual correctness of their choice. A total of 120 trials were presented, with 40 trials for each advice cue and an intertrial interval of 100–1,000 ms. Data from the behavioral task were stored for offline analysis using the Presentation software (Neurobehavioral systems, Berkeley, CA, USA).

To study the underlying ERP components of advice information processing, EEG measurements were performed during the task. After participants had been informed about the task, EEG electrodes were placed. Then participants performed the decision-making task for approximately 10 min, after which the electrodes were removed and participants were debriefed.

Electroencephalography recording

EEG was recorded continuously during the task using an online 0.1- to 70-Hz band-pass filter with a sampling rate of 1000 Hz on a passive 64-channel EASYCAP with a transcranial magnetic stimulation multitrode system (EASYCAP GmbH, Herrsching, Germany). Recordings were made from a selection of 13 resin-covered sintered Ag/AgCl electrodes (F3, F1, Fz, F2, F4, P3, Pz, P4, T7, T8, O1, Oz, O2), shown in Fig. 1. The reference electrode was positioned on the left mastoid, and the ground electrode was placed at POz. Furthermore, a vertical electro-oculogram was recorded from electrodes above and below the left eye, and a horizontal electro-oculogram was recorded from electrodes at the outer canthi of both eyes. Raw EEG data were recorded and stored for offline analysis using BrainVision Analyzer 2.0 (Brain Products GmbH, München, Germany).

Data reduction and processing

Advice information processing

For each participant, the percentages of following the novice, amateur, and expert cues were calculated. To determine how informative each cue was experienced to be by the participants, the subjective predictive value (SPV) was calculated. This value is the absolute difference between the percentage following and chance level (50%). Therefore, the SPV is a value between 0 and 50, with larger numbers representing higher subjective information value. For example, if a cue is followed on either 100% or 0% of trials, the SPV would be 50. This would mean that these cues were highly informative, since participants either followed or opposed the cue at all times, whereas an SPV of 0 would imply that the cue was uninformative to the participant, and thus was followed at chance level.

ERP data analysis

All EEG recordings were offline band-pass filtered between 1.5 and 30 Hz (48 dB/Octave) and referenced to the mastoid. ERPs of 1,000 ms were segmented time-locked to the moment at which participant had received feedback immediately following the decision. Epochs were baseline-corrected using the – 100-ms to 0-ms window. Ocular artifacts were controlled using the Gratton and Coles algorithm (Gratton, Coles, & Donchin, 1983). Additional artifacts were removed automatically if the peak-to-peak differences exceeded 100 μV or were below 0.5 μV, followed by visual inspection of the data. The remaining epochs were averaged separately for (1) reward versus punishment and (2) novice versus amateur versus expert trials. All ERP results were analyzed at the Pz electrode. This electrode was chosen in order to compare the findings with experiments in which only the Pz electrode was investigated (see the ERP Data Analysis section of Exp. 2 and Supplementary Table 1). The ERPs recorded at the other channels are displayed in Supplementary Figs. S1 and S2.

Statistical analysis

Expert information processing was measured by the number of advice cues followed during the decision-making task. Generalized linear model (GLM) repeated measures analyses of variance (ANOVAs), with the dependent variables percentage following cues and reaction time, were used to investigate how the advice cues influenced decision making. Significant results were followed by Bonferroni-corrected paired-samples t tests. Additionally, in a post hoc test, the SPV values were tested against a test value of 0 (indicating no predictive value).

ERP components were investigated using GLM repeated measures ANOVAs in the time windows of the FRN (200–300 ms; Gentsch, Ullsperger, & Ullsperger, 2009), the P3a (350–400 ms; Muller-Gass, Macdonald, Schrönger, Sculthorpe, & Campbell, 2007), and the P3b (450–600 ms; Calvo & Beltrán, 2014). Significant results were followed by an exploratory analysis in which the time course of a significant difference was made by plotting the p value every 4 ms within the investigated time window.

All statistical analyses were performed using IBM SPSS 22.0, and the statistical level of significance was set to α < .05 (two-tailed). All analyses were checked for normality, and Mauchly’s test was used to examine the assumption of sphericity. A Greenhouse–Geisser correction was used if this assumption was violated. All averages are represented as means ± SEMs.

Results

Advice cue processing

Participants chose the left vase in 50.59 ± 1.16% of trials, and the right vase in 48.89 ± 1.09% of trials, which did not differ significantly [t(20) = 0.76, p = .45], and responses occurred too late in 0.51 ± 0.30% of trials. A significant difference was observed between percentages following the three advice cues [F(2, 40) = 31.91, p < .001]. On average, participants followed novices in 33.41 ± 4.23% of trials, amateurs in 37.80 ± 3.26%, and experts in 76.10 ± 4.22%. Bonferroni-corrected pairwise comparisons revealed that experts were followed significantly more than both novices [t(20) = 6.11, p < .001] and amateurs [t(20) = 6.31, p < .001], but the percentages following novices and amateurs were not different (Fig. 2a). Furthermore, we tested whether the SPVs—that is, the absolute differences from chance level (50%)—were significantly greater than zero. Consequently, Bonferroni-corrected one-sample t tests were performed, and the SPVs of all cues [novice: SPV = 16.59, t(20) = 3.82, p = .003; amateur: SPV = 12.20, t(20) = 3.75, p = .003; expert: SPV = 26.10, t(20) = 6.18, p < .001] were significantly different from chance level (Fig. 2a). Expert cues were subjectively perceived as being most predictive, whereas amateur cues were seen as least predictive. Interestingly, the SPVs were significantly positively correlated between all three conditions (novice–amateur, r = .507, p = .019; novice–expert, r = .529, p = .014; amateur–expert, r = .508, p = .019). This suggest that participants who relied more on advice information in one condition also relied more on advice information in another condition.

Decision times differed significantly between the advice cue categories, as well [F(2, 20) = 19.00, p < .001]. Bonferroni-corrected pairwise comparisons revealed that during expert trials decisions were made significantly faster than during either novice trials [t(20) = 7.02, p < .001] or amateur trials [t(20) = 4.60, p = .001]. No difference was observed between novice and amateur trials [t(20) = 0.08, p = .936] (Fig. 2b). Furthermore, no correlation between the averaged decision times and averaged SPVs was found (r = – .186, p = .420).

Reward and punishment ERPs

A significant difference for the P3b component was observed between reward and punishment trials [Pz, F(1, 20) = 5.99, p = .024; Figs. 3a, b], specifically in a time window of 492–588 ms (Fig. 3c). No significant difference between reward and punishment trials was observed for the FRN [Pz, F(1, 20) = 0.01, p = .965] or the P3a [Pz, F(1, 20) = 0.01, p = .934]. Similar results were observed for electrodes P3 and P4, but no differences between reward and punishment trials were found in other electrodes (Supplementary Fig. S1).

Advice cue ERPs

Analysis of ERPs segmented for different advice cues revealed a trend toward significance at the P3a component [F(2, 40) = 3.30, p = .047; Fig. 3d]. Indeed, analysis of the time points within the P3a window revealed a significant effect between 366 and 386 ms (Fig. 3e). Bonferroni-correct post hoc comparisons revealed a significantly larger P3a amplitudes in response to expert than to amateur cues [t(20) = 2.73, p = .039; Fig. 3f]. The novice–expert and novice–amateur differences in ERP signals did not reach significance. The modulation of the P3a component concurred with the differences in SPVs (Fig. 3f). No effect of advice cues was found on the FRN [F(2, 40) = 2.43, p = .101] or the P3b [F(2, 40) = 0.20, p = .822].

Experiment 2

Indeed, Reinhart and Woodman (2014) have shown that FRN amplitudes are decreased and error positivities are increased after offline cathodal tDCS (ctDCS), and that both are associated with reduced behavioral adjustments after errors. Here we explored the effects of online ctDCS on feedback-related ERPs. On the basis of Experiment 1, we therefore expected P3a amplitudes to be decreased. Furthermore, in accordance with the findings of Reinhart and Woodman, a decrease in FRN amplitudes was also expected.