Abstract
Event-related potentials (ERPs) show promise to be objective indicators of cognitive functioning. The aim of the study was to examine if ERPs recorded during an oddball task would predict cognitive functioning and information processing speed in Multiple Sclerosis (MS) patients and controls at the individual level. Seventy-eight participants (35 MS patients, 43 healthy age-matched controls) completed visual and auditory 2- and 3-stimulus oddball tasks with 128-channel EEG, and a neuropsychological battery, at baseline (month 0) and at Months 13 and 26. ERPs from 0 to 700 ms and across the whole scalp were transformed into 1728 individual spatio-temporal datapoints per participant. A machine learning method that included penalized linear regression used the entire spatio-temporal ERP to predict composite scores of both cognitive functioning and processing speed at baseline (month 0), and months 13 and 26. The results showed ERPs during the visual oddball tasks could predict cognitive functioning and information processing speed at baseline and a year later in a sample of MS patients and healthy controls. In contrast, ERPs during auditory tasks were not predictive of cognitive performance. These objective neurophysiological indicators of cognitive functioning and processing speed, and machine learning methods that can interrogate high-dimensional data, show promise in outcome prediction.
Similar content being viewed by others
References
Amato MP, Ponziani G, Pracucci G, Bracco L, Siracusa G, Amaducci L (1995) Cognitive impairment in early-onset multiple sclerosis. Pattern, predictors, and impact on everyday life in a 4-year follow-up. Arch Neurol 52(2):168–172
Amato MP, Ponziani G, Siracusa G, Sorbi S (2001) Cognitive dysfunction in early-onset multiple sclerosis: a reappraisal after 10 years. Arch Neurol 58(10):1602–1606
Amato MP, Zipoli V, Portaccio E (2006) Multiple sclerosis-related cognitive changes: a review of cross-sectional and longitudinal studies. J Neurol Sci 245(1–2):41–46. https://doi.org/10.1016/j.jns.2005.08.019
Amato MP, Razzolini L, Goretti B, Stromillo ML, Rossi F, Giorgio A et al (2013) Cognitive reserve and cortical atrophy in multiple sclerosis: a longitudinal study. Neurology 80(19):1728–1733. https://doi.org/10.1212/WNL.0b013e3182918c6f
Azcarraga-Guirola E, Rodriguez-Agudelo Y, Velazquez-Cardoso J, Rito-Garcia Y, Solis-Vivanco R (2017) Electrophysiological correlates of decision making impairment in multiple sclerosis. Eur J Neurosc 45(2):321–329. https://doi.org/10.1111/ejn.13465
Bagnato F, Salman Z, Kane R, Auh S, Cantor FK, Ehrmantraut M et al (2010) T1 cortical hypointensities and their association with cognitive disability in multiple sclerosis. Mult Scler 16(10):1203–1212. https://doi.org/10.1177/1352458510377223
Barker-Collo SL (2005) Within session practice effects on the PASAT in clients with multiple sclerosis. Arch Clin Neuropsychol 20(2):145–152. https://doi.org/10.1016/j.acn.2004.03.007
Beck AT, Epstein N, Brown G, Steer RA (1988) An inventory for measuring clinical anxiety: psychometric properties. J Consult Clin Psychol 56(6):893–897
Beck AT, Steer RA, Brown GK (1996) Beck depression inventory-II: manual. Psychological Corporation, San Antonio
Benedict RH, Zivadinov R (2011) Risk factors for and management of cognitive dysfunction in multiple sclerosis. Nat Rev Neurol 7(6):332–342. https://doi.org/10.1038/nrneurol.2011.61
Benedict RHB, Schretlen D, Groninger L, Dobraski M, Shpritz B (1996) Revision of the brief visuospatial memory test: studies of normal performance, reliability, and validity. Psychol Assess 8(2):145–153
Benedict RH, Fischer JS, Archibald CJ, Arnett PA, Beatty WW, Bobholz J et al (2002) Minimal neuropsychological assessment of MS patients: a consensus approach. Clin Neuropsychol 16(3):381–397. https://doi.org/10.1076/clin.16.3.381.13859
Benedict RH, Bruce JM, Dwyer MG, Abdelrahman N, Hussein S, Weinstock-Guttman B et al (2006) Neocortical atrophy, third ventricular width, and cognitive dysfunction in multiple sclerosis. Arch Neurol 63(9):1301–1306. https://doi.org/10.1001/archneur.63.9.1301
Benedict RH, Morrow SA, Weinstock Guttman B, Cookfair D, Schretlen DJ (2010) Cognitive reserve moderates decline in information processing speed in multiple sclerosis patients. J Int Neuropsychol Soc 16(5):829–835. https://doi.org/10.1017/S1355617710000688
Benton AL, Hamsher K (1989) Multilingual aphasia examination. AJA Associates, Iowa City
Bergendal G, Fredrikson S, Almkvist O (2007) Selective decline in information processing in subgroups of multiple sclerosis: an 8-year longitudinal study. Eur Neurol 57(4):193–202. https://doi.org/10.1159/000099158
Cawley GC, Talbot NLC (2010) On over-fitting in model selection and subsequent selection bias in performance evaluation. J Mach Learn Res 11:2079–2107
Chiaravalloti ND, DeLuca J (2008) Cognitive impairment in multiple sclerosis. Lancet Neurol 7(12):1139–1151. https://doi.org/10.1016/S1474-4422(08)70259-X
Costa SL, Genova HM, DeLuca J, Chiaravalloti ND (2017) Information processing speed in multiple sclerosis: past, present, and future. Mult Scler 23(6):772–789. https://doi.org/10.1177/1352458516645869
Covey TJ, Shucard JL, Shucard DW (2016) Evaluation of cognitive control and distraction using event-related potentials in healthy individuals and patients with multiple sclerosis. In: International conference on augmented cognition. Springer International Publishing, pp 165–176
Covey TJ, Shucard JL, Shucard DW (2017) Event-related brain potential indices of cognitive function and brain resource reallocation during working memory in patients with Multiple Sclerosis. Clin Neurophysiol 128(4):604–621. https://doi.org/10.1016/j.clinph.2016.12.030
Crawford JR (1992) Current and premorbid intelligence measures in neuropsychological assessment. In: Crawford JR, McKinlay WW (eds) A handbook of neuropsychological assessment. Erlbaum, London, pp 21–49
De Sonneville LM, Boringa JB, Reuling IE, Lazeron RH, Ader HJ, Polman CH (2002) Information processing characteristics in subtypes of multiple sclerosis. Neuropsychologia 40(11):1751–1765
Delis DC, Kramer JH, Kaplan E, Ober BA (2000) California verbal learning test: second edition (CVLT-II). The Psychological Corporation, San Antonio
Delorme A, Makeig S (2004) EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J Neurosci Methods 134(1):9–21. https://doi.org/10.1016/j.jneumeth.2003.10.009S0165027003003479
Doyle OM, Mehta MA, Brammer MJ (2015) The role of machine learning in neuroimaging for drug discovery and development. Psychopharmacology 232(21–22):4179–4189. https://doi.org/10.1007/s00213-015-3968-0
Filippi M, Rocca MA, Benedict RH, DeLuca J, Geurts JJ, Rombouts SA, Ron M, Comi G (2010) The contribution of MRI in assessing cognitive impairment in multiple sclerosis. Neurology 75(23):2121–2128
Friedman D, Cycowicz YM, Gaeta H (2001) The novelty P3: an event-related brain potential (ERP) sign of the brain’s evaluation of novelty. Neurosci Biobehav Rev 25(4):355–373
Genova HM, DeLuca J, Chiaravalloti N, Wylie G (2013) The relationship between executive functioning, processing speed and white matter integrity in multiple sclerosis. J Clin Exp Neuropsychol 35(6):631
Ghaffar O, Fiati M, Feinstein A (2012) Occupational attainment as a marker of cognitive reserve in multiple sclerosis. PLoS ONE 7(10):e47206. https://doi.org/10.1371/journal.pone.0047206
Gillan CM, Whelan R (2017) What big data can do for treatment in psychiatry. Curr Opin Behav Sci 31(18):34–42
Glanz BI, Healy BC, Hviid LE, Chitnis T, Weiner HL (2012) Cognitive deterioration in patients with early multiple sclerosis: a 5-year study. J Neurol Neurosurg Psychiatry 83(1):38–43. https://doi.org/10.1136/jnnp.2010.237834
Gronwall DMA (1977) Paced auditory serial-addition task: measure of recovery from concussion. Percept Motor Skill 44:367–373
Hamalainen P, Rosti-Otajarvi E (2016) Cognitive impairment in MS: rehabilitation approaches. Acta Neurol Scand 134(Suppl 200):8–13. https://doi.org/10.1111/ane.12650
Hoffmann S, Tittgemeyer M, von Cramon DY (2007) Cognitive impairment in multiple sclerosis. Curr Opin Neurol 20(3):275–280. https://doi.org/10.1097/WCO.0b013e32810c8e8700019052-200706000-00006
Holdnack HA (2001) Wechsler test of adult reading: WTAR. The Psychological Corporation, San Antonio
Jollans L, Whelan R (2016) The clinical added value of imaging: A perspective from outcome prediction. Biol Psychiatry Cogn Neurosci Neuroimaging 1(5):423–432
Jollans L, Whelan R (2017) Neuromarkers for mental disorders: Harnessing population neuroscience. In: Werdecker A (ed) Biomarkers for demographic research. Springer (In press)
Jollans L, Zhipeng C, Icke I, Greene C, Kelly C, Banaschewski T et al (2016) Ventral striatum connectivity during reward anticipation in adolescent smokers. Dev Neuropsychol 41(1–2):6–21
Jollans L, Whelan R, Venables L, Turnbull OH, Cella M, Dymond S (2017) Computational EEG modelling of decision making under ambiguity reveals spatio-temporal dynamics of outcome evaluation. Behav Brain Res 321:28–35
Kalmar JH, Halper J, Gaudino EA, Moore NB, DeLuca J (2008) The relationship between cognitive deficits and everyday functional activities in multiple sclerosis. Neuropsychology 22(4):442–449. https://doi.org/10.1037/08944105.22.4.442
Kendler KS (2012) The dappled nature of causes of psychiatric illness: replacing the organic-functional/hardware-software dichotomy with empirically based pluralism. Mol Psychiatry 17(4):377–388. https://doi.org/10.1038/mp.2011.182
Key AP, Dove GO, Maguire MJ (2005) Linking brainwaves to the brain: an ERP primer. Dev Neuropsychol 27(2):183–215. https://doi.org/10.1207/s15326942dn2702_1
Kiiski H, Reilly RB, Lonergan R, Kelly S, O’Brien M, Kinsella K et al (2011) Change in PASAT performance correlates with change in P3 ERP amplitude over a 12-month period in multiple sclerosis patients. J Neurol Sci 305(1–2):45–52. https://doi.org/10.1016/j.jns.2011.03.018
Kiiski H, Reilly RB, Lonergan R, Kelly S, O’Brien MC, Kinsella K et al (2012) Only low frequency event-related EEG activity is compromised in multiple sclerosis: insights from an independent component clustering analysis. PLoS ONE 7(9):e45536. https://doi.org/10.1371/journal.pone.0045536
Kimiskidis VK, Papaliagkas V, Sotirakoglou K, Kouvatsou ZK, Kapina VK, Papadaki E et al (2016) Cognitive event-related potentials in multiple sclerosis: Correlation with MRI and neuropsychological findings. Mult Scler Relat Disord 10:192–197. https://doi.org/10.1016/j.msard.2016.10.006
Kok A (2001) On the utility of P3 amplitude as a measure of processing capacity. Psychophysiology 38(3):557–577. https://doi.org/10.1017/S0048577201990559
Kurtzke JF (2008) Historical and clinical perspectives of the expanded disability status scale. Neuroepidemiology 31(1):1–9. doi:https://doi.org/10.1159/000136645
Lazeron RH, Rombouts SA, Scheltens P, Polman CH, Barkhof F (2004) An fMRI study of planning-related brain activity in patients with moderately advanced multiple sclerosis. Mult Scler 10(5):549–555. https://doi.org/10.1191/1352458504ms1072oa
Leocani L, Comi G (2000) Neurophysiological investigations in multiple sclerosis. Curr Opin Neurol 13(3):255–261
Lopez-Gongora M, Escartin A, Martinez-Horta S, Fernandez-Bobadilla R, Querol L, Romero S et al (2015) Neurophysiological evidence of compensatory brain mechanisms in early-stage multiple sclerosis. PLoS ONE 10(8):e0136786. https://doi.org/10.1371/journal.pone.0136786
Lowe C, Rabbitt P (1998) Test/re-test reliability of the CANTAB and ISPOCD neuropsychological batteries: theoretical and practical issues. Neuropsychologia 36(9):915–923
Luck SJ, Gaspelin N (2017) How to get statistically significant effects in any ERP experiment (and why you shouldn’t). Psychophysiology 54(1):146–157
McCarthy M, Beaumont JG, Thompson R, Peacock S (2005) Modality-specific aspects of sustained and divided attentional performance in multiple sclerosis. Arch Clin Neuropsychol 20(6):705–718. https://doi.org/10.1016/j.acn.2005.04.007
Moutoussis M, Eldar E, Dolan RJ (2016) Building a new field of computational psychiatry. Biol psychiatry 82(6):388–390. https://doi.org/10.1016/j.biopsych.2016.10.007
Nolan H, Whelan R, Reilly RB (2010) FASTER: fully automated statistical thresholding for EEG artifact rejection. J Neurosci Methods 192(1):152–162. https://doi.org/10.1016/j.jneumeth.2010.07.015
Oostenveld R, Praamstra P (2001) The five percent electrode system for high-resolution EEG and ERP measurements. Clin Neurophysiol 112(4):713–719
Piras MR, Magnano I, Canu ED, Paulus KS, Satta WM, Soddu A et al (2003) Longitudinal study of cognitive dysfunction in multiple sclerosis: neuropsychological, neuroradiological, and neurophysiological findings. J Neurol Neurosurg Psychiatry 74(7):878–885
Polich J (2007) Updating P300: an integrative theory of P3a and P3b. Clin Neurophysiol 118(10):2128–2148. https://doi.org/10.1016/j.clinph.2007.04.019
Polman C, Barkhof F, Sandberg-Wollheim M, Linde A, Nordle O, Nederman T (2005) Treatment with laquinimod reduces development of active MRI lesions in relapsing MS. Neurology 64(6):987–991. https://doi.org/10.1212/01.WNL.0000154520.48391.69
Polman CH, Reingold SC, Banwell B, Clanet M, Cohen JA, Filippi M et al (2011) Diagnostic criteria for multiple sclerosis: 2010 revisions to the McDonald criteria. Ann Neurol 69(2):292–302. https://doi.org/10.1002/ana.22366
Rabbitt P, Lowe C, Shilling V (2001) Frontal tests and models for cognitive ageing. Eur J Cogn Psychol 13:5–28
Rocca MA, Amato MP, De Stefano N, Enzinger C, Geurts JJ, Penner IK et al (2015) Clinical and imaging assessment of cognitive dysfunction in multiple sclerosis. Lancet Neurol 14(3):302–317. https://doi.org/10.1016/S1474-4422(14)70250-9
Rogers JM, Panegyres PK (2007) Cognitive impairment in multiple sclerosis: evidence-based analysis and recommendations. J Clin Neurosci 14(10):919–927. https://doi.org/10.1016/j.jocn.2007.02.006
Santiago O, Guardia J, Casado V, Carmona O, Arbizu T (2007) Specificity of frontal dysfunctions in relapsing-remitting multiple sclerosis. Arch Clin Neuropsychol 22(5):623–629. https://doi.org/10.1016/j.acn.2007.04.003
Scarpazza C, Braghittoni D, Casale B, Malagu S, Mattioli F, di Pellegrino G et al (2013) Education protects against cognitive changes associated with multiple sclerosis. Restor Neurol Neurosci 31(5):619–631. https://doi.org/10.3233/RNN-120261
Smith A (1982) Symbol digit modalities test: manual. Western Psychological Services, Los Angeles
Stern Y, Habeck C, Moeller J, Scarmeas N, Anderson KE, Hilton HJ et al (2005) Brain networks associated with cognitive reserve in healthy young and old adults. Cereb Cortex 15(4):394–402. https://doi.org/10.1093/cercor/bhh142
Sumowski JF, Leavitt VM (2013) Cognitive reserve in multiple sclerosis. Mult Scler 19(9):1122–1127. https://doi.org/10.1177/1352458513498834
Sumowski JF, Chiaravalloti N, Leavitt VM, Deluca J (2012) Cognitive reserve in secondary progressive multiple sclerosis. Mult Scler 18(10):1454–1458. https://doi.org/10.1177/1352458512440205
Sumowski JF, Rocca MA, Leavitt VM, Riccitelli G, Comi G, DeLuca J et al (2013) Brain reserve and cognitive reserve in multiple sclerosis: what you’ve got and how you use it. Neurology 80(24):2186–2193. https://doi.org/10.1212/WNL.0b013e318296e98b
Sundgren M, Nikulin VV, Maurex L, Wahlin A, Piehl F, Brismar T (2015a) P300 amplitude and response speed relate to preserved cognitive function in relapsing-remitting multiple sclerosis. Clin Neurophysiol 126(4):689–697. https://doi.org/10.1016/j.clinph.2014.07.024
Sundgren M, Wahlin A, Maurex L, Brismar T (2015b) Event related potential and response time give evidence for a physiological reserve in cognitive functioning in relapsing-remitting multiple sclerosis. J Neurol Sci 356(1–2):107–112. https://doi.org/10.1016/j.jns.2015.06.025
Titlic M, Mihalj M, Petrovic AB, Suljic E (2016) P300 as an auxiliary method in clinical practice: a review of literature. J Health Sci 6(3):143–148
Trenerry MR, Crossan B, DeBoe J, Leber WR (1989) Stroop neuropsychological screening test: manual. Psychological Assessment Resources, Florida
Van Schependom J, Gielen J, Laton J, D’Hooghe MB, De Keyser J, Nagels G (2014) Graph theoretical analysis indicates cognitive impairment in MS stems from neural disconnection. Neuroimage Clin 4:403–410. https://doi.org/10.1016/j.nicl.2014.01.012
Vazquez-Marrufo M, Gonzalez-Rosa JJ, Galvao-Carmona A, Hidalgo-Munoz A, Borges M, Pena JL et al (2013) Retest reliability of individual p3 topography assessed by high density electroencephalography. PLoS ONE 8(5):e62523. https://doi.org/10.1371/journal.pone.0062523
Vazquez-Marrufo M, Galvao-Carmona A, Gonzalez-Rosa JJ, Hidalgo-Munoz AR, Borges M, Ruiz-Pena JL et al (2014) Neural correlates of alerting and orienting impairment in multiple sclerosis patients. PLoS ONE 9(5):e97226. https://doi.org/10.1371/journal.pone.0097226
Whelan R (2008) Effective analysis of reaction time data. Psychol Rec 58(3):475–482
Whelan R, Garavan H (2014) When optimism hurts: inflated predictions in psychiatric neuroimaging. Biol Psychiatry 75(9):746–748. https://doi.org/10.1016/j.biopsych.2013.05.014
Whelan R, Lonergan R, Kiiski H, Nolan H, Kinsella K, Bramham J et al (2010a) A high-density ERP study reveals latency, amplitude, and topographical differences in multiple sclerosis patients versus controls. Clin Neurophysiol 121(9):1420–1426. https://doi.org/10.1016/j.clinph.2010.03.019
Whelan R, Lonergan R, Kiiski H, Nolan H, Kinsella K, Hutchinson M et al (2010b) Impaired information processing speed and attention allocation in multiple sclerosis patients versus controls: a high-density EEG study. J Neurol Sci 293(1):45–50. https://doi.org/10.1016/j.jns.2010.03.010
Whelan R, Watts R, Orr CA, Althoff RR, Artiges E, Banaschewski T et al (2014) Neuropsychosocial profiles of current and future adolescent alcohol misusers. Nature 512(7513):185–189. https://doi.org/10.1038/nature13402
Woo CW, Chang LJ, Lindquist MA, Wager TD (2017) Building better biomarkers: brain models in translational neuroimaging. Nat Neurosci 20(3):365–377. https://doi.org/10.1038/nn.4478
Yarkoni T, Westfall J (2016) Choosing prediction over explanation in psychology: Lessons from machine learning. Unpublished manuscript. Retrieved from http://jakewestfall.org/publications/Yarkoni_Westfall_choosing_prediction.pdf
Zhao Y, Healy BC, Rotstein D, Guttmann CR, Bakshi R, Weiner HL, Brodley CE, Chitnis T (2017) Exploration of machine learning techniques in predicting multiple sclerosis disease course. PLoS ONE 12(4):e0174866
Ziemann U, Wahl M, Hattingen E, Tumani H (2011) Development of biomarkers for multiple sclerosis as a neurodegenerative disorder. Prog Neurobiol 95(4):670–685. https://doi.org/10.1016/j.pneurobio.2011.04.007
Zou H, Hastie T (2005) Regularization and variable selection via the elastic net. J R Stat Soc Series B Stat Methodol 67:301–320
Funding
This study was partly funded by an Enterprise Ireland (eBiomed: eHealthCare based on Biomedical Signal Processing and ICT for Integrated Diagnosis and Treatment of Disease), a Science Foundation Ireland grant to R.B. Reilly (09/RFP/NE2382), an IRCSET grants to H. Kiiski and S. Ó. Donnchadha (http://www.ircset.ie), a Health Service Executive funding to M.C. O’Brien and a Science Foundation Ireland grant to R. Whelan (16/ERCD/3797). The study sponsors had no involvement in the collection, analysis and interpretation of data and in the writing of the manuscript.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethical Approval
All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee (the Ethics and Medical Research Committee of the St. Vincent’s Healthcare Group) and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.
Informed Consent
Written informed consent was obtained from all individual participants included in the study on each testing occasion.
Additional information
Handling Editor: Stefano Seri.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Supplementary Fig. 1
Cognitive functioning composite score (z) in RRMS, SPMS and control participants. (TIFF 312 KB)
Supplementary Fig. 2
Processing speed and working memory composite score (z) in RRMS, SPMS and control participants. (TIFF 323 KB)
Supplementary Video 1 ERP activity over the scalp during visual 2-stimulus oddball task (0-700ms) that predicted cognitive functioning at Month 0. Higher beta choice frequency values denote better accuracy in predicting cognitive functioning score. (AVI 2023 KB)
Supplementary Video 2 ERP activity over the scalp during visual 2-stimulus oddball task (0-700ms) that predicted cognitive functioning at Month 13. Higher beta choice frequency values denote better accuracy in predicting cognitive functioning score. (AVI 2109 KB)
Supplementary Video 3 ERP activity over the scalp during visual 3-stimulus oddball task (0-700ms) that predicted cognitive functioning at Month 13. Higher beta choice frequency values denote better accuracy in predicting cognitive functioning score. (AVI 1947 KB)
Supplementary Video 4 ERP activity over the scalp during visual 2-stimulus oddball task (0-700ms) that predicted processing speed and working memory performance at Month 0. Higher beta choice frequency values denote better accuracy in predicting processing speed and working memory score. (AVI 1935 KB)
Supplementary Video 5 ERP activity over the scalp during visual 2-stimulus oddball task (0-700ms) that predicted processing speed and working memory performance at Month 13. Higher beta choice frequency values denote better accuracy in predicting processing speed and working memory score. (AVI 1990 KB)
Supplementary Video 6 ERP activity (µV) in multiple sclerosis and healthy control participants during visual 2-stimulus oddball task at Month 0. (AVI 1574 KB)
Appendix
Appendix
The Regularized Adaptive Feature Thresholding Algorithm (RAFT)
Nested Cross-Validation
The dataset is initially divided into ten cross-validation (CV) folds. The entire analysis is performed ten times, using 90% of the dataset (the training set) to create a regression model which is then tested on the remaining 10% of the data (the test set). Within the training set, additional ‘nested’ cross-validation with ten partitions is used to support the analyses at the feature selection and model optimisation level. Results from all ten CV folds are finally aggregated, using the frequency with which a variable is found in models from different CV folds as a measure of its robustness.
Threshold Creation
Each feature of the dataset is individually evaluated to assess its utility in predicting the continuous target variable, in what can be considered a filtering step. A simple linear regression model is applied to the training set (81% of the data) for that feature and the target variable, and the resulting regression weight is used to make outcome predictions for the test set (9% of the data). A root mean squared error criterion (mse) is used to quantify the prediction error.
Based on the range of mse values estimated across all CV folds, a set of ten prediction error thresholds is created. These thresholds rank features according to their mse. At each prediction error threshold (t mse ) and for each nested CV partition n in every main CV fold m there is a subset of features f which have smaller prediction error values than that threshold.
A set of ten stability thresholds (tstability) is also used to assess how stable, over samples, the mse value assigned to each feature is. This is quantified as the number of nested CV partitions n in which a feature had a prediction error value lower than each mse threshold.
The ten prediction error thresholds and ten stability thresholds jointly define 100 new summary datasets, which include all features that had a smaller mse value than tmse in the number of CV partitions specified by tstability.
The prediction error thresholds are chosen based on the range of prediction error and feature stability across the sample. The most liberal prediction error threshold t mse (max) is chosen such that in each main CV fold m there is at least one feature which is common to all nested CV partitions, i.e. t mse (max) is the smallest prediction error value at which the following is true:
The strictest prediction error threshold t mse (min) is set as the lowest possible prediction error value at which every nested CV partition n in every main CV fold still contains at least one feature that has a smaller prediction error value than that threshold. That is, t mse (min) is the smallest prediction error value at which the following is true:
Taken together t mse and t stability define how high the individual predictive power of each feature in the knowledge base is, and how stable results with each feature are across subsets of the sample. The creation of these thresholds serves the purpose of integrating the choice of the criterion used to select features from the filtering step into model selection, eliminating researcher input at this point.
Model Optimisation
The feature sets which are created in the first analysis step are used as inputs into a model optimisation algorithm. We chose to use Elastic Net regularisation (Zou and Hastie 2005), but the feature selection framework can be adapted for use with other optimisation algorithms. The Elastic Net uses two parameters: λ and α. Alpha represents the weight of lasso vs. ridge regularisation which the Elastic Net uses, and λ is the regularization coefficient. Both Lasso and Ridge regression apply a penalty for large regression coefficient values, but Lasso regularization favours models with fewer features, making it more prone to excluding features. Eight values of each are considered, resulting in 64 models being built with each feature set, resulting in a total of 6400 models. For each of these models the prediction error is measured using the same root mean squared error criterion which was used to assess the feature mse. For each model an updated feature set is saved, which excludes any features that were excluded by the Elastic Net.
Bootstrap Aggregation
Calculations in the thresholding and model optimisation step are validated using 25-fold bootstrap aggregation (bagging). Instead of performing the analysis once using all data, summary datasets are created by randomly sampling on average two-thirds of the data in each iteration. Results from each iteration are aggregated using the median value. A feature was removed after the model optimization step if the Elastic Net removed the feature in more than half of all bagging iterations. As bagging greatly increases the computational expense we also tested the method without bagging.
Model Validation
After the model optimisation step, the combination of model parameters and thresholds which resulted in the model with the lowest prediction error is identified for each nested CV partition. The optimal model parameters and thresholds from each nested CV partition are used to identify what parameters will be used to create the final prediction model in each main CV fold, using the most frequently occurring values of α, λ, t mse and t stability . To select the features to include in the final model for each CV fold, the stability of all features that were included in the updated feature sets at the optimal prediction error and stability thresholds is re-calculated.
Only the features that were included in at least as many of the ten models with optimal parameters as specified by the optimal stability threshold are used to create the feature set for the final model.
It is possible that this implementation of the stability threshold does not leave any features for inclusion in the model. Should this be the case the closest possible parameter combination is used to create the feature set.
This feature set and the covariates are used as input into the Elastic Net, using the optimal values for α and λ, and the entire training set (90% of the data). The beta weights generated by the Elastic Net are subsequently used to make outcome prediction for the final unseen portion of the data (10%). Each CV fold is used to make outcome predictions for 10% of the data, and the evaluation of model fit is carried out using the complete vector of outcome predictions from all CV folds.
Performing model selection as an integrated step within the CV framework is an essential step in preserving the external validity of the resultant model (Cawley and Talbot 2010).
See Fig. 8.
Rights and permissions
About this article
Cite this article
Kiiski, H., Jollans, L., Donnchadha, S.Ó. et al. Machine Learning EEG to Predict Cognitive Functioning and Processing Speed Over a 2-Year Period in Multiple Sclerosis Patients and Controls. Brain Topogr 31, 346–363 (2018). https://doi.org/10.1007/s10548-018-0620-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10548-018-0620-4