Abstract
We describe a method of achieving emotion classification using ECG and EDA data. There have been many studies conducted on usage of heart rate and EDA data to quantify the arousal level of a user [1–3]. Researchers have identified a connection between a person’s ECG data and the positivity or negativity of their emotional state [4]. The goal of this work is to extend this idea to human computer interaction domain. We will explore whether the valence/arousal level of a subject’s response to computer based stimuli is predictable using ECG and EDA, and whether or not that information can complement recordings of participants’ cognitive data to form a more accurate depiction of emotional state. The experiment consists of presenting three types of stimuli, both interactive and noninteractive, to 9 subjects and recording their physiological response via ECG and EDA data as well as fNIRS device. The stimuli were selected from validated methods of inducing emotion including DEAP dataset [5], Multi Attribute Task Battery [6] and Tetris video game [7]. The participants’ responses were captured using Self-Assessment Manikin [8] surveys which were used as the ground truth labels. The resulting data was analyzed using Machine Learning. The results provide new avenues of research in combining physiological data to classify emotion.
You have full access to this open access chapter, Download conference paper PDF
1 Background and Literature Review
1.1 Nature of Emotion and Connection to Physiological Measures
Emotion is a complex phenomenon, often difficult for humans, not to mention machines, to recognize and respond appropriately [9, 10]. “Valence and arousal” [11] are two dimensions widely used and validated in neuro-physiological research. Arousal is conceptualized as “a unitary force that intensifies motivated behavior [20].” As superordinate dimensions of emotion, arousal is a dimension of emotion that maps intensity, while the dimension of the emotion being pleasant versus unpleasant (i.e., the direction of emotion) is dedicated to valence [17]. Russell’s circumplex model of affect is arguably the most popular model depicting the arousal and valence dimensions and their relation to emotional state [39]. Research states that most people’s neutral (baseline) state is located near point (0, 0) in Russell’s model [39] (Fig. 1).
This study considers valence and arousal as having “Low” and “High” states without concerning ourselves with the granular details of Russel’s model. Therefore we will try to classify between high and low valence, and high and low arousal.
The most common ‘measures’ of emotion are done using self-report surveys, such as the Self-Assessment Manikin (SAM) or the Positive and Negative Affect Scale (PANAS) survey instruments [12]. The SAM uses a non-verbal pictorial assessment technique that directly measures the pleasure, arousal, and dominance associated with a person’s affective reaction to a wide variety of stimuli. In this work we will focus on each of these factors as it relates to the physiological readings.
1.2 Physiological Manifestations of Emotion
Electrodermal Activity on Emotion.
Arousal can be measured as difference in skin conductance, as a result of increased activity in eccrine sweat glands on palm or sole [17]. The activity of eccrine glands is thought to be related to both central and peripheral nervous system. A meta-analysis on a series of studies found that brain regions that are thought to be involved in generating emotional response are also involved in eccrine glands activity [21]. Also, eccrine glands is innervated by the sympathetic branch of autonomic nervous system in peripheral nervous system. Thus, higher activity in skin conductance is thought as an index of higher arousal.
The relationship between body and brain is thought to be interactive and influence each other [9]. This embodiment view of body and mind in human emotion found that brain activity is influenced by afferent and efferent signals from the body [13, 14]. Under this perspective, arousal is related to sympathetic activation in the autonomic branch of the peripheral nervous system, which is observed in skin conductance and heart rate [1, 15, 16]. In this experiment, Skin Conductance was measured in a form of electrodermal activity (EDA) and heart rate was measured as electrocardiogram (ECG).
The connection between heart rate and cognition has been studied in literature [17]. There have been a multitude of studies that attempt to translate the arousal aspect of emotion to EDA [18, 19]. There have been limitations in using ECG data to detect arousal due to the confounding effect of arousal and attention involved in heart rate [1, 2, 16]. By using machine learning technique, this study intends to seek if we can singulate an index for arousal from ECG by combining it with EDA.
Heart Rate on Emotion.
Heart Rate is also known to be related to emotional response on stimuli [1]. However, a confounding effect in heart rate makes it complicated to use it as an index for arousal, such that sympathetic nervous system activation is associated with arousal and faster heartbeat while parasympathetic nervous system activation that is associated with cognitive effort usually takes place as well for stimuli [17]. Although majority of studies [1, 3, 5, 9, 17] focused on the relationship between heart rate and arousal or attention, a more recent study found that the height of R-wave was lower when the stimuli had more arousal potential [22]. This finding suggests that analyzing other artifacts of the PQRST waveform in ECG may reveal direct relationships between ECG artifacts and arousal.
1.3 Machine Learning Efforts on ECG, EDA Data
ECG and EDA data are interesting as physiological responses despite being single channel time series data, there is a variety of features that can be derived from them (SCR, SCL, Heart Rate, Heart Rate Variability). They also represent correlations with emotional and neuronal activity as mentioned above. Methods such as sliding window, time shifting/warping [22] and symbolic representations [23] are used in analysis of time series data. Selvaraj et al. [30] used Bayesian classifier, Regression tree, k-Nearest Neighbor and fuzzy k-Nearest Neighbor on ECG data to classify emotional state. As discussed earlier, One of the most sensitive markers for emotional arousal is Electrodermal Activity (EDA). EDA reflects the amount of sweat secretion from sweat glands triggered by emotional stimulation. To equally assess both the quality of an emotion and its intensity, it is worth combining cognitive data with skin conductance measures to obtain higher accuracy than the individual measures. There have been research into combining EEG data with ECG data in literature [31]. Even combining EEG with other physiological sensors such as Eye Tracking [32] and Electromyography (EMG) [33]. However there is limited literature on combining fNIRS and GSR data [34]. This work incorporates cognitive data from fNIRS and ECG data along with EDA to obtain higher Emotion classification accuracy.
1.4 Functional Near Infrared Spectroscopy
The fNIRS device uses light sources in the wavelength range (690–830 nm) that are pulsed into the brain (Fig. 2).
Deoxygenated hemoglobin (Hb) and oxygenated hemoglobin (HbO) are the main absorbers of near-infrared light in tissues during hemodynamic and metabolic changes associated with neural activity in the brain [40]. These changes can be detected by measuring the diffusively reflected light that has probed the brain cortex [40, 41].
fNIRS has been used to classify various cognitive states while computer users complete tasks under normal working conditions [35]. It has also recently been successfully used to classify emotional state of the user [36] and, as noted above, one purpose of the current study is to complement the utility of fNIRS measurements in the classification of emotion.
2 Experiment
2.1 Stimuli Selection
Music Video Segments.
A subset of music video segments from the Database for Emotional Analysis using Physiological Signals (DEAP) mentioned previously [6] were selected as stimuli to elicit participant emotions. The DEAP research team selected the videos using a semi-automated method of user-tagged videos and subjectively rejected videos where the tag was not reflective of the emotion induced (e.g. a “happy songs about sad topics”) [6]. The original DEAP dataset stimulus materials consisted of 40 music videos that had been found to induce consistent self-report scores in the outer boundaries of the four quadrants of the circumplex model.
Multi Attribute Task Battery (MATB).
Psychology and neuroscience showed that emotions are connected to high-level reasoning; they are tightly linked to decision-making processes [38]. Also researchers have found connections between workload and emotion [37]. Therefore the different levels of workload are expected to elicit different emotional response. The Multi-Attribute Task (MAT) Battery provides a benchmark set of tasks for use in a wide range of laboratory studies of operator workload. The battery also provides a high degree of experimenter control and freedom to use diverse test subjects [28]. In this experiment, the MATB sessions were customized into 3 different workload levels (Low, Medium, High) in order to elicit different levels of workload in the user. This was achieved by presenting different frequencies of tasks to the user.
Tetris.
The Tetris version [29] was customized to elicit different difficulty levels similar to MATB above. The difficulty levels were changed by adjusting the time between steps in Tetris
-
200 ms between Tetris steps-hard
-
300 ms between Tetris steps-medium
-
400 ms between Tetris steps-easy
2.2 Equipment Setup
Electrocardiogram (ECG) electrodermal activity (EDA) and was measured using Biopac MP150 system. The sample rate was set up as 1000 kHz using AcqKnowledge 4.2 software. For ECG, Mason-Likar lead placement was employed [24] (Fig. 3).
fNIRS was setup to take concurrent measurements of cognitive data. The equipment used was a Hitachi ETG-4000 with 3X11 probe configuration (Fig. 4).
2.3 Protocol
9 college age subjects took part in this experiment. 6 were men and 3 were women.
The subjects were provided instructions on how to do the different activities. Then they ran through a demo of each task to get familiar. After that, the ECG and EDA electrodes were placed on their body. And the fNIRS was placed on their head. Next the subject was presented with the tasks in a pseudo randomized block design (Fig. 5).
Before each stimuli, the subject was presented with a REST screen for 30 s. This was to get the subject to baseline level before the start of stimulus. Each task was one minute long.
2.4 Post Task Surveys
Subjects were given a survey after each task to gage their perceived Arousal, Valence, Dominance, Liking and workload levels (Fig. 6).
2.5 Data Analysis
Preprocessing.
The raw data for ECG signal was presented in millivolt (mV), and EDA was in micro Siemens. By using ‘ECG interval extraction’ function in AcqKnowledge 4.2, PQRST waveform was analyzed into variables: RR interval, heart rate, R height, P height, QRS interval, PRQ interval, QT interval, corrected QT interval, and ST interval (Fig. 7).
-
RR interval: Time difference between R spikes of each heartbeat
-
Heart Rate: Number of heartbeats in a minute
-
R height: The height if R spike artifact
-
P Height: The height if P spike artifact
-
QRS interval: time difference between Q and S spikes of a heartbeat
-
PRQ interval: time difference between P and R spikes of a heartbeat
-
QT interval: time difference between Q and T spikes of a heartbeat
-
corrected QT interval: standardization of QT interval corrects shortened/lengthened QT interval in faster/slower heart rate to fit that of heart rate of 60
-
ST interval: time difference between S and T spikes of a heartbeat
The data set contains 11 columns of timestamped data. During preprocessing, the 33 trials data was extracted from the data set and the Mean, Standard Deviation, Maximum and Minimum was calculated for each of the columns.
This resulted in 11 * 4 = 44 attributes and 33 feature vectors for each subject. The survey data was divided into high and low around the average for each person. This was to ensure an even split between high and low data points.
-
Machine Learning.
ECG Data.
The labels thus created were used as the ground truth and fed into the classifier. Which was evaluated by a 10 fold cross validation scheme. The results for each individual and average results for the whole group are given in the results section.
Fusion of ECG and fNIRS Data.
The fusion of ECG and fNIRS data was done using a simple label weighting method. The labels obtained by each classifier were combined linearly to get final labels. The weighting of the classifications has only two possibilities.
Case 1: Weight assigned to ECG data prediction < Weight assigned to fNIRS data prediction.
Case 2: Weight assigned to fNIRS data prediction < Weight assigned to ECG data prediction (Fig. 8).
This prediction was compared with the binary survey label to get the accuracy of prediction.
3 Results
3.1 Valence Classification Using ECG Data
See Table 1.
3.2 Arousal Classification Using ECG Data
See Table 2.
3.3 Results from Combination of Cognitive and ECG/EDA Data
Case 1: Weight assigned to ECG data prediction < Weight assigned to fNIRS data prediction.
Classification accuracy of combined ECG and fNIRS data using Naive Bayes classifier = 85.85 %.
Classification accuracy of combined ECG and fNIRS data using SVM classifier = 66 %.
Case 2: Weight assigned to fNIRS data prediction < Weight assigned to ECG data prediction.
Classification accuracy of combined ECG and fNIRS data using Naive Bayes classifier.
Accuracy = 93.93 %.
Classification accuracy of combined ECG and fNIRS data using SVM classifier = 61.27 %.
The Naive Bayes classifier gave higher accuracy when combining ECG and fNIRS data whereas SVM accuracy was unchanged.
4 Conclusion
The results show that the combination of Electrocardiogram/Electrodermal Activity and cognitive data (fNIRS) provides higher accuracy than Electrocardiogram/Electrodermal Activity alone. A further explication of this work would be to apply it to a larger data set as well as cross subject data. However this work does emphasize the need for more research into fusion of physiological data to boost the accuracy levels.
References
Lang, A.: Involuntary attention and physiological arousal evoked by structural features and emotional content in TV commercials. Commun. Res. 17(3), 275–299 (1990). doi:10.1177/009365090017003001
Zillmann, D.: Excitation transfer in communication-mediated aggressive behavior. J. Exp. Soc. Psychol. 7(4), 419–434 (1971). http://dx.doi.org/10.1016/0022-1031(71)90075-8
Bradley, M.M., Codispoti, M., Cuthbert, B.N., Lang, P.J.: Emotion and motivation I: defensive and appetitive reactions in picture processing. Emotion 1(3), 276–298 (2001). doi:10.1037/1528-3542.1.3.276
Brosschot, J.F., Thayer, J.F.: Heart rate response is longer after negative emotions than after positive emotions. Int. J. Psychophysiol. 50(3), 181–187 (2003)
Kim, K.H., Bang, S.W., Kim, S.R.: Emotion recognition system using short-term monitoring of physiological signals. Med. Biol. Eng. Comput. 42(3), 419–427 (2004)
Koelstra, S., Muhl, C., Soleymani, M., Jong-Seok Lee, A., Yazdani, T., Ebrahimi, T., Pun, A.Nijholt, Patras, I.: DEAP: a database for emotion analysis; using physiological signals. IEEE Trans. Affect. Comput. 3(1), 18–31 (2012). doi:10.1109/T-AFFC.2011.15
Comstock, J.R., Arnegard, R.J.: The Multi-attribute Task Battery for Human Operator Workload and Strategic Behavior Research. National Aeronautics and Space Administration, Langley Research Center, Hampton (1992)
Chanel, G., Rebetez, C., Bétrancourt, M., Pun, T.: Emotion assessment from physiological signals for adaptation of game difficulty. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 41(6), 1052–1063 (2011)
Bradley, M.M., Lang, P.J.: Measuring emotion: the self-assessment manikin and the semantic differential. J. Behav. Ther. Exp. Psychiatry 25(1), 49–59 (1994)
Plutchik, R.: The nature of emotions human emotions have deep evolutionary roots, a fact that may explain their complexity and provide tools for clinical practice. Am. Sci. 89(4), 344–350 (2001)
Scherer, K.R.: What are emotions? And how can they be measured? Soc. Sci. Inf. 44(4), 695–729 (2005)
Russell, J.A.: A circumplex model of affect. J. Pers. Soc. Psychol. 39(6), 1161–1178 (1980)
Barrett, L.F., Lindquist, K.A.: The embodiment of emotion. In: Semin, G.R., Smith, E.R. (eds.) Embodied Grounding: Social, Cognitive, Affective, and Neuroscientific Approaches, pp. 237–262. Cambridge University Press, New York (2008)
Andreassi, J.L.: Psychophysiology: Human Behavior and Physiological Response. Psychology Press, New York (2013)
Lane, R.D., McRae, K., Reiman, E.M., Chen, K., Ahern, G.L., Thayer, J.F.: Neural correlates of heart rate variability during emotion. NeuroImage 44(1), 213–222 (2009)
Boucsein, W.: Electrodermal Activity. Springer Science & Business Media, New York (2012)
Potter, R.F., Bolls, P.D.: Psychophysiological Measurement and Meaning: Cognitive and Emotional Processing of Media. Routledge, New York (2012)
Shi, Y., Ruiz, N., Taib, R., Choi, E., Chen, F.: Galvanic skin response (GSR) as an index of cognitive load. In: CHI 2007 Extended Abstracts on Human Factors in Computing Systems, pp. 2651–2656. ACM, April 2007
Westerink, J.H., Van Den Broek, E.L., Schut, M.H., Van Herk, J., Tuinenbreijer, K.: Computing emotion awareness through galvanic skin response and facial electromyography. In: Westerink, J.H.D.M., Ouwerkerk, M., Overbeek, T.J.M., Frank Pasveer, W., de Ruyter, B. (eds.) Probing Experience, pp. 149–162. Springer, Netherlands (2008)
Zillmann, D.: Television Viewing and Arousal Television and Social Behavior. U.S. Government Printing Office, Washington (1981)
Kober, H., Barrett, L.F., Joseph, J., Bliss-Moreau, E., Lindquist, K., Wager, T.D.: Functional grouping and cortical–subcortical interactions in emotion: a meta-analysis of neuroimaging studies. NeuroImage 42(2), 998–1031 (2008). doi:10.1016/j.neuroimage.2008.03.059
Dousty, M., Daneshvar, S., Haghjoo, M.: The effects of sedative music, arousal music, and silence on electrocardiography signals. J. Electrocardiol. 44(3), 396.e391–396.e396 (2011). http://dx.doi.org/10.1016/j.jelectrocard.2011.01.005
AcqKnowledge [Computer Software] (2014). http://www.biopac.com/
Farrell, R.M., Syed, A., Syed, A., Gutterman, D.D.: Effects of limb electrode placement on the 12- and 16-lead electrocardiogram. J. Electrocardiol. 41(6), 536–545 (2008). http://search.proquest.com/docview/216204094?accountid=14214
Electrode placement [Online image] (2014). http://advancedparamedicine.ca/posts/limb-leads. Accessed 10 Feb 2016
Berndt, D.J., Clifford, J.: Using dynamic time warping to find patterns in time series. In: KDD workshop, vol. 10, no. 16, pp. 359–370, July 1994
Lin, J., Keogh, E., Lonardi, S., Chiu, B.: A symbolic representation of time series, with implications for streaming algorithms. In: Proceedings of the 8th ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, pp. 2–11. ACM, June 2003
Comstock Jr., J.R., Arnegard, R.J.: The multi-attribute task battery for human operator workload and strategic behavior research (1992)
McIntyre, R.C.: TetrisCSharp. Computer software (2011)
Selvaraj, J., Murugappan, M., Wan, K., Yaacob, S.: Classification of emotional states from electrocardiogram signals: a non-linear approach based on hurst. Biomed. Eng. Online 12, 44 (2013)
Schmidt, L.A., Trainor, L.J., Santesso, D.L.: Development of frontal electroencephalogram (EEG) and heart rate (ECG) responses to affective musical stimuli during the first 12 months of post-natal life. Brain Cogn. 52(1), 27–32 (2003)
Zheng, W.L., Dong, B.N., Lu, B.L.: Multimodal emotion recognition using EEG and eye tracking data. In: 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), pp. 5040–5043. IEEE, August 2014
Wexler, B.E., Warrenburg, S., Schwartz, G.E., Janer, L.D.: EEG and EMG responses to emotion-evoking stimuli processed without conscious awareness. Neuropsychologia 30(12), 1065–1079 (1992)
Hirshfield, L.M., Bobko, P., Barelka, A., Hirshfield, S.H., Farrington, M.T., Gulbronson, S., Paverman, D.: Using noninvasive brain measurement to explore the psychological effects of computer malfunctions on users during human-computer interactions. Adv. Hum. Comput. Interact. 2014, 2 (2014)
Hirshfield, L., Costa, M., Bandara, D., Bratt, S.: Measuring situational awareness aptitude using functional near-infrared spectroscopy. In: Schmorrow, D.D., Fidopiastis, C.M. (eds.) AC 2015. LNCS, vol. 9183, pp. 244–255. Springer, Heidelberg (2015)
Bandara, D.S.: Emotion-Reading: Affective Analysis Using fNIRS. https://arxiv.org/submit/1490836/view. Accessed 24 Feb 2016
Myrtek, M., Deutschmann-Janicke, E., Strohmaier, H., Zimmermann, W., Lawerenz, S., Brügner, G., Müller, W.: Physical, mental, emotional, and subjective workload components in train drivers. Ergonomics 37(7), 1195–1203 (1994)
Marg, E.: DESCARTES’ ERROR: emotion, reason, and the human brain. Optom. Vis. Sci. 72(11), 847–848 (1995)
Russell, J.A.: A circumplex model of affect. J. Pers. Soc. Psychol. 39, 1161–1178 (1980)
Chance, B., Zhuang, Z., UnAh, C., Alter, C., Lipton, L.: Cognition-activated low-frequency modulation of light absorption in human brain. Proc. Natl. Acad. Sci. 90(8), 3770–3774 (1993)
Izzetoglu, K., Bunce, S., Onaral, B., Pourrezaei, K., Chance, B.: Functional optical brain imaging using near-infrared during cognitive tasks. Int. J. Hum. Comput. Interact. 17(2), 211–227 (2004)
Atkielski, A.: Schematic diagram of normal sinus rhythm for a human heart as seen on ECG. Digital image. Sinus rhythm labels. Wikimedia commons (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Bandara, D., Song, S., Hirshfield, L., Velipasalar, S. (2016). A More Complete Picture of Emotion Using Electrocardiogram and Electrodermal Activity to Complement Cognitive Data. In: Schmorrow, D., Fidopiastis, C. (eds) Foundations of Augmented Cognition: Neuroergonomics and Operational Neuroscience. AC 2016. Lecture Notes in Computer Science(), vol 9743. Springer, Cham. https://doi.org/10.1007/978-3-319-39955-3_27
Download citation
DOI: https://doi.org/10.1007/978-3-319-39955-3_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-39954-6
Online ISBN: 978-3-319-39955-3
eBook Packages: Computer ScienceComputer Science (R0)