Sentiment Analysis Through Machine Learning for the Support on Decision-Making in Job Interviews

Martínez Zárate, Julio; Mateus Santiago, Sandra

doi:10.1007/978-3-030-30033-3_16

Julio Martínez Zárate⁹ &
Sandra Mateus Santiago⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11786))

Included in the following conference series:

International Conference on Human-Computer Interaction

1996 Accesses
2 Citations

Abstract

In this paper, we propose a sentiment analysis model using machine learning for the support on decision-making in the process of job interviews. To do this, a characterization of the analysis of sentiments, job interviews and machine learning algorithms is first performed. Then, supervised machine learning with artificial neural networks is implemented in a prototype, due to the non-linear behavior described in the variables taken in the study and applying the Eye tracking technique. Finally, tests are carried out with people, in which, by asking questions of these, the involuntary movements of the pupil of the eye are analyzed, through the processing of a volume of data and the results of the ocular patterns are interpreted. Correlated with the questions of the test and with it, a final judgment is presented for the support of the decision making.

You have full access to this open access chapter, Download conference paper PDF

Comparative Analysis Between Different Automatic Learning Environments for Sentiment Analysis

An Approach to Sentiment Analysis for Mobile Speech Applications

On the Identification and Annotation of Emotional Properties of Verbs

Keywords

1 Introduction

The sentiment analysis is a process to extract the feelings and emotions of the users [1]. Liu [2] defines it as the field of study that analyzes the opinions, feelings, evaluations, validations, attitudes and emotions of people towards entities such as products, services, organizations, individuals, themes, events and its attributes. Chen [3] speaks about the different forms or computational approaches in which the sentiment analysis can be performed, such as: text-based, voice-based, visual and multimodal. One of the techniques or approaches used for this discipline is machine learning and Mitchel [4] defines it as a sub-area of computer science that studies methods to construct predictive computational models from observational data. The sentiment analysis can be applied to a myriad of disciplines and areas: in economics, in medicine, psychology, state security, politics; for the case of this work, it is applied in psychology and more exactly in the job interviews within organizations. The job interview is the most important process in recruitment and is used for various purposes: measurement of cognitive qualities, personality, motor and physical skills [5].

Job interviews are a popular selection technique from many points of view. In organizations around the world, job interviews are still one of the most widely used methods to evaluate candidates for employment. Among organizational decision makers, interviews have been found to be the most preferred assessment method by supervisors and human resources professionals. In addition, applicants perceive that interviews are fair compared to other selection procedures and applicants expect interviews as part of a selection process. In fact, from the perspective of the applicant, obtaining a job interview is essential for success in the search for a job [6].

For the Sentiment Analysis, there are some instruments and techniques that also require a specialist person to interpret that analysis, as well as the costs of some of these devices are relatively high [7,8,9]. In pre-employment interviews, for example, a person is the one who analyzes behavior, gestures and certain key patterns such as the look, tone of voice and other expressions of the inquired or interviewed. On the other hand, it is worth mentioning one of the most used devices, the polygraph, which measures physiological alterations of people [10]; Furthermore, Chica [11] mentions some disadvantages of this device, where she states that, on the other hand, there are also several “tricks” that can alter the test. Another device is also the Magnetic Resonance Scanner, which uses one of the technologies considered to be the best in the detection of lies, however, it only focuses on this, it is very expensive and requires a rigorous process [12].

Because certain shortcomings and deficiencies in the techniques and devices, among others, a low-cost model is proposed that can accurately interpret the feelings of people with eye tracking techniques and provide support to the current decision-making techniques for the personnel in charge of conducting job interviews in organizations, thanks to the information systems, mathematical theories and psychology.

This paper is organized as follows: in Sect. 2, the related works are presented. In Sect. 3, the sentiment analysis model using machine learning. In Sect. 4, the experiments and results. The conclusion is finally presented.

2 Related Works

There are some related works on sentiment analysis using machine learning:

Borth [13] proposed an approach based on understanding of the visual concepts that are strongly related to sentiments. Present a method built upon psychological theories and web mining to automatically construct a large-scale Visual Sentiment Ontology, using a detector library for visual sentiment analysis. Wang [14] proposed a visual sentiment analysis approach with coupled deep adjective and noun neural networks, considers visual sentiment analysis as a binary prediction problem that is to classify an image as positive or negative from its visual content. Baecchi [15] did a study that uses a multimodal feature learning approach, using neural network-based models, to address sentiment analysis of micro-blogging content, such as Twitter short messages, that are composed by a short text and, possibly, an image.

Zadeh [16] presented a model termed “tensor fusion” network (neural network-based) for sentiment analysis, highlighting the growth of research in this area through multiple modes and the use of machine learning. Similarly, Chinsatit [17] used neural networks-based pupil center detection method for a wearable gaze estimation, mentioning the applications that can have this in several disciplines of knowledge, including psychology. For its part, Poria [18] proposed a multimodal affective data analysis framework to extract user opinion and emotions from video content and combines text, audio and video; the paper also proposes an extensive study on decision-level fusion.

On the other hand, Chen [3] uses a convolutional neuronal network for the prediction of sentiments through the joint learning of textual and visual sentiments from training examples. George [19] uses a convolutional neuronal network too, propose a real-time framework for the classification of eye gaze direction and estimation of eye accessing cues.

3 Sentiment Analysis Model Using Machine Learning

In this research, the characterization on sentiment analysis was initially carried out, among which are: textual, voice, visual and multimodal. After this characterization and given the final validation designed to apply it in job interviews, this work focuses on the visual approach. To do this, we use the register of positions through coordinates of the center of the pupil of the eye, following an algorithm called Eye-Tracking [20], which is the process of measuring the movement of an eye in relation to the head or the point where the gaze is fixed (Fig. 1).

The model shown in Fig. 1 establishes that the specific characteristics of the thinking mechanism relate to a non-visual orientation of the gaze [21]. A person is related to the way he moves his eyes [22]. “According to a popular proverb, the eyes are the window of the soul. And, in fact, people have wondered for a long time if there is something in our eyes indicative of character” [23].

For this study, in addition, the exploration of the different automatic learning algorithms used for the sentiment analysis was carried out (Fig. 2).

After making comparisons between the algorithms, the supervised machine learning approach was selected using artificial neural network techniques, using as a criterion that people have different behaviors in their eyes when asked about something and the time of permanence looking towards certain coordinates it is not the same for everyone, nor are the coordinates usually the same, so the relationship between the variables can be considered with a non-linear trend.

3.1 Architecture of the Artificial Neural Network

The prototype used in the sentiment analysis model proposed in the present article has enabled six of the seven patterns (visual defocused, visual created, visual remembered, auditory created, remembered auditory, kinesthetic sensations, internal dialog), excluding the defocused visual pattern.

Next, the architecture of the neural network is proposed to validate the coordinates of the model. The variables of inputs are related to the coordinates of the Cartesian plane (x, y) and the stay time of the gaze. When a question is asked during the job interview at a certain time t, the person fixe his gaze towards certain places, which, when captured, counts the number of times in all the coordinates:

Number of looks towards visual defocused (#VD)
Number of looks towards remembered visual (#VR)
Number of looks towards visual created (#VC)
Number of looks towards remembered auditory (#AR)
Number of looks towards auditory created (#AC)
Number of looks towards internal dialogue (#DI)
Number of looks towards kinesthetic sensations (#KI)

The input data are normalized by Eq. (1):

$$ x^{\prime} = d1 + \frac{{\left( {x\, - \,xmin} \right)\left( {d2 - d1} \right)}}{xmax\, - \,xmin} $$

(1)

Where,
$ x:value\, to\,normalize $
$ \left[ {xmax, \,xmin} \right]:value\,range\,x $
$ \left[ {d1, d2} \right]:range\,to\,which\,the\,value\,of\,x\,will\,be\,reduced $

The outputs are multiple and correspond to the value of the coordinates, since it is a multiclass classification layer (0, 1, 2, 3, 4, 5, 6 correspond to: VD, VR, VC, AR, AC, DI, KI, respectively).

After the characterization of the machine learning algorithms and the design of several neural network architectures, making combinations with different numbers of hidden layers, with different amounts of nodes in these layers, with different activation functions and different algorithms of learning, a multilayer perceptron is selected with two hidden layers of “processing of the sentiments” with 2 inputs (and Bias) and 7 neurons in each hidden layer, and the number of the output is 7 as well, as it was mentioned is multiple (classification). They are activated with the sigmoid activation function, as an error function, categorical cross entropy with Adam optimizer was chosen (Fig. 3).

The model proposed in this paper can perform the visual sentiment analysis of people with machine learning, performing the interpretation of the eye Accessing cues with the help of the eye tracking technique. It is applied in job interviews now in which an interviewee is asked questions of a personal nature, and this fixes his gaze towards certain coordinates that have a meaning according to the areas of study of psychology. After the results obtained with the variables of the fixations of the gaze, the personnel in charge (usually of human resources), analyze said results and make their own decision about the candidate for the job.

3.2 Interview

The interview is based on the work done in Costa [25] on the five (5) personality dimensions (Big Five): Extraversion, kindness, responsibility, emotional stability, openness to experience. In Rauthmann [23] a study is carried out with the ocular tracking demonstrating with mixed linear models that the personality predicts the number of fixations, the duration of the fixation and the time of permanence in two different abstract animations. Hooft [26] investigate whether ocular tracking technology can improve people’s understanding of the response process when it is said. For his part, Broz [27] collects the data from the look of human conversational pairs in order to understand which characteristics of the interlocutors of the conversation influence this behavior. These three (3) works made use of the Costa test [25].

Open questions are carried out based on the Costa test [25] carried out in order to evaluate each of the previous dimensions, so that the person responding projects their gaze towards certain coordinates involuntarily according to their feelings. The human being is capable of manifesting three (3) feelings at the same time with his eye cues.

4 Experimentos y Resultados

Next, the experiments and results made for the sentiment analysis model are presented. For the experiment, it should be suggested to the person not to wear glasses, not to turn the face to the sides, so that the webcam can constantly monitor the movement of the retina.

4.1 Prototype

The prototype consists of a client-server system with graphical user interface (GUI) web client, which captures the coordinates of the gaze and the time spent in these coordinates, using a conventional web camera. The detection of the retina is done with the help of the webcam, this is captured with a javascript library called Webgazer [28], which uses internal machine learning. First the calibration of the “library” is performed, the more it is calibrated, the greater the precision of the detection of the coordinates of the gaze.

In Fig. 4, the user is in front of the computer that has a conventional web camera and captures the movement of the retina, this is reflected in the graphical user interface web on the screen of said computer, capturing the needed information. The collection of the information of the variables is done in real time through the prototype eye-tracking and stored in a database for further processing.

In each coordinate where the person interviewed fixes the gaze, the absolute frequency or number of fixations to that coordinate is calculated; then, a point (x, y) closest to the interpolation point described by the data set $ \left( {{ \hbox{min} }\left( {\left\| {X - IP} \right\|} \right)} \right) $, if it belongs to the first three (3) major absolute frequencies. It is making a scale of the point (x, y) captured in the width and height (in pixels) of the browser window viewport with respect to the square of the size of the rectangular image of the eye area. A translation of Cartesian coordinates is carried out and then the normalization of the data with respect to the pixel size of the rectangle of the eye region [29].

4.2 Evaluation

For training the neural network, we take the Eye-Chimera data set used in [21], transformed to coordinate matrices of the images. They contain a matrix of (14 × 2) of each look (described in each image) in which the rows are the coordinates; and the columns, the x and y axes. The five (5) first rows are coordinates of the left eye, as well as the last 2 (two); the rows from 6 to 10 are coordinates of the right eye; the coordinates of the pupil are rows 5 and 10, row 1 is the left end of the left eye, row 6 the left end of the right eye, row 11 is the upper end of the left eye, and row 13 is the upper end of the right eye (which are the variables of interest). Taking these values and performing measurements normalization to a rectangle located in the region of the eye. The images of the taken dataset have a size of 640 px wide by 480 px high, the region of the eye is reduced to 35 × 18 px.

In Fig. 5, two eyes are shown taking values from a matrix of dotted positions with the pupil in the center.

The eye coordinates are captured in five (5) volunteers, asking a personality question for each dimension of the Big-Five (5 questions), only explaining to the person who answers the questions that will appear on the computer screen or device, the client was hidden from the interface of capturing the gaze and only the questions that had to be answered appeared and he clicked when he answered it verbally, without explaining what dimension of personality would be measured either. And at the end of the test, present the results of the quantities as shown in Fig. 5, for each question answered.

For each question, the absolute frequencies of fixation in the established coordinates are obtained, and with that the six (6) relative frequencies in each coordinate are calculated and the three (3) first major ones are taken (relative frequencies greater than 10%). The average duration of a fixation ranges between 200 and 350 ms [30].

Table 1 shows the results of the people evaluated and the sentiments corresponding obtained with the eye tracking:

Table 1. Results with absolute frequencies in the coordinates in the determined response time for the five (5) questions.

Full size table

Each question is about identifying a dimension of the candidate for the job: Question 1, Extraversion; question 2, kindness; question 3, responsibility; question 4, emotional stability, question 5, openness to experience. The response time the person takes is not relevant in the results for the verification, although the absolute frequencies depend on it.

After training of the model with the neural network shown in Fig. 4 with the Eye-Chimera data set (885 samples in total), the results without label are processed, for this the “Tensorflowjs” tool is used.

For each point of the first 3 major ratios of the previous table, take a point closer to the curve described, of each set of points in each of those three (3) coordinates.

Table 2 shows the matrix of validation of the results, showing the percentage of success or certainty with the coordinate. The neural network throws each one of the results of the patterns to be checked with the highest corresponding values with respect to the others.

Table 2. Validation of the results in the corresponding coordinates.

Full size table

5 Conclusion

In this paper, we have presented a sentiment analysis model with the use of advanced machine learning techniques, seeing that it is an emergent application area in many fields, for this case, it is applied to decision making in the process of job interviews to assess personality. There are several approaches to analyzing feelings, but in this case algorithms with neural networks were taken because of their advantages analyzed against the subject under treatment.

With the results of the personality test of each question, the person in charge of defining the future of the candidates for a job position (normally a psychologist or human resources professional), relies on the model correlating in the results of the variables to then give their final judgment in this regard.

References

Devika, M., Sunitha, C., Ganesh, A.: Sentiment analysis: a comparative study on different approaches. Procedia Comput. Sci. 87, 44–49 (2016)
Article Google Scholar
Liu, B.: Sentiment Analysis and Opinion Mining. Morgan & Claypool Publishers, San Rafael (2012)
Book Google Scholar
Chen, X., Wang, Y., Liu, Q.: Visual and textual sentiment analysis using deep fusion convolutional neural networks (2017)
Google Scholar
Mitchell, T.: Machine Learning. McGraw-Hill, Boston (1997)
MATH Google Scholar
Dessler, G.: Administración de recursos humanos, enfoque latinoamericano, 5 edn. (2009)
Google Scholar
Macan, T.: The employment interview: a review of current studies and directions for future research. Hum. Resour. Manag. Rev. 19, 203–218 (2009)
Article Google Scholar
Kron, L.: Polygraph and reliability in psychological assessment: myth or reality? (2016)
Google Scholar
Pham, T.D., Tran, D.: Emotion recognition using the emotiv EPOC device. In: Huang, T., Zeng, Z., Li, C., Leung, C.S. (eds.) ICONIP 2012, Part V. LNCS, vol. 7667, pp. 394–399. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-34500-5_47
Chapter Google Scholar
Mitsuyoshi, S.: Emotion recognizing method, sensibility creating method, device, and software (2008)
Google Scholar
Anta, J.: Detección Del Engaño: Polígrafo vs Análisis Verbo-Corporal (2012)
Google Scholar
Chica, H., Escobar, F., Folino, J.: La Entrevista Psiquiátrica Del Sujeto Simulador (2005)
Google Scholar
Petisco, J.: Una Mirada A La Detección De Mentiras Empleando fMRI (2015)
Google Scholar
Borth D., Rongrong, J., Chen, T., Breuel, T., Chang, S.: Large-scale visual sentiment ontology and detectors using adjective noun pairs (2013)
Google Scholar
Wang, J., Fu, J., Xu, Y., Mei, T.: Beyond object recognition: visual sentiment analysis with deep coupled adjective and noun neural networks (2016)
Google Scholar
Baecchi, C., Uricchio, T., Bertini, M., Del Bimbo, A.: A multimodal feature learning approach for sentiment analysis of social network multimedia. Multimed. Tools Appl. 75, 2507–2525 (2015)
Article Google Scholar
Zadeh, A., Chen, M.: Tensor fusion network for multimodal sentiment analysis (2017)
Google Scholar
Chinsatit, W., Saitoh, T.: CNN-based pupil center detection for wearable gaze estimation system (2017)
Article Google Scholar
Poria, S., Peng, H., Hussain, A., Howard, N., Cambria, E.: Ensemble application of convolutional neural networks and multiple kernel learning for multimodal sentiment analysis (2017)
Google Scholar
George, A., Routray, A.: Real-time eye gaze direction classification using convolutional neural network (2016)
Google Scholar
Llerena, M.: Desarrollo de una metodología basada en la programación neurolingüística utilizando software educativo para mejorar el proceso enseñanza-aprendizaje. Msc tesis. Escuela Superior politécnica de Chimborazo, Ecuador (2016)
Google Scholar
Florea, L., Florea, C., Vrânceanu, R., Vertan, C.: Can your eyes tell me how you think? A gaze directed estimation of the mental activity (2013)
Google Scholar
Risko, E., Anderson, N., Lanthier, S., Kingstone, A.: Curious eyes: individual differences in personality predict eye movement behavior in scene-viewing (2012)
Article Google Scholar
Rauthmann, F., Seubert, C., Sachse, P., Furtner, M.: Eyes as windows to the soul: gazing behavior is related to personality (2012)
Article Google Scholar
Medhat, W., Hassan, A., Korashy, H.: Sentiment analysis algorithms and applications: a survey (2014)
Article Google Scholar
Costa Jr., P.T., McCrae, R.R.: Revised NEO Personality Inventory (NEO-PI-R) and NEO Five-Factor Inventory (NEO-FFI) Manual. Psychological Assessment Resources, Odessa (1992)
Google Scholar
Hooft, V.: Intentional response distortion on personality tests: using eye-tracking to understand response processes when faking (2014)
Google Scholar
Broz, F., Lehmann, H., Nehaniv, C., Dautenhahn, K.: Mutual gaze, personality, and familiarity: dual eye-tracking during conversation (2012)
Google Scholar
Papoutsaki, A., Sangkloy, P., Laskey, J.: WebGazer: scalable webcam eye tracking using user interactions (2016)
Google Scholar
Krishnamurthy, N.: Introduction to Computer Graphics, p. 165 (2002)
Google Scholar
Tejero, P., Pastor, G., Crespo, A.: Exploración visual y movimientos oculares en conductores con distinta experiencia: una revisión (2004)
Google Scholar

Download references

Acknowledgments

The research work presented in this paper was partially funded by the thesis proposal of Master’s Degree in Engineering, from the Politécnico Colombiano Jaime Isaza Cadavid - Colombia; entitled: “Modelo de análisis de sentimientos mediante el aprendizaje de máquina para el apoyo en la toma de decisiones en entrevistas laborales”. Thanks to Yamid Asael Arenas Manso, System Engineer.

Author information

Authors and Affiliations

Politécnico Colombiano Jaime Isaza Cadavid, Medellín, Colombia
Julio Martínez Zárate & Sandra Mateus Santiago

Authors

Julio Martínez Zárate
View author publications
You can also search for this author in PubMed Google Scholar
Sandra Mateus Santiago
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Julio Martínez Zárate or Sandra Mateus Santiago .

Editor information

Editors and Affiliations

University of Crete and Foundation for Research and Technology - Hellas (Forth), Heraklion, Greece
Constantine Stephanidis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Martínez Zárate, J., Mateus Santiago, S. (2019). Sentiment Analysis Through Machine Learning for the Support on Decision-Making in Job Interviews. In: Stephanidis, C. (eds) HCI International 2019 – Late Breaking Papers. HCII 2019. Lecture Notes in Computer Science(), vol 11786. Springer, Cham. https://doi.org/10.1007/978-3-030-30033-3_16

Download citation

DOI: https://doi.org/10.1007/978-3-030-30033-3_16
Published: 31 August 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-30032-6
Online ISBN: 978-3-030-30033-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics