Abstract
Detecting depression in speech is a hot topic in recent years. Some inconsistent results in previous researches imply a few important influence factors are ignored. In this paper, we investigated a sample of 184 subjects (108 females, 76 males) to examine the influence of speaking style and emotional valence on depression detection. First, classification accuracy was used to measure the influence of these two factors. Then, two-way analysis of variance was employed to determine interactive acoustical features. Finally, normalized features by subtracting got higher classification accuracies. Results show that both speaking style and emotional valence are important factors. Spontaneous speech is better than automatic speech and neutral is the best choice among three emotional valences in depression detection. Normalized features improve the detection performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Toh, W.L., Thomas, N., Rossell, S.L.: Auditory verbal hallucinations in bipolar disorder (BD) and major depressive disorder (MDD): a systematic review. J. Affect. Disord. 184, 18–28 (2015)
Zhang, Y., Zhang, C., Yuan, G., Yao, J., Cheng, Z., Liu, C., et al.: Effect of tryptophan hydroxylase-2 rs7305115 SNP on suicide attempts risk in major depression. Behav. Brain Funct. 6, 1 (2010)
Angeleri, F., Angeleri, V.A., Foschi, N., Giaquinto, S., Nolfe, G.: The influence of depression, social activity, and family stress on functional outcome after stroke. Stroke 24, 1478–1483 (1993)
Zumg, W., Richards, C., Short, M.: Self-rating depression scale in an outpatient clinic: further validation of the SDS. Arch. Gen. Psychiatry 13, 508–515 (1965)
Cohn, J.F., Kruez, T.S., Matthews, I., Yang, Y., Nguyen, M.H., Padilla, M.T., et al.: Detecting depression from facial actions and vocal prosody. In: 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops. ACII 2009, pp. 1–7 (2009)
Cummins, N., Epps, J., Breakspear, M., Goecke, R.: An investigation of depressed speech detection: features and normalization. In: Interspeech, pp. 2997–3000 (2011)
Mundt, J.C., Snyder, P.J., Cannizzaro, M.S., Chappie, K., Geralts, D.S.: Voice acoustic measures of depression severity and treatment response collected via interactive voice response (IVR) technology. J. Neurolinguist. 20, 50–64 (2007)
Scherer, S., Stratou, G., Morency, L.-P.: Audiovisual behavior descriptors for depression assessment. In: Proceedings of the 15th ACM on International Conference on Multimodal Interaction, pp. 135–140 (2013)
Kupfer, D., Foster, F.G.: Interval between onset of sleep and rapid-eye-movement sleep as an indicator of depression. Lancet 300, 684–686 (1972)
Davidson, R.J., Pizzagalli, D., Nitschke, J.B., Putnam, K.: Depression: perspectives from affective neuroscience. Annu. Rev. Psychol. 53, 545–574 (2002)
Ozdas, A., Shiavi, R.G., Silverman, S.E., Silverman, M.K., Wilkes, D.M.: Investigation of vocal jitter and glottal flow spectrum as possible cues for depression and near-term suicidal risk. IEEE Trans. Biomed. Eng. 51, 1530–1540 (2004)
Mundt, J.C., Vogel, A.P., Feltner, D.E., Lenderking, W.R.: Vocal acoustic biomarkers of depression severity and treatment response. Biol. Psychiatry 72, 580–587 (2012)
Nilsonne, A., Sundberg, J., Ternstrom, S., Askenfelt, A.: Measuring the rate of change of voice fundamental frequency in fluent speech during mental depression. J. Acoust. Soc. Am. 83, 716–728 (1988)
Moore, E., Clements, M.A., Peifer, J.W., Weisser, L.: Critical analysis of the impact of glottal features in the classification of clinical depression in speech. IEEE Trans. Biomed. Eng. 55, 96–107 (2008)
Alghowinem, S., Goecke, R., Wagner, M., Epps, J., Gedeon, T., Breakspear, M., et al.: A comparative study of different classifiers for detecting depression from spontaneous speech. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 8022–8026 (2013)
Ooi, K.E.B., Lech, M., Allen, N.B.: Multichannel weighted speech classification system for prediction of major depression in adolescents. IEEE Trans. Biomed. Eng. 60, 497–506 (2013)
Cummins, N., Epps, J., Ambikairajah, E.: Spectro-temporal analysis of speech affected by depression and psychomotor retardation. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 7542–7546 (2013)
Kinnunen, T., Lee, K.-A., Li, H.: Dimension reduction of the modulation spectrogram for speaker verification. In: Odyssey, p. 30 (2008)
Breznitz, Z., Share, D.L.: Effects of accelerated reading rate on memory for text. J. Educ. Psychol. 84, 193 (1992)
Alpert, M., Pouget, E.R., Silva, R.R.: Reflections of depression in acoustic measures of the patient’s speech. J. Affect. Disord. 66, 59–69 (2001)
Calev, A., Nigal, D., Chazan, S.: Retrieval from semantic memory using meaningful and meaningless constructs by depressed, stable bipolar and manic patients. Br. J. Clin. Psychol. 28, 67–73 (1989)
Vanger, P., Summerfield, A.B., Rosen, B., Watson, J.: Effects of communication content on speech behavior of depressives. Compr. Psychiatry 33, 39–41 (1992)
Shankayi, R., Vali, M., Salimi, M., Malekshahi, M.: Identifying depressed from healthy cases using speech processing. In: 19th Iranian Conference of Biomedical Engineering (ICBME), pp. 191–194 (2012)
Kroenke, K., Spitzer, R.L., Williams, J.B.: The PHQ-9: validity of a brief depression severity measure. J. Gen. Intern. Med. 16, 606–613 (2001)
Hamilton, M.: A rating scale for depression. J. Neurol. Neurosurg. Psychiatry 23, 56–62 (1960)
Gong, X., Huang, Y., Wang, Y., Luo, Y.: Revision of the Chinese facial affective picture system. Chin. Ment. Health J. 25, 40–46 (2011)
Martinot, M.-L.P., Bragulat, V., Artiges, E., Dollé, F., Hinnen, F., Jouvent, R., et al.: Decreased presynaptic dopamine function in the left caudate of depressed patients with affective flattening and psychomotor retardation. Am. J. Psychiatry 158, 314–316 (2001)
Clark, L., Chamberlain, S.R., Sahakian, B.J.: Neurocognitive mechanisms in depression: implications for treatment. Annu. Rev. Neurosci. 32, 57–74 (2009)
Hönig, F., Batliner, A., Nöth, E., Schnieder, S., Krajewski, J.: Automatic modelling of depressed speech: relevant features and relevance of gender. In: Fifteenth Annual Conference of the International Speech Communication Association, pp. 1248–1252 (2014)
Smolak, L., Munstertieger, B.F.: The relationship of gender and voice to depression and eating disorders. Psychol. Women Q. 26, 234–241 (2002)
Low, L.S.A., Maddage, N.C., Lech, M., Sheeber, L.B., Allen, N.B.: Detection of clinical depression in adolescents’ speech during family interactions. IEEE Trans. Biomed. Eng. 58, 574–586 (2011)
Ellgring, H., Scherer, K.R.: Vocal indicators of mood change in depression. J. Nonverbal Behav. 20, 83–110 (1996)
Acknowledgments
This work was supported by the National Basic Research Program of China (973 Program) (No. 2014CB744600), the National Natural Science Foundation of China (Grant No. 61632014, No. 61210010), Program of Beijing Municipal Science & Technology Commission (No. Z171100000117005), the Program of International S&T Cooperation of MOST (No. 2013DFA11140). Grateful acknowledgement is made to: Xiang Gao, Tianyang Wang, Lihua Yan, Huanyu Kang, for experimental implementation.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Liu, Z., Hu, B., Li, X., Liu, F., Wang, G., Yang, J. (2017). Detecting Depression in Speech Under Different Speaking Styles and Emotional Valences. In: Zeng, Y., et al. Brain Informatics. BI 2017. Lecture Notes in Computer Science(), vol 10654. Springer, Cham. https://doi.org/10.1007/978-3-319-70772-3_25
Download citation
DOI: https://doi.org/10.1007/978-3-319-70772-3_25
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-70771-6
Online ISBN: 978-3-319-70772-3
eBook Packages: Computer ScienceComputer Science (R0)