Abstract
Emotions in continuous speech were analyzed using inverse filtering and a recently developed glottal flow parameter, the normalized amplitude quotient (NAQ). Simulated emotion portrayals were produced by 9 professional stage actors. Segments of the vowel /a:/ were separated from continuous speech. The segments were inverse filtered and parametrized using NAQ. Statistical analyses showed significant differences between most studied emotions. Results also showed clear gender differences. Inverse filtering together with NAQ was shown to be a suitable method for analysis of emotional content in continuous speech.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Murray, I.R., Arnott, J.L.: Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion. Journal of Acoustical Society of America 93, 1097–1108 (1993)
Pollack, I., Rubenstein, H., Horowitz, A.: Communication of verbal modes of expression. Language and Speech 3, 121–130 (1960)
Scherer, K.R.: Vocal communication of emotion: A review of research paradigms. Speech Communication 40, 227–256 (2003)
Scherer, K.R.: Speech and Emotional States. In: Speech evaluation in psychiatry. Grune & Stratton, pp. 189–203 (1981)
Scherer, K.R.: Vocal affect expression: A review and a model for future research. Psychological Bulletin 99, 143–165 (1986)
Laver, J.: The phonetic description of voice quality. Cambridge University Press, Cambridge (1980)
Miller, R.L.: Nature of the vocal cord wave. Journal of Acoustical Society of America 31, 667–677 (1959)
Rothenberg, M.: A new inverse-filtering technique for deriving the glottal air flow waveform during voicing. Journal of Acoustical Society of America 53, 1632–1645 (1973)
Allen, J.B., Curtis, T.H.: Automatic extraction of glottal pulses by linear estimation. Journal of Acoustical Society of America 55, 396 (1973)
Mataušek, M.R., Batalov, V.S.: A new approach to the determination of the glottal waveform. IEEE Transactions on Acoustics, Speech, and Signal Processing 28, 616–622 (1980)
Alku, P.: Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering. Speech Communication 11, 109–118 (1992)
Holmberg, E.B., Hillman, R.E., Perkell, J.S.: Glottal airflow and transglottal air pressure measurements for male and female speakers in soft, normal, and loud voice. Journal of Acoustical Society of America 84, 511–1787 (1988)
Childers, D.G., Lee, C.K.: Vocal quality factors: Analysis, synthesis, and perception. Journal of Acoustical Society of America 90, 2394–2410 (1991)
Fant, G., Liljencrants, J., Lin, Q.: A four-parameter model of glottal flow. STLQPSR, 1–13 (1985)
Gobl, C.: A preliminary study of acoustic voice quality correlates. STL-QPSR 4, 9–21 (1989)
Cummings, K.E., Clements, M.A.: Analysis of glottal waveforms across stress styles. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Albuquerque, New, Mexico, pp. 369–372 (1990)
Laukkanen, A.M., Vilkman, E., Alku, P., Oksanen, H.: Physical variations related to stress and emotional state: a preliminary study. Journal of Phonetics 24, 313–335 (1996)
Johnstone, T., Scherer, K.R.: The effects of emotions on voice quality. In: Processings of the XIVth International Congress of Phonetic Sciences, San Francisco, pp. 2029–2032 (1999)
Gobl, C., Ní Chasaide, A.: The role of voice quality in communicating emotion, mood and attitude. Speech Communication 40, 189–212 (2003)
Alku, P., Vilkman, E.: Estimation of the glottal pulseform based on discrete all-pole modeling. In: Proceedings of the International Conference on Spoken Language Processing, Yokohama, pp. 1619–1622 (1994)
Alku, P., Tiitinen, H., Näätänen, R.: A method for generating natural-sounding speech stimuli for cognitive brain research. Clinical Neurophysiology 110, 1329–1333 (1999)
Alku, P., Vilkman, E.: Amplitude domain quotient of the glottal volume velocity waveform estimated by inverse filtering. Speech Communication 18, 131–138 (1996)
Sundberg, J., Cleveland, T.F., Stone, R.E., Iwarsson, J.: Voice source characteristics in six premier country singers. Journal of Voice 13, 168–183 (1999)
Fant, G.: Some problems in voice source analysis. Speech Communication 13, 7–22 (1993)
Alku, P., Vilkman, E.: A comparison of glottal voice source quantification parameters in breathy, normal and pressed phonation of female and male speakers. Folia Phoniatrica et Logopaedica 48, 240–254 (1996)
Alku, P., Bäckström, T., Vilkman, E.: Normalized amplitude quotient for parametrization of the glottal flow. Journal of Acoustical Society of America 112, 701–710 (2002)
Bäckström, T., Alku, P., Vilkman, E.: Time-domain parametrization of the closing phase of glottal airflow waveform from voices over a large intensity range. IEEE Transactions on Speech and Audio Processing 10, 186–192 (2002)
Manstead, A.S.R.: Gender differences in emotion. In: Handbook of individual differences: Biological perspectives, Wiley, Chichester (1992)
Ekman, P.: An argument for basic emotions. Cognition and Emotion 6, 169–200 (1992)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Airas, M., Alku, P. (2004). Emotions in Short Vowel Segments: Effects of the Glottal Flow as Reflected by the Normalized Amplitude Quotient. In: André, E., Dybkjær, L., Minker, W., Heisterkamp, P. (eds) Affective Dialogue Systems. ADS 2004. Lecture Notes in Computer Science(), vol 3068. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24842-2_2
Download citation
DOI: https://doi.org/10.1007/978-3-540-24842-2_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22143-2
Online ISBN: 978-3-540-24842-2
eBook Packages: Springer Book Archive