Advertisement

Automatic Recognition of Emotional State in Polish Speech

  • Piotr Staroniewicz
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6456)

Abstract

The paper presents the comparison of scores for emotional state automatic recognition tests. The database of Polish emotional speech used during tests includes recordings of six acted emotional states (anger, sadness, happiness, fear, disgust, surprise) and the neutral state of 13 amateur speakers (2118 utterances). The features based on F0, intensity, formants and LPC coefficients were applied in seven chosen classifiers. The highest scores were reached for SVM, ANN and DTC classifiers.

Keywords

emotional speech emotional state recognition 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Cowie, R.: Describing the Emotional States Expressed in Speech. In: Proc. of ISCA, Belfast 2000, pp. 11–18 (2000)Google Scholar
  2. 2.
    Scherer, K.R.: Vocal communications of emotion: A review of research paradigms. Speech Communication 40, 227–256 (2003)CrossRefzbMATHGoogle Scholar
  3. 3.
    Burkhard, F., Paeschkhe, A., Rolfes, M., Sendlmeier, W., Weiss, B.: A Database of German Emotional Speech. In: Proc. of Interspeech 2005, Lissabon, Portugal (2005)Google Scholar
  4. 4.
    Lugger, M., Yang, B.: The Relevance of Voice Quality Features in Speaker Independent Emotion Recognition. In: IEEE ICASSP 2007, USA, IV, pp. 17–20 (2007)Google Scholar
  5. 5.
    Staroniewicz, P., Majewski, W.: Polish Emotional Speech Database – Recording and Preliminary Validation. In: Esposito, A., Vích, R. (eds.) Cross-Modal Analysis of Speech, Gestures, Gaze and Facial Expressions. LNCS(LNAI), vol. 5641, pp. 42–49. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  6. 6.
    Staroniewicz, P.: Polish emotional speech database–design. In: Proc. of 55th Open Seminar on Acoustics, Wroclaw, Poland, pp. 373–378 (2008)Google Scholar
  7. 7.
    Staroniewicz, P.: Recognition of Emotional State in Polish Speech – Comparison between Human and Automatic Efficiency. In: Fierrez, J., Ortega-Garcia, J., Esposito, A., Drygajlo, A., Faundez-Zanuy, M. (eds.) BioID MultiComm 2009. LNCS, vol. 5707, pp. 33–40. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  8. 8.
    Douglas-Cowie, E., Campbell, N., Cowie, R., Roach, P.: Emotional speech: Towards a new generation of databases. Speech Communication 40, 33–60 (2003)CrossRefzbMATHGoogle Scholar
  9. 9.
    Ververdis, D., Kotropoulos, C.: A State of the Art on Emotional Speech Databases. In: Proc. of 1st Richmedia Conf., Laussane, Switzerland, pp. 109–119 ( October 2003)Google Scholar
  10. 10.
    Vapnik, V.N.: Statistical Learning Theory. Wiley, Chichester (1998)zbMATHGoogle Scholar
  11. 11.
    Kwon, O., Chan, K., Hao, J., Lee, T.: Emotion Recognition by Speech Signals. In: Eurospeech, Geneva, Switzerland, Septmeber 1-3 (2003)Google Scholar
  12. 12.
    Zhou, J., Wang, G., Yang, Y., Chen, P.: Speech emotion recognition based on rough set and SVM. In: 5th IEEE International Conference on Cognitive Informatics, ICCI 2006, Beijing, July 2006, vol. 1, pp. 53–61 (2006)Google Scholar
  13. 13.
    Lee, C.M., Narayanam, S.S., Pieraccini, R.: Classifuing Emotions in Human-Machine Spoken Dialogs. In: Proc. Int. Conf., on Multimedia Expo. (2002)Google Scholar
  14. 14.
    Hsu, Ch.W., Chang, Ch.-Ch., Lin, Ch.-J.: A Practical Guide to Support Vector Classification. Department of Computer Science, National Taiwan University (2008), http://www.csie.ntu.edu.tw/~cjlin (last updated: May 21, 2008)
  15. 15.
    Chang, Ch.-Ch., Lin, Ch.-J.: LIBSVM: a Library for Support Vector Machines (2001), http://www.csie.ntu.edu.tw/~cjlin/libsvm/
  16. 16.
    Witten, I.H., Frank, E.: Data Mining: Practical machine learning tools and techniques, 2nd edn. Morgan Kauffmann, San Francisco (2005)zbMATHGoogle Scholar
  17. 17.
    Breiman, L.: Random Forests. Machine Learning 45, 5–32 (2001)CrossRefzbMATHGoogle Scholar
  18. 18.
    Witten, I.H., Frank, E.: Data Mining. Practical Machine Learning Tools and Techniques. Elsevier, Amsterdam (2005)zbMATHGoogle Scholar
  19. 19.
    COST Action 2102, Cross -Modal Analysis of Verbal and Non-verbal Communication, Memorandum of Understanding, Brussels (July 11, 2006)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Piotr Staroniewicz
    • 1
  1. 1.Institute of Telecommunications, Teleinformatics and AcousticsWroclaw University of TechnologyWroclawPoland

Personalised recommendations