Abstract
Laughter is an important social signal that conveys different emotions like happiness, sadness, anger, fear, surprise, and disgust. Therefore, detecting emotions in the laughter is useful for estimating the emotional state of the user. This paper presents work that detects the emotions in Iranian laughter by using audio features and running four machine learning algorithms, namely, Sequential Minimal Optimization (SMO), Multilayer Perceptron (MLP), Logistic, and Radial Basis Function Network (RBFNetwork). We extracted features such as intensity (minimum, maximum, mean, and standard deviation), energy, power, first 3 formants, and the first thirteen Mel Frequency Cepstral Coefficients. Two datasets are used: one that contains segments of full laughter episodes and one that contains only laughter onsets. Results indicate that MLP algorithm produce the highest rate of accuracy which is 86.1372% for first dataset and 85.0123% for second dataset. Besides, using the combination of MFCC and prosodic features led to better results. This means that recognition of emotions is possible at the start of laughter, which is useful for real-time applications.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
References
Eyben, F., Petridis, S., Schuller, B., Tzimiropoulos, G.: Audiovisual classification of vocal outbursts in human conversation using long-short-term memory networks. In: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 5844–5847. IEEE, Prague (2001)
Galvan, C., Manangan, D., Sanchez, M., Wong, J., Cu, J.: Audiovisual affect recognition in spontaneous filipino laughter. In: 2011 3rd International Conference on Knowledge and Systems Engineering (KSE), pp. 266–271. IEEE, Hanoi (2011)
Hamidi, M., Mansoorizade, M.: Emotion recognition from persian speech with neural network. Int. J. Artif. Intell. Appl. 5, 107–112 (2012)
Iliev, A.I., Zhang, Y., Scordilis, M.S.: Spoken emotion classification using ToBI features and GMM. In: 14th International Workshop on 2007 and 6th EURASIP Conference Focused on Speech and Image Processing, Multimedia Communications and Services, Systems, Signals and Image Processing, pp. 495–498. IEEE, Maribor (2007)
Kennedy, L.S., Ellis, D. P.: Laughter detection in meetings. In: NIST ICASSP (2004)
Kudiri, K.M., Verma, G.K., Gohel, B.: Relative amplitude based features for emotion detection from speech. In: 2010 International Conference on Signal and Image Processing (ICSIP), pp. 301–304. IEEE, Chennai (2010)
Miranda, M., Alonzo, J. A., Campita, J., Lucila, S., Suarez, M.: Discovering emotions in Filipino laughter using audio features. In: 2010 3rd International Conference on Human-Centric Computing (HumanCom), pp. 1–6. IEEE, Cebu (2010)
Oyang, Y.-J., Hwang, W., Ou, Y.-Y., Chen, C.-Y., Chen, Z.-W.: Data classification with radial basis function networks based on a novel kernel density estimation algorithm. IEEE Trans. Neural Netw. 16, 225–236 (2005). IEEE
Rodriguez, R.L., Ataollahi, F., Cabredo, R.: Modelling and detecting emotions of Filipino laughter using audio signal. In: 14th Philippine Computing Science Congress, pp. 168–174. Computing Society of the Philippines (2014)
Szameitat, D.P., Alter, K., Szameitat, A.J., Wildgruber, D., Sterr, A., Darwin, C.J.: Acoustic profiles of distinct emotional expressions in laughter. J. Acoust. Soc. Am. 126, 354–366 (2009)
Tanaka, H., Campbell, N.: Acoustic features of four types of laughter in natural conversational speech. In: ICPHS XVII, Hong Kong, pp. 1958–1961 (2011)
Truong, K.P., Van Leeuwen, D.A.: Automatic detection of laughter. In: INTERSPEECH, pp. 485–488 (2005)
Urbain, J., Bevacqua, E., Dutoit, T., Moinet, A., Niewiadomski, R.: The AVLaughterCycle database. In: LREC 2010, Malta, pp. 2996–3001 (2010)
Urbain, J., Niewiadomski, R., Bevacqua, E., Dutoit, T., Moinet, A.: The AVLaughterCycle. Multimodal User Interfaces 4, 47–58 (2010). Springer
Witten, I.H., Frank, E.: Data Mining – Practical Machine Learning Tools and Techniques, 3rd edn. Morgan Kaufmann, Burlington (2011)
Yap, B.W., Rani, K.A., Rahman, H.A.A., Fong, S., Khairudin, Z., Abdullah, N.N.: An application of oversampling, undersampling, bagging and boosting in handling imbalanced datasets. In: 1st International Conference on Advanced Data and Information Engineering (DaEng-2013), vol. 285, pp. 13–22. Springer, Singapore (2013)
Zare, S.: Home and away: blogging emotions in a Persian virtual Dowreh: a thesis presented in fulfilment of the requirements for the degree of Doctor of Philosophy in Linguistics and Second Language Teaching at Massey University. Massey University, Palmerston North (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Ataollahi, F., Suarez, M.T. (2016). Comparing Affect Recognition in Peaks and Onset of Laughter. In: Baldoni, M., et al. Principles and Practice of Multi-Agent Systems. CMNA IWEC IWEC 2015 2015 2014. Lecture Notes in Computer Science(), vol 9935. Springer, Cham. https://doi.org/10.1007/978-3-319-46218-9_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-46218-9_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46217-2
Online ISBN: 978-3-319-46218-9
eBook Packages: Computer ScienceComputer Science (R0)