Detecting Human Emotion via Speech Recognition by Using Ensemble Classification Model

Prasomphan, Sathit; Doungwichain, Surinee

doi:10.1007/978-3-319-98752-1_8

Sathit Prasomphan¹⁸ &
Surinee Doungwichain¹⁸

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 248))

Included in the following conference series:

International Conference on Big Data Technologies and Applications

375 Accesses
1 Citations

Abstract

Speech Emotion Recognition is one of the most challenging researches in the field of Human-Computer Interaction (HCI). The accuracy of detecting emotion depends on several factors for example, type of emotion and number of emotion which is classified, quality of speech. In this research, we introduced the process of detecting 4 different emotion types (anger, happy, natural, and sad) from Thai speech which was recorded from Thai drama show which was most similar with daily life speech. The proposed algorithms used the combination of Support Vector Machine, Neural Network and k-Nearest Neighbors for emotion classification by using the ensemble classification method with majority weight voting. The experimental results show that emotion classification by using the ensemble classification method by using the majority weight voting can efficiency give the better accuracy results than the single model. The proposed method has better results when using with fundamental frequency (F0) and Mel-frequency cepstral coefficients (MFCC) of speech which give the accuracy results at 70.69%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 44.99; Price excludes VAT (USA)

Softcover Book: USD 60.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Ayadi, M.M.H.E., Kamel, M.S., Karray, F.: Survey on speech emotion recognition: features, classification schemes, and databases. Pattern Recognit. 44, 572–587 (2011)
Article Google Scholar
Xu, S., Liu, Y., Liu, X.: Speaker recognition and speech emotion recognition based on GMM. In: 3rd International Conference on Electric and Electronics (2013)
Google Scholar
Seehapoch, T., Wongthanavasu, S.: Speech emotion recognition using Support Vector Machines. In: the 5th International Conference on Knowledge and Smart Technology (KST), pp. 219–223 (2011)
Google Scholar
Stickel, C., Ebner, M., Steinbach-Nordmann, S., Searle, G., Holzinger, A.: Emotion detection: application of the valence arousal space for rapid biological usability testing to enhance universal access. In: Stephanidis, C. (ed.) UAHCI 2009. LNCS, vol. 5614, pp. 615–624. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-02707-9_70
Chapter Google Scholar
Burkhardt, F., Paeschke, A., Rolfes, M., Sendlmeier, W., Weiss, B.: A database of German emotional speech. In: Proceedings of Interspeech (2005)
Google Scholar
Kasuriya, S., Teeramunkong, T., Wutiwiwatchai, C.: Developing a Thai emotional speech corpus. In: International Conference on Asian Spoken Language Research and Evaluation (2013)
Google Scholar
Kasuriya, S., Banchaditt, T., Somboon, N., Teeramunkong, T., Wutiwiwatchai, C.: Detecting emotional speech in Thai drama. In: 2nd ICT International Student Project Conference (ICT-ISPC) (2013)
Google Scholar
Shen, P., Changjun, Z.: Automatic speech emotion recognition using support vector machine. In: International Conference on Electronic & Mechanical Engineering and Information Technology, pp. 621–625 (2011)
Google Scholar
Thamsiri, D., Meesad, P.: Ensemble data classification based on decision tree, artificial neuron network and support vector machine optimized by genetic algorithm. J. King’s Mongkut’s Univ. Technol. North Bangk. 21(2), 293–303 (2011)
Google Scholar
Rieger Jr., S.A., Muraleedharan, R., Ramachandran, R.P.: Speech based emotion recognition using spectral feature extraction and an ensemble of kNN classifiers. In: 9th International Symposium on Chinese Spoken Language Processing (ISCSLP), pp. 589–593 (2014)
Google Scholar
Anagnostopoulos, T., Skourlas, C.: Ensemble majority voting classifier for speech emotion recognition and prediction. J. Syst. Inf. Technol. 16(3), 222–232 (2014)
Article Google Scholar
Nicholson, J., Takahashi, K., Nakatsu, R.: Emotion recognition in speech using neural networks. In: 6th International Conference on Neural Information Processing, vol. 2, pp. 495–501 (1999)
Google Scholar
Mu, X., Lu, J., Watta, P., Hassoun, M.H.: Weighted voting-based ensemble classifiers with application to human face recognition and voice recognition. In: Proceedings of International Joint Conference on Neural Networks, Atlanta, Georgia, USA, 14–19 June, pp. 2168–2171 (2009)
Google Scholar
Morrison, D., Wang, R., De Silva, L.C.: Ensemble methods for spoken emotion recognition in call-centres. J. Speech Commun. 49(2), 98–112 (2007)
Article Google Scholar
Aha, D., Kibler, D.: Instance-based learning algorithms. Mach. Learn. 6, 37–66 (1991)
MATH Google Scholar
Sharkey, A.J.C.: Combining Artificial Neural Nets. Ensemble and Modular Multi-Net Systems. Springer, London (1999). https://doi.org/10.1007/978-1-4471-0793-4
Book MATH Google Scholar
Vasuki, P.: Speech emotion recognition using adaptive ensemble of class specific classifiers. Res. J. Appl. Sci. Eng. Technol. 9(12), 1105–1114 (2015)
Google Scholar

Download references

Acknowledgment

This research was funded by King Mongkut’s University of Technology North Bangkok. Contract no. KMUTNB-58-GEN-048.

Author information

Authors and Affiliations

Department of Computer and Information Science, Faculty of Applied Science, King Mongkut’s University of Technology North Bangkok, 1518 Pracharat 1 Road, Wongsawang, Bangsue, Bangkok, 10800, Thailand
Sathit Prasomphan & Surinee Doungwichain

Authors

Sathit Prasomphan
View author publications
You can also search for this author in PubMed Google Scholar
Surinee Doungwichain
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sathit Prasomphan .

Editor information

Editors and Affiliations

Chung-Ang University, Seoul, Korea (Republic of)
Jason J. Jung
Chosun Unversity, Gwangju, Korea (Republic of)
Pankoo Kim
Department of Computer Engineering, Chung-Ang University, Seoul, Korea (Republic of)
Kwang Nam Choi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Prasomphan, S., Doungwichain, S. (2018). Detecting Human Emotion via Speech Recognition by Using Ensemble Classification Model. In: Jung, J., Kim, P., Choi, K. (eds) Big Data Technologies and Applications. BDTA 2017. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 248. Springer, Cham. https://doi.org/10.1007/978-3-319-98752-1_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-98752-1_8
Published: 09 November 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-98751-4
Online ISBN: 978-3-319-98752-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics