Abstract
Nowadays, derogatory comments are often made by one another, not only in offline environment but also immensely in online environments like social networking websites and online communities. So, an Identification combined with Prevention System in all social networking websites and applications, including all the communities, existing in the digital world is a necessity. In such a system, the Identification Block should identify any negative online behavior and should signal the Prevention Block to take action accordingly. This study aims to analyze any piece of text and detect different types of toxicity like obscenity, threats, insults and identity-based hatred. The labeled Wikipedia Comment Dataset prepared by Jigsaw is used for the purpose. A 6-headed Machine Learning tf–idf Model has been made and trained separately, yielding a Mean Validation Accuracy of 98.08% and Absolute Validation Accuracy of 91.64%. Such an Automated System should be deployed for enhancing the healthy online conversation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Coversation AI Team. https://conversationai.github.io/
Perspective API. https://perspectiveapi.com/#/
Georgakopoulos, S.V., Tasoulis, S.K., Vrahatis, A.G., Plagianakos, V.P.: Convolutional neural networks for toxic comment classification. In: 10th Hellenic Conference on Artificial Intelligence (2018)
Khieu, K., Narwal, N.: Detecting and classifying toxic comments. https://web.stanford.edu/class/cs224n/reports/6837517.pdf
Chu, T., Jue, K., Wang, M.: “Comment abuse classification with deep learning. https://web.stanford.edu/class/cs224n/reports/2762092.pdf
Kohli, M., Kuehler, E., Palowitch, J.: Paying attention to toxic comments online. https://web.stanford.edu/class/cs224n/reports/6856482.pdf
https://www.kaggle.com/c/jigsaw-toxic-comment-classification-challenge/data
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Chakrabarty, N. (2020). A Machine Learning Approach to Comment Toxicity Classification. In: Das, A., Nayak, J., Naik, B., Pati, S., Pelusi, D. (eds) Computational Intelligence in Pattern Recognition. Advances in Intelligent Systems and Computing, vol 999. Springer, Singapore. https://doi.org/10.1007/978-981-13-9042-5_16
Download citation
DOI: https://doi.org/10.1007/978-981-13-9042-5_16
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-9041-8
Online ISBN: 978-981-13-9042-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)