Detecting Cyberbullying in Social Commentary Using Supervised Machine Learning

Raza, Muhammad Owais; Memon, Mohsin; Bhatti, Sania; Bux, Rahim

doi:10.1007/978-3-030-39442-4_45

Muhammad Owais Raza¹⁷,
Mohsin Memon¹⁷,
Sania Bhatti¹⁷ &
…
Rahim Bux¹⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1130))

Included in the following conference series:

Future of Information and Communication Conference

1363 Accesses
11 Citations

Abstract

This paper addresses the problem of cyberbullying on various online discussion forums in the form of social commentary. Here, supervised machine learning algorithms are employed to detect whether a particular comment is an insult, threat or a hate message. First of all, a machine learning model is developed with Logistic Regression, Random forest and naive bayes algorithms for classification and then, both Voting and AdaBoost classifiers are applied on the developed model to observe which works best in this case. In Japan, the members of PTA (Parent Teacher Association) perform net-petrol with a manual website monitoring in order to catch and stop cyberbullying activities; however, doing all this manually is very time consuming and hectic process. The main contribution of this paper includes a mechanism to detect cyberbullying and by using supervised machine learning with logistic regression algorithm, model has achieved an accuracy of 82.7%. With voting classifier, an accuracy of 84.4% was observed. The evaluation results show that voting classifier outperforms all other algorithms in detecting cyberbullying.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Detecting Insults in Social Commentary “Kaggle”, Kaggle.com (2019). https://www.kaggle.com/c/detecting-insults-in-socialcommentary/data. Accessed 09 Apr 2019
Nitta, T., et al.: Detecting cyberbullying entries on informal school websites based on category relevance maximization. In: Proceedings of the Sixth International Joint Conference on Natural Language Processing (2013)
Google Scholar
Reynolds, K., Kontostathis, A., Edwards, L.: Using machine learning to detect cyberbullying. In: 2011 10th International Conference on Machine learning and applications and workshops, vol. 2. IEEE (2011)
Google Scholar
Dadvar, M., et al.: Improved cyberbullying detection using gender information. In: Proceedings of the Twelfth Dutch-Belgian Information Retrieval Workshop (DIR 2012). University of Ghent (2012)
Google Scholar
Kontostathis, A., et al.: Detecting cyberbullying: query terms and techniques. In: Proceedings of the 5th Annual ACM Web Science Conference. ACM (2013)
Google Scholar
Dadvar, M., et al.: Improving cyberbullying detection with user context. In: European Conference on Information Retrieval. Springer, Berlin (2013)
Google Scholar
DeGregory, K.W., et al.: A review of machine learning in obesity. Obes. Rev. 19(5), 668–685 (2018)
Article Google Scholar
Wu, J.-Y., Hsiao, Y.-C., Nian, M.-W.: Using supervised machine learning on large-scale online forums to classify course-related Facebook messages in predicting learning achievement within the personal learning environment. In: Interactive Learning Environments, pp. 1–16 (2018)
Google Scholar
Balyan, R., McCarthy, K.S., McNamara, D.S.: Comparing machine learning classification approaches for predicting expository text difficulty. In: The Thirty-First International Flairs Conference (2018)
Google Scholar
Hoogeveen, D., et al.: Web forum retrieval and text analytics: a survey. Found. Trends® Inf. Retrieval 12(1), 1–163 (2018)
Article Google Scholar
Raisi, E., Huang, B.: Cyberbullying detection with weakly supervised machine learning. In: Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017, pp. 409–416. ACM, July 2017
Google Scholar
Al-garadi, M.A., Varathan, K.D., Ravana, S.D.: Cybercrime detection in online communications: the experimental case of cyberbullying detection in the Twitter network. Comput. Hum. Behav. 63, 433–443 (2016)
Article Google Scholar
Randhawa, K., et al.: Credit card fraud detection using AdaBoost and majority voting. IEEE Access 6, 14277–14284 (2018)
Article Google Scholar
Voting Classifier. https://scikitl-earn.org/stable/modules/ensemble.html#voting-classifier. Accessed 24 Apr 2019
Ensemble Methods. Scikit. https://scikit-learn.org/stable/modules/ensemble.html#adaboost. Accessed 24 Apr 2019
Rahman, H.A.A., Wah, Y.B., He, H., Bulgiba, A.: Comparisons of ADABOOST, KNN, SVM and logistic regression in classification of imbalanced dataset. In: International Conference on Soft Computing in Data Science, pp. 54–64. Springer, Singapore, September 2015
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Software Engineering, Mehran University of Engineering Technology, Jamshoro, Pakistan
Muhammad Owais Raza, Mohsin Memon, Sania Bhatti & Rahim Bux

Authors

Muhammad Owais Raza
View author publications
You can also search for this author in PubMed Google Scholar
Mohsin Memon
View author publications
You can also search for this author in PubMed Google Scholar
Sania Bhatti
View author publications
You can also search for this author in PubMed Google Scholar
Rahim Bux
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Muhammad Owais Raza .

Editor information

Editors and Affiliations

Faculty of Science and Engineering, Saga University, Saga, Japan
Kohei Arai
The Science and Information (SAI) Organization, Bradford, West Yorkshire, UK
Supriya Kapoor
The Science and Information (SAI) Organization, Bradford, West Yorkshire, UK
Rahul Bhatia

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Raza, M.O., Memon, M., Bhatti, S., Bux, R. (2020). Detecting Cyberbullying in Social Commentary Using Supervised Machine Learning. In: Arai, K., Kapoor, S., Bhatia, R. (eds) Advances in Information and Communication. FICC 2020. Advances in Intelligent Systems and Computing, vol 1130. Springer, Cham. https://doi.org/10.1007/978-3-030-39442-4_45

Download citation

DOI: https://doi.org/10.1007/978-3-030-39442-4_45
Published: 13 February 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-39441-7
Online ISBN: 978-3-030-39442-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics