Comparative Performance of Machine Learning Algorithms for Fake News Detection

Bali, Arvinder Pal Singh; Fernandes, Mexson; Choubey, Sourabh; Goel, Mahima

doi:10.1007/978-981-13-9942-8_40

Arvinder Pal Singh Bali¹³,
Mexson Fernandes¹³,
Sourabh Choubey¹³ &
…
Mahima Goel¹³

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1046))

Included in the following conference series:

International Conference on Advances in Computing and Data Sciences

2170 Accesses
34 Citations

Abstract

Automatic detection of fake news, which could negatively affect individuals and the society, is an emerging research area attracting global attention. The problem has been approached in this paper from Natural Language Processing and Machine Learning perspectives. The evaluation is carried out for three standard datasets with a novel set of features extracted from the headlines and the contents. Performances of seven machine learning algorithms in terms of accuracies and F1 scores are compared. Gradient Boosting outperformed other classifiers with mean accuracy of 88% and F1-Score of 0.91.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Lazer, D., et al.: The science of fake news. Science 359(6380), 1094–1096 (2018)
Article Google Scholar
Vosoughi, S., Roy, D., Aral, S.: The spread of true and false news online. Science 359(6380), 1146–1151 (2018)
Article Google Scholar
Ahmed, H., Traore, I., Saad, S.: Detecting opinion spams and fake news using text classification. Secur. Priv. 1(1) (2017). https://onlinelibrary.wiley.com/doi/full/10.1002/spy2.9
Article Google Scholar
Shu, K., Sliva, A., Wang, S., Tang, J., Liu, H.: Fake news detection on social media: a data mining perspective. ACM SIGKDD Explor. Newsl. 19(1), 22–36 (2017). https://www.kdd.org/exploration_files/19-1-Article2.pdf
Article Google Scholar
Horne, B.D., Adali, S.: This just in: fake news packs a lot in title, uses simpler, repetitive content in text body, more similar to satire than real news. Paper Presented at: The 2nd International Workshop on News and Public Opinion at ICWSM; Montreal, Canada (2017). https://arxiv.org/abs/1703.09398
Horne, B.D., Khedr, S., Adali, S.: Sampling the news producers: a large news and feature data set for the study of the complex media landscape. In: Proceedings of the Twelfth International Conference on Web and Social Media, ICWSM 2018, Stanford, CA, USA, pp. 518–527 (2018)
Google Scholar
Baly, R., Karadzhov, G., Alexandrov, D., Glass, J., Nakov, P.: Predicting Factuality of Reporting and Bias of News Media Sources (2018). https://arxiv.org/abs/1810.01765
Pérez-Rosas, V., Kleinberg, B., Lefevre, A., Mihalcea, R.: Automatic detection of fake news. In: Proceedings of the 27th International Conference on Computational Linguistics, Santa Fe, New Mexico, USA, 20–26 August, pp. 3391–3401 (2018)
Google Scholar
Gilda, S.: Evaluating machine learning algorithms for fake news detection. In: 2017 IEEE 15th Student Conference on Research and Development (SCOReD), Putrajaya, pp. 110–115 (2017)
Google Scholar
Bajaj, S.: The Pope Has a New Baby! Fake News Detection Using Deep Learning. https://web.stanford.edu/class/cs224n/reports/2710385.pdf
http://news.mit.edu/2018/mit-csail-machine-learning-system-detects-fake-news-from-source-1004
Liu, Y., Wu, Y.-F.B.: Early detection of fake news on social media through propagation path, classification with recurrent and convolutional networks. In: AAAI Publications, Thirty-Second AAAI Conference on Artificial Intelligence (2018). https://aaai.org/ocs/index.php/AAAI/AAAI18/paper/view/16826
Shu, K., Wang, S., Liu, H.: Beyond news contents: the role of social context for fake news detection. In: WSDM 2019, 11–15 February (2019). http://www.public.asu.edu/~skai2/files/wsdm_2019_fake_news.pdf
Tacchini, E., Ballarin, G., Della Vedova, M.L., Moret, S., de Alfaro, L.: Some like it hoax: automated fake news detection in social networks. Technical report UCSC-SOE-17-05 School of Engineering, University of California, Santa Cruz (2017). https://www.soe.ucsc.edu/sites/default/files/technical-reports/UCSC-SOE-17-05.pdf
Opensource Dataset. http://www.opensources.co/
Kaggle Dataset. https://www.kaggle.com/jruvika/fake-news-detection
GitHub Repository. https://github.com/GeorgeMcIntire/fake_real_news_dataset
Pennington, J., Socher, R., Manning, C.D.: GloVe: Global Vectors for Word Representation (2014). https://nlp.stanford.edu/pubs/glove.pdf
https://www.ibm.com/cloud/watson-tone-analyzer
https://www.nltk.org/
Furnkranz, J., et al.: Case study in using linguistic phrases for text categorization on the WWW. In: AAAI Technical report WS-98: (1998). https://www.aaai.org/Papers/Workshops/1998/WS-98-05/WS98-05-002.pdf
Seki, Y.: Sentence extraction by tf-idf and position weighting from newspaper articles. In: Proceedings of the 3rd NTCIR Workshop, Tokyo (2002). http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings3/NTCIR3-TSC-SekiY.pdf
Chen, T., Guestrin, C.: XGBoost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD, pp. 785–794 (2016)
Google Scholar
Alpaydın, E.: Introduction to Machine Learning, pp. 487–488, 2nd edn. MIT Press, Cambridge (2010)
Google Scholar

Download references

Acknowledgements

Comments on the paper by the anonymous reviewers were immensely helpful in revising the paper.

Author information

Authors and Affiliations

Asia Pacific Institute of Information Technology SD India, Panipat, 132103, Haryana, India
Arvinder Pal Singh Bali, Mexson Fernandes, Sourabh Choubey & Mahima Goel

Authors

Arvinder Pal Singh Bali
View author publications
You can also search for this author in PubMed Google Scholar
Mexson Fernandes
View author publications
You can also search for this author in PubMed Google Scholar
Sourabh Choubey
View author publications
You can also search for this author in PubMed Google Scholar
Mahima Goel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Arvinder Pal Singh Bali .

Editor information

Editors and Affiliations

University of KwaZulu-Natal, Durban, South Africa
Mayank Singh
Computer Science and Engineering, Jaypee Institute of Information Technology, Waknaghat, Himachal Pradesh, India
P.K. Gupta
Department of Computer Science and Engineering, Jaypee University of Engineering and Technology, Guna, Madhya Pradesh, India
Vipin Tyagi
ÚTIA AV ČR, Institute of Information Theory and Automation, Prague 8, Praha, Czech Republic
Jan Flusser
School of Electrical Engineering and Computer Science, University of Ottawa, Ottawa, ON, Canada
Tuncer Ören
CSE Department, Inderprastha Engineering College, Ghaziabad, Uttar Pradesh, India
Rekha Kashyap

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bali, A.P.S., Fernandes, M., Choubey, S., Goel, M. (2019). Comparative Performance of Machine Learning Algorithms for Fake News Detection. In: Singh, M., Gupta, P., Tyagi, V., Flusser, J., Ören, T., Kashyap, R. (eds) Advances in Computing and Data Sciences. ICACDS 2019. Communications in Computer and Information Science, vol 1046. Springer, Singapore. https://doi.org/10.1007/978-981-13-9942-8_40

Download citation

DOI: https://doi.org/10.1007/978-981-13-9942-8_40
Published: 19 July 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-9941-1
Online ISBN: 978-981-13-9942-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics