PUD: Social Spammer Detection Based on PU Learning

Song, Yuqi; Gao, Min; Yu, Junliang; Li, Wentao; Wen, Junhao; Xiong, Qingyu

doi:10.1007/978-3-319-70139-4_18

Yuqi Song^18,19,
Min Gao^18,19,
Junliang Yu^18,19,
Wentao Li²⁰,
Junhao Wen^18,19 &
…
Qingyu Xiong^18,19

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10638))

Included in the following conference series:

International Conference on Neural Information Processing

4677 Accesses
1 Citations

Abstract

Social networks act as the communication channels for people to share various information online. However, spammers who generate spam information reduce the satisfaction of common users. Numerous notable studies have been done to detect social spammers, and these methods can be categorized into three types: unsupervised, supervised and semi-supervised methods. While the performance of supervised and semi-supervised methods is superior in terms of detection accuracy, these methods usually suffer from the dilemma of imbalanced data since the labeled normal users are far more than spammers in real situations. To address the problem, we propose a novel method only relying on normal users to detect spammers. Firstly, a classifier is built from a part of normal and unlabeled samples to pick out reliable spammers from unlabeled samples. Secondly, our well-trained detector, which is based on the given normal users and predicted spammers, can distinguish between normal users and spammers. Experiments conducted on real-world datasets show that the proposed method is competitive with supervised methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

References

Benevenuto, F., Magno, G., Rodrigues, T., Almeida, V.: Detecting spammers on Twitter. In: Collaboration, Electronic Messaging, Anti-abuse and Spam Conference (CEAS), vol. 6, p. 12 (2010)
Google Scholar
Gao, H., Hu, J., Wilson, C., Li, Z., Chen, Y., Zhao, B.Y.: Detecting and characterizing social spam campaigns. In: Proceedings of 10th ACM SIGCOMM conference on Internet measurement, pp. 35–47. ACM (2010)
Google Scholar
Tan, E., Guo, L., Chen, S., Zhang, X., Zhao, Y.: Unik: unsupervised social network spam detection. In: Proceedings of 22nd ACM international conference on Information & Knowledge Management, pp. 479–488. ACM (2013)
Google Scholar
Zhang, B., Qian, T., Chen, Y., You, Z.: Social spammer detection via structural properties in ego network. In: Li, Y., Xiang, G., Lin, H., Wang, M. (eds.) SMP 2016. CCIS, vol. 669, pp. 245–256. Springer, Singapore (2016). doi:10.1007/978-981-10-2993-6_21
Chapter Google Scholar
Benevenuto, F., Rodrigues, T., Almeida, V., Almeida, J., Gonçalves, M.: Detecting spammers and content promoters in online video social networks. In: Proceedings of 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 620–627. ACM (2009)
Google Scholar
Hu, X., Tang, J., Zhang, Y., Liu, H.: Social spammer detection in microblogging. In: IJCAI, vol. 13, pp. 2633–2639. Citeseer (2013)
Google Scholar
Wu, L., Hu, X., Morstatter, F., Liu, H.: Adaptive spammer detection with sparse group modeling. In: ICWSM, p. 319–326 (2017)
Google Scholar
Wu, Z., Wang, Y., Wang, Y., Wu, J., Cao, J., Zhang, L.: Spammers detection from product reviews: a hybrid model. In: 2015 IEEE International Conference on, Data Mining (ICDM), pp. 1039–1044. IEEE (2015)
Google Scholar
Li, W., Gao, M., Rong, W., Wen, J., Xiong, Q., Ling, B.: LSSL-SSD: social spammer detection with laplacian score and semi-supervised learning. In: Lehner, F., Fteimi, N. (eds.) KSEM 2016. LNCS, vol. 9983, pp. 439–450. Springer, Cham (2016). doi:10.1007/978-3-319-47650-6_35
Chapter Google Scholar
Liu, B., Dai, Y., Li, X., Lee, W.S., Yu, P.S.: Building text classifiers using positive and unlabeled examples. In: 3rd IEEE International Conference on Data Mining, ICDM 2003, pp. 179–186. IEEE (2003)
Google Scholar

Download references

Acknowledgments

The work is supported by the Basic and Advanced Research Projects in Chongqing under Grant No. cstc2015jcyjA40049, the National Key Basic Research Program of China (973) under Grant No. 2013CB328903, the Guangxi Science and Technology Major Project under Grant No. GKAA17129002, and the Graduate Scientific Research and Innovation Foundation of Chongqing, China under Grant No. CYS17035.

Author information

Authors and Affiliations

Key Laboratory of Dependable Service Computing in Cyber Physical Society, Chongqing University, Ministry of Education, Chongqing, China
Yuqi Song, Min Gao, Junliang Yu, Junhao Wen & Qingyu Xiong
School of Software Engineering, Chongqing University, Chongqing, China
Yuqi Song, Min Gao, Junliang Yu, Junhao Wen & Qingyu Xiong
Faculty of Engineering and Information Technology, Centre for Artificial Intelligence, School of Software, University of Technology Sydney, Ultimo, Australia
Wentao Li

Authors

Yuqi Song
View author publications
You can also search for this author in PubMed Google Scholar
Min Gao
View author publications
You can also search for this author in PubMed Google Scholar
Junliang Yu
View author publications
You can also search for this author in PubMed Google Scholar
Wentao Li
View author publications
You can also search for this author in PubMed Google Scholar
Junhao Wen
View author publications
You can also search for this author in PubMed Google Scholar
Qingyu Xiong
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Min Gao .

Editor information

Editors and Affiliations

Guangdong University of Technology, Guangzhou, China
Derong Liu
Guangdong University of Technology, Guangzhou, China
Shengli Xie
South China University of Technology, Guangzhou, China
Yuanqing Li
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Dongbin Zhao
King Fahd University of Petroleum and Minerals, Dhahran, Saudi Arabia
El-Sayed M. El-Alfy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Song, Y., Gao, M., Yu, J., Li, W., Wen, J., Xiong, Q. (2017). PUD: Social Spammer Detection Based on PU Learning. In: Liu, D., Xie, S., Li, Y., Zhao, D., El-Alfy, ES. (eds) Neural Information Processing. ICONIP 2017. Lecture Notes in Computer Science(), vol 10638. Springer, Cham. https://doi.org/10.1007/978-3-319-70139-4_18

Download citation

DOI: https://doi.org/10.1007/978-3-319-70139-4_18
Published: 29 October 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-70138-7
Online ISBN: 978-3-319-70139-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics