Detecting Spam Community Using Retweeting Relationships – A Study on Sina Microblog

Zhao, Bin; Ji, Genlin; Qu, Weiguang; Zhang, Zhigang

doi:10.1007/978-3-319-04048-6_16

Bin Zhao²⁹,
Genlin Ji²⁹,
Weiguang Qu²⁹ &
…
Zhigang Zhang²⁹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8178))

Included in the following conference series:

1239 Accesses
2 Citations

Abstract

Microblog marketing is a new trend in social media. Spammers have been increasingly targeting such platforms to disseminate spam and promoting messages. Unlike the past behaviors on traditional media, they connect and support each other to perform spam tasks on microblogs. Therefore existing methods can’t be directly used for detecting spam community. In this paper, we examine the behaviors of spammers on Sina microblog, and obtain some observations about their activities rules. Then we extract content features from tweet text and behavior features from retweeting interactions, perform machine learning to build classification models and identify spammers on microblogs. We evaluate our generated feature set used for detecting spammers under three classification methods, including Naive Bayes, Decision Tree and SVM. Extensive experiments show that our proposed feature set can make the classifiers perform well, and the crawler program combining the SVM classifier can effectively detect spam community.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Dasgupta, A., Gurevich, M., Punera, K.: Enhanced email spam filtering through combining similarity graphs. In: WSDM, pp. 785–794 (2011)
Google Scholar
Cormack, G.V., Kolcz, A.: Spam filter evaluation with imprecise ground truth. In: SIGIR, pp. 604–611 (2009)
Google Scholar
Wei Chang, M., Tau Yih, W., Meek, C.: Partitioned logistic regression for spam filtering. In: KDD, pp. 97–105 (2008)
Google Scholar
Fette, I., Sadeh, N.M., Tomasic, A.: Learning to detect phishing emails. In: WWW, pp. 649–656 (2007)
Google Scholar
Ott, M., Choi, Y., Cardie, C., Hancock, J.T.: Finding deceptive opinion spam by any stretch of the imagination. In: ACL, pp. 309–319 (2011)
Google Scholar
Wang, G., Xie, S., Liu, B., Yu, P.S.: Review graph based online store review spammer detection. In: ICDM, pp. 1242–1247 (2011)
Google Scholar
Lim, E.P., Nguyen, V.A., Jindal, N., Liu, B., Lauw, H.W.: Detecting product review spammers using rating behaviors. In: CIKM, pp. 939–948 (2010)
Google Scholar
Benevenuto, F., Rodrigues, T., Almeida, V.A.F., Almeida, J.M., Gonçalves, M.A.: Detecting spammers and content promoters in online video social networks. In: SIGIR, pp. 620–627 (2009)
Google Scholar
Yang, C., Harkreader, R.C., Zhang, J., Shin, S., Gu, G.: Analyzing spammers’ social networks for fun and profit: a case study of cyber criminal ecosystem on twitter. In: WWW, pp. 71–80 (2012)
Google Scholar
Lee, K., Caverlee, J., Webb, S.: Uncovering social spammers: social honeypots + machine learning. In: SIGIR, pp. 435–442 (2010)
Google Scholar
Cao, L.: In-depth behavior understanding and use: the behavior informatics approach. Information Sciences 180(17), 3067–3085 (2010)
Article Google Scholar
Jindal, N., Liu, B.: Opinion spam and analysis. In: WSDM, pp. 219–230 (2008)
Google Scholar
Mukherjee, A., Liu, B., Glance, N.S.: Spotting fake reviewer groups in consumer reviews. In: WWW, pp. 191–200 (2012)
Google Scholar
Xie, S., Wang, G., Lin, S., Yu, P.S.: Review spam detection via temporal pattern discovery. In: KDD, pp. 823–831 (2012)
Google Scholar
Zhu, Y., Wang, X., Zhong, E., Liu, N.N., Li, H., Yang, Q.: Discovering spammers in social networks. In: AAAI (2012)
Google Scholar
Liu, L., Jia, K.: Detecting spam in chinese microblogs - a study on sina weibo. In: CIS, pp. 578–581 (2012)
Google Scholar
Zhang, X., Zhu, S., Liang, W.: Detecting spam and promoting campaigns in the twitter social network. In: ICDM, pp. 1194–1199 (2012)
Google Scholar
Liao, Q., Shi, L.: She gets a sports car from our donation: rumor transmission in a chinese microblogging community. In: CSCW, pp. 587–598 (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and Technology, Nanjing Normal University, Nanjing, P.R. China
Bin Zhao, Genlin Ji, Weiguang Qu & Zhigang Zhang

Authors

Bin Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Genlin Ji
View author publications
You can also search for this author in PubMed Google Scholar
Weiguang Qu
View author publications
You can also search for this author in PubMed Google Scholar
Zhigang Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Advanced Analytics Institute, University of Technology, 2-12 Blackfriars Street, Chippendale, Blackfriars Campus, NSW 2008, Sydney, Australia
Longbing Cao
Institute of Scientific and Industrial Research, Osaka University, Japan
Hiroshi Motoda
Department of Computer Science, University of Minnesota, USA
Jaideep Srivastava
School of Information Systems, Singapore Management University, 80 Stamford Road, 178902, Singapore
Ee-Peng Lim
Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong
Irwin King
Department of Computer Science, University of Illinois at Chicago, 851 S. Morgan St., Rm 1138 SEO, 60607, Chicago, IL, USA
Philip S. Yu
Leibniz Universität Hannover, Germany
Wolfgang Nejdl
Advanced Analytics Institute, University of Technology, Sydney, Australia
Guandong Xu
Deakin University, Burwood, VIC, Australia
Gang Li
Shanghai Jiao Tong University, 200240, Shanghai, China
Ya Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhao, B., Ji, G., Qu, W., Zhang, Z. (2013). Detecting Spam Community Using Retweeting Relationships – A Study on Sina Microblog. In: Cao, L., et al. Behavior and Social Computing. BSIC BSI 2013 2013. Lecture Notes in Computer Science(), vol 8178. Springer, Cham. https://doi.org/10.1007/978-3-319-04048-6_16

Download citation

DOI: https://doi.org/10.1007/978-3-319-04048-6_16
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-04047-9
Online ISBN: 978-3-319-04048-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics