Skip to main content

Detecting Spam Community Using Retweeting Relationships – A Study on Sina Microblog

  • Conference paper
Behavior and Social Computing (BSIC 2013, BSI 2013)

Abstract

Microblog marketing is a new trend in social media. Spammers have been increasingly targeting such platforms to disseminate spam and promoting messages. Unlike the past behaviors on traditional media, they connect and support each other to perform spam tasks on microblogs. Therefore existing methods can’t be directly used for detecting spam community. In this paper, we examine the behaviors of spammers on Sina microblog, and obtain some observations about their activities rules. Then we extract content features from tweet text and behavior features from retweeting interactions, perform machine learning to build classification models and identify spammers on microblogs. We evaluate our generated feature set used for detecting spammers under three classification methods, including Naive Bayes, Decision Tree and SVM. Extensive experiments show that our proposed feature set can make the classifiers perform well, and the crawler program combining the SVM classifier can effectively detect spam community.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Dasgupta, A., Gurevich, M., Punera, K.: Enhanced email spam filtering through combining similarity graphs. In: WSDM, pp. 785–794 (2011)

    Google Scholar 

  2. Cormack, G.V., Kolcz, A.: Spam filter evaluation with imprecise ground truth. In: SIGIR, pp. 604–611 (2009)

    Google Scholar 

  3. Wei Chang, M., Tau Yih, W., Meek, C.: Partitioned logistic regression for spam filtering. In: KDD, pp. 97–105 (2008)

    Google Scholar 

  4. Fette, I., Sadeh, N.M., Tomasic, A.: Learning to detect phishing emails. In: WWW, pp. 649–656 (2007)

    Google Scholar 

  5. Ott, M., Choi, Y., Cardie, C., Hancock, J.T.: Finding deceptive opinion spam by any stretch of the imagination. In: ACL, pp. 309–319 (2011)

    Google Scholar 

  6. Wang, G., Xie, S., Liu, B., Yu, P.S.: Review graph based online store review spammer detection. In: ICDM, pp. 1242–1247 (2011)

    Google Scholar 

  7. Lim, E.P., Nguyen, V.A., Jindal, N., Liu, B., Lauw, H.W.: Detecting product review spammers using rating behaviors. In: CIKM, pp. 939–948 (2010)

    Google Scholar 

  8. Benevenuto, F., Rodrigues, T., Almeida, V.A.F., Almeida, J.M., Gonçalves, M.A.: Detecting spammers and content promoters in online video social networks. In: SIGIR, pp. 620–627 (2009)

    Google Scholar 

  9. Yang, C., Harkreader, R.C., Zhang, J., Shin, S., Gu, G.: Analyzing spammers’ social networks for fun and profit: a case study of cyber criminal ecosystem on twitter. In: WWW, pp. 71–80 (2012)

    Google Scholar 

  10. Lee, K., Caverlee, J., Webb, S.: Uncovering social spammers: social honeypots + machine learning. In: SIGIR, pp. 435–442 (2010)

    Google Scholar 

  11. Cao, L.: In-depth behavior understanding and use: the behavior informatics approach. Information Sciences 180(17), 3067–3085 (2010)

    Article  Google Scholar 

  12. Jindal, N., Liu, B.: Opinion spam and analysis. In: WSDM, pp. 219–230 (2008)

    Google Scholar 

  13. Mukherjee, A., Liu, B., Glance, N.S.: Spotting fake reviewer groups in consumer reviews. In: WWW, pp. 191–200 (2012)

    Google Scholar 

  14. Xie, S., Wang, G., Lin, S., Yu, P.S.: Review spam detection via temporal pattern discovery. In: KDD, pp. 823–831 (2012)

    Google Scholar 

  15. Zhu, Y., Wang, X., Zhong, E., Liu, N.N., Li, H., Yang, Q.: Discovering spammers in social networks. In: AAAI (2012)

    Google Scholar 

  16. Liu, L., Jia, K.: Detecting spam in chinese microblogs - a study on sina weibo. In: CIS, pp. 578–581 (2012)

    Google Scholar 

  17. Zhang, X., Zhu, S., Liang, W.: Detecting spam and promoting campaigns in the twitter social network. In: ICDM, pp. 1194–1199 (2012)

    Google Scholar 

  18. Liao, Q., Shi, L.: She gets a sports car from our donation: rumor transmission in a chinese microblogging community. In: CSCW, pp. 587–598 (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer International Publishing Switzerland

About this paper

Cite this paper

Zhao, B., Ji, G., Qu, W., Zhang, Z. (2013). Detecting Spam Community Using Retweeting Relationships – A Study on Sina Microblog. In: Cao, L., et al. Behavior and Social Computing. BSIC BSI 2013 2013. Lecture Notes in Computer Science(), vol 8178. Springer, Cham. https://doi.org/10.1007/978-3-319-04048-6_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-04048-6_16

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-04047-9

  • Online ISBN: 978-3-319-04048-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics