Abstract
Microblog marketing is a new trend in social media. Spammers have been increasingly targeting such platforms to disseminate spam and promoting messages. Unlike the past behaviors on traditional media, they connect and support each other to perform spam tasks on microblogs. Therefore existing methods can’t be directly used for detecting spam community. In this paper, we examine the behaviors of spammers on Sina microblog, and obtain some observations about their activities rules. Then we extract content features from tweet text and behavior features from retweeting interactions, perform machine learning to build classification models and identify spammers on microblogs. We evaluate our generated feature set used for detecting spammers under three classification methods, including Naive Bayes, Decision Tree and SVM. Extensive experiments show that our proposed feature set can make the classifiers perform well, and the crawler program combining the SVM classifier can effectively detect spam community.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Dasgupta, A., Gurevich, M., Punera, K.: Enhanced email spam filtering through combining similarity graphs. In: WSDM, pp. 785–794 (2011)
Cormack, G.V., Kolcz, A.: Spam filter evaluation with imprecise ground truth. In: SIGIR, pp. 604–611 (2009)
Wei Chang, M., Tau Yih, W., Meek, C.: Partitioned logistic regression for spam filtering. In: KDD, pp. 97–105 (2008)
Fette, I., Sadeh, N.M., Tomasic, A.: Learning to detect phishing emails. In: WWW, pp. 649–656 (2007)
Ott, M., Choi, Y., Cardie, C., Hancock, J.T.: Finding deceptive opinion spam by any stretch of the imagination. In: ACL, pp. 309–319 (2011)
Wang, G., Xie, S., Liu, B., Yu, P.S.: Review graph based online store review spammer detection. In: ICDM, pp. 1242–1247 (2011)
Lim, E.P., Nguyen, V.A., Jindal, N., Liu, B., Lauw, H.W.: Detecting product review spammers using rating behaviors. In: CIKM, pp. 939–948 (2010)
Benevenuto, F., Rodrigues, T., Almeida, V.A.F., Almeida, J.M., Gonçalves, M.A.: Detecting spammers and content promoters in online video social networks. In: SIGIR, pp. 620–627 (2009)
Yang, C., Harkreader, R.C., Zhang, J., Shin, S., Gu, G.: Analyzing spammers’ social networks for fun and profit: a case study of cyber criminal ecosystem on twitter. In: WWW, pp. 71–80 (2012)
Lee, K., Caverlee, J., Webb, S.: Uncovering social spammers: social honeypots + machine learning. In: SIGIR, pp. 435–442 (2010)
Cao, L.: In-depth behavior understanding and use: the behavior informatics approach. Information Sciences 180(17), 3067–3085 (2010)
Jindal, N., Liu, B.: Opinion spam and analysis. In: WSDM, pp. 219–230 (2008)
Mukherjee, A., Liu, B., Glance, N.S.: Spotting fake reviewer groups in consumer reviews. In: WWW, pp. 191–200 (2012)
Xie, S., Wang, G., Lin, S., Yu, P.S.: Review spam detection via temporal pattern discovery. In: KDD, pp. 823–831 (2012)
Zhu, Y., Wang, X., Zhong, E., Liu, N.N., Li, H., Yang, Q.: Discovering spammers in social networks. In: AAAI (2012)
Liu, L., Jia, K.: Detecting spam in chinese microblogs - a study on sina weibo. In: CIS, pp. 578–581 (2012)
Zhang, X., Zhu, S., Liang, W.: Detecting spam and promoting campaigns in the twitter social network. In: ICDM, pp. 1194–1199 (2012)
Liao, Q., Shi, L.: She gets a sports car from our donation: rumor transmission in a chinese microblogging community. In: CSCW, pp. 587–598 (2013)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer International Publishing Switzerland
About this paper
Cite this paper
Zhao, B., Ji, G., Qu, W., Zhang, Z. (2013). Detecting Spam Community Using Retweeting Relationships – A Study on Sina Microblog. In: Cao, L., et al. Behavior and Social Computing. BSIC BSI 2013 2013. Lecture Notes in Computer Science(), vol 8178. Springer, Cham. https://doi.org/10.1007/978-3-319-04048-6_16
Download citation
DOI: https://doi.org/10.1007/978-3-319-04048-6_16
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-04047-9
Online ISBN: 978-3-319-04048-6
eBook Packages: Computer ScienceComputer Science (R0)