Detecting Spammers with Changing Strategies via a Transfer Distance Learning Method

Chen, Hao; Liu, Jun; Lv, Yanzhang

doi:10.1007/978-3-030-05090-0_24

Detecting Spammers with Changing Strategies via a Transfer Distance Learning Method

Hao Chen^16,17,
Jun Liu^17,18 &
Yanzhang Lv^16,18

Conference paper
First Online: 29 December 2018

1442 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11323))

Abstract

Social spammers bring plenty of harmful influence to the social networking involving both social network sites and normal users. It is a consensus to detect and filter spammers. Existing social spammer detection approaches mainly focus on discovering discriminative features and organizing these features in a proper way to improve the detection performance, e.g., combining multiple features together. However, spammers are easy to escape being detected by using changing spamming strategies. Various spamming strategies bring differences in data distribution between training and testing data. Thus, previous fixed approaches are difficult to achieve desired performance in real applications. To address this, in this paper, we present a transfer distance learning approach, which combines distance learning and transfer learning to extract informative knowledge underlying training and testing instances in a unified framework. The proposed approach is validated on large real-world data. Empirical experiments results give the evidence that our method is efficient to detect spammers with changing spamming strategies.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Absil, P.A., Mahony, R., Sepulchre, R.: Optimization Algorithms on Matrix Manifolds. Princeton University Press, Princeton (2009)
Google Scholar
Bellet, A., Habrard, A., Sebban, M.: A survey on metric learning for feature vectors and structured data. arXiv preprint arXiv:1306.6709 (2013)
Cao, B., Ni, X., Sun, J.T., Wang, G., Yang, Q.: Distance metric learning under covariate shift. In: proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence, Barcelona, Catalonia, Spain, p. 1204 (2011)
Google Scholar
Chen, H., Liu, J., Lv, Y., Li, M.H., Liu, M., Zheng, Q.: Semi-supervised clue fusion for spammer detection in Sina Weibo. Inf. Fusion 44, 22–32 (2018)
Article Google Scholar
Davis, J.V., Kulis, B., Jain, P., Sra, S., Dhillon, I.S.: Information-theoretic metric learning. In: Proceedings of the 24th International Conference on Machine Learning, pp. 209–216. ACM (2007)
Google Scholar
Fleiss, J.L., Cohen, J.: The equivalence of weighted kappa and the intraclass correlation coefficient as measures of reliability. Educ. Psychol. Measur. 33(3), 613–619 (1973)
Article Google Scholar
Hu, X., Tang, J., Zhang, Y., Liu, H.: Social spammer detection in microblogging. In: IJCAI, vol. 13, pp. 2633–2639 (2013)
Google Scholar
Huang, J., Gretton, A., Borgwardt, K.M., Schölkopf, B., Smola, A.J.: Correcting sample selection bias by unlabeled data. In: Advances in Neural Information Processing Systems, pp. 601–608 (2007)
Google Scholar
Jiang, J., Zhai, C.: Instance weighting for domain adaptation in NLP. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pp. 264–271 (2007)
Google Scholar
Liao, X., Xue, Y., Carin, L.: Logistic regression with an auxiliary data source. In: Proceedings of the 22nd International Conference on Machine Learning, pp. 505–512. ACM (2005)
Google Scholar
Marcos Alvarez, A., Yamada, M., Kimura, A., Iwata, T.: Clustering-based anomaly detection in multi-view data. In: Proceedings of the 22nd ACM International Conference on Information & Knowledge Management, pp. 1545–1548. ACM (2013)
Google Scholar
Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2010)
Article Google Scholar
Shen, H., Ma, F., Zhang, X., Zong, L., Liu, X., Liang, W.: Discovering social spammers from multiple views. Neurocomputing 225, 49–57 (2017)
Article Google Scholar
Shimodaira, H.: Improving predictive inference under covariate shift by weighting the log-likelihood function. J. Stat. Plann. Infer. 90(2), 227–244 (2000)
Article MathSciNet Google Scholar
Wang, G., Xie, S., Liu, B., Philip, S.Y.: Review graph based online store review spammer detection. In: 2011 IEEE 11th International Conference on Data Mining (ICDM), pp. 1242–1247. IEEE (2011)
Google Scholar
Weinberger, K.Q., Saul, L.K.: Distance metric learning for large margin nearest neighbor classification. J. Mach. Learn. Res. 10(Feb), 207–244 (2009)
Google Scholar
Wu, P., Dietterich, T.G.: Improving SVM accuracy by training on auxiliary data sources. In: Proceedings of the Twenty-First International Conference on Machine Learning, p. 110. ACM (2004)
Google Scholar
Xu, Z., Zhang, Y., Wu, Y., Yang, Q.: Modeling user posting behavior on social media. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 545–554. ACM (2012)
Google Scholar
Yang, W., Shen, G.W., Wang, W., Gong, L.Y., Yu, M., Dong, G.Z.: Anomaly detection in microblogging via co-clustering. J. Comput. Sci. Technol. 30(5), 1097–1108 (2015)
Article Google Scholar
Zhu, Y., Wang, X., Zhong, E., Liu, N.N., Li, H., Yang, Q.: Discovering spammers in social networks. In: AAAI (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

National Engineering Lab for Big Data Analytics, Xi’an Jiaotong University, Xi’an, 710049, Shaanxi, China
Hao Chen & Yanzhang Lv
School of Electronic and Information Engineering, Xi’an Jiaotong University, Xi’an, 710049, Shaanxi, China
Hao Chen & Jun Liu
Shaanxi Province Key Laboratory of Satellite and Terrestrial Network Tech. R&D, Xi’an, China
Jun Liu & Yanzhang Lv

Authors

Hao Chen
View author publications
You can also search for this author in PubMed Google Scholar
Jun Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yanzhang Lv
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hao Chen .

Editor information

Editors and Affiliations

University of Connecticut, Storrs, CT, USA
Guojun Gan
Nanjing University of Aeronautics and Astronautics, Nanjing, China
Bohan Li
The University of Queensland, Brisbane, QLD, Australia
Xue Li
Beijing Institute of Technology, Beijing, China
Shuliang Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, H., Liu, J., Lv, Y. (2018). Detecting Spammers with Changing Strategies via a Transfer Distance Learning Method. In: Gan, G., Li, B., Li, X., Wang, S. (eds) Advanced Data Mining and Applications. ADMA 2018. Lecture Notes in Computer Science(), vol 11323. Springer, Cham. https://doi.org/10.1007/978-3-030-05090-0_24

Download citation

DOI: https://doi.org/10.1007/978-3-030-05090-0_24
Published: 29 December 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-05089-4
Online ISBN: 978-3-030-05090-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics