Discovering Opinion Spammer Groups by Network Footprints

Ye, Junting; Akoglu, Leman

doi:10.1007/978-3-319-23528-8_17

Junting Ye¹⁰ &
Leman Akoglu¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9284))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

5101 Accesses
49 Citations

Abstract

Online reviews are an important source for consumers to evaluate products/services on the Internet (e.g. Amazon, Yelp, etc.). However, more and more fraudulent reviewers write fake reviews to mislead users. To maximize their impact and share effort, many spam attacks are organized as campaigns, by a group of spammers. In this paper, we propose a new two-step method to discover spammer groups and their targeted products. First, we introduce NFS (Network Footprint Score), a new measure that quantifies the likelihood of products being spam campaign targets. Second, we carefully devise GroupStrainer to cluster spammers on a 2-hop subgraph induced by top ranking products. We demonstrate the efficiency and effectiveness of our approach on both synthetic and real-world datasets from two different domains with millions of products and reviewers. Moreover, we discover interesting strategies that spammers employ through case studies of our detected groups.

Download to read the full chapter text

Chapter PDF

NRWalk2Vec-HIN: spammer group detection based on heterogeneous information network embedding over social media

Article 24 July 2023

Detecting Spammer Communities Using Network Structural Features

Exposing collaborative spammer groups through the review-response graph

Article 17 February 2023

Keywords

References

Akoglu, L., Chandy, R., Faloutsos, C.: Opinion fraud detection in online reviews by network effects. In: ICWSM (2013)
Google Scholar
Akoglu, L., Faloutsos, C.: RTG: a recursive realistic graph generator using random typing. In: Buntine, W., Grobelnik, M., Mladenić, D., Shawe-Taylor, J. (eds.) ECML PKDD 2009, Part I. LNCS, vol. 5781, pp. 13–28. Springer, Heidelberg (2009)
Chapter Google Scholar
Akoglu, L., McGlohon, M., Faloutsos, C.: Oddball: spotting anomalies in weighted graphs. In: Zaki, M.J., Yu, J.X., Ravindran, B., Pudi, V. (eds.) AKDD 2010, Part II. LNCS, vol. 6119, pp. 410–421. Springer, Heidelberg (2010)
Chapter Google Scholar
Barabási, A.-L., Albert, R., Jeong, H.: Scale-free characteristics of random networks: the topology of the world-wide web. Physica A: Statistical Mechanics and its Applications 281(1–4), 69–77 (2000)
Article Google Scholar
Benczr, A.A., Csalogny, K., Sarls, T., Uher, M.: Spamrank - fully automatic link spam detection. In: AIRWeb, pp. 25–38 (2005)
Google Scholar
Broder, A.: Graph structure in the web. Computer Networks 33(1–6), 309–320 (2000)
Article Google Scholar
Chung, F.R.K., Lu, L.: The average distance in a random graph with given expected degrees. Internet Mathematics 1(1), 91–113 (2003)
Article MathSciNet MATH Google Scholar
Dalvi, N., Domingos, P., Mausam, Sanghai, S., Verma, D.: Adversarial classification. In: KDD, pp. 99–108 (2004)
Google Scholar
Faloutsos, M., Faloutsos, P., Faloutsos, C.: On power-law relationships of the internet topology. In: SIGCOMM, pp. 251–262 (1999)
Google Scholar
Faust, K.: Centrality in affiliation networks. Social Networks 19(2), 157–191 (1997)
Article Google Scholar
Fei, G., Mukherjee, A., Liu, B., Hsu, M., Castellanos, M., Ghosh, R.: Exploiting burstiness in reviews for review spammer detection. In: ICWSM (2013)
Google Scholar
Feng, S., Banerjee, R., Choi, Y.: Syntactic stylometry for deception detection. In: ACL (2012)
Google Scholar
Feng, S., Xing, L., Gogar, A., Choi, Y.: Distributional footprints of deceptive product reviews. In: ICWSM (2012)
Google Scholar
Gao, J., Tan, P.-N.: Converting output scores from outlier detection algorithms to probability estimates. In: ICDM (2006)
Google Scholar
Gionis, A., Indyk, P., Motwani, R.: Similarity search in high dimensions via hashing. In: VLDB, pp. 518–529 (1999)
Google Scholar
Jiang, M., Cui, P., Beutel, A., Faloutsos, C., Yang, S.: Catchsync: catching synchronized behavior in large directed graphs. In: KDD, pp. 941–950 (2014)
Google Scholar
Jindal, N., Liu, B.: Opinion spam and analysis. In: WSDM, pp. 219–230 (2008)
Google Scholar
Jindal, N., Liu, B., Lim, E.-P.: Finding unusual review patterns using unexpected rules. In: CIKM, pp. 1549–1552. ACM (2010)
Google Scholar
Johnson, W., Lindenstrauss, J.: Extensions of Lipschitz mappings into a Hilbert space. Contemporary Mathematics, vol. 26, pp. 189–206 (1984)
Google Scholar
Li, F., Huang, M., Yang, Y., Zhu, X.: Learning to identify review spam. In: IJCAI (2011)
Google Scholar
Li, H., Chen, Z., Liu, B., Wei, X., Shao, J.: Spotting fake reviews via collective positive-unlabeled learning. In: ICDM (2014)
Google Scholar
Mukherjee, A., Kumar, A., Liu, B., Wang, J., Hsu, M., Castellanos, M., Ghosh, R.: Spotting opinion spammers using behavioral footprints. In: KDD (2013)
Google Scholar
Mukherjee, A., Liu, B., Glance, N.S.: Spotting fake reviewer groups in consumer reviews. In: WWW (2012)
Google Scholar
Newman, M.: Power laws, Pareto distributions and Zipf’s law. Contemporary Physics 46(5), 323–351 (2005)
Article Google Scholar
Ott, M., Choi, Y., Cardie, C., Hancock, J.T.: Finding deceptive opinion spam by any stretch of the imagination. In: ACL, pp. 309–319 (2011)
Google Scholar
Wang, G., Xie, S., Liu, B., Yu, P.S.: Review graph based online store review spammer detection. In: ICDM, pp. 1242–1247 (2011)
Google Scholar
Xu, C., Zhang, J.: Combating product review spam campaigns via multiple heterogeneous pairwise features. In: SDM. SIAM (2015)
Google Scholar
Xu, C., Zhang, J., Chang, K., Long, C.: Uncovering collusive spammers in Chinese review websites. In: CIKM, pp. 979–988. ACM (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Stony Brook University, Stony Brook, USA
Junting Ye & Leman Akoglu

Authors

Junting Ye
View author publications
You can also search for this author in PubMed Google Scholar
Leman Akoglu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Junting Ye .

Editor information

Editors and Affiliations

University of Bari Aldo Moro, Bari, Italy
Annalisa Appice
University of Porto, Porto, Portugal
Pedro Pereira Rodrigues
University of Porto - CRACS/INESC TEC, Porto, Portugal
Vítor Santos Costa
University of Porto - INESC TEC, Porto, Portugal
Carlos Soares
University of Porto - INESC TEC, Porto, Portugal
João Gama
University of Porto - INESC TEC, Porto, Portugal
Alípio Jorge

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ye, J., Akoglu, L. (2015). Discovering Opinion Spammer Groups by Network Footprints. In: Appice, A., Rodrigues, P., Santos Costa, V., Soares, C., Gama, J., Jorge, A. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2015. Lecture Notes in Computer Science(), vol 9284. Springer, Cham. https://doi.org/10.1007/978-3-319-23528-8_17

Download citation

DOI: https://doi.org/10.1007/978-3-319-23528-8_17
Published: 29 August 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23527-1
Online ISBN: 978-3-319-23528-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Discovering Opinion Spammer Groups by Network Footprints

Abstract

Chapter PDF

Similar content being viewed by others

NRWalk2Vec-HIN: spammer group detection based on heterogeneous information network embedding over social media

Detecting Spammer Communities Using Network Structural Features

Exposing collaborative spammer groups through the review-response graph

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Discovering Opinion Spammer Groups by Network Footprints

Abstract

Chapter PDF

Similar content being viewed by others

NRWalk2Vec-HIN: spammer group detection based on heterogeneous information network embedding over social media

Detecting Spammer Communities Using Network Structural Features

Exposing collaborative spammer groups through the review-response graph

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation