Skip to main content

Design and Implementation of Chinese Spam Review Detection System

  • Conference paper
Chinese Lexical Semantics (CLSW 2013)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8229))

Included in the following conference series:

Abstract

This paper designs and implements a Chinese spam review detection system based on rules. The main rules include the following three types: (1) Calculating the similarity between two comments, i.e., if the similarity is larger than a specified threshold, the two comments are viewed as review spam; (2) Calculating the correlation degree between comments and the product, i.e., if the degree is smaller than a specified threshold, the comment is viewed as review spam. (3) Detecting whether stuffing exists in the keyword, meta field or keywords of the web page. If they exist, the comments are viewed as review spam. In addition, we proposed a Naive Bayes Classifier in the review detection system. We selected 500 comments randomly and signed the comments true or false manually. Then 400 comments were selected to train and the other 100 comments were used to test. Finally, precision of the algorithm was attained. Experimental results show that the operation effect of our system is satisfactory.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. China Internet Network Information Center. 2009 annual report of China online shopping market, 38–42 (2009)

    Google Scholar 

  2. Lim, E.-P., Nguyen, V.-A., Jindal, N., Liu, B., Lauw, H.W.: Detecting product review spammers using rating behaviors. In: Proceeding CIKM 2010 Proceedings of the 19th ACM International Conference on Information and Knowledge Management, pp. 939–948 (2010)

    Google Scholar 

  3. Wu, G., Greene, D., Smyth, B., Cunningham, P.: Distortion as a validation criterion in the identification of suspicious reviews. Technical Report UCD-CSI-2010-04, University College Dublin (2010)

    Google Scholar 

  4. Jindal, N., Liu, B., Lim, E.: Finding Unusual Review Patterns Using Unexpected Rules. In: CIKM 2010: 19th ACM International Conference on Information and Knowledge Management, Toronto, Ontario (2010)

    Google Scholar 

  5. Li, F., Liu, N., Jin, H., Zhao, K., Yang, Q., Zhu, X.: Incorporate Reviewer and Product Information for Review Rating Prediction. In: Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence, IJCAI 2011 (2011)

    Google Scholar 

  6. Wang, G., Xie, S., Liu, B., Yu, P.S.: Identify Online Store Review Spammers via Social Review Graph. Journal of ACM Transactions on Intelligent Systems and Technology (TIST) 3(4) (September 2012)

    Google Scholar 

  7. Xia, H., Liu, J.: Credibility Analysis of Comments of Virtual Community Based on Text Similarity Computing. Journal of Modern Information 31(9), 33–37 (2011)

    MathSciNet  Google Scholar 

  8. Yang, F., Li, J.: Survey on research of opinion spam in user-generated-content. Application Research of Computers 28(10), 3601–3605 (2011)

    Google Scholar 

  9. ICTCLAS, http://ictclas.org/

  10. Shi, C., Xu, C., Yang, X.: Study of TFIDF algorithm. Journal of Computer Application 29(6), 167–180 (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Xu, X., Han, T., Xu, Z., Wang, Y., Liu, Y. (2013). Design and Implementation of Chinese Spam Review Detection System. In: Liu, P., Su, Q. (eds) Chinese Lexical Semantics. CLSW 2013. Lecture Notes in Computer Science(), vol 8229. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-45185-0_53

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-45185-0_53

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-45184-3

  • Online ISBN: 978-3-642-45185-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics