Skip to main content

Feature Selection Methods in Persian Sentiment Analysis

  • Conference paper
Natural Language Processing and Information Systems (NLDB 2013)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7934))

Abstract

With the enormous growth of digital content in internet, various types of online reviews such as product and movie reviews present a wealth of subjective information that can be very helpful for potential users. Sentiment analysis aims to use automated tools to detect subjective information from reviews. Up to now as there are few researches conducted on feature selection in sentiment analysis, there are very rare works for Persian sentiment analysis. This paper considers the problem of sentiment classification using different feature selection methods for online customer reviews in Persian language. Three of the challenges of Persian text are using of a wide variety of declensional suffixes, different word spacing and many informal or colloquial words. In this paper we study these challenges by proposing a model for sentiment classification of Persian review documents. The proposed model is based on stemming and feature selection and is employed Naive Bayes algorithm for classification. We evaluate the performance of the model on a collection of cellphone reviews, where the results show the effectiveness of the proposed approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Liu, B., Zhang, L.: A survey of opinion mining and sentiment analysis. Mining Text Data. pp. 415–463 (2012)

    Google Scholar 

  2. Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL 2002 Conference on Empirical Methods in Natural Language Processing, vol. 10, pp. 79–86. ACL (2002)

    Google Scholar 

  3. Moraes, R., Valiati, J.F., Gavião Neto, W.P.: Document-level sentiment classification: an empirical comparison between SVM and ANN. Expert Systems with Applications (2012)

    Google Scholar 

  4. Cui, H., Mittal, V., Datar, M.: Comparative experiments on sentiment classification for online product reviews. In: Proceedings of National Conference on Artificial Intelligence, Menlo Park, Cambridge, London, vol. 21(2), p. 1265 (2006)

    Google Scholar 

  5. Yussupova, N., Bogdanova, D., Boyko, M.: Applying of sentiment analysis for texts in russian based on machine learning approach. In: Proceedings of Second International Conference on Advances in Information Mining and Management, pp. 8–14 (2012)

    Google Scholar 

  6. Popescu, A.M., Etzioni, O.: Extracting product features and opinions from reviews. In: Proceedings of Conference on Empirical Methods in Natural Language Processing (2005)

    Google Scholar 

  7. Zhu, J., Wang, H., Zhu, M., Tsou, B.K., Ma, M.: Aspect-based opinion polling from customer reviews. IEEE Transactions on Affective Computing 2(1), 37–49 (2011)

    Article  Google Scholar 

  8. Liu, B., Hu, M., Cheng, J.: Opinion observer: analyzing and comparing opinions on the web. In: Proceedings of Conference on World Wide Web, pp. 342–351 (2005)

    Google Scholar 

  9. Turney, P.D., Littman, M.L.: Unsupervised learning of semantic orientation from a hundred-billion-word corpus. Technical Report EGB-1094, National Research Council Canada (2002)

    Google Scholar 

  10. Shams, M., Shakery, A., Faili, H.: A non-parametric LDA-based induction method for sentiment analysis. In: Proceedings of 16th IEEE CSI International Symposium on Artificial Intelligence and Signal Processing, pp. 216–221 (2012)

    Google Scholar 

  11. Farhoodi, M., Yari, A.: Applying machine learning algorithms for automatic Persian text classification. In: Proceedings of IEEE International Confernce on Advanced Information Management and Service, pp. 318–323 (2010)

    Google Scholar 

  12. Taghva, K., Beckley, R., Sadeh, M.: A stemming algorithm for the Farsi language. In: Proceedings of IEEE International Conference on Information Technology: Coding and Computing, ITCC, vol. 1, pp. 158–162 (2005)

    Google Scholar 

  13. Mitchell, T.: Machine Learning, 2nd edn. McGraw-Hill (1997)

    Google Scholar 

  14. Duric, A., Song, F.: Feature selection for sentiment analysis based on content and syntax models. Decision Support Systems (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Saraee, M., Bagheri, A. (2013). Feature Selection Methods in Persian Sentiment Analysis. In: Métais, E., Meziane, F., Saraee, M., Sugumaran, V., Vadera, S. (eds) Natural Language Processing and Information Systems. NLDB 2013. Lecture Notes in Computer Science, vol 7934. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38824-8_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-38824-8_29

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-38823-1

  • Online ISBN: 978-3-642-38824-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics