Sentiment Analysis in Movie Reviews Using Document Frequency Difference, Gain Ratio and Kullback-Leibler Divergence as Feature Selection Methods and Multi-layer Perceptron Classifier

  • S. VigneshwaranEmail author
Conference paper
Part of the Lecture Notes on Data Engineering and Communications Technologies book series (LNDECT, volume 49)


Both industries and people are interested in knowing the reviews of the products or movies. If the number of reviews are large, manual classification of reviews into either positive or negative classes, is tedious and time consuming. As an alternative, Sentiment Analysis, a subdomain in Natural Language Processing aims to automate the above process, by training the models by using reviews as the training data. These are popular because this is very essential for making decisions by knowing the other person’s opinion. In this paper, Document Frequency Difference, Gain Ratio and Kullback-Leibler divergence are used for feature selection and the classification is done with Multi-Layer Perceptron classifier. This is done with movie reviews dataset like SAR14, IMDB11, Cornell open source benchmark dataset. The results show that Document Frequency Difference and Gain Ratio and feature selection methods have better sentiment classification performance i.e. with better accuracy and reduced error.


Document Frequency Difference Gain Ratio Kullback-Leibler Divergence Feature selection Movie reviews Sentiment classification Multi-layer Perceptron 


  1. 1.
    Singh, V.K., Piryani, R., Uddin, A., Waila, P.: Sentiment analysis of movie reviews: a new feature-based heuristic for aspect-level sentiment classification. In: 2013 International Multi-Conference on Automation, Computing, Communication, Control and Compressed Sensing (iMac4s), pp. 712–717. IEEE, March 2013Google Scholar
  2. 2.
    Thet, T.T., Na, J.C., Khoo, C. S., Shakthikumar, S.: Sentiment analysis of movie reviews on discussion boards using a linguistic approach. In: Proceedings of the 1st International CIKM Workshop on Topic-Sentiment Analysis for Mass Opinion, pp. 81–84. ACM, November 2009Google Scholar
  3. 3.
    Singh, V.K., Piryani, R., Uddin, A., Waila, P.: Sentiment analysis of movie reviews and Blog posts. In: 2013 3rd IEEE International Advance Computing Conference (IACC), pp. 893–898. IEEE, February 2013Google Scholar
  4. 4.
    Liu, C.L., Hsaio, W.H., Lee, C.H., Lu, G.C., Jou, E.: Movie rating and review summarization in mobile environment. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 42(3), 397–407 (2011)CrossRefGoogle Scholar
  5. 5.
    Lin, C., He, Y.: Joint sentiment/topic model for sentiment analysis. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management, pp. 375–384. ACM, November 2009Google Scholar
  6. 6.
    Zhuang, L., Jing, F., Zhu, X.Y.: Movie review mining and summarization. In: Proceedings of the 15th ACM International Conference on Information and Knowledge Management, pp. 43–50. ACM, November 2006Google Scholar
  7. 7.
    Khotimah, D.A.K., Sarno, R.: Sentiment detection of comment titles in using probabilistic latent semantic analysis. In: 2018 6th International Conference on Information and Communication Technology (ICoICT), pp. 514–519. IEEE, May 2018Google Scholar
  8. 8.
    Amolik, A., Jivane, N., Bhandari, M., Venkatesan, M.: Twitter sentiment analysis of movie reviews using machine learning techniques. Int. J. Eng.Technol. 7(6), 1–7 (2016)Google Scholar
  9. 9.
    Pang, B., Lee, L.: A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the 42nd annual meeting on Association for Computational Linguistics, p. 271. Association for Computational Linguistics, July 2004Google Scholar
  10. 10.
    Shirani-Mehr, H.: Applications of deep learning to sentiment analysis of movie reviews. Technical report. Stanford University (2014)Google Scholar
  11. 11.
    Annett, M., Kondrak, G.: A comparison of sentiment analysis techniques: polarizing movie blogs. In: Conference of the Canadian Society for Computational Studies of Intelligence, pp. 25–35. Springer, Heidelberg, May 2008CrossRefGoogle Scholar
  12. 12.
    Yasen, M., Tedmori, S.: Movies Reviews sentiment analysis and classification. In: 2019 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT), pp. 860–865. IEEE, April 2019Google Scholar
  13. 13.
    The dataset can be downloaded from
  14. 14.
    Nicholls, C., Song, F.: Comparison of feature selection methods for sentiment analysis. In: Canadian Conference on Artificial Intelligence, pp. 286–289. Springer, Heidelberg, May 2010CrossRefGoogle Scholar
  15. 15.
    Manek, A.S., Shenoy, P.D., Mohan, M.C., Venugopal, K.R.: Aspect term extraction for sentiment analysis in large movie reviews using Gini Index feature selection method and SVM classifier. World Wide Web 20(2), 135–154 (2017)CrossRefGoogle Scholar

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  1. 1.Applied Mathematics and Computational SciencesPSG College of Engineering and TechnologyCoimbatoreIndia

Personalised recommendations