Sentiment Analysis for Movies Prediction Using Machine Leaning Techniques

  • Manisha Jadon
  • Ila SharmaEmail author
  • Arvind K. Sharma
Conference paper
Part of the Lecture Notes on Data Engineering and Communications Technologies book series (LNDECT, volume 38)


The main intention of this paper is to determine the sentimental analysis for movies prediction using machine learning techniques. A novel methodology along with necessary algorithms is presented here. The input dataset obtained from IMDb is chosen for prediction of box office collection. Firstly preprocessing is performed to remove the redundant features such as stop words, punctuations, numbers, whitespace, lowercase conversion, word stemming, sparse term removal etc.. Preprocessing technique involves two steps namely tokenization and stop word removal. The preprocessed text is sent for transformation process. This formation involves calculating the values of TF and IDF. These are performed for estimating the sentiment analysis. Afterwards, fuzzy clustering is performed that helps in getting both positive and negative outcomes on basis of different movies datasets. In this way, a classification model is made utilizing SVM classifier for foreseeing the pattern of the box office incomes from the reviews of movies.


Machine learning Sentiment analysis Movie reviews IMDb SVM classifier 


  1. 1.
    Nagamma, P., et. al.: An improved sentiment analysis of online movie reviews based on clustering for box-office prediction. In: International Conference on Computing, Communication & Automation (2015)Google Scholar
  2. 2.
    Tyagi, E., Sharma, A.K.: An intelligent framework for sentiment analysis of text and emotions-a review. In: 2017 International Conference on Energy, Communication, Data Analytics and Soft Computing (ICECDS) (2017)Google Scholar
  3. 3.
    Luo, F., et al.: Affective-feature-based sentiment analysis using SVM classifier. In: 20th International Conference on Computer Supported Cooperative Work in Design. IEEE (2016)Google Scholar
  4. 4.
    Aydoan, E., et al.: A comprehensive survey for sentiment analysis tasks using machine learning techniques. IEEE (2016)Google Scholar
  5. 5.
    Qasem, M., et al.: Twitter sentiment classification using machine learning techniques for stock markets. IEEE (2015)Google Scholar
  6. 6.
    Trupthi, M., et al.: Improved feature extraction and classification-sentiment analysis. In: International Conference on Advances in Human Machine Interaction, HMI-2016. R.L. Jalappa Institute of Technology, Bangalore, 3–5 March 2016Google Scholar
  7. 7.
    Apple, O., et al.: A hybrid approach to sentiment analysis. IEEE (2016)Google Scholar
  8. 8.
    Brindha, S., et al.: A survey on classification techniques for text mining. In: 3rd International Conference on Advanced Computing and Communication Systems, ICACCS-2016, Coimbatore, India, 22–23 January 2016Google Scholar
  9. 9.
    Zou, H., et al.: Sentiment classification using machine learning techniques with syntax features. In: International Conference on Computational Science and Computational Intelligence. IEEE (2015)Google Scholar
  10. 10.
    Kalaivani, P., et al.: An improved K-nearest-neighbor algorithm using genetic algorithm for sentiment classification. In: International Conference on Circuit, Power and Computing Technologies (ICCPCT). IEEE (2014)Google Scholar
  11. 11.
    Karamibekr, M., et al.: A structure for opinion in social domains. IEEE (2013)Google Scholar
  12. 12.
    Saif, H., et al.: Semantic sentiment analysis of Twitter. In: 11th International Semantic Web Conference, ISWC-2012 (2012)CrossRefGoogle Scholar
  13. 13.
    Internet Movie Data Base.
  14. 14.
    Tyagi, E., Sharma, A.K.: Sentiment analysis of product reviews using support vector machine learning algorithm. Indian J. Sci. Technol. 10(35), 1–9 (2017)CrossRefGoogle Scholar
  15. 15.
    Kumari, U., Sharma, A.K., Soni, D.: Sentiment analysis of smart phone product review using SVM classification technique. In: IEEE-2017 International Conference on Energy, Communication, Data Analytics & Soft Computing (ICECDS) (2018)Google Scholar
  16. 16.
    Zhao, K., Jin, Y.: A hybrid method for sentiment classification in Chinese movie reviews based on sentiment labels. In: International Conference on Asian Language Processing (IALP) (2015)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  1. 1.Department of CSER.N. Modi College of EngineeringKotaIndia
  2. 2.Computer Science & EngineeringUniversity of KotaKotaIndia

Personalised recommendations