Skip to main content

Sentiment Analysis Using Tuned Ensemble Machine Learning Approach

  • Conference paper
  • First Online:
Advances in Data and Information Sciences

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 38))

  • 793 Accesses

Abstract

With the recent emergence of Web-based applications and use of social networking sites, number of people are eager in expressing their views and opinions online. The sentimental analysis also referred to as opinion mining aims at processing user reviews (about products, movies, services, books, places, etc.). These reviews are often unstructured and need processing to evolve into the productive knowledge. Majority of the sentiment analysis works on the classification of opinion polarity with the use of simple classifiers. Handling diverse data distribution is one of the major issues that simple classifiers suffer. To cope up with the issue in this paper, we utilized the ensemble learners on the polarity prediction of the movie reviews. The proposed work processes the review data through some elementary steps that are conducted for the feature extraction in sentiment analysis. In addition to the feature extraction, we further perform the feature selection for the sake of dimensionality reduction. However, in contrast to the conventional simple learner, we applied the ensemble learner in the proposed model and evaluated its performance. To compare the ensemble model competence, we conducted the experiment on both individual as well as ensemble learner (random forest, AdaBoost, extra trees) and computed classification measures on both the model. IMDB dataset is used, and the polarity of a review, i.e., whether it is positive or negative, is predicted. With an extensive experimentation, it is found that results of ensemble classifiers are outperforming than individual learner in the classification of sentiment polarity.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Fernández-gavilanes M, Álvarez-lópez T, Juncal-martínez J, Costa-montenegro E, González-castaño FJ (2016) Unsupervised method for sentiment analysis in online texts, vol 58, pp 57–75

    Article  Google Scholar 

  2. Medhat, W., Hassan, A., Korashy H (2014) Sentiment analysis algorithms and applications: a survey. Ain Shams Eng 5(4):1093–1113

    Article  Google Scholar 

  3. Parkhe V (2014) Aspect based sentiment analysis of movie reviews

    Google Scholar 

  4. Singh VK, Piryani R, Uddin A (2013) Sentiment analysis of movie reviews, pp 712–717

    Google Scholar 

  5. Pang B, Lee L, Vaithyanathan S (2002) Thumbs up: sentiment classification using machine learning techniques. Proc Conf Empir Methods Nat Lang Process, 79–86

    Google Scholar 

  6. Salvetti F, Lewis S, Reichenbach C (2004) Automatic opinion polarity classification of movie. Color Res Linguist 17(1):2

    Google Scholar 

  7. Mullen T, Collier N (2004) Sentiment analysis using support vector machines with diverse information sources. Conf Empir Methods Nat Lang Process, 412–418

    Google Scholar 

  8. Matsumoto S, Takamura H, Okumura M (2005) Sentiment classification using word sub-sequences and dependency sub-trees. In: Proceedings of 9th Pacific-Asia conference advances in knowledge discovery and data mining, vol 059, pp 301–311

    Chapter  Google Scholar 

  9. Liu SM, Chen J-H (2015) A multi-label classification based approach for sentiment classification. Expert Syst Appl 42(3):1083–1093

    Article  Google Scholar 

  10. Lin Y, Lei H, Wu J, Li X (2015) An empirical study on sentiment classification of Chinese review using word embedding. In: 29th Pacific Asia conference on language information and computation, pp 258–266

    Google Scholar 

  11. http://ai.stanford.edu/~amaas/data/sentiment/

  12. https://docs.oracle.com/database/121/DMCON/feature_extr.htm#DMCON268

  13. Pechenizkiy M, Puuronen S, Tsymbal A (2001) Feature extraction for classification in the data mining process PCA-based feature extraction feature extraction for a classifier and dynamic integration of classifiers. Int J 10:271–278

    Google Scholar 

  14. Tripathy A, Agrawal A, Rath SK (2016) Classification of sentiment reviews using n-gram machine learning approach. Expert Syst Appl 57:117–126

    Article  Google Scholar 

  15. Pedregosa F (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12:2825–2830

    Google Scholar 

  16. McCallum A, Nigam K (1998) A comparison of event models for Naive Bayes text classification. AAAI/ICML-98 work learning for text categorization, pp 41–48

    Google Scholar 

  17. Kibriya AM (2004) Multinomial Naive Bayes for text categorization revisited. Adv Artif Intell, 488–499

    Google Scholar 

  18. Mason L, Baxter J, Bartlett P, Frean M (1999) Boosting algorithms as gradient descent. Nips, 512–518

    Google Scholar 

  19. Fradkin D, Muchnik I (2006) Support vector machines for classification. Discret Methods Epidemiol 70:13–20

    Google Scholar 

  20. http://sebastianraschka.com/Articles/2014_naive_bayes_1.html

  21. https://books.google.co.in/books?id=48u5BQAAQBAJ&pg=PA369&lpg=PA369&dq=Stochastic+Gradient+Descent

  22. http://machinelearningmastery.com/classification-accuracy-is-not-enough-more-performance-measures-you-can-use/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pradeep Singh .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Singh, P. (2018). Sentiment Analysis Using Tuned Ensemble Machine Learning Approach. In: Kolhe, M., Trivedi, M., Tiwari, S., Singh, V. (eds) Advances in Data and Information Sciences. Lecture Notes in Networks and Systems, vol 38. Springer, Singapore. https://doi.org/10.1007/978-981-10-8360-0_27

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-8360-0_27

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-8359-4

  • Online ISBN: 978-981-10-8360-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics