Skip to main content

A Stacked Ensemble Approach to Bengali Sentiment Analysis

  • Conference paper
  • First Online:
Intelligent Human Computer Interaction (IHCI 2019)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11886))

Included in the following conference series:

Abstract

Sentiment analysis is a crucial step in the social media data analysis. The majority of research works on sentiment analysis focus on sentiment polarity detection which identifies whether an input text is positive, negative or neutral. In this paper, we have implemented a stacked ensemble approach to sentiment polarity detection in Bengali tweets. The basic concept of stacked generalization is to fuse the outputs of the first level base classifiers using a second-level Meta classifier in an ensemble. In our ensemble method, we have used two types of base classifiers- multinomial Naïve Bayes classifiers and SVM that make use of a diverse set of features. Our proposed approach shows an improvement over some existing Bengali sentiment analysis approaches reported in the literature.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bowker, J.: The Oxford Dictionary of World Religions. Oxford University Press, Oxford (1997)

    Google Scholar 

  2. Zhao, J., Liu, K., Wang, G.: Adding redundant features for CRFs-based sentence sentiment classification. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 117–126. Association for Computational Linguistics (2008)

    Google Scholar 

  3. Joachims, T.: Making large scale SVM learning practical. In: Schölkopf, B., Burges, C.J.C., der Smola, A. (eds.) Advances in Kernel Methods-Support Vector Learning. MITPress, Cambridge (1999)

    Google Scholar 

  4. Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing, vol. 10, pp. 79–86. Association for Computational Linguistics (2002)

    Google Scholar 

  5. Dave, K., Lawrence, S., Pennock, D.M.: Mining the peanut gallery: opinion extraction and semantic classification of product reviews. In: Proceedings of the 12th International Conference on World Wide Web, pp. 519–528. ACM (2003)

    Google Scholar 

  6. Mullen, T., Collier, N.: Sentiment analysis using support vector machines with diverse information sources. In: EMNLP, vol. 4, pp. 412–418 (2004)

    Google Scholar 

  7. Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retrieval 2(1–2), 1–135 (2008)

    Article  Google Scholar 

  8. Goldberg, A.B., Zhu, X.: Seeing stars when there aren’t many stars: graph-based semi-supervised learning for sentiment categorization. In: Proceedings of the First Workshop on Graph Based Methods for Natural Language Processing, pp. 45–52. Association for Computational Linguistics (2006)

    Google Scholar 

  9. Miao, Q., Li, Q., Zeng, D.: Fine grained opinion mining by integrating multiple review sources. J. Am. Soc. Inform. Sci. Technol. 61(11), 2288–2299 (2010)

    Article  Google Scholar 

  10. Riloff, E., Wiebe, J.: Learning extraction patterns for subjective expressions. In: Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, pp. 105–112. Association for Computational Linguistics (2003)

    Google Scholar 

  11. Prabowo, R., Thelwall, M.: Sentiment analysis: a combined approach. J. Inf. 3(2), 143–157 (2009)

    Google Scholar 

  12. Narayanan, R., Liu, B., Choudhary, A.: Sentiment analysis of conditional sentences. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, vol. 1, pp. 180–189. Association for Computational Linguistics (2009)

    Google Scholar 

  13. Wiegand, M., Balahur, A., Roth, B., Klakow, D., Montoyo, A.: A survey on the role of negation in sentiment analysis. In: Proceedings of the Workshop on Negation and Speculation in Natural Language Processing, pp. 60–68. Association for Computational Linguistics (2010)

    Google Scholar 

  14. Ku, L.-W., Liang, Y.T., Chen, H-H.: Opinion extraction, summarization and tracking in news and blog corpora. In: AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs (2006)

    Google Scholar 

  15. Kim, J., Chern, G., Feng, D., Shaw, E., Hovy, E.: Mining and assessing discussions on the web through speech act analysis. In: Proceedings of the Workshop on Web Content Mining with Human Language Technologies at the 5th International Semantic Web Conference (2006)

    Google Scholar 

  16. Pang, B., Lee, L.: A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, p. 271 (2004)

    Google Scholar 

  17. Zhu, F., Zhang, X.: Impact of online consumer reviews on sales: the moderating role of product and consumer characteristics. J. Mark. 74(2), 133–148 (2010)

    Article  Google Scholar 

  18. Melville, P., Gryc, W., Lawrence, R.D.: Sentiment analysis of blogs by combining lexical knowledge with text classification. In: Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1275–1284. ACM (2009)

    Google Scholar 

  19. Ramakrishnan, G., Jadhav, A., Joshi, A., Chakrabarti, S., Bhattacharyya, P.: Question answering via Bayesian inference on lexical relations. In: Proceedings of the ACL 2003 Workshop on Multilingual Summarization and Question Answering, vol. 12, pp. 1–10. Association for Computational Linguistics (2003)

    Google Scholar 

  20. Jiao, J., Zhou, Y.: Sentiment Polarity Analysis based multi-dictionary. Phys. Procedia 22, 590–596 (2011)

    Article  Google Scholar 

  21. Macdonald, C., Ounis, I.: The TREC Blogs06 collection: creating and analysing a blog test collection. Department of Computer Science, University of Glasgow Technical report TR-2006-224, 1, 3-1, (2006)

    Google Scholar 

  22. Hatzivassiloglou, V., McKeown, K.R.: Predicting the semantic orientation of adjectives. In: Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics, pp. 174–181. Association for Computational Linguistics, July 1997

    Google Scholar 

  23. Wiebe, J.: Learning subjective adjectives from corpora. In: AAAI/IAAI, pp. 735–740, July 2000

    Google Scholar 

  24. Yu, H., Hatzivassiloglou, V.: Towards answering opinion questions: separating facts from opinions and identifying the polarity of opinion sentences. In: Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, pp. 129–136. Association for Computational Linguistics, July 2003

    Google Scholar 

  25. Riloff, E., Wiebe, J.: Learning extraction patterns for subjective expressions. In: Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, pp. 105–112. Association for Computational Linguistics, July 2003

    Google Scholar 

  26. Esuli, A., Sebastiani, F.: SENTIWORDNET: a publicly available lexical resource for opinion mining. In: Proceedings of LREC, vol. 6, pp. 417–422, May 2006

    Google Scholar 

  27. Fellbaum, C.: WordNet. Blackwell Publishing Ltd., Hoboken (1999)

    MATH  Google Scholar 

  28. Pang, B., Lee, L.: A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In: ACL (2004)

    Google Scholar 

  29. Chen, C.C., Tseng, Y.D.: Quality evaluation of product reviews using an information quality framework. Decis. Support Syst. 50(4), 755–768 (2011)

    Article  Google Scholar 

  30. Kang, H., Yoo, S.J., Han, D.: Senti-lexicon and improved Naïve Bayes algorithms for sentiment analysis of restaurant reviews. Expert Syst. Appl. 39(5), 6000–6010 (2012)

    Article  Google Scholar 

  31. Clarke, D., Lane, P., Hender, P.: Developing robust models for favourability analysis. In: Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis, pp. 44–52. Association for Computational Linguistics (2011)

    Google Scholar 

  32. Reyes, A., Rosso, P.: Making objective decisions from subjective data: detecting irony in customer reviews. Decis. Support Syst. 53(4), 754–760 (2012)

    Article  Google Scholar 

  33. Moraes, R., Valiati, J.F., Neto, W.P.G.: Document-level sentiment classification: an empirical comparison between SVM and ANN. Expert Syst. Appl. 40(2), 621–633 (2013)

    Article  Google Scholar 

  34. Martín-Valdivia, M.T., Martínez-Cámara, E., Perea-Ortega, J.M., Ureña-López, L.A.: Sentiment polarity detection in Spanish reviews combining supervised and unsupervised approaches. Expert Syst. Appl. 40(10), 3934–3942 (2013)

    Article  Google Scholar 

  35. Sarkar, K., Chakraborty, S.: A sentiment analysis system for Indian Language Tweets. In: Prasath, R., Vuppala, A.K., Kathirvalavakumar, T. (eds.) MIKE 2015. LNCS (LNAI), vol. 9468, pp. 694–702. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-26832-3_66

    Chapter  Google Scholar 

  36. Li, Y.M., Li, T.Y.: Deriving market intelligence from microblogs. Decis. Support Syst. 55(1), 206–217 (2013)

    Article  Google Scholar 

  37. Patra, B.G., Das, D., Das, A., Prasath, R.: Shared task on Sentiment Analysis in Indian Languages (SAIL) tweets - an overview. In: Prasath, R., Vuppala, A.K., Kathirvalavakumar, T. (eds.) MIKE 2015. LNCS (LNAI), vol. 9468, pp. 650–655. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-26832-3_61

    Chapter  Google Scholar 

  38. Sarkar, K., Bhowmik, M.: Sentiment polarity detection in bengali tweets using multinomial Naïve Bayes and support vector machines. In: CALCON 2017, Kolkata. IEEE (2017)

    Google Scholar 

  39. Sarkar, K.: Sentiment polarity detection in Bengali tweets using deep convolutional neural networks. J. Intell. Syst. 28(3), 377–386 (2018). https://doi.org/10.1515/jisys-2017-0418. Accessed 7 July 2019

    Article  MathSciNet  Google Scholar 

  40. Sarkar, K.: Using character N gram features and multinomial Naïve Bayes for sentiment polarity detection in Bengali tweets. In: Proceedings of Fifth International Conference on Emerging Applications of Information Technology (EAIT), Kolkata. IEEE (2018)

    Google Scholar 

  41. Sarkar, K.: Sentiment polarity detection in Bengali tweets using LSTM recurrent neural networks. In: Proceedings of Second International Conference on Advanced Computational and Communication Paradigms (ICACCP), Sikkim, India, 25–28 February 2019. IEEE (2019)

    Google Scholar 

  42. Das, A., Bandyopadhyay, S.: SentiWordNet for Indian languages. In: Proceedings of 8th Workshop on Asian Language Resources (COLING 2010), Beijing, China, pp. 56–63 (2010)

    Google Scholar 

  43. Vapnik, V.: Estimation of Dependences Based on Empirical Data, vol. 40. Springer-Verlag, New York (1982). https://doi.org/10.1007/0-387-34239-7

    Book  MATH  Google Scholar 

Download references

Acknowledgments

This research work has received support from the project entitled ‘‘Indian Social Media Sensor: an Indian Social Media Text Mining System for Topic Detection, Topic Sentiment Analysis and Opinion Summarization’’ funded by the Department of Science and Technology, Government of India under the SERB scheme.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kamal Sarkar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sarkar, K. (2020). A Stacked Ensemble Approach to Bengali Sentiment Analysis. In: Tiwary, U., Chaudhury, S. (eds) Intelligent Human Computer Interaction. IHCI 2019. Lecture Notes in Computer Science(), vol 11886. Springer, Cham. https://doi.org/10.1007/978-3-030-44689-5_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-44689-5_10

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-44688-8

  • Online ISBN: 978-3-030-44689-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics