Skip to main content

An Empirical Study on Machine Learning-Based Sentiment Classification Using Polarity Clues

  • Conference paper

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 75))

Abstract

In recent years a variety of approaches in classifying the sentiment polarity of texts have been proposed. While in the majority of approaches the determination of subjectivity or polarity-related term features is at the center, the number of publicly available dictionaries is rather limited. In this paper, we investigate the performance of combining lexical resources with machine learning based classifier for the task of sentiment classification.We systematically analyze four different English and three different German polarity dictionaries as a resources for a sentiment-based feature selection. The evaluation results show that smaller but more controlled dictionaries used for feature selection perform within a SVM-based classification setup equally good compared to the biggest available resources.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agarwal, A., Biadsy, F., McKeown, K.: Contextual phrase-level polarity analysis using lexical affect scoring and syntactic n-grams. In: EACL 2009, Athens, Greece (2009)

    Google Scholar 

  2. Annett, M., Kondrak, G.: A comparison of sentiment analysis techniques: Polarizing movie blogs. In: Canadian Conference on AI, pp. 25–35 (2008)

    Google Scholar 

  3. Becker-Asano, C., Wachsmuth, I.: Affective computing with primary and secondary emotions in a virtual human. In: Autonomous Agents and Multi-Agent Systems (2009)

    Google Scholar 

  4. Chandler, D.: Introduction to Modern Statistical Mechanics. Oxford University Press, Oxford (1987)

    Google Scholar 

  5. Chaovalit, P., Zhou, L.: Movie review mining: a comparison between supervised and unsupervised classification approaches. In: Hawaii International Conference on System Sciences, vol. 4, p. 112c (2005)

    Google Scholar 

  6. Clematide, S., Klenner, M.: Evaluation and extension of a polarity lexicon for german. In: Proceedings of the 1st Workshop on Computational Approaches to Subjectivity and Sentiment Analysis, WASSA (2010)

    Google Scholar 

  7. Dave, K., Lawrence, S., Pennock, D.M.: Mining the peanut gallery: opinion extraction and semantic classification of product reviews. In: WWW 2003: Proceedings of the Twelfth International Conference on World Wide Web, pp. 519–528. ACM Press, New York (2003)

    Chapter  Google Scholar 

  8. Denecke, K.: Using sentiwordnet for multilingual sentiment analysis. In: ICDE Workshops, pp. 507–512. IEEE Computer Society, Los Alamitos (2008)

    Google Scholar 

  9. Esuli, A., Sebastiani, F.: Sentiwordnet: A publicly available lexical resource for opinion mining. In: Proceedings of the 5th Conference on Language Resources and Evaluation (LREC 2006), pp. 417–422 (2006)

    Google Scholar 

  10. Fellbaum, C. (ed.): WordNet. An Electronic Lexical Database. MIT Press, Cambridge (1998)

    MATH  Google Scholar 

  11. Hatzivassiloglou, V., McKeown, K.R.: Predicting the semantic orientation of adjectives. In: Proceedings of the Eighth Conference on European Chapter of the Association for Computational Linguistics, pp. 174–181. Association for Computational Linguistics, Morristown (1997)

    Chapter  Google Scholar 

  12. Hu, M., Liu, B.: Mining and summarizing customer reviews. In: KDD 2004: Proceedings of the tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 168–177. ACM Press, New York (2004)

    Chapter  Google Scholar 

  13. Joachims, T.: SVM light (2002), http://svmlight.joachims.org

  14. Joachims, T.: Learning to Classify Text Using Support Vector Machines: Methods, Theory and Algorithms. Kluwer Academic Publishers, Norwell (2002)

    Book  Google Scholar 

  15. Kennedy, A., Inkpen, D.: Sentiment classification of movie reviews using contextual valence shifters. Computational Intelligence 22(2), 110–125 (2006)

    Article  MathSciNet  Google Scholar 

  16. Kugatsu Sadamitsu, S.S., Yamamoto, M.: Sentiment analysis based on probabilistic models using inter-sentence information. In: Calzolari, N., Choukri, K., B.M.J.M.J.O.S.P.D.T. (eds.) Proceedings of the Sixth International Language Resources and Evaluation (LREC 2008), European Language Resources Association, Marrakech (2008)

    Google Scholar 

  17. Liu, B.: Sentiment analysis and subjectivity. Handbook of Natural Language Processing 2, 568 (2010)

    Google Scholar 

  18. Maarten, J.K., Marx, M., Mokken, R.J., Rijke, M.D.: Using wordnet to measure semantic orientations of adjectives. In: National Institute for, pp. 1115–1118 (2004)

    Google Scholar 

  19. Mehler, A., Geibel, P., Pustylnikov, O.: Structural classifiers of text types: Towards a novel model of text representation. Journal for Language Technology and Computational Linguistics (JLCL) 22(2), 51–66 (2007)

    Google Scholar 

  20. Mullen, T., Collier, N.: Sentiment analysis using support vector machines with diverse information sources. In: Lin, D., Wu, D. (eds.) Proceedings of EMNLP 2004, pp. 412–418. Association for Computational Linguistics, Barcelona (2004)

    Google Scholar 

  21. Pang, L.A.: sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the ACL, pp. 271–278 (2004)

    Google Scholar 

  22. Pang, B., Lee, L.: Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales. In: ACL 2005: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pp. 115–124. Association for Computational Linguistics, Morristown (2005)

    Chapter  Google Scholar 

  23. Pang, B., Lee, L.: Opinion Mining and Sentiment Analysis. Now Publishers Inc. (2008)

    Google Scholar 

  24. Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: EMNLP 2002: Proceedings of the ACL 2002 Conference on Empirical Methods in Natural Language Processing, pp. 79–86. Association for Computational Linguistics, Morristown (2002)

    Chapter  Google Scholar 

  25. Prabowo, R., Thelwall, M.: Sentiment analysis: A combined approach. J. Informetrics 3(2), 143–157 (2009)

    Article  Google Scholar 

  26. Remus, R., Quasthoff, U., Heyer, G.: Sentiws – a publicly available german-language resource for sentiment analysis. In: Proceedings of the 7th International Language Resources and Evaluation (LREC 2010), pp. 1168–1171 (2010)

    Google Scholar 

  27. Stone, P.J., Dunphy, D.C., Smith, M.S., Ogilvie, D.M.: The General Inquirer: A Computer Approach to Content Analysis. MIT Press, Cambridge (1966)

    Google Scholar 

  28. Strapparava, C., Valitutti, A.: WordNet-Affect: an affective extension of WordNet. In: Proceedings of LREC, vol. 4, pp. 1083–1086 (2004)

    Google Scholar 

  29. Taboada, M., Brooke, J., Stede, M.: Genre-based paragraph classification for sentiment analysis. In: Proceedings of the SIGDIAL 2009 Conference, pp. 62–70. Association for Computational Linguistics, London (2009)

    Google Scholar 

  30. Takamura, H., Inui, T., Okumura, M.: Extracting semantic orientations of words using spin model. In: ACL 2005: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pp. 133–140. Association for Computational Linguistics, Morristown (2005)

    Chapter  Google Scholar 

  31. Tan, S., Zhang, J.: An empirical study of sentiment analysis for chinese documents. Expert Syst. Appl. 34(4), 2622–2629 (2008)

    Article  Google Scholar 

  32. Turney, P.D.: Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In: ACL 2002: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 417–424. Association for Computational Linguistics, Morristown (2001)

    Chapter  Google Scholar 

  33. Turney, P.D., Littman, M.L.: Unsupervised learning of semantic orientation from a hundred-billion-word corpus. CoRR cs.LG/0212012 (2002)

    Google Scholar 

  34. Waltinger, U.: Polarity reinforcement: Sentiment polarity identification by means of social semantics. In: Proceedings of the IEEE Africon 2009, Nairobi, Kenya, September 23-25 (2009)

    Google Scholar 

  35. Waltinger, U.: Germanpolarityclues: A lexical resource for german sentiment analysis. In: Calzolari, N., Choukri, K., B.M.J.M.J.O.S.P.D.T. (eds.) Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC 2010), European Language Resources Association, Valletta (2010)

    Google Scholar 

  36. Waltinger, U.: Sentiment analysis reloaded: A comparative study on sentiment polarity identification combining machine learning and subjectivity features. In: Proceedings of the 6th International Conference on Web Information Systems and Technologies (WEBIST 2010), Valencia (2010)

    Google Scholar 

  37. Wiebe, J., Riloff, E.: Creating subjective and objective sentence classifiers from unannotated texts. In: Gelbukh, A. (ed.) CICLing 2005. LNCS, vol. 3406, pp. 486–497. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  38. Wiebe, J., Wilson, T., Cardie, C.: Annotating expressions of opinions and emotions in language. Language Resources and Evaluation 1(2), (2005)

    Google Scholar 

  39. Wiegand, M., Klakow, D.: The role of knowledge-based features in polarity classification at sentence level

    Google Scholar 

  40. Wilson, T., Wiebe, J., Hoffmann, P.: Recognizing contextual polarity in phrase-level sentiment analysis. In: HLT 2005: Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, pp. 347–354. Association for Computational Linguistics, Morristown (2005)

    Chapter  Google Scholar 

  41. Yu, H., Hatzivassiloglou, V.: Towards answering opinion questions: Separating facts from opinions and identifying the polarity of opinion sentences. In: Proceedings of EMNLP 2003(2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Waltinger, U. (2011). An Empirical Study on Machine Learning-Based Sentiment Classification Using Polarity Clues. In: Filipe, J., Cordeiro, J. (eds) Web Information Systems and Technologies. WEBIST 2010. Lecture Notes in Business Information Processing, vol 75. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22810-0_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-22810-0_15

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-22809-4

  • Online ISBN: 978-3-642-22810-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics