Abstract
In recent years a variety of approaches in classifying the sentiment polarity of texts have been proposed. While in the majority of approaches the determination of subjectivity or polarity-related term features is at the center, the number of publicly available dictionaries is rather limited. In this paper, we investigate the performance of combining lexical resources with machine learning based classifier for the task of sentiment classification.We systematically analyze four different English and three different German polarity dictionaries as a resources for a sentiment-based feature selection. The evaluation results show that smaller but more controlled dictionaries used for feature selection perform within a SVM-based classification setup equally good compared to the biggest available resources.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Agarwal, A., Biadsy, F., McKeown, K.: Contextual phrase-level polarity analysis using lexical affect scoring and syntactic n-grams. In: EACL 2009, Athens, Greece (2009)
Annett, M., Kondrak, G.: A comparison of sentiment analysis techniques: Polarizing movie blogs. In: Canadian Conference on AI, pp. 25–35 (2008)
Becker-Asano, C., Wachsmuth, I.: Affective computing with primary and secondary emotions in a virtual human. In: Autonomous Agents and Multi-Agent Systems (2009)
Chandler, D.: Introduction to Modern Statistical Mechanics. Oxford University Press, Oxford (1987)
Chaovalit, P., Zhou, L.: Movie review mining: a comparison between supervised and unsupervised classification approaches. In: Hawaii International Conference on System Sciences, vol. 4, p. 112c (2005)
Clematide, S., Klenner, M.: Evaluation and extension of a polarity lexicon for german. In: Proceedings of the 1st Workshop on Computational Approaches to Subjectivity and Sentiment Analysis, WASSA (2010)
Dave, K., Lawrence, S., Pennock, D.M.: Mining the peanut gallery: opinion extraction and semantic classification of product reviews. In: WWW 2003: Proceedings of the Twelfth International Conference on World Wide Web, pp. 519–528. ACM Press, New York (2003)
Denecke, K.: Using sentiwordnet for multilingual sentiment analysis. In: ICDE Workshops, pp. 507–512. IEEE Computer Society, Los Alamitos (2008)
Esuli, A., Sebastiani, F.: Sentiwordnet: A publicly available lexical resource for opinion mining. In: Proceedings of the 5th Conference on Language Resources and Evaluation (LREC 2006), pp. 417–422 (2006)
Fellbaum, C. (ed.): WordNet. An Electronic Lexical Database. MIT Press, Cambridge (1998)
Hatzivassiloglou, V., McKeown, K.R.: Predicting the semantic orientation of adjectives. In: Proceedings of the Eighth Conference on European Chapter of the Association for Computational Linguistics, pp. 174–181. Association for Computational Linguistics, Morristown (1997)
Hu, M., Liu, B.: Mining and summarizing customer reviews. In: KDD 2004: Proceedings of the tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 168–177. ACM Press, New York (2004)
Joachims, T.: SVM light (2002), http://svmlight.joachims.org
Joachims, T.: Learning to Classify Text Using Support Vector Machines: Methods, Theory and Algorithms. Kluwer Academic Publishers, Norwell (2002)
Kennedy, A., Inkpen, D.: Sentiment classification of movie reviews using contextual valence shifters. Computational Intelligence 22(2), 110–125 (2006)
Kugatsu Sadamitsu, S.S., Yamamoto, M.: Sentiment analysis based on probabilistic models using inter-sentence information. In: Calzolari, N., Choukri, K., B.M.J.M.J.O.S.P.D.T. (eds.) Proceedings of the Sixth International Language Resources and Evaluation (LREC 2008), European Language Resources Association, Marrakech (2008)
Liu, B.: Sentiment analysis and subjectivity. Handbook of Natural Language Processing 2, 568 (2010)
Maarten, J.K., Marx, M., Mokken, R.J., Rijke, M.D.: Using wordnet to measure semantic orientations of adjectives. In: National Institute for, pp. 1115–1118 (2004)
Mehler, A., Geibel, P., Pustylnikov, O.: Structural classifiers of text types: Towards a novel model of text representation. Journal for Language Technology and Computational Linguistics (JLCL) 22(2), 51–66 (2007)
Mullen, T., Collier, N.: Sentiment analysis using support vector machines with diverse information sources. In: Lin, D., Wu, D. (eds.) Proceedings of EMNLP 2004, pp. 412–418. Association for Computational Linguistics, Barcelona (2004)
Pang, L.A.: sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the ACL, pp. 271–278 (2004)
Pang, B., Lee, L.: Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales. In: ACL 2005: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pp. 115–124. Association for Computational Linguistics, Morristown (2005)
Pang, B., Lee, L.: Opinion Mining and Sentiment Analysis. Now Publishers Inc. (2008)
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: EMNLP 2002: Proceedings of the ACL 2002 Conference on Empirical Methods in Natural Language Processing, pp. 79–86. Association for Computational Linguistics, Morristown (2002)
Prabowo, R., Thelwall, M.: Sentiment analysis: A combined approach. J. Informetrics 3(2), 143–157 (2009)
Remus, R., Quasthoff, U., Heyer, G.: Sentiws – a publicly available german-language resource for sentiment analysis. In: Proceedings of the 7th International Language Resources and Evaluation (LREC 2010), pp. 1168–1171 (2010)
Stone, P.J., Dunphy, D.C., Smith, M.S., Ogilvie, D.M.: The General Inquirer: A Computer Approach to Content Analysis. MIT Press, Cambridge (1966)
Strapparava, C., Valitutti, A.: WordNet-Affect: an affective extension of WordNet. In: Proceedings of LREC, vol. 4, pp. 1083–1086 (2004)
Taboada, M., Brooke, J., Stede, M.: Genre-based paragraph classification for sentiment analysis. In: Proceedings of the SIGDIAL 2009 Conference, pp. 62–70. Association for Computational Linguistics, London (2009)
Takamura, H., Inui, T., Okumura, M.: Extracting semantic orientations of words using spin model. In: ACL 2005: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pp. 133–140. Association for Computational Linguistics, Morristown (2005)
Tan, S., Zhang, J.: An empirical study of sentiment analysis for chinese documents. Expert Syst. Appl. 34(4), 2622–2629 (2008)
Turney, P.D.: Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In: ACL 2002: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 417–424. Association for Computational Linguistics, Morristown (2001)
Turney, P.D., Littman, M.L.: Unsupervised learning of semantic orientation from a hundred-billion-word corpus. CoRR cs.LG/0212012 (2002)
Waltinger, U.: Polarity reinforcement: Sentiment polarity identification by means of social semantics. In: Proceedings of the IEEE Africon 2009, Nairobi, Kenya, September 23-25 (2009)
Waltinger, U.: Germanpolarityclues: A lexical resource for german sentiment analysis. In: Calzolari, N., Choukri, K., B.M.J.M.J.O.S.P.D.T. (eds.) Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC 2010), European Language Resources Association, Valletta (2010)
Waltinger, U.: Sentiment analysis reloaded: A comparative study on sentiment polarity identification combining machine learning and subjectivity features. In: Proceedings of the 6th International Conference on Web Information Systems and Technologies (WEBIST 2010), Valencia (2010)
Wiebe, J., Riloff, E.: Creating subjective and objective sentence classifiers from unannotated texts. In: Gelbukh, A. (ed.) CICLing 2005. LNCS, vol. 3406, pp. 486–497. Springer, Heidelberg (2005)
Wiebe, J., Wilson, T., Cardie, C.: Annotating expressions of opinions and emotions in language. Language Resources and Evaluation 1(2), (2005)
Wiegand, M., Klakow, D.: The role of knowledge-based features in polarity classification at sentence level
Wilson, T., Wiebe, J., Hoffmann, P.: Recognizing contextual polarity in phrase-level sentiment analysis. In: HLT 2005: Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, pp. 347–354. Association for Computational Linguistics, Morristown (2005)
Yu, H., Hatzivassiloglou, V.: Towards answering opinion questions: Separating facts from opinions and identifying the polarity of opinion sentences. In: Proceedings of EMNLP 2003(2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Waltinger, U. (2011). An Empirical Study on Machine Learning-Based Sentiment Classification Using Polarity Clues. In: Filipe, J., Cordeiro, J. (eds) Web Information Systems and Technologies. WEBIST 2010. Lecture Notes in Business Information Processing, vol 75. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22810-0_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-22810-0_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22809-4
Online ISBN: 978-3-642-22810-0
eBook Packages: Computer ScienceComputer Science (R0)