An Evaluation Framework and Adaptive Architecture for Automated Sentiment Detection

Gindl, Stefan; Liegl, Johannes; Scharl, Arno; Weichselbraun, Albert

doi:10.1007/978-3-642-02184-8_15

Stefan Gindl⁶,
Johannes Liegl⁶,
Arno Scharl⁶ &
…
Albert Weichselbraun⁷

Part of the book series: Studies in Computational Intelligence ((SCI,volume 221))

658 Accesses

Abstract

Analysts are often interested in how sentiment towards an organization, a product or a particular technology changes over time. Popular methods that process unstructured textual material to automatically detect sentiment based on tagged dictionaries are not capable of fulfilling this task, even when coupled with part-of speech tagging, a standard component of most text processing toolkits that distinguishes grammatical categories such as article, noun, verb, and adverb. Small corpus size, ambiguity and subtle incremental change of tonal expressions between different versions of a document complicate sentiment detection. Parsing grammatical structures, by contrast, outperforms dictionary-based approaches in terms of reliability, but usually suffers from poor scalability due to its computational complexity. This work provides an over view of different dictionary- and machine-learning-based sentiment detection methods and evaluates them on several Web corpora.After identifying the shortcomings of these methods, the paper proposes an approach based on automatically building Tagged Linguistic Unit (TLU) databases to overcome the restrictions of dictionaries with a limited set of tagged tokens.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.00; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Berger, A.L., Pietra, S.D., Pietra, V.J.D.: A maximum entropy approach to natural language processing. Computational Linguistics 22(1), 39–71 (1996)
Google Scholar
Blitzer, J., Dredze, M., Pereira, F.: Biographies, bollywood, boom-boxes and blenders: Domain adaptation for sentiment classification. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, Prague, Czech Republic, pp. 440–447 (June 2007)
Google Scholar
Ding, X., Liu, B., Yu, P.S.: A holistic lexicon-based approach to opinion mining. In: WSDM 2008: Proceedings of the international conference on Web search and web data mining, Palo Alto, California, USA, pp. 231–240. ACM, New York (2008)
Chapter Google Scholar
Hatzivassiloglou, V., McKeown, K.R.: Predicting the semantic orientation of adjectives. In: Proceedings of the eighth conference on European chapter of the Association for Computational Linguistics, Morristown, NJ, pp. 174–181. Association for Computational Linguistics (1997)
Google Scholar
Kilgarriff, A., Evans, R., Koeling, R., Rundell, M., Tugwell, D.: Waspbench: A lexicographer’s workbench supporting state-of-the-art word sense disambiguation. In: 10th Conference on European Chapter of the Association For Computational Linguistics, Morristown, USA, Association for Computational Linguistics (2003)
Google Scholar
Kilgarriff, A., Rychl, P., Smrz, P., Tugwell, D.: The Sketch engine. In: 11th Euralex international Congress. Lorient, France (2004)
Google Scholar
Kushal, D., Lawrence, S., Pennock, D.M.: Mining the peanut gallery: Opinion extraction and semantic classification of product reviews. In: WWW 2003: Proceedings of the twelfth international conference on World Wide Web, pp. 519–528. ACM Press, New York (2003)
Google Scholar
Liu, W., Weichselbraun, A., Scharl, A., Chang, E.: Semi-automatic ontology extension using spreading activation. Journal of Universal Knowledge Management (1), 50–58 (2005), http://www.jukm.org/jukm_0_1/semi_automatic_ontology_extension
Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing. The MIT Press, Cambridge (1999)
MATH Google Scholar
Mullen, T., Collier, N.: Sentiment analysis using support vector machines with diverse information sources (2004)
Google Scholar
Pang, B., Lee, L.: A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts (September 2004)
Google Scholar
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? Sentiment Classification using Machine Learning Techniques. In: Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing, EMNLP (2002)
Google Scholar
Ratnaparkhi, A.: Maximum entropy models for natural language ambiguity resolution (1998)
Google Scholar
Riloff, E., Wiebe, J.: Learning extraction patterns for subjective expressions. In: Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing (EMNLP 2003) (2003)
Google Scholar
Scharl, A., Dickinger, A., Weichselbraun, A.: Analyzing news media coverage to acquire and structure tourism knowledge. Information Technology and Tourism 10(1), 3–17 (2008)
Article Google Scholar
Scharl, A., Pollach, I., Bauer, C.: Determining the semantic orientation of web-based corpora. In: Liu, J., Cheung, Y.-m., Yin, H. (eds.) IDEAL 2003. LNCS, vol. 2690, pp. 840–849. Springer, Heidelberg (2003)
Google Scholar
Scharl, A., Weichselbraun, A.: An automated approach to investigating the online media coverage of us presidential elections. Journal of Information Technology & Politics 5(1), 121–132 (2008)
Article Google Scholar
Stone, P.J.: The General Inquirer: A Computer Approach to Content Analysis. The MIT Press, Cambridge (1966)
Google Scholar
Subasic, P., Huettner, A.: Affect analysis of text using fuzzy semantic typing. IEEE Transaction on Fuzzy Systems 9(4), 483–496 (2001)
Article Google Scholar
Weichselbraun, A.: Ontologiebasierende Textklassifikation mittels mathematischer Verfahren. PhD thesis, Vienna University of Economics and Business Administration (2004)
Google Scholar
Weichselbraun, A., Wohlgenannt, G., Scharl, A., Granitzer, M., Neidhart, T., Juffinger, A.: Applying vector space models to ontology link type suggestion. In: 4th International Conference on Innovations in Information Technology, Dubai, United Arab Emirates, pp. 566–570. IEEE Press, Los Alamitos (2007)
Google Scholar
Whitelaw, C., Garg, N., Argamon, S.: Using Appraisal Taxonomies for Sentiment Analysis. In: Proceedings of MCLC 2005, the 2nd Midwest Computational Linguistic Colloquium, Columbus, US (2005)
Google Scholar
Wiebe, J., Riloff, E.: Creating subjective and objective sentence classifiers from unannotated texts. In: Gelbukh, A. (ed.) CICLing 2005. LNCS, vol. 3406, pp. 486–497. Springer, Heidelberg (2005)
Google Scholar
Wilson, T., Wiebe, J., Hoffmann, P.: Recognizing contextual polarity in phrase-level sentiment analysis. In: Proceedings of Human Language Technologies Conference/Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP 2005), Vancouver, CA (2005)
Google Scholar
Yu, H., Hatzivassiloglou, V.: Towards answering opinion questions: Separating facts from opinions and identifying the polarity of opinion sentences. In: Collins, M., Steedman, M. (eds.) Proceedings of EMNLP 2003, 8th Conference on Empirical Methods in Natural Language Processing, Sapporo, JP, pp. 129–136 (2003)
Google Scholar
Zhang, H.: The optimality of naive bayes. In: Barr, V., Markov, Z. (eds.) FLAIRS Conference. AAAI Press, Menlo Park (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of New Media Technology, MODUL University Vienna, Austria
Stefan Gindl, Johannes Liegl & Arno Scharl
Research Institute for Computational Methods, Vienna University of Economics and Business Administration, Austria
Albert Weichselbraun

Authors

Stefan Gindl
View author publications
You can also search for this author in PubMed Google Scholar
Johannes Liegl
View author publications
You can also search for this author in PubMed Google Scholar
Arno Scharl
View author publications
You can also search for this author in PubMed Google Scholar
Albert Weichselbraun
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

SemanticWeb School, Zentrum fü rWissenstransfer , Lerchenfelder Gürtel 43, 1160, Wien Top 5/2, Austria
Tassilo Pellegrini
Institute for Applied Informatics, University of Leipzig , Johannisgasse 26, 04109, Leipzig, Germany
Sóren Auer
Know-Center GmbH , Inffeldgasse 21, 8010, Graz, Austria
Klaus Tochtermann
ForschungsgesellschaftmbH, Salzburg Research , Jakob Haringer Straße 5/III, 5020, Salzburg, Austria
Sebastian Schaffert

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Gindl, S., Liegl, J., Scharl, A., Weichselbraun, A. (2009). An Evaluation Framework and Adaptive Architecture for Automated Sentiment Detection. In: Pellegrini, T., Auer, S., Tochtermann, K., Schaffert, S. (eds) Networked Knowledge - Networked Media. Studies in Computational Intelligence, vol 221. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02184-8_15

Download citation

DOI: https://doi.org/10.1007/978-3-642-02184-8_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02183-1
Online ISBN: 978-3-642-02184-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics