Abstract
Sentiment lexicons are an essential component on most state-of-the-art sentiment analysis methods. However, the terms included are usually restricted to verbs and adjectives because they (1) usually have similar meanings among different domains and (2) are the main indicators of subjectivity in the text. This can lead to a problem in the classification of short informal texts since sometimes the absence of these types of parts of speech does not mean an absence of sentiment. Therefore, our hypothesis states that knowledge of terms regarding certain events and respective sentiment (public opinion) can improve the task of sentiment analysis. Consequently, to complement traditional sentiment dictionaries, we present a system for lexicon expansion that extracts the most relevant terms from news and assesses their positive or negative score through Twitter. Preliminary results on a labelled dataset show that our complementary lexicons increase the performance of three state-of-the-art sentiment systems, therefore proving the effectiveness of our approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Amazon: Amazon mechanical turk. https://www.mturk.com/mturk/welcome (2016). Accessed 21 Aug 2016
Apache: Opennlp: http://opennlp.apache.org (2010). Accessed 07 Mar 2016
Baccianella, S., Esuli, A., Sebastiani, F.: Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: Calzolari, N., Choukri, K., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S., Rosner, M., Tapias, D. (eds.) Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10), Valletta. European Language Resources Association (ELRA), Paris (2010)
Bradley, M.M., Lang, P.J.: Affective norms for English words (ANEW): stimuli, instruction manual, and affective ratings. Technical Report, Center for Research in Psychophysiology, University of Florida, Gainesville (1999)
Butler, S.: The Macquarie thesaurus/[general editor] J.R.L. Bernard, new budget edn. Herron Publications, West End (1987)
Cambria, E., Olsher, D., Rajagopal, D.: Senticnet 3: a common and common-sense knowledge base for cognition-driven sentiment analysis. In: Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, AAAI’14, pp. 1515–1521. AAAI Press, Palo Alto (2014)
Crowdflower: Data for everyone. http://www.crowdflower.com/data-for-everyone/ (2016). Accessed 10 Apr 2016
Esuli, A., Sebastiani, F.: Sentiwordnet: a publicly available lexical resource for opinion mining. In: Proceedings of the 5th Conference on Language Resources and Evaluation (LREC’06), pp. 417–422 (2006)
Facebook: Facebook Graph API. https://developers.facebook.com/docs/graph-api/reference/v2.8/object/comments (2016). Accessed 19 Oct 2016
Fellbaum, C. (ed.): WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)
Foster, A.: Terror attacks timeline: from Paris and Brussels terror to most recent attacks in Europe. http://www.express.co.uk/news/world/693421/Terror-attacks-timeline-France-Brussels-Europe-ISIS-killings-Germany-dates-terrorism (2016). Accessed 21 Aug 2016
Go, A., Bhayani, R., Huang, L.: Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford 1, 12 (2009)
Gonçalves, P., Araújo, M., Benevenuto, F., Cha, M.: Comparing and combining sentiment analysis methods. In: Proceedings of the First ACM Conference on Online Social Networks, pp. 27–38. ACM, New York (2013)
Guimaraes, N., Torgo, L., Figueira, A.: Lexicon expansion system for domain and time oriented sentiment analysis. In: Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR (IC3K 2016), pp. 463–471 (2016)
Hannak, A., Anderson, E., Barrett, L.F., Lehmann, S., Mislove, A., Riedewald, M.: Tweetin’ in the rain: exploring societal-scale effects of weather on mood. In: Proceedings of the 6th International Conference on Weblogs and Social Media (2012)
Hatzivassiloglou, V., McKeown, K.R.: Predicting the semantic orientation of adjectives. In: Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics, ACL’98, pp. 174–181. Association for Computational Linguistics, Stroudsburg (1997)
Haynie, D.: The U.S. and U.K. are the world’s most influential countries, survey finds. www.usnews.com/news/best-countries/best-international-influence (2015). Accessed 23 May 2016
Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD’04, pp. 168–177. ACM, New York (2004)
Hutto, C.J., Gilbert, E.: Vader: a parsimonious rule-based model for sentiment analysis of social media text. In: Adar, E., Resnick, P., Choudhury, M.D., Hogan, B., Oh, A.H. (eds.) ICWSM. The AAAI Press, Menlo Park (2014)
Kim, S.-M., Hovy, E.: Determining the sentiment of opinions. In: Proceedings of the 20th International Conference on Computational Linguistics, COLING’04. Association for Computational Linguistics, Stroudsburg (2004)
Levallois, C.: Umigon: sentiment analysis for tweets based on lexicons and heuristics. In: Proceedings of 7th International Workshop on Semantic Evaluation (SemEval 2013), Atlanta (2013). Zenodo
LikeAlyzer: Likealyzer: analyze and monitor your Facebook pages. http://likealyzer.com/ (2016). Accessed 21 Sept 2016
Messias, J., Diniz, J.P., Soares, E., Ferreira, M., Araújo, M., Bastos, L., Miranda, M., Benevenuto, F.: Towards sentiment analysis for mobile devices. In: Kumar, R., Caverlee, J., Tong, H. (eds.) 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2016, San Francisco, 18–21 August 2016, pp. 1390–1391. IEEE Computer Society, New York (2016)
Mohammad, S.M.: #emotional tweets. In: Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the Main Conference and the Shared Task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation, SemEval’12, pp. 246–255. Association for Computational Linguistics, Stroudsburg (2012)
Mohammad, S.M., Turney, P.D.: Emotions evoked by common words and phrases: using Mechanical Turk to create an emotion lexicon. In: Proceedings of the NAACL HLT 2010 Workshop on Computational Approaches to Analysis and Generation of Emotion in Text, CAAGET’10, pp. 26–34. Association for Computational Linguistics, Stroudsburg (2010)
Mohammad, S., Dunne, C., Dorr, B.: Generating high-coverage semantic orientation lexicons from overtly marked words and a thesaurus. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2, EMNLP’09, pp. 599–608. Association for Computational Linguistics, Stroudsburg (2009)
Nielsen, F.Å.: A new ANEW: evaluation of a word list for sentiment analysis in Microblogs. In: Proceedings of the ESWC2011 Workshop on “Making Sense of Microposts”: Big Things Come in Small Packages, pp. 93–98 (2011)
Oxford: Oxford Learner’s Dictionaries topic dictionaries. http://www.oxfordlearnersdictionaries.com/topic/ (2016). Accessed 03 Jul 2016
Pappas, N., Popescu-Belis, A.: Sentiment analysis of user comments for one-class collaborative filtering over ted talks. In: 36th ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York (2013)
Phipps, C.: Brussels: Islamic state launches attacks on airport and station – as it happened. https://www.theguardian.com/world/live/2016/mar/22/brussels-airport-explosions-live-updates (2015). Accessed 28 Sept 2016
Qiu, G., Liu, B., Bu, J., Chen, C.: Opinion word expansion and target extraction through double propagation. Comput. Linguist. 37(1), 9–27 (2011)
Ribeiro, F.N., Araújo, M., Gonçalves, P., Gonçalves, M.A., Benevenuto, F.: Sentibench-a benchmark comparison of state-of-the-practice sentiment analysis methods. EPJ Data Sci. 5(1), 1–29 (2016)
Socher, R., Perelygin, A., Wu, J.Y., Chuang, J., Manning, C.D., Ng, A.Y., Potts, C.: Recursive deep models for semantic compositionality over a sentiment Treebank. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (2013)
Stone, P.J., Dunphy, D.C., Smith, M.S., Ogilvie, D.M.: The General Inquirer: A Computer Approach to Content Analysis. MIT Press, Cambridge (1966)
Taboada, M., Brooke, J., Tofiloski, M., Voll, K., Stede, M.: Lexicon-based methods for sentiment analysis. Comput. Linguist. 37(2), 267–307 (2011)
Tenuto, J.: Classification accuracy is not enough: more performance measures you can use. http://machinelearningmastery.com/classification-accuracy-is-not-enough-more-performance-measures-you-can-use/ (2014). Accessed 20 Aug 2016
Tenuto, J.: #deflategate was just a chance for us to make some really bad jokes. https://www.crowdflower.com/deflategate-sentiment/ (2015). Accessed 12 Aug 2016
Thelwall, M.: Heart and soul: sentiment strength detection in the social web with Sentistrength. In: Proceedings of CyberEmotions, p. 1–14 (2013)
Thelwall, M., Buckley, K., Paltoglou, G., Cai, D., Kappas, A.: Sentiment in short strength detection informal text. J. Am. Soc. Inf. Sci. Technol. 61(12), 2544–2558 (2010)
Thelwall, M., Buckley, K., Paltoglou, G.: Sentiment strength detection for the social web. J. Am. Soc. Inf. Sci. Technol. 63(1), 163–173 (2012)
Twitter: Rest API documentation. https://dev.twitter.com/rest/public (2015). Accessed 19 Oct 2015
Twitter: Twitter Developers . https://dev.twitter.com/ (2016). Accessed 08 Mar 2016
Valitutti, R.: Wordnet-affect: an affective extension of wordnet. In: Proceedings of the 4th International Conference on Language Resources and Evaluation, pp. 1083–1086 (2004)
Wang, H., Can, D., Kazemzadeh, A., Bar, F., Narayanan, S.: A system for real-time twitter sentiment analysis of 2012 U.S. presidential election cycle. In: Proceedings of the ACL 2012 System Demonstrations, ACL’12, pp. 115–120. Association for Computational Linguistics, Stroudsburg (2012)
Wilson, T., Hoffmann, P., Somasundaran, S., Kessler, J., Wiebe, J., Choi, Y., Cardie, C., Riloff, E., Patwardhan, S.: Opinionfinder: a system for subjectivity analysis. In: Proceedings of HLT/EMNLP on Interactive Demonstrations, HLT-Demo’05, pp. 34–35. Association for Computational Linguistics, Stroudsburg (2005)
Acknowledgements
This work is supported by the ERDF European Regional Development Fund through the COMPETE Programme (operational programme for competitiveness) and by National Funds through the FCT (Portuguese Foundation for Science and Technology) within project Reminds/UTAP-ICDT/EEI-CTP/0022/2014.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this chapter
Cite this chapter
Guimarães, N., Torgo, L., Figueira, Á. (2018). Twitter as a Source for Time- and Domain-Dependent Sentiment Lexicons. In: Kaya, M., Kawash, J., Khoury, S., Day, MY. (eds) Social Network Based Big Data Analysis and Applications. Lecture Notes in Social Networks. Springer, Cham. https://doi.org/10.1007/978-3-319-78196-9_1
Download citation
DOI: https://doi.org/10.1007/978-3-319-78196-9_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-78195-2
Online ISBN: 978-3-319-78196-9
eBook Packages: Social SciencesSocial Sciences (R0)