Twitter as a Source for Time- and Domain-Dependent Sentiment Lexicons

Guimarães, Nuno; Torgo, Luís; Figueira, Álvaro

doi:10.1007/978-3-319-78196-9_1

Nuno Guimarães¹⁷,
Luís Torgo¹⁸ &
Álvaro Figueira¹⁷

Part of the book series: Lecture Notes in Social Networks ((LNSN))

786 Accesses
3 Citations

Abstract

Sentiment lexicons are an essential component on most state-of-the-art sentiment analysis methods. However, the terms included are usually restricted to verbs and adjectives because they (1) usually have similar meanings among different domains and (2) are the main indicators of subjectivity in the text. This can lead to a problem in the classification of short informal texts since sometimes the absence of these types of parts of speech does not mean an absence of sentiment. Therefore, our hypothesis states that knowledge of terms regarding certain events and respective sentiment (public opinion) can improve the task of sentiment analysis. Consequently, to complement traditional sentiment dictionaries, we present a system for lexicon expansion that extracts the most relevant terms from news and assesses their positive or negative score through Twitter. Preliminary results on a labelled dataset show that our complementary lexicons increase the performance of three state-of-the-art sentiment systems, therefore proving the effectiveness of our approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Amazon: Amazon mechanical turk. https://www.mturk.com/mturk/welcome (2016). Accessed 21 Aug 2016
Apache: Opennlp: http://opennlp.apache.org (2010). Accessed 07 Mar 2016
Baccianella, S., Esuli, A., Sebastiani, F.: Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: Calzolari, N., Choukri, K., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S., Rosner, M., Tapias, D. (eds.) Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10), Valletta. European Language Resources Association (ELRA), Paris (2010)
Google Scholar
Bradley, M.M., Lang, P.J.: Affective norms for English words (ANEW): stimuli, instruction manual, and affective ratings. Technical Report, Center for Research in Psychophysiology, University of Florida, Gainesville (1999)
Google Scholar
Butler, S.: The Macquarie thesaurus/[general editor] J.R.L. Bernard, new budget edn. Herron Publications, West End (1987)
Google Scholar
Cambria, E., Olsher, D., Rajagopal, D.: Senticnet 3: a common and common-sense knowledge base for cognition-driven sentiment analysis. In: Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, AAAI’14, pp. 1515–1521. AAAI Press, Palo Alto (2014)
Google Scholar
Crowdflower: Data for everyone. http://www.crowdflower.com/data-for-everyone/ (2016). Accessed 10 Apr 2016
Esuli, A., Sebastiani, F.: Sentiwordnet: a publicly available lexical resource for opinion mining. In: Proceedings of the 5th Conference on Language Resources and Evaluation (LREC’06), pp. 417–422 (2006)
Google Scholar
Facebook: Facebook Graph API. https://developers.facebook.com/docs/graph-api/reference/v2.8/object/comments (2016). Accessed 19 Oct 2016
Fellbaum, C. (ed.): WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)
Google Scholar
Foster, A.: Terror attacks timeline: from Paris and Brussels terror to most recent attacks in Europe. http://www.express.co.uk/news/world/693421/Terror-attacks-timeline-France-Brussels-Europe-ISIS-killings-Germany-dates-terrorism (2016). Accessed 21 Aug 2016
Go, A., Bhayani, R., Huang, L.: Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford 1, 12 (2009)
Google Scholar
Gonçalves, P., Araújo, M., Benevenuto, F., Cha, M.: Comparing and combining sentiment analysis methods. In: Proceedings of the First ACM Conference on Online Social Networks, pp. 27–38. ACM, New York (2013)
Google Scholar
Guimaraes, N., Torgo, L., Figueira, A.: Lexicon expansion system for domain and time oriented sentiment analysis. In: Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR (IC3K 2016), pp. 463–471 (2016)
Google Scholar
Hannak, A., Anderson, E., Barrett, L.F., Lehmann, S., Mislove, A., Riedewald, M.: Tweetin’ in the rain: exploring societal-scale effects of weather on mood. In: Proceedings of the 6th International Conference on Weblogs and Social Media (2012)
Google Scholar
Hatzivassiloglou, V., McKeown, K.R.: Predicting the semantic orientation of adjectives. In: Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics, ACL’98, pp. 174–181. Association for Computational Linguistics, Stroudsburg (1997)
Google Scholar
Haynie, D.: The U.S. and U.K. are the world’s most influential countries, survey finds. www.usnews.com/news/best-countries/best-international-influence (2015). Accessed 23 May 2016
Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD’04, pp. 168–177. ACM, New York (2004)
Google Scholar
Hutto, C.J., Gilbert, E.: Vader: a parsimonious rule-based model for sentiment analysis of social media text. In: Adar, E., Resnick, P., Choudhury, M.D., Hogan, B., Oh, A.H. (eds.) ICWSM. The AAAI Press, Menlo Park (2014)
Google Scholar
Kim, S.-M., Hovy, E.: Determining the sentiment of opinions. In: Proceedings of the 20th International Conference on Computational Linguistics, COLING’04. Association for Computational Linguistics, Stroudsburg (2004)
Google Scholar
Levallois, C.: Umigon: sentiment analysis for tweets based on lexicons and heuristics. In: Proceedings of 7th International Workshop on Semantic Evaluation (SemEval 2013), Atlanta (2013). Zenodo
Google Scholar
LikeAlyzer: Likealyzer: analyze and monitor your Facebook pages. http://likealyzer.com/ (2016). Accessed 21 Sept 2016
Messias, J., Diniz, J.P., Soares, E., Ferreira, M., Araújo, M., Bastos, L., Miranda, M., Benevenuto, F.: Towards sentiment analysis for mobile devices. In: Kumar, R., Caverlee, J., Tong, H. (eds.) 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2016, San Francisco, 18–21 August 2016, pp. 1390–1391. IEEE Computer Society, New York (2016)
Google Scholar
Mohammad, S.M.: #emotional tweets. In: Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the Main Conference and the Shared Task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation, SemEval’12, pp. 246–255. Association for Computational Linguistics, Stroudsburg (2012)
Google Scholar
Mohammad, S.M., Turney, P.D.: Emotions evoked by common words and phrases: using Mechanical Turk to create an emotion lexicon. In: Proceedings of the NAACL HLT 2010 Workshop on Computational Approaches to Analysis and Generation of Emotion in Text, CAAGET’10, pp. 26–34. Association for Computational Linguistics, Stroudsburg (2010)
Google Scholar
Mohammad, S., Dunne, C., Dorr, B.: Generating high-coverage semantic orientation lexicons from overtly marked words and a thesaurus. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2, EMNLP’09, pp. 599–608. Association for Computational Linguistics, Stroudsburg (2009)
Google Scholar
Nielsen, F.Å.: A new ANEW: evaluation of a word list for sentiment analysis in Microblogs. In: Proceedings of the ESWC2011 Workshop on “Making Sense of Microposts”: Big Things Come in Small Packages, pp. 93–98 (2011)
Google Scholar
Oxford: Oxford Learner’s Dictionaries topic dictionaries. http://www.oxfordlearnersdictionaries.com/topic/ (2016). Accessed 03 Jul 2016
Pappas, N., Popescu-Belis, A.: Sentiment analysis of user comments for one-class collaborative filtering over ted talks. In: 36th ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York (2013)
Google Scholar
Phipps, C.: Brussels: Islamic state launches attacks on airport and station – as it happened. https://www.theguardian.com/world/live/2016/mar/22/brussels-airport-explosions-live-updates (2015). Accessed 28 Sept 2016
Qiu, G., Liu, B., Bu, J., Chen, C.: Opinion word expansion and target extraction through double propagation. Comput. Linguist. 37(1), 9–27 (2011)
Article Google Scholar
Ribeiro, F.N., Araújo, M., Gonçalves, P., Gonçalves, M.A., Benevenuto, F.: Sentibench-a benchmark comparison of state-of-the-practice sentiment analysis methods. EPJ Data Sci. 5(1), 1–29 (2016)
Google Scholar
Socher, R., Perelygin, A., Wu, J.Y., Chuang, J., Manning, C.D., Ng, A.Y., Potts, C.: Recursive deep models for semantic compositionality over a sentiment Treebank. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (2013)
Google Scholar
Stone, P.J., Dunphy, D.C., Smith, M.S., Ogilvie, D.M.: The General Inquirer: A Computer Approach to Content Analysis. MIT Press, Cambridge (1966)
Google Scholar
Taboada, M., Brooke, J., Tofiloski, M., Voll, K., Stede, M.: Lexicon-based methods for sentiment analysis. Comput. Linguist. 37(2), 267–307 (2011)
Article Google Scholar
Tenuto, J.: Classification accuracy is not enough: more performance measures you can use. http://machinelearningmastery.com/classification-accuracy-is-not-enough-more-performance-measures-you-can-use/ (2014). Accessed 20 Aug 2016
Tenuto, J.: #deflategate was just a chance for us to make some really bad jokes. https://www.crowdflower.com/deflategate-sentiment/ (2015). Accessed 12 Aug 2016
Thelwall, M.: Heart and soul: sentiment strength detection in the social web with Sentistrength. In: Proceedings of CyberEmotions, p. 1–14 (2013)
Google Scholar
Thelwall, M., Buckley, K., Paltoglou, G., Cai, D., Kappas, A.: Sentiment in short strength detection informal text. J. Am. Soc. Inf. Sci. Technol. 61(12), 2544–2558 (2010)
Article Google Scholar
Thelwall, M., Buckley, K., Paltoglou, G.: Sentiment strength detection for the social web. J. Am. Soc. Inf. Sci. Technol. 63(1), 163–173 (2012)
Article Google Scholar
Twitter: Rest API documentation. https://dev.twitter.com/rest/public (2015). Accessed 19 Oct 2015
Twitter: Twitter Developers . https://dev.twitter.com/ (2016). Accessed 08 Mar 2016
Valitutti, R.: Wordnet-affect: an affective extension of wordnet. In: Proceedings of the 4th International Conference on Language Resources and Evaluation, pp. 1083–1086 (2004)
Google Scholar
Wang, H., Can, D., Kazemzadeh, A., Bar, F., Narayanan, S.: A system for real-time twitter sentiment analysis of 2012 U.S. presidential election cycle. In: Proceedings of the ACL 2012 System Demonstrations, ACL’12, pp. 115–120. Association for Computational Linguistics, Stroudsburg (2012)
Google Scholar
Wilson, T., Hoffmann, P., Somasundaran, S., Kessler, J., Wiebe, J., Choi, Y., Cardie, C., Riloff, E., Patwardhan, S.: Opinionfinder: a system for subjectivity analysis. In: Proceedings of HLT/EMNLP on Interactive Demonstrations, HLT-Demo’05, pp. 34–35. Association for Computational Linguistics, Stroudsburg (2005)
Google Scholar

Download references

Acknowledgements

This work is supported by the ERDF European Regional Development Fund through the COMPETE Programme (operational programme for competitiveness) and by National Funds through the FCT (Portuguese Foundation for Science and Technology) within project Reminds/UTAP-ICDT/EEI-CTP/0022/2014.

Author information

Authors and Affiliations

CRACS - INESC TEC & University of Porto, Porto, Portugal
Nuno Guimarães & Álvaro Figueira
LIAAD - INESC TEC & University of Porto, Porto, Portugal
Luís Torgo

Authors

Nuno Guimarães
View author publications
You can also search for this author in PubMed Google Scholar
Luís Torgo
View author publications
You can also search for this author in PubMed Google Scholar
Álvaro Figueira
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nuno Guimarães .

Editor information

Editors and Affiliations

Department of Computer Engineering, Firat University, Elazig, Turkey
Mehmet Kaya
Department of Computer Science, University of Calgary, Calgary, Alberta, Canada
Jalal Kawash
American University of Sharjah, Sharjah, United Arab Emirates
Suheil Khoury
Tamkang University, Taipei, Taiwan
Min-Yuh Day

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Guimarães, N., Torgo, L., Figueira, Á. (2018). Twitter as a Source for Time- and Domain-Dependent Sentiment Lexicons. In: Kaya, M., Kawash, J., Khoury, S., Day, MY. (eds) Social Network Based Big Data Analysis and Applications. Lecture Notes in Social Networks. Springer, Cham. https://doi.org/10.1007/978-3-319-78196-9_1

Download citation

DOI: https://doi.org/10.1007/978-3-319-78196-9_1
Published: 11 May 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-78195-2
Online ISBN: 978-3-319-78196-9
eBook Packages: Social SciencesSocial Sciences (R0)

Publish with us

Policies and ethics