Skip to main content

Twitter as a Source for Time- and Domain-Dependent Sentiment Lexicons

  • Chapter
  • First Online:
Social Network Based Big Data Analysis and Applications

Part of the book series: Lecture Notes in Social Networks ((LNSN))

Abstract

Sentiment lexicons are an essential component on most state-of-the-art sentiment analysis methods. However, the terms included are usually restricted to verbs and adjectives because they (1) usually have similar meanings among different domains and (2) are the main indicators of subjectivity in the text. This can lead to a problem in the classification of short informal texts since sometimes the absence of these types of parts of speech does not mean an absence of sentiment. Therefore, our hypothesis states that knowledge of terms regarding certain events and respective sentiment (public opinion) can improve the task of sentiment analysis. Consequently, to complement traditional sentiment dictionaries, we present a system for lexicon expansion that extracts the most relevant terms from news and assesses their positive or negative score through Twitter. Preliminary results on a labelled dataset show that our complementary lexicons increase the performance of three state-of-the-art sentiment systems, therefore proving the effectiveness of our approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Amazon: Amazon mechanical turk. https://www.mturk.com/mturk/welcome (2016). Accessed 21 Aug 2016

  2. Apache: Opennlp: http://opennlp.apache.org (2010). Accessed 07 Mar 2016

  3. Baccianella, S., Esuli, A., Sebastiani, F.: Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: Calzolari, N., Choukri, K., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S., Rosner, M., Tapias, D. (eds.) Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10), Valletta. European Language Resources Association (ELRA), Paris (2010)

    Google Scholar 

  4. Bradley, M.M., Lang, P.J.: Affective norms for English words (ANEW): stimuli, instruction manual, and affective ratings. Technical Report, Center for Research in Psychophysiology, University of Florida, Gainesville (1999)

    Google Scholar 

  5. Butler, S.: The Macquarie thesaurus/[general editor] J.R.L. Bernard, new budget edn. Herron Publications, West End (1987)

    Google Scholar 

  6. Cambria, E., Olsher, D., Rajagopal, D.: Senticnet 3: a common and common-sense knowledge base for cognition-driven sentiment analysis. In: Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, AAAI’14, pp. 1515–1521. AAAI Press, Palo Alto (2014)

    Google Scholar 

  7. Crowdflower: Data for everyone. http://www.crowdflower.com/data-for-everyone/ (2016). Accessed 10 Apr 2016

  8. Esuli, A., Sebastiani, F.: Sentiwordnet: a publicly available lexical resource for opinion mining. In: Proceedings of the 5th Conference on Language Resources and Evaluation (LREC’06), pp. 417–422 (2006)

    Google Scholar 

  9. Facebook: Facebook Graph API. https://developers.facebook.com/docs/graph-api/reference/v2.8/object/comments (2016). Accessed 19 Oct 2016

  10. Fellbaum, C. (ed.): WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)

    Google Scholar 

  11. Foster, A.: Terror attacks timeline: from Paris and Brussels terror to most recent attacks in Europe. http://www.express.co.uk/news/world/693421/Terror-attacks-timeline-France-Brussels-Europe-ISIS-killings-Germany-dates-terrorism (2016). Accessed 21 Aug 2016

  12. Go, A., Bhayani, R., Huang, L.: Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford 1, 12 (2009)

    Google Scholar 

  13. Gonçalves, P., Araújo, M., Benevenuto, F., Cha, M.: Comparing and combining sentiment analysis methods. In: Proceedings of the First ACM Conference on Online Social Networks, pp. 27–38. ACM, New York (2013)

    Google Scholar 

  14. Guimaraes, N., Torgo, L., Figueira, A.: Lexicon expansion system for domain and time oriented sentiment analysis. In: Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR (IC3K 2016), pp. 463–471 (2016)

    Google Scholar 

  15. Hannak, A., Anderson, E., Barrett, L.F., Lehmann, S., Mislove, A., Riedewald, M.: Tweetin’ in the rain: exploring societal-scale effects of weather on mood. In: Proceedings of the 6th International Conference on Weblogs and Social Media (2012)

    Google Scholar 

  16. Hatzivassiloglou, V., McKeown, K.R.: Predicting the semantic orientation of adjectives. In: Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics, ACL’98, pp. 174–181. Association for Computational Linguistics, Stroudsburg (1997)

    Google Scholar 

  17. Haynie, D.: The U.S. and U.K. are the world’s most influential countries, survey finds. www.usnews.com/news/best-countries/best-international-influence (2015). Accessed 23 May 2016

  18. Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD’04, pp. 168–177. ACM, New York (2004)

    Google Scholar 

  19. Hutto, C.J., Gilbert, E.: Vader: a parsimonious rule-based model for sentiment analysis of social media text. In: Adar, E., Resnick, P., Choudhury, M.D., Hogan, B., Oh, A.H. (eds.) ICWSM. The AAAI Press, Menlo Park (2014)

    Google Scholar 

  20. Kim, S.-M., Hovy, E.: Determining the sentiment of opinions. In: Proceedings of the 20th International Conference on Computational Linguistics, COLING’04. Association for Computational Linguistics, Stroudsburg (2004)

    Google Scholar 

  21. Levallois, C.: Umigon: sentiment analysis for tweets based on lexicons and heuristics. In: Proceedings of 7th International Workshop on Semantic Evaluation (SemEval 2013), Atlanta (2013). Zenodo

    Google Scholar 

  22. LikeAlyzer: Likealyzer: analyze and monitor your Facebook pages. http://likealyzer.com/ (2016). Accessed 21 Sept 2016

  23. Messias, J., Diniz, J.P., Soares, E., Ferreira, M., Araújo, M., Bastos, L., Miranda, M., Benevenuto, F.: Towards sentiment analysis for mobile devices. In: Kumar, R., Caverlee, J., Tong, H. (eds.) 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2016, San Francisco, 18–21 August 2016, pp. 1390–1391. IEEE Computer Society, New York (2016)

    Google Scholar 

  24. Mohammad, S.M.: #emotional tweets. In: Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the Main Conference and the Shared Task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation, SemEval’12, pp. 246–255. Association for Computational Linguistics, Stroudsburg (2012)

    Google Scholar 

  25. Mohammad, S.M., Turney, P.D.: Emotions evoked by common words and phrases: using Mechanical Turk to create an emotion lexicon. In: Proceedings of the NAACL HLT 2010 Workshop on Computational Approaches to Analysis and Generation of Emotion in Text, CAAGET’10, pp. 26–34. Association for Computational Linguistics, Stroudsburg (2010)

    Google Scholar 

  26. Mohammad, S., Dunne, C., Dorr, B.: Generating high-coverage semantic orientation lexicons from overtly marked words and a thesaurus. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2, EMNLP’09, pp. 599–608. Association for Computational Linguistics, Stroudsburg (2009)

    Google Scholar 

  27. Nielsen, F.Å.: A new ANEW: evaluation of a word list for sentiment analysis in Microblogs. In: Proceedings of the ESWC2011 Workshop on “Making Sense of Microposts”: Big Things Come in Small Packages, pp. 93–98 (2011)

    Google Scholar 

  28. Oxford: Oxford Learner’s Dictionaries topic dictionaries. http://www.oxfordlearnersdictionaries.com/topic/ (2016). Accessed 03 Jul 2016

  29. Pappas, N., Popescu-Belis, A.: Sentiment analysis of user comments for one-class collaborative filtering over ted talks. In: 36th ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York (2013)

    Google Scholar 

  30. Phipps, C.: Brussels: Islamic state launches attacks on airport and station – as it happened. https://www.theguardian.com/world/live/2016/mar/22/brussels-airport-explosions-live-updates (2015). Accessed 28 Sept 2016

  31. Qiu, G., Liu, B., Bu, J., Chen, C.: Opinion word expansion and target extraction through double propagation. Comput. Linguist. 37(1), 9–27 (2011)

    Article  Google Scholar 

  32. Ribeiro, F.N., Araújo, M., Gonçalves, P., Gonçalves, M.A., Benevenuto, F.: Sentibench-a benchmark comparison of state-of-the-practice sentiment analysis methods. EPJ Data Sci. 5(1), 1–29 (2016)

    Google Scholar 

  33. Socher, R., Perelygin, A., Wu, J.Y., Chuang, J., Manning, C.D., Ng, A.Y., Potts, C.: Recursive deep models for semantic compositionality over a sentiment Treebank. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (2013)

    Google Scholar 

  34. Stone, P.J., Dunphy, D.C., Smith, M.S., Ogilvie, D.M.: The General Inquirer: A Computer Approach to Content Analysis. MIT Press, Cambridge (1966)

    Google Scholar 

  35. Taboada, M., Brooke, J., Tofiloski, M., Voll, K., Stede, M.: Lexicon-based methods for sentiment analysis. Comput. Linguist. 37(2), 267–307 (2011)

    Article  Google Scholar 

  36. Tenuto, J.: Classification accuracy is not enough: more performance measures you can use. http://machinelearningmastery.com/classification-accuracy-is-not-enough-more-performance-measures-you-can-use/ (2014). Accessed 20 Aug 2016

  37. Tenuto, J.: #deflategate was just a chance for us to make some really bad jokes. https://www.crowdflower.com/deflategate-sentiment/ (2015). Accessed 12 Aug 2016

  38. Thelwall, M.: Heart and soul: sentiment strength detection in the social web with Sentistrength. In: Proceedings of CyberEmotions, p. 1–14 (2013)

    Google Scholar 

  39. Thelwall, M., Buckley, K., Paltoglou, G., Cai, D., Kappas, A.: Sentiment in short strength detection informal text. J. Am. Soc. Inf. Sci. Technol. 61(12), 2544–2558 (2010)

    Article  Google Scholar 

  40. Thelwall, M., Buckley, K., Paltoglou, G.: Sentiment strength detection for the social web. J. Am. Soc. Inf. Sci. Technol. 63(1), 163–173 (2012)

    Article  Google Scholar 

  41. Twitter: Rest API documentation. https://dev.twitter.com/rest/public (2015). Accessed 19 Oct 2015

  42. Twitter: Twitter Developers . https://dev.twitter.com/ (2016). Accessed 08 Mar 2016

  43. Valitutti, R.: Wordnet-affect: an affective extension of wordnet. In: Proceedings of the 4th International Conference on Language Resources and Evaluation, pp. 1083–1086 (2004)

    Google Scholar 

  44. Wang, H., Can, D., Kazemzadeh, A., Bar, F., Narayanan, S.: A system for real-time twitter sentiment analysis of 2012 U.S. presidential election cycle. In: Proceedings of the ACL 2012 System Demonstrations, ACL’12, pp. 115–120. Association for Computational Linguistics, Stroudsburg (2012)

    Google Scholar 

  45. Wilson, T., Hoffmann, P., Somasundaran, S., Kessler, J., Wiebe, J., Choi, Y., Cardie, C., Riloff, E., Patwardhan, S.: Opinionfinder: a system for subjectivity analysis. In: Proceedings of HLT/EMNLP on Interactive Demonstrations, HLT-Demo’05, pp. 34–35. Association for Computational Linguistics, Stroudsburg (2005)

    Google Scholar 

Download references

Acknowledgements

This work is supported by the ERDF European Regional Development Fund through the COMPETE Programme (operational programme for competitiveness) and by National Funds through the FCT (Portuguese Foundation for Science and Technology) within project Reminds/UTAP-ICDT/EEI-CTP/0022/2014.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nuno Guimarães .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Guimarães, N., Torgo, L., Figueira, Á. (2018). Twitter as a Source for Time- and Domain-Dependent Sentiment Lexicons. In: Kaya, M., Kawash, J., Khoury, S., Day, MY. (eds) Social Network Based Big Data Analysis and Applications. Lecture Notes in Social Networks. Springer, Cham. https://doi.org/10.1007/978-3-319-78196-9_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-78196-9_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-78195-2

  • Online ISBN: 978-3-319-78196-9

  • eBook Packages: Social SciencesSocial Sciences (R0)

Publish with us

Policies and ethics