Word sense disambiguation application in sentiment analysis of news headlines: an applied approach to FOREX market prediction

Article
  • 27 Downloads

Abstract

Sentiment analysis of textual content has become a popular approach for market prediction. However, lack of a process for word sense disambiguation makes it questionable whether the sentiment expressed by the context is correctly identified. Meanwhile, many studies in natural language processing have focused on word sense disambiguation. However, there has been a weak link between the two logically relevant fields of study. Therefore, with two motivations, we propose a system for FOREX market prediction that exploits word sense disambiguation in sentiment analysis of news headlines and predicts the directional movement of a currency pair. The first motivation is the implementation of a novel word sense disambiguation that can determine the proper senses of all significant words in a news headline. The main contributions of this work that make the first motivation possible, are the introduction of novel approaches termed Relevant Gloss Retrieval, Similarity Threshold, Verb Nominalization, and also optimization measures to decrease execution time. The second motivation is to prove that determination of proper senses of significant words in textual contents can improve the determination of sentiment, conveyed by the context, and consequently any application based on sentiment analysis. Inclusion of the word sense disambiguation into the proposed system proves the achievement of the second motivation. Carried out tests with the same dataset prove that the proposed system outperforms one of the best systems (to our best knowledge) proposed for market prediction and improves accuracy from 83.33% to 91.67%. The detail for reproduction of the system is amply provided.

Keywords

Sentiment analysis Semantic analysis Polysemous word Word sense disambiguation FOREX prediction 

Notes

Acknowledgements

We want to thank Arman Khadjeh Nassirtoussi (Nassirtoussi et al. 2015) for letting us have access to their data. Without using the same data, we could not compare the results.

References

  1. Anastasakis, L., & Mort, N. (2009). Exchange rate forecasting using a combined parametric and nonparametric self-organising modelling approach. Expert Systems with Applications, 36(10), 12001–12011.CrossRefGoogle Scholar
  2. Baccianella, S., Esuli, A., Sebastiani, F. (2010). Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In LREC (Vol. 10 pp. 2200–2204).Google Scholar
  3. Banerjee, S., & Pedersen, T. (2002). An adapted lesk algorithm for word sense disambiguation using wordnet. In International conference on intelligent text processing and computational linguistics (pp. 136–145). Berlin: Springer.Google Scholar
  4. Bollen, J., Mao, H., Zeng, X. (2011). Twitter mood predicts the stock market. Journal of Computational Science, 2(1), 1–8.CrossRefGoogle Scholar
  5. Dong, R., O’Mahony, M.P., Schaal, M., McCarthy, K., Smyth, B. (2016). Combining similarity and sentiment in opinion mining for product recommendation. Journal of Intelligent Information Systems, 46(2), 285–312.CrossRefGoogle Scholar
  6. Evans, M.D., & Lyons, R.K. (2008). How is macro news transmitted to exchange rates? Journal of Financial Economics, 88(1), 26–50.CrossRefGoogle Scholar
  7. Farooq, U., Dhamala, T.P., Nongaillard, A., Ouzrout, Y., Qadir, M.A. (2015). A word sense disambiguation method for feature level sentiment analysis. In 2015 9th international conference on software, knowledge, information management and applications (SKIMA) (pp. 1–8). IEEE.Google Scholar
  8. Fellbaum C. (1998). WordNet. Wiley Online Library.Google Scholar
  9. Hagenau, M., Liebmann, M., Neumann, D. (2013). Automated news reading: stock price prediction based on financial news using context-capturing features. Decision Support Systems, 55(3), 685– 697.CrossRefGoogle Scholar
  10. Howe, D.C. (2009). Rita: creativity support for computational literature. In Proceedings of the seventh ACM conference on creativity and cognition (pp. 205–210). ACM.Google Scholar
  11. Huang, C.J., Liao, J.J., Yang, D.X., Chang, T.Y., Luo, Y.C. (2010). Realization of a news dissemination agent based on weighted association rules and text mining techniques. Expert Systems with Applications, 37(9), 6409–6413.CrossRefGoogle Scholar
  12. Jiang, J.J., & Conrath, D.W. (1997). Semantic similarity based on corpus statistics and lexical taxonomy. arXiv:cmp-lg/9709008.
  13. Kehagias, A., Petridis, V., Kaburlasos, V.G., Fragkou, P. (2003). A comparison of word-and sense-based text categorization using several classification algorithms. Journal of Intelligent Information Systems, 21(3), 227–247.CrossRefGoogle Scholar
  14. Lesk, M. (1986). Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone. In Proceedings of the 5th annual international conference on systems documentation (pp. 24–26). ACM.Google Scholar
  15. Levinson, M. et al. (2014). The economist guide to financial markets: why they exist and how they work. The Economist, 1, 17. 6th edition.Google Scholar
  16. Li, X., Szpakowicz, S., Matwin, S. (1995). A wordnet-based algorithm for word sense disambiguation. In IJCAI (Vol. 95 pp. 1368–1374).Google Scholar
  17. Li, Q., Wang, T., Li, P., Liu, L., Gong, Q., Chen, Y. (2014). The effect of news and public mood on stock movements. Information Sciences, 278, 826–840.CrossRefGoogle Scholar
  18. Lin, D. et al. (1998). An information-theoretic definition of similarity. In Icml, (Vol. 98 pp. 296–304).Google Scholar
  19. Liu, Y., Scheuermann, P., Li, X., Zhu, X. (2007). Using wordnet to disambiguate word senses for text classification. In Computational Science–ICCS 2007 (pp. 781–789).Google Scholar
  20. Luk, A. (1993). Statistical sense disambiguation with relatively small corpora using dictionary definitions. In Proceedings of the 33rd annual meeting of ACL (pp. 181–188).Google Scholar
  21. Miller, G., & Fellbaum, C. (1998). Wordnet: an electronic lexical database. MIT Press, Cambridge.Google Scholar
  22. Mittermayer, M.A. (2004). Forecasting intraday stock price trends with text mining techniques. In Proceedings of the 37th annual Hawaii international conference on system sciences, 2004 (p. 10). IEEE.Google Scholar
  23. Mladenović, M., Mitrović, J., Krstev, C., Vitas, D. (2016). Hybrid sentiment analysis framework for a morphologically rich language. Journal of Intelligent Information Systems, 46(3), 599–620.CrossRefGoogle Scholar
  24. Mostafa, M.M. (2013). More than words: social networks’ text mining for consumer brand sentiments. Expert Systems with Applications, 40(10), 4241–4251.CrossRefGoogle Scholar
  25. Nassirtoussi, A.K., Aghabozorgi, S., Wah, T.Y., Ngo, D.C.L. (2014). The text mining homepage. sites.google.com/site/bigdatasetmining/Projects/textmining.
  26. Nassirtoussi, A.K., Aghabozorgi, S., Wah, T.Y., Ngo, D.C.L. (2015). Text mining of news-headlines for forex market prediction: a multi-layer dimension reduction algorithm with semantics and sentiment. Expert Systems with Applications, 42(1), 306–324.CrossRefGoogle Scholar
  27. Nizer, P., & Nievola, J.C. (2012). Predicting published news effect in the Brazilian stock market. Expert Systems with Applications, 39(12), 10674–10680.CrossRefGoogle Scholar
  28. Patwardhan, S., Banerjee, S., Pedersen, T. (2007). Umnd1: unsupervised word sense disambiguation using contextual semantic relatedness. In Proceedings of the 4th international workshop on semantic evaluations, association for computational linguistics (pp. 390–393).Google Scholar
  29. Peramunetilleke, D., & Wong, R.K. (2002). Currency exchange rate forecasting from news headlines. Australian Computer Science Communications, 24(2), 131–139.Google Scholar
  30. Rao, T., & Srivastava, S. (2012). Using twitter sentiments and search volumes index to predict oil, gold, forex and markets indices. Tech. rep.Google Scholar
  31. Resnik, P. (1995). Using information content to evaluate semantic similarity in a taxonomy. arXiv:cmp-lg/9511007.
  32. Schumaker, R.P., Zhang, Y., Huang, C.N., Chen, H. (2012). Evaluating sentiment in financial news articles. Decision Support Systems, 53(3), 458–464.CrossRefGoogle Scholar
  33. Siganos, A., Vagenas-Nanos, E., Verwijmeren, P. (2014). Facebook’s daily sentiment and international stock markets. Journal of Economic Behavior & Organization, 107, 730–743.CrossRefGoogle Scholar
  34. Sprenger, T.O., Tumasjan, A., Sandner, P.G., Welpe, I.M. (2014). Tweets and trades: the information content of stock microblogs. European Financial Management, 20(5), 926–957.CrossRefGoogle Scholar
  35. Sul, H.K., Dennis, A.R., Yuan, L.I. (2014). Trading on Twitter: the financial information content of emotion in social media. In 2014 47th Hawaii international conference on system sciences (HICSS) (pp. 806–815). IEEE.Google Scholar
  36. Toutanova, K., Klein, D., Manning, C.D., Singer, Y. (2003). Feature-rich part-of-speech tagging with a cyclic dependency network. In Proceedings of the 2003 conference of the North American chapter of the association for computational linguistics on human language technology (Vol. 1 pp. 173–180). Association for Computational Linguistics.Google Scholar
  37. Wilson, T., Wiebe, J., Hoffmann, P. (2005). Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of the conference on human language technology and empirical methods in natural language processing (pp. 347–354). Association for Computational Linguistics.Google Scholar
  38. Yarowsky, D. (1995). Unsupervised word sense disambiguation rivaling supervised methods. In Proceedings of the 33rd annual meeting on association for computational linguistics (pp. 189–196). Association for Computational Linguistics.Google Scholar
  39. Zhang, X., Fuehres, H., Gloor, P.A. (2011). Predicting stock market indicators through twitter “hope it is not as bad as i fear”. Procedia-Social and Behavioral Sciences, 26, 55–62.CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Department of Computer Engineering and Information TechnologyAmirkabir University of Technology (Tehran Polytechnic)TehranIran

Personalised recommendations