Skip to main content

Sentiment Analysis Using Automatically Labelled Financial News Items

  • Chapter
  • First Online:
Book cover Affective Computing and Sentiment Analysis

Part of the book series: Text, Speech and Language Technology ((TLTB,volume 45))

Abstract

Given a corpus of financial news items labelled according to the market reaction following their publication, we investigate ‘cotemporeneous’ and forward-looking price stock movements. Our approach is to provide a pool of relevant textual features to a machine learning algorithm to detect substantial stock price variations. Our two working hypotheses are that the market reaction to a news item is a good indicator for labelling financial news items, and that a machine learning algorithm can be trained on those news items to build models detecting price movement effectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 109.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 139.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 139.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.cs.waikato.ac.nz/ml/weka/

  2. 2.

    This list is augmented by the words up, down, above and below to follow [7].

  3. 3.

    The corpus was a random selection of texts from Yahoo, Motley Fool and other financial sites.

  4. 4.

    Sanjiv Das, personal communication.

  5. 5.

    http://wordnet.princeton.edu/

  6. 6.

    Wordnet synset number 100005598: causal agency#n#1, cause#n#4 and causal agent#n#1

  7. 7.

    Using the PERL package [13].

  8. 8.

    http://vse.marketwatch.com/

References

  1. Banerjee, S. and T. Pedersen. 2003. Extended gloss overlaps as a measure of semantic relatedness. In Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence, 805–810, Acapulco.

    Google Scholar 

  2. Das, Sanjiv, Asis Martinez-Jerez, and Peter Tufano. 2005. E-information: A clinical study of investor discussion and sentiment. Financial Management 34(5):103–137.

    Article  Google Scholar 

  3. Ann Devitt, and Khurshid Ahmad. 2007. Sentiment polarity identification in financial news: A cohesion-based approach. In Proceedings of ACL-07, the 45th Annual Meeting of the Association of Computational Linguistics, 984–991, Prague, CZ, June 2007. ACL.

    Google Scholar 

  4. Joachims, Thorsten. 2001. Learning to classify text using support vector machines. Boston: Kluwer Academic Publishers.

    Google Scholar 

  5. Kamps, J., M. Marx, R. Mokken, and M. de Rijke. Using Wordnet to measure semantic orientation of adjectives. In LREC 2004 IV:1115–1118.

    Google Scholar 

  6. Knowles, Francis. 2004. Lexicographical aspects of health metaphors in financial texts. In Proceedings Part II of Euralex 1996, pp. 789–796, Department of Swedish, Göteborg University, 1996.

    Google Scholar 

  7. Koppel, Moshe, and Itai Shtrimberg. Good news or bad news? Let the market decide. In AAI Spring Symposium on Exploring Attitude and Affect in Text, pp. 86–88. Stanford University, March 2004.

    Google Scholar 

  8. Mishne, Gilad. 2007. Applied text analytics for blogs. PhD thesis, University of Amsterdam.

    Google Scholar 

  9. Morris, Michael W., Oliver J. Sheldon, Daniel R. Ames, and Maia J. Young. 2007. Metaphors and the market: Consequences and preconditions of agent and object metaphors in stock market commentary. Journal of Organizational Behavior and Human Decision Processes 102(2):174–192, March 2007.

    Article  Google Scholar 

  10. Mullen, Tony, and Nigel Collier. 2004. Sentiment analysis using support vector machines with diverse information sources. In Empirical Methods in NLP.

    Google Scholar 

  11. Osgood, Charles E., George J. Suci, and Percy H. Tannenbaum. 1957. The Measurement of Meaning. University of Illinois.

    Google Scholar 

  12. Pang, Bo, Lillian Lee, and Shivakumar Vaithyanathan. 2002. Thumbs up? Sentiment classification using machine learning techniques. In Proceedings of the 2002 Conf. on Empirical Methods in Natural Language Processing.

    Google Scholar 

  13. Pedersen, Ted. 2004. Wordnet::similarity – measuring the relatedness of concepts. In Proceedings of the Nineteenth National Conference on Artificial Intelligence (AAAI-04).

    Google Scholar 

  14. Schmid, Helmut. 1994. Probabilistic part-of-speech tagging using decision trees. In Int. Conference on New Methods in Language Processing, Manchester.

    Google Scholar 

  15. Lawrie, D., P. Ogilvie, D. Jensen, V. Lavrenko, M. Schmill, and J. Allan. 2000. Mining of concurrent text and time series. In 6th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, August 2000.

    Google Scholar 

  16. Shanahan, J.G., Y. Qu, J. Weibe, eds. 2006. Computing attitude and affect in text: Theory and applications. Dordrecht: Springer.

    Google Scholar 

  17. Wilson, T., and J. Wiebe. 2003. Annotating opinions in the world press. In Proceedings of SIGdial-03, 13–22.

    Google Scholar 

  18. Yang, Yiming, and Jan O. Pedersen. A comparative study on feature selection in text categorization. In Douglas H. Fisher, editor, Proc. of ICML-97, 14th Int. Conf. on Machine Learning, pp. 412–420, Nashville, US, 1997. Morgan Kaufmann Publishers, San Francisco.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Michel Généreux .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer Science+Business Media B.V.

About this chapter

Cite this chapter

Généreux, M., Poibeau, T., Koppel, M. (2011). Sentiment Analysis Using Automatically Labelled Financial News Items. In: Ahmad, K. (eds) Affective Computing and Sentiment Analysis. Text, Speech and Language Technology, vol 45. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-1757-2_9

Download citation

  • DOI: https://doi.org/10.1007/978-94-007-1757-2_9

  • Published:

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-94-007-1756-5

  • Online ISBN: 978-94-007-1757-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics