Abstract
Given a corpus of financial news items labelled according to the market reaction following their publication, we investigate ‘cotemporeneous’ and forward-looking price stock movements. Our approach is to provide a pool of relevant textual features to a machine learning algorithm to detect substantial stock price variations. Our two working hypotheses are that the market reaction to a news item is a good indicator for labelling financial news items, and that a machine learning algorithm can be trained on those news items to build models detecting price movement effectively.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
This list is augmented by the words up, down, above and below to follow [7].
- 3.
The corpus was a random selection of texts from Yahoo, Motley Fool and other financial sites.
- 4.
Sanjiv Das, personal communication.
- 5.
- 6.
Wordnet synset number 100005598: causal agency#n#1, cause#n#4 and causal agent#n#1
- 7.
Using the PERL package [13].
- 8.
References
Banerjee, S. and T. Pedersen. 2003. Extended gloss overlaps as a measure of semantic relatedness. In Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence, 805–810, Acapulco.
Das, Sanjiv, Asis Martinez-Jerez, and Peter Tufano. 2005. E-information: A clinical study of investor discussion and sentiment. Financial Management 34(5):103–137.
Ann Devitt, and Khurshid Ahmad. 2007. Sentiment polarity identification in financial news: A cohesion-based approach. In Proceedings of ACL-07, the 45th Annual Meeting of the Association of Computational Linguistics, 984–991, Prague, CZ, June 2007. ACL.
Joachims, Thorsten. 2001. Learning to classify text using support vector machines. Boston: Kluwer Academic Publishers.
Kamps, J., M. Marx, R. Mokken, and M. de Rijke. Using Wordnet to measure semantic orientation of adjectives. In LREC 2004 IV:1115–1118.
Knowles, Francis. 2004. Lexicographical aspects of health metaphors in financial texts. In Proceedings Part II of Euralex 1996, pp. 789–796, Department of Swedish, Göteborg University, 1996.
Koppel, Moshe, and Itai Shtrimberg. Good news or bad news? Let the market decide. In AAI Spring Symposium on Exploring Attitude and Affect in Text, pp. 86–88. Stanford University, March 2004.
Mishne, Gilad. 2007. Applied text analytics for blogs. PhD thesis, University of Amsterdam.
Morris, Michael W., Oliver J. Sheldon, Daniel R. Ames, and Maia J. Young. 2007. Metaphors and the market: Consequences and preconditions of agent and object metaphors in stock market commentary. Journal of Organizational Behavior and Human Decision Processes 102(2):174–192, March 2007.
Mullen, Tony, and Nigel Collier. 2004. Sentiment analysis using support vector machines with diverse information sources. In Empirical Methods in NLP.
Osgood, Charles E., George J. Suci, and Percy H. Tannenbaum. 1957. The Measurement of Meaning. University of Illinois.
Pang, Bo, Lillian Lee, and Shivakumar Vaithyanathan. 2002. Thumbs up? Sentiment classification using machine learning techniques. In Proceedings of the 2002 Conf. on Empirical Methods in Natural Language Processing.
Pedersen, Ted. 2004. Wordnet::similarity – measuring the relatedness of concepts. In Proceedings of the Nineteenth National Conference on Artificial Intelligence (AAAI-04).
Schmid, Helmut. 1994. Probabilistic part-of-speech tagging using decision trees. In Int. Conference on New Methods in Language Processing, Manchester.
Lawrie, D., P. Ogilvie, D. Jensen, V. Lavrenko, M. Schmill, and J. Allan. 2000. Mining of concurrent text and time series. In 6th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, August 2000.
Shanahan, J.G., Y. Qu, J. Weibe, eds. 2006. Computing attitude and affect in text: Theory and applications. Dordrecht: Springer.
Wilson, T., and J. Wiebe. 2003. Annotating opinions in the world press. In Proceedings of SIGdial-03, 13–22.
Yang, Yiming, and Jan O. Pedersen. A comparative study on feature selection in text categorization. In Douglas H. Fisher, editor, Proc. of ICML-97, 14th Int. Conf. on Machine Learning, pp. 412–420, Nashville, US, 1997. Morgan Kaufmann Publishers, San Francisco.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer Science+Business Media B.V.
About this chapter
Cite this chapter
Généreux, M., Poibeau, T., Koppel, M. (2011). Sentiment Analysis Using Automatically Labelled Financial News Items. In: Ahmad, K. (eds) Affective Computing and Sentiment Analysis. Text, Speech and Language Technology, vol 45. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-1757-2_9
Download citation
DOI: https://doi.org/10.1007/978-94-007-1757-2_9
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-1756-5
Online ISBN: 978-94-007-1757-2
eBook Packages: Computer ScienceComputer Science (R0)