Abstract
The impact of financial risk events on stock market is a fairly established area of research in the financial domain. However, the analysts require these events to be represented in a structured form in order to carry out statistical analysis. In this work, we aim is to identify and extract various financial risk events from news articles along with associated organizations to facilitate integrated analysis with structured business data. We propose a two-phase risk extraction algorithm involving a CNN based semi-supervised risk event identification and gradient boosting based entity association algorithm to extract risk events from news and associate them to their target organizations. We have analyzed large volumes of past available data using Granger causality to assess the impact of these events on various stock indices. Further, the utility of extracted risk events in predicting stock movement has been shown using a Bi-LSTM network based prediction model. The proposed system outperforms state of the art linear SVM on data for different stock indices.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Dasgupta, T., Dey, L., Dey, P., Saha, R.: A framework for mining enterprise risk and risk factors from news documents. In: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: System Demonstrations, pp. 180–184 (2016)
Finkel, J.R., Grenager, T., Manning, C.: Incorporating non-local information into information extraction systems by Gibbs sampling. In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pp. 363–370. Association for Computational Linguistics (2005)
Friedman, J.H.: Greedy function approximation: a gradient boosting machine. Ann. Stat. 25, 1189–1232 (2001)
Kim, Y.: Convolutional neural networks for sentence classification. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1746–1751. Association for Computational Linguistics, October 2014
Kogan, S., Levin, D., Routledge, B.R., Sagi, J.S., Smith, N.A.: Predicting risk from financial reports with regression. In: Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 272–280. Association for Computational Linguistics (2009)
Lam, J.: Enterprise Risk Management: From Incentives to Controls. Wiley, Hoboken (2014)
Lee, G., Jeong, J., Seo, S., Kim, C., Kang, P.: Sentiment classification with word localization based on weakly supervised learning with a convolutional neural network. Knowl.-Based Syst. 152, 70–82 (2018)
Leidner, J.L., Schilder, F.: Hunting for the black swan: risk mining from text. In: Proceedings of the ACL 2010 System Demonstrations, pp. 54–59. Association for Computational Linguistics (2010)
Liaw, A., Wiener, M., et al.: Classification and regression by randomforest. R News 2(3), 18–22 (2002)
Lu, H.-M., Huang, N.W.H., Zhang, Z., Chen, T.-J.: Identifying firm-specific risk statements in news articles. In: Chen, H., Yang, C.C., Chau, M., Li, S.-H. (eds.) PAISI 2009. LNCS, vol. 5477, pp. 42–53. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-01393-5_6
Mendes, P., Jakob, M., GarcÃa-Silva, A., Bizer, C.: DBpedia spotlight: shedding light on the web of documents, pp. 1–8, September 2011
Nugent, T., Leidner, J.L.: Risk mining: company-risk identification from unstructured sources. In: 2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW), pp. 1308–1311. IEEE (2016)
Olson, D.L., Wu, D.D.: Enterprise risk management, vol. 3. World Scientific Publishing Company (2015)
Pennington, J., Socher, R., Manning, C.D.: Glove: global vectors for word representation. EMNLP 14, 1532–1543 (2014)
Satoh, N., Samejima, M.: Risk words suggestion for information security audit by Bayesian inference. Electron. Commun. Jpn. 102(1), 42–48 (2019)
Taleb, N.N.: The Black Swan: The Impact of the Highly Improbable, vol. 2. Random House, New York City (2007)
Tsai, M.F., Wang, C.J.: On the risk prediction and analysis of soft information in finance reports. Eur. J. Oper. Res. 257(1), 243–250 (2017)
Verma, I., Dey, L., Meisheri, H.: Detecting, quantifying and accessing impact of news events on Indian stock indices. In: Proceedings of the International Conference on Web Intelligence, pp. 550–557. ACM (2017)
Wang, W.Y., Hua, Z.: A semiparametric Gaussian copula regression model for predicting financial risks from earnings calls. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), vol. 1, pp. 1155–1165 (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Bhadani, S., Verma, I., Dey, L. (2020). Mining Financial Risk Events from News and Assessing their Impact on Stocks. In: Bitetta, V., Bordino, I., Ferretti, A., Gullo, F., Pascolutti, S., Ponti, G. (eds) Mining Data for Financial Applications. MIDAS 2019. Lecture Notes in Computer Science(), vol 11985. Springer, Cham. https://doi.org/10.1007/978-3-030-37720-5_7
Download citation
DOI: https://doi.org/10.1007/978-3-030-37720-5_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37719-9
Online ISBN: 978-3-030-37720-5
eBook Packages: Computer ScienceComputer Science (R0)