Abstract
Textual data are an important information source for risk management for business organizations. To effectively identify, extract, and analyze risk-related statements in textual data, these processes need to be automated. We developed an annotation framework for firm-specific risk statements guided by previous economic, managerial, linguistic, and natural language processing research. A manual annotation study using news articles from the Wall Street Journal was conducted to verify the framework. We designed and constructed an automated risk identification system based on the annotation framework. The evaluation using manually annotated risk statements in news articles showed promising results for automated risk identification.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Slywotzky, A.J., Drzik, J.: Countering the Biggest Risk of All. Harvard Business Review, 1–11 (2005)
Chapelle, A., Crama, Y., Hubner, G., Peters, J.-P.: Practical methods for measuring and managing operational risk in the financial sector: A clinical study. Journal of Banking & Finance 32, 1049–1061 (2008)
Merriam-Webster: Risk: Definition from the Merrian-Webster Online Dictionary (2008)
COSO: Enterprise Risk Management - Intergrated Framework. COSO(Committee of Sponsoring Organizations of the Treadway Commission) (2004)
Mas-Colell, A., Whinston, M., Green, J.R.: Microeconomic Theory, Oxford (1995)
Bruce, R.F., Wiebe, J.M.: Recognizing subjectivity: a case study of manual tagging. Natural Language Engineering 1, 1–16 (1999)
Wiebe, J., Wilson, T., Cardie, C.: Annotating Expressions of Opinions and Emotions in Language. Language Resources and Evaluation 39, 165–210 (2005)
Rubin, V.L., Liddy, E.D.: Certainty Identification in Texts: Categorization Model and Manual Tagging Results. In: Shanahan, J.G., Qu, Y., Wiebe, J. (eds.) Computing Attitude and Affect in Text: Theory and Applications. Springer, Heidelberg (2005)
Rubin, V.L.: Certainty Categorization Model. In: AAAI Spring Symposium: Exploring Attitude and Affect in Text: Theories and Applications, Stanford, CA (2004)
Rubin, V.L.: Starting with Certainty or Starting with Doubt: Intercoder Reliability Results for Manual Annotation of Epistemically Modalized Statements. In: Proceedings of NAACL HLT 2007, pp. 141–144 (2007)
Coates, J.: Epistemic Modality and Spoken Discourse. Transactions of the Philological Society 85, 110–131 (1987)
de Haan, F.: Evidentiality and Epistemic Modality: Setting Boundaries. Southwest Journal of Linguistic 18, 83–101 (1999)
Porter, M.F.: An Algorithm for Suffix Stripping. Program 14, 130–137 (1980)
Berger, A.L., Pietra, V.J.D., Pietra, S.A.D.: A maximum entropy approach to natural language processing. Comput. Linguist. 22, 39–71 (1996)
Joachims, T.: Making large-Scale SVM Learning Practical. In: Schölkopf, B., Burges, C., Smola, A. (eds.) Advances in Kernel Methods - Support Vector Learning. MIT-Press, Cambridge (1999)
Cortes, C., Mohri, M.: Confidence Intervals for the Area under the ROC Curve. In: Advances in Neural Information Processing Systems (NIPS 2004), vol. 17. MIT Press, Vancouver (2005)
Abbasi, A., Chen, H.: Writeprints: A stylometric approach to identity-level identification and similarity detection in cyberspace. ACM Trans. Inf. Syst. 26, 1–29 (2008)
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques. Elsevier, Amsterdam (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lu, HM., Huang, N.W., Zhang, Z., Chen, TJ. (2009). Identifying Firm-Specific Risk Statements in News Articles. In: Chen, H., Yang, C.C., Chau, M., Li, SH. (eds) Intelligence and Security Informatics. PAISI 2009. Lecture Notes in Computer Science, vol 5477. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01393-5_6
Download citation
DOI: https://doi.org/10.1007/978-3-642-01393-5_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01392-8
Online ISBN: 978-3-642-01393-5
eBook Packages: Computer ScienceComputer Science (R0)