Abstract
This chapter presents a theoretical framework and preliminary results for manual categorization of explicit certainty information in 32 English newspaper articles. Our contribution is in a proposed categorization model and analytical framework for certainty identification. Certainty is presented as a type of subjective information available in texts. Statements with explicit certainty markers were identified and categorized according to four hypothesized dimensions — level, perspective, focus, and time of certainty. The preliminary results reveal an overall promising picture of the presence of certainty information in texts, and establish its susceptibility to manual identification within the proposed four-dimensional certainty categorization analytical framework. Our findings are that the editorial sample group had a significantly higher frequency of markers per sentence than did the sample group of the news stories. For editorials, high level of certainty, writer’s point of view, and future and present time were the most populated categories. For news stories, the most common categories were high and moderate levels, directly involved third party’s point of view, and past time. These patterns have positive practical implications for automation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
7. Bibliography
Anick, P. and Bergler, S. (1992) Lexical structures for linguistic inference. In Pustejovsky, J. and Bergler, S. (Eds.) Lexical Semantics and Knowledge Representation. Berlin, Springer Verlag: 121–135.
Banfield, A. (1982) Unspeakable Sentences. Routledge and Kegan Paul, Boston.
Bergler, S., Doandes, M., Gerard, C., and Witte, R. (2004) Attributions. In Qu, Y., Shanahan, J. G., Wiebe, J. (Eds.) Proceedings of AAAI Spring Symposium: Exploring Attitude and Affect in Text: Theories and Applications, Stanford, CA. AAAI Press.
Cappon, R. J. (2000) The Associated Press Guide to News Writing. Foster City, CA, IDG Books Worldwide Inc.
Chafe, W. (1986) Evidentiality in English Conversation and Academic Writing. In Chafe, W. and Nichols, J. (Eds.) Evidentiality: The Linguistic Coding of Epistemology. Norwood, New Jersey, Ablex Publishing Corporation. 20: 261–273.
Coates, J. (1983) The Semantics of the Modal Auxiliaries. London & Canberra, Croom Helm.
Holmes, J. (1990) Hedges and boosters in women’s and men’s speech. Language and communication 10(3): 185–205.
Hyland, K. (1998) Hedging in Scientific Research Articles. Amsterdam, Philadelphia, John Benjamin Publishing Company.
Kando, N. (1996) Text structure analysis based on human recognition: Cases of Japanese newspaper and English newspaper. Bulletin of National Center for Science Information Systems, No. 8, pp.107–126 (Japanese)
Lackoff, G. (1972) Hedges: a study of meaning criteria and the logic of fuzzy concepts. Chicago Linguistic Society Papers.
Liddy, E.D., McVearry, K., Paik, W., Yu, E.S., and McKenna, M. (1993) Development, implementation & Testing of a Discourse Model for Newspaper Texts. Proceedings of the ARPA Workshop on Human Language Technology, Princeton, NJ, March 21–24, 1993.
Liddy, E.D., Paik, W., and McKenna, M. (1995) Development and Implementation of a discourse model for newspaper texts. Proceedings of the AAAI Symposium on Empirical Methods in Discourse Interpretation and Generation. Stanford, CA.
Merriam-Webster Online Dictionary, http://www.m-w.com/. Accessed on January 30, 2004.
Mushin, I. (2001) Evidentiality and Epistemological Stance: Narrative Retelling. Amsterdam, John Benjamins Publishing Co.
Rubin, V. L., Stanton, J. M., and Liddy E. D. (2004) Discerning Emotions in Texts. AAAI Spring Symposium: Exploring Attitude and Affect in Text: Theories and Applications, Stanford, CA.
Searle, J. R. (1979) Expression and Meaning: Studies in the Theory of Speech Acts. Cambridge, London, New York, Melbourne, Cambridge University Press.
van Dijk, T. A. (1981) Studies in the Pragmatics of Discourse, Mouton Publishers, The Hague, The Netherlands
Wiebe, J. M. (1994) Tracking Point of View in Narrative. Computational Linguistics 20(2): 233–287.
Wiebe, J. M. (2000) Learning Subjective Adjectives from Corpora. Proceedings of the 17th National Conference on Artificial Intelligence (AAAI-2000). Austin, Texas, July 2000.
Wiebe, J., Bruce, R., Bell, M., Martin, M., and Wilson, T. (2001) A Corpus Study of Evaluative and Speculative Language. Proceedings of the 2nd ACL SIGdial Workshop on Discourse and Dialogue. Aalborg, Denmark, September, 2001.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer
About this chapter
Cite this chapter
Rubin, V.L., Liddy, E.D., Kando, N. (2006). Certainty Identification in Texts: Categorization Model and Manual Tagging Results. In: Shanahan, J.G., Qu, Y., Wiebe, J. (eds) Computing Attitude and Affect in Text: Theory and Applications. The Information Retrieval Series, vol 20. Springer, Dordrecht. https://doi.org/10.1007/1-4020-4102-0_7
Download citation
DOI: https://doi.org/10.1007/1-4020-4102-0_7
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-4026-9
Online ISBN: 978-1-4020-4102-0
eBook Packages: Computer ScienceComputer Science (R0)