Abstract
Factuality classification is used for classifying information based on degrees of certainty. It has been actively used in different applications including information extraction, textual entailment, finding semantic uncertainty and certainty, or fact extraction. In this paper, we propose an approach to improve factuality classification by analyzing information in Elementary Discourse Units (EDUs) and their relations. We use news articles as our case study since it contains information that has various degrees of certainty or factuality values (i.e., information about certain events or uncertain information from factual and opinionated information). In this work, we use five sets of facets for factuality classification, which are (1) Epistemic Modality set, (2) Subjectivity Type set, (3) Rhetorical Structure Theory (RST) set, (4) Semantic Implicative and Factive Patterns set and (5) Weasel Words set. Unlike previous works on factuality classification, we use multiple facets of EDU to examine certainty and unambiguity level of information. We performed experiments based on news articles in FactBank corpus. We evaluated our method by comparing with several state-of-the-art factuality classification techniques and the results clearly show that our method can improve accuracy in terms of precision, recall and F1-measure as 94.1%, 93.9% and 93.9%, respectively.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agarwal, S., Yu, H.: Detecting Hedge Cues and their scope in Biomedical Text with conditional random fields. J. of Bio. Info. 43, 953–961 (2010)
Diab, M.T., Levin, L.: Committed belief annotation and tagging. In: Proceedings of the Third Linguistic Annotation Workshop, ACL-IJNLP 09 (2009)
Ganter, V., Strube ,M.: Finding hedges by chasing weasels: hedge detection using wikipedia tags and shallow linguistic features. In: Proceedings of the ACL-IJCNLP 2009 Conference Short Papers, pp.173-176 (2009)
Kilicoglu, H., Bergler, S.: Recognizing speculative language in biomedical research articles: a linguistically motivated perspective. In: Current Trends in Biomedical Natural Language Processing, pp. 46–53, Ohio, USA (2008)
Karttunen, L.: Implicative verbs. Language 47, 340–358 (1971)
Karttunen, L.: Simple and phrasal implicatives
Karttunen, L., Zaenen, A.: Veridicity. In: Annotating, Extracting and Reasoning about Time and Events (2005)
Karttunen, L.: Simple and phrasal implicatives. In: Proceedings of the Sixth International Workshop on Semantic Evaluation, pp. 124–131 (2012)
Li, X., Gao, W., Shavlik, J.W.: Detecting semantic uncertainty by learning hedge cues in sentences using an HMM. In: Proceedings of Workshop on Semantic Matching in Information Retrieval, pp. 30–37, Queensland, Australia (2014)
Mann, W.C., Thompson, S.A.: Rhetorical structure theory: toward a functional theory of text organization. Text 8, 243–281 (1988)
Marneffe, M.C.D., Manning, C.D., Potts, C.: Veridicality and utterance understanding. In: Fifth IEEE International Conference on Semantic Computing, pp. 430–436 (2011)
Marneffe, M.C.D., Manning, C.D., Potts, C.: Did it happen? The pragmatic complexity of veridicality assessment. Com. Ling. 38, 301–333 (2012)
Medlock, B., Briscoe, T.: Weakly supervised learning for hedge classification in scientific literature. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pp. 992—99, Prague, Czech Republic (2007)
Medlock, B.: Exploring hedge identification in biomedical literature. J. of Bio. Info. 41, 636–654 (2008)
Moncecchi, G., Minel, J.L., Wonserver, D.: Improving speculative language detection using linguistic knowledge. In: Proceedings of the ACL-2012 Workshop on Extra-Propositional Aspects of Meaning in Computational Linguistics( EXProM-2012), pp. 37–46, Jeju, Republic of Korea (2012)
Morante, R., Sporleder, C.: Modality and negation: an introduction to the special issue. Com. Ling. 38, 223–260 (2012)
Prabhakaran, V., Rambow, O., Diab, M.: Automatic committed belief tagging. In: Coling 2010: Poster Volume, pp. 1014–1022, Beijing, China (2010)
Pustejovsky, J., Hanks P., See, A., Sauri, R.: The timebank corpus. In: Proceedings of Corpus Linguistics, pp. 647–656 (2003)
Rubin, V.L.: Epistemic modality: from uncertainty to certainty in the context of information seeking as interactions with texts. Inf. Process. Manage. 46, 533–540 (2010)
Sauri, R.: Determining modality and factuality for text entailment. In: Proceedings of the First IEEE International Conference on Semantic Computing, pp. 509–516. Irvine, California (2007)
Sauri, R., Pustejovsky, J.: From structure to interpretation: a double-layered annotation for event factuality, In:Prooceedings of the 2nd Linguistic Annotation Workshop. The Sixth International Conference on Language Resources and Evaluation, pp. 1–8 (2008)
Sauri, R.: A factuality profiler for eventualities in text. Ph. D. thesis, Brandeis University (2008)
Sauri, R., Pustejovsky, J.: FactBank : a corpus annotated with event factuality. Lang. Resour. Eval. 43, 227–268 (2009)
Sauri, R., Pustejovsky, J.: Are you sure that this happened? Assessing the factuality degree of events in text. Comp. Ling. 38, 261–299 (2012)
Soricut, R., Marcu, D.: Sentence level parsing using syntactic and lexical information. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, vol. 1, pp. 149–156 (2003)
Szarvas, G., Vincze, V., Farkas, R., Mora, G., Gurevych, I.: Cross-genre and cross domain detection of semantic uncertainty. Com. Ling. 38, 335–368 (2012)
Velupillai, S., Skeppstedt, M., Kvist, M., Mwery, D.: Cue-based assertion classification for Swedish clinical text-developing a lexicon for pyConTextSwe. Artif. Intell. Med. 61, 137–144 (2014)
Vlachos, A., Riedel, S.: Fact checking: task definition and dataset construction. In : ACL 2014 Workshop on Language Technologies and Computational Social Science, pp. 18–22 (2014)
Weibe, J., Wilson, T., Cardie, C.: Annotating expressions of opinions and emotions in Language. Lang. Resour. Eval. pp. 165–210 (2005)
Wiebe, J., Riloff, E.: Creating subjective and objective sentence classifiers from unannotated texts. In: CICLing05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing, pp. 486–497 (2005)
Acknowledgement
This research was supported by the Center of Excellence in Intelligent Informatics, Speech and Language Technology and Service Innovation (CILS), Intelligent Informatics and Service Innovation (IISI) and NRU grant at SIIT, Thammasat University.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Swe Wynn, K., Usanavasin, S. (2018). Factuality Classification Using Multi-facets Based on Elementary Discourse Units for News Articles. In: Theeramunkong, T., Skulimowski, A., Yuizono, T., Kunifuji, S. (eds) Recent Advances and Future Prospects in Knowledge, Information and Creativity Support Systems. KICSS 2015. Advances in Intelligent Systems and Computing, vol 685. Springer, Cham. https://doi.org/10.1007/978-3-319-70019-9_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-70019-9_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-70018-2
Online ISBN: 978-3-319-70019-9
eBook Packages: EngineeringEngineering (R0)