Factuality Classification Using Multi-facets Based on Elementary Discourse Units for News Articles

Swe Wynn, Khaing; Usanavasin, Sasiporn

doi:10.1007/978-3-319-70019-9_8

Khaing Swe Wynn¹⁸ &
Sasiporn Usanavasin¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 685))

Included in the following conference series:

International Conference on Knowledge, Information, and Creativity Support Systems

475 Accesses

Abstract

Factuality classification is used for classifying information based on degrees of certainty. It has been actively used in different applications including information extraction, textual entailment, finding semantic uncertainty and certainty, or fact extraction. In this paper, we propose an approach to improve factuality classification by analyzing information in Elementary Discourse Units (EDUs) and their relations. We use news articles as our case study since it contains information that has various degrees of certainty or factuality values (i.e., information about certain events or uncertain information from factual and opinionated information). In this work, we use five sets of facets for factuality classification, which are (1) Epistemic Modality set, (2) Subjectivity Type set, (3) Rhetorical Structure Theory (RST) set, (4) Semantic Implicative and Factive Patterns set and (5) Weasel Words set. Unlike previous works on factuality classification, we use multiple facets of EDU to examine certainty and unambiguity level of information. We performed experiments based on news articles in FactBank corpus. We evaluated our method by comparing with several state-of-the-art factuality classification techniques and the results clearly show that our method can improve accuracy in terms of precision, recall and F1-measure as 94.1%, 93.9% and 93.9%, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Agarwal, S., Yu, H.: Detecting Hedge Cues and their scope in Biomedical Text with conditional random fields. J. of Bio. Info. 43, 953–961 (2010)
Article Google Scholar
Diab, M.T., Levin, L.: Committed belief annotation and tagging. In: Proceedings of the Third Linguistic Annotation Workshop, ACL-IJNLP 09 (2009)
Google Scholar
Ganter, V., Strube ,M.: Finding hedges by chasing weasels: hedge detection using wikipedia tags and shallow linguistic features. In: Proceedings of the ACL-IJCNLP 2009 Conference Short Papers, pp.173-176 (2009)
Google Scholar
Kilicoglu, H., Bergler, S.: Recognizing speculative language in biomedical research articles: a linguistically motivated perspective. In: Current Trends in Biomedical Natural Language Processing, pp. 46–53, Ohio, USA (2008)
Google Scholar
Karttunen, L.: Implicative verbs. Language 47, 340–358 (1971)
Article Google Scholar
Karttunen, L.: Simple and phrasal implicatives
Google Scholar
Karttunen, L., Zaenen, A.: Veridicity. In: Annotating, Extracting and Reasoning about Time and Events (2005)
Google Scholar
Karttunen, L.: Simple and phrasal implicatives. In: Proceedings of the Sixth International Workshop on Semantic Evaluation, pp. 124–131 (2012)
Google Scholar
Li, X., Gao, W., Shavlik, J.W.: Detecting semantic uncertainty by learning hedge cues in sentences using an HMM. In: Proceedings of Workshop on Semantic Matching in Information Retrieval, pp. 30–37, Queensland, Australia (2014)
Google Scholar
Mann, W.C., Thompson, S.A.: Rhetorical structure theory: toward a functional theory of text organization. Text 8, 243–281 (1988)
Article Google Scholar
Marneffe, M.C.D., Manning, C.D., Potts, C.: Veridicality and utterance understanding. In: Fifth IEEE International Conference on Semantic Computing, pp. 430–436 (2011)
Google Scholar
Marneffe, M.C.D., Manning, C.D., Potts, C.: Did it happen? The pragmatic complexity of veridicality assessment. Com. Ling. 38, 301–333 (2012)
Article Google Scholar
Medlock, B., Briscoe, T.: Weakly supervised learning for hedge classification in scientific literature. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pp. 992—99, Prague, Czech Republic (2007)
Google Scholar
Medlock, B.: Exploring hedge identification in biomedical literature. J. of Bio. Info. 41, 636–654 (2008)
Article Google Scholar
Moncecchi, G., Minel, J.L., Wonserver, D.: Improving speculative language detection using linguistic knowledge. In: Proceedings of the ACL-2012 Workshop on Extra-Propositional Aspects of Meaning in Computational Linguistics( EXProM-2012), pp. 37–46, Jeju, Republic of Korea (2012)
Google Scholar
Morante, R., Sporleder, C.: Modality and negation: an introduction to the special issue. Com. Ling. 38, 223–260 (2012)
Article MathSciNet Google Scholar
Prabhakaran, V., Rambow, O., Diab, M.: Automatic committed belief tagging. In: Coling 2010: Poster Volume, pp. 1014–1022, Beijing, China (2010)
Google Scholar
Pustejovsky, J., Hanks P., See, A., Sauri, R.: The timebank corpus. In: Proceedings of Corpus Linguistics, pp. 647–656 (2003)
Google Scholar
Rubin, V.L.: Epistemic modality: from uncertainty to certainty in the context of information seeking as interactions with texts. Inf. Process. Manage. 46, 533–540 (2010)
Article Google Scholar
Sauri, R.: Determining modality and factuality for text entailment. In: Proceedings of the First IEEE International Conference on Semantic Computing, pp. 509–516. Irvine, California (2007)
Google Scholar
Sauri, R., Pustejovsky, J.: From structure to interpretation: a double-layered annotation for event factuality, In:Prooceedings of the 2nd Linguistic Annotation Workshop. The Sixth International Conference on Language Resources and Evaluation, pp. 1–8 (2008)
Google Scholar
Sauri, R.: A factuality profiler for eventualities in text. Ph. D. thesis, Brandeis University (2008)
Google Scholar
Sauri, R., Pustejovsky, J.: FactBank : a corpus annotated with event factuality. Lang. Resour. Eval. 43, 227–268 (2009)
Article Google Scholar
Sauri, R., Pustejovsky, J.: Are you sure that this happened? Assessing the factuality degree of events in text. Comp. Ling. 38, 261–299 (2012)
Article Google Scholar
Soricut, R., Marcu, D.: Sentence level parsing using syntactic and lexical information. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, vol. 1, pp. 149–156 (2003)
Google Scholar
Szarvas, G., Vincze, V., Farkas, R., Mora, G., Gurevych, I.: Cross-genre and cross domain detection of semantic uncertainty. Com. Ling. 38, 335–368 (2012)
Article Google Scholar
Velupillai, S., Skeppstedt, M., Kvist, M., Mwery, D.: Cue-based assertion classification for Swedish clinical text-developing a lexicon for pyConTextSwe. Artif. Intell. Med. 61, 137–144 (2014)
Article Google Scholar
Vlachos, A., Riedel, S.: Fact checking: task definition and dataset construction. In : ACL 2014 Workshop on Language Technologies and Computational Social Science, pp. 18–22 (2014)
Google Scholar
Weibe, J., Wilson, T., Cardie, C.: Annotating expressions of opinions and emotions in Language. Lang. Resour. Eval. pp. 165–210 (2005)
Google Scholar
Wiebe, J., Riloff, E.: Creating subjective and objective sentence classifiers from unannotated texts. In: CICLing05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing, pp. 486–497 (2005)
Google Scholar

Download references

Acknowledgement

This research was supported by the Center of Excellence in Intelligent Informatics, Speech and Language Technology and Service Innovation (CILS), Intelligent Informatics and Service Innovation (IISI) and NRU grant at SIIT, Thammasat University.

Author information

Authors and Affiliations

School of Information, Computer and Communication Technology (ICT), Thammasat University, Khlong Nung, Thailand
Khaing Swe Wynn
Sirindhorn International Institute of Technology (SIIT), Thammasat University, Khlong Nung, Thailand
Sasiporn Usanavasin

Authors

Khaing Swe Wynn
View author publications
You can also search for this author in PubMed Google Scholar
Sasiporn Usanavasin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Khaing Swe Wynn .

Editor information

Editors and Affiliations

School of Information, Computer, and Communication Technology (ICT), Sirindhorn International Institute of Technology, Thammasat University, Pathum Thani, Thailand
Thanaruk Theeramunkong
Department of Automatic Control and Biomedical Engineering, AGH University of Science and Technology, Kraków, Poland
Andrzej M.J. Skulimowski
Graduate School of Advanced Science and Technology, Japan Advanced Institute of Science and Technology, Nomi-shi, Ishikawa, Japan
Takaya Yuizono
School of Knowledge Science, Japan Advanced Institute of Science and Technology, Nomi-shi, Ishikawa, Japan
Susumu Kunifuji

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Swe Wynn, K., Usanavasin, S. (2018). Factuality Classification Using Multi-facets Based on Elementary Discourse Units for News Articles. In: Theeramunkong, T., Skulimowski, A., Yuizono, T., Kunifuji, S. (eds) Recent Advances and Future Prospects in Knowledge, Information and Creativity Support Systems. KICSS 2015. Advances in Intelligent Systems and Computing, vol 685. Springer, Cham. https://doi.org/10.1007/978-3-319-70019-9_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-70019-9_8
Published: 02 December 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-70018-2
Online ISBN: 978-3-319-70019-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics