Abstract
In this paper we present our experiments with different annotation workflows for annotating discourse relations in the Hindi Discourse Relation Bank(HDRB). In view of the growing interest in the development of discourse data-banks based on the PDTB framework and the complexities associated with the discourse annotation, it is important to study and analyze approaches and practices followed in the annotation process. The ultimate goal is to find an optimal balance between accurate description of discourse relations and maximal inter-rater reliability. We address the question of the choice of annotation work-flow for discourse and how it affects the consistency and hence the quality of annotation. We conduct multiple annotation experiments using different work-flow strategies, and evaluate their impact on inter-annotator agreement. Our results show that the choice of annotation work-flow has a significant effect on the annotation load and the comprehension of discourse relations for annotators, as is reflected in the inter-annotator agreement results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Prasad, R., Dinesh, N., Lee, A., Miltsakaki, E., Robaldo, L., Joshi, A.K., Webber, B.L.: The Penn Discourse TreeBank 2.0. In: LREC (2008)
Webber, B., Stone, M., Joshi, A., Knott, A.: Anaphora and Discourse Structure. Computational Linguistics 29, 545–587 (2003)
Yuping, Z., Nianwen, X.: PDTB-style discourse annotation of Chinese text. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, ACL 2012, pp. 69–77 (2012)
Zeyrek, D., Demirşahin, I., Sevdik-Çalli, A., Balaban, H.Ö., Yalçinkaya, İ., Turan, Ü.D.: The Annotation Scheme of the Turkish Discourse Bank and an evaluation of inconsistent annotations. In: Proceedings of the Fourth Linguistic Annotation Workshop, LAW IV 2010, pp. 282–289 (2010)
Al-Saif, A., Markert, K.: The Leeds Arabic Discourse Treebank: Annotating Discourse Connectives for Arabic. In: LREC (2010)
Oza, U., Prasad, R., Kolachina, S., Sharma, D.M., Joshi, A.: The Hindi Discourse Relation Bank. In: Proceedings of the Third Linguistic Annotation Workshop, ACL-IJCNLP 2009, pp. 158–161 (2009)
Mladová, L., Zikánová, Š., Hajičová, E.: From Sentence to Discourse: Building an Annotation Scheme for Discourse based on Prague Dependency Treebank. In: Proceedings of Language Resources and Evaluation, LREC (2008)
Kolachina, S., Prasad, R., Sharma, D.M., Joshi, A.: Evaluation of Discourse Relation Annotation in the Hindi Discourse Relation Bank. In: Proceedings of the Eight International Conference on Language Resources and Evaluation, LREC 2012 (2012)
Begum, R., Husain, S., Dhwaj, A., Sharma, D.M., Bai, L., Sangal, R.: Dependency Annotation Scheme for Indian Languages. In: Proceedings of the Third International Joint Conference on Natural Language Processing, IJCNLP (2008)
Prasad, R., Joshi, A., Webber, B.: Realization of Discourse Relations by Other Means: Alternative Lexicalizations. In: Coling 2010: Posters, Coling 2010 Organizing Committee, pp. 1023–1031 (2010)
Miltsakaki, E., Prasad, R., Joshi, A., Webber, B.: Annotating Discourse Connectives And Their Arguments. In: Proceedings of the HLT/NAACL Workshop on Frontiers in Corpus Annotation (2004)
Fleiss, J.: Measuring nominal scale agreement among many raters. Psychological Bulletin (1971)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sharma, H., Dakwale, P., Sharma, D.M., Prasad, R., Joshi, A. (2013). Assessment of Different Workflow Strategies for Annotating Discourse Relations: A Case Study with HDRB. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2013. Lecture Notes in Computer Science, vol 7816. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37247-6_42
Download citation
DOI: https://doi.org/10.1007/978-3-642-37247-6_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37246-9
Online ISBN: 978-3-642-37247-6
eBook Packages: Computer ScienceComputer Science (R0)