Skip to main content

Assessment of Different Workflow Strategies for Annotating Discourse Relations: A Case Study with HDRB

  • Conference paper
Book cover Computational Linguistics and Intelligent Text Processing (CICLing 2013)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7816))

Abstract

In this paper we present our experiments with different annotation workflows for annotating discourse relations in the Hindi Discourse Relation Bank(HDRB). In view of the growing interest in the development of discourse data-banks based on the PDTB framework and the complexities associated with the discourse annotation, it is important to study and analyze approaches and practices followed in the annotation process. The ultimate goal is to find an optimal balance between accurate description of discourse relations and maximal inter-rater reliability. We address the question of the choice of annotation work-flow for discourse and how it affects the consistency and hence the quality of annotation. We conduct multiple annotation experiments using different work-flow strategies, and evaluate their impact on inter-annotator agreement. Our results show that the choice of annotation work-flow has a significant effect on the annotation load and the comprehension of discourse relations for annotators, as is reflected in the inter-annotator agreement results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Prasad, R., Dinesh, N., Lee, A., Miltsakaki, E., Robaldo, L., Joshi, A.K., Webber, B.L.: The Penn Discourse TreeBank 2.0. In: LREC (2008)

    Google Scholar 

  2. Webber, B., Stone, M., Joshi, A., Knott, A.: Anaphora and Discourse Structure. Computational Linguistics 29, 545–587 (2003)

    Article  MATH  Google Scholar 

  3. Yuping, Z., Nianwen, X.: PDTB-style discourse annotation of Chinese text. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, ACL 2012, pp. 69–77 (2012)

    Google Scholar 

  4. Zeyrek, D., Demirşahin, I., Sevdik-Çalli, A., Balaban, H.Ö., Yalçinkaya, İ., Turan, Ü.D.: The Annotation Scheme of the Turkish Discourse Bank and an evaluation of inconsistent annotations. In: Proceedings of the Fourth Linguistic Annotation Workshop, LAW IV 2010, pp. 282–289 (2010)

    Google Scholar 

  5. Al-Saif, A., Markert, K.: The Leeds Arabic Discourse Treebank: Annotating Discourse Connectives for Arabic. In: LREC (2010)

    Google Scholar 

  6. Oza, U., Prasad, R., Kolachina, S., Sharma, D.M., Joshi, A.: The Hindi Discourse Relation Bank. In: Proceedings of the Third Linguistic Annotation Workshop, ACL-IJCNLP 2009, pp. 158–161 (2009)

    Google Scholar 

  7. Mladová, L., Zikánová, Š., Hajičová, E.: From Sentence to Discourse: Building an Annotation Scheme for Discourse based on Prague Dependency Treebank. In: Proceedings of Language Resources and Evaluation, LREC (2008)

    Google Scholar 

  8. Kolachina, S., Prasad, R., Sharma, D.M., Joshi, A.: Evaluation of Discourse Relation Annotation in the Hindi Discourse Relation Bank. In: Proceedings of the Eight International Conference on Language Resources and Evaluation, LREC 2012 (2012)

    Google Scholar 

  9. Begum, R., Husain, S., Dhwaj, A., Sharma, D.M., Bai, L., Sangal, R.: Dependency Annotation Scheme for Indian Languages. In: Proceedings of the Third International Joint Conference on Natural Language Processing, IJCNLP (2008)

    Google Scholar 

  10. Prasad, R., Joshi, A., Webber, B.: Realization of Discourse Relations by Other Means: Alternative Lexicalizations. In: Coling 2010: Posters, Coling 2010 Organizing Committee, pp. 1023–1031 (2010)

    Google Scholar 

  11. Miltsakaki, E., Prasad, R., Joshi, A., Webber, B.: Annotating Discourse Connectives And Their Arguments. In: Proceedings of the HLT/NAACL Workshop on Frontiers in Corpus Annotation (2004)

    Google Scholar 

  12. Fleiss, J.: Measuring nominal scale agreement among many raters. Psychological Bulletin (1971)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Sharma, H., Dakwale, P., Sharma, D.M., Prasad, R., Joshi, A. (2013). Assessment of Different Workflow Strategies for Annotating Discourse Relations: A Case Study with HDRB. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2013. Lecture Notes in Computer Science, vol 7816. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37247-6_42

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-37247-6_42

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-37246-9

  • Online ISBN: 978-3-642-37247-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics