Skip to main content

Analyzing Text Coherence via Multiple Annotation in the Prague Dependency Treebank

  • Conference paper
  • First Online:
Text, Speech, and Dialogue (TSD 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9302))

Included in the following conference series:

  • 1806 Accesses

Abstract

Corpus-based research demonstrates an existence of a mutual interaction of bridging anaphoric relations in the text and sentence information structure. The research is carried out on large corpus data of the Prague Dependency Treebank 3.0 that contains almost 50 thousand sentences with manual annotation of both sentence information structure and bridging anaphora. We investigate in which way the bridging anaphora relations interconnect contextually bound and non-bound sentence items and how such types of connections contribute to the text coherence.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hajičová, E., Hladká, B., Kučová, L.: An annotated corpus as a test bed for discourse structure analysis. In: Proceedings of the Workshop on Constraints in Discourse, Maynooth, Ireland, pp. 82–89. National University of Ireland, National University of Ireland (2006)

    Google Scholar 

  2. Hajičová, E.: On interplay of information structure, anaphoric links and discourse relations. In: Societas Linguistica Europaea, SLE 2011, 44th Annual Meeting, Book of Abstracts, Logrono, Spain, pp. 139–140. Universidad de la Rioja, Center for Research in the Applications of Language, Universidad de la Rioja, Center for Research in the Applications of Language (2011)

    Google Scholar 

  3. Kučová, L., Veselá, K., Hajičová, E., Havelka, J.: Topic-focus articulation and anaphoric relations: a corpus based probe. In: Heusinger, K., Umbach, C. (eds.) Proceedings of Discourse Domains and Information Structure Workshop, pp. 37–46. Edinburgh, Scotland (2005)

    Google Scholar 

  4. Komen, E.R.: Coreferenced corpora for information structure research. Outposts of Historical Corpus Linguistics: From the Helsinki Corpus to a Proliferation of Resources (Studies in Variation, Contacts and Change in English 10) (2012)

    Google Scholar 

  5. Stede, M., Neumann, A.: Potsdam commentary corpus 2.0: annotation for discourse research. In: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 14), pp. 925–929 (2014)

    Google Scholar 

  6. Chiarcos, C.: Towards interoperable discourse annotation. Discourse features in the ontologies of linguistic annotation. In: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 14), pp. 4569–4577 (2014)

    Google Scholar 

  7. Bejček, E., Hajičová, E., Hajič, J., Jínová, P., Kettnerová, V., Kolářová, V., Mikulová, M., Mírovský, J., Nedoluzhko, A., Panevová, J., Poláková, L., Ševčíková, M., Štěpánek, J., Zikánová, Š.: Prague dependency treebank 3 (2013)

    Google Scholar 

  8. Hajičová, E., Sgall, P., Partee, B.: Topic-focus articulation, tripartite structures, and semantic content. Kluwer, Dordrecht (1998). ISBN 0-7923-5289-0

    Google Scholar 

  9. Mikulová, M., Bémová, A., Hajič, J., Hajičová, E., Havelka, J., Kolářová, V., Lopatková, M., Pajas, P., Panevová, J., Razímová, M., Sgall, P., Štěpánek, J., Urešová, Z., Veselá, K., Žabokrtský, Z., Kučová, L.: Anotace na tektogramatické rovině pražského závislostního korpusu. anotátorská příručka. Technical Report TR-2005-28 (2005)

    Google Scholar 

  10. Nedoluzhko, A.: Rozšířená textová koreference a asociační anafora (Koncepce anotace českých dat v Pražském závislostním korpusu). Studies in Computational and Theoretical Linguistics. Ústav formální a aplikované lingvistiky, Praha (2011)

    Google Scholar 

  11. Nedoluzhko, A.: Generic noun phrases and annotation of coreference and bridging relations in the prague dependency treebank. In: 51st Annual Meeting of the Association for Computational Linguistics Proceedings of the 7th Linguistic Annotation Workshop & Interoperability with Discourse, Sofija, Bulgaria, pp. 103–111. Bălgarska akademija na naukite, Omnipress, Inc (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kateřina Rysová .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Rysová, K., Rysová, M. (2015). Analyzing Text Coherence via Multiple Annotation in the Prague Dependency Treebank. In: Král, P., Matoušek, V. (eds) Text, Speech, and Dialogue. TSD 2015. Lecture Notes in Computer Science(), vol 9302. Springer, Cham. https://doi.org/10.1007/978-3-319-24033-6_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-24033-6_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-24032-9

  • Online ISBN: 978-3-319-24033-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics