Identifying the Content Zones of German Court Decisions

  • Manfred Stede
  • Florian Kuhn
Part of the Lecture Notes in Business Information Processing book series (LNBIP, volume 37)


A central step in the automatic processing of court decisions is the identification of the various content zones, i.e., breaking up the document into functionally independent areas. We assembled a corpus of German court decisions and argue that this genre belongs to the class of semi-structured text documents. Currently, we are implementing zone identification by means of a set of recognition rules, following up on our earlier experiences with a different genre (film reviews).


court decisions content zones document parsing 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Bieler, H., Dipper, S., Stede, M.: Identifying formal and functional zones in film reviews. In: Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue, Antwerpen (2007)Google Scholar
  2. 2.
    Dipper, S.: XML-based Stand-off Representation and Exploitation of Multi-Level Linguistic Annotation. In: Proc. of Berliner XML-Tage (BXML 2005), pp. 39–50 (2005)Google Scholar
  3. 3.
    Hachey, B., Grover, C.: Extractive summarization of legal texts. Artificial Intelligence and Law 14, 305–345 (2006)CrossRefGoogle Scholar
  4. 4.
    Miller, R.C.: Lightweight Structure in Text. PhD thesis, Computer Science Department, School of Computer Science, Carnegie Mellon University (May 2002)Google Scholar
  5. 5.
    Moens, M.-F., Uyttendaele, C., Dumortier, J.: Abstracting of Legal Cases: The SALOMON Experience. In: Proc. of the 6th Int’l Conference on Artificial Intelligence and Law, Melbourne (1996)Google Scholar
  6. 6.
    Teufel, S., Moens, M.: Summarizing Scientific Articles – Experiments with Relevance and Rhetorical Status. Computational Linguistics 28(4) (2002)Google Scholar
  7. 7.
    Walter, S., Pinkal, M.: Linguistic support for legal ontology construction. In: Proceedings of ICAIL, pp. 242–243 (2005)Google Scholar
  8. 8.
    Stede, M., Bieler, H., Dipper, S., Suryiawongkul, A.: SUMMaR: Combining linguistics and statistics for text summarization. In: Proceedings of ECAI, Riva del Garda (2006)Google Scholar
  9. 9.
    Stede, M., Sauermann, A.: Linearization of arguments in commentary texts. In: Proceedings of the Workshop on Multidisciplinary Approaches to Discourse, Oslo (2008)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Manfred Stede
    • 1
  • Florian Kuhn
    • 1
  1. 1.Applied Computational Linguistics, Dept. of LinguisticsUniversity of PotsdamGolmGermany

Personalised recommendations