Abstract
This paper discusses the corpus annotation effort in the FLAG project and its application in the development of controlled language and grammar checking applications. A USENET corpus was collected and annotated using the error typology developed in the project. The DiET tool was used to support the automatic annotation effort, and to evaluate and validate the data. Finally, we report on some interesting aspects of the data which came out of our evaluation.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Brants, T., Skut, W., Uszkoreit, H. (2003). Syntactic annotation of a German newspaper corpus, This volume.
Crysmann, B. (1997). Fehlerannotation. Technical report, DFKI GmbH.
Klein, J., Lehmann, S., Netter, K., Wegst, T. (1998a). Construction and annotation of test-items in DiET. In Proceedings of the ESSLLI Workshop on Recent Advances in Corpus Annotation, Saarbrücken.
Klein, J., Lehmann, S., Netter, K., Wegst, T. (1998b). DiET in the context of MT evaluation. In Nübel, R. and Seewald-Heeg, U., editors, Evaluation of the Linguistic Performance of Machine Translation System, p. 107–126. Gardez! Verlag, St. Augustin.
Luik, G. (1993a). Stolpersteine, volume 1. Geiger Verlag, Wiesbaden.
Luik, G. (1993b). Stolpersteine, volume 2. Geiger Verlag, Wiesbaden.
Netter, K., Armstrong, S., Kiss, T., Klein, J., Lehmann, S., Milward, D., Regnier-Prost, S., Schäler, R., Wegst, T. (1998). DiET — diagnostic and evaluation tools for natural language processing applications. In Proceedings of the First International Conference on Language Resources and Evaluation, p. 573–579, Granada.
Skut, W., Krenn, B., Brants, T., Uszkoreit, H. (1997). An annotation scheme for free word order languages. In Proceedings of ANLP, p. 8–96, Washington.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Becker, M., Bredenkamp, A., Crysmann, B., Klein, J. (2003). Annotation of Error Types for German Newsgroup Corpus. In: Abeillé, A. (eds) Treebanks. Text, Speech and Language Technology, vol 20. Springer, Dordrecht. https://doi.org/10.1007/978-94-010-0201-1_6
Download citation
DOI: https://doi.org/10.1007/978-94-010-0201-1_6
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-1335-5
Online ISBN: 978-94-010-0201-1
eBook Packages: Springer Book Archive