Abstract
Coreference resolution, determining the appropriate discourse referent for an anaphoric expression, is an essential but difficult task in natural language processing. It has been observed that an important source of errors in machine-learning based approaches to this task, is the wrong disambiguation of the third person singular neuter pronoun as either referential or non-referential. In this paper, we investigate whether a machine learning based approach can be successfully applied to the disambiguation of the neuter pronoun in Dutch and show a modest potential effect of this disambiguation on the results of a machine learning based coreference resolution system for Dutch.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Paice, C., Husk, G.: Towards an automatic recognition of anaphoric features in english text: the impersonal pronoun ’it’. Computer Speech and Language 2, 109–132 (1987)
Lappin, S., Leass, H.: An algorithm for pronominal anaphora resolution. Computational Linguistics 20(4), 535–561 (1994)
Boyd, A., Gegg-Harrison, W., Byron, D.: Identifiying non-referential it: a machine learning approach incorporating linguistically motivated patterns. In: Proceedings of the ACL Workshop on Feature Engineering for Machine Learning in NLP, pp. 40–47 (2005)
Evans, R.: Applying machine learning toward an automatic classification of it. Literary and Linguistic Computing 16(1), 45–57 (2001)
Ng, V., Cardie, C.: Identifying anaphoric and non-anaphoric noun phrases to improve coreference resolution. In: Proceedings of the 19th International Conference on Computational Linguistics (COLING-2002) (2002)
Hoste, V.: Optimization Issues in Machine Learning of Coreference Resolution. PhD thesis, Antwerp University (2005)
Daelemans, W., van den Bosch, A.: Memory-based Language Processing. Cambridge University Press, Cambridge (2005)
McCarthy, J.: A Trainable Approach to Coreference Resolution for Information Extraction. PhD thesis, Department of Computer Science, University of Massachusetts, Amherst MA (1996)
Soon, W., Ng, H., Lim, D.: A machine learning approach to coreference resolution of noun phrases. Computational Linguistics 27(4), 521–544 (2001)
Ng, V., Cardie, C.: Combining sample selection and error-driven pruning for machine learning of coreference rules. In: Proceedings of the 2002, Conference on Empirical Methods in Natural Language Processing (EMNLP 2002), pp. 55–62 (2002)
Yang, X., Zhou, G., Su, J., Tan, C.L: Coreference resolution using competition learning approach. In: Proceedings of the 41st Annual Meeting of the Association for Compuatiational Linguistics (ACL 2003), pp. 176–183. Sapporo, Japan (2003)
Uryupina, O.: Linguistically motivated sample selection for coreference resolution. In: Proceedings of DAARC-2004 (2004)
Hendrickx, I., Hoste, V., Daelemans, W.: Evaluating hybrid versus data-driven coreference resolution. In: Anaphora: Analysis, Algorithms and Applications (LNAI 4410) (2007)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hoste, V., Hendrickx, I., Daelemans, W. (2007). Disambiguation of the Neuter Pronoun and Its Effect on Pronominal Coreference Resolution. In: Matoušek, V., Mautner, P. (eds) Text, Speech and Dialogue. TSD 2007. Lecture Notes in Computer Science(), vol 4629. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74628-7_9
Download citation
DOI: https://doi.org/10.1007/978-3-540-74628-7_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74627-0
Online ISBN: 978-3-540-74628-7
eBook Packages: Computer ScienceComputer Science (R0)