A Comparison of Rule-Based and Machine Learning Methods for Identifying Non-nominal It

Evans, Richard

doi:10.1007/3-540-45154-4_22

A Comparison of Rule-Based and Machine Learning Methods for Identifying Non-nominal It

Richard Evans²

Conference paper
First Online: 01 January 2000

985 Accesses
3 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1835))

Abstract

The pronoun it is noted to be used in a variety of non-nominal ways. The identification of non-nominal pronouns is important in information retrieval, machine translation and automatic summarisation. Given that previous work has only tackled a subset of those non-nominal uses, a machine learning method for identification of all instances of non-nominal it is presented. The machine learning method is compared with a rule-based approach. The performance of each implementation is evaluated. The construction of an annotated corpus and training data are also described.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Burnard, L. (1995) Users Reference Guide British National Corpus Version 1.0, Oxford University Computing Services, UK.
Google Scholar
Daelemans, W. (1999) TiMBL: Tilburg Memory Based Learner version 2 Reference Guide, ILK Technical Report-ILK 99-01, Tilburg University, The Netherlands
Google Scholar
Denber, M. (1998) Automatic Resolution of Anaphora in English Eastman Kodak Co., Imaging Science Division
Google Scholar
Harabagiu, S.M. and Maiorano, S.J. (1999) Knowledge-Lean Coreference Resolution and its Relation to Textual Cohesion and Coherence, in Proceedings of the Workshop The Relation of Discourse / Dialogue Structure and Reference, ACL’ 99, Maryland, US.
Google Scholar
Hirschmann, L (1997) MUC-7 Coreference Task Definition at http://www.muc.saic.com/proceedings/co_task.pdf
Hirst, G. (1981) Anaphora in Natural Language Understanding, Springer Verlag, Germany
Google Scholar
Lappin, S. and Leass, H.J. (1994) An Algorithm for Pronominal Anaphora Resolution, in Computational Linguistics Volume 20, Number 4
Google Scholar
Litman, D. J. (1996) Cue Phrase Classification Using Machine Learning, in Journal of Artificial Intelligence Research, vol 5, pp. 53–94
Google Scholar
Mikheev, A. (1996) LTCHUNK V 2.1, Language Technology Group, University of Edinburgh, available from http://www.ltg.ed.ac.uk/software/chunk/index.html
Mitkov, R., Belguith, L. and Stys, M. (1998) Multilingual Robust Anaphora Resolution, in Proceedings of The Third International Conference on Empirical Methods in Natural Language Processing, Granada, Spain.
Google Scholar
Paice, C.D. and Husk, G.D. (1987) Towards the automatic recognition of anaphoric features in English text: the impersonal pronoun ‘it,’ in Computer Speech and Language, 2 p. 109–132, Academic Press, US.
Google Scholar
Quinlan, J.R. (1993) C4.5: Programs for Machine Learning, Morgan Kaufmann, US.
Google Scholar
Quirk, R. et al. (1985) A Comprehensive Grammar of the English Language, Longman, UK.
Google Scholar
Sampson, G. (1995) English for the Computer: The SUSANNE Corpus and analytic scheme, Oxford Univerity Press, UK.
Google Scholar
Sinclair, J. et al. (1995) English Grammar, Harper Collins Publishers, UK.
Google Scholar
Swan, M. (1995) Practical English Usage, Oxford University Press, UK.
Google Scholar
Tapanainen, P. and Järvinen, T. (1997) A Non-Projective Dependency Parser, in The Proceedings of The 5th Conference of Applied Natural Language Processing, pages 64–71, ACL, US.
Google Scholar

Download references

Author information

Authors and Affiliations

School of Humanities, Languages and Social Sciences, University of Wolverhampton, Stafford Street, Wolverhampton, WV1 1SB, UK
Richard Evans

Authors

Richard Evans
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Engineering Department and Computer Technology Institute, University of Patras, 26500, Patras, Greece
Dimitris N. Christodoulakis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Evans, R. (2000). A Comparison of Rule-Based and Machine Learning Methods for Identifying Non-nominal It . In: Christodoulakis, D.N. (eds) Natural Language Processing — NLP 2000. NLP 2000. Lecture Notes in Computer Science(), vol 1835. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45154-4_22

Download citation

DOI: https://doi.org/10.1007/3-540-45154-4_22
Published: 25 May 2000
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67605-8
Online ISBN: 978-3-540-45154-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics