Skip to main content

A Comparison of Rule-Based and Machine Learning Methods for Identifying Non-nominal It

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1835))

Abstract

The pronoun it is noted to be used in a variety of non-nominal ways. The identification of non-nominal pronouns is important in information retrieval, machine translation and automatic summarisation. Given that previous work has only tackled a subset of those non-nominal uses, a machine learning method for identification of all instances of non-nominal it is presented. The machine learning method is compared with a rule-based approach. The performance of each implementation is evaluated. The construction of an annotated corpus and training data are also described.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Burnard, L. (1995) Users Reference Guide British National Corpus Version 1.0, Oxford University Computing Services, UK.

    Google Scholar 

  2. Daelemans, W. (1999) TiMBL: Tilburg Memory Based Learner version 2 Reference Guide, ILK Technical Report-ILK 99-01, Tilburg University, The Netherlands

    Google Scholar 

  3. Denber, M. (1998) Automatic Resolution of Anaphora in English Eastman Kodak Co., Imaging Science Division

    Google Scholar 

  4. Harabagiu, S.M. and Maiorano, S.J. (1999) Knowledge-Lean Coreference Resolution and its Relation to Textual Cohesion and Coherence, in Proceedings of the Workshop The Relation of Discourse / Dialogue Structure and Reference, ACL’ 99, Maryland, US.

    Google Scholar 

  5. Hirschmann, L (1997) MUC-7 Coreference Task Definition at http://www.muc.saic.com/proceedings/co_task.pdf

  6. Hirst, G. (1981) Anaphora in Natural Language Understanding, Springer Verlag, Germany

    Google Scholar 

  7. Lappin, S. and Leass, H.J. (1994) An Algorithm for Pronominal Anaphora Resolution, in Computational Linguistics Volume 20, Number 4

    Google Scholar 

  8. Litman, D. J. (1996) Cue Phrase Classification Using Machine Learning, in Journal of Artificial Intelligence Research, vol 5, pp. 53–94

    Google Scholar 

  9. Mikheev, A. (1996) LTCHUNK V 2.1, Language Technology Group, University of Edinburgh, available from http://www.ltg.ed.ac.uk/software/chunk/index.html

  10. Mitkov, R., Belguith, L. and Stys, M. (1998) Multilingual Robust Anaphora Resolution, in Proceedings of The Third International Conference on Empirical Methods in Natural Language Processing, Granada, Spain.

    Google Scholar 

  11. Paice, C.D. and Husk, G.D. (1987) Towards the automatic recognition of anaphoric features in English text: the impersonal pronoun ‘it,’ in Computer Speech and Language, 2 p. 109–132, Academic Press, US.

    Google Scholar 

  12. Quinlan, J.R. (1993) C4.5: Programs for Machine Learning, Morgan Kaufmann, US.

    Google Scholar 

  13. Quirk, R. et al. (1985) A Comprehensive Grammar of the English Language, Longman, UK.

    Google Scholar 

  14. Sampson, G. (1995) English for the Computer: The SUSANNE Corpus and analytic scheme, Oxford Univerity Press, UK.

    Google Scholar 

  15. Sinclair, J. et al. (1995) English Grammar, Harper Collins Publishers, UK.

    Google Scholar 

  16. Swan, M. (1995) Practical English Usage, Oxford University Press, UK.

    Google Scholar 

  17. Tapanainen, P. and Järvinen, T. (1997) A Non-Projective Dependency Parser, in The Proceedings of The 5th Conference of Applied Natural Language Processing, pages 64–71, ACL, US.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Evans, R. (2000). A Comparison of Rule-Based and Machine Learning Methods for Identifying Non-nominal It . In: Christodoulakis, D.N. (eds) Natural Language Processing — NLP 2000. NLP 2000. Lecture Notes in Computer Science(), vol 1835. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45154-4_22

Download citation

  • DOI: https://doi.org/10.1007/3-540-45154-4_22

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-67605-8

  • Online ISBN: 978-3-540-45154-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics