Skip to main content

Part-of-Speech Tagging Using Rules

  • Chapter
  • First Online:
Language Processing with Perl and Prolog

Part of the book series: Cognitive Technologies ((COGTECH))

  • 2836 Accesses

Abstract

We saw that looking up a word in a lexicon or carrying out a morphological analysis on a word can leave it with an ambiguous part of speech. The word chair, which can be assigned two tags, noun or verb, is an example of ambiguity. It is a noun in the phrase a chair, and a verb in to chair a session.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 99.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Brill, E. (1995). Transformation-based error-driven learning and natural language processing: A case study in part-of-speech tagging. Computational Linguistics, 21(4), 543–565.

    Google Scholar 

  • Carlberger, J., Domeij, R., Kann, V., & Knutsson, O. (2004). The development and performance of a grammar checker for Swedish: A language engineering perspective. Technical report, Kungliga Tekniska högskolan, Stockholm.

    Google Scholar 

  • Charniak, E. (1993). Statistical language learning. Cambridge, MA: MIT.

    Google Scholar 

  • Church, K. W., & Mercer, R. L. (1993). Introduction to the special issue on computational linguistics using large corpora. Computational Linguistics, 19(1), 1–24.

    Google Scholar 

  • Constant, P. (1991). Analyse syntaxique par couches. Thèse de doctorat, École Nationale Supérieure des Télécommunications, Paris.

    Google Scholar 

  • Dermatas, E., & Kokkinakis, G. K. (1995). Automatic stochastic tagging of natural language texts. Computational Linguistics, 21(2), 137–163.

    Google Scholar 

  • Ejerhed, E., Källgren, G., Wennstedt, O., & Ã…ström, M. (1992). The linguistic annotation system of the Stockholm-UmeÃ¥ corpus project. Technical report 33, Department of General Linguistics, University of UmeÃ¥.

    Google Scholar 

  • Franz, A. (1996). Automatic ambiguity resolution in natural language processing: An empirical approach (Lecture notes in artificial intelligence, Vol. 1171). Berlin/Heidelberg/New York: Springer.

    Google Scholar 

  • Giménez, J., & Màrquez, L. (2004). SVMTool: A general POS tagger generator based on support vector machines. In Proceedings of the 4th international conference on language resources and evaluation (LREC’04), Lisbon (pp. 43–46).

    Google Scholar 

  • Harris, Z. (1962). String analysis of sentence structure. The Hague: Mouton.

    Google Scholar 

  • Ide, N., & Véronis, J. (1995). Text encoding initiative: Background and context. Dordrecht: Kluwer Academic.

    Book  Google Scholar 

  • Joshi, A. K., & Hopely, P. (1999). A parser from antiquity: An early application of finite state transducers to natural language processing. In A. Kornai (Ed.), Extended finite state models of language (Studies in natural language processing, pp. 6–15). Cambridge: Cambridge University Press.

    Google Scholar 

  • Klein, S., & Simmons, R. (1963). A computational approach to grammatical coding of English words. Journal of the ACM, 10(3), 334–347.

    Article  MATH  Google Scholar 

  • Marcus, M., Marcinkiewicz, M. A., & Santorini, B. (1993). Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics, 19(2), 313–330.

    Google Scholar 

  • Merialdo, B. (1994). Tagging English text with a probabilistic model. Computational Linguistics, 20(2), 155–171.

    Google Scholar 

  • Monachini, M., & Calzolari, N. (1996). Synopsis and comparison of morphosyntactic phenomena encoded in lexicons and corpora: A common proposal and applications to European languages. Technical report, Istituto di Linguistica Computazionale del CNR, Pisa. EAGLES Document EAG–CLWG–MORPHSYN/R.

    Google Scholar 

  • Petrov, S., Das, D., & McDonald, R. (2012). A universal part-of-speech tagset. In Proceedings of the eighth international conference on language resources and evaluation (LREC 2012), Istanbul (pp. 2089–2096).

    Google Scholar 

  • Roche, E., & Schabes, Y. (1995). Deterministic part-of-speech tagging with finite-state transducers. Computational Linguistics, 21(2), 227–253.

    Google Scholar 

  • Surdeanu, M., Johansson, R., Meyers, A., Màrquez, L., & Nivre, J. (2008). The CoNLL 2008 shared task on joint parsing of syntactic and semantic dependencies. In CoNLL 2008: Proceedings of the 12th conference on computational natural language learning, Manchester (pp. 159–177).

    Google Scholar 

  • Vergne, J. (1998). Entre arbre de dépendance et ordre linéaire, les deux processus de transformation: Linéarisation, puis reconstruction de l’arbre. Cahiers de grammaire, 23.

    Google Scholar 

  • Vergne, J. (1999). Étude et modélisation de la syntaxe des langues à l’aide de l’ordinateur. Analyse syntaxique automatique non combinatoire. Synthèse et résultats. Habilitation à diriger des recherches, Université de Caen.

    Google Scholar 

  • Voutilainen, A., Heikkilä, J., & Anttila, A. (1992). Constraint grammar of English: A performance-oriented introduction. Technical report 21, Department of General Linguistics, University of Helsinki.

    Google Scholar 

  • Voutilainen, A., & Järvinen, T. (1995). Specifying a shallow grammatical representation for parsing purposes. In Proceedings of the seventh conference of the European chapter of the ACL, Dublin (pp. 210–214).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Nugues, P.M. (2014). Part-of-Speech Tagging Using Rules. In: Language Processing with Perl and Prolog. Cognitive Technologies. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41464-0_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-41464-0_7

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-41463-3

  • Online ISBN: 978-3-642-41464-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics