Selection Restrictions Acquisition for Parsing Improvement

Agustini, Alexandre; Gamallo, Pablo; Lopes, Gabriel P.

doi:10.1007/3-540-36524-9_11

Alexandre Agustini⁵,
Pablo Gamallo⁵ &
Gabriel P. Lopes⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2543))

Included in the following conference series:

International Conference on Applications of Prolog

392 Accesses
1 Citations

Abstract

Partially parsed corpora is used for automatically extracting semantic and syntactic subcategorization information for words, helping to cluster them according to their sense which is highly restricted by the syntactic contexts where words do occur. In this paper we propose the use of a parsing platform, based on chart parsing and tabling, in order to check if the syntactic and semantic information extracted automatically leads to better parses than assuming that words do not subcategorize anything.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Roberto Basili, Maria Pazienza, and Paola Velardi. Hierarchical clustering of verbs. In Workshop on Acquisition of Lexical Knowledge from Text, pages 56–70, Ohio State University, USA, 1993.
Google Scholar
Gilles Bisson, Claire Nédellec, and Dolores Canamero. Designing clustering methods for ontology building: The mo’k workbench. In Internal rapport, citerseer.nj.nec.com/316335.html, 2000.
Google Scholar
Michael Brent. From grammar to lexicon: unsupervised learning of lexical syntax. Computational Linguistics, 19(3):243–262, 1993.
Google Scholar
Eric Brill and Philip Resnik. A rule-based approach to prepositional phrase attachment disambiguation. In COLING, 1994.
Google Scholar
Ted Briscoe and John Carrol. Automatic extraction of subcategorization from corpora. In 5th Conference on Applied Natural Languague Processing (ANCP97), Washington, DC, USA, 1997.
Google Scholar
Ido Dagan, Lillian Lee, and Fernando Pereira. Similarity-based methods of word coocurrence probabilities. Machine Learning, 43, 1998.
Google Scholar
David Faure. Conception de méthode d'aprentissage symbolique et automatique pour l’acquisition de cadres de sous-catégorisation de verbes et de connaissances sémantiques à partir de textes: le système ASIUM. PhD thesis, Université Paris XI Orsay, Paris, France, 2000.
Google Scholar
David Faure and Claire Nédellec. Asium: Learning subcategorization frames and restrictions of selection. In ECML98, Workshop on Text Mining, 1998.
Google Scholar
Francesc Ribas Framis. On learning more appropriate selectional restrictions. In Proceedings of the 7th Conference of the European Chapter of the Association for Computational Linguistics, Dublin, 1995.
Google Scholar
Pablo Gamallo. Construction conceptuelle d'expressions complexes: traitement de la combinaison nom-adjectif. PhD thesis, Université Blaise Pascal, Clermont-Ferrand, France, 1998.
Google Scholar
Pablo Gamallo, Alexandre Agustini, and Gabriel P. Lopes. Selection restrictions acquisition from corpora. In 10th Portuguese Conference on Artificial Intelligence (EPIA’01), Porto, Portugal, 2001. LNAI, Springer-Verlag. Selection Restrictions Acquisition for Parsing Improvement 143
Google Scholar
Pablo Gamallo, Caroline Gasperin, Alexandre Agustini, and Gabriel P. Lopes. Syntactic-based methods for measuring word similarity. In V. Mautner, R. Moucek, and K. Moucek, editors, Text, Speech, and Discourse (TSD-2001), pages 116–125. Berlin: Springer Verlag, 2001.
Google Scholar
Gregory Grefenstette. Explorations in Automatic Thesaurus Discovery. Kluwer Academic Publishers, USA, 1994.
MATH Google Scholar
Gregory Grefenstette. Evaluation techniques for automatic semantic extraction: Comparing syntatic and window based approaches. In Branimir Boguraev and James Pustejovsky, editors, Corpus processing for Lexical Acquisition, pages 205–216. The MIT Press, 1995.
Google Scholar
Ralph Grishman and John Sterling. Generalizing automatically generated selectional patterns. In Proceedings of the 15th International on Computational Linguistics (COLING-94), 1994.
Google Scholar
Donald Hindle and Mats Rooth. Structural ambiguity and lexical relations. Computational Linguistics, 19(1):103–120, 1993.
Google Scholar
Dekang Lin. Automatic retrieval and clustering of similar words. In COLINGACL’ 98, Montreal, 1998.
Google Scholar
N. Marques and J.G.P. Lopes. Tagging with small training corpora. In Advances in Intelligent Data Analysi: 4th International Conference, IDA 2001, number 2189 in Lecture Notes in Computer Science (LNCS), pages 63–72, Berlin, Germany, 2001. Springer Verlag.
Google Scholar
Nuno Marques. Uma Metodologia para a ModelaÇão Estatística da SubcategorizaÇão Verbal. PhD thesis, Universidade Nova de Lisboa, Lisboa, Portugal, 2000.
Google Scholar
Fernando Pereira, Naftali Tishby, and Lillian Lee. Distributional clustering of english words. In Proceedings of the 30th Annual Meeting of the Association of Comptutational Linguistics, pages 183–190, Columbos, Ohio, 1993.
Google Scholar
James Pustejovsky. The Generative Lexicon. MIT Press, Cambridge, 1995.
Google Scholar
Philip Resnik. Selectional preference and sense disambiguation. In ACL-SIGLEX Workshop on Tagging with Lexical Semantics, Washinton DC, 1997.
Google Scholar
Philip Resnik. Semantic similarity in taxonomy: An information-based measure and its application to problems of ambiguity in natural language. Journal of Artificial Intelligence Research, 11:95–130, 1999.
MATH Google Scholar
V. Rocio, E. de la Clergerie, and J.G.P. Lopes. Tabulation for multi-purpose partial parsing. Journal of Grammars, 4(1), 2001.
Google Scholar
Vitor Rocio. Syntactic infrastructure for fault findind and repair in Natural Language Processing Systemss. PhD thesis, Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa, 2002. written in Portuguese.
Google Scholar
Luis Talavera and Javier Béjar. Integrating declarative knowledge in hierarchical clustering tasks. In Intelligent Data Analysis, pages 211–222, 1999.
Google Scholar

Download references

Author information

Authors and Affiliations

CENTRIA, Departamento de Informática Universidade Nova de Lisboa, Portgual
Alexandre Agustini, Pablo Gamallo & Gabriel P. Lopes

Authors

Alexandre Agustini
View author publications
You can also search for this author in PubMed Google Scholar
Pablo Gamallo
View author publications
You can also search for this author in PubMed Google Scholar
Gabriel P. Lopes
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

IF Computer Japan, 5-28-2 Sendagi, Bunkyo-ku, 113-0022, Tokyo, Japan
Oskar Bartenstein
Fraunhofer FIRST, Kekulé 7, 12489, Berlin, Germany
Ulrich Geske
think-cell Software GmbH, Invalidenstraße 34, 10115, Berlin, Germany
Markus Hannebauer
Waseda University, 2-7 Hibikino, Wakamatsu-ku, Kitakyushu, Fukuoka, Japan
Osamu Yoshie

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Agustini, A., Gamallo, P., Lopes, G.P. (2003). Selection Restrictions Acquisition for Parsing Improvement. In: Bartenstein, O., Geske, U., Hannebauer, M., Yoshie, O. (eds) Web Knowledge Management and Decision Support. INAP 2001. Lecture Notes in Computer Science(), vol 2543. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36524-9_11

Download citation

DOI: https://doi.org/10.1007/3-540-36524-9_11
Published: 14 March 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00680-0
Online ISBN: 978-3-540-36524-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics