Abstract
Partially parsed corpora is used for automatically extracting semantic and syntactic subcategorization information for words, helping to cluster them according to their sense which is highly restricted by the syntactic contexts where words do occur. In this paper we propose the use of a parsing platform, based on chart parsing and tabling, in order to check if the syntactic and semantic information extracted automatically leads to better parses than assuming that words do not subcategorize anything.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Roberto Basili, Maria Pazienza, and Paola Velardi. Hierarchical clustering of verbs. In Workshop on Acquisition of Lexical Knowledge from Text, pages 56–70, Ohio State University, USA, 1993.
Gilles Bisson, Claire Nédellec, and Dolores Canamero. Designing clustering methods for ontology building: The mo’k workbench. In Internal rapport, citerseer.nj.nec.com/316335.html, 2000.
Michael Brent. From grammar to lexicon: unsupervised learning of lexical syntax. Computational Linguistics, 19(3):243–262, 1993.
Eric Brill and Philip Resnik. A rule-based approach to prepositional phrase attachment disambiguation. In COLING, 1994.
Ted Briscoe and John Carrol. Automatic extraction of subcategorization from corpora. In 5th Conference on Applied Natural Languague Processing (ANCP97), Washington, DC, USA, 1997.
Ido Dagan, Lillian Lee, and Fernando Pereira. Similarity-based methods of word coocurrence probabilities. Machine Learning, 43, 1998.
David Faure. Conception de méthode d'aprentissage symbolique et automatique pour l’acquisition de cadres de sous-catégorisation de verbes et de connaissances sémantiques à partir de textes: le système ASIUM. PhD thesis, Université Paris XI Orsay, Paris, France, 2000.
David Faure and Claire Nédellec. Asium: Learning subcategorization frames and restrictions of selection. In ECML98, Workshop on Text Mining, 1998.
Francesc Ribas Framis. On learning more appropriate selectional restrictions. In Proceedings of the 7th Conference of the European Chapter of the Association for Computational Linguistics, Dublin, 1995.
Pablo Gamallo. Construction conceptuelle d'expressions complexes: traitement de la combinaison nom-adjectif. PhD thesis, Université Blaise Pascal, Clermont-Ferrand, France, 1998.
Pablo Gamallo, Alexandre Agustini, and Gabriel P. Lopes. Selection restrictions acquisition from corpora. In 10th Portuguese Conference on Artificial Intelligence (EPIA’01), Porto, Portugal, 2001. LNAI, Springer-Verlag. Selection Restrictions Acquisition for Parsing Improvement 143
Pablo Gamallo, Caroline Gasperin, Alexandre Agustini, and Gabriel P. Lopes. Syntactic-based methods for measuring word similarity. In V. Mautner, R. Moucek, and K. Moucek, editors, Text, Speech, and Discourse (TSD-2001), pages 116–125. Berlin: Springer Verlag, 2001.
Gregory Grefenstette. Explorations in Automatic Thesaurus Discovery. Kluwer Academic Publishers, USA, 1994.
Gregory Grefenstette. Evaluation techniques for automatic semantic extraction: Comparing syntatic and window based approaches. In Branimir Boguraev and James Pustejovsky, editors, Corpus processing for Lexical Acquisition, pages 205–216. The MIT Press, 1995.
Ralph Grishman and John Sterling. Generalizing automatically generated selectional patterns. In Proceedings of the 15th International on Computational Linguistics (COLING-94), 1994.
Donald Hindle and Mats Rooth. Structural ambiguity and lexical relations. Computational Linguistics, 19(1):103–120, 1993.
Dekang Lin. Automatic retrieval and clustering of similar words. In COLINGACL’ 98, Montreal, 1998.
N. Marques and J.G.P. Lopes. Tagging with small training corpora. In Advances in Intelligent Data Analysi: 4th International Conference, IDA 2001, number 2189 in Lecture Notes in Computer Science (LNCS), pages 63–72, Berlin, Germany, 2001. Springer Verlag.
Nuno Marques. Uma Metodologia para a ModelaÇão Estatística da SubcategorizaÇão Verbal. PhD thesis, Universidade Nova de Lisboa, Lisboa, Portugal, 2000.
Fernando Pereira, Naftali Tishby, and Lillian Lee. Distributional clustering of english words. In Proceedings of the 30th Annual Meeting of the Association of Comptutational Linguistics, pages 183–190, Columbos, Ohio, 1993.
James Pustejovsky. The Generative Lexicon. MIT Press, Cambridge, 1995.
Philip Resnik. Selectional preference and sense disambiguation. In ACL-SIGLEX Workshop on Tagging with Lexical Semantics, Washinton DC, 1997.
Philip Resnik. Semantic similarity in taxonomy: An information-based measure and its application to problems of ambiguity in natural language. Journal of Artificial Intelligence Research, 11:95–130, 1999.
V. Rocio, E. de la Clergerie, and J.G.P. Lopes. Tabulation for multi-purpose partial parsing. Journal of Grammars, 4(1), 2001.
Vitor Rocio. Syntactic infrastructure for fault findind and repair in Natural Language Processing Systemss. PhD thesis, Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa, 2002. written in Portuguese.
Luis Talavera and Javier Béjar. Integrating declarative knowledge in hierarchical clustering tasks. In Intelligent Data Analysis, pages 211–222, 1999.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Agustini, A., Gamallo, P., Lopes, G.P. (2003). Selection Restrictions Acquisition for Parsing Improvement. In: Bartenstein, O., Geske, U., Hannebauer, M., Yoshie, O. (eds) Web Knowledge Management and Decision Support. INAP 2001. Lecture Notes in Computer Science(), vol 2543. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36524-9_11
Download citation
DOI: https://doi.org/10.1007/3-540-36524-9_11
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00680-0
Online ISBN: 978-3-540-36524-2
eBook Packages: Springer Book Archive