Abstract
TreeLex is a subcategorization lexicon of French, automatically extracted from a syntactically annotated corpus. The lexicon comprises 2006 verbs (25076 occurrences). The goal of the project is to obtain a list of subcategorization frames of contemporary French verbs and to estimate the number of different verb frames available in French in general. A few more frames are discovered when the corpus size changes, but the average number of frames per verb remains relatively stable (about 1.91–2.09 frames per verb).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abeillé, A., Clément, L., Toussenel, F.: Building a treebank for French. In: Treebanks, Kluwer, Dordrecht (2003)
Bourigault, D., Frérot, C.: Acquisition et évaluation sur corpus de propriétés de sous-catégorisation syntaxique. In: Actes des 12èmes journées sur le Traitement Automatique des Langues Naturelles (2005)
Bresnan, J. (ed.): The Mental Representation of Grammatical Relations. MIT Press Series on Cognitive Theory and Mental Representation. MIT Press, Cambridge (1982)
Briscoe, T., Carroll, J.: Generalised probabilistic LR parsing for unification-based grammars. Computational linguistics (1993)
Cahill, A., et al.: Parsing with PCFGs and automatic f-structure annotation. In: Butt, M., King, T.H. (eds.) Proceedings of the LFG 2002 Conference, CSLI Publications (2002)
Carroll, J., Fang, A.: The automatique acquisition of verb subcategorisations and their impact on the performance of an HPSG parser. In: Proceedings of the 1st International Conference on Natural Language Processing, Sanya City, China (2004)
Chesley, P., Salmon-Alt, S.: Le filtrage probabiliste dans l’extraction automatique de cadres de sous-catǵorisation. In: Journé ATALA sur l’interface lexique-grammaire, Paris (2005)
Danlos, L., Sagot, B.: Comparaison du Lexique-Grammaire et de Dicovalence: vers une intégration dans le Lefff. In: Proceedings TALN 2007 (2007)
Danlos, L.: La génération automatique de textes, Masson (1985)
Davis, A., Koenig, J.-P.: Linking as constraints on word classes in a hierachical lexicon. Language 76(1), 56–91 (2000)
Falk, I., Francopoulo, G., Gardent, C.: Evaluer SynLex. In: Proceedings of TALN 2007 (2007)
Fort, K., Guillaume, B.: Preplex: a lexicon of French prepositions for parsing. In: ACL SIGSEM 2007 (2007)
Frank, A., et al.: From treebank resources to LFG f-structures. In: Treebanks, Kluwer, Dordrecht (2002)
Gardent, C., et al.: Extraction d’information de sous-catégorisation à partir du lexique-grammaire de Maurice Gross. In: TALN 2006 (2006)
Gross, M.: Méthodes en syntaxe. Hermann (1975)
Guillet, A., Leclère, C.: La structure des phrases simples en français. Droz, Genève (1992)
C.-H. Han, J., Yoon, N., Kim, M.: Palmer, A feature-based lexicalized tree adjoining grammar for Korean. Technical report, IRCS (2000)
Hornby, A.S.: Oxford Advanced Learner’s Dictionary of Current English, 4th edn. Oxford University Press, Oxford (1989)
Mel’cuk, I., Arbatchewsky-Jumarie, N., Clas, A.: 1984, 1988, 1992, 1999. Dictionnaire explicatif et combinatoire du français contemporain. Recherches lexico-sémantiques, vol. I, II, III, IV. Les Presses de l’Université de Montréal
Pollard, C., Sag, I.A.: Head-driven Phrase Structure Grammar. Chicago University Press / CSLI Publications, Chicago, IL (1994)
Procter, P. (ed.): Longman Dictionary of Contemporary English, Longman, Burnt Mill, Harlow (1978)
Sagot, B., Danlos, L.: Améliorer un lexique syntaxique à l’aide des tables du lexique-grammaire. constructions impersonnelles. In: Proceedings of TALN 2007 (2007)
Sagot, B., et al.: The lefff 2 syntactic lexicon for french: architecture, acquisition, use. In: Actes de LREC 2006, Gênes, Italie (2006)
Surdeanu, M., et al.: Using predicate-argument structures for information extraction (2003)
van den Eynde, K., Mertens, P.: La valence: l’approche pronominale et son application au lexique verbal. French Language Studies 13, 63–104 (2003)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kupść, A., Abeillé, A. (2008). Growing TreeLex. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2008. Lecture Notes in Computer Science, vol 4919. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78135-6_3
Download citation
DOI: https://doi.org/10.1007/978-3-540-78135-6_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78134-9
Online ISBN: 978-3-540-78135-6
eBook Packages: Computer ScienceComputer Science (R0)