Abstract
In this paper, we present a new approximation in Natural Language Processing (nlp) aimed at knowledge representation and acquisition using a formal syntactic frame. In practice, we introduce our implementation on an encyclopedic corpus in a botanic domain, illustrating the algorithm on a set of preliminary tests.
Research partially supported by the Spanish Government under project TIN2004-07246-C03-01, and the Autonomous Government of Galicia under project PGIDIT05PXIC30501PN and the Network for Language Processing and Information Retrieval.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ferret, O.: Using collocations for topic segmentation and link detection. In: Proc. of the 19th Int. Conf. on Computational Linguistics, USA, vol. 1, pp. 1–7 (2002)
Habert, B., Naulleau, E., Nazarenko, A.: Symbolic word clustering for medium-size corpora. In: COLING, pp. 490–495 (1996)
Harris, Z.S.: Mathematical Structures of Languages. J. Wiley & Sons, USA (1968)
Jacquemin, C., Bourigault, D.: Term extraction and automatic indexing. Handbook of Computational Linguistics, 599–615 (1999)
Joshi, A.K.: An introduction to TAG. In: Mathematics of Language, pp. 87–114
Rousse, G., de La Clergerie, É.V.: Analyse automatique de documents botaniques: le projet Biotim. In: Proc. of TIA 2005, pp. 95–104 (2005)
Sagot, B., de La Clergerie, É.V.: Error mining in parsing results. In: Proc. of the 21st Int. Conf. on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Austrila, pp. 329–336 (2006)
de La Clergerie, É.V.: DyALog: a tabular logic programming based environment for NLP. In: Christiansen, H., Skadhauge, P.R., Villadsen, J. (eds.) Constraint Solving and Language Processing. LNCS (LNAI), vol. 3438, Springer, Heidelberg (2005)
de La Clergerie, É.V.: From metagrammars to factorized TAG/TIG parsers. In: Proc. of IWPT 2005, Canada, pp. 190–191 (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fernández, M., de la Clergerie, E.V., Vilares, M. (2007). From Text to Knowledge. In: Moreno Díaz, R., Pichler, F., Quesada Arencibia, A. (eds) Computer Aided Systems Theory – EUROCAST 2007. EUROCAST 2007. Lecture Notes in Computer Science, vol 4739. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75867-9_34
Download citation
DOI: https://doi.org/10.1007/978-3-540-75867-9_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-75866-2
Online ISBN: 978-3-540-75867-9
eBook Packages: Computer ScienceComputer Science (R0)