Abstract
We present results of the project Prolex. The aim of the project is the automated analysis of proper names, especially a description of relations between different proper names in a text. The system currently works with geographical proper names (place names, derived adjectives and names of inhabitants) in French.
It consists of a database containing specific types of proper names and relations between the different names. Using these names and relations, the program can group the proper names appearing in a text that may belong together (such as Beijing-Chinese-Pekinese-China; American-United States-Washington). This is done by constructing an association matrix between them and by computing the transitive closure of this Boolean matrix. The method is explained with an example.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
BELLEIL C. (1997), Reconnaissance, typage et traitement des coréférences des toponymes francais et de leurs gentilés par dictionnaire électronique relationnel, Doctoral thesis, University of Nantes.
BRUNESEAUX F. (1998), Noms propres, syntagmes nominaux, expressions referentielles: reperage et codage, Langues, 1-1, 46–59.
COATES-STEPHENS S. (1993), The Analysis and Acquisition of Proper Names for the Understanding of Free Text, Computers and the Humanities, 26:441–456.
FAURE R. (1970), Précis de Recherche Opérationnelle, Dunod Décision.
MACDONALD D. (1996), Internal and external evidence in the identification and semantic categorisation of Proper Names, Corpus Processing for Lexical Acquisition, 21–39, Massachussetts Institute of Technology.
MAE (1995), Etats et capitales Liste des formes françaises recommandées, Division Géographique (Archives et Documentation) du Ministére des Affaires Etrangéres.
MANI I., RICHARD MACMILLAN T. (1996), Indentifying Unknown Proper Names in Newswire Text, Corpus Processing for Lexical Acquisition, 41–59, Massachussetts Institute of Technology.
MIHOV S., MAUREL D. (2000), Direct Construction of Minimal Acyclic Subsequential Transducers, CIAA 2000, to appear in LNCS.
MOHRI M. (1994), Minimization of Sequential Transducers, Theoretical Computer Science.
PAIK W., LIDDY E. D., YU E., MCKENNA M. (1996), Categorizing and Standardizing Proper Nouns for Efficient Information Retrieval, Corpus Processing for Lexical Acquisition, 61–73, Massachussetts Institute of Technology.
PITON O., MAUREL D. (1997), Le traitement informatique de la géographie politique internationale, Colloque Franche-Comté Traitement automatique des langues (FRACTAL 97), Besancon, 10-12 décembre, in Bulag, numéro spécial, 321–328.
PITON O., MAUREL D., BELLEIL C. (1999), The Prolex Data Base: Toponyms and gentiles for NLP, Third International Workshop on Applications of Natural Language to Data Bases (NLDB’99) (Proceedings p. 233–237), Klagenfurt, Autriche,.17-19 juin.
POIBEAU T. (1999), Evaluation des systémes d’extraction d’information: une expérience sur le francais, Langues.
REN X., PERRAULT F. (1992), The typology of Unknown Words: An Experimental Study of Two Corpora, COLING 92.
ROCHE E., SCHABES Y. ed. (1997), Finite state language Processing, Cambridge, Mass./London, England: MIT Press.
SILBERZTEIN M. (1993), Dictionnaires électroniques et analyse automatique de textes-Le système INTEX, Paris, Masson.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Piton, O., Maurel, D. (2001). “Beijing Frowns and Washington Pays Close Attention” Computer Processing of Relations between Geographical Proper Names in Foreign Affairs. In: Bouzeghoub, M., Kedad, Z., Métais, E. (eds) Natural Language Processing and Information Systems. NLDB 2000. Lecture Notes in Computer Science, vol 1959. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45399-7_6
Download citation
DOI: https://doi.org/10.1007/3-540-45399-7_6
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41943-3
Online ISBN: 978-3-540-45399-4
eBook Packages: Springer Book Archive