Abstract
The Semantic Web (SW) originally aims at studying a system interoperability based on a shared common knowledge base (ontology). Henceforth, the SW sets its heart on a semantic coordination of community parlance representative resources (in complement to a common knowledge base shared by the users). The matter is not only to use techniques to handle a large amount of data, but also to use approaches to keep the community parlance features. Thus, Web documents and folksonomies are the main semantic vehicle. They are little structured and Natural Language Processing (NLP) methods are then beneficial to analyze language specificities with a view to automating tasks about text. This paper describes a use of NLP techniques for the SW through a document engineering application: the information retrieval in a catalogue of online medical resources. Our approach emphasizes benefits of NLP techniques to handle multi-granular terminological resources.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Sinclair, J.: Preliminary recommendations on text typology. In: EAGLES (Expert Advisory Group on Language Engineering Standards) (1996)
Pery-Woodley, M.: Discours, corpus, traitement automatiques. In: Condamines, A. (ed.) Sémantique et corpus, Hermés, Londres (2005)
Williams, G.: A corpus-driven analysis of collocational resonance in French and English Texts. Hédiard M. Lezioni di Dottorato, Edizioni Spartaco (2005)
Handschuh, S., Staab, S.: Annotating of the shallow and the deep web. In: Handschuh, S., Staab, S. (eds.) Annotation for the semantic web, pp. 25–45. IOS Press, Amsterdam (2003)
Dingli, A.: Next generation annotation interfaces for adaptive information extraction. In: 6th Annual Computer Linguists UK Colloquium, Edinburgh, UK (2003)
Brill, E.: A simple rule-based part of speech tagger. In: Conference on Applied Natural Language Processing, Trento, Italia, pp. 152–155. ACL (1992)
Schmid, H.: Probalistic part-of-speech tagging using decision trees. In: International Conference on New methods in Language Processing, UK (1994)
Smadja, F.: Retrieving collocations from text: Xtract. Computational Linguistics 19(1), 143–177 (1993)
Rousselot, F., Montessuit, N.: La station de travail likes. In: INTEX Workshop (2003)
Bourigault, D.: Lexter, a natural language processing tool for terminology extraction. In: EURALEX International Congress, Goteborg, Nederland, pp. 771–779 (1996)
Ait-Mokhtar, S., Chanod, J.P., Roux, C.: A multi-input dependency parser. In: International Workshop on Parsing Technologies, Beijing, China, pp. 201–204 (2001)
Daille, B.: Conceptual structuring through term variations. In: ACL Workshop on Multiword Expressions: Analysis, Acquisition and Treatment, pp. 9–16 (2003)
Roche, M., Heitz, T., Matte-Tailliez, O., Kodratoff, Y.: Exit: Un système itératif pour l’extraction de la terminologie du domaine à partir de corpus spécialisés. In: Journées d’analyse statistique des données textuelles, pp. 946–956 (2004)
Habert, B., Fabre, C.: Elementary dependency trees for identifying corpus-specific semantic classes. Computer and the Humanities 33(3), 207–219 (1999)
Séguéla, P., Aussenac-Gilles, N.: Extraction de relations sémantiques entre termes et enrichissement de modèles du domaine. In: IC, Paris, France, pp. 79–88 (1999)
Bourigault, D.: Upery: un outil d’analyse distributionnelle étendue pour la construction d’ontologies à partir de corpus. In: TALN, Nancy, France, pp. 75–84 (2002)
Dutoit, D., Papadima, O.: Alexandria as a result of the integration of wordnet and ldi. In: International WordNet Conference, pp. 157–163 (2005)
Landauer, T., Foltz, P., Laham, D.: An introduction to latent semantic analysis. Discourse Processes 25, 259–284 (1998)
Névéol, A., Rogozan, A., Darmoni, S.: Automatic indexing of online health resources for a french quality controlled gateway. Information Processing and Management 42(3), 695–709 (2006)
Silberztein, M.: Nooj: an object-oriented approach. Cahiers de la MSH Ledoux, 359–369 (2004)
Diosan, L., Rogozan, A., Pecuchet, J.: Automatic alignment of medical vs. general terminologies. In: European Symposium on Artificial Neural Networks, Bruges, Belgia, pp. 487–492 (2008)
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, Heidelberg (1995)
Loisel, A., Kotowicz, J.P., Chaignaud, N.: An issue-based approach to information search modelling. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2008. LNCS (LNAI), vol. 5246, pp. 609–616. Springer, Heidelberg (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lortal, G., Chaignaud, N., Kotowicz, JP., Pécuchet, JP. (2009). NLP Contribution to the Semantic Web: Linking the Term to the Concept. In: Velásquez, J.D., Ríos, S.A., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based and Intelligent Information and Engineering Systems. KES 2009. Lecture Notes in Computer Science(), vol 5711. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04595-0_38
Download citation
DOI: https://doi.org/10.1007/978-3-642-04595-0_38
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04594-3
Online ISBN: 978-3-642-04595-0
eBook Packages: Computer ScienceComputer Science (R0)