Abstract
This paper describes a method and a system ONTOQUERY for content-based querying of texts based on the availability of an ontology for the concepts in the text domain. A key principle in the system is the extraction of conceptual content of noun phrases into descriptors forming an integral part of the ontology.
The retrieval of text passages rests on matching descriptors from the text against descriptors from the noun phrases in the query. The match need not be exact but is mediated by the ontology, invoking in particular taxonomic reasoning with sub- and super concepts. The paper also reports on a prototype implementation of the system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abney, S.: Partial parsing via finite-state cascades. Proceedings of the ESSLLI’ 96 Robust Parsing Workshop, 1996. Available from: http://www.sfs.nphil.unituebingen.de/abney/.
Andreasen, T., Nilsson, J. Fischer & Thomsen, H. Erdman: Ontology-based Querying, in Larsen, H. L. et al. (eds.) Flexible Query Answering Systems, Flexible Query Answering Systems, Recent Advances, Physica-Verlag, Springer, 2000. pp. 15–26.
Andreasen, T.: Query evaluation based on domain-specific ontologies. In NAFIPS’2001, 20th IFSA / NAFIPS International Conference Fuzziness and Soft Computing, pp. 1844–1849, Vancouver, Canada, 2001.
Andreasen, T.: On knowledge-guided fuzzy aggregation. In IPMU’2002, 9th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems, 1–5 July 2002, Annecy, France
Andreasen, T., Jensen, P. Anker, Nilsson, J. Fischer, Paggio, P., Pedersen, B. Sandford & Thomsen, H. Erdman: OntoQuery: Ontology-based Querying of Texts, to appear at AAAI 2002 Spring Symposium, Stanford, California, 2002.
Brill, E.: Transformation-based error-driven learning amd natural language processing: a case study in part-of-speech tagging. Computational Linguistics, 21(4), pp.5543–565, 1995.
Copestake, A.: The (new) LKB system-version 5.2. CSLI, Stanford, 1999.
Fillmore, C.: The Case for Case. In Bach, E. & R. Harms (eds.): Universals in Linguistic Theory, New York: Holt, Rinehart & Winston, 1968.
Gonzales J., Verdejo, F., PETERS, C., Calzolari, N. Applying EuroWordNet to Cross-lingual Text Retrieval, in Computers and the Humanities Vol. 32: 185–207, 1998. Kluwer Academic Publishers, The Netherlands.
Guarino, N., Masolo, C., & Vetere, G.: OntoSeek: Content-Based Access to the Web. IEEE Intelligent Systems, 14(3) (1999) 70–80.
Jacquemin, C., and Tzoukermann, E.: NLP for Term Variant Extraction: A Synergy of Morphology, Lexicon and Syntax. In T. Strzalkowski (ed.), Natural Language Information Retrieval, pp. 25–74, Kluwer, Boston, MA, 1999.
Jacquemin, C., and Bourigault, D.: Term Extraction and Automatic Indexing. In R. Mitkov (ed.), Handbook of Computational Linguistics, Oxford University Press, Oxford, 2001.
Jensen, P. Anker & Skadhauge, P. (eds.): Proceedings of the First International OntoQuery Workshop Ontology-based interpretation of NP’s, Department of Business Communication and Information Science, University of Southern Denmark, Kolding, 2001, to be republished at http://www.ontoquery.dk.
Jensen, P. Anker, Nilsson, J. Fischer & Vikner C.: Towards an Ontology-based Interpretation of Noun Phrases, In: P. A. Jensen & P. R. Skadhauge (eds.): Ontologybased Interpretation of Noun Phrases, in [13].
Keson, B.: Morfosyntaktisk tagging af danske tekster, in: P. Widell & M. Kunøe (eds.) 8. møde om Udforskning af Dansk Sprog, Århus Universitet, 1998.
Lenci, A., Bel, N., Busa, F., Calzolari, N., Gola, E., Monacini, M., Ogonowski, A., Peters, I., Peters, W., Ruimy, N., Villegas, M., Zampolli, A.: SIMPLE-A General Framework for the Development of Multilingual Lexicons, in: T. Fontenelle (ed.) International Journal of Lexicography Vol 13. pp. 249–263. 2000. Oxford University Press.
Madsen, B. Nistrup, Pedersen B. Sandford & Thomsen, H. Erdman: “Semantic Relations in Content-based Querying Systems: a Research Presentation from the OntoQuery Project”. In: K. Simov and A. Kiryakov (Eds.): Proceedings of OntoLex’ 2000: Ontologies and Lexical Knowledge Bases. OntoText Lab.,Sofia 2001. Forthcoming.
Nilsson, J. Fischer: A Logico-algebraic Framework for Ontologies ONTOLOG, in [13].
Nilsson, J. Fischer: Concept Descriptions for Text Search, in Proceedings from the 11th European-Japanese Conference on Information Modelling and Knowledge Bases, Maribor, Slovenia, 2001, to be republished in the series: Information Modelling and Knowledge Bases, IOS press.
Nilsson, J. Fischer: Are there Conceptual Grammars?, panel contribution at the 11th European-Japanese Conference on Information Modelling and Knowledge Bases, Maribor, Slovenia, 2001, forthcoming in the series: Information Modelling and Knowledge Bases, IOS press, 2002.
Nilsson, J. Fischer: Generative Ontologies, Ontological Types and Conceptual Grammars, in [31].
Nuopponen, A.: Concept Systems for terminological analysis. Acta Wasaensia, No.38. Universitas Wasaensis, Wasa, 1994.
Paggio, P., Pedersen B. S. & Haltrup, D:: Applying Language Technology to Content-based Querying-The OntoQuery Project, forthcoming in Proceedings of Nordiske Datalingvistikdage, NoDaLiDa 2001, Uppsala, Sweden.
Paggio, P.: Parsing in ontoquery-experiments with LKB, in [13].
Pedersen, B. S & Keson, B.: SIMPLE Semantic Information for Multifunctional Plurilingual Lexicons: Some Examples of Danish Concrete Nouns, in: SIGLEX 99: Standardisaing Lexical Resources pp.46–51, ACL Workshop, 1999. University of Maryland, USA.
Pedersen, B. S. & Nimb, S.: Semantic Encoding of Danish Verbs in SIMPLE Adapting a verb-framed model to a satellite-framed language SiÉ in Proceedings from 2nd Internal Conference on Language Resources and Evaluation, pp. 1405–1412. Athens, Greece.
Pedersen, B. S. & Paggio, P.: A Danish Semantic Lexicon and its Application in Content-based Querying, submitted for review to Bouillon & Viegas (eds.) Semantic Lexicons in Natural Language Processing, Special Issue of T.A.L., Hermes, France.
Pustejovsky, J.: The Generative Lexicon, MIT press, 1995.
Smeaton, A. & Quigley, A: Experiments on Using Semantic Distances between Words in IMage Caption Retrieval, in Proceedings of the 19th International Conference on Research Development in IR, 1996.
Thomsen, Hanne Erdman (ed.): Ontologies and Search-2nd OntoQueryWorkshop January 2000, LAMBDA, nr.28, HHK (Copenhagen Business School), Frederiksberg, 2001.
Voorhees, E. M.: Using WordNet to disambiguate word senses for text retrieval, in Korfhage, R., Rasmussen, E., and Willett P. eds., Proceedings of the 16th Annual ACM SIGIR Conference on Research and Development in Information Retrieval, Pittsburgh, 1993, pp. 171–180.
Voorhees, E. M.: Query expansion using lexical-semantic relations. In Croft, W. Bruce and C. J. van Rijsbergen, eds., Proceedings of the 17th Annual ACM SIGIR Conference on Research and Development in Information Retrieval, 1994, pp. 61–69.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Andreasen, T., Anker Jensen, P., Fischer Nilsson, J., Paggio, P., Sandford Pedersen, B., Erdman Thomsen, H. (2002). Ontological Extraction of Content for Text Querying. In: Andersson, B., Bergholtz, M., Johannesson, P. (eds) Natural Language Processing and Information Systems. NLDB 2002. Lecture Notes in Computer Science, vol 2553. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36271-1_11
Download citation
DOI: https://doi.org/10.1007/3-540-36271-1_11
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00307-6
Online ISBN: 978-3-540-36271-5
eBook Packages: Springer Book Archive