Abstract
This paper describes our large-scale effort to build a conceptual Information Retrieval system that converts a large volume of natural language text into Conceptual Graph representation by means of knowledge-based processing. In order to automatically extract concepts and conceptual relations between concepts from texts, we constructed a knowledge base consisting of over 12,000 case frames for verbs and a large number of other linguistic patterns that reveal conceptual relations. They were used to process a Wall Street Journal database covering a period of three years. We describe our methods for constructing the knowledge base, how the linguistic knowledge is used to process the text, and how the retrieval system makes use of the rich representation of documents and information needs.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Preview
Unable to display preview. Download preview PDF.
References
Myaeng, S. H. & Liddy, E. (1993) Information Retrieval with Semantic Representation of Texts, in Proc. of Symposium on Document Analysis and Information Retrieval, in April, Las Vegas.
Myaeng, S. H. (1992). Using conceptual graphs for information retrieval:A framework for representation and flexible inferencing. Proceedings of Symposium on Document Analysis and Information Retrieval, Las Vegas, March 16–18.
Sowa, J. (1984). Conceptual Structures: Information Processing in Mind and Machine. Reading, MA: Addison-Wesley.
Fox, E. (1980). Lexical relations: Enhancing effectiveness of information retrieval systems. SIGIR Forum, 14, 6–35.
Wang, Y. et al. (1985). Relational thesauri in information retrieval. Journal of American Society for Information Science, 36, 15–27.
Spark Jones, K. & Kay, M. (1973). Linguistics and Information Science. New York: Academic Press.
Farradane, J. (1980). Relation indexing: Part I and part II. Journal of Information Science, 1, 267–276 & 313-24.
Lu, X. (1990). Document retrieval:A structural approach. Information Processing & Management, 26 (2), 209–218.
Fillmore, C.J. (1968). The case for case. In: Universals in Linguistic Theory, ed. Bach & Harms, 1–88. New York: Holt, Rinehart, and Winston.
Cook, W. (1989). Case Grammar Theory. Washington, D.C.: Georgetown University Press.
Somers, H. L. (1987) Valency and Case in Computational Linguistics. Edinburgh: Edinburgh University Press.
Dick, J. (1992). A conceptual, case-relation representation of text for intelligent retrieval. Technical Report CSRI-265, Computer Systems Research Institute, Univ. of Toronto.
Wendlandt, E. & Driscoll, J. (1991). Incorporating a semantic analysis into a document retrieval strategy. Proc. 14th International ACM/SIGIR Conference on Research and Development in Information Retrieval, Chicago, October.
Rosner, M. & Somers, H.L. (1980). Case in linguistics and cognitive science. UEA Papers in Linguistics, 13, 1–29.
Meteer, M., Schwarte, R. & Weischedel, R. (1991). POST: Using probabilities in language processing. Proceedings of the Twelfth International Conference on Artificial Intelligence, Sydney, Australia.
Hobbs, J. et al. (1992). FASTUS: System summary. Unpublished manuscript.
Myaeng, S. H. & Lopez-lopes, Aurelio (1992). A conceptual graph matching: a flexible algorithm and experiments. Journal of Experimental and Theoretical Artificial Intelligence, 4, 107–126.
Liddy, E., Paik, W. & Woelfel, J. (1992). Use of subject field codes from a machine-readable dictionary for automatic classification of documents. Proc. of 3rd ASIS Classification Research Workshop.
Myaeng, S. H. & Khoo, C. (1992). On uncertainty handling in plausible reasoning with conceptual graphs. Proc. of 7th Workshop on Conceptual Graphs, Las Craces, NM, July, 1992.
Shafer, G. (1976). A Mathematical Theory of Evidence. Princeton, N.J.: Princeton University Press.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1994 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Myaeng, S.H., Khoo, C., Li, M. (1994). Linguistic processing of text for a large-scale conceptual Information Retrieval system. In: Tepfenhart, W.M., Dick, J.P., Sowa, J.F. (eds) Conceptual Structures: Current Practices. ICCS 1994. Lecture Notes in Computer Science, vol 835. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-58328-9_5
Download citation
DOI: https://doi.org/10.1007/3-540-58328-9_5
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-58328-8
Online ISBN: 978-3-540-38675-9
eBook Packages: Springer Book Archive