Information Retrieval Tools for Literary Analysis
The advent of the CD-ROM as a means of distributing massive bodies of textual data increases the importance of developing automatic techniques for textual analysis. To accomplish this task, we should be alert to existing techniques, perhaps developed for other purposes, that can be of value. We here report on observations we made while carrying out research on information storage and retrieval that promise to be helpful. Specifically, auxiliary information and data structures created incidental to our IR investigations are rich in semantic content, and can be useful in suggesting or confirming relations among concepts in text. Two examples are given: one based on a term weighting scheme for IR, the other on a tree structure for compressing bitmaps.
KeywordsInformation Retrieval Semantic Content Retrieval Mechanism Hebrew Word Singleton Cluster
Unable to display preview. Download preview PDF.
- Bookstein A., Probability and Fuzzy-Set Applications to Information Retrieval, Annual Review of Information Science and Technology 20 (1985) 117–151Google Scholar
- Bookstein A., Explanation and Generalization of Vector Models in Information Retrieval, Proc. 5-th ACA- SIGIR Conf., Berlin (1982) 118–132Google Scholar
- Bookstein A., Klein S.T., Construction of Optimal Graphs for Bit-Vector Compression, to appear in Proc. 13-th A CAI-SIGIR Conf, Brussels (1990).Google Scholar
- Bookstein A., Klein S.T., Using Bitmaps for Medium Sized Information Retrieval Systems, to appear in Information ProcessingManagement (1990).Google Scholar
- Bookstein A., Morrissey R., Deerwester S., Waclena K., Ziff D., Statistical Guides for Literary Analysis, to appear in Festschrift for Quemada. edited by Antonio Zampolli.Google Scholar
- Choueka Y., Fraenkel A.S., Klein S.T., Segal E., Improved hierarchical bit-vector compression in document retrieval systems, Proc. 9-th ACAI-SIGIR Conf., Pisa, Italy (1986) 88–97.Google Scholar
- Choueka Y., Fraenkel A.S., Klein S.T., Segal E., Improved Techniques for Processing Queries in Full-Text Systems, Proc. 10-th ACA’M-SIGIR Conf., New Orleans (1987) 306–315.Google Scholar
- Choueka Y., Klein S.T., Neuvitz E., Automatic Retrieval of Frequent Idiomatic and Collocational Expressions in a Large Corpus, J. Assoc. Literary and Linguistic Computing, Vol. 4 (1983) 34–38.Google Scholar
- Feller W., An Introduction to Probability Theory and Atrpi„, Wiley, New York (1968).Google Scholar
- Morrissey R., Del Vigna C., A Natural Language Data Base, Educom 18 (1983).Google Scholar
- Salton G, Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer, Addison-Wesley, Reading, Mass. (1989).Google Scholar
- Storer J.A., Data Compression, Methods and Theory, Computer Science Press, Rockville, Maryland (1988).Google Scholar