Abstract
With the increasing availability of information in electronic form, the integration of textual data into database systems is becoming more and more important. Motivated by recent technology development, we describe how a preprocessor for simple text retrieval can be realised on top of a relational database system. This approach shows a surprisingly good performance compared to a commercially available information retrieval system and compared to another relational preprocessor product for text search.
Preview
Unable to display preview. Download preview PDF.
References
H. Biller. On the architecture of a system integrating data base management and information retrieval. In G. Goos and H. Hartmanis, editors, Research and Development in Information Retrieval, volume 146 of Lecture Notes in Computer Science, pages 80–97. Springer, May 1982.
P. Dadam, K. Küspert, F. Andersen, H. Blanken, R. Erbe, J. Günauer, V. Lum, P. Pistor, and G. Walch. A DBMS prototype to support extended NF2 relations: An integrated view on flat tables and hierarchies. In Proc. ACM SIGMOD Conf. on Management of Data, 1986.
N. Fuhr. Models for retrieval with probabilistic indexing. Information Processing & Management, 25(1):55–72, 1989.
D. Harman, E. Fox, R.Baeza-Yates, and W. Lee. Inverted files. In W. B. Frakes and R. Baeza-Yates, editors, Information Retrieval (Data Structures and Algorithms), chapter 3, pages 28–43. Prentice Hall, 1992.
R. Hüppin, H. Kaufmann, and H.-J. Schek. MUSE—Ein Musikarchiv für die SRG. MUSE project report, ETH Zürich, Inst. for Information Systems—Database Research Group, April 1993.
Information Dimension Inc., 655 Metro Place South, Dublin, Ohio 43017-1396. BASISPlus Database Administration Guide, June 1990. Release L.
D. E. Knuth. The Art of Computer Programming. Addison-Wesley, Reading, 1973.
V. Lum, P. Dadam, R. Erbe, J. Günauer, P. Pistor, G. Walch, H. Werner, and J. Woodfill. Design of an integrated DBMS to support advanced applications. In International Conference on Foundations of Data Organization, Kyoto, 1985.
V. Linnemann, K. Küspert, P. Dadam, P. Pistor, R. Erbe, A. Kemper, N. Südkamp, G. Walch, and M. Wallrath. Design and implementation of extensible database management system supporting user defined data types and furnctions. In Proceedings International Conference on Very Large Databases, Los Angeles, California, 1988.
I.A. Macleod. SEQUEL as a language for document retrieval. Journal of the American Society for Information Science, 30(5):243–249, September 1979.
Oracle Corporation, 500 Oracle Parkway, Redwood City, CA 94065. PL/SQL User's Guide and References (Version 2.0), December 1992.
Oracle TextServer. White Paper A17283, Oracle Corporation, March 1994. Servers to Manage Very Large Document Databases.
R. Poloczek. SQL Text Retrieval in SQL/DS. Confidential technical report, IBM Scientific Center, Heidelberg, 1980.
C.J. van Rijsbergen. Information Retrieval. Butterworth, 1981.
G. Salton. Dynamic Information and Library Processing. Prentice Hall, 1975.
H.-J. Schek. The reference string access method and partial match retrieval. Technical Report TR 77.12.008, IBM Germany, Heidelberg Scientific Center, December 1977.
H.-J. Schek. Nested Transactions in a Combined IR-DBMS Architecture. In C.J. van Rijsbergen, editor, Proceeding of the 3rd BCS/ACM Symposium on Research and Development in Information Retrieval, The British Computer Society Workshop Series, pages 55–70, Cambridge, July 1984. British Computer Society, Cambridge University Press.
H.-J. Schek and P. Pistor. Data structures for an integrated database management and information retrieval system. In Proceedings International Conference on Very Large Databases, pages 197–207, Mexico, 1982.
H.-J. Schek, H.-B. Paul, M.H. Scholl, and G. Weikum. The DASDBS project: Objectives, experiences and future prospects. IEEE Trans. on Knowledge and Data Engineering, 2(1):25–43, March 1990. Special Issue on Database Prototype Systems.
SQL*TextRetrieval. Technical overview, Oracle Corporation, 1992. Version 2.
M.R. Stonebraker and L.A. Rowe. The design of POSTGRES. In Proc. ACM SIGMOD Conf. on Management of Data, pages 340–355, Washington, D.C., May 1986. ACM.
M. Stonebraker, H. Stettner, N. Lynn, J. Kalash, and A. Guttmann. Document processing in a relational database system. ACM Transactions on Office Informations Systems, 1(2):143–158, April 1983.
H.-J. Schek and G. Weikum. DASDBS: Concepts and architecture of a database system for advanced applications. Technical Report DVSI-1986-T1, TU Darmstadt, 1986. German Version in: Informatik Forschung und Entwicklung, 1987.
G.K. Zipf. Human Behaviour and the Principle of Least Effort. Addison-Wesley Press, 1949.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1995 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kaufmann, H., Schek, HJ. (1995). Text search using database systems revisited - Some experiments. In: Goble, C., Keane, J. (eds) Advances in Databases. BNCOD 1995. Lecture Notes in Computer Science, vol 940. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0000549
Download citation
DOI: https://doi.org/10.1007/BFb0000549
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-60100-5
Online ISBN: 978-3-540-49427-0
eBook Packages: Springer Book Archive