Skip to main content

Text search using database systems revisited - Some experiments

  • Conference paper
  • First Online:
Book cover Advances in Databases (BNCOD 1995)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 940))

Included in the following conference series:

Abstract

With the increasing availability of information in electronic form, the integration of textual data into database systems is becoming more and more important. Motivated by recent technology development, we describe how a preprocessor for simple text retrieval can be realised on top of a relational database system. This approach shows a surprisingly good performance compared to a commercially available information retrieval system and compared to another relational preprocessor product for text search.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. H. Biller. On the architecture of a system integrating data base management and information retrieval. In G. Goos and H. Hartmanis, editors, Research and Development in Information Retrieval, volume 146 of Lecture Notes in Computer Science, pages 80–97. Springer, May 1982.

    Google Scholar 

  2. P. Dadam, K. Küspert, F. Andersen, H. Blanken, R. Erbe, J. Günauer, V. Lum, P. Pistor, and G. Walch. A DBMS prototype to support extended NF2 relations: An integrated view on flat tables and hierarchies. In Proc. ACM SIGMOD Conf. on Management of Data, 1986.

    Google Scholar 

  3. N. Fuhr. Models for retrieval with probabilistic indexing. Information Processing & Management, 25(1):55–72, 1989.

    Google Scholar 

  4. D. Harman, E. Fox, R.Baeza-Yates, and W. Lee. Inverted files. In W. B. Frakes and R. Baeza-Yates, editors, Information Retrieval (Data Structures and Algorithms), chapter 3, pages 28–43. Prentice Hall, 1992.

    Google Scholar 

  5. R. Hüppin, H. Kaufmann, and H.-J. Schek. MUSE—Ein Musikarchiv für die SRG. MUSE project report, ETH Zürich, Inst. for Information Systems—Database Research Group, April 1993.

    Google Scholar 

  6. Information Dimension Inc., 655 Metro Place South, Dublin, Ohio 43017-1396. BASISPlus Database Administration Guide, June 1990. Release L.

    Google Scholar 

  7. D. E. Knuth. The Art of Computer Programming. Addison-Wesley, Reading, 1973.

    Google Scholar 

  8. V. Lum, P. Dadam, R. Erbe, J. Günauer, P. Pistor, G. Walch, H. Werner, and J. Woodfill. Design of an integrated DBMS to support advanced applications. In International Conference on Foundations of Data Organization, Kyoto, 1985.

    Google Scholar 

  9. V. Linnemann, K. Küspert, P. Dadam, P. Pistor, R. Erbe, A. Kemper, N. Südkamp, G. Walch, and M. Wallrath. Design and implementation of extensible database management system supporting user defined data types and furnctions. In Proceedings International Conference on Very Large Databases, Los Angeles, California, 1988.

    Google Scholar 

  10. I.A. Macleod. SEQUEL as a language for document retrieval. Journal of the American Society for Information Science, 30(5):243–249, September 1979.

    Google Scholar 

  11. Oracle Corporation, 500 Oracle Parkway, Redwood City, CA 94065. PL/SQL User's Guide and References (Version 2.0), December 1992.

    Google Scholar 

  12. Oracle TextServer. White Paper A17283, Oracle Corporation, March 1994. Servers to Manage Very Large Document Databases.

    Google Scholar 

  13. R. Poloczek. SQL Text Retrieval in SQL/DS. Confidential technical report, IBM Scientific Center, Heidelberg, 1980.

    Google Scholar 

  14. C.J. van Rijsbergen. Information Retrieval. Butterworth, 1981.

    Google Scholar 

  15. G. Salton. Dynamic Information and Library Processing. Prentice Hall, 1975.

    Google Scholar 

  16. H.-J. Schek. The reference string access method and partial match retrieval. Technical Report TR 77.12.008, IBM Germany, Heidelberg Scientific Center, December 1977.

    Google Scholar 

  17. H.-J. Schek. Nested Transactions in a Combined IR-DBMS Architecture. In C.J. van Rijsbergen, editor, Proceeding of the 3rd BCS/ACM Symposium on Research and Development in Information Retrieval, The British Computer Society Workshop Series, pages 55–70, Cambridge, July 1984. British Computer Society, Cambridge University Press.

    Google Scholar 

  18. H.-J. Schek and P. Pistor. Data structures for an integrated database management and information retrieval system. In Proceedings International Conference on Very Large Databases, pages 197–207, Mexico, 1982.

    Google Scholar 

  19. H.-J. Schek, H.-B. Paul, M.H. Scholl, and G. Weikum. The DASDBS project: Objectives, experiences and future prospects. IEEE Trans. on Knowledge and Data Engineering, 2(1):25–43, March 1990. Special Issue on Database Prototype Systems.

    Google Scholar 

  20. SQL*TextRetrieval. Technical overview, Oracle Corporation, 1992. Version 2.

    Google Scholar 

  21. M.R. Stonebraker and L.A. Rowe. The design of POSTGRES. In Proc. ACM SIGMOD Conf. on Management of Data, pages 340–355, Washington, D.C., May 1986. ACM.

    Google Scholar 

  22. M. Stonebraker, H. Stettner, N. Lynn, J. Kalash, and A. Guttmann. Document processing in a relational database system. ACM Transactions on Office Informations Systems, 1(2):143–158, April 1983.

    Google Scholar 

  23. H.-J. Schek and G. Weikum. DASDBS: Concepts and architecture of a database system for advanced applications. Technical Report DVSI-1986-T1, TU Darmstadt, 1986. German Version in: Informatik Forschung und Entwicklung, 1987.

    Google Scholar 

  24. G.K. Zipf. Human Behaviour and the Principle of Least Effort. Addison-Wesley Press, 1949.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Carole Goble John Keane

Rights and permissions

Reprints and permissions

Copyright information

© 1995 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kaufmann, H., Schek, HJ. (1995). Text search using database systems revisited - Some experiments. In: Goble, C., Keane, J. (eds) Advances in Databases. BNCOD 1995. Lecture Notes in Computer Science, vol 940. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0000549

Download citation

  • DOI: https://doi.org/10.1007/BFb0000549

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-60100-5

  • Online ISBN: 978-3-540-49427-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics