Abstract
As the number and diversity of text databases on the Internet increases rapidly, users are faced with finding the text databases that are relevant to the user query. Identifying the relevant text databases out of many candidates for a given query is called the text database discovery problem. In this paper, we propose a novel approach, a neural approach, to the text database discovery problem. First, we present a neural agent that learns about underlying text databases from the user’s relevance feedback. For a given query, the neural agent, which is sufficiently trained on the basis of the neural net learning mechanism, discovers the text databases associated with the relevant documents and retrieves those documents effectively. In order to scale our approach with the large number of text databases, we also propose the hierarchical organization of neural agents which reduces the total training cost at the acceptable level. Finally, we evaluate the performance of our approach by comparing it to those of the conventional well-known approaches.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Broomhead D. S., Lowe D., (1988). Multivariable functional interpolation and adaptive networks, Complex Systems, 2 (3), 321–355.
Chen S., Cowan C. F. N., Grant P. M., (1991). Orthogonal Least Squares Learning Algorithm for Radial Basis Function Networks, IEEE Transactions on Neural Networks, 2 (2), 302–309.
Freeman J.A., Skapura D.M., (1992). Neural Networks Algorithms, Applications, and Programming Techniques, Addison-Wesley, MA.
Gravano L., Garcia-Molina H., Tomasic A., (1994). The effectiveness of GlOSS for the text-database discovery problem, Proc. of ACM SIGMOD Conference, 126–137.
Gravano L., Garcia-Molina H., (1995). Generalizing GlOSS to vector-space databases and broker hierarchies, Proc. of VLDB Conference, 78–89.
Gudivada V.N., Raghavan V.V., Grosky W.I., Kasanagottu R., (1997) Information Retrieval on the World Wide Web, IEEE Internet Computing, 58–68.
Hartman E.J., Keller J.D., Kowalski, J.M., (1990). Layered neural networks with Gaussian hidden units as approximations, Neural Computation, 2 (2), 210–215.
Hertz J., Krogh A., Palmer R. G., (1991). Introduction to the theory of Neural Computation, Addison-Wesley, New York, NY.
Howe A., Dreilinger D., (1997). SavvySearch: a meta-search engine that learns which search engines to query, AI Magazine, 18 (2).
IBM Inc., (1999). http://www.infomarket.com/, IBM InfoMarket.
Kahle B., Medlar A., (1991). An information system for corporate users: Wide Area Information Servers, Technical Report TMC199, Thinking Machines Corporation.
Koller D., Sahami M., (1997). Hierarchically classifying documents using very few words, Proc. of Machine Learning Conference, 170–178.
Langley P., (1988). Machine learning as an experimental science, Machine Learning, 3, 5–8.
Meng W., Liu K., Yu C., Wang X., Chang Y., Rishe N., (1998). Determining text databases to search in the Internet, Proc. of VLDB Conference, 14–25.
Minsky M., Papert S., (1969). Perceptrons, Cambridge, MA, MIT Press.
Salton G., (1971). The SMART Retrieval System - Experiments in Automatic Document Processing, Prentice-Hall Inc., Englewood Cliffs, NJ.
Salton G., (1991). Developments in automatic text retrieval, Science, 253, 974–979.
Salton G., McGill M., (1983). Introduction to Modern Information Retrieval, McGraw-Hill, New York, NY.
Selberg E., Etzioni 0., (1995). Multi-service search and comparison using the MetaCrawler, Proc. of the WWW Conference.
Stone M., (1978). Cross-validation: a review, Mathematische Operationsforschung Statistischen, 9, 127–140.
Werbos P., (1974). Beyond regression: new tools for prediction and analysis in the behavioral sciences, PhD Thesis, Harvard, Cambridge, MA.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Choi, Y.S. (2003). Neural Agent for Text Database Discovery. In: Szczepaniak, P.S., Segovia, J., Kacprzyk, J., Zadeh, L.A. (eds) Intelligent Exploration of the Web. Studies in Fuzziness and Soft Computing, vol 111. Physica, Heidelberg. https://doi.org/10.1007/978-3-7908-1772-0_15
Download citation
DOI: https://doi.org/10.1007/978-3-7908-1772-0_15
Publisher Name: Physica, Heidelberg
Print ISBN: 978-3-7908-2519-0
Online ISBN: 978-3-7908-1772-0
eBook Packages: Springer Book Archive