Neural Agent for Text Database Discovery

Choi, Yong S.

doi:10.1007/978-3-7908-1772-0_15

Neural Agent for Text Database Discovery

Yong S. Choi⁶

Chapter

227 Accesses

Part of the book series: Studies in Fuzziness and Soft Computing ((STUDFUZZ,volume 111))

Abstract

As the number and diversity of text databases on the Internet increases rapidly, users are faced with finding the text databases that are relevant to the user query. Identifying the relevant text databases out of many candidates for a given query is called the text database discovery problem. In this paper, we propose a novel approach, a neural approach, to the text database discovery problem. First, we present a neural agent that learns about underlying text databases from the user’s relevance feedback. For a given query, the neural agent, which is sufficiently trained on the basis of the neural net learning mechanism, discovers the text databases associated with the relevant documents and retrieves those documents effectively. In order to scale our approach with the large number of text databases, we also propose the hierarchical organization of neural agents which reduces the total training cost at the acceptable level. Finally, we evaluate the performance of our approach by comparing it to those of the conventional well-known approaches.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Broomhead D. S., Lowe D., (1988). Multivariable functional interpolation and adaptive networks, Complex Systems, 2 (3), 321–355.
MathSciNet MATH Google Scholar
Chen S., Cowan C. F. N., Grant P. M., (1991). Orthogonal Least Squares Learning Algorithm for Radial Basis Function Networks, IEEE Transactions on Neural Networks, 2 (2), 302–309.
Article Google Scholar
Freeman J.A., Skapura D.M., (1992). Neural Networks Algorithms, Applications, and Programming Techniques, Addison-Wesley, MA.
Google Scholar
Gravano L., Garcia-Molina H., Tomasic A., (1994). The effectiveness of GlOSS for the text-database discovery problem, Proc. of ACM SIGMOD Conference, 126–137.
Google Scholar
Gravano L., Garcia-Molina H., (1995). Generalizing GlOSS to vector-space databases and broker hierarchies, Proc. of VLDB Conference, 78–89.
Google Scholar
Gudivada V.N., Raghavan V.V., Grosky W.I., Kasanagottu R., (1997) Information Retrieval on the World Wide Web, IEEE Internet Computing, 58–68.
Google Scholar
Hartman E.J., Keller J.D., Kowalski, J.M., (1990). Layered neural networks with Gaussian hidden units as approximations, Neural Computation, 2 (2), 210–215.
Article Google Scholar
Hertz J., Krogh A., Palmer R. G., (1991). Introduction to the theory of Neural Computation, Addison-Wesley, New York, NY.
Google Scholar
Howe A., Dreilinger D., (1997). SavvySearch: a meta-search engine that learns which search engines to query, AI Magazine, 18 (2).
Google Scholar
IBM Inc., (1999). http://www.infomarket.com/, IBM InfoMarket.
Google Scholar
Kahle B., Medlar A., (1991). An information system for corporate users: Wide Area Information Servers, Technical Report TMC199, Thinking Machines Corporation.
Google Scholar
Koller D., Sahami M., (1997). Hierarchically classifying documents using very few words, Proc. of Machine Learning Conference, 170–178.
Google Scholar
Langley P., (1988). Machine learning as an experimental science, Machine Learning, 3, 5–8.
Google Scholar
Meng W., Liu K., Yu C., Wang X., Chang Y., Rishe N., (1998). Determining text databases to search in the Internet, Proc. of VLDB Conference, 14–25.
Google Scholar
Minsky M., Papert S., (1969). Perceptrons, Cambridge, MA, MIT Press.
MATH Google Scholar
Salton G., (1971). The SMART Retrieval System - Experiments in Automatic Document Processing, Prentice-Hall Inc., Englewood Cliffs, NJ.
Google Scholar
Salton G., (1991). Developments in automatic text retrieval, Science, 253, 974–979.
Article MathSciNet Google Scholar
Salton G., McGill M., (1983). Introduction to Modern Information Retrieval, McGraw-Hill, New York, NY.
Google Scholar
Selberg E., Etzioni 0., (1995). Multi-service search and comparison using the MetaCrawler, Proc. of the WWW Conference.
Google Scholar
Stone M., (1978). Cross-validation: a review, Mathematische Operationsforschung Statistischen, 9, 127–140.
MATH Google Scholar
Werbos P., (1974). Beyond regression: new tools for prediction and analysis in the behavioral sciences, PhD Thesis, Harvard, Cambridge, MA.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science Education, Hanyang University, Seongdong-ku, Seoul, 133-791, Korea
Yong S. Choi

Authors

Yong S. Choi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Computer Science, Technical University of Lodz, ul. Sterlinga 16/18, 90-217, Lodz, Poland
Piotr S. Szczepaniak
Systems Research Institute, Polish Academy of Sciences, ul. Newelska 6, 01-447, Warsaw, Poland
Piotr S. Szczepaniak & Janusz Kacprzyk &
Facultad de Informática, Universidad Politécnica de Madrid, Campus de Montegancedo, 28660, Madrid, Spain
Javier Segovia
Computer Science Division, Department of Electrical Engineering and Computer Sciences, University of California, 94720-1776, Berkeley, CA, USA
Lotfi A. Zadeh

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Choi, Y.S. (2003). Neural Agent for Text Database Discovery. In: Szczepaniak, P.S., Segovia, J., Kacprzyk, J., Zadeh, L.A. (eds) Intelligent Exploration of the Web. Studies in Fuzziness and Soft Computing, vol 111. Physica, Heidelberg. https://doi.org/10.1007/978-3-7908-1772-0_15

Download citation

DOI: https://doi.org/10.1007/978-3-7908-1772-0_15
Publisher Name: Physica, Heidelberg
Print ISBN: 978-3-7908-2519-0
Online ISBN: 978-3-7908-1772-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics