Skip to main content

Neural Agent for Text Database Discovery

  • Chapter
  • 227 Accesses

Part of the book series: Studies in Fuzziness and Soft Computing ((STUDFUZZ,volume 111))

Abstract

As the number and diversity of text databases on the Internet increases rapidly, users are faced with finding the text databases that are relevant to the user query. Identifying the relevant text databases out of many candidates for a given query is called the text database discovery problem. In this paper, we propose a novel approach, a neural approach, to the text database discovery problem. First, we present a neural agent that learns about underlying text databases from the user’s relevance feedback. For a given query, the neural agent, which is sufficiently trained on the basis of the neural net learning mechanism, discovers the text databases associated with the relevant documents and retrieves those documents effectively. In order to scale our approach with the large number of text databases, we also propose the hierarchical organization of neural agents which reduces the total training cost at the acceptable level. Finally, we evaluate the performance of our approach by comparing it to those of the conventional well-known approaches.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Broomhead D. S., Lowe D., (1988). Multivariable functional interpolation and adaptive networks, Complex Systems, 2 (3), 321–355.

    MathSciNet  MATH  Google Scholar 

  2. Chen S., Cowan C. F. N., Grant P. M., (1991). Orthogonal Least Squares Learning Algorithm for Radial Basis Function Networks, IEEE Transactions on Neural Networks, 2 (2), 302–309.

    Article  Google Scholar 

  3. Freeman J.A., Skapura D.M., (1992). Neural Networks Algorithms, Applications, and Programming Techniques, Addison-Wesley, MA.

    Google Scholar 

  4. Gravano L., Garcia-Molina H., Tomasic A., (1994). The effectiveness of GlOSS for the text-database discovery problem, Proc. of ACM SIGMOD Conference, 126–137.

    Google Scholar 

  5. Gravano L., Garcia-Molina H., (1995). Generalizing GlOSS to vector-space databases and broker hierarchies, Proc. of VLDB Conference, 78–89.

    Google Scholar 

  6. Gudivada V.N., Raghavan V.V., Grosky W.I., Kasanagottu R., (1997) Information Retrieval on the World Wide Web, IEEE Internet Computing, 58–68.

    Google Scholar 

  7. Hartman E.J., Keller J.D., Kowalski, J.M., (1990). Layered neural networks with Gaussian hidden units as approximations, Neural Computation, 2 (2), 210–215.

    Article  Google Scholar 

  8. Hertz J., Krogh A., Palmer R. G., (1991). Introduction to the theory of Neural Computation, Addison-Wesley, New York, NY.

    Google Scholar 

  9. Howe A., Dreilinger D., (1997). SavvySearch: a meta-search engine that learns which search engines to query, AI Magazine, 18 (2).

    Google Scholar 

  10. IBM Inc., (1999). http://www.infomarket.com/, IBM InfoMarket.

    Google Scholar 

  11. Kahle B., Medlar A., (1991). An information system for corporate users: Wide Area Information Servers, Technical Report TMC199, Thinking Machines Corporation.

    Google Scholar 

  12. Koller D., Sahami M., (1997). Hierarchically classifying documents using very few words, Proc. of Machine Learning Conference, 170–178.

    Google Scholar 

  13. Langley P., (1988). Machine learning as an experimental science, Machine Learning, 3, 5–8.

    Google Scholar 

  14. Meng W., Liu K., Yu C., Wang X., Chang Y., Rishe N., (1998). Determining text databases to search in the Internet, Proc. of VLDB Conference, 14–25.

    Google Scholar 

  15. Minsky M., Papert S., (1969). Perceptrons, Cambridge, MA, MIT Press.

    MATH  Google Scholar 

  16. Salton G., (1971). The SMART Retrieval System - Experiments in Automatic Document Processing, Prentice-Hall Inc., Englewood Cliffs, NJ.

    Google Scholar 

  17. Salton G., (1991). Developments in automatic text retrieval, Science, 253, 974–979.

    Article  MathSciNet  Google Scholar 

  18. Salton G., McGill M., (1983). Introduction to Modern Information Retrieval, McGraw-Hill, New York, NY.

    Google Scholar 

  19. Selberg E., Etzioni 0., (1995). Multi-service search and comparison using the MetaCrawler, Proc. of the WWW Conference.

    Google Scholar 

  20. Stone M., (1978). Cross-validation: a review, Mathematische Operationsforschung Statistischen, 9, 127–140.

    MATH  Google Scholar 

  21. Werbos P., (1974). Beyond regression: new tools for prediction and analysis in the behavioral sciences, PhD Thesis, Harvard, Cambridge, MA.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Choi, Y.S. (2003). Neural Agent for Text Database Discovery. In: Szczepaniak, P.S., Segovia, J., Kacprzyk, J., Zadeh, L.A. (eds) Intelligent Exploration of the Web. Studies in Fuzziness and Soft Computing, vol 111. Physica, Heidelberg. https://doi.org/10.1007/978-3-7908-1772-0_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-7908-1772-0_15

  • Publisher Name: Physica, Heidelberg

  • Print ISBN: 978-3-7908-2519-0

  • Online ISBN: 978-3-7908-1772-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics