Advertisement

Journal of Intelligent Information Systems

, Volume 8, Issue 2, pp 133–153 | Cite as

Learning to Understand Information on the Internet: An Example-Based Approach

  • Mike Perkowitz
  • Robert B. Doorenbos
  • Oren Etzioni
  • Daniel S. Weld
Article

Abstract

The explosive growth of the Web has made intelligent softwareassistants increasingly necessary for ordinary computer users. Bothtraditional approaches—search engines, hierarchical indices—andintelligent software agents require significant amounts of humaneffort to keep up with the Web. As an alternative, we investigate theproblem of automatically learning to interact with informationsources on the Internet. We report on ShopBotand ILA , two implemented agents that learn touse such resources. ShopBot learns how to extract information from onlinevendors using only minimal knowledge about product domains. Giventhe home pages of several online stores, ShopBotautonomously learns how to shop at those vendors. After its learningis complete, ShopBot is able to speedily visitover a dozen software stores and CD vendors, extract productinformation, and summarize the results for the user. ILAlearns to translate information from Internetsources into its own internal concepts. ILAbuilds a model of an information source that specifies the translation between the source's output and ILA 's model of the world. ILA iscapable of leveraging a small amount of knowledge about a domain tolearn models of many information sources. We show that ILA 's learning is fast and accurate, requiring only a smallnumber of queries per information source.

machine learning internet 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agre, P. and Chapman, D. (1987). Pengi: An implementation of a theory of activity. In Proc. 6th Nat. Conf. on AI.Google Scholar
  2. Agre, P. and Horswill, I. (1992). Cultural support for improvisation. In Proc. 10th Nat. Conf. on AI(pp. 363–368).Google Scholar
  3. Arens, Yigal, Chee, Chin Y., Hsu, Chun-Nan, and Knoblock, Craig A. (1993). Retrieving and Integrating Data from Multiple Information Sources, International Journal on Intelligent and Cooperative Information Systems, 2(2), 127–158,.Google Scholar
  4. Armstrong, Robert, Freitag, Dayne, Joachims, Thorsten, and Mitchell, Tom. (1995). Webwatcher: A learning apprentice for the world wide web. In Working Notes of the AAAI Spring Symposium: Information Gathering from Heterogeneous, Distributed Environments(pp. 6–12). Stanford University. AAAI Press. To order a copy, contact sss@aaai.org.Google Scholar
  5. Berwick, R.C. and Pilato, S. (1987). Learning Syntax by Automata Induction, Machine Learning, 2, 9–38.Google Scholar
  6. Dent, Lisa, Boticario, Jesus, McDermott, John, Mitchell, Tom, and Zabowski, David. (1992). A personal learning apprentice. In Proc. 10th Nat. Conf. on AI(pp. 96–103).Google Scholar
  7. Doorenbos, R.B., Etzioni, O., and Weld, D.S. (1996). A Scalable Comparison-Shopping Agent for theWorld-Wide Web. Technical Report 96-01-03, University of Washington, Department of Computer Science and Engineering. Available via FTP from pub/ai/ at ftp.cs.washington.edu.Google Scholar
  8. Etzioni, O. and Weld, D. (1994). A Softbot-Based Interface to the Internet, CACM, 37(7), 72–76.Google Scholar
  9. Hammond, Kristen, Burke, Robin, Martin, Charles, and Lytinen, Steven (1995). FAQ finder: A case-based approach to knowledge navigation. In Working Notes of the AAAI Spring Symposium: Information Gathering from Heterogeneous, Distributed Environments(pp. 69–73). Stanford University. AAAI Press. To order a copy, contact sss@aaai.org.Google Scholar
  10. Horswill, I. (1995). Analysis of Adaptation and Environment, Artificial Intelligence, 73(1–2), 1–30.Google Scholar
  11. Kirk, Thomas, Levy, Alon Y., Sagiv, Yehoshua, and Srivastava, Divesh. (1995). The information manifold. In Working Notes of the AAAI Spring Symposium: Information Gathering from Heterogeneous, Distributed Environments(pp. 85–91). Stanford University. AAAI Press.Google Scholar
  12. Knoblock, Craig, Arens, Yigal, and Hsu, Chun-Nan. (1994). Cooperating agents for information retrieval. In Proceedings of the Second International Conference on Cooperative Information Systems. Toronto, Canada.Google Scholar
  13. Knoblock, Craig and Levy, Alon (Eds.), (1995). Working Notes of the AAAI Spring Symposium on Information Gathering from Heterogeneous, Distributed Environments. Stanford University. AAAI Press. To order a copy, contact sss@aaai.org.Google Scholar
  14. Krulwich, B. (1996). The Bargainfinder Agent: Comparison Price Shopping on the Internet. In J. Williams (Ed.), Bots and Other Internet Beasties. SAMS.NET. http://bf.cstar.ac.com/bf/.Google Scholar
  15. Kwok, C. and Weld, D. (1996). Planning to gather information. In Proc. 14th Nat. Conf. on AI.Google Scholar
  16. Levy, A.Y. and Ordille, J.J. (1995). An experiment in integrating internet information sources. In AAAI Fall Symposium on AI Applications on Knowledge Navigation and Retrieval. Cambridge, MA.Google Scholar
  17. Levy, A.Y., Srivastava, Divesh, and Kirk, Thomas. (1995). Data Model and Query Evaluation in Global Information Systems. Journal of Intelligent Information Systems, Special Issue on Networked Information Discovery and Retrieval, 5(2).Google Scholar
  18. Li, Wen-Syan. (1995). Knowledge gathering and matching in heterogeneous databases. In Working Notes of the AAAI Spring Symposium: Information Gathering from Heterogeneous, Distributed Environments. Stanford University. AAAI Press. To order a copy, contact sss@aaai.org.Google Scholar
  19. Lieberman, H. (1995). Letizia: An agent that assists web browsing. In Proc. 15th Int. Joint Conf. on AI(pp. 924–929).Google Scholar
  20. Maes, Pattie. (1994). Agents that ReduceWork and Information Overload, Comm. of the ACM, 37(7), 31–40, 146.Google Scholar
  21. Maes, Pattie and Kozierok, Robyn. (1993). Learning interface agents. In Proceedings of AAAI-93.Google Scholar
  22. Motro, A. and Rakov, I. (1996). Estimating the quality of data in relational databases. In Proceedings of the 1996 Conference on Information Quality(pp. 94–106).Google Scholar
  23. Rajamoney, S. (1993). The Design of Discrimination Experiments, Machine Learning, 12(1/2/3).Google Scholar
  24. Richards, B.L. and Mooney, R.J. (1992). Learning relations by pathfinding. In Proc. 10th Nat. Conf. on AI (pp. 50–55).Google Scholar
  25. Russell, S. (1986). Preliminary steps toward the automation of induction. In Proc. 5th Nat. Conf. on AI(pp. 477–484).Google Scholar
  26. Schlimmer, J.C. and Hermens, L.A. (1993). Software Agents: Completing Patterns and Constructing User Interfaces, Journal of Artificial Intelligence Research, 61–89.Google Scholar
  27. Wiederhold, G. (1992). Mediators in the Architecture of Future Information Systems, IEEE Computer, 38–49.Google Scholar
  28. Wittgenstein, Ludwig. (1958). Philosophical Investigations. Macmillan Publishing Co., Inc. Translated by G.E.M. Anscombe.Google Scholar

Copyright information

© Kluwer Academic Publishers 1997

Authors and Affiliations

  • Mike Perkowitz
    • 1
  • Robert B. Doorenbos
    • 1
  • Oren Etzioni
    • 1
  • Daniel S. Weld
    • 1
  1. 1.Department of Computer Science and EngineeringUniversity of WashingtonSeattle

Personalised recommendations