Abstract
In the Web, there are classes of pages with similar structuring and contents (e.g., call for papers pages, references, etc), which are interrelated forming clusters (e.g., Science). We propose an architecture of cognitive multiagent systems for information retrieval and extraction from these clusters. Each agent processes one class employing reusable ontologies to recognize pages, extract all possible useful information and communicate with the others agents. Whenever it identifies information interesting to another agent, it forwards this information to that agent. These „hot hints” usually contain much less garbage than search engine results do. The agent architecture presents many sorts of reuse: all the code, DB definitions, knowledge and services of the search engines. We got promising results using Java and Jess.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Alvares, L., Sichman; J.: Introdução aos Sistemas Multiagentes. Proceedings of EINE—Escola de Informática do Nordeste, Soc. Brasileira de Computação, Recife, Brazil (1997).
Ambite, J.; Knoblock, C.: Agents for Information Gathering. In Software Agents. Bradshaw, J. (ed.), MIT Press, Pittsburgh, PA, USA (1997).
Appelt, D. E.; Israel, D. J.: Introduction to Information Extraction Technology. International Joint Conference of Artificial Intelligence. Stokholm, Sweden (1999).
Ashish, N.; Knoblock, C.: Wrapper Generation for Semi-structured Internet Sources. SIGMOD Record, 26(4):8–15 (1997).
Baeza-Yates, R, Ribeiro-Neto, B: Modern Information Retrieval. Addison Wesley (1999) 167–9
Cohen, W. W.: Learning Rules that Classify E-mail. http://www.parc.xerox.com/istl/projects/mlia/papers/cohen.ps (1996).
Bittencourt, G.: In the Quest of the Missing Link. Proceedings of the International Joint Conference of Artificial Intelligence. Nagoya, Japan (1997).
Craven, M., McCallum, A. M., DiPasquo, D., Mitchell, T., Freitag, D., Nigam, K., Slattery, S.: Learning to Extract Symbolic Knowledge from the World Wide Web. Technical Report CMU-CS-98-122. School of Computer Science. Carnegie Mellon University(1998).
Embley, D., Campbell, D., Liddle, S., Smith, R.:Ontology-Based Extraction of Information from Data-Rich Unstructured Documents. http://www.deg.byu.edu/papers/cikm98.ps (1998).
Flanaghan, D.: Java Examples in a Nutshell. O’Reilly. Sebastopol,CA,USA.(1997)330–333
Freitas, F., Siebra, C., Ferraz, C., Ramalho, G.: Mediation services for agents integration. Proceedings of SEMISH’99. Soc. Brasileira de Computação (SBC). Rio, Brazil (1999).
Friemann-Hill, E. 1997. Jess,The Java Expert System Shell. http://herzberg.ca.sandia.gov/Jess
Gruber, T..R.: Ontolingua: A Mechanism to Support Portable Ontologies. Technical Report KSL-91-66. Stanford University, Knowledge Systems Laboratory. USA. (1996)
Huhns, M.; Singh, M.: The Agent Test. IEEE Internet Computing.Sep/Oct 97(1997)
Koster, M.: Guidelines for Robot Writers. http://www.eskimo.com/~falken/guidelin.html (1993)
Kushmerick, N: Wrapper Induction. http://www.compapp.dcu.ie/~nick/research/wrappers (1999)
Oates, T.; Prasad, M.; Lesser, V.: Cooperative Information Gathering: A Distributed Problem Solving Approach. Technical Report 94–66. University of Mass.,USA (1994)
Pirolli, P., Pitkow, J., Rao, R.: Silk from a Sow’s Ear: Extracting Usable Structures from the Web. http://www.acm.org/sigchi/chi96/proceedings/papers/Pirolli_2/pp2.html (1995)
Riloff, E.: Information Extraction as a Basis for Portable Text Classification Systems. PhD. thesis. Depart. of Computer Science. University of Mass., Amherst. USA (1994)
Russel, S; Norvig; P.: Artificial Intelligence: A Modern Approach, Prentice-Hall (1995) 10
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Freitas, F.L.G., Bittencourt, G. (2000). Cognitive Multi-agent Systems for Integrated Information Retrieval and Extraction over the Web. In: Monard, M.C., Sichman, J.S. (eds) Advances in Artificial Intelligence. IBERAMIA SBIA 2000 2000. Lecture Notes in Computer Science(), vol 1952. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44399-1_32
Download citation
DOI: https://doi.org/10.1007/3-540-44399-1_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41276-2
Online ISBN: 978-3-540-44399-5
eBook Packages: Springer Book Archive