Abstract
The exponential growth in the number and size of information services available on the internet has created an urgent need for information agents which act as brokers in that they can autonomously search, gather and integrate information on behalf of a user. Moreover, the inherent volatility of the internet and the wide range of information processing tasks to be carried out, calls for a framework that facilitates both the construction and evolution of such information brokers. This paper proposes such a framework named XIB (eXtensible Information Brokers).
Based on descriptions of relevant information services, XIB supports the interactive generation of an integrated query interface, generates wrappers for each information service dynamically, and returns to the user the composed result to a query. XIB depends heavily on XML-related techniques. More specifically, we will use DTDs to model the input and output of the service, use XML elements to denote the input and output values. By using this representation, service integration is investigated in the form of DTD integration, and query decomposition is studied in the form of XML element decomposition. Within the proposed framework, it is easy to add or remove information services on the internet to a broker, thereby facilitating maintenance, evolution and customization.
Chapter PDF
Similar content being viewed by others
Keywords
References
Vidur Apparao et al, Document Object Model (DOM) Level 1, W3C recommendation, http://www.w3.org/TR/REC-DOM-Level-l/.
Naveen Ashish, Craig Knoblock, Semi-automatic Wrapper Generation for Internet Information Sources, Second IFCIS Conference on Cooperative Information Systems (CoopIS), Charleston, South Carolina, 1997.
P. Atzeni, G. Mecca, P. Merialdo, Semistructured and Structured Data in the Web: Going Back and Forth, In SIGMOD Record, Special Issue on the Workshop on the Management of Semistructured Data, 1997.
C. Baru, A. Gupta, B. Ludaescher, R. Marciano, Y. Papakonstantinou, P. Velikhov, XML-Based Information Mediation with MIX, In Exhibitions Program of ACM SIGMOD 99.
Tim Bray, Jean Paoli, C. M. Sperberg-McQueen, Extensible Markup Language(XML) 1.0, W3C recommendation, http://wwww.w3.org/TR/REC-xml, 1998.
Chen-Chuan K. Chang, Hector Garcia-Molina, Andreas Paepcke, Predicate Rewriting for Translating Boolean Queries in a Heterogeneous Information System, ACM Transactions on Information Systems, vol. 17, no. 1, Jan. 1999.
Chen-Chuan K. Chang, Hector Garcia-Molina, Mind Your Vocabulary: Query Mapping Across Heterogeneous Information Sources, Proc. of the 1999 ACM SIG-MOD International Conference On Management of Data, Jun. 1999.
James Clark, Stephen Deach, Extensible Stylesheet Language(XSL), W3C working draft, http://www/w3/org/TR/WD-xsl, 1998.
K. Decker, K. Sycara, M. Williamson. Matchmaking and Brokering. Proceedings of the Second International Conference on Multi-Agent Systems (ICMAS-96), Dec-96.
Alin Deutsch, Mary Fernandez, Daniela Florescu, Alon Levy, Dan Suciu, XML-QL: A Query Language for XML, W3C note, http://www.w3.org/TR/NOTE-xml-ql, 1998.
Craig A. Knoblock, Steven Minton, Jose Luis Ambite, Naveen Ashish, Pragnesh Jay Modi, Ion Muslea, Andrew G., Philpot, and Sheila Tejada. Modeling web sources for information integration, Proceedings of the Fifteenth National Conference on Artificial Intelligence, Madison, WI, 1998.
David Konopnicki, Oded Shmuedi, A comprehensive framework for querying and integrating WWW Data and services, Fourth IFCIS International Conference on Cooperative Information Systems, Edinburgh, 1999.
Bruce Krulwich, Automating the Internet Agents as User Surrogates, IEEE Internet computing, Vol. 1, No. 4, July/August 1997.
Nicholas Kushmerick, Daniel Weld, Robert Doorenbos, Wrapper induction for information extraction, IJCAI’97.
Andrew Layman, et al, XML Data, W3C note, http://www.w3.org/TR/1998/NOTE-XML-data-0105.
Alon Y. Levy, Anand Rajaraman and Joann J. Ordille, Querying Heterogeneous Information Sources Using Source Descriptions, Proceedings of the 22nd International Conference on Very Large Databases, VLDB-96, Bombay, India, September, 1996.
Hannes Marais and Tom Rodeheffer. Automating the Web with WebL. In Dr. Dobb’s Journal, January 1999.
Mihaila, George and Raschid, Louiqa, Locating Data Repositories using XML, W3C Workshop on XML and Querying the Web, 1998.
Makoto Murata, Automatically Constructing the Intersection/Union/Difference of Two Schemas, XTech’99, march 7–11, 1999.
I. Muslea and S. Minton and C. Knoblock, STALKER: Learning Extraction Rules for Semistructured, Web-based Information Sources, AAAI-98 Workshop on AI and Information Integration, 1998, 74–81.
Y. Papakonstantinou, A. Gupta, L. Haas, Capabilities-Based Query Rewriting in Mediator Systems (Extended Version), in DAPD.
Raschid, Louiqa and Vidal, Maria Esther and Gruser, Jean-Robert. A Flexible Meta-Wrapper Interface for Autonomous Distributed Information Sources, Under Review. http://www.umiacs.umd.edu/users/mvidal/
Arnaud Sahuguet, Fabien Azavant, Wysiwyg Web Wrapper Factory (W4F), 1999.
K. Sycara, J. Lu, M. Klusch, S. Widoff, Matchmaking among Heterogeneous Agents on the Internet, in Proceedings of the 1999 AAAI Spring Symposium on Intelligent Agents in Cyberspace, Stanford University, USA 22–24 March 1999.
Vasilis Vassalos, Y. Papakonstantinou, Expressive Capabilities Description Languages and Query Rewriting Algorithms, http://www-cse.ucsd.edu/yan-nis/papers/vpcap2.ps
Maria Esther Vidal, Louiqa Raschid, Jean Robert Gruser, A Meta-Wrapper for Scaling up to Multiple Autonomous Distributed Information Sources, In Proceedings CoopIS’98.
WIDL: Application Integration with XML, in “XML: Principles, Tools, and Techniques”, the October print issue of O’Reilly’s World Wide Web Journal, fall 1997.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lu, J., Mylopoulos, J., Ho, J. (2000). Towards Extensible Information Brokers Based on XML. In: Wangler, B., Bergman, L. (eds) Advanced Information Systems Engineering. CAiSE 2000. Lecture Notes in Computer Science, vol 1789. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45140-4_4
Download citation
DOI: https://doi.org/10.1007/3-540-45140-4_4
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67630-0
Online ISBN: 978-3-540-45140-2
eBook Packages: Springer Book Archive