Skip to main content

Information Aggregation Using the Caméléon# Web Wrapper

  • Conference paper
E-Commerce and Web Technologies (EC-Web 2005)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3590))

Included in the following conference series:

Abstract

Caméléon# is a web data extraction and management tool that provides information aggregation with advanced capabilities that are useful for developing value-added applications and services for electronic business and electronic commerce. To illustrate its features, we use an airfare aggregation example that collects data from eight online sites, including Travelocity, Orbitz, and Expedia. This paper covers the integration of Caméléon# with commercial database management systems, such as MS SQL Server, and XML query languages, such as XQuery.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alatovic, T.: Capabilities Aware, Planner, Optimizer, Executioner for Context Interchange Project. Thesis (S.M.) MIT, Dept. of EE & CS (2001)

    Google Scholar 

  2. Baumgartner, R., Flesca, S., Gottlob, G.: Declarative information extraction, web crawling, and recursive wrapping with Lixto. In: Eiter, T., Faber, W., Truszczyński, M. (eds.) LPNMR 2001. LNCS (LNAI), vol. 2173, p. 21. Springer, Heidelberg (2001)

    Google Scholar 

  3. Chan, C.: OLE DB for the Context Interchange System. Thesis (S.M.) MIT, Dept. of EE & CS (2000)

    Google Scholar 

  4. Chuang, S.W.: A Taxonomy and Analysis of Web Wrapping Technologies. Thesis (S.M.) MIT, Technology and Policy Program (2004)

    Google Scholar 

  5. Firat, A.: Information Integration Using Contextual Knowledge and Ontology Merging Thesis (Ph.D.) MIT (2003)

    Google Scholar 

  6. Garcia-Molina, H., Hammer, J., Ireland, K., Papakonstantinou, V., Ullman, J., Widom, J.: Integrating and Accessing Heterogeneous Information Sources in TSIMMIS. In: Proceedings of the AAAI Symposium on Information Gathering, Stanford, California, March 1995, pp. 61–64 (1995)

    Google Scholar 

  7. Huck, G., Fankhauser, P., Aberer, K., Neuhold, E.J.: JEDI: Extracting and Synthesizing Information from the Web. In: COOPIS 1998. IEEE Computer Society Press, New York (1998) (submitted to)

    Google Scholar 

  8. Hsu, C., Dung, M.: Wrapping semistructured web pages with finite-state transducers. In: Proceedings of the Conference on Autonomous Learning and Discovery CONALD 1998 (1998)

    Google Scholar 

  9. Knoblock, C., Lerman, K., Minton, S., Muslea, I.: Accurately and reliably extracting data from the web: A machine learning approach. IEEE Data Engineering Bulletin 23(4) (2000)

    Google Scholar 

  10. Kushmerick, N., Doorenbos, R., Weld., D.: Wrapper Induction for Information Extraction. In: IJCAI 1997 (August 1997)

    Google Scholar 

  11. Laender, A., Ribeiro-Neto, B., Silva, A., Teixeira, J.: A Brief Survey of Web Data Extraction Tools. In: SIGMOD Record, vol. 31(2) (2002)

    Google Scholar 

  12. Madnick, S., Siegel, M.: Seizing the Opportunity: Exploiting Web Aggregation. MIS Quarterly Executive 1(1) (2002)

    Google Scholar 

  13. Muslea, I., Minton, S., Knoblock, C.: STALKER: Learning extraction rules for semistructure, Web-based information sources. In: Proc. of AAAI 1998: Workshop on AI and Information Integration (1998)

    Google Scholar 

  14. Sahuguent, A., Azavant, F.: W4F: the WysiWyg Web Wrapper Factory. Technical Report, University of Pennsylvania, Department of Computer and Information Science (1998)

    Google Scholar 

  15. Zhu, H., Siegel, M., Madnick, S.: Information Aggregation – A Value-added E-Service. In: Proc. of the International Conference on Technology, Policy, and Innovation: Critical Infrastructures (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Firat, A., Madnick, S., Yahaya, N.A., Kuan, C.W., Bressan, S. (2005). Information Aggregation Using the Caméléon# Web Wrapper. In: Bauknecht, K., Pröll, B., Werthner, H. (eds) E-Commerce and Web Technologies. EC-Web 2005. Lecture Notes in Computer Science, vol 3590. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11545163_8

Download citation

  • DOI: https://doi.org/10.1007/11545163_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-28467-3

  • Online ISBN: 978-3-540-31736-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics