Skip to main content

Data Integration in Web Data Extraction System

  • Living reference work entry
  • First Online:
Book cover Encyclopedia of Database Systems
  • 78 Accesses

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Recommended Reading

  1. Baumgartner R, Flesca S, Gottlob G. Visual web information extraction with Lixto. In: Proceedings of the 27th International Conference on Very Large Data Bases; 2001. p. 119–28.

    Google Scholar 

  2. Berglund A, Boag S, Chamberlin D, Rernandez MF, Kay M, Robie J, Simeon J. editors. XML XPath language 2.0. W3C recommendation; 2007.

    Google Scholar 

  3. Bernstein PA, Melnik S, Petropoulos M, Quix C. Industrial-strength schema matching. ACM SIGMOD Rec. 2004;33(4):38–43.

    Article  Google Scholar 

  4. Bing L, Chen-Chuan-Chang K. Editorial: special issue on web content mining. ACM SIGKDD Explor Newsl. 2004;6(2):1–4.

    Article  Google Scholar 

  5. Boag S, Chamberlin D, Fernandez MF, Florescu D, Robie J, Simeon J. editors. XQuery 1.0. An XML query language. W3C recommendation; 2007.

    Google Scholar 

  6. Fodor O, Werthner E. Harmonise: a step toward an interoperable e-tourism marketplace. Intl J Electron Commer. 2005;9(2):11–39.

    Google Scholar 

  7. Gravano L, Panagiotis GI, Koudas N, Srivastava D. Text joins in an RDBMS for web data integration. In: Proceedings of the 12th International World Wide Web Conference; 2003. p. 90–101.

    Google Scholar 

  8. Halevy A, Rajaraman A, Ordille J. Data integration: the teenage years. In: Proceedings of the 32nd International Conference on Very Large Data Bases; 2006. p. 9–18.

    Google Scholar 

  9. Harmonise Framework. Available at: http://sourceforge.net/projects/hmafra/

  10. Herzog M, Gottlob G. InfoPipes: a flexible framework for m-commerce applications. In: Proceedings of the 2nd International Workshop on Technologies for E-Services; 2001. p. 175–86.

    Google Scholar 

  11. Kay M, editor. XSL transformations. Version 2.0. W3C recommendation; 2007.

    Google Scholar 

  12. Kirk T, Levy AY, Sagiv Y, Srivastava D. The information manifold. In: Proceedings of the Working Notes of the AAAI Spring Symposium on Information Gathering from Heterogeneous, Distributed Environments. Stanford University: AAAI Press; 1995. p. 85–91.

    Google Scholar 

  13. Ludäscher B, Himmeröder R, Lausen G, May W, Schlepphorst C. Managing semistructured data with florid: a deductive object-oriented perspective. Inf Syst. 1998;23(9):589–613.

    Article  Google Scholar 

  14. May W, Lausen G. A uniform framework for integration of information from the web. Inf Syst. 2004;29:59–91.

    Article  Google Scholar 

  15. Myllymaki J. Effective web data extraction with standard XML technologies. Comput Netw. 2002;39(5):635–44.

    Article  Google Scholar 

  16. Rahm E, Bernstein PA. A survey of approaches to automatics schema matching. VLDB J. 2001;10(4):334–50.

    Article  MATH  Google Scholar 

  17. Salton G, McGill MJ. Introduction to modern information retrieval. New York: McGraw-Hill; 1983.

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marcus Herzog .

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer Science+Business Media New York

About this entry

Cite this entry

Herzog, M. (2016). Data Integration in Web Data Extraction System. In: Liu, L., Özsu, M. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4899-7993-3_1161-2

Download citation

  • DOI: https://doi.org/10.1007/978-1-4899-7993-3_1161-2

  • Received:

  • Accepted:

  • Published:

  • Publisher Name: Springer, New York, NY

  • Online ISBN: 978-1-4899-7993-3

  • eBook Packages: Springer Reference Computer SciencesReference Module Computer Science and Engineering

Publish with us

Policies and ethics