GUIs for Web Data Extraction

Ziegler, Cai-Nicolas

doi:10.1007/978-1-4614-8265-9_1163

GUIs for Web Data Extraction

Cai-Nicolas Ziegler³

Reference work entry
First Online: 01 January 2018

22 Accesses

Synonyms

Visual web data extraction; Visual web information extraction; Wrapper generator GUIs

Definition

While content management systems (CMS) are geared towards adding presentational information to relational and structured data from database systems, thus dynamically generating HTML documents, the goal of GUIs for Web data extraction is diametrically opposed: The commonly semi-automatic Web data extraction tools intend to removeall presentational information from Web pages, so that only pure structured content remains. The extraction process itself does not address single documents, but template types, such as the product page of an online retailer or the news page template of an online journal. That is, for each template type, one set of extraction rules is generated. These extraction rules are defined in a graphical manner, by selecting the pieces of information that are relevant and by assigning labels to them. To this end, GUIs are used that largely resemble Web browsers,...

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 4,499.99; Price excludes VAT (USA)

Hardcover Book: USD 6,499.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Author information

Authors and Affiliations

Siemens AG, Munich, Germany
Cai-Nicolas Ziegler

Authors

Cai-Nicolas Ziegler
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Cai-Nicolas Ziegler .

Editor information

Editors and Affiliations

Georgia Institute of Technology College of Computing, Atlanta, GA, USA
Ling Liu
University of Waterloo School of Computer Science, Waterloo, ON, Canada
M. Tamer Özsu

Section Editor information

Computing Lab., Oxford Univ., Oxford, UK
Georg Gottlob

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Ziegler, CN. (2018). GUIs for Web Data Extraction. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_1163

Download citation

DOI: https://doi.org/10.1007/978-1-4614-8265-9_1163
Published: 07 December 2018
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-8266-6
Online ISBN: 978-1-4614-8265-9
eBook Packages: Computer ScienceReference Module Computer Science and Engineering

Publish with us

Policies and ethics

GUIs for Web Data Extraction

Synonyms

Definition

Recommended Reading

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Copyright information

About this entry

Cite this entry

Download citation

Publish with us

Navigation

Synonyms

Definition

Buying options

Recommended Reading

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Copyright information

About this entry

Cite this entry

Download citation

Share this entry

Publish with us

Search

Navigation