Information Structuring

  • Dieter Fensel
  • Borys Omelayenko
  • Ying Ding
  • Michel Klein
  • Alan Flett
  • Ellen Schulten
  • Guy Botquin
  • Mike Brown
  • Gloria Dabiri
Chapter
Part of the The Springer International Series in Engineering and Computer Science book series (SECS, volume 710)

Abstract

The input for the overall content management process is product catalogs in various formats and without much standardization. Therefore, the information must first be extracted from various information sources and moulded into a structured and well-defined form. Currently, most of this work is done manually. However, there is a strong need to automate the extraction of information from unstructured and semi-structured multi-media information in order to improve the overall productivity of the content management process.

Keywords

Information Extraction Product Schema Word Sense Extraction Rule Extraction Pattern 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer Science+Business Media New York 2002

Authors and Affiliations

  • Dieter Fensel
    • 1
  • Borys Omelayenko
    • 1
  • Ying Ding
    • 1
  • Michel Klein
    • 1
  • Alan Flett
    • 1
  • Ellen Schulten
    • 2
  • Guy Botquin
    • 3
  • Mike Brown
    • 4
  • Gloria Dabiri
    • 5
  1. 1.Vrije Universiteit AmsterdamThe Netherlands
  2. 2.Heloise Ontology AssociatesLondonUK
  3. 3.AlexsysLasneBelgium
  4. 4.Global Transactions, Ltd.BerlinGermany
  5. 5.Freie Universität BerlinBerlinGermany

Personalised recommendations