Discovering and Building Semantic Models of Web Sources

Knoblock, Craig A.

doi:10.1007/978-3-642-02121-3_2

Craig A. Knoblock²⁵

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5554))

Included in the following conference series:

European Semantic Web Conference

2421 Accesses

Abstract

To achieve widespread use of the Semantic Web depends on having a critical mass of Web data available with semantic annotations. Since there are a huge number of sources available today without any such annotations, the challenge is how to find and build semantic models for these sources. In this talk I will describe an integrated end-to-end approach that automatically discovers information-producing web sources, invokes and extracts the data from these sources, builds semantic models of the sources, and validates the results by comparing the data produced by the source with the model of the source. These techniques are implemented in a system called DEIMOS, which integrates a diverse set of technologies to completely automate this task. DEIMOS starts with a “seed” source and finds other similar sources online using data from a social networking web site. Next the system learns how to invoke these sources through experimentation and then extracts data from these sources with automatic wrapping techniques. Finally, DEIMOS learns a semantic model of a source, which identifies the semantic types of the data produced by a source as well as the function that maps the inputs to the outputs. I will describe the challenges in integrating the component technologies into a unified approach to discovering, extracting and modeling new online sources. I will also present an evaluation of the integrated system on three different domains to demonstrate that it can automatically discover and model new Web sources.

This talk is based on joint work with Jose Luis Ambite, Mark Carman, Cenk Gazen, Kristina Lerman, Steven Minton, Anon Plangprasopchok, and Tom Russ. This research is based upon work supported in part by the National Science Foundation under award number IIS-0535182, in part by the Air Force Office of Scientific Research under grant number FA9550-07-1-0416, and in part by the Defense Advanced Research Projects Agency (DARPA) under Contract No. FA8750-07-D-0185/0004.

Download to read the full chapter text

Chapter PDF

Assigning Semantic Labels to Data Sources

Integrating Concepts and Knowledge in Large Content Networks

Article 27 August 2014

Semantic Web Search and Inductive Reasoning

Author information

Authors and Affiliations

Information Sciences Institute, University of Southern California, 4676 Admiralty Way, Marina del Rey, CA 90292, USA
Craig A. Knoblock

Authors

Craig A. Knoblock
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

VU University Amsterdam, The Netherlands
Lora Aroyo & Eyal Oren &
Fondazione Bruno Kessler (FBK) - IRST Center for Information Technology, Via Sommarive 18, Povo, 38100, Trento, Italy
Paolo Traverso
University of Sheffield, United Kingdom
Fabio Ciravegna
TU Delft, The Netherlands
Philipp Cimiano
Talis Information Ltd., United Kingdom
Tom Heath
Semantic Computing Research Group (SeCo), Helsinki University of Technology, and University of Helsinki,, Finland
Eero Hyvönen
The Institute of Scientific and Industrial Research, Osaka University, 8-1 Mihogaoka, Ibaraki, 567-0047, Osaka, Japan
Riichiro Mizoguchi
Knowledge Media Institute, The Open University, UK
Marta Sabou
STI Innsbruck, University of Innsbruck, Technikerstr. 21a, 6020, Innsbruck, Austria
Elena Simperl

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Knoblock, C.A. (2009). Discovering and Building Semantic Models of Web Sources. In: Aroyo, L., et al. The Semantic Web: Research and Applications. ESWC 2009. Lecture Notes in Computer Science, vol 5554. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02121-3_2

Download citation

DOI: https://doi.org/10.1007/978-3-642-02121-3_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02120-6
Online ISBN: 978-3-642-02121-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Discovering and Building Semantic Models of Web Sources

Abstract

Chapter PDF

Similar content being viewed by others

Assigning Semantic Labels to Data Sources

Integrating Concepts and Knowledge in Large Content Networks

Semantic Web Search and Inductive Reasoning

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Discovering and Building Semantic Models of Web Sources

Abstract

Chapter PDF

Similar content being viewed by others

Assigning Semantic Labels to Data Sources

Integrating Concepts and Knowledge in Large Content Networks

Semantic Web Search and Inductive Reasoning

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation