Semantic Data Integration for Life Science Entities

Leser, Ulf

doi:10.1007/978-1-4614-8265-9_627

Semantic Data Integration for Life Science Entities

Ulf Leser³

Reference work entry
First Online: 01 January 2018

13 Accesses

Synonyms

Data fusion; Duplicate detection; LSID; Object identification

Definition

An entity is the representation of a (not necessarily physical) real-world object, such as a gene, a protein, or a disease, within a database. To integrate information about the same entities from different databases, these representations must be analyzed to uncover the corresponding underlying objects. This process is called entity identification. A variation of entity identification is duplicate detection, which analyses two or more entities to determine whether they represent the same real-world object or not. Finally, data fusion is the process of generating a single, homogeneous representation from multiple, possibly inconsistent entities that represent the same real-world object.

When entities have globally unique keys, such as ISBN numbers in the case of books, entity identification and duplicate detection are simple. However, in life science databases, one usually has only descriptive...

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 4,499.99; Price excludes VAT (USA)

Hardcover Book: USD 6,499.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Author information

Authors and Affiliations

Humboldt University of Berlin, Berlin, Germany
Ulf Leser

Authors

Ulf Leser
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ulf Leser .

Editor information

Editors and Affiliations

Georgia Institute of Technology College of Computing, Atlanta, GA, USA
Ling Liu
University of Waterloo School of Computer Science, Waterloo, ON, Canada
M. Tamer Özsu

Section Editor information

Robert H. Smith School of Business, University of Maryland, College Park, MD, USA
Louiqa Raschid

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Leser, U. (2018). Semantic Data Integration for Life Science Entities. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_627

Download citation

DOI: https://doi.org/10.1007/978-1-4614-8265-9_627
Published: 07 December 2018
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-8266-6
Online ISBN: 978-1-4614-8265-9
eBook Packages: Computer ScienceReference Module Computer Science and Engineering

Publish with us

Policies and ethics

Semantic Data Integration for Life Science Entities

Synonyms

Definition

Recommended Reading

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Copyright information

About this entry

Cite this entry

Download citation

Publish with us

Navigation

Synonyms

Definition

Buying options

Recommended Reading

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Copyright information

About this entry

Cite this entry

Download citation

Share this entry

Publish with us

Search

Navigation