Abstract
The growing volume of biomedical data available on the Web has contributed to numerous scientific advancements. At the same time, the complex, versatile and disparate nature of the data can overburden the knowledge discovery and data-driven hypothesis generation by scientists. Ontologies have been proposed to address the data integration challenge, however, creating useful domain-specific ontologies and populating them with high quality instances is tedious and time-consuming. In this paper, we present the mOntage framework to rapidly create ontologies representing data in a specific area of interest. We show how the mOntage framework can be used to create and populate biomedical ontologies from existing data sources. The classes and properties of the ontology being created are mapped to and instantiated from the existing data sources by executing suitable SPARQL queries. We illustrate our framework by creating a Phosphatase Ontology and show how it can serve as an important source of knowledge in the area of phosphatases.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
A prototype knowledge base for the life sciences. Available from: http://www.w3.org/TR/hcls-kb/
Bizer, C., Heath, T., Berners-Lee, T.: Linked data - the story so far. Int. J. Seman. Web Inf. Syst. 5, 21 (2009)
Gruber, T.R.: Toward principles for the design of ontologies used for knowledge sharing. Int. J. Hum. Comput. Stud. 43(4–5), 907–928 (1995)
Smith, B., et al.: The OBO foundry: coordinated evolution of ontologies to support biomedical data integration. Nat. BioTechnol. 25(11), 1251–1255 (2007)
Noy, N.F., et al.: BioPortal: ontologies and integrated data resources at the click of a mouse. Nucl. Acids Res. 37(suppl. 2), W170–W173 (2009)
Ashburner, M., et al.: Gene ontology: tool for the unification of biology. Gene Ontol. Consortium. Nat. Genet. 25(1), 25–29 (2000)
Natale, D.A., et al.: The protein ontology: a structured representation of protein forms and complexes. Nucl. Acids Res. 39(Database issue), D539–D545 (2011)
Dastgheib, S., Mesbah, A., Kochut, K.: Montage: creating self-populating domain ontologies from linked open data. Int. J. Seman. Comput. 7(04), 427–453 (2013)
Dastgheib, S., Mesbah, A., Kochut, K.: mOntage: building domain ontologies from linked open data. In: International Conference on Semantic Computing (ICSC). IEEE, Irvine (2013)
Gosal, G., Kochut, K.J., Kannan, N.: ProKinO: an ontology for integrative analysis of protein kinases in cancer. PLoS ONE 6(12), e28782 (2011)
McSkimming, D.I., et al.: ProKinO: a unified resource for mining the cancer kinome. Hum. Mutat. 36(2), 175–186 (2015)
Gosal, G.P.S., Kannan, N., Kochut, K.J.: ProKinO: a framework for protein kinase ontology. In: 2011 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE (2011)
Forbes, S.A., et al.: The catalogue of somatic mutations in cancer (COSMIC). Current Protoc. Hum. Genet. 10–11 (2008)
Croft, D., et al.: Reactome: a database of reactions, pathways and biological processes. Nucl. Acids Res. 39(suppl. 1), D691–D697 (2011)
Bairoch, A., et al.: The universal protein resource (UniProt). Nucl. Acids Res. 33(suppl. 1), D154–D159 (2005)
He, R.-J., et al.: Protein tyrosine phosphatases as potential therapeutic targets. Acta Pharmacologica Sinica 35, 1227–1246 (2014)
McConnell, J.L., Wadzinski, B.E.: Targeting protein serine/threonine phosphatases for drug development. Mol. Pharmacol. 75(6), 1249–1261 (2009)
Zhang, M., et al.: Viewing serine/threonine protein phosphatases through the eyes of drug designers. FEBS J. 280(19), 4739–4760 (2013)
Wolstencroft, K., et al.: PhosphaBase: an ontology-driven database resource for protein phosphatases. Proteins: Struct. Funct. Bioinf. 58(2), 290–294 (2005)
Horrocks, I.: DAML+OIL: a description logic for the semantic web. IEEE Data Eng. Bull. 25(1), 4–9 (2002)
Apweiler, R., et al.: The InterPro database, an integrated documentation resource for protein families, domains and functional sites. Nucl. Acids Res. 29(1), 37–40 (2001)
Hamosh, A., et al.: Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucl. Acids Res. 33(suppl. 1), D514–D517 (2005)
Duan, G., Li, X., Köhn, M.: The human DEPhOsphorylation database DEPOD: a 2015 update. Nucl. Acids Res. 43, D531–D535 (2014). doi:10.1093/nar/gku1009
Composer, T.: TopBraid Composer 2007 features and getting started guide version 1.0, created by TopQuadrant, US (2007)
Weiten, M.: OntoSTUDIO® as a ontology engineering environment. In: Davies, J., Grobelnik, M., Mladenić, D. (eds.) Semantic Knowledge Management, pp. 51–60. Springer, Heidelberg (2009)
von Eschenbach, A.C., Buetow, K.: Cancer informatics vision: caBIGâ„¢. Cancer Inf. 2, 22 (2006)
Knoblock, C.A., Szekely, P., Ambite, J.L., Goel, A., Gupta, S., Lerman, K., Muslea, M., Taheriyan, M., Mallick, P.: Semi-automatically mapping structured sources into the semantic web. In: Simperl, E., Cimiano, P., Polleres, A., Corcho, O., Presutti, V. (eds.) ESWC 2012. LNCS, vol. 7295, pp. 375–390. Springer, Heidelberg (2012)
Sahoo, S.S., et al.: An ontology-driven semantic mash-up of gene and biological pathway information: application to the domain of nicotine dependence. J. Biomed. Inf. 41(5), 752 (2008)
Jentzsch, A., et al.: Linking open drug data. In: Triplification Challenge of the International Conference on Semantic Systems (2009)
Hassanzadeh, O., et al.: Linkedct: a linked data space for clinical trials (2009). arXiv preprint arXiv:0908.0567
Lenzerini, M.: Data integration: a theoretical perspective. In: Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems. ACM (2002)
Queralt-Rosinach, N., Furlong, L.I.: DisGeNET RDF: a gene-disease association linked open data resource. In: SWAT4LS (2013)
Lin, Y.-C., et al.: SCP phosphatases suppress renal cell carcinoma by stabilizing PML and inhibiting mTOR/HIF signaling. Cancer Res. 74(23), 6935–6946 (2014)
Humtsoe, J.O., et al.: Lipid phosphate phosphatase 3 stabilization of β-catenin induces endothelial cell migration and formation of branching point structures. Mol. Cell. Biol. 30(7), 1593–1606 (2010)
Acknowledgment
Funding for NK from the National Science Foundation (MCB-1149106) is acknowledged.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Dastgheib, S., McSkimming, D.I., Kannan, N., Kochut, K. (2015). Creating Biomedical Ontologies Using mOntage. In: Ashish, N., Ambite, JL. (eds) Data Integration in the Life Sciences. DILS 2015. Lecture Notes in Computer Science(), vol 9162. Springer, Cham. https://doi.org/10.1007/978-3-319-21843-4_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-21843-4_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-21842-7
Online ISBN: 978-3-319-21843-4
eBook Packages: Computer ScienceComputer Science (R0)