Abstract
The development of high throughput technology in biological and medical domains has seen a growing intervention of informatics support. Indeed, the big amount of data produced is difficult to analyse and interpret in terms of time consuming and number of different resources used. In this context, the challenge would be to have an integrated and multi component database with a user friendly interface able to solve biological problems without a priori high-level of bioinformatics knowledge. This need arises from the evidence that biologists have multi-task and multi-levels problems to solve. To this aim, we propose a bottom-up, graph-based approach for integrating bioinformatics resources, usually databases, starting from typical biological scenarios, in order to solve novel bioinformatics problems. The integrated resources can be queried by means of a graph traversal language such as Gremlin.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Amberger, J.S., Bocchini, C.A., Schiettecatte, F., Scott, A.F., Hamosh, A.: Omim. org: online mendelian inheritance in man (omim®), an online catalog of human genes and genetic disorders. Nucleic Acids Res. 43(D1), D789–D798 (2015)
Apache tinkerpop: the gremlin traversal language. https://github.com/tinkerpop/gremlin/wiki
Betel, D., Wilson, M., Gabow, A., Marks, D.S., Sander, C.: The microrna. org resource: targets and expression. Nucleic Acids Res. 36(suppl 1), D149–D153 (2008)
Bhattacharya, A., Ziebarth, J.D., Cui, Y.: Polymirts database 3.0: linking polymorphisms in micrornas and their target sites with human diseases and biological pathways. Nucleic Acids Res. 42(D1), D86–D91 (2014)
Botstein, D., Cherry, J.M., Ashburner, M., Ball, C., Blake, J., Butler, H., Davis, A., Dolinski, K., Dwight, S., Eppig, J., et al.: Gene ontology: tool for the unification of biology. Nat. Genet. 25(1), 25–29 (2000)
Brennecke, J., Stark, A., Russell, R.B., Cohen, S.M.: Principles of microrna-target recognition. PLoS Biol. 3(3), e85 (2005)
Croft, D., Mundo, A.F., Haw, R., Milacic, M., Weiser, J., Wu, G., Caudy, M., Garapati, P., Gillespie, M., Kamdar, M.R., et al.: The reactome pathway knowledgebase. Nucleic Acids Res. 42(D1), D472–D477 (2014)
Dennis Jr., G., Sherman, B.T., Hosack, D.A., Yang, J., Gao, W., Lane, H.C., Lempicki, R.A., et al.: David: database for annotation, visualization, and integrated discovery. Genome Biol. 4(5), P3 (2003)
Dweep, H., Gretz, N.: Mirwalk 2.0: a comprehensive atlas of microrna-target interactions. Nat. Methods 12(8), 697–697 (2015)
Rojas, I., Ortuño, F. (eds.): IWBBIO 2017. LNCS, vol. 10209. Springer, Cham (2017)
Fiannaca, A., La Paglia, L., La Rosa, M., Messina, A., Storniolo, P., Urso, A.: Integrated DB for bioinformatics: a case study on analysis of functional effect of MiRNA SNPs in cancer. In: Renda, M.E., Bursa, M., Holzinger, A., Khuri, S. (eds.) ITBAM 2016. LNCS, vol. 9832, pp. 214–222. Springer, Cham (2016). doi:10.1007/978-3-319-43949-5_17
Fiannaca, A., La Rosa, M., La Paglia, L., Rizzo, R., Urso, A.: Analysis of mirna expression profiles in breast cancer using biclustering. BMC Bioinform. 16(Suppl 4), S7 (2015)
Gray, K.A., Yates, B., Seal, R.L., Wright, M.W., Bruford, E.A.: Genenames.org: the HGNC resources in 2015. Nucleic Acids Res. 43(D1), D1079–D1085 (2015)
Huang, D.W., Sherman, B.T., Lempicki, R.A.: Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 37(1), 1–13 (2009)
Kanehisa, M., Sato, Y., Kawashima, M., Furumichi, M., Tanabe, M.: Kegg as a reference resource for gene and protein annotation. Nucleic Acids Res. 44(D1), D457–D462 (2016)
Kozomara, A., Griffiths-Jones, S.: MiRBase: integrating microRNA annotation and deep-sequencing data. Nucleic Acids Res. 39(Database issue), D152–7 (2011)
Kumar, S., Dudley, J.: Bioinformatics software for biologists in the genomics era. Bioinformatics 23(14), 1713–1717 (2007)
Laganà , A., Forte, S., Giudice, A., Arena, M., Puglisi, P.L., Giugno, R., Pulvirenti, A., Shasha, D., Ferro, A.: Miro: a mirna knowledge base. Database 2009, bap. 008 (2009)
Mi, H., Poudel, S., Muruganujan, A., Casagrande, J.T., Thomas, P.D.: Panther version 10: expanded protein families and functions, and analysis tools. Nucleic Acids Res. 44(D1), D336–D342 (2016)
Rodriguez, M.A.: The gremlin graph traversal machine and language (invited talk). In: Proceedings of the 15th Symposium on Database Programming Languages - DBPL 2015, pp. 1–10. ACM Press, New York (2015)
Romero-Cordoba, S.L., Salido-Guadarrama, I., Rodriguez-Dorantes, M., Hidalgo-Miranda, A.: Mirna biogenesis: biological impact in the development of cancer. Cancer Biol. Ther. 15(11), 1444–1455 (2014)
Rosenbloom, K.R., Armstrong, J., Barber, G.P., Casper, J., Clawson, H., Diekhans, M., Dreszer, T.R., Fujita, P.A., Guruvadoo, L., Haeussler, M., et al.: The UCSC genome browser database: 2015 update. Nucleic Acids Res. 43(D1), D670–D681 (2015)
Schuler, G.D., Epstein, J.A., Ohkawa, H., Kans, J.A.: Entrez: molecular biology database and retrieval system. Methods Enzymol. 266, 141–162 (1996)
Sigrist, C.J.A., de Castro, E., Cerutti, L., Cuche, B.A., Hulo, N., Bridge, A., Bougueleret, L., Xenarios, I.: New and continuing developments at PROSITE. Nucleic Acids Res. 41(Database issue), D344–7 (2013)
Szklarczyk, D., Franceschini, A., Wyder, S., Forslund, K., Heller, D., Huerta-Cepas, J., Simonovic, M., Roth, A., Santos, A., Tsafou, K.P., Kuhn, M., Bork, P., Jensen, L.J., von Mering, C.: STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res. 43(Database issue), D447–52 (2015)
The Gene ontology consortium: going forward. Nucleic Acids Res. 43(D1), D1049–D1056 (2015)
The UniProt Consortium: UniProt: a hub for protein information. Nucleic Acids Res. 43(D1), D204–D212 (2015)
Xie, B., Ding, Q., Han, H., Wu, D.: MiRCancer: a microRNA-cancer association database constructed by text mining on literature. Bioinformatics 29(5), 638–644 (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Fiannaca, A., La Rosa, M., La Paglia, L., Messina, A., Rizzo, R., Urso, A. (2017). A Problem-Driven Approach for Building a Bioinformatics GraphDB. In: Bracciali, A., Caravagna, G., Gilbert, D., Tagliaferri, R. (eds) Computational Intelligence Methods for Bioinformatics and Biostatistics. CIBB 2016. Lecture Notes in Computer Science(), vol 10477. Springer, Cham. https://doi.org/10.1007/978-3-319-67834-4_11
Download citation
DOI: https://doi.org/10.1007/978-3-319-67834-4_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-67833-7
Online ISBN: 978-3-319-67834-4
eBook Packages: Computer ScienceComputer Science (R0)