Abstract
Recent technological advances in genomics allow the production of biological data at unprecedented tera- and petabyte scales. Efficient mining of these vast and complex datasets for the needs of biomedical research critically depends on a seamless integration of the clinical, genomic, and experimental information with prior knowledge about genotype-phenotype relationships. Such experimental data accumulated in publicly available databases should be accessible to a variety of algorithms and analytical pipelines that drive computational analysis and data mining.
We present an integrated computational platform Lynx (Sulakhe et al., Nucleic Acids Res 44:D882–D887, 2016) ( http://lynx.cri.uchicago.edu ), a web-based database and knowledge extraction engine. It provides advanced search capabilities and a variety of algorithms for enrichment analysis and network-based gene prioritization. It gives public access to the Lynx integrated knowledge base (LynxKB) and its analytical tools via user-friendly web services and interfaces. The Lynx service-oriented architecture supports annotation and analysis of high-throughput experimental data. Lynx tools assist the user in extracting meaningful knowledge from LynxKB and experimental data, and in the generation of weighted hypotheses regarding the genes and molecular mechanisms contributing to human phenotypes or conditions of interest. The goal of this integrated platform is to support the end-to-end analytical needs of various translational projects.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Chen J et al (2013) Translational biomedical informatics in the cloud: present and future. Biomed Res Int 2013:658925
Payne PR, Embi PJ, Sen CK (2009) Translational informatics: enabling high-throughput research paradigms. Physiol Genomics 39(3):131–140
Ranganathan S et al (2011) Towards big data science in the decade ahead from ten years of InCoB and the 1st ISCB-Asia Joint Conference. BMC Bioinformatics 12(Suppl 13):S1
Boyd LB et al (2011) The caBIG® Life Science Business Architecture model. Bioinformatics 27(10):1429–1435
Hillman-Jackson, J., et al. (2012) Using Galaxy to perform large-scale interactive data analyses. Curr Protoc Bioinformatics. Chapter 10: p. Unit10.5.
Schuler R et al (2012) A flexible, open, decentralized system for digital pathology networks. Stud Health Technol Inform 175:29–38
Ideker T, Krogan NJ (2012) Differential network biology. Mol Syst Biol 8:565
Koyutürk M (2010) Algorithmic and analytical methods in network biology. Wiley Interdiscip Rev Syst Biol Med 2(3):277–292
Bandyopadhyay S et al (2010) Rewiring of genetic networks in response to DNA damage. Science 330(6009):1385–1389
Chikina MD et al (2009) Global prediction of tissue-specific gene expression and context-dependent gene networks in Caenorhabditis elegans. PLoS Comput Biol 5(6):e1000417
Myers CL, Troyanskaya OG (2007) Context-sensitive data integration and prediction of biological networks. Bioinformatics 23(17):2322–2330
Sharan R, Ideker T (2006) Modeling cellular machinery through biological network comparison. Nat Biotechnol 24(4):427–433
Takemoto K, Kihara K (2013) Modular organization of cancer signaling networks is associated with patient survivability. Biosystems 113(3):149–154
Ideker T, Sharan R (2008) Protein networks in disease. Genome Res 18(4):644–652
Kiemer L, Cesareni G (2007) Comparative interactomics: comparing apples and pears? Trends Biotechnol 25(10):448–454
Nibbe RK et al (2011) Protein-protein interaction networks and subnetworks in the biology of disease. Wiley Interdiscip Rev Syst Biol Med 3(3):357–367
Blank MC et al (2011) Multiple developmental programs are altered by loss of Zic1 and Zic4 to cause Dandy-Walker malformation cerebellar pathogenesis. Development 138(6):1207–1216
Beltrao P, Ryan C, Krogan NJ (2012) Comparative interaction networks: bridging genotype to phenotype. Adv Exp Med Biol 751:139–156
Black DL, Grabowski PJ (2003) Alternative pre-mRNA splicing and neuronal function. Prog Mol Subcell Biol 31:187–216
Ellis JD et al (2012) Tissue-specific alternative splicing remodels protein-protein interaction networks. Mol Cell 46(6):884–892
Greene CS et al (2015) Understanding multicellular function and disease with human tissue-specific networks. Nat Genet 47(6):569–576
Yap K, Makeyev EV (2013) Regulation of gene expression in mammalian nervous system through alternative pre-mRNA splicing coupled with RNA quality control mechanisms. Mol Cell Neurosci 56:420–428
Biamonti G et al (2014) The alternative splicing side of cancer. Semin Cell Dev Biol 32:30–36
Kaida D, Schneider-Poetsch T, Yoshida M (2012) Splicing in oncogenesis and tumor suppression. Cancer Sci 103(9):1611–1616
Zhang J, Manley JL (2013) Misregulation of pre-mRNA alternative splicing in cancer. Cancer Discov 3(11):1228–1237
Wells QS et al (2013) Whole exome sequencing identifies a causal RBM20 mutation in a large pedigree with familial dilated cardiomyopathy. Circ Cardiovasc Genet 6(4):317–326
Stallings-Mann M, Radisky D (2007) Matrix metalloproteinase-induced malignancy in mammary epithelial cells. Cells Tissues Organs 185(1–3):104–110
Sumithra B, Saxena U, Das AB (2016) Alternative splicing within the Wnt signaling pathway: role in cancer development. Cell Oncol (Dordr) 39(1):1–13
Yabas M, Elliott H, Hoyne GF (2016) The role of alternative splicing in the control of immune homeostasis and cellular differentiation. Int J Mol Sci 17(1):3
Schaefer MH et al (2013) Adding protein context to the human protein-protein interaction network to reveal meaningful interactions. PLoS Comput Biol 9(1):e1002860
Shao H et al (2013) Systematically studying kinase inhibitor induced signaling network signatures by integrating both therapeutic and side effects. PLoS One 8(12):e80832
Cordero F et al (2012) Large disclosing the nature of computational tools for the analysis of next generation sequencing data. Curr Top Med Chem 12(12):1320–1330
Hong H et al (2013) Critical role of bioinformatics in translating huge amounts of next-generation sequencing data into personalized medicine. Sci China Life Sci 56(2):110–118
Wang S, Xing J (2013) A primer for disease gene prioritization using next-generation sequencing data. Genomics Inform 11(4):191–199
Warde-Farley D et al (2010) The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function. Nucleic Acids Res 38(Web Server issue):W214–W220
Franceschini A et al (2013) STRING v9.1: protein-protein interaction networks, with increased coverage and integration. Nucleic Acids Res 41(Database issue):D808–D815
Szklarczyk D et al (2011) The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored. Nucleic Acids Res 39(Database issue):D561–D568
Chen J et al (2009) ToppGene Suite for gene list enrichment analysis and candidate gene prioritization. Nucleic Acids Res 37(Web Server issue):W305–W311
Tranchevent LC et al (2008) ENDEAVOUR update: a web resource for gene prioritization in multiple species. Nucleic Acids Res 36(Web Server issue):W377–W384
Sifrim A et al (2013) eXtasy: variant prioritization by genomic data fusion. Nat Methods 10(11):1083–1084
Wu J, Li Y, Jiang R (2014) Integrating multiple genomic data to predict disease-causing nonsynonymous single nucleotide variants in exome sequencing studies. PLoS Genet 10(3):e1004237
Jäger M et al (2014) Jannovar: a java library for exome annotation. Hum Mutat 35(5):548–555
Li MX et al (2012) A comprehensive framework for prioritizing variants in exome sequencing studies of Mendelian diseases. Nucleic Acids Res 40(7):e53
Calabrese C et al (2014) MToolBox: a highly automated pipeline for heteroplasmy annotation and prioritization analysis of human mitochondrial variants in high-throughput sequencing. Bioinformatics 30(21):3115–3117
Yao J et al (2014) FamAnn: an automated variant annotation pipeline to facilitate target discovery for family-based sequencing studies. Bioinformatics 30(8):1175–1176
Li X, Montgomery SB (2013) Detection and impact of rare regulatory variants in human disease. Front Genet 4:67
Matthews LR et al (2001) Identification of potential interaction networks using sequence-based searches for conserved protein-protein interactions or "interologs". Genome Res 11(12):2120–2126
Yu H et al (2004) Annotation transfer between genomes: protein-protein interologs and protein-DNA regulogs. Genome Res 14(6):1107–1118
Mewes HW et al (2011) MIPS: curated databases and comprehensive secondary data resources in 2010. Nucleic Acids Res 39(Database issue):D220–D224
St Onge RP et al (2007) Systematic pathway analysis using high-resolution fitness profiling of combinatorial gene deletions. Nat Genet 39(2):199–206
Bakal C et al (2008) Phosphorylation networks regulating JNK activity in diverse genetic backgrounds. Science 322(5900):453–456
Lage K et al (2008) A large-scale analysis of tissue-specific pathology and gene expression of human disease genes and complexes. Proc Natl Acad Sci U S A 105(52):20870–20875
Zuberi K et al (2013) GeneMANIA prediction server 2013 update. Nucleic Acids Res 41(Web Server issue):W115–W122
Kamburov A et al (2013) The ConsensusPathDB interaction database: 2013 update. Nucleic Acids Res 41(Database issue):D793–D800
Niu Y, Otasek D, Jurisica I (2010) Evaluation of linguistic features useful in extraction of interactions from PubMed; application to annotating known, high-throughput and predicted interactions in I2D. Bioinformatics 26(1):111–119
Hu Z et al (2013) VisANT 4.0: Integrative network platform to connect genes, drugs, diseases and therapies. Nucleic Acids Res 41(Web Server issue):W225–W231
Elefsinioti A et al (2011) Large-scale de novo prediction of physical protein-protein association. Mol Cell Proteomics 10(11):M111–010629
Patil A, Nakai K, Nakamura H (2011) HitPredict: a database of quality assessed protein-protein interactions in nine species. Nucleic Acids Res 39(Database issue):D744–D749
Balaji S et al (2012) IMID: integrated molecular interaction database. Bioinformatics 28(5):747–749
Wong AK et al (2012) IMP: a multi-species functional genomics portal for integration, visualization and prediction of protein functions and networks. Nucleic Acids Res 40(Web Server issue):W484–W490
Tamames J, de Lorenzo V (2010) EnvMine: a text-mining system for the automatic extraction of contextual information. BMC Bioinformatics 11:294
Gerner M et al (2012) BioContext: an integrated text mining system for large-scale extraction and contextualization of biomolecular events. Bioinformatics 28(16):2154–2161
Kahn AB et al (2007) SpliceMiner: a high-throughput database implementation of the NCBI Evidence Viewer for microarray splice variant analysis. BMC Bioinformatics 8:75
Thanaraj TA et al (2004) ASD: the Alternative Splicing Database. Nucleic Acids Res 32(Database issue):D64–D69
Latham KE (2006) The Primate Embryo Gene Expression Resource in embryology and stem cell biology. Reprod Fertil Dev 18(8):807–810
Sulakhe D et al (2016) Lynx: a knowledge base and an analytical workbench for integrative medicine. Nucleic Acids Res 44(D1):D882–D887
Lukashin I et al (2011) VISTA Region Viewer (RViewer)--a computational system for prioritizing genomic intervals for biomedical studies. Bioinformatics 27(18):2595–2597
Xie B, et al (2012) Prediction of candidate genes for neuropsychiatric disorders using feature-based enrichment. Proceedings of the ACM conference on bioinformatics, computational biology and biomedicine, Association for Computing Machinery, pp 564–566
Frazer KA et al (2004) VISTA: computational tools for comparative genomics. Nucleic Acids Res 32(Web Server issue):W273–W279
Nitsch D et al (2011) PINTA: a web server for network-based gene prioritization from expression data. Nucleic Acids Res 39(Web Server issue):W334–W338
Xie B et al (2015) Disease gene prioritization using network and feature. J Comput Biol 22(4):313–323
Xie B, et al (2013) Conditional random field for candidate gene prioritization. Proceedings of the international conference on bioinformatics, computational biology and biomedical informatics, Association for Computing Machinery, p 700
Dubchak I et al (2014) An integrative computational approach for prioritization of genomic variants. PLoS One 9(12):e114903
Nitsch D et al (2010) Candidate gene prioritization by network analysis of differential expression using machine learning approaches. BMC Bioinformatics 11:460
Källberg M et al (2012) Template-based protein structure modeling using the RaptorX web server. Nat Protoc 7(8):1511–1522
Rosenbloom KR et al (2015) The UCSC genome browser database: 2015 update. Nucleic Acids Res 43(Database issue):D670–D681
Mirzaa GM et al (2014) The developmental brain disorders database (DBDB): a curated neurogenetics knowledge base with clinical and research applications. Am J Med Genet A 164A(6):1503–1511
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Science+Business Media LLC
About this protocol
Cite this protocol
D’Souza, M. et al. (2017). Strategic Integration of Multiple Bioinformatics Resources for System Level Analysis of Biological Networks. In: Tatarinova, T., Nikolsky, Y. (eds) Biological Networks and Pathway Analysis. Methods in Molecular Biology, vol 1613. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-7027-8_5
Download citation
DOI: https://doi.org/10.1007/978-1-4939-7027-8_5
Published:
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-7025-4
Online ISBN: 978-1-4939-7027-8
eBook Packages: Springer Protocols