Abstract
Soybean Knowledge Base (SoyKB) is a comprehensive all-inclusive web resource for bridging the gap between soybean translational genomics and molecular breeding. It provides information for six entities including genes/proteins, microRNAs (miRNAs)/small interfering RNAs (sRNA), metabolites, single nucleotide polymorphisms (SNPs), and plant introduction lines and traits. It has a user-friendly web interface publicly available at http://soykb.org, which integrates and presents data in an intuitive manner to the soybean researchers, breeders, and consumers. It incorporates several informatics and analytical tools for integrating and merging various multi-omics datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Grant D, Nelson RT, Cannon SB, Shoemaker RC (2010) SoyBase, the USDA-ARS soybean genetics and genomics database. Nucleic Acids Res 38(Database issue):D843–D846. doi:10.1093/nar/gkp798
Shultz JL, Kurunam D, Shopinski K, Iqbal MJ, Kazi S, Zobrist K, Bashir R, Yaegashi S, Lavu N, Afzal AJ, Yesudas CR, Kassem MA, Wu C, Zhang HB, Town CD, Meksem K, Lightfoot DA (2006) The Soybean Genome Database (SoyGD): a browser for display of duplicated, polyploid, regions and sequence tagged sites on the integrated physical and genetic maps of Glycine max. Nucleic Acids Res 34(Database issue):D758–D765. doi:10.1093/nar/gkj050
Soybean Functional Genomics Database: http://bioinformatics.cau.edu.cn/SFGD/
Joshi T, Fitzpatrick MR, Chen S, Liu Y, Zhang H, Endacott RZ, Gaudiello EC, Stacey G, Nguyen HT, Xu D (2014) Soybean knowledge base (SoyKB): a web resource for integration of soybean translational genomics and molecular breeding. Nucleic Acids Res 42(Database issue):D1245–D1252. doi:10.1093/nar/gkt905
Joshi T, Patil K, Fitzpatrick MR, Franklin LD, Yao Q, Cook JR, Wang Z, Libault M, Brechenmacher L, Valliyodan B, Wu X, Cheng J, Stacey G, Nguyen HT, Xu D (2012) Soybean Knowledge Base (SoyKB): a web resource for soybean translational genomics. BMC Genomics 13(Suppl 1):S15. doi:10.1186/1471-2164-13-S1-S15
Goff SA, Vaughn M, McKay S, Lyons E, Stapleton AE, Gessler D, Matasci N, Wang L, Hanlon M, Lenards A, Muir A, Merchant N, Lowry S, Mock S, Helmke M, Kubach A, Narro M, Hopkins N, Micklos D, Hilgert U, Gonzales M, Jordan C, Skidmore E, Dooley R, Cazes J, McLay R, Lu Z, Pasternak S, Koesterke L, Piel WH, Grene R, Noutsos C, Gendler K, Feng X, Tang C, Lent M, Kim SJ, Kvilekval K, Manjunath BS, Tannen V, Stamatakis A, Sanderson M, Welch SM, Cranston KA, Soltis P, Soltis D, O’Meara B, Ane C, Brutnell T, Kleibenstein DJ, White JW, Leebens-Mack J, Donoghue MJ, Spalding EP, Vision TJ, Myers CR, Lowenthal D, Enquist BJ, Boyle B, Akoglu A, Andrews G, Ram S, Ware D, Stein L, Stanzione D (2011) The iPlant collaborative: cyberinfrastructure for plant biology. Front Plant Sci 2:34. doi:10.3389/fpls.2011.00034
MySQL: http://www.mysql.com
PHP: http://www.php.net
Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, Haussler D (2002) The human genome browser at UCSC. Genome Res 12(6):996–1006
Skinner ME, Uzilov AV, Stein LD, Mungall CJ, Holmes IH (2009) JBrowse: a next-generation genome browser. Genome Res 19(9):1630–1638. doi:10.1101/gr.094607.109
Lyons E, Freeling M (2008) How to usefully compare homologous plant genes and chromosomes as DNA sequences. Plant J 53(4):661–673. doi:10.1111/j.1365-313X.2007.03326.x
Open Science Grid showcase of SoyKB application as a exemplar use case for distributed computing: http://www.opensciencegrid.org/soykb-helps-improve-a-vital-food-source/
Langfelder P, Horvath S (2008) WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics 9:559. doi:10.1186/1471-2105-9-559
Winter D, Vinegar B, Nahal H, Ammar R, Wilson GV, Provart NJ (2007) An “Electronic Fluorescent Pictograph” browser for exploring and analyzing large-scale biological data sets. PLoS One 2(8):e718. doi:10.1371/journal.pone.0000718
Langewisch T, Zhang H, Vincent R, Joshi T, Xu D, Bilyeu K (2014) Major soybean maturity gene haplotypes revealed by SNPViz analysis of 72 sequenced soybean genomes. PLoS One 9(4):e94150. doi:10.1371/journal.pone.0094150
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, DePristo MA (2010) The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 20(9):1297–1303. doi:10.1101/gr.107524.110
DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, Philippakis AA, del Angel G, Rivas MA, Hanna M, McKenna A, Fennell TJ, Kernytsky AM, Sivachenko AY, Cibulskis K, Gabriel SB, Altshuler D, Daly MJ (2011) A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet 43(5):491–498. doi:10.1038/ng.806
Goodstein DM, Shu S, Howson R, Neupane R, Hayes RD, Fazo J, Mitros T, Dirks W, Hellsten U, Putnam N, Rokhsar DS (2012) Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res 40(Database issue):D1178–D1186. doi:10.1093/nar/gkr944
Klambauer G, Schwarzbauer K, Mayr A, Clevert D-A, Mitterecker A, Bodenhofer U, Hochreiter S (2012) cn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate. Nucleic Acids Res 40:e69. doi:10.1093/nar/gks003
Cingolani P, Platts A, le Wang L, Coon M, Nguyen T, Wang L, Land SJ, Lu X, Ruden DM (2012) A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 6(2):80–92. doi:10.4161/fly.19695
Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES (2007) TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics (Oxford, England) 23(19):2633–2635. doi:10.1093/bioinformatics/btm308
Wang J, Joshi T, Valliyodan B, Shi H, Liang Y, Nguyen HT, Zhang J, Xu D (2015) A Bayesian model for detection of high-order interactions among genetic variants in genome-wide association studies. BMC Genomics 16:1011. doi:10.1186/s12864-015-2217-6
LDExplorer: http://www.eurac.edu/en/research/health/biomed/services/Pages/LDExplorer.aspx
Deelman E, Singh G, Su M-H, Blythe J, Gil Y, Kesselman C, Mehta G, Vahi K, Berriman GB, Good J, Laity A, Jacob JC, Katz DS (2005) Pegasus: a framework for mapping complex scientific workflows onto distributed systems. Sci Program 13(3):219–237
Valliyodan B, Qiu D, Patil G, Zeng P, Huang J, Dai L, Chen C, Zeng L, Joshi T, Song L, Vuong T, Musket T, Xu D, Shannon JG, Shifeng C, Liu X, Nguyen HT (2016) Landscape of genomic diversity and trait discovery in soybean. Sci Rep 6:23598
Acknowledgment
The development has been supported by the Missouri Soybean Merchandising Council, United Soybean Board, National Science Foundation (#DBI-0421620), Department of Energy (DE-SC0004898), and the National Center for Soybean Biotechnology.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Science+Business Media New York
About this protocol
Cite this protocol
Joshi, T. et al. (2017). The Evolution of Soybean Knowledge Base (SoyKB). In: van Dijk, A. (eds) Plant Genomics Databases. Methods in Molecular Biology, vol 1533. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-6658-5_7
Download citation
DOI: https://doi.org/10.1007/978-1-4939-6658-5_7
Published:
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-6656-1
Online ISBN: 978-1-4939-6658-5
eBook Packages: Springer Protocols