SNP development and diversity analysis for Ginkgo biloba based on transcriptome sequencing
- 82 Downloads
Based on Ginkgo biloba transcriptome data, 22 SNP loci using high-resolution melting curve technology were validated and genetic diversity was analyzed in three populations.
Ginkgo biloba (Ginkgo) is a long-lived dioecious gymnosperm with unique morphological characteristics and play a significant role in evolutionary relationship research. In this study, we used Illumina paired-end RNA sequencing technology for facilitating the gene discovery and single nucleotide polymorphism (SNP) development in Ginkgo. We collected 44.08 G clean bases from six transcriptome datasets. The transcriptome data generated 98,919 unigenes among which 42,667 (43.13%) were successfully annotated. A total of 139,854 putative SNPs were identified; most of the SNPs were transition-type with the nucleotide transitions C–T or A–G. Further, 54532 (38.99%) SNPs were found in protein-coding regions: 23483 (43.06%) were synonymous and 31049 (56.94%) were nonsynonymous. 22 SNPs were subjected to PCR amplification and Sanger sequencing, and all of them were validated. To test the practicability of identified SNPs, these validated SNPs were also assessed by genotyping three natural populations with 84 individuals by high-resolution melting curve (HRM) analysis. Observed and expected heterozygosity varied from 0.0119 to 0.9643 and from 0.0581 to 0.5024, respectively. HRM technology was first applied for the SNP genotyping in Ginkgo. SNPs identified by RNA-Seq provided a useful resource for genetic and genomic studies in Ginkgo. Moreover, the collection of nonsynonymous SNPs annotated with their predicted functional effects also provided a valuable asset for further discovery of genes, identification of gene variants, and development of functional markers.
KeywordsGinkgo biloba Transcriptome High-resolution melting curve Single nucleotide polymorphism
This study was supported by the Special Fund for Forest Scientific Research in the Public Welfare (201504105), the Agricultural Science and Technology Independent Innovation Funds of Jiangsu Province (CX(16)1005), the National Key Research and Development Program of China (2017YFD0600700), Postgraduate Research & Practice Innovation Program of Jiangsu Province (KYCX18_0954), and the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD).
Compliance with ethical standards
Conflict of interest
The authors declare that they have no conflict of interest.
- Antanaviciute L, Fernández-Fernández F, Jansen J, Banchi E, Evans KM, Viola R, Velasco R, Dunwell JM, Troggio M, Sargent DJ (2012) Development of a dense SNP-based linkage map of an apple rootstock progeny using the Malus infinium whole genome genotyping array. BMC Genom 13:203. https://doi.org/10.1186/1471-2164-13-203 CrossRefGoogle Scholar
- Cabrera A, Rosyara UR, Franceschi PD, Sebolt A, Sooriyapathirana SS, Dirlewanger E, Quero-Garcia J, Schuster M, Iezzoni AF, Knaap EVD (2012) Rosaceae conserved orthologous sequences marker polymorphism in sweet cherry germplasm and construction of a SNP-based map. Tree Genet Genomes 8:237–247. https://doi.org/10.1007/s11295-011-0436-9 CrossRefGoogle Scholar
- Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, Adiconis X, Fan L, Raychowdhury R, Zeng Q, Chen Z, Mauceli E, Hacohen N, Gnirke A, Rhind N, di Palma F, Birren BW, Nusbaum C, Lindblad-Toh K, Friedman N, Regev A (2011) Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol 29:644–652. https://doi.org/10.1038/nbt.1883 CrossRefGoogle Scholar
- Jo IH, Lee SH, Kim YC, Kim DH, Kim HS, Kim KH, Chung JW, Bang KH (2015) De novo transcriptome assembly and the identification of gene associated single nucleotide polymorphism markers in Asian and American ginseng roots. Mol Genet Genomics 290:1055–1065. https://doi.org/10.1007/s00438-014-0974-6 CrossRefGoogle Scholar
- Mckenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, DePristo MA (2010) The genome analysis toolkit: a map reduce framework for analysis next-generation DNA sequencing data. Genome Res 20:1297–1303. https://doi.org/10.1101/gr.107524.110 CrossRefGoogle Scholar
- Nickerson M L,Weirich G,Zbar B,Schmidt L S (2000) Signature-based analysis of MET proto-oncogene mutations using DHPLC. Hum Mutat 16:68–76. https://doi.org/10.1002/1098-1004(200007)16:1%3C68::AID-HUMU12%3E3.0.CO;2-U CrossRefGoogle Scholar
- Plomion C, Bartholomé J, Lesur I, Boury C, Rodríguez-Quilón I, Lagraulet H, Ehrenmann F, Bouffier L, Gion JM, Grivet D, de Miguel M, de María N, Cervera MT, Bagnoli F, Isik F, Vendramin GG, González-Martínez SC (2016) High-density SNP assay development for genetic analysis in maritime pine (Pinus pinaster). Mol Ecol Resour 16:574–587. https://doi.org/10.1111/1755-0998.12464 CrossRefGoogle Scholar
- Sato T, Nakagome S, Watanabe C, Yamaguchi K, Koganebuchi K, Haneji K, Yamaguchi T, Hanihara T, Yamamoto K, Ishida H, Mano S, Kimura R, Oota H (2014) Genome-wide SNP analysis reveals population structure and demographic history of the Ryukyu Islanders in the southern part of the Japanese Archipelago. Mol Biol Evol 31(11):2929–2940. https://doi.org/10.1093/molbev/msu230 CrossRefGoogle Scholar
- Sharma K, Sharma M, Singh S, Modi M, Sharma A, Ray P, Varma S (2017) Real-time PCR followed by high-resolution melting curve analysis: a rapid and pragmatic approach for screening of multidrug-resistant extrapulmonary tuberculosis. Tuberculosis 106:56–61. https://doi.org/10.1016/j.tube.2017.07.002 CrossRefGoogle Scholar