Strategies for Sequencing and Assembling Grapevine Genomes

  • Rosa Figueroa-Balderas
  • Andrea Minio
  • Abraham Morales-Cruz
  • Amanda M. Vondras
  • Dario CantuEmail author
Part of the Compendium of Plant Genomes book series (CPG)


Though grape transcriptomics has expanded dramatically over the last ten years, few additional novel genomic resources were developed since the release of the PN40024 reference genome in 2007. This is partly because of the difficulty associated with assembling grape genomes. Despite a relatively small genome size of ~500 Mb and modest repeat content, high sequence and structural heterozygosity makes assembling grape genomes particularly challenging. Without assemblies representative of the genetic diversity within the cultivated germplasm, identifying cultivar-specific functions not represented in the PN40024 genome has remained elusive. Now, third-generation sequencing technologies and long-range scaffolding methods have made it possible to relatively inexpensively and rapidly generate highly contiguous and complete grape genomes. This chapter will describe the challenges associated with the isolation of high-quality nucleic acids suitable for long-read sequencing and provide an overview of the sequencing and assembling approaches that can be used to successfully reconstruct grape genomes.


Genome assembling Sequencing technologies Grapevine nucleic acid isolation 



This work was supported by J. Lohr Vineyards and Wines, E. & J. Gallo Winery, Dolce Winery, the Louis P. Martini Endowment in Viticulture, and the NSF Grant #1741627. Part of this work was carried out in collaboration with UC Davis Chile and funded by the Chilean Economic Development Agency (CORFO).


  1. Aigrain L, Gu Y, Quail MA (2016) Quantitation of next generation sequencing library preparation protocol efficiencies using droplet digital PCR assays a systematic comparison of DNA library preparation kits for Illumina sequencing. BMC Genom 17:458CrossRefGoogle Scholar
  2. An D, Cao HX, Li C, Humbeck K, Wang W (2018) Isoform sequencing and state-of-art applications for unravelling complexity of plant transcriptomes. Genes 9(1):43. Scholar
  3. Aradhya MK, Dang GS, Prins BH et al (2003) Genetic structure and differentiation in cultivated grape Vitis vinifera L. Genet Res 81:179–192. Scholar
  4. Aranda PS, LaJoie DM, Jorcyk CL (2012) Bleach gel: a simple agarose gel for analyzing RNA quality. Electrophoresis 33(2):366–369PubMedPubMedCentralCrossRefGoogle Scholar
  5. Argout X, Salse J, Aury JM et al (2011) The genome of Theobroma cacao. Nat Genet 43:101–108. Scholar
  6. Ausubel FM, Brent R, Kingston et al (1994) Current protocols in molecular biology. Wiley, New York, pp 2.0.1–2.14.8Google Scholar
  7. Bhattacharjee R, Kolesnikova-Allen M, Aikpokpodion P, Taiwo S, Ingelbrecht I (2004) A semi-automated rapid method of extracting genomic DNA for molecular marker analysis in cocoa, Theobroma cacao L. Plant Mol Biol Report 22:435a–435h458CrossRefGoogle Scholar
  8. Blanco-Ulate B, Hopfer H, Figueroa-Balderas R et al (2017) Red blotch disease alters grape berry development and metabolism by interfering with the transcriptional and hormonal regulation of ripening. J Exp Bot 68(5):1225–1238PubMedPubMedCentralCrossRefGoogle Scholar
  9. Bradnam KR, Fass JN, Alexandrov A et al (2013) Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species. Gigascience 2:10PubMedPubMedCentralCrossRefGoogle Scholar
  10. Braidot E, Zancani M, Petrussa E et al (2008) Transport and accumulation of flavonoids in grapevine (Vitis vinifera L.). Plant Signal Behav 3(9):626–632PubMedPubMedCentralCrossRefGoogle Scholar
  11. Cardone MF, D’Addabbo P, Alkan C et al (2016) Inter-varietal structural variation in grapevine genomes. Plant J 88:648–661PubMedCrossRefPubMedCentralGoogle Scholar
  12. Cheng SH, Moore BD, Seemann JR (2000) Purification of uncontaminated, intact plant RNA. In: Rapley R (ed) The nucleic acid protocols handbook. Humana Press, TotowaGoogle Scholar
  13. Chin CS, Peluso P, Sedlazeck FJ et al (2016) Phased diploid genome assembly with single-molecule real-time sequencing. Nat Methods 13:1050–1054. Scholar
  14. Chin CS, Alexander DH, Marks P et al (2013) Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat Methods 10(6):563–569. Scholar
  15. Claros MG, Bautista R, Guerrero-Fernández D et al (2012) Why assembling plant genome sequences is so challenging. Biology (Basel) 1:439–459. Scholar
  16. Costello M, Pugh TJ, Fennell TJ et al (2013) Discovery and characterization of artifactual mutations in deep coverage targeted capture sequencing data due to oxidative DNA damage during sample preparation. Nucleic Acids Res 41(6):e67PubMedPubMedCentralCrossRefGoogle Scholar
  17. Couch JA, Fritz PJ (1990) Isolation of DNA from plants high in polyphenolics. Plant Mol Biol Rep 8:8–12CrossRefGoogle Scholar
  18. Da Silva C, Zamperin G, Ferrarini A et al (2013) The high polyphenol content of grapevine cultivar tannat berries is conferred primarily by genes that are not shared with the reference genome. Plant Cell 25:4777–4788. Scholar
  19. Di Genova A, Almeida AM, Muñoz-Espinoza C et al (2014) Whole genome comparison between table and wine grapes reveals a comprehensive catalog of structural variants. BMC Plant Biol 14:7. Scholar
  20. Doyle JJ, Doyle JL (1987) A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem Bull 19:11–15Google Scholar
  21. Earl D, Bradnam K, St. John JN et al (2011) Assemblathon 1: a competitive assessment of de novo short read assembly methods. Genome Res 21:2224–2241. Scholar
  22. Edwards D, Batley J (2010) Plant genome sequencing: applications for crop improvement. Plant Biotechnol J 8:2–9. Scholar
  23. Ekblom R, Wolf JBW (2014) A field guide to whole-genome sequencing, assembly and annotation. Evol Appl. Scholar
  24. Endrullat C, Glökler J, Franke P, Frohme M (2016) Standardization and quality management in next-generation sequencing. Appl Transl Genom 10:2–9. Scholar
  25. Fang G, Hammar S, Grumet R (1992) A quick and inexpensive method for removing polysaccharides from plant genomic DNA. Biotechniques 13(1):52–54PubMedCrossRefPubMedCentralGoogle Scholar
  26. Feuillet C, Leach JE, Rogers J et al (2011) Crop genome sequencing: lessons and rationales. Trends Plant Sci 16:77–88. Scholar
  27. Gambino G, Dal Molin A, Boccacci P et al (2017) Whole-genome sequencing and SNV genotyping of ‘Nebbiolo’ (Vitis vinifera L.) clones. Sci Rep 7:17294PubMedPubMedCentralCrossRefGoogle Scholar
  28. Goff SA, Ricke D, Lan T-H et al (2002) A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). Science 296:92–100. Scholar
  29. Green MR, Sambrook J (2016) Precipitation of DNA with ethanol. Cold Spring Harb Protoc. Scholar
  30. Green MR, Sambrook J (2017) Precipitation of DNA with isopropanol. Cold Spring Harb Protoc. Scholar
  31. Guzmán P, Ecker JR (1988) Development of large DNA methods for plants: molecular cloning of large segments of Arabidopsis and carrot DNA into yeast. Nucleic Acids Res 16(23):11091–11105PubMedPubMedCentralCrossRefGoogle Scholar
  32. Hanania U, Velcheva M, Sahar N et al (2004) An improved method for isolating high-quality DNA vitis nuclei. Plant Mol Biol Rep 22:173–177CrossRefGoogle Scholar
  33. Healy A, Furtado A, Cooper T, Henry R (2014) Protocol: a simple method for extracting next-generation sequencing quality genomic DNA from recalcitrant plant species. Plant Methods 10(1):1–8CrossRefGoogle Scholar
  34. Heptinstall J, Rapley R (2000) Spectrophotometric analysis of nucleic acids. In: Rapley R (ed) The Nucleic acid protocols handbook. Humana Press, TotowaGoogle Scholar
  35. Hyma KE, Barba P, Wang M et al (2015) Heterozygous mapping strategy (HetMappS) for high resolution genotyping-by-sequencing markers: a case study in grapevine (N. A. Tinker, Ed.). PLoS ONE 10:e0134880PubMedPubMedCentralCrossRefGoogle Scholar
  36. Iandolino AB, Goes da Silva F, Lim H, Choi H, Williams LE, Cook DR (2004) High-quality Rna, cDNA, and derived EST libraries from grapevine (Vitis vinifera L.). Plant Mol Biol Rep 22:269–278CrossRefGoogle Scholar
  37. Jaillon O, Aury J-M, Noel B et al (2007) The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature 449:463–467. Scholar
  38. Japelaghi RH, Haddad R, Garoosi GA (2011) Rapid and efficient isolation of high quality nucleic acids from plant tissues rich in polyphenols and polysaccharides. Mol Biotechnol 49:129–137. Scholar
  39. Jiao C, Gao M, Wang X, Fei Z (2015) Transcriptome characterization of three wild Chinese Vitis uncovers a large number of distinct disease related genes. BMC Genom 16:223CrossRefGoogle Scholar
  40. Kajitani R, Toshimoto K, Noguchi H et al (2014) Efficient de novo assembly of highly heterozygous genomes from whole-genome shotgun short reads. Genome Res 24:1384–1395. Scholar
  41. Kim CS, Lee CH, Shin JS, Chung YS, Hyung NI (1997) A simple and rapid method for isolation of high quality genomic DNA from fruit trees and conifers using PVP. Nucleic Acids Res 25:1085–1086PubMedPubMedCentralCrossRefGoogle Scholar
  42. Knebelsberger T, Stöger I (2012) DNA extraction, preservation, and amplification. In: Kress W, Erickson D (eds) DNA Barcodes. Methods in molecular biology (methods and protocols). Humana Press, TotowaGoogle Scholar
  43. Laucou V, Lacombe T, Dechesne F et al (2011) High throughput analysis of grape genetic diversity as a tool for germplasm collection management. Theor Appl Genet 122(6):1233–1245PubMedCrossRefPubMedCentralGoogle Scholar
  44. Lodhi MA, Ye G-N, Weeden NF, Reisch BI (1994) A simple and efficient method for DNA extraction from grapevine cultivars and Vitis species. Plant Mol Biol Rep 12:6–13CrossRefGoogle Scholar
  45. Lutz KA, Wang W, Zdepski A, Michael TP (2011) Isolation and analysis of high quality nuclear DNA with reduced organellar DNA for plant genome sequencing and resequencing. BMC Biotechnol 11:54. Scholar
  46. Maliyakal JE (1992) An efficient method for isolation of RNA and DNA from plants containing polyphenolics. Nucleic Acids Res 20:2381CrossRefGoogle Scholar
  47. Marsal G, Baiges I, Canals JM, Zamora F, Fort F (2013) Comparison of the efficiency of some of the most usual DNA extraction methods for woody plants in different tissues of Vitis vinifera L. J Int Sci Vigne Vin 47:227–237Google Scholar
  48. Massonnet M, Fasoli M, Tornielli GB et al (2017) Ripening transcriptomic program in red and white grapevine varieties correlates with berry skin anthocyanin accumulation. Plant Physiol 174(4):2376–2396PubMedPubMedCentralCrossRefGoogle Scholar
  49. Mayjonade B, Gouzy J, Donnadieu C et al (2016) Extraction of high-molecular-weight genomic DNA for long-read sequencing of single molecules. Biotechniques 61:203–205. Scholar
  50. Michael TP, Jackson S (2013) The first 50 plant genomes. Plant Genome 6:1–7CrossRefGoogle Scholar
  51. Minio A, Lin J, Gaut BS, Cantu D (2017) How single molecule real-time sequencing and haplotype phasing have enabled reference-grade diploid genome assembly of wine grapes. Front Plant Sci 8:1–6. Scholar
  52. Minio A, Massonnet M, Figueroa-Balderas R et al (2019a) Iso-seq allows genome-independent transcriptome profiling of grape berry development. G3 Genes Genomes Genet 9:755–767Google Scholar
  53. Minio A, Massonnet M, Figueroa-Balderas R et al (2019b) Diploid genome assembly of the wine grape carmenere. G3 Genes Genomes Genet 9:1331–1337Google Scholar
  54. Morrell PL, Buckler ES, Ross-Ibarra J (2012) Crop genomics: advances and applications. Nat Rev Genet 13(2):85–96. Scholar
  55. Murray MG, Thompson WF (1980) Rapid isolation of high molecular weight DNA. Nucleic Acids Res 8:4321–4325PubMedPubMedCentralCrossRefGoogle Scholar
  56. Myles S, Chia J-M, Hurwitz B et al (2010) Rapid genomic characterization of the genus Vitis. PLoS ONE 5:e8219. Scholar
  57. Nakayama Y, Yamaguchi H, Einaga N, Esumi M (2016) Pitfalls of DNA quantification using DNA-binding fluorescent dyes and suggested solutions. PLoS ONE 11(3):e0150528. Scholar
  58. Newbury HJ, Possingham JV (1977) Factors affecting the extraction of intact ribonucleic acid from plant tissues containing interfering phenolic compounds. Plant Physiol 60:543–547PubMedPubMedCentralCrossRefGoogle Scholar
  59. Pendleton M, Sebra R, Pang AWC et al (2015) Assembly and diploid architecture of an individual human genome via single-molecule technologies. Nat Methods 12:780–786PubMedPubMedCentralCrossRefGoogle Scholar
  60. Peterson DG, Boehm KS, Stack SM (1997) Isolation of milligram quantities of nuclear DNA from tomato (Lycopersicon esculentum), a plant containing high levels of polyphenolic compounds. Plant Mol Biol Rep 15(2):148–153CrossRefGoogle Scholar
  61. Pryszcz LP, Gabaldón T (2016) Redundans: An assembly pipeline for highly heterozygous genomes. Nucleic Acids Res 44:e113. Scholar
  62. Reid KE, Olsson N, Schlosser J, Peng F, Lund ST (2006) An optimized grapevine RNA isolation procedure and statistical determination of reference genes for real-time RT-PCR during berry development. BMC Plant Biol 6:27. Scholar
  63. Rezadoost MH, Kordrostami M, Kumleh HH (2016) An efficient protocol for isolation of inhibitor-free nucleic acids even from recalcitrant plants. 3 Biotech 6(1):61PubMedPubMedCentralCrossRefGoogle Scholar
  64. Ribeiro RA, Lovato MB (2007) Comparative analysis of different DNA extraction protocols in fresh and herbarium specimens of the genus Dalbergia. Gen Mol Res 6:173–187Google Scholar
  65. Roach MJ, Johnson DL, Bohlmann J et al (2018) Population sequencing reveals clonal diversity and ancestral inbreeding in the grapevine cultivar Chardonnay. PLoS Genet 14:e1007807. Scholar
  66. Romieu C (2010) RNA extraction from young, acidic berries and other organs from Vitis vinifera L. In: Delrot S, Medrano H, Or E, Bavaresco L, Grando S (eds) Methodologies and results in grapevine research. Springer, DordrechtGoogle Scholar
  67. Ruan J, Li H (2019) Fast and accurate long-read assembly with wtdbg2. BioRxiv, 530972.
  68. Safonova Y, Bankevich A, Pevzner PA (2015) dipSPAdes: assembler for highly polymorphic diploid genomes. J Comput Biol A J Comput Mol Cell Biol 22:528–545. Scholar
  69. Sahu SK, Thangaraj M, Kathiresan K (2012) DNA extraction protocol for plants with high levels of secondary metabolites and polysaccharides without using liquid nitrogen and phenol. ISRN Mol Biol 2012:205049. Scholar
  70. Sharma AD, Gill PK, Singh P (2002) DNA isolation from dry and fresh samples of polysaccharide-rich plants. Plant Mol Biol Rep 20:415. Scholar
  71. Simão FA, Waterhouse RM, Ioannidis P et al (2015) BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. Scholar
  72. Tan ZJ, Chen SJ (2005) Nucleic acid helix stability: effects of salt concentration, cation valence and size, and chain length. Biophys J 90(4):1175–1190PubMedPubMedCentralCrossRefGoogle Scholar
  73. The Arabidopsis Genome Initiative (2000) Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408:796–815CrossRefGoogle Scholar
  74. Thottathil GP, Jayasekaran K, Othman AS (2016) Sequencing crop genomes: a gateway to improve tropical agriculture. Trop Life Sci Res. Scholar
  75. Tuskan GA, Difazio S, Jansson S et al (2006) The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Science 313(5793):1596–1604PubMedCrossRefPubMedCentralGoogle Scholar
  76. VanBuren R, Bryant D, Edger PP et al (2015) Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum. Nature 527:508–511PubMedCrossRefPubMedCentralGoogle Scholar
  77. Varma A, Padh H, Shrivastava N (2007) Plant genomic DNA isolation: an art or a science. Biotechnol J 2:386–392PubMedCrossRefPubMedCentralGoogle Scholar
  78. Velasco R, Zharkikh A, Troggio M et al (2007) A high quality draft consensus sequence of the genome of a heterozygous grapevine variety. PLoS ONE 2:e1326. Scholar
  79. Venturini L, Ferrarini A, Zenoni S et al (2013) De novo transcriptome characterization of Vitis vinifera cv. Corvina unveils varietal diversity. BMC Genom 14:41. Scholar
  80. Walenz BP, Koren S, Bergman et al (2017) Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res 27(5):722–736. Scholar
  81. Xiao H, Kim WS, Meng B (2015) A highly effective and versatile technology for the isolation of RNAs from grapevines and other woody perennials for use in virus diagnostics. Virol J 12:171. Scholar
  82. Yang C, Li F, Ji X, Wang J (2011) An effective method for RNA extraction from grapevine berry skins. Afr J Biotechnol 10(45):9032–9035CrossRefGoogle Scholar
  83. Yang S, Fresnedo-Ramírez J, Wang M et al (2016) A next-generation marker genotyping platform (AmpSeq) in heterozygous crops: a case study for marker-assisted selection in grapevine. Hortic Res 3:16002. Scholar
  84. Yockteng R, Almeida AM, Yee S, Andre T, Hill C, Specht CD (2013) A method for extracting high-quality RNA from diverse plants for next-generation sequencing and gene expression analyses. Appl Plant Sci. Scholar
  85. Yu J, Wang J, Lin W et al (2005) The genomes of Oryza sativa: a history of duplications. PLoS Biol 3:e38. Scholar
  86. Zhou Y, Massonnet M, Sanjak JS et al (2017) Evolutionary genomics of grape (Vitis vinifera ssp. vinifera) domestication. Proc Natl Acad Sci USA 114:11715–11720. Scholar
  87. Zhou Y, Minio A, Massonnet M, et al (2018) Structural variants, clonal propagation, and genome evolution in grapevine (Vitis vinifera). bioRxiv 508119; doi:

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  • Rosa Figueroa-Balderas
    • 1
  • Andrea Minio
    • 1
  • Abraham Morales-Cruz
    • 1
  • Amanda M. Vondras
    • 1
  • Dario Cantu
    • 1
    Email author
  1. 1.Department of Viticulture and EnologyUniversity of California DavisDavisUSA

Personalised recommendations