Springer Nature is making Coronavirus research free. View research | View latest news | Sign up for updates

Construction and utility of 10-kb libraries for efficient clone-gap closure for rice genome sequencing

  • 164 Accesses

  • 12 Citations


Rice is an important crop and a model system for monocot genomics, and is a target for whole genome sequencing by the International Rice Genome Sequencing Project (IRGSP). The IRGSP is using a clone by clone approach to sequence rice based on minimum tiles of BAC or PAC clones. For chromosomes 10 and 3 we are using an integrated physical map based on two fingerprinted and end-sequenced BAC libraries to identifying a minimum tiling path of clones. In this study we constructed and tested two rice genomic libraries with an average insert size of 10 kb (10-kb library) to support the gap closure and finishing phases of the rice genome sequencing project. The HaeIII library contains 166,752 clones covering approximately 4.6× rice genome equivalents with an average insert size of 10.5 kb. The Sau3AI library contains 138,960 clones covering 4.2× genome equivalents with an average insert size of 11.6 kb. Both libraries were gridded in duplicate onto 11 high-density filters in a 5 × 5 pattern to facilitate screening by hybridization. The libraries contain an unbiased coverage of the rice genome with less than 5% contamination by clones containing organelle DNA or no insert. An efficient method was developed, consisting of pooled overgo hybridization, the selection of 10-kb gap spanning clones using end sequences, transposon sequencing and utilization of in silico draft sequence, to close relatively small gaps between sequenced BAC clones. Using this method we were able to close a majority of the gaps (up to approximately 50 kb) identified during the finishing phase of chromosome-10 sequencing. This method represents a useful way to close clone gaps and thus to complete the entire rice genome.

This is a preview of subscription content, log in to check access.

Fig. 1.
Fig. 2.
Fig. 3.
Fig. 4.
Fig. 5.
Fig. 6.


  1. Arabidopsis Genome Initiative (2000) Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408:796–815

  2. Atrazhev AM, Elliott JF (1996) Simplified de-salting of ligation reactions immediately prior to electroporation into E. coli. Biotechniques 21:1024

  3. Barry G (2001) The use of the Monsanto draft rice genome sequence in research. Plant Physiol 125:1164–1165

  4. Batzoglou S, Berger B, Mesirov J, Lander ES (1999) Sequencing a genome by walking with clone-end sequences: a mathematical analysis. Genome Res 9:1163–1174

  5. Budiman MA, Mao L, Wood T, Wing RA (2000) A deep-coverage tomato BAC library and prospects toward development of an STC framework for genome sequencing. Genome Res 10:129–136

  6. Chen M, Presting G, Barbazuk WB, Goicoechea JL, Blackmon B, Fang G, Kim H, Frisch D, Yu Y, Sun S, et al. (2002) An integrated physical and genetic map of the rice genome. Plant Cell 14:1–10

  7. Devine SE, Chissoe SL, Eby Y, Wilson RK, Boeke JD (1997) A transposon-based strategy for sequencing repetitive DNA in eukaryotic genomes. Genome Res 7:551–563

  8. Ewing B, Green P (1998) Base-calling of automated sequencer traces using PhredII. Error probabilities. Genome Res 8:186–194

  9. Ewing B, Hillier L, Wendl MC, Green P (1998) Base-calling of automated sequencer traces using PHRED. I. Accuracy assessment. Genome Res 8:175–185

  10. Feng Q, Zhang Y, Hao P, Wang S, Fu G, Huang Y, Li Y, Zhu J, Liu Y, Hu X, et al. (2002) Sequence and analysis of rice chromosome 4. Nature 420:316–320

  11. Frohme M, Camargo AA, Czink C, Matsukuma AY, Simpson AJG, Hoheisel JD, Verjovski-Almeida S (2001) Directed gap closure in large-scale sequencing projects. Genome Res 11:901–903

  12. Goff SG, Ricke D, Lan TH, Presting G, Wang R, Dunn M, Glazebrook J, Sessions A, Oeller P, Varma H, et al. (2002) A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). Science 296:92–100

  13. Gordon D, Abajian C, Green P (1998) Consed: a graphical tool for sequence finishing. Genome Res 8:195–202

  14. Green P (1997) Against a whole-genome shotgun. Genome Res 7:410–417

  15. International Human Genome Sequencing Consortium (2001) Initial sequencing and analysis of the human genome. Nature 409:860–941

  16. Luo M, Wang YH, Frisch D, Joobeur T, Wing RA, Ralph AD (2001) Melon BAC library construction using improved methods and identification of clones linked to the locus conferring resistance to melon Fusarium Wilt (Fom-2). Genome 44:154–162

  17. McMurray AA, Sulston, JE, Quail MA (1998) Short-insert libraries as a method of problem solving in genome sequencing. Genome Res 8:562–566

  18. Ohmido N, Kijima K, Akiyama Y, de Jong JH, Fukui K (2000) Quantification of total genomic DNA and selected repetitive sequences reveals concurrent changes in different DNA families in indica and jopanica rice. Mol Gen Genet 263:388–394

  19. Roach JC, Thorsson V, Siegel AF (2000) Parking strategies for genome sequencing. Genome Res 10:1020–1030

  20. Sasaki T, Burr B (2000) International rice genome sequencing project: the effort to completely sequence the rice genome. Curr Opin Plant Biol 3:138–141

  21. Sasaki T, Matsumoto T, Yamamoto K, Sakata K, Baba T, Katayose Y, Wu J, Niimura Y, Cheng Z, Nagamura Y, et al. (2002) The genome sequence and structure of rice chromosome 1. Nature 420:312–316

  22. Siegel AF, Trask B, Roach JD, Mahairas GG, Hood L, van den Engh G (1999) Analysis of sequence-tagged-connector strategies for DNA sequencing. Genome Res 9:297–307

  23. Soderlund C, Humphray S, Dunham A, French L (2000) Contigs built with fingerprints, markers and FPC V4.7. Genome Res 10:1772–1787

  24. Tomkins JP, Peterson DG, Yang TJ, Main D, Wilkins TA, Paterson AH, Wing RA (2001) Development of genomic resources for cotton (Gosypium hirsutuum): BAC library construction, preliminary STC analysis, and identification of clones associated with fiber development. Mol Breed 8:255–261

  25. Venter JD, Smith HO, Hood L (1996) A new strategy for genome sequencing. Nature 381:364–366

  26. Venter JD, Adams MD, Myers EW, Li PW, Mural RJ, Sutton SS, Smith HO, Yandell M, Evans CA, Holt RA, et al. (2001) The sequence of the human genome. Science 291:1304–1351

  27. Wendl MC, Marra MA, Hillier LW, Chinwalla AT, Wilson RK, Waterston RH (2001) Theories and applications for sequencing randomly selected clones. Genome Res 11:274–280

  28. Yang TJ, Yu Y, Frisch D, Wing RA (2002) Two series of plasmid vectors for the shotgun library. Plant and Animal Genome Conference X, p 24

  29. Yu J, Hu S, Wang J, Wong GK, Li S, Liu B, Deng Y, Dai L, Zhou Y, Zhang X, et al. (2002) A draft sequence of the rice genome (Oryza sativa L. ssp. indica). Science 296:79–91

  30. Yu Y, Tomkins JP, Waugh R, Frisch D, Kudrna D, Kleinhofs A, Brueggeman RS, Muehlbauer GJ, Wise RP, Wing RA (2000) A bacterial artificial chromosome library for barley (Hordeum vulgare L.) and the identification of clones containing putative resistance genes. Theor Appl Genet 101:1093–1099

  31. Yu Y, Rambo T, Currie J, Sasaki C, Kim HR, Collura K, Thompson S, Simmons J, Yang TJ, Park GN, Patel AJ, Thurmond S, Henry D, Oates R, Palmer M, Pries G, Gibson J, Anderson H, Paradkar M, Crane L, Dale J, Carver MB, Wood T, Frisch D, Engler F, Soderlund C, Palmer LE, Tetylman L, Nascimento L, Bastide M de la, Spiegel L, Ware D, O'Shaughnessy A, Dike S, Dedhia N, Preston R, Huang E, Ferraro K, Kuit K, Miller B, Zutavern T, Katzenberger F, Muller S, Balija V, Martienssen RA, Stein L, Minx P, Johnson D, Cordum H, Mardis E, Cheng Z, Jiang J, Wilson R, McCombie WR, Wing RA, Yuan Q, Ouyang S, Liu J, Jones KM, Gansberger K, Moffat K, Hill J, Tsitrin T, Overton L, Bera J, Kim M, Jin S, Tallon L, Ciecko A, Pai G, Aken SV, Utterback T, Reidmuller S, Bormann J, Feldblyum T, Hsiao J, Zismann V, Blunt S, Vazeilles A de, Shaffer T, Koo H, Suh B, Yang Q, Haas B, Peterson J, Pertea M, Volfovsky N, Worman J, White O, Salzberg SL, Fraser CV, Buell CR, Messing J, Song R, Fuks G, Llaca V, Kovchak S, Young S, Bowers JE, Paterson AH, Johns MA, Mao L, Pan H, Dean RA (2003) In-depth view of structure, activity, and evolution of rice chromosome 10. Science (in press)

  32. Zhang HB, Zhao X, Ding X, Paterson AH, Wing RA (1995) Preparation of megabase-size DNA from plant nuclei. Plant J 7:175–184

Download references


This work was supported by grants from the U.S. Department of Agriculture CREES, the National Science Foundation, and the Department of Energy and postdoctoral fellowship program from Korea Science and Engineering Foundation (KOSEF). Any opinions, findings, and conclusions or recommendations expressed herein are those of the authors and do not necessarily reflect the views of the U.S. Department of Agriculture, the National Science Foundation or the Department of Energy. We thank Monsanto for providing access to their draft sequence data.

Author information

Correspondence to Rod A. Wing.

Additional information

Communicated by Q. Zhang

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Yang, T., Yu, Y., Nah, G. et al. Construction and utility of 10-kb libraries for efficient clone-gap closure for rice genome sequencing. Theor Appl Genet 107, 652–660 (2003).

Download citation


  • Rice genome sequencing
  • 10-kb Library
  • Gap closure