Abstract
Development of molecular markers and the transfer of marker information from one species to another are limiting steps in the assembly of genetic maps and the use of map information in breeding programs. To identify potential marker sequences more efficiently, we have established procedures combining multi-species EST and genome sequence data for a genome-wide, in silico identification of molecular markers. Taking advantage of information from a few related species, comparative EST sequence analysis identifies evolutionarily conserved sequences (ECSs) that with high probability are conserved in less characterised species in the same family. The chance of observing variation between any two mapping parents is increased by selecting ECS that are interrupted by introns in corresponding genomic regions. Our procedure simultaneously optimizes (1) primer selection for stable performance of PCR across species by choosing ECS as the target sequences for priming, (2) the likelihood of polymorphism discovery by selecting intron-containing ECSs, (3) marker transfer between species, and (4) information content by counting copy numbers of homologous sequences in Arabidopsis. We illustrate our procedure in legumes, where model plant genome and EST sequence data have great potential re influencing crop legume breeding programs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Altschul SF, Gish W, Miller W, Myers E, and Lipman DJ. (1990) Basic local alignment search tool. Journal of Molecular Biology 215, 403–410.
Arabidopsis Genome Inititative (2000) Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408, 796–815.
Bowers JE, Chapman BA, Rong J, Paterson AH. (2003) Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events. Nature 422, 433–438.
Chiapello H, Lisacek F, Caboche M, Henaut A. (1998) Codon usage and gene function are related in sequences of Arabidopsis thaliana. Gene 209, 1–38.
Fulton TM, Van der Hoeven R, Eannetta NT, Tanksley SD. (2002) Identification, analysis, and utilization of conserved ortholog set markers for comparative genomics in higher plants. Plant Cell 14, 1457–67.
Goff SA, Ricke D, Lan TH, Presting G, Wang R, Dunn M, Glazebrook J, Sessions A, Oeller P, Varma H, Hadley D, Hutchison D, Martin C, Katagiri F, Lange BM, Moughamer T, Xia Y, Budworth P, Zhong J, Miguel T, Paszkowski U, Zhang S, Colbert M, Sun WL, Chen L, Cooper B, Park S, Wood TC, Mao L, Quail P, Wing R, Dean R, Yu Y, Zharkikh A, Shen R, Sahasrabudhe S, Thomas A, Cannings R, Gutin A, Pruss D, Reid J, Tavtigian S, Mitchell J, Eldredge G, Scholl T, Miller RM, Bhatnagar S, Adey N, Rubano T, Tusneem N, Robinson R, Feldhaus J, Macalma T, Oliphant A, and Briggs S. A draft sequence of the rice genome (Oryza sativa L ssp. japonica). Science 296:92–100.
Hayashi M, Miyahara A, Sato S, Kato T, Yoshikawa M, Taketa M, Hayashi M, Pedrosa A, Onda R, Imaizumi-Anraku H, Bachmair A, Sandal N, Stougaard J, Murooka Y, Tabata S, Kawasaki S, Kawaguchi M, Harada K. (2001) Construction of a genetic linkage map of the model legume Lotus japonicus using an intraspecific F 2 population. DNA Research 8, 301–310.
Jander G, Norris SR, Rounsley SD, Bush DF, Levin IM, Last RL (2002) Arabidopsis map-based cloning in the post-genome era. Plant Physiology 129, 440–450.
Li P, Kupfer KC, Davies CJ, Burbee D, Evans GA, Garner HR. (1997) PRIMO: A Primer Design Program That Applies Base Quality Statistics for Automated Large-Scale DNA Sequencing. Genomics 40, 476–485.
Pedrosa A, Sandal N, Stougaard J, Schweitzer D, and Bachmair A. (2002) Chromosomal map of the model legume Lotus japonicus. Genetics 161, 1661–1672.
Pertea G, Huang X, Liang F, Antonescu V, Sultana R, Karamycheva S, Lee Y, White J, Cheung F, Parvizi B, Tsai J, and Quackenbush J. (2003). TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets. Bioinformatics 19, 651–652.
Quackenbush J, Liang F, Holt I, Pertea G, and Upton J. (2000) The TIGR Gene Indices: reconstruction and representation of expressed gene sequences. Nucleic Acids Research 28, 141–145.
Roy SW, Fedorov A, Gilbert W. (2003) Large-scale comparison of intron positions in mammalian genes shows intron loss but no gain. Proceedings of the National Academy of Sciences USA 100, 7158–7162.
Sachidanandam R, Weissman D, Schmidt SC, Kakol JM, Stein LD, Marth G, Sherry S, Mullikin JC, Mortimore BJ, Willey DL, Hunt SE, Cole CG, Coggill PC, Rice CM, Ning Z, Rogers J, Bentley DR, Kwok PY, Mardis ER, Yeh RT, Schultz B, Cook L, Davenport R, Dante M, Fulton L, Hillier L, Waterston RH, McPherson JD, Gilman B, Schaffner S, Van Etten WJ, Reich D, Higgins J, Daly MJ, Blumenstiel B, Baldwin J, Stange-Thomann N, Zody MC, Linton L, Lander ES, Altshuler D; International SNP Map Working Group (2001) A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms. Nature 409, 928–933.
Sandal N, Krusell L, Radutoiu S, Olbryt M, Pedrosa A, Stracke S, Parniske M, Bachmaier A, Sato S, Tabata S, Ketelsen T, and Stougaard J. (2002) A genetic linkage map of the model legume Lotus japonicus and strategies for fast mapping of new loci. Genetics 161, 1673–1683.
Young ND, Mudge J, and Ellis TH. (2003) Legume genomes: more than peas in a pod. Current Opinion in Plant Biology 6, 199–204.
Yu J, Hu S, Wang J, Wong GK, Li S, Liu B, Deng Y, Dai L, Zhou Y, Zhang X, Cao M, Liu J, Sun J, Tang J, Chen Y, Huang X, Lin W, Ye C, Tong W, Cong L, Geng J, Han Y, Li L, Li W, Hu G, Huang X, Li W, Li J, Liu Z, Li L, Liu J, Qi Q, Liu J, Li L, Li T, Wang X, Lu H, Wu T, Zhu M, Ni P, Han H, Dong W, Ren X, Feng X, Cui P, Li X, Wang H, Xu X, Zhai W, Xu Z, Zhang J, He S, Zhang J, Xu J, Zhang K, Zheng X, Dong J, Zeng W, Tao L, Ye J, Tan J, Ren X, Chen X, He J, Liu D, Tian W, Tian C, Xia H, Bao Q, Li G, Gao H, Cao T, Wang J, Zhao W, Li P, Chen W, Wang X, Zhang Y, Hu J, Wang J, Liu S, Yang J, Zhang G, Xiong Y, Li Z, Mao L, Zhou C, Zhu Z, Chen R, Hao B, Zheng W, Chen S, Guo W, Li G, Liu S, Tao M, Wang J, Zhu L, Yuan L, and Yang H. (2002) A draft sequence of the rice genome (Oryza sativa L ssp. indica). Science 296, 79–92.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer
About this chapter
Cite this chapter
Schauser, L., Subrahmanyam, S., Madsen, L.H., Sandal, N., Stougaard, J. (2005). An in silico strategy towards the development of legume genome anchor markers using comparative sequence analysis. In: Márquez, A.J. (eds) Lotus japonicus Handbook. Springer, Dordrecht. https://doi.org/10.1007/1-4020-3735-X_23
Download citation
DOI: https://doi.org/10.1007/1-4020-3735-X_23
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-3734-4
Online ISBN: 978-1-4020-3735-1
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)