, Volume 143, Issue 3, pp 299–304 | Cite as

A novel genomic selection method combining GBLUP and LASSO

  • Hengde Li
  • Jingwei Wang
  • Zhenmin Bao


Genetic prediction of quantitative traits is a critical task in plant and animal breeding. Genomic selection is an accurate and efficient method of estimating genetic merits by using high-density genome-wide single nucleotide polymorphisms (SNP). In the framework of linear mixed models, we extended genomic best linear unbiased prediction (GBLUP) by including additional quantitative trait locus (QTL) information that was extracted from high-throughput SNPs by using least absolute shrinkage selection operator (LASSO). GBLUP was combined with three LASSO methods—standard LASSO (SLGBLUP), adaptive LASSO (ALGBLUP), and elastic net (ENGBLUP)—that were used for detecting QTLs, and these QTLs were fitted as fixed effects; the remaining SNPs were fitted using a realized genetic relationship matrix. Simulations performed under distinct scenarios revealed that (1) the prediction accuracy of SLGBLUP was the lowest; (2) the prediction accuracies of ALGBLUP and ENGBLUP were equivalent to or higher than that of GBLUP, except under scenarios in which the number of QTLs was large; and (3) the persistence of prediction accuracy over generations was strongest in the case of ENGBLUP. Building on the favorable computational characteristics of GBLUP, ENGBLUP enables robust modeling and efficient computation to be performed for genomic selection.


Genomic selection Genomic best linear unbiased prediction Least absolute shrinkage selection operator Quantitative trait loci 



This project was supported financially by the National High-Tech R&D Program of China (863 program) (Grant No. 2012AA10A402).

Conflict of interest


Supplementary material

10709_2015_9826_MOESM1_ESM.docx (443 kb)
Supplementary material 1 (DOCX 442 kb)


  1. Chen CY, Misztal I, Aquilar I, Tsuruta S, Meuwissen THE, Aqqrey SE, Winq T, Muir WM (2011) Genome-wide marker-assisted selection combining all pedigree phenotypic information with genotypic data in one step: an example using broiler chickens. J Anim Sci 89:23–28CrossRefPubMedGoogle Scholar
  2. Christensen OF (2012) Compatibility of pedigree-based and marker-based relationship matrices for single-step genetic evaluation. Genet Sel Evol 44:37CrossRefPubMedCentralPubMedGoogle Scholar
  3. Core Team R (2012) R: a language and environment for statistical computing. R Foundation for Statistical Computing, ViennaGoogle Scholar
  4. Fernando RL, Grossman M (1989) Marker assisted selection using best linear unbiased prediction. Genet Sel Evol 21:467–477CrossRefPubMedCentralGoogle Scholar
  5. Forni S, Aguilar I, Misztal I (2011) Different genomic relationship matrices for single-step analysis using phenotypic, pedigree and genomic information. Genet Sel Evol 43:1CrossRefPubMedCentralPubMedGoogle Scholar
  6. Friedman J, Hastie T, Tibshirani R (2010) Regularization paths for generalized linear models via coordinate descent. J Stat Softw 33:1–22PubMedCentralPubMedGoogle Scholar
  7. Grattapaglia D, Resende M (2011) Genomic selection in forest tree breeding. Tree Genet Genomes 7:241–255CrossRefGoogle Scholar
  8. Hayes BJ, Goddard ME (2001) The distribution of the effects of genes affecting quantitative traits in livestock. Genet Sel Evol 33:209–229CrossRefPubMedCentralPubMedGoogle Scholar
  9. Hayes BJ, Bowman PJ, Chamberlain AJ, Goddard ME (2009) Invited review: genomic selection in dairy cattle: progress and challenges. J Dairy Sci 92:433–443CrossRefPubMedGoogle Scholar
  10. Henderson CR (1975) Best linear unbiased estimation and prediction under a selected model. Biometrics 31:423–447CrossRefPubMedGoogle Scholar
  11. Heslot N, Yang HS, Sorells ME, Jannick JL (2012) Genomic selection in plant breeding: a comparison of models. Crop Sci 52:146–160CrossRefGoogle Scholar
  12. Kraemer N, Schaefer J, Boulesteix AL (2009) Regularized estimation of large-scale gene regulatory networks using gaussian graphical models. BMC Bioinformatic 10:384Google Scholar
  13. Le Roy P, Filangi O, Demeure O, Elsen JM (2012) Comparison of analyses of the XVth QTLMAS common dataset III: genomic estimations of breeding values. BMC Proc 6:S3CrossRefPubMedCentralPubMedGoogle Scholar
  14. Luan T, Woolliams JA, Lien S, Kent M, Svendsen M, Meuwissen THE (2009) The accuracy of genomic selection in Norwegian Red cattle assessed by cross-validation. Genetics 183:1119–1126CrossRefPubMedCentralPubMedGoogle Scholar
  15. Madsen P, Jensen J (2000) A user’s guide to DMU. Danish Institute of Agricultural Sciences, Research Center Foulum, DenmarkGoogle Scholar
  16. Meuwissen THE, Hayes BJ, Goddard ME (2001) Prediction of total genetic value using genome-wide dense marker maps. Genetics 157:1819–1829PubMedCentralPubMedGoogle Scholar
  17. Moser G, Tier B, Crump R, Khatkar M, Raadsma H (2009) A comparison of five methods to predict genomic breeding values of dairy bulls from genome-wide SNP markers. Genet Sel Evol 41:56CrossRefPubMedCentralPubMedGoogle Scholar
  18. Pszczola M, Strabel T, Wolc A, Mucha S, Szydlowski M (2011) Comparison of analyses of the QTLMAS XIV common dataset I: genomic selection. BMC Proc 5:S1CrossRefPubMedCentralPubMedGoogle Scholar
  19. Stranden I, Christensen O (2011) Allele coding in genomic evaluation. Genet Sel Evol 43:25CrossRefPubMedCentralPubMedGoogle Scholar
  20. Usai MG, Goddard ME, Hayes BJ (2009) LASSO with cross-validation for genomic selection. Genet Res 91:427–436CrossRefGoogle Scholar
  21. VanRaden PM (2008) Efficient methods to compute genomic predictions. J Dairy Sci 91:4414–4423CrossRefPubMedGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  1. 1.Centre for Applied Aquatic GenomicsChinese Academy of Fishery SciencesBeijingChina
  2. 2.College of Marine LifeOcean University of ChinaQingdaoChina
  3. 3.College of Animal ScienceFujian Agriculture and Forestry UniversityFuzhouChina

Personalised recommendations