Abstract
Population stratification (PS) and correcting for PS are studied in Chap. 9. The chapter starts with an introduction to population structure and its impact on inference using the trend test. Different models of PS are given. Methods to correct for PS are discussed, including genomic control, structural association, principal component clustering , and multidimensional scaling plots. How to select marker loci to correct for PS is discussed. Comparison of the several methods is reported using simulations. How to simulate case-control data in the presence of PS is given.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bacanu, S.A., Devlin, B., Roeder, K.: The power of genomic control. Am. J. Hum. Genet. 66, 1933–1944 (2000)
Balding, D.J., Nichols, R.A.: A method for quantifying differentiation between populations at multi-allelic loci and its implications for investigating identity and paternity. Genetica 96, 3–12 (1995)
Campbell, C.D., Ogburn, E.L., Lunetta, K.L., Lyon, H.N., Freedman, M.L., Groop, L.C., Altshuler, D., Ardlie, K.G., Hirschhorn, J.N.: Demonstrating stratification in a European American population. Nat. Genet. 37, 868–872 (2005)
Cavalli-Sforza, L.L., Menozzi, P., Piazza, A.: The History and Geography of Human Genes. Princeton University Press, Princeton (1994)
Chen, H.S., Zhu, X., Zhao, H., Zhang, S.: Qualitative semi-parametric test for genetic associations in case-control designs under structured populations. Ann. Hum. Genet. 67, 250–264 (2003)
Crow, J.F., Kimura, H.: An Introduction to Population Genetics Theory. Burgess Publication Co, Minneapolis (1970)
Dadd, T., Lewis, C.M., Weale, M.E.: Delta-centralization fails to control for population stratification in genetic association studies. Hum. Hered. 69, 285–294 (2009)
Devlin, B., Roeder, K.: Genomic control for association studies. Biometrics 55, 997–1004 (1999)
Elandt-Johnson, R.C.: Probability Models and Statistical Methods in Genetics. Wiley, New York (1971)
Epstein, M.P., Allen, A.S., Satten, G.A.: A simple and improved correction for population stratification in case-control studies. Am. J. Hum. Genet. 80, 921–930 (2007)
Falush, D., Stephens, M., Pritchard, J.K.: Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics 164, 1567–1587 (2003)
Genovese, G., Friedman, D.J., Ross, M.D., Lecordier, L., Uzureau, P., Freedman, B.I., Bowden, D.W., Langefeld, C.D., Oleksyk, T.K., Uscinski Knob, A.L., Bernhardy, A.J., Hicks, P.J., Nelson, G.W., Vanhollebeke, B., Winkler, C.A., Kopp, J.B., Pays, E., Pollak, M.R.: Association of Trypanolytic ApoL1 variants with kidney disease in African-Americans. Science 7, 1–7 (2010)
Gorroochurn, P., Heiman, G.A., Hodge, S.E., Greenberg, D.A.: Centralizing the non-central chi-square: A new method to correct for population stratification in genetic case-control association studies. Genet. Epidemiol. 30, 277–289 (2006)
Gorroochurn, P., Hodge, S.E., Heiman, G.A., Greenberg, D.A.: A unified approach for quantifying, testing and correcting population stratification in case-control association studies. Hum. Hered. 64, 149–159 (2007)
Kang, H.M., Sul, J.H., Service, S.K., Zaitlen, N.A., Kong, S.Y., Freimer, N.B., Sabatti, C., Eskin, E.: Variance component model to account for sample structure in genome-wide association studies. Nat. Genet. 42, 348–354 (2010)
Knowler, W.C., Williams, R.C., Pettitt, D.J., Steinberg, A.G.: Gm3;5,13,14 and type 2 diabetes mellitus: an association in American Indians with genetic admixture. Am. J. Hum. Genet. 43, 520–526 (1988)
Li, J.Z., Absher, D.M., Tang, H., Southwick, A.M., Casto, A.M., Ramachandran, S., Cann, H.M., Barsh, G.S., Feldman, M., Cavalli-Sforza, L.L., Myers, R.M.: Worldwide human relationships inferred from genome-wide patterns of variation. Science 319, 1100–1104 (2008)
Marchini, J., Cardon, L.R., Phillips, M.S., Donnelly, P.: The effects of human population structure on large genetic association studies. Nat. Genet. 36, 512–517 (2004)
Price, A.L., Patterson, N.J., Plenge, R.M., Weinblatt, M.E., Shadick, N.A., Reich, D.: Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38, 904–909 (2006)
Price, A.L., Zaitlen, N.A., Reich, D., Patterson, N.: New approaches to population stratification in genome-wide association studies. Nat. Rev. Genet. 11, 459–463 (2010)
Pritchard, J.K., Stephens, M., Rosenberg, N.A., Donnelly, P.: Association mapping in structured populations. Am. J. Hum. Genet. 67, 170–181 (2000)
Qin, H., Morris, N., Kang, S.J., Li, M., Tayo, B., Lyon, H., Hirschhorn, J.N., Cooper, R.S., Zhu, X.: Interrogating local population structure for fine mapping in genome wide association studies. Bioinformatics 26, 2961–2968 (2010)
Reich, D.E., Goldstein, D.B.: Detecting association in a case-control study while correcting for population stratification. Genet. Epidemiol. 20, 4–16 (2001)
Rosenberg, N.A., Nordborg, M.: A general population-genetic model for the production by population structure of spurious genotype-phenotype associations in discrete, admixed or spatially distributed populations. Genetics 173, 1665–1678 (2006)
Rosenberg, N.A., Pritchard, J.K., Weber, J.L., Cann, H.M., Kidd, K.K., Zhivotovsky, L.A., Feldman, M.W.: Genetic structure of human populations. Science 298, 2381–2385 (2002)
Satten, G.A., Flanders, W.D., Yang, Q.: Accounting for unmeasured population substructure in case-control studies of genetic association using a novel latent-class model. Am. J. Hum. Genet. 68, 466–477 (2001)
Tang, H., Choudhry, S., Mei, R., Morgan, M., Rodriguez-Cintron, W., Burchard, E.G., Risch, N.J.: Recent genetic selection in the ancestral admixture of Puerto Ricans. Am. J. Hum. Genet. 81, 626–633 (2007)
Tang, H., Peng, J., Wang, P., Risch, N.J.: Estimation of individual admixture: analytical and study design considerations. Genet. Epidemiol. 28, 289–301 (2005)
Tang, H., Quertermous, T., Rodriguez, B., Kardia, S.L., Zhu, X., Brown, A., Pankow, J.S., Province, M.A., Hunt, S.C., Boerwinkle, E., Schork, N.J., Risch, N.J.: Genetic structure, self-identified race/ethnicity, and confounding in case-control association studies. Am. J. Hum. Genet. 76, 268–275 (2005)
Voight, B.F., Pritchard, J.K.: Confounding from cryptic relatedness in case-control association studies. PLOS Genet. 1(3), e32 (2005)
Whittemore, A.S.: Population structure in genetic association studies. In: 2006 Proceedings of the American Statistical Association, ASA Section on Statistics in Epidemiology [CD-ROM], ASA, Alexandria, VA, pp. 2657–2667 (2006)
Zhang, S., Zhu, X., Zhao, H.: On a semiparametric test to detect associations between quantitative traits and candidate genes using unrelated individuals. Genet. Epidemiol. 24, 44–56 (2003)
Zhang, Z., Ersoz, E., Lai, C.Q., Todhunter, R.J., Tiwari, H.K., Gore, M.A., Bradbury, P.J., Yu, J., Arnett, D.K., Ordovas, J.M., Buckler, E.S.: Mixed linear model approach adapted for genome-wide association studies. Nat. Genet. 42, 355–360 (2010)
Zheng, G., Freidlin, B., Gastwirth, J.L.: Robust genomic control for association studies. Am. J. Hum. Genet. 78, 350–356 (2006)
Zheng, G. Li, Z., Gail, M.H., Gastwirth, J.L.: Impact of population substructure on trend tests for genetic case-control association studies. Biometrics 66, 196–204 (2010)
Zhu, X., Li, S., Cooper, R.S., Elston, R.C.: A unified association analysis approach for family and unrelated samples correcting for stratification. Am. J. Hum. Genet. 82, 352–365 (2008)
Zhu, X., Zhang, S., Zhao, H., Cooper, R.S.: Association mapping, using a mixture model for complex traits. Genet. Epidemiol. 23, 181–196 (2002)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2012 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Zheng, G., Yang, Y., Zhu, X., Elston, R.C. (2012). Population Structure. In: Analysis of Genetic Association Studies. Statistics for Biology and Health. Springer, Boston, MA. https://doi.org/10.1007/978-1-4614-2245-7_9
Download citation
DOI: https://doi.org/10.1007/978-1-4614-2245-7_9
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4614-2244-0
Online ISBN: 978-1-4614-2245-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)