Calibrating Population Stratification in Association Analysis

Qin, Huaizhen; Zhu, Xiaofeng

doi:10.1007/978-1-4939-7274-6_21

Huaizhen Qin Ph.D.^3,4 &
Xiaofeng Zhu Ph.D.⁴

Part of the book series: Methods in Molecular Biology ((MIMB,volume 1666))

3387 Accesses
1 Citations

Abstract

In genetic association studies, it is necessary to correct for population structure to avoid inference bias. During the past decade, prevailing corrections often only involved adjustments of global ancestry differences between sampled individuals. Nevertheless, population structure may vary across local genomic regions due to the variability of local ancestries associated with natural selection, migration, or random genetic drift. Adjusting for global ancestry alone may be inadequate when local population structure is an important confounding factor. In contrast, adjusting for local ancestry can more effectively prevent false positives due to local population structure. To more accurately locate disease genes, we recommend adjusting for local ancestries by interrogating local structure. In practice, locus-specific ancestries are usually unknown and must be inferred. For recently admixed populations with known reference ancestral populations, locus-specific ancestries can be inferred accurately using some hidden Markov model-based methods. However, SNP-wise ancestries cannot be accurately inferred when ancestral population information is not available. For such scenarios, we propose employing local principal components (PCs) to present local ancestries and adjusting for local PCs when testing for gene–phenotype association.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

References

Knowler WC, Williams RC, Pettitt DJ, Steinberg AG (1988) Gm3;5,13,14 and type 2 diabetes mellitus: an association in American Indians with genetic admixture. Am J Hum Genet 43:520–526
PubMed PubMed Central CAS Google Scholar
Lander ES, Schork NJ (1994) Genetic dissection of complex traits. Science 265:2037–2048
Article CAS PubMed Google Scholar
Devlin B, Roeder K (1999) Genomic control for association studies. Biometrics 55:997–1004
Article CAS PubMed Google Scholar
Pritchard JK, Stephens M, Rosenberg NA, Donnelly P (2000) Association mapping in structured populations. Am J Hum Genet 67:170–181
Article CAS PubMed PubMed Central Google Scholar
Satten GA, Flanders WD, Yang Q (2001) Accounting for unmeasured population substructure in case-control studies of genetic association using a novel latent-class model. Am J Hum Genet 68:466–477
Article CAS PubMed PubMed Central Google Scholar
Zhu X, Zhang S, Zhao H, Cooper RS (2002) Association mapping, using a mixture model for complex traits. Genet Epidemiol 23:181–196
Article PubMed Google Scholar
Zhang S, Zhu X, Zhao H (2003) On a semiparametric test to detect associations between quantitative traits and candidate genes using unrelated individuals. Genet Epidemiol 24:44–56
Article PubMed Google Scholar
Marchini J, Cardon LR, Phillips MS, Donnelly P (2004) The effects of human population structure on large genetic association studies. Nat Genet 36:512–517
Article CAS PubMed Google Scholar
Campbell CD et al (2005) Demonstrating stratification in a European American population. Nat Genet 37:868–872
Article CAS PubMed Google Scholar
Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D (2006) Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet 38:904–909
Article CAS PubMed Google Scholar
Zhu X et al (2008) A unified association analysis approach for family and unrelated samples correcting for stratification. Am J Hum Genet 82:352–365
Article CAS PubMed PubMed Central Google Scholar
Zhu X et al (2008) Admixture mapping and the role of population structure for localizing disease genes. Adv Genet 60:547–569
PubMed Google Scholar
Qin et al (2010) Interrogating local population structure for fine mapping in genome-wide association studies. Bioinformatics 26(23):2961–2968
Article CAS Google Scholar
Cavalli-Sforza LL, Bodmer WF (1999) The genetics of human populations. Dover, Mineola, New York
Google Scholar
Epstein MP et al (2007) A simple and improved correction for population stratification in case-control studies. Am J Hum Genet 80:921–930
Article CAS PubMed PubMed Central Google Scholar
Novembre J, Stephens M (2008) Interpreting principal component analyses of spatial population genetic variation. Nat Genet 40:646–649
Article CAS PubMed PubMed Central Google Scholar
Tang H et al (2007) Recent genetic selection in the ancestral admixture of Puerto Ricans. Am J Hum Genet 81(3):626–633
Article CAS PubMed PubMed Central Google Scholar
Genovese G et al (2010) Association of trypanolytic ApoL1 variants with kidney disease in African-Americans. Science 7:1–7
Google Scholar
Voight BF et al (2006) A map of recent positive selection in the human genome. PLoS Biol 4:e72
Article PubMed PubMed Central Google Scholar
Sabeti PC et al (2007) Genome-wide detection and characterization of positive selection in human populations. Nature 449:913–918
Article CAS PubMed PubMed Central Google Scholar
Crow JF, Kimura M (1970) An introduction to population genetics theory. Harper & Row, New York, pp 469–478
Google Scholar
Patterson N et al (2004) Methods for high-density admixture mapping of disease genes. Am J Hum Genet 74:979–1000
Article CAS PubMed PubMed Central Google Scholar
Tang H et al (2006) Reconstructing genetic ancestry blocks in admixed individuals. Am J Hum Genet 79:1–12
Article CAS PubMed PubMed Central Google Scholar
Zhu X et al (2006) A classical likelihood based approach for admixture mapping using EM algorithm. Hum Genet 120:431–445
Article PubMed Google Scholar
Sankararaman S et al (2008) Estimating local ancestry in admixed populations. Am J Hum Genet 82:290–303
Article CAS PubMed PubMed Central Google Scholar
Price AL et al (2009) Sensitive detection of chromosomal segments of distinct ancestry in admixed populations. PLoS Genet 5:e1000519
Article CAS PubMed PubMed Central Google Scholar
Guan Y (2014) Detecting Structure of Haplotypes and Local Ancestry. Genetics 196(3):625–642
Article PubMed PubMed Central Google Scholar
Kang SJ et al (2010) Genome wide association of anthropometric traits in African and African derived populations. Hum Mol Genet 19(13):2725–2738
Article CAS PubMed PubMed Central Google Scholar
Levy D et al (2009) Genome-wide association study of blood pressure and hypertension. Nat Genet 41:677–687
Article CAS PubMed PubMed Central Google Scholar
Chang CC, Chow CC, Tellier L, Vattikuti S, Purcell SM, Lee JJ (2015) Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience 4(7):1–16
Google Scholar
Roshyara NR, M Scholz M (2014) AfcGENE: a versatile tool for processing and transforming SNP datasets. PLoS One 9(7):e9758
Article Google Scholar
Patterson N, Price AL, Reich D (2006) Population structure and eigenanalysis. PLoS Genet 2(12):2074–2093., e190. doi:10.1371/journal.pgen.0020190
Article CAS Google Scholar
Johnstone I (2001) On the distribution of the largest eigenvalue in principal components analysis. Ann Stat 29:295–327
Article Google Scholar
Zou F et al (2010) Quantification of population structure using correlated SNPs by shrinkage principal components. Hum Hered 70:9–22
Article CAS PubMed PubMed Central Google Scholar
Price AL et al (2010) New approaches to population stratification in genome-wide association studies. Nature Reviews 11:459–463
Article CAS PubMed Google Scholar

Download references

Acknowledgments

This work was funded in part by NHGRI grant HG003054 to X.Z. and by Tulane’s Committee on Research fellowship (600890) and Carol Lavin Bernick Faculty Grant (632119) to H.Q.

Author information

Authors and Affiliations

Department of Global Biostatistics and Data Science, Tulane University School of Public Health and Tropical Medicine, New Orleans, LA, 70112, USA
Huaizhen Qin Ph.D.
Department of Population and Quantitative Health Sciences, Case Western Reserve University School of Medicine, Cleveland, OH, 44106, USA
Huaizhen Qin Ph.D. & Xiaofeng Zhu Ph.D.

Authors

Huaizhen Qin Ph.D.
View author publications
You can also search for this author in PubMed Google Scholar
Xiaofeng Zhu Ph.D.
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Huaizhen Qin Ph.D. .

Editor information

Editors and Affiliations

Case Western Reserve University, Cleveland, OH, USA
Robert C. Elston

Rights and permissions

Reprints and permissions

Copyright information

About this protocol

Cite this protocol

Qin, H., Zhu, X. (2017). Calibrating Population Stratification in Association Analysis. In: Elston, R. (eds) Statistical Human Genetics. Methods in Molecular Biology, vol 1666. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-7274-6_21

Download citation

DOI: https://doi.org/10.1007/978-1-4939-7274-6_21
Published: 05 October 2017
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-7273-9
Online ISBN: 978-1-4939-7274-6
eBook Packages: Springer Protocols

Publish with us

Policies and ethics