Abstract
In this chapter we will discuss genome wide association studies (GWAS) using SNP. GWAS present some challenges for biostatistics and bioinformatics—the sheer dimensionality of the data can create storage/retrieval and analysis problems. Quality control and data preprocessing are also important steps in GWAS. We will initially discuss basic database usage for data storage and handling and the main metrics for evaluating the quality of genotypes followed by how to perform a GWAS, multiple testing issues and how to visualize results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Albrechtsen, A., Nielsen, F. C., and Nielsen, R. Ascertainment biases in snp chips affect measures of population divergence. Mol Biol Evol 27, 11 (2010), 2534–47.
Ball, R. D. Designing a gwas: power, sample size, and data structure. Methods in Molecular Biology 1019 (2013), 37–98.
Bush, W. S., and Moore, J. H. Chapter 11: Genome-wide association studies. PLoS Computational Biology 8, 12 (2012), e1002822.
Dudoit, S., and van der Laan, M. J. Multiple Testing Procedures with Applications to Genomics. Springer, New York, 2008.
Goddard, M. Genomic selection: prediction of accuracy and maximisation of long term response. Genetica 136, 2 (2009), 245–57.
Gondro, C., Lee, S. H., Lee, H. K., and Porto-Neto, L. R. Quality control for genome-wide association studies. Methods Mol Biol 1019 (2013), 129–47.
Gondro, C., Porto-Neto, L. R., and Lee, S. H. R for genome-wide association studies. Methods Mol Biol 1019 (2013), 1–17.
Gondro, C., Porto-Neto, L. R., and Lee, S. H. snpqc - an r pipeline for quality control of illumina snp genotyping array data. Anim Genet 45, 5 (2014), 758–61.
Huang, X., and Han, B. Natural variations and genome-wide association studies in crop plants. Annu Rev Plant Biol 65 (2014), 531–51.
Klein, R. J. Power analysis for genome-wide association studies. BMC Genetics 8 (2007), 58.
Kruglyak, L. The road to genome-wide association studies. Nature Reviews Genetics 9, 4 (2008), 314–318.
Lachance, J., and Tishkoff, S. A. Snp ascertainment bias in population genetic analyses: Why it is important, and how to correct it. Bioessays 35, 9 (2013), 780–6.
Neale, B. M., and Purcell, S. The positives, protocols and perils of genome-wide association. American Journal of Medical Genetics Part B 147B, 7 (2008), 1288–1294.
Ramalho, J. A. Learn SQL. Wordware Publishing, Plano, USA, 2000.
Rincon, G., Weber, K. L., Eenennaam, A. L., Golden, B. L., and Medrano, J. F. Hot topic: performance of bovine high-density genotyping platforms in Holsteins and Jerseys. J Dairy Sci 94, 12 (2011), 6116–21.
Spencer, C. C. A., Su, Z., Donnelly, P., and Marchini, J. Designing genome-wide association studies: Sample size, power, imputation, and the choice of genotyping chip. PLoS Genetics 5, 5 (2009), e1000477.
Teo, Y. Y. Common statistical issues in genome-wide association studies: a review on power, data quality control, genotype calling and population structure. Current Opinion in Lipidology 19, 2 (2008), 133–143.
Ziegler, A., Konig, I. R., and Thompson, J. R. Biostatistical aspects of genome-wide association studies. Biometrical Journal of Statistical Software 50, 1 (2008), 8–28.
Author information
Authors and Affiliations
3.1 Electronic supplementary material
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Gondro, C. (2015). Genome Wide Association Studies. In: Primer to Analysis of Genomic Data Using R. Use R!. Springer, Cham. https://doi.org/10.1007/978-3-319-14475-7_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-14475-7_3
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-14474-0
Online ISBN: 978-3-319-14475-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)