Abstract
Microarray experiments are complex, multistep processes that represent a considerable investment of time and resources. Proper experimental design and analysis are critical to the success of a microarray experiment, and must be considered early in the planning of the experiment. Many aspects of experimental design from low-throughput experiments, such as randomization, replication, and blocking, remain applicable to microarray experiments as well. Similarly, the analysis of variance (ANOVA) remains a valid approach for analyzing data from most microarray experiments. However, the high-dimensional nature of microarrays introduces additional considerations into the design and analysis. This chapter provides an overview of the unique statistical challenges presented by microarrays and describes computational methods for implementing these statistical algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Altman, N. S., & Hua, J. (2006). Extending the loop design for two-channel microarray experiments. Genetical Research, 88, 153–163.
Bailey, R. A. (2007). Designs for two-colour microarray experiments. Applied Statistics, 56(4), 365–394.
Baldi, P., & Long, A. D. (2001). A Bayesian framework for the analysis of microarray expression data: Regularized t-test and statistical inferences of gene changes. Bioinformatics, 17(6), 509–519.
Bar-Joseph, Z. (2004). Analyzing time series gene expression data. Bioinformatics, 20, 2493–2503.
Buck, M. J., & Lieb, J. D. (2004). Chip-chip: Considerations for the design, analysis, and application of genome-wide chromatin immunoprecipitation experiments. Genomics, 83, 349–360.
Bueno Filho, J. S., Gilmour, S. G., & Rosa, G. J. (2006). Design of microarray experiments for genetical genomics studies. Genetics, 174, 945–957.
Chai F.-S., Liao C.-T., & Tsai S.-F. (2007). Statistical designs for two-color spotted microarray experiments. Biometrical Journal, 49(2), 259–271.
Churchill, G. A. (2002). Fundamentals of experimental design for cDNA microarrays. Nature Genetics, 32 Suppl, 490–495.
Cox, D. R. (1958). Planning of experiments. New York: Wiley.
Cui, X., & Churchill, G. A. (2003). How many mice and how many arrays? replication of cDNA microarray experiments. In M. L. Simon & T. A. Emily (Eds.), Methods of microarray data analysis III. New York: Kluwer Academic Publishers.
Cui, X., & Churchill, G. A. (2003). Statistical tests for differential expression in cDNA microarray experiments. Genome Biology, 4(4), 210.
Cui, X., Hwang, J. T., Qiu, J., Blades, N. J., & Churchill, G. A. (2005). Improved statistical tests for differential gene expression by shrinking variance components estimates. Biostatistics, 6(1), 59–75.
de Koning, D. J., & Haley, C. S. (2005). Genetical genomics in humans and model organisms. Trends in Genetics, 21, 377–381.
Dobbin, K., Shih, J. H., & Simon, R. (2003). Statistical design of reverse dye microarrays. Bioinformatics, 19(7), 803–810.
Dobbin, K., & Simon, R. (2002). Comparison of microarray designs for class comparison and class discovery. Bioinformatics, 18, 1438–1445.
Dobbin, K. K., Kawasaki, E. S., Petersen, D. W., & Simon, R. M. (2005). Characterizing dye bias in microarray experiments. Bioinformatics, 21, 2430–2437.
Efron, B., Tibshirani, R., Storey, J. D., & Tusher, V. (2001). Empirical Bayes analysis of a microarray experiment. Journal of American Statistical Association, 96(456), 1151–1160.
Fan, J., Chen, Y., Chan, H. M., Tam, P. K. H., & Ren, Y. (2005). Removing intensity effects and identifying significant genes for affymetrix arrays in macrophage migration inhibitory factor-suppressed neuroblastoma cells. Proceedings of the National Academy of Sciences of the United States of America, 102(49), 17,751–17,756.
Firestein, G. S., & Pisetsky, D. S. (2002). DNA microarrays: Boundless technology or bound by technology? Guidelines for studies using microarray technology. Arthritis & Rheumatism, 46(4), 859–861.
Fisher, R. A. (1926). The arrangement of field experiments. Journal of the Ministry of Agriculture of Great Britain, 33, 503–513.
Fisher, R. A. (1935). The design of experiments. Edinburgh: Oliver and Boyd.
Fu, J., & Jansen, R. C. (2006). Optimal design and analysis of genetic studies on gene expression. Genetics, 172, 1993–1999.
Gibson, G., & Weir, B. (2005). The quantitative genetics of transcription. Genetics, 21, 616–623.
Jafari, P., & Azuaje, F. (2006). An assessment of recently published gene expression data analyses: Reporting experimental design and statistical factors. BMC Medical Informatics and Decision Making, 6, 27.
Kendziorski, C. M., Irizarry, R. A., Chen, K. S., Haag, J. D., & Gould, M. N. (2005). On the utility of pooling biological samples in microarray experiments. Proceedings of the National Academy of Sciences of the United of States America, 102, 4252–4257.
Kendziorski, C. M., Newton, M. A., Lan, H., & Gould, M. N. (2003). On parametric empirical Bayes methods for comparing multiple groups using replicated gene expression profiles. Statistics in Medicine, 22(24), 3899–3914.
Kendziorski, C. M., Zhang, Y., Lan, H., & Attie, A. D. (2003). The efficiency of pooling mRNA in microarray experiments. Biostatistics, 4, 465–477.
Kennedy, W. J, & Gentle, J. E. (1980). Statistical computing. New York: Marcel Dekker.
Kerr, M. K. (2003). Design considerations for efficient and effective microarray studies. Biometrics, 59, 822–828.
Kerr, M. K., & Churchill, G. A. (2001). Experimental design for gene expression microarrays. Biostatistics, 2, 183–201.
Kerr, M. K., & Churchill, G. A. (2001). Statistical design and the analysis of gene expression microarray data. Genetical Research, 77(2), 123–128.
Khanin, R., & Wit, E. (2005). Design of large time-course microarray experiments with two channels. Applied Bioinformatics, 4, 253–261.
Kuehl, R. O. (2000). Design of experiments: Statistical principles of research design and analysis. New York: Duxbury Press.
Lonnstedt, I., & Speed, T. (2002). Replicated microarray data. Statistica Sinica, 12, 31–46.
Monahan, J. F. (2001). Numerical methods of statistics. New York: Cambridge.
Newton, M. A., Kendziorski, C. M., Richmond, C. S., Blattner, F. R., & Tsui, K. W. (2001). On differential variability of expression ratios: Improving statistical inference about gene expression changes from microarray data. Journal of Computational Biology, 8(1), 37–52.
Nguyen N.-K., & Williams, E. R. (2006). Experimental designs for 2-colour cDNA microarray experiments. Applied Stochastic Models in Business and Industry, 22, 631–638.
Ponzielli, R., Boutros, P. C., Katz, S., Stojanova, A., Hanley, A. P., Khosravi, F., Bros, C., Jurisica, I., & Penn, L. Z. (2008). Optimization of experimental design parameters for high–throughput chromatin immunoprecipitation studies. Nucleic Acids Research, 36, e144.
Quackenbush, J. (2006). Microarray analysis and tumor classification. New England Journal of Medicine, 354, 2463–2472.
Segal, M. R., Dahlquist, K. D., & Conklin, B. R. (2003). Regression approaches for microarray data analysis. Journal of Computational Biology, 10, 961–980.
Shih, J. H., Michalowska, A. M., Dobbin, K., Ye, Y., Qiu, T. H., & Green, J. E. (2004). Effects of pooling mRNA in microarray class comparisons. Bioinformatics, 20, 3318–3325.
Simon, R. (2003). Diagnostic and prognostic prediction using gene expression profiles in high-dimensional microarray data. British Journal of Cancer, 89, 1599–1604.
Smyth, G. K. (2004). Linear models and empirical Bayes methods for assessing differential expression in microarray experiments. Statistical Applications in Genetics and Molecular Biology, 3, Article3.
Smyth, G. K. (2005). Limma: Linear models for microarray data. In R. Gentleman, V. Carey, S. Dudoit, R. Irizarry, & W. Huber (Eds.), Bioinformatics and computational biology solutions using R and bioconductor (pp. 397–420). New York: Springer.
Steibel, J. P., & Rosa, G. J. (2005). On reference designs for microarray experiments. Statistical Applications in Genetics and Molecular Biology, 4, Article36.
Storey, J. D., & Tibshirani, R. (2003). SAM thresholding and false discovery rates for detecting differential gene expression in DNA microarrays. In G. Parmigiani, E. S. Garret, R. Irizarry, & S. Zeger (Eds.), The analysis of gene expression data: An overview of methods and software (pp. 272–290). New York: Springer.
Tusher, V. G., Tibshirani, R., & Chu, G. (2001). Significance analysis of microarrays applied to the ionizing radiation response. Proceedings of the National Academy of Sciences of the United States of America, 98, 5116–5121.
Wit, E., Nobile, A., & Khanin, R. (2005). Near-optimal designs for dual channel microarray studies. Applied Statistics, 54, 817–830.
Wright, G. W., & Simon, R. M. (2003). A random variance model for detection of differential gene expression in small microarray experiments. Bioinformatics, 19(18), 2448–2455.
Xie, Y., Pan, W., & Khodursky, A. B. (2005). A note on using permutation-based false discovery rate estimates to compare different analysis methods for microarray data. Bioinformatics, 21(23), 4280–4288.
Yang, H., & Churchill, G. A. (2007). Estimating p-values in small microarray experiments. Bioinformatics, 23(1), 38–43.
Yang, I. V., Chen, E., Hasseman, J. P., Liang, W., Frank, B. C., Wang, S., Sharov, V., Saeed, A. I., White, J., Li, J., Lee, N. H., Yeatman, T. J., & Quackenbush, J. (2002). Within the fold: Assessing differential expression measures and reproducibility in microarray assays. Genome Biology, 3(11), research0062.
Yang, Y. H., & Speed, T. (2002). Design issues for cDNA microarray experiments. Nature Reviews. Genetics, 3(8), 579–588.
Yates, F. (1936). Incomplete randomized blocks. Annals of Eugenics, 7, 121–140.
Yates, F. (1936). A new method of arranging variety trials involving a large number of varieties. Journal of Agricultural Science, 26, 424–455.
Zakharkin, S. O., Kim, K., Mehta, T., Chen, L., Barnes, S., Scheirer, K. E., Parrish, R. S., Allison, D. B., & Page, G. P. (2005). Sources of variation in Affymetrix microarray experiments. BMC Bioinformatics, 6, 214.
Zhang, W., Carriquiry, A., Nettleton, D., & Dekkers, J. C. (2007). Pooling mRNA in microarray experiments and its effect on power. Bioinformatics, 23, 1217–1224.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Kennedy, R.E., Cui, X. (2011). Experimental Designs and ANOVA for Microarray Data. In: Lu, HS., Schölkopf, B., Zhao, H. (eds) Handbook of Statistical Bioinformatics. Springer Handbooks of Computational Statistics. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16345-6_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-16345-6_8
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16344-9
Online ISBN: 978-3-642-16345-6
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)