Abstract
This chapter outlines a typical workflow for micraorray data analysis. It aims at explaining the background of the methods as this is necessary for deciding upon a specific numerical method to use and for understanding and interpreting the outcomes of the analyses. We focus on error handling, various steps during preprocessing (clipping, imputing missing values, normalization, and transformation of data), statistic tests for variable selection and the use of multiple hypothesis testing procedures, various metrics and clustering algorithms for hierarchical clustering, principles, and results from principal components analysis and discriminant analysis, partitioning, selforganizing map, K-nearest neighbor classifier, and the use of a neural network and a support vector machine for classification.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Troyanska, O., Cantor, M., Sherlock, G., et al. (2001) Missing value estimation methods for DNA microarrays. Bioinformatics 17, 250–525.
Yang, Y. H., Dutoit, S. P., Luu, P., and Speed, T. P. (2001) Normalization for cDNA Microarray Data. Technical Report 589, Department of Statistics, UC Berkeley.
Yang, Y. H., Dutoit, S., Luu, P., et al. (2002) Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. Nucleic Acids Res. 30, E15.
Quackenbush, J. (2001) Computational analysis of microarray data. Nat. Rev. Genet. 2, 418–427.
Quackenbush, J. (2002) Microarray data normalization and transformation. Nature Genetics Suppl. 32, 496–501.
Dobson, J. D. (1992) Applied multivariate data analysis, vol. II: Categorical and Multivariate Methods. Berlin: Springer—Verlag, p 731.
Dudoit, S., Shaffer, J. P., and Boldrick, J. C. (2003) Multiple hypothesis testing in microarray experiments. Statist. Sci. 18, 71–103.
Jain, A. K., Murty, M. N., and Flynn, P. J. (1999) Data clustering: a review. ACM Comput. Surveys 31, 264–323.
Eisen, M. B., Spellman, P. T., Brown, P. O., and Botstein, D. (1998) Cluster analysis and display of genome-wide expression patterns. Proc. Natl. Acad. Sci. USA 95, 14,863–14,868.
Ramaswamy, S., Tamayo, P., Rifkin, R., et al. (2001) Multiclass cancer diagnosis using tumor gene expression signatures. Proc. Natl. Acad. Sci. USA 98, 15,149–15,154.
Alizadeh, A., Eisen, M., Davis, R. E., et al. (2000) Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature 403, 503.
Raychaudhuri, S., Stuart, J. M., and Altman, R. B. (2000) Principal Components Analysis to Summarize Microarray Experiments: Application to Sporulation Time Series. Pacific Symposium on Biocomputing, Honolulu, Hawaii, pp. 452–463.
Yeung, K. W. and Ruzzo, W. L. (2001) An empirical study on principal component analysis for clustering gene expression data. Bioinformatics 17, 763–774.
Tamayo, P., Slonim, D., Mesirov, J., et al. (1999) Interpreting patterns of gene expression with self-organizing maps: Methods and application to hematopoietic differentiation. Proc. Natl. Acad. Sci. 96, 2907–2912.
Golub, T. R., Slonim, D. K., Tamayo, P., et al. (1999) Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286, 531–537.
Kohonen, T. (2001) Self-Organizing Maps. Springer, Berlin, Germany.
Burges, C. (1998) A tutorial for support vector machines for pattern recognition. Data Mining Knowledge Discov. 2, 121–167.
Hearst, M. A. (1998) Trends and controversies: support vector machines. IEEE Intell. Syst. 13, 18–28.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Humana Press Inc., Totowa, NJ
About this protocol
Cite this protocol
De Bruyne, V., Al-Mulla, F., Pot, B. (2007). Methods for Microarray Data Analysis. In: Rampal, J.B. (eds) Microarrays. Methods in Molecular Biology, vol 382. Humana Press. https://doi.org/10.1007/978-1-59745-304-2_23
Download citation
DOI: https://doi.org/10.1007/978-1-59745-304-2_23
Publisher Name: Humana Press
Print ISBN: 978-1-58829-944-4
Online ISBN: 978-1-59745-304-2
eBook Packages: Springer Protocols