Methods for Microarray Data Analysis

De Bruyne, Veronique; Al-Mulla, Fahd; Pot, Bruno

doi:10.1007/978-1-59745-304-2_23

Veronique De Bruyne²,
Fahd Al-Mulla³ &
Bruno Pot⁴

Part of the book series: Methods in Molecular Biology ((MIMB,volume 382))

1063 Accesses
7 Citations

Abstract

This chapter outlines a typical workflow for micraorray data analysis. It aims at explaining the background of the methods as this is necessary for deciding upon a specific numerical method to use and for understanding and interpreting the outcomes of the analyses. We focus on error handling, various steps during preprocessing (clipping, imputing missing values, normalization, and transformation of data), statistic tests for variable selection and the use of multiple hypothesis testing procedures, various metrics and clustering algorithms for hierarchical clustering, principles, and results from principal components analysis and discriminant analysis, partitioning, selforganizing map, K-nearest neighbor classifier, and the use of a neural network and a support vector machine for classification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Protocol: USD 49.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.00; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Troyanska, O., Cantor, M., Sherlock, G., et al. (2001) Missing value estimation methods for DNA microarrays. Bioinformatics 17, 250–525.
Article Google Scholar
Yang, Y. H., Dutoit, S. P., Luu, P., and Speed, T. P. (2001) Normalization for cDNA Microarray Data. Technical Report 589, Department of Statistics, UC Berkeley.
Google Scholar
Yang, Y. H., Dutoit, S., Luu, P., et al. (2002) Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. Nucleic Acids Res. 30, E15.
Article Google Scholar
Quackenbush, J. (2001) Computational analysis of microarray data. Nat. Rev. Genet. 2, 418–427.
Article CAS Google Scholar
Quackenbush, J. (2002) Microarray data normalization and transformation. Nature Genetics Suppl. 32, 496–501.
Article CAS Google Scholar
Dobson, J. D. (1992) Applied multivariate data analysis, vol. II: Categorical and Multivariate Methods. Berlin: Springer—Verlag, p 731.
Google Scholar
Dudoit, S., Shaffer, J. P., and Boldrick, J. C. (2003) Multiple hypothesis testing in microarray experiments. Statist. Sci. 18, 71–103.
Article Google Scholar
Jain, A. K., Murty, M. N., and Flynn, P. J. (1999) Data clustering: a review. ACM Comput. Surveys 31, 264–323.
Article Google Scholar
Eisen, M. B., Spellman, P. T., Brown, P. O., and Botstein, D. (1998) Cluster analysis and display of genome-wide expression patterns. Proc. Natl. Acad. Sci. USA 95, 14,863–14,868.
Article CAS Google Scholar
Ramaswamy, S., Tamayo, P., Rifkin, R., et al. (2001) Multiclass cancer diagnosis using tumor gene expression signatures. Proc. Natl. Acad. Sci. USA 98, 15,149–15,154.
Article CAS Google Scholar
Alizadeh, A., Eisen, M., Davis, R. E., et al. (2000) Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature 403, 503.
Article CAS Google Scholar
Raychaudhuri, S., Stuart, J. M., and Altman, R. B. (2000) Principal Components Analysis to Summarize Microarray Experiments: Application to Sporulation Time Series. Pacific Symposium on Biocomputing, Honolulu, Hawaii, pp. 452–463.
Google Scholar
Yeung, K. W. and Ruzzo, W. L. (2001) An empirical study on principal component analysis for clustering gene expression data. Bioinformatics 17, 763–774.
Article CAS Google Scholar
Tamayo, P., Slonim, D., Mesirov, J., et al. (1999) Interpreting patterns of gene expression with self-organizing maps: Methods and application to hematopoietic differentiation. Proc. Natl. Acad. Sci. 96, 2907–2912.
Article CAS Google Scholar
Golub, T. R., Slonim, D. K., Tamayo, P., et al. (1999) Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286, 531–537.
Article CAS Google Scholar
Kohonen, T. (2001) Self-Organizing Maps. Springer, Berlin, Germany.
Google Scholar
Burges, C. (1998) A tutorial for support vector machines for pattern recognition. Data Mining Knowledge Discov. 2, 121–167.
Article Google Scholar
Hearst, M. A. (1998) Trends and controversies: support vector machines. IEEE Intell. Syst. 13, 18–28.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Applied-Maths BVBA, Sint-Martens-Latem
Veronique De Bruyne
Department of Pathology, Molecular Pathology Division, Faculty of Medicine, Kuwait University, Safat, Kuwait
Fahd Al-Mulla
Applied-Maths BVBA, and Bacteriology of Ecosystems, Institut Pasteur de Lille (IBL), Lille Cedex, France
Bruno Pot

Authors

Veronique De Bruyne
View author publications
You can also search for this author in PubMed Google Scholar
Fahd Al-Mulla
View author publications
You can also search for this author in PubMed Google Scholar
Bruno Pot
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Beckman Coulter, Inc., Brea, CA
Jang B. Rampal

Rights and permissions

Reprints and permissions

Copyright information

About this protocol

Cite this protocol

De Bruyne, V., Al-Mulla, F., Pot, B. (2007). Methods for Microarray Data Analysis. In: Rampal, J.B. (eds) Microarrays. Methods in Molecular Biology, vol 382. Humana Press. https://doi.org/10.1007/978-1-59745-304-2_23

Download citation

DOI: https://doi.org/10.1007/978-1-59745-304-2_23
Publisher Name: Humana Press
Print ISBN: 978-1-58829-944-4
Online ISBN: 978-1-59745-304-2
eBook Packages: Springer Protocols

Publish with us

Policies and ethics