Skip to main content

Methods for Microarray Data Analysis

  • Protocol
Microarrays

Part of the book series: Methods in Molecular Biology ((MIMB,volume 382))

Abstract

This chapter outlines a typical workflow for micraorray data analysis. It aims at explaining the background of the methods as this is necessary for deciding upon a specific numerical method to use and for understanding and interpreting the outcomes of the analyses. We focus on error handling, various steps during preprocessing (clipping, imputing missing values, normalization, and transformation of data), statistic tests for variable selection and the use of multiple hypothesis testing procedures, various metrics and clustering algorithms for hierarchical clustering, principles, and results from principal components analysis and discriminant analysis, partitioning, selforganizing map, K-nearest neighbor classifier, and the use of a neural network and a support vector machine for classification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Protocol
USD 49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Troyanska, O., Cantor, M., Sherlock, G., et al. (2001) Missing value estimation methods for DNA microarrays. Bioinformatics 17, 250–525.

    Article  Google Scholar 

  2. Yang, Y. H., Dutoit, S. P., Luu, P., and Speed, T. P. (2001) Normalization for cDNA Microarray Data. Technical Report 589, Department of Statistics, UC Berkeley.

    Google Scholar 

  3. Yang, Y. H., Dutoit, S., Luu, P., et al. (2002) Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. Nucleic Acids Res. 30, E15.

    Article  Google Scholar 

  4. Quackenbush, J. (2001) Computational analysis of microarray data. Nat. Rev. Genet. 2, 418–427.

    Article  CAS  Google Scholar 

  5. Quackenbush, J. (2002) Microarray data normalization and transformation. Nature Genetics Suppl. 32, 496–501.

    Article  CAS  Google Scholar 

  6. Dobson, J. D. (1992) Applied multivariate data analysis, vol. II: Categorical and Multivariate Methods. Berlin: Springer—Verlag, p 731.

    Google Scholar 

  7. Dudoit, S., Shaffer, J. P., and Boldrick, J. C. (2003) Multiple hypothesis testing in microarray experiments. Statist. Sci. 18, 71–103.

    Article  Google Scholar 

  8. Jain, A. K., Murty, M. N., and Flynn, P. J. (1999) Data clustering: a review. ACM Comput. Surveys 31, 264–323.

    Article  Google Scholar 

  9. Eisen, M. B., Spellman, P. T., Brown, P. O., and Botstein, D. (1998) Cluster analysis and display of genome-wide expression patterns. Proc. Natl. Acad. Sci. USA 95, 14,863–14,868.

    Article  CAS  Google Scholar 

  10. Ramaswamy, S., Tamayo, P., Rifkin, R., et al. (2001) Multiclass cancer diagnosis using tumor gene expression signatures. Proc. Natl. Acad. Sci. USA 98, 15,149–15,154.

    Article  CAS  Google Scholar 

  11. Alizadeh, A., Eisen, M., Davis, R. E., et al. (2000) Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature 403, 503.

    Article  CAS  Google Scholar 

  12. Raychaudhuri, S., Stuart, J. M., and Altman, R. B. (2000) Principal Components Analysis to Summarize Microarray Experiments: Application to Sporulation Time Series. Pacific Symposium on Biocomputing, Honolulu, Hawaii, pp. 452–463.

    Google Scholar 

  13. Yeung, K. W. and Ruzzo, W. L. (2001) An empirical study on principal component analysis for clustering gene expression data. Bioinformatics 17, 763–774.

    Article  CAS  Google Scholar 

  14. Tamayo, P., Slonim, D., Mesirov, J., et al. (1999) Interpreting patterns of gene expression with self-organizing maps: Methods and application to hematopoietic differentiation. Proc. Natl. Acad. Sci. 96, 2907–2912.

    Article  CAS  Google Scholar 

  15. Golub, T. R., Slonim, D. K., Tamayo, P., et al. (1999) Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286, 531–537.

    Article  CAS  Google Scholar 

  16. Kohonen, T. (2001) Self-Organizing Maps. Springer, Berlin, Germany.

    Google Scholar 

  17. Burges, C. (1998) A tutorial for support vector machines for pattern recognition. Data Mining Knowledge Discov. 2, 121–167.

    Article  Google Scholar 

  18. Hearst, M. A. (1998) Trends and controversies: support vector machines. IEEE Intell. Syst. 13, 18–28.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Humana Press Inc., Totowa, NJ

About this protocol

Cite this protocol

De Bruyne, V., Al-Mulla, F., Pot, B. (2007). Methods for Microarray Data Analysis. In: Rampal, J.B. (eds) Microarrays. Methods in Molecular Biology, vol 382. Humana Press. https://doi.org/10.1007/978-1-59745-304-2_23

Download citation

  • DOI: https://doi.org/10.1007/978-1-59745-304-2_23

  • Publisher Name: Humana Press

  • Print ISBN: 978-1-58829-944-4

  • Online ISBN: 978-1-59745-304-2

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics