Bioconductor R Packages for Exploratory Analysis and Normalization of cDNA Microarray Data
- 1.4k Downloads
This chapter describes a collection of four R packages for exploratory analysis and normalization of two-color cDNA microarray fluorescence intensity data. R’s object-oriented class/method mechanism is exploited to allow efficient and systematic representation and manipulation of large microarray datasets of multiple types. The marrayClasses package contains class definitions and associated methods for pre- and postnormalization intensity data for batches of arrays. The marrayInput package provides functions and tcltk widgets to automate data input and the creation of microarray-specific R objects for storing these data. Functions for diagnostic plots of microarray spot statistics, such as boxplots, scatterplots, and spatial color images, are provided in marrayPlots. Finally, the marrayNorm package implements robust adaptive location and scale normalization procedures, which correct for different types of dye biases (e.g., intensity, spatial, plate biases) and allow the use of control sequences spotted onto the array and possibly spiked into the mRNA samples. The four new packages were developed as part of the Bioconductor project, which aims more generally to produce an open-source and open-development statistical computing framework for the analysis of genomic data.
KeywordsMedian Absolute Deviation Diagnostic Plot Color Palette Spot Statistic cDNA Microarray Data
Unable to display preview. Download preview PDF.
- Brazma A, Hingamp P, Quackenbush J, Sherlock G, Spellman P, Stoeckert C, Aach J, Ansorge W, Ball CA, Causton HC, Gaasterland T, Glenisson P, Holstege FCP, Kim IF, Markowitz V, Matese JC, Parkinson H, Robinson A, Sarkans U, Schulze-Kremer S, Stewart J, Taylor R, Vilo J, Vingron M (2001). Minimum information about a microarray experiment (MIAME)-toward standards for microarray data. Nature Genetics 29:365–371.CrossRefGoogle Scholar
- Brown PO, Botstein D (1999). Exploring the new world of the genome with DNA microarrays. In: The Chipping Forecast, volume 21, 33–37. Supplement to Nature Genetics.Google Scholar
- Buckley MJ (2000). The Spot user’s guide. CSIRO Mathematical and Information Sciences, Sydney, Australia. //www.cmis.csiro.au/IAP/Spot/spotmanual.htm.Google Scholar
- Ihaka R, Gentleman R (1996). R: A language for data analysis and graphics. Journal of Computational and Graphical Statistics 5:299–314.Google Scholar
- Leisch F (2002). Dynamic generation of statistical reports using literate data analysis. Technical Report 69, SFB Adaptive Information Systems and Modelling in Economics and Management Science, Vienna University of Economics and Business Administration: Vienna.Google Scholar
- Schena M (ed.) (2000). Microarray Biochip Technology. Eaton.Google Scholar
- Yang YH, Dudoit S, Luu P, Speed TP (2001). Normalization for cDNA microarray data. In: ML Bittner, Y Chen, AN Dorsel, ER Dougherty (eds.), Microarrays: Optical Technologies and Informatics, volume 4266 of Proceedings of SPIE, 141–152. SPIE: Bellingham, WA.Google Scholar