Abstract
The use of DNA microarrays and other gene expression analysis techniques throughout the biological sciences has put extremely large, complex datasets in the hands of biologists who, for the most part, are not formally trained in computational or statistical methods. The majority of gene expression datasets have extensive artifactual bias and/or noise, which are not apparent upon superficial inspection. The SNOMAD gene expression analysis tools are an effort to make important normalization and quality control methods available to a wide audience of biological scientists working with gene expression data. Methods available in the SNOMAD tools include background subtraction, global mean normalization, local mean normalization across absolute intensity, local variance correction across absolute intensity, and ratio correction across the physical surface of the microarray. The SNOMAD web-implementation, available free of charge to all researchers at http://pevsnerlab.kennedykrieger.org/snomad.htm provides these tools without the downloading or installation of additional software, and does not require users to have any statistical or computer programming expertise.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Alter O, Brown PO, Botstein D (2000). Singular value decomposition for genome-wide expression data processing and modeling. Proceedings of the National Academy of Sciences USA, 97:10101–10106.
Beissbarth T, Fellenberg K, Brors B, Arribas-Prat R, Boer J, Hauser NC, Scheideler M, Hoheisel JD, Schutz G, Poustka A, Vingron M (2000). Processing and quality control of DNA array hybridization data. Bioinformatics, 16:1014–1022.
Brown MP, Grundy WN, Lin D, Cristianini N, Sugnet CW, Furey TS, Ares M, Haussler D (2000). Knowledge-based analysis of microarray gene expression data by using support vector machines. Proceedings of the National Academy of Sciences USA, 97:262–267.
Butte AJ, Tamayo P, Slonim D, Golub TR, Kohane IS (2000). Discovering functional relationships between RNA expression and chemotherapeutic susceptibility using relevance networks. Proceedings of the National Academy of Sciences USA, 97:12182–12186.
Butte AJ, Ye J, Haring HU, Stumvoll M, White MF, Kohane IS (2001). Determining significant fold differences in gene expression analysis. Pacific Symposium on Biocomputing, X:6–17.
Chu S, DeRisi J, Eisen M, Mulholland J, Botstein D, Brown PO, Herskowitz I (1998). The transcriptional program of sporulation in budding yeast. Science, 282:699–705.
Claverie JM (1999). Computational methods for the identification of differential and coordinated gene expression. Human Molecular Genetics, 8:1821–1832.
Cleveland WS (1981). Lowess: Program for smoothing scatterplots by robust locally weighted regression. The American Statistician, 35:54.
Colantuoni C, Jeon OH, Hyder K, Chenchik A, Khimani AH, Narayanan V, Hoffman EP, Kaufmann WE, Naidu S, Pevsner J (2001). Gene expression profiling in postmortem Rett Syndrome brain: Differential gene expression and patient classification. Neurobiology of Disease 8, 847–65.
Eickhoff B, Korn B, Schick M, Poustka A, van der Bosch J (1999). Normalization of array hybridization experiments in differential gene expression analysis. Nucleic Acids Research, 27:e33.
Eisen MB, Spellman PT, Brown PO, Botstein D (1998). Cluster analysis and display of genome-wide expression patterns. Proceedings of the National Academy of Sciences USA, 95:14863–14868.
Golub TR, Slonim DK, Tamayo P, Huard C, Gaasenbeek M, Mesirov JP, Coller H, Loh ML, Downing JR, Caligiuri MA, Bloomfield CD, Lander ES (1999). Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring. Science, 286:531–537.
Hastie T, Tibshirani R (1990). Exploring the nature of covariate effects in the proportional hazards model. Biometrics 46:1005–1016
Hastie T, Tibshirani R, Eisen MB, Alizadeh A, Levy R, Staudt L, Chan WC, Botstein D, Brown P (2000). ‘Gene shaving’ as a method for identifying distinct sets of genes with similar expression patterns. Genome Biology, 1:research0003.
Hegde P, Qi R, Abernathy K, Gay C, Dharap S, Gaspard R, Hughes JE, Snesrud E, Lee N, Quackenbush J (2000). A concise guide to cDNA microarray analysis. Biotechniques, 29:548–550, 552–554, 556 passim.
Hilsenbeck SG, Friedrichs WE, Schiff R, O’Connell P, Hansen RK, Osborne CK, Fuqua SA (1999). Statistical analysis of array expression data as applied to the problem of tamoxifen resistance. Journal of the National Cancer Institute, 91:453–459.
Holter NS, Maritan A, Cieplak M, Fedoroff NV, Banavar JR (2001). Dynamic modeling of gene expression data. Proceedings of the National Academy of Sciences USA, 98:1693–1698.
Holter NS, Mitra M, Maritan A, Cieplak M, Banavar JR, Fedoroff NV (2000). Fundamental patterns underlying gene expression profiles: Simplicity from complexity. Proceedings of the National Academy of Sciences USA, 97:8409–8414.
Iyer VR, Eisen MB, Ross DT, Schuler G, Moore T, Lee JC, Trent JM, Staudt LM, Hudson J, Boguski MS, Lashkari D, Shalon D, Botstein D, Brown PO (1999). The transcriptional program in the response of human fibroblasts to serum. Science, 283:83–87.
Kadota K, Miki R, Bono H, Shimizu K, Okazaki Y, Hayashizaki Y (2001). Preprocessing implementation for microarray (prim): An efficient method for processing cDNA microarray data. Physiological Genomics, 4:183–188.
Khan J, Simon R, Bittner M, Chen Y, Leighton SB, Pohida T, Smith PD, Jiang Y, Gooden GC, Trent JM, Meltzer PS (1998). Gene expression profiling of alveolar rhabdomyosarcoma with cDNA microarrays. Cancer Research, 58:5009–5013.
Lee CK, Klopp RG, Weindruch R, Prolla TA (1999). Gene expression profile of aging and its retardation by caloric restriction. Science, 285:1390–1393.
Liao B, Hale W, Epstein CB, Butow RA, Garner HR (2000). MAD: A suite of tools for microarray data management and processing. Bioinformatics, 16:946–947.
Manduchi E, Grant GR, McKenzie SE, Overton GC, Surrey S, Stoeckert CJ (2000). Generation of patterns from gene expression data by assigning confidence to differentially expressed genes. Bioinformatics, 16:685–698.
Park PJ, Pagano M, Bonetti M (2001). A nonparametric scoring algorithm for identifying informative genes from microarray data. Pacific Symposium on Bio-computing, X:52–63.
Raychaudhuri S, Stuart JM, Altman RB (2000). Principal components analysis to summarize microarray experiments: Application to sporulation time series. Pacific Symposium on Biocomputing, 11:455–466.
Schadt EE, Li C, Su C, Wong WH (2000). Analyzing high-density oligonucleotide gene expression array data. Journal of Cellular Biochemistry, 80:192–202.
Schuchhardt J, Beule D, Malik A, Wolski E, Eickhoff H, Lehrach H, Herzel H (2000). Normalization strategies for cDNA microarrays. Nucleic Acids Research, 28:E47.
Smid-Koopman E, Blok LJ, Chadha-Ajwani S, Helmerhorst TJ, Brinkmann AO, Huikeshoven FJ (2000). Gene expression profiles of human endometrial cancer samples using a cDNA-expression array technique: Assessment of an analysis method. British Journal of Cancer, 83:246–251.
Toronen P, Kolehmainen M, Wong G, Castren E (1999). Analysis of gene expression data using self-organizing maps. FEBS Letters, 451:142–146.
Tseng GC, Oh MK, Rohlin L, Liao JC, Wong WH (2001). Issues in cDNA microarray analysis: Quality filtering, channel normalization, models of variations and assessment of gene effects. Nucleic Acids Research, 29:2549–2557
Tsien CL, Libermann TA, Gu X, Kohane IS (2001). On reporting fold differences. Pacific Symposium on Biocomputing, X:496–507.
Wittes J, Friedman HP (1999). Searching for evidence of altered gene expression: A comment on statistical analysis of microarray data. Journal of the National Cancer Institute, 91:400–401.
Yang YH, Dudoit S, Luu P, Lin DM, Peng V, Ngai J, Speed TP (2002). Normalization for cDNA microarray data: A robust composite method addressing single and multiple slide systematic variation. Nucleic Acids Research, 30:e15.
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag New York, Inc.
About this chapter
Cite this chapter
Colantuoni, C., Henry, G., Bouton, C.M.L.S., Zeger, S.L., Pevsner, J. (2003). Snomad: Biologist-Friendly Web Tools for the Standardization and NOrmalization of Microarray Data. In: Parmigiani, G., Garrett, E.S., Irizarry, R.A., Zeger, S.L. (eds) The Analysis of Gene Expression Data. Statistics for Biology and Health. Springer, New York, NY. https://doi.org/10.1007/0-387-21679-0_9
Download citation
DOI: https://doi.org/10.1007/0-387-21679-0_9
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-95577-3
Online ISBN: 978-0-387-21679-9
eBook Packages: Springer Book Archive