Snomad: Biologist-Friendly Web Tools for the Standardization and NOrmalization of Microarray Data

Colantuoni, Carlo; Henry, George; Bouton, Christopher M. L. S.; Zeger, Scott L.; Pevsner, Jonathan

doi:10.1007/0-387-21679-0_9

Carlo Colantuoni,
George Henry,
Christopher M. L. S. Bouton,
Scott L. Zeger &
…
Jonathan Pevsner

Part of the book series: Statistics for Biology and Health ((SBH))

1643 Accesses

Abstract

The use of DNA microarrays and other gene expression analysis techniques throughout the biological sciences has put extremely large, complex datasets in the hands of biologists who, for the most part, are not formally trained in computational or statistical methods. The majority of gene expression datasets have extensive artifactual bias and/or noise, which are not apparent upon superficial inspection. The SNOMAD gene expression analysis tools are an effort to make important normalization and quality control methods available to a wide audience of biological scientists working with gene expression data. Methods available in the SNOMAD tools include background subtraction, global mean normalization, local mean normalization across absolute intensity, local variance correction across absolute intensity, and ratio correction across the physical surface of the microarray. The SNOMAD web-implementation, available free of charge to all researchers at http://pevsnerlab.kennedykrieger.org/snomad.htm provides these tools without the downloading or installation of additional software, and does not require users to have any statistical or computer programming expertise.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.00; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Alter O, Brown PO, Botstein D (2000). Singular value decomposition for genome-wide expression data processing and modeling. Proceedings of the National Academy of Sciences USA, 97:10101–10106.
Article Google Scholar
Beissbarth T, Fellenberg K, Brors B, Arribas-Prat R, Boer J, Hauser NC, Scheideler M, Hoheisel JD, Schutz G, Poustka A, Vingron M (2000). Processing and quality control of DNA array hybridization data. Bioinformatics, 16:1014–1022.
Article Google Scholar
Brown MP, Grundy WN, Lin D, Cristianini N, Sugnet CW, Furey TS, Ares M, Haussler D (2000). Knowledge-based analysis of microarray gene expression data by using support vector machines. Proceedings of the National Academy of Sciences USA, 97:262–267.
Article Google Scholar
Butte AJ, Tamayo P, Slonim D, Golub TR, Kohane IS (2000). Discovering functional relationships between RNA expression and chemotherapeutic susceptibility using relevance networks. Proceedings of the National Academy of Sciences USA, 97:12182–12186.
Article Google Scholar
Butte AJ, Ye J, Haring HU, Stumvoll M, White MF, Kohane IS (2001). Determining significant fold differences in gene expression analysis. Pacific Symposium on Biocomputing, X:6–17.
Google Scholar
Chu S, DeRisi J, Eisen M, Mulholland J, Botstein D, Brown PO, Herskowitz I (1998). The transcriptional program of sporulation in budding yeast. Science, 282:699–705.
Article Google Scholar
Claverie JM (1999). Computational methods for the identification of differential and coordinated gene expression. Human Molecular Genetics, 8:1821–1832.
Article Google Scholar
Cleveland WS (1981). Lowess: Program for smoothing scatterplots by robust locally weighted regression. The American Statistician, 35:54.
Article Google Scholar
Colantuoni C, Jeon OH, Hyder K, Chenchik A, Khimani AH, Narayanan V, Hoffman EP, Kaufmann WE, Naidu S, Pevsner J (2001). Gene expression profiling in postmortem Rett Syndrome brain: Differential gene expression and patient classification. Neurobiology of Disease 8, 847–65.
Article Google Scholar
Eickhoff B, Korn B, Schick M, Poustka A, van der Bosch J (1999). Normalization of array hybridization experiments in differential gene expression analysis. Nucleic Acids Research, 27:e33.
Article Google Scholar
Eisen MB, Spellman PT, Brown PO, Botstein D (1998). Cluster analysis and display of genome-wide expression patterns. Proceedings of the National Academy of Sciences USA, 95:14863–14868.
Article Google Scholar
Golub TR, Slonim DK, Tamayo P, Huard C, Gaasenbeek M, Mesirov JP, Coller H, Loh ML, Downing JR, Caligiuri MA, Bloomfield CD, Lander ES (1999). Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring. Science, 286:531–537.
Article Google Scholar
Hastie T, Tibshirani R (1990). Exploring the nature of covariate effects in the proportional hazards model. Biometrics 46:1005–1016
Article Google Scholar
Hastie T, Tibshirani R, Eisen MB, Alizadeh A, Levy R, Staudt L, Chan WC, Botstein D, Brown P (2000). ‘Gene shaving’ as a method for identifying distinct sets of genes with similar expression patterns. Genome Biology, 1:research0003.
Google Scholar
Hegde P, Qi R, Abernathy K, Gay C, Dharap S, Gaspard R, Hughes JE, Snesrud E, Lee N, Quackenbush J (2000). A concise guide to cDNA microarray analysis. Biotechniques, 29:548–550, 552–554, 556 passim.
Google Scholar
Hilsenbeck SG, Friedrichs WE, Schiff R, O’Connell P, Hansen RK, Osborne CK, Fuqua SA (1999). Statistical analysis of array expression data as applied to the problem of tamoxifen resistance. Journal of the National Cancer Institute, 91:453–459.
Article Google Scholar
Holter NS, Maritan A, Cieplak M, Fedoroff NV, Banavar JR (2001). Dynamic modeling of gene expression data. Proceedings of the National Academy of Sciences USA, 98:1693–1698.
Article Google Scholar
Holter NS, Mitra M, Maritan A, Cieplak M, Banavar JR, Fedoroff NV (2000). Fundamental patterns underlying gene expression profiles: Simplicity from complexity. Proceedings of the National Academy of Sciences USA, 97:8409–8414.
Article Google Scholar
Iyer VR, Eisen MB, Ross DT, Schuler G, Moore T, Lee JC, Trent JM, Staudt LM, Hudson J, Boguski MS, Lashkari D, Shalon D, Botstein D, Brown PO (1999). The transcriptional program in the response of human fibroblasts to serum. Science, 283:83–87.
Article Google Scholar
Kadota K, Miki R, Bono H, Shimizu K, Okazaki Y, Hayashizaki Y (2001). Preprocessing implementation for microarray (prim): An efficient method for processing cDNA microarray data. Physiological Genomics, 4:183–188.
Google Scholar
Khan J, Simon R, Bittner M, Chen Y, Leighton SB, Pohida T, Smith PD, Jiang Y, Gooden GC, Trent JM, Meltzer PS (1998). Gene expression profiling of alveolar rhabdomyosarcoma with cDNA microarrays. Cancer Research, 58:5009–5013.
Google Scholar
Lee CK, Klopp RG, Weindruch R, Prolla TA (1999). Gene expression profile of aging and its retardation by caloric restriction. Science, 285:1390–1393.
Article Google Scholar
Liao B, Hale W, Epstein CB, Butow RA, Garner HR (2000). MAD: A suite of tools for microarray data management and processing. Bioinformatics, 16:946–947.
Article Google Scholar
Manduchi E, Grant GR, McKenzie SE, Overton GC, Surrey S, Stoeckert CJ (2000). Generation of patterns from gene expression data by assigning confidence to differentially expressed genes. Bioinformatics, 16:685–698.
Article Google Scholar
Park PJ, Pagano M, Bonetti M (2001). A nonparametric scoring algorithm for identifying informative genes from microarray data. Pacific Symposium on Bio-computing, X:52–63.
Google Scholar
Raychaudhuri S, Stuart JM, Altman RB (2000). Principal components analysis to summarize microarray experiments: Application to sporulation time series. Pacific Symposium on Biocomputing, 11:455–466.
Google Scholar
Schadt EE, Li C, Su C, Wong WH (2000). Analyzing high-density oligonucleotide gene expression array data. Journal of Cellular Biochemistry, 80:192–202.
Article Google Scholar
Schuchhardt J, Beule D, Malik A, Wolski E, Eickhoff H, Lehrach H, Herzel H (2000). Normalization strategies for cDNA microarrays. Nucleic Acids Research, 28:E47.
Article Google Scholar
Smid-Koopman E, Blok LJ, Chadha-Ajwani S, Helmerhorst TJ, Brinkmann AO, Huikeshoven FJ (2000). Gene expression profiles of human endometrial cancer samples using a cDNA-expression array technique: Assessment of an analysis method. British Journal of Cancer, 83:246–251.
Article Google Scholar
Toronen P, Kolehmainen M, Wong G, Castren E (1999). Analysis of gene expression data using self-organizing maps. FEBS Letters, 451:142–146.
Article Google Scholar
Tseng GC, Oh MK, Rohlin L, Liao JC, Wong WH (2001). Issues in cDNA microarray analysis: Quality filtering, channel normalization, models of variations and assessment of gene effects. Nucleic Acids Research, 29:2549–2557
Article Google Scholar
Tsien CL, Libermann TA, Gu X, Kohane IS (2001). On reporting fold differences. Pacific Symposium on Biocomputing, X:496–507.
Google Scholar
Wittes J, Friedman HP (1999). Searching for evidence of altered gene expression: A comment on statistical analysis of microarray data. Journal of the National Cancer Institute, 91:400–401.
Article Google Scholar
Yang YH, Dudoit S, Luu P, Lin DM, Peng V, Ngai J, Speed TP (2002). Normalization for cDNA microarray data: A robust composite method addressing single and multiple slide systematic variation. Nucleic Acids Research, 30:e15.
Article Google Scholar

Download references

Authors

Carlo Colantuoni
View author publications
You can also search for this author in PubMed Google Scholar
George Henry
View author publications
You can also search for this author in PubMed Google Scholar
Christopher M. L. S. Bouton
View author publications
You can also search for this author in PubMed Google Scholar
Scott L. Zeger
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan Pevsner
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Departments of Oncology, Biostatistics,and Pathology, Johns Hopkins University, Baltimore, MD, 21205-2013, USA
Giovanni Parmigiani
Departments of Oncology and Biostatistics, Johns Hopkins University, Baltimore, MD, 21205-2013, USA
Elizabeth S. Garrett
Departments of Biostatistics, Johns Hopkins University, Baltimore, MD, 21205-2013, USA
Rafael A. Irizarry
Departments of Biostatistics and Epidemiology, Johns Hopkins University, Baltimore, MD, 21205-2013, USA
Scott L. Zeger

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Colantuoni, C., Henry, G., Bouton, C.M.L.S., Zeger, S.L., Pevsner, J. (2003). Snomad: Biologist-Friendly Web Tools for the Standardization and NOrmalization of Microarray Data. In: Parmigiani, G., Garrett, E.S., Irizarry, R.A., Zeger, S.L. (eds) The Analysis of Gene Expression Data. Statistics for Biology and Health. Springer, New York, NY. https://doi.org/10.1007/0-387-21679-0_9

Download citation

DOI: https://doi.org/10.1007/0-387-21679-0_9
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-95577-3
Online ISBN: 978-0-387-21679-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics