Abstract
We address Quintet, an R-based unified cDNA microarray data analysis system with GUI. Five principal categories of microarray data analysis have been coherently integrated in Quintet: data processing steps such as faulty spot filtering and normalization, data quality assessment (QA), identification of differentially expressed genes (DEGs), clustering of gene expression profiles, and classification of samples. Though many microarray data analysis systems normally consider DEG identification and clustering/classification the most important problems, we emphasize that data processing and QA are equally important and should be incorporated into the regular-base data analysis practices because microarray data are very noisy. In each analysis category, customized plots and statistical summaries are also given for users convenience. Using these plots and summaries, analysis results can be easily examined for their biological plausibility and compared with other results. Since Quintet is written in R, it is highly extendable so that users can insert new algorithms and experiment them with minimal efforts. Also, the GUI makes it easy to learn and use and since R-language and its GUI engine, Tcl/Tk, are available in all operating systems, Quintet is OS-independent too.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Draghici, S., Kuklin, A., Hoff, B., Shams, S.: Experimental design, analysis of variance and slide quality assessment in gene expression arrays. Curr. Opin. Drug. Discov. Devel. 4, 332–337 (2001)
Quackenbush, J.: Microarray data normalization and transformation. Nat. Genet. Suppl.32, 496–501 (2002)
Yang, Y.H., Dudoit, S., Luu, P., Lin, D. M., Peng, V., Ngai, J., Speed, T. P.: Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. Nucleic Acids Res. 30, e15 (2002)
Eisen, M.B., Spellman, P.T., Brown, P.O., Botstein, D.: Cluster analysis and display of genome-wide expression patterns. Proc. Natl. Acad. Sci. USA 95, 14863–14868 (1998)
Gasch, A.P., Spellman, P.T., Kao, C.M., Carmel-Harel, O., Eisen, M.B., Storz, G., Botstein, D., Brown, P.O.: Genomic expression programs in the response of yeast cells to environmental changes. Mol. Biol. Cell 11, 4241–4257 (2000)
Becker, K.G.: The sharing of cDNA microarray data. Nat. Rev. Neurosci. 2, 438–440 (2001)
Tran, P. H., Peiffer, D.A., Shin, Y., Meek, L.M., Brody, J.P., Cho, K.W.Y.: Microarray optimizations: increasing spot accuracy and automated identification of true microarray signals. Nucleic. Acids Res. 30, e54 (2002)
Delenstarr, G., Cattell, H., Connell, S., Dorsel, A., Kincaid, R.H., Nguyen, K., Sampas, N., Schidel, S., Shannon, K.W., Tu, A., Wolber, P.K.: Estimation of the confidence limits of oligonucleotide microarray-based measurements of differential expression. in Microarrays: Optical Technologies and Informatics. In: Bittner, M., et al. (eds.) Proceedings of SPIE 2001, vol. 4266, pp. 120–131 (2001)
Lee, M.L.T., Kuo, F.C., Whitmore, G.A., Sklar, J.: Importance of replication in microarray gene expression studies: statistical methods and evidence from repetitive cDNA hybridizations. Proc. Natl. Acad. Sci. USA 97, 9834–9839 (2000)
Yang, I.V., Chen, E., Hasseman, J.P., Liang, W., Frank, B.C., Wang, S., Sharov, V., Saeed, A.I., White, J., Li, J., Lee, N.H., Yeatman, T.J., Quackenbush, J.: Within the fold: assessing differential expression measures and reproducibility in microarray assays. Genome Biol., 3:research0062.1-0062.12 (2002)
Sapir, M., Churchill, G.A.: Estimating the posterior probability of differential gene expression from microarray data (2000), http://www.jax.org/research/churchill/pubs/index.html (Poster)
Newton, M.A., Kendziorski, C.M., Richmond, C.S., Blattner, F.R., Tsui, K.W.: On differential variability of expression ratios: improving statistical inference about gene expression changes from microarray data. J. Comput. Biol. 8, 37–52 (2001)
Troyanskaya, O.G., Garber, M.E., Brown, P.O., Botstein, D., Altman, R.B.: Nonparametric methods for identifying differentially expressed genes in microarray data. Bioinformatics 18, 1454–1461 (2002)
Benjamini, Y., Hochberg, Y.: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. Roy Stat. Soc. B 57, 289–300 (1995)
Ben-Dor, A., Shamir, R., Yakhini, Z.: Clustering gene expression patterns. J. Comput. Biol. 6, 281–297 (1999)
Yeung, K.Y., Medvedovic, M., Bumgarner, R.E.: Clustering gene-expression data with repeated measurements. Genome Biol. 4, R34 (2003)
Kaufman, L., Rousseeuw, P.J.: Finding Groups in Data: an Introduction to Cluster Analysis. John Wiley & Sons, Chichester (1990)
Dudoit, S., Fridlyand, J., Speed, T.P.: Comparison of discrimination methods for the classification of tumors using gene expression data. JASA 97, 77–87 (2002)
Khan, J., Wei, J.S., Ringner, M., Saal, L.H., Ladanyi, M., Westermann, F., Berthold, F., Schwab, M., Antonescu, C.R., Peterson, C., Meltzer, P.S.: Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks. Nat. Med. 7, 673–679 (2001)
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, Heidelberg (1995)
Zien, A., Ratsch, G., Mika, S., Scholkopf, B., Lemmen, C., Smola, A., Lengauer, T., Muller, K.: Engineering support vector machine kernels that recognize translation initiation sites. Bioinformatics 16, 799–807 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Choe, Jk., Chung, TH., Park, S., Cho, H.G., Hur, CG. (2005). An Open Source Microarray Data Analysis System with GUI: Quintet. In: Ślęzak, D., Yao, J., Peters, J.F., Ziarko, W., Hu, X. (eds) Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing. RSFDGrC 2005. Lecture Notes in Computer Science(), vol 3642. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11548706_41
Download citation
DOI: https://doi.org/10.1007/11548706_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28660-8
Online ISBN: 978-3-540-31824-8
eBook Packages: Computer ScienceComputer Science (R0)