Abstract
In this chapter, we introduce a few statistical algorithms for calling gains and losses in array-based comparative genomic hybridization (array CGH) data, including CBS, CLAC, CGHseg, and Fused Lasso. We illustrate the performance of the methods through simulated and real data examples. We also provide brief guidance on how to use the corresponding software at the end of this chapter.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Pollack, J., Sorlie, T., Perou, C., Rees, C., Jeffrey, S., Lonning, P., Tibshirani, R., Botstein, D., Borresen-dale, A. and Brown, P. (2002). Microarray analysis reveals a major direct role of DNA copy number alteration in the transcriptional program of human breast tumors. Proc. Natl. Acad. Sci. USA 99, 12963–12968.
Hodgson, G., Hager, J., Volik, S., Hariono, S., Wernick, M., Moore, D., Nowak, N., Albertson, D., Pinkel, D., Collins, C., Hanahan, D. and Gray, J.W. (2001). Genome scanning with array CGH delineates regional alterations in mouse islet carcinomas. Nat. Genet. 29, 491.
Cheng, C., Kimmel, R., Nelman, P. and Zhao, L.P. (2003). Array rank order regression analysis for the detection of gene copy-number changes in human cancer. Genomics 82, 122–129.
Lingjaerde, O., Baumbusch, L., Liestol, K., Glad, I. and AL, B.-D. (2005). CGH-explorer: a program for analysis of array-CGH data. Bioinformatics 21(6).
Fridlyand, J., Snijders, A.M., Pinkel, D., Albertson, D.G. and Jain, A.N. (2004). Hidden Markov models approach to the analysis of array CGH data. J. Multivariate Anal. 90, 132–153.
Olshen, A. and Venkatraman, E. (2004). Circular binary segmentation for the analysis of array-based DNA copy number data. Biostatistics 5, 557–572.
Vostrikova, L.J. (1981). Detecting ‘disorder’ in multidimensional random processes. Sov. Math. Dokl. 24, 55–59.
Zhang, N.R. and Siegmund, D.O. (2007). A modified Bayes information criterion with applications to the analysis of comparative genomic hybridization data. Biometrics 63, 22–32.
Lai, T.L., Xing, H.P. and Zhang, N.R. (2007). Stochastic segmentation models for array-based comparative genomic hybridization data analysis. Biostatistics, doi:10.1093/biostatistics/kxm031
Wang, P., Kim, Y., Pollack, J., Narasimhan, B. and Tibshirani, R. (2005). A method for calling gains and losses in array CGH data. Biostatistics 6, 45–58.
Myers, C.L., Dunham, M.J., Kung, S.Y. and Troyanskaya, O.G. (2004). Accurate detection of aneuploidies in array CGH and gene expression microarray data. Bioinformatics 20, 3533–3543.
Lipson, D., Aumann, Y., Ben-Dor, A., Linial, N. and Yakhini, Z. (2005). Efficient calculation of interval scores for DNA copy number data analysis. In Proceedings of RECOMB 05. Springer-Verlag, Cambridge, MA.
Hupe, P., Stransky, N., Thiery, J.-P., Radvanyi, F. and Barillot, E. (2004). Analysis of array CGH data: from signal ratio to gain and loss of DNA regions. Bioinformatics. 20, 3413–3422.
Picard, F., Robin, S., Lavielle, M., Vaisse, C. and Daudin, J.J. (2005). A statistical approach for array CGH data analysis. BMC Bioinform. 11, 6–27.
Hsu, L., Self, S.G., Grove, D., Randolph, T., Wang, K., Delrow, J.J., Loo, L. and Porter, P. (2005). Denoising array-based comparative genomic hybridization data using wavelets. Biostatistics. 6, 211–226.
Tibshirani, R. and Wang, P. (2007). Spatial smoothing and hot spot detection for CGH data using the fused lasso. Biostatistics, doi:10.1093/biostatistics/kxm013.
Tibshirani, R., Saunders, M., Rosset, S., Zhu, J. and Knight, K. (2004). Sparsity and smoothness via the fused lasso. J. R. Stat. Soc. B. 67(1), 91–108.
Eilers, P.H. and de Menezes, R.X. (2005). Quantile smoothing of array CGH data. Bioinformatics 21(7), 1146–1153.
Li, Y. and Zhu, J. (2007). Analysis of array CGH data for cancer studies using the fused quantile regression. Bioinformatics 23, 2470–2476.
Wen, C., Wu, Y., Huang, Y., Chen, W., Liu, S., Jiang, S., Juang, J., Lin, C., Fang, W., Hsiung, C. and Chang, I. (2006). A Bayes regression approach to array-CGH data. Stat. Appl. Mol. Biol. Berkeley Electron. Press 5(1), 3.
Engler, D., Mohapatra, G., Louis, D. and Betensky, R. (2006). A pseudolikelihood approach for simultaneous analysis of array comparative genomic hybridizations. Biostatistics 7(3), 399–421.
Lai, W.R., Johnson, M.D., Kucherlapati, R. and Park, P.J.(2005). Comparative analysis of algorithms for identifying amplifications and deletions in array CGH data. Bioinformatics 21(19), 3763–3770.
Venkatraman, E.S. and Olshen, A.B. (2007). A faster circular binary segmentation algorithm for the analysis of array CGH data. Bioinformatics 23 (6), 657–663.
Siegmund, D.O. (1988). Approximate tail probabilities for the maxima of some random fields. Ann. Probab. 16, 487–501.
Yao, Q. (1989). Large deviations for boundary crossing probabilities of some random fields. J. Math. Res. Exposit. 9, 181–192.
Hastie, T., Tibshirani, R. and Friedman, J. (2001). The Elements of Statistical Learning. Springer, New York, NY, p. 475.
Benjamini, Y. and Hochberg, Y. (1995). Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. B 57(1), 289–300.
Tusher, V., Tibshirani, R. and Chu, G. (2001). Significance analysis of microarrays applied to the ionizing radiation response. Proc. Natl. Acad. Sci. USA 98, 5116–5121.
Storey, J. (2002). A direct approach to false discovery rates. J. R. Stat. Soc. 64(3), 479–498.
Efron, B. and Tibshirani, R. (2002). Microarrays, empirical Bayes methods, and false discovery rates. Genetic Epidemiology 23(1), 70–86.
Tibshirani, B. (1996). Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B 58, 267–288.
Friedman, J., Hastie, T. and Tibshirani, R. (2007). Pathwise coordinate optimization. Ann. Appl. Stat. 1(2), 302–332.
Becker, R.A., Chambers, J.M. and Wilks, A.R. (1988). The New S Language. Wadsworth Brooks Cole, Pacific Grove, CA.
Ruppert, D., Wand, M.P. and Carroll, R. (2003). Semiparametric Regression. Cambridge University Press, New York.
Bredel, M., Bredel, C., Juric, D., Harsh, G.R., Vogel, H., Recht, L.D. and Sikic, B.I. (2005). High-resolution genome-wide mapping of genetic alterations in human glial brain tumors. Cancer Res. 65, 4088–4096.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Humana Press, a part of Springer Science+Business Media, LLC
About this protocol
Cite this protocol
Wang, P. (2009). Algorithms for Calling Gains and Losses in Array CGH Data. In: Pollack, J. (eds) Microarray Analysis of the Physical Genome. Methods in Molecular Biology™, vol 556. Humana Press. https://doi.org/10.1007/978-1-60327-192-9_8
Download citation
DOI: https://doi.org/10.1007/978-1-60327-192-9_8
Published:
Publisher Name: Humana Press
Print ISBN: 978-1-60327-191-2
Online ISBN: 978-1-60327-192-9
eBook Packages: Springer Protocols