Abstract
Analysis of high-dimensional biomarker data is in general of exploratory nature and aims to discover or dissect subgroups of patients sharing a specific pattern of biomarker measurements. One major challenge is to extract the relevant markers from the extremely large pool of measured markers. Specific techniques such as grouping and ordering and dimension reduction allow to aggregate huge amounts of data into single meaningful graphics. These graphics can guide the direction of exploration during the analysis. We present graphical tools for unsupervised and supervised objectives based on gene expression data of multiple myeloma patients which are part of the MAQC-II project.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Arabie P, Hubert L (1996) An overview of combinatorial data analysis. In: Arabie P, Hubert L, Soete G (eds) Clustering and classification. World Scientific, River Edge, NJ, pp 5–63
Barlogie B, Tricot G, Anaissie E, Shaughnessy J, Rasmussen E, van Rhee F, Fassas A, Zangari M, Hollmig K, Pineda-Roman M, Lee C, Talamo G, Thertulien R, Kiwan E, Krishna S, Fox M, Crowley J (2006) Thalidomide and hematopoietic-cell transplantation for multiple myeloma. New Engl J Med 354(10):1021–1030
Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Roy Stat Soc B 57(1):289–300
Binder H, Schumacher M (2008) Adapting prediction error estimates for biased complexity selection in high-dimensional bootstrap samples. Stat Appl Genet Mol Biol 7: Article 1
Biomarkers Definitions Working Group (2001) Biomarkers and surrogate endpoints: preferred definitions and conceptual framework. Clin Pharmacol Therapeut 69:89–95
Eisen M, Spellman P, Brown P, Botstein D (1998) Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci USA 95(25):14863–14868
Friedman J, Hastie T, Tibshirani R (2010) Regularization paths for generalized linear models via coordinate descent. J Stat Softw 33(1):1–22. URL http://www.jstatsoft.org/v33/i01/
Gelman A, Unwin A (2012) Infovis and statistical graphics: different goals, different looks. Unpublished manuscript. URL http://www.stat.columbia.edu/∼gelman/research/published/vis14.pdf
Goeman JJ, Bühlmann P (2007) Analyzing gene expression data in terms of gene sets: methodological issues. Bioinformatics 23(8):980–987
Goeman JJ, van de Geer SA, de Kort F, van Houwelingen HC (2004) A global test for groups of genes: testing association with a clinical outcome. Bioinformatics 20:93–99
Goldman AI (1992) EVENTCHARTS: visualizing survival and other timed-events data. American Statistician 46(1):13–18. URL http://www.jstor.org/stable/2684402
Gribov A, Sill M, Lück S, Rücker F, Döhner K, Bullinger L, Benner A, Unwin A (2010) SEURAT: visual analytics for the integrated analysis of microarray data. BMC Med Genom 3:21. doi:10.1186/1755-8794-3-21, URL http://dx.doi.org/10.1186/1755-8794-3-21
Jolliffe I (2002) Principal components analysis, 2nd edn. Springer series in statistics. Springer, New York
Madeira SC, Oliveira AL (2004) Biclustering algorithms for biological data analysis: a survey. IEEE/ACM Trans Comput Biol Bioinform 1:24–45. doi:http://dx.doi.org/10.1109/TCBB.2004.2, URL http://dx.doi.org/10.1109/TCBB.2004.2
Mechelen IV, Bock HH, Boeck PD (2004) Two-mode clustering methods: a structured overview. Stat Methods Med Res 13(5):363–394
Meinshausen N (2008) Hierarchical testing of variable importance. Biometrika 95(2):265–278
Merico D, Isserlin R, Stueker O, Emili A, Bader G (2010) Enrichment map: a network-based method for gene-set enrichment visualization and interpretation. PLoS One 5(11):e13,984
Monti S, Tamayo P, Mesirov J, Golub T (2003) Consensus clustering – a resampling-based method for class discovery and visualization of gene expression microarray data. In: Machine Learning 52(1–2):91–118
Pittelkow Y, Wilson S (2003) Visualisation of gene expression data - the GE-biplot, the chip-plot and the gene-plot. Stat Appl Genet Mol Biol 2: Article 6
Porzelius C, Binder H, Schumacher M (2009) Parallelized prediction error estimation for evaluation of high-dimensional models. Bioinformatics 25(6):827–829
Rousseeuw P (1987) Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. Comput Appl Math 20:53–65
Shi L, Campbell G, Jones WD et al (2010) The MicroArray quality control (MAQC)-II study of common practices for the development and validation of microarray-based predictive models. Nat Biotechnol 28(8):827–838
Sill M, Kaiser S, Benner A, Kopp-Schneider A (2011) Robust biclustering by sparse singular value decomposition incorporating stability selection. Bioinformatics 27(15):2089–2097 doi:10.1093/bioinformatics/btr322, URL http://dx.doi.org/10.1093/bioinformatics/btr322
Smyth GK (2004) Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol 3: Article 1
Smyth GK (2005) Limma: linear models for microarray data. In: Gentleman R, Carey V, Dudoit S, Irizarry R, Huber W (eds) Bioinformatics and computational biology solutions using R and Bioconductor. Springer, New York, pp 397–420
Subramanian A, Tamayo VK P Mootha, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP (2005) A knowledge-based approach for interpreting genome-wide expression profiles. PNAS 102(43):15545–15550
Tibshirani R (1996) Regression shrinkage and selection via the lasso. J Roy Stat Soc B Methodological 58:267–288
Turner HL, Bailey TC, Krzanowski WJ, Hemingway CA (2005) Biclustering models for structured microarray data. IEEE/ACM Trans Comput Biol Bioinform 2(4):316–329. doi:10.1109/TCBB.2005.49, URL http://dx.doi.org/10.1109/TCBB.2005.49
Walker R, Barlogie B, Haessler J, Tricot G, Anaissie E, Shaughnessy J, Epstein J, Hemert R, Erdem E, Hoering A, Crowley J, Ferris E, Hollmig K, Rhee F, Zangari M, Pineda-Roman M, Mohiuddin A, Yaccoby S, Sawyer J, Angtuaco E (2007) Magnetic resonance imaging in multiple myeloma: diagnostic and clinical implications. J Clin Oncol 25(9):1121–1128
Wang X, Terfve C, Rose J, Markowetz F (2011) HTSanalyzeR: An R/Bioconductor package for integrated network analysis of high-throughput screens. Bioinformatics 27(6):879–880
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
A.1 Appendix
A.1 Appendix
Rights and permissions
Copyright information
© 2012 Springer Science+Business Media, New York
About this chapter
Cite this chapter
Zucknick, M., Hielscher, T., Sill, M., Benner, A. (2012). Graphical Displays for Biomarker Data. In: Krause, A., O'Connell, M. (eds) A Picture is Worth a Thousand Tables. Springer, Boston, MA. https://doi.org/10.1007/978-1-4614-5329-1_8
Download citation
DOI: https://doi.org/10.1007/978-1-4614-5329-1_8
Published:
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4614-5328-4
Online ISBN: 978-1-4614-5329-1
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)