The Pancancer DNA Methylation Trackhub: A Window to The Cancer Genome Atlas Epigenomics Data
The Cancer Genome Atlas (TCGA) epigenome data includes the DNA methylation status of tumor and normal tissues of large cohorts for dozens of cancer types. Due to the moderately large data sizes, retrieving and analyzing them requires basic programming skills. Simple data browsing (e.g., candidate gene search) is hampered by the scarcity of easy-to-use data browsers addressed to the broad community of biomedical researchers. We propose a new visualization method depicting the overall DNA methylation status at each TCGA cohort while emphasizing its heterogeneity, thus facilitating the evaluation of the cohort variability and the normal versus tumor differences. Implemented as a trackhub integrated to the University of California Santa Cruz (UCSC) genome browser, it can be easily added to any genome-wide annotation layer.
To exemplify the trackhub usage we evaluate local DNA methylation boundaries, the aberrant DNA methylation of a CpG island located at the estrogen receptor 1 (ESR1) in breast and colon cancer, and the hypermethylation of the Homeobox HOXA gene cluster and the EN1 gene in multiple cancer types. The DNA methylation pancancer trackhub is freely available at http://maplab.cat/tcga_450k_trackhub.
Key wordsDNA methylation Pancancer Data visualization TCGA The Cancer Genome Atlas
We thank Iñaki Martinez de Ilarduya for his excellent technical support. The trackhub published here is based upon data generated by the TCGA Research Network: http://cancergenome.nih.gov/. This work was supported by the Spanish Ministry of Economy and Competitiveness [SAF2011/23638 and SAF2015-64521-R to M.A.P.]. CERCA Program/Generalitat de Catalunya.
- 1.Zhang J, Baran J, Cros A, Guberman JM, Haider S, Hsu J, Liang Y, Rivkin E, Wang J, Whitty B, Wong-Erasmus M, Yao L, Kasprzyk A (2011) International Cancer Genome Consortium Data Portal—a one-stop shop for cancer genomics data. Database (Oxford) 2011:bar026Google Scholar
- 3.Cerami E, Gao J, Dogrusoz U, Gross BE, Sumer SO, Aksoy BA, Jacobsen A, Byrne CJ, Heuer ML, Larsson E, Antipin Y, Reva B, Goldberg AP, Sander C, Schultz N (2012) The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Cancer Discov 2(5):401–404CrossRefPubMedGoogle Scholar
- 5.Speir ML, Zweig AS, Rosenbloom KR, Raney BJ, Paten B, Nejad P, Lee BT, Learned K, Karolchik D, Hinrichs AS, Heitner S, Harte RA, Haeussler M, Guruvadoo L, Fujita PA, Eisenhart C, Diekhans M, Clawson H, Casper J, Barber GP, Haussler D, Kuhn RM, Kent WJ (2016) The UCSC Genome Browser database: 2016 update. Nucleic Acids Res 44(D1):D717–D725CrossRefPubMedGoogle Scholar
- 11.Rauch T, Wang Z, Zhang X, Zhong X, Wu X, Lau SK, Kernstine KH, Riggs AD, Pfeifer GP (2007) Homeobox gene methylation in lung cancer studied by genome-wide analysis with a microarray-based methylated CpG island recovery assay. Proc Natl Acad Sci U S A 104(13):5527–5532CrossRefPubMedPubMedCentralGoogle Scholar