Genome-wide epigenomic profiling for biomarker discovery
- 4.2k Downloads
A myriad of diseases is caused or characterized by alteration of epigenetic patterns, including changes in DNA methylation, post-translational histone modifications, or chromatin structure. These changes of the epigenome represent a highly interesting layer of information for disease stratification and for personalized medicine. Traditionally, epigenomic profiling required large amounts of cells, which are rarely available with clinical samples. Also, the cellular heterogeneity complicates analysis when profiling clinical samples for unbiased genome-wide biomarker discovery. Recent years saw great progress in miniaturization of genome-wide epigenomic profiling, enabling large-scale epigenetic biomarker screens for disease diagnosis, prognosis, and stratification on patient-derived samples. All main genome-wide profiling technologies have now been scaled down and/or are compatible with single-cell readout, including: (i) Bisulfite sequencing to determine DNA methylation at base-pair resolution, (ii) ChIP-Seq to identify protein binding sites on the genome, (iii) DNaseI-Seq/ATAC-Seq to profile open chromatin, and (iv) 4C-Seq and HiC-Seq to determine the spatial organization of chromosomes. In this review we provide an overview of current genome-wide epigenomic profiling technologies and main technological advances that allowed miniaturization of these assays down to single-cell level. For each of these technologies we evaluate their application for future biomarker discovery. We will focus on (i) compatibility of these technologies with methods used for clinical sample preservation, including methods used by biobanks that store large numbers of patient samples, and (ii) automation of these technologies for robust sample preparation and increased throughput.
KeywordsGenome-wide epigenetic profiling Biomarker discovery Miniaturization Automation Single cell DNA methylation WGBS ATAC-Seq Stratification Precision medicine
- 450K array
Infinium HumanMethylation450 BeadChip array
Circular chromosome conformation capture
Assay for transposase-accessible chromatin
Canonical correlation analysis
Chronic lymphocytic leukemia
DNAseI hypersensitive site
Fluorescence-activated cell sorting
False discovery rate
Formalin-fixed paraffin-embedded sample
International Human Epigenome Consortium
Methyl-CpG binding domain protein-enriched
Methylation DNA immunoprecipitation
Methylated DNA capture
National Institutes of Health
Non-small cell lung cancer
Pathology tissue chromatin immunoprecipitation
Peripheral blood mononuclear cell
Principle component analysis
Polycomb repressive complex
Reduced representation bisulfite sequencing
Suberanilohydroxamic acid (Vorinostat)
single-cell bisulfite sequencing
followed by sequencing
Topologically associating domain
Whole-genome bisulfite sequencing
Within fundamental and clinical research and in clinical practice, biomarkers play an important role to facilitate disease diagnosis, prognosis, and selection of targeted therapies in patients. As such, biomarkers are critical for personalized medicine to improve disease stratification: the identification of groups of patients with shared (biological) characteristics, such as a favorable response to a particular drug [1, 2]. Biomarkers need to fulfill a number of requirements, the most important of which is to show high predictive value. From a practical perspective, the detection method for a biomarker must be accurate, relatively easy to carry out, and show high reproducibility . Over the last decade, there has been an increasing interest in biomarkers at the hand of rapid developments within high-throughput molecular biology technologies, capable of identifying “molecular biomarkers” [4, 5]. Molecular biomarkers possess a critical advantage over more traditional biomarkers during the exploratory phase of biomarker discovery, as many candidate molecular biomarkers can be assayed in parallel. This particularly involves screening of (epi)genomic features at a genome-wide scale, often making use of powerful next-generation sequencing (NGS)-based technologies. These screens can assess very large numbers of loci for the presence or absence of a certain (epi)genomic feature. Subsequently, these loci can be evaluated as a potential biomarker by determining their correlation between samples with different characteristics, for example, by comparing healthy versus diseased tissue.
To be suitable for biomarker discovery, (epi)genomic profiling assays need to fulfill a number of important requirements. To accommodate sample collection for batch processing, clinical samples are often preserved by freezing or by formaldehyde crosslinking. Therefore, an important requirement for (epi)genomic biomarker screening technologies is that these are compatible with processed samples. Additionally, this allows inclusion of clinical samples that have been processed for biobanking, or to use such samples for replication or validation. Biobanks collect large numbers of samples such as tissues or DNA (deoxyribonucleic acid) and the associated patient information, which is highly valuable for retrospective biomarker studies [6, 7, 8, 9]. Exploratory screens for candidate biomarkers mainly rely on the use of patient specimens, which are obtained in small quantities, while also biobanks often contain limited quantities of patient material. Therefore, a second requirement is that assays used for biomarker discovery are compatible with miniaturization to allow processing of low-input samples. Furthermore, robust biomarker discovery is dependent on the screening of large numbers of samples due to the inherent clinical and biological variability between patient samples . Assays used for biomarker discovery therefore benefit from automation and digitalization, facilitating upscaling while reducing the chance of errors due to human handling.
Genomic features that are utilized for molecular biomarker discovery can be separated in two categories: (i) changes in the DNA sequence itself, such as mutations and rearrangements, and (ii) changes in the epigenome, represented by molecules and structures associated with the DNA such as DNA methylation and post-translational histone modifications. This review will focus on the latter category, as recent developments in epigenetic profiling technologies have not only greatly increased our knowledge on epigenetic regulation, but also allow for large-scale discovery of molecular epigenetic biomarkers. The first section of this review provides an overview of epigenetic features and how these can be assayed. We discuss how misregulation of epigenetic processes may lead to disease, providing mechanistic rationale for the use of epigenetic features as biomarkers. The feasibility of applying epigenetic biomarkers in the clinic is demonstrated by examples of DNA methylation biomarkers that have reached clinical stages. In the second part of this review, we will focus on current genome-wide epigenomic profiling technologies, and whether these are already or will likely become compatible with biomarker discovery in the near future. We will evaluate these approaches with three criteria in mind: (i) the possibility to use frozen or chemically fixed material in these assays, (ii) compatibility with miniaturization and single-cell profiling, and (iii) the current level of automation.
Within a eukaryotic cell, the DNA is packaged to fit into the small volume of the nucleus in a highly organized fashion. The basic unit of chromatin involves the DNA wrapped around nucleosomes consisting of two copies of each of the core histones H2A, H2B, H3, and H4: the so-called beads-on-string structure . Subsequent compaction leads to higher order structures including the formation of very dense arrays of nucleosomes observed in heterochromatin [12, 13]. Despite being tightly packed, the chromatin appears to be highly plastic to allow processes such as transcription, DNA damage repair, DNA remodeling, and DNA replication. This plasticity is facilitated by several factors that influence both local and global chromatin architectures. The most prominent features affecting chromatin structure are reversible covalent modifications of the DNA, e.g., cytosine methylation and hydroxymethylation mainly occurring within the genomic CG context (CpGs), and reversible post-translational modifications of histones, e.g., lysine acetylation, lysine and arginine methylation, serine and threonine phosphorylation, and lysine ubiquitination and sumoylation. These modifications are set by specific classes of enzymes: DNA methyltransferases (DNMTs) in case of cytosine methylation  or histone-modifying enzymes . Besides facilitating chromatin compaction, modifications of the DNA and histones are read by adaptor molecules, chromatin-modifying enzymes, and transcription factors (TFs) that contribute to the regulation of transcription and other chromatin-related processes [15, 16]. Next to modifications of DNA and histones, the three-dimensional (3D) conformation of the DNA within the nucleus imposes an additional regulatory layer of gene expression .
Summary of the main epigenetic features and the principles, caveats, and requirements of the main technologies used for their profiling
DNA methylation. DNA methylation is the process in which a methyl group is added to the 5′ position of cytosines in the DNA, which mainly occurs within the context of CpGs. DNA methylation typically acts to repress gene transcription when located in a gene promoter, while gene-body methylation is positively correlated with expression [153, 154, 155, 156, 157]. Distal regulatory regions like enhancers generally contain low DNA methylation levels when active due to binding of TFs . The role or consequence of DNA methylation at other places of the genome is less well understood . Genome-wide profiling of DNA methylation generally relies on (i) affinity purification of methylated DNA fragments or (ii) the use of sodium bisulfite converting unmethylated cytosines into uracil. The technologies referred to by the first method, MBD-Seq/MethylCap-Seq (methyl-CpG binding domain protein-enriched sequencing/methylated DNA capture sequencing) [140, 141, 159] and MeDIP-Seq (methylation DNA immunoprecipitation sequencing) [160, 161], utilize a methyl binding protein domain or an antibody raised against 5-methylcytosine, respectively, to affinity purify methylated DNA fragments from sheared genomic DNA. Although MethylCap-Seq/MeDIP-Seq provides accurate measurements of DNA methylation , an important caveat is the aspecific background remaining after the affinity purification. These might cause false positive results (in particular in case of copy number variations) if not properly controlled for. The second method makes use of bisulfite on sheared genomic DNA to convert unmethylated cytosines into uracil, while leaving methylated cytosines unaffected . After subsequent amplification to prepare the DNA for readout, the uracil (representing the unmethylated cytosine) is read as a thymidine, while cytosines represent methylated cytosines in the original sample. The readout of bisulfite-based methods is mainly performed by microarrays (including the Infinium HumanMethylation450 BeadChip array (“450K array”) covering 450,000 of the 28 million genomic CpGs)  or by sequencing, referred to as whole-genome bisulfite sequencing (WGBS). In light of the high sequencing costs associated with WGBS, reduced representation bisulfite sequencing (RRBS) selects for CpG-rich fragments before sequencing using methylation-insensitive restriction enzymes such as MspI . An important advantage of bisulfite-based methods (450K array, WGBS, RRBS) over other DNA methylation profiling technologies is that these generate DNA methylation profiles at base-pair resolution. Furthermore, the input requirements for WGBS/RRBS (20 ng of DNA for low-input WGBS/RRBS profiling, equivalent to 3 × 103 cells ) are low as compared to the 450K array (500 ng; 7.5 × 104 cells) and MBD-Seq/MethylCap-Seq/MeDIP-Seq (1 μg DNA; 1.5 × 105 cells). Although dependent on sequencing depth, the coverage of WGBS is usually >90% of all CpGs in the genome [165, 166], as compared to 60–90% for MBD-Seq/MethylCap-Seq/MeDIP-Seq and 2% for the 450K array. In view of the superior specifications, WGBS is considered the “golden standard” for determining the DNA methylome.
Protein binding sites. Characterization of the genomic locations of post-translational histone modifications, histone variants, TFs, and other chromatin associated proteins is generally performed by chromatin immunoprecipitation (ChIP). ChIP relies on the use of a specific antibody to perform affinity purifications on sheared chromatin to isolate fragments bound by the protein of interest. In most workflows, proteins are crosslinked to the DNA by formaldehyde, after which the chromatin is fragmented by sonication or enzymatic digestion. However, in particular in case of histones, ChIP can also be performed on native (meaning non-crosslinked) chromatin fragmented by micrococcal nuclease (MNase) [167, 168]. After ChIP, the purified DNA fragments are sequenced to determine the protein localization on a genome-wide scale (ChIP-Seq) [169, 170]. Loci in the genome which are enriched for mapped sequencing reads (generally referred to as “peaks” according to their visual appearance in genome browsers) represent protein binding sites. ChIP-Seq heavily relies on the availability of antibodies that are specific for their endogenous target and that are compatible with the ChIP conditions. Since ChIP-Seq relies on an enrichment strategy, it generally requires a relative high number of cells as input to distinguish specific signals from background. The number of input cells for ChIP-Seq is typically 0.5–5 × 106 cells, with profiling of histones requiring less cells than profiling TFs .
Chromatin accessibility/footprinting. Transcriptional activation is tightly linked with disruption or eviction of nucleosome organization at control regions such as promoters and enhancers due to binding of TFs. Regulatory DNA thus coincides with open or accessible genomic sites in chromatin [171, 172]. Profiling of these accessible sites is performed using the exonuclease desoxyribonuclease 1 (DNaseI) or using the Tn5 transposase on native chromatin, as both enzymes are able to target accessible genomic regions within chromatin. Selecting and sequencing short fragments (50–150 nt) after treatment with DNaseI (DNAseI-Seq) [173, 174] or transposase (assay for transposase-accessible chromatin (ATAC)-Seq) allows to enrich for TF binding sites, in contrast to larger fragments that might be derived from nucleosomes . Similar to ChIP-Seq, loci in the genome which are enriched for mapped sequencing reads (referred to as “peaks”) represent accessible sites. Within the ATAC-Seq procedure, the Tn5 transposase directly inserts the adapters for sequencing. Therefore, ATAC-Seq has an important advantage in that it requires a relative small number of cells (5 × 104 cells)  to start with as compared to DNAseI-Seq (1–10 × 106 cells ). Both for ATAC-Seq and DNAseI-Seq, characterization of enriched DNA motifs within the accessible sites can be used to infer the identity of sequence-specific TFs. A complementary approach to infer the identity of TFs that are binding within accessible regions is by the use of so-called “footprints.” Sequence-specific TFs protect the genome from DNAseI and transposase digestion at the exact position where they are binding the DNA. This results in a unique, detectable footprint that can be used for characterization of the factor that is binding [174, 176].
Nucleosome occupancy/positioning. Nucleosomes are the basic core particles of the chromatin, consisting of histones and approximately 147 base pairs of DNA wrapped around it. Although the DNA-protein binding within nucleosomes is very stable, nucleosomes can be remodeled or slide along the DNA, thereby facilitating or inhibiting chromatin-related processes such as transcription. Nucleosome positioning is usually determined with the use of MNase on native chromatin [171, 177]. MNase is an endo-exonuclease that digests and cleaves DNA unless it is protected by proteins. Nucleosome position can be determined by sequencing the DNA fragments (115–195 bp in size) isolated from chromatin treated with MNase (MNase-Seq) [178, 179]. A typical MNAse-Seq profiling experiments requires 1–10 × 106 cells.
3D conformation of the genome. Chromatin loops and further high-order chromatin structures are profiled using chromosome confirmation capture . Chromosome confirmation capture relies on digestion of crosslinked chromatin using restriction enzymes, followed by ligation of the sticky ends. Sequencing of DNA ligation products allows to determine the proximity of the ligated fragments and provides insight into the 3D structure within the nucleus. Chromosomal loci that are far apart on a linear chromosome, but close together in nuclear space, can come into proximity and will hence be ligated . For genome-wide profiling, two different variants of chromosome confirmation capture that are popular include Circular chromosome confirmation capture (4C-Seq)  and HiC-Seq [183, 184]. 4C-Seq determines all genomic interaction partners of one specific locus in the genome (referred to as “bait”) at high resolution and sensitivity. In HiC-Seq, all genomic interactions are profiled at low resolution and sensitivity, enabling a global 3D view on the genome. Using HiC-Seq, recent studies in mice and human have revealed that chromosome territories are arranged into large megabase-sized topologically-associating domains (TADs) that are highly conserved and stable across cell types [183, 185]. 4C-Seq experiments typically require 1 × 107 cells , while HiC-Seq experiments require 2.5 × 107 cells .
Comparative analyses of epigenomes are complicated by the epigenetic variability that is present between individuals within a population. Genetic variation such as SNPs (single-nucleotide polymorphisms) or indels in regulatory sequences or mutations in epigenetic enzymes will have a direct effect on the epigenome [26, 27, 28, 29]. Furthermore, environmental factors such as lifestyle, stress, and nutrition influence epigenetic patterns [30, 31, 32, 33]. Also, epigenetic patterns change during aging. In fact, DNA methylation markers in saliva and blood can be used for accurate estimation of age [34, 35, 36, 37]. Thus, epigenetic patterns are plastic and change during development and over time. The variability between individuals has to be accounted for in epigenetic studies including biomarker discovery and hence large cohorts need to be studied to overcome the intra-individual variation. In this respect, it is important to note that the extent of the intra-individual variation is much less as compared to the variation observed between tissues within individuals, at least for DNA methylation [38, 39, 40].
It has become increasingly clear that misregulation or mutations of epigenetic enzymes are at the basis of a broad range of syndromes and diseases . Mutations in epigenetic enzymes are frequently observed in cancer , intellectual disability , neurological disorders such as Alzheimer’s, Parkinson’s, and Huntington’s disease , and autoimmune diseases such as rheumatoid arthritis [45, 46, 47] and type 1 diabetes . Most studies have been performed in cancer: ~30% of all driver genes characterized in cancer are related to chromatin structure and function . Well-known examples of genes in which mutations can promote or drive tumorigenesis include DNMT3A and TET2, involved in DNA methylation and DNA demethylation, respectively, and EZH2, which is part of the polycomb repressive complex 2 (PRC2) complex that trimethylates lysine 27 on histone 3 (H3K27me3) [49, 50, 51]. Apart from mutations in epigenetic enzymes, mistargeting of epigenetic enzymes, such as the silencing of CDKN2A and MLH1 by aberrant promoter DNA methylation, is considered to drive tumor formation . Given their prominent roles in cancer and various other diseases, epigenetics enzymes represent promising targets for therapeutic intervention. For example, small molecules targeting enzymes involved in the post-translational modifications of histones, such as SAHA (suberanilohydroxamic acid; Vorinostat) inhibiting histone deacetylases (HDACs), are effective as therapeutic drugs for a range of tumor types including T cell lymphomas in case of SAHA [53, 54, 55]. See Rodriguez and Miller , Qureshi and Mehler , and various papers within this special issue for excellent recent reviews on the use of small molecules to target epigenetic enzymes and their current status in clinical applications.
Molecular diagnosis and prognosis is traditionally often based on (immuno)histochemistry or immunoassays, for example by assaying prostate-specific antigen (PSA) in case of testing for prostate cancer . Also , changes in RNA (ribonucleic acid) expression, genetic alterations, and chromosomal abnormalities represent powerful biomarkers in various diseases including cancer . Notable examples are mutations in the BRCA1 and BRCA2 genes in breast and ovarian cancer or the presence of the Philadelphia chromosome in leukemia [60, 61, 62]. With the growing understanding that changes in the epigenome and chromatin are related with or causative in disease , it became clear that epigenetic alterations represent promising features to be used as biomarkers. An important characteristic for their use as biomarker is that epigenetic marks, in particular DNA methylation, are known to survive sample storage conditions reasonably well [63, 64]. Another convenient characteristic is that almost every biological tissue sample or body fluid such as blood or saliva can be used for analysis of DNA methylation and other epigenetic marks [22, 65, 66]. This robustness makes the application of epigenetic biomarkers in a clinical environment attractive.
Over the recent years, it has become clear that epigenetic features contain a high predictive value during various stages of disease. These analyses thus far mainly focused on DNA methylation. DNA methylation has been shown to be informative for disease diagnosis, prognosis, and stratification. Some of the DNA methylation-based epigenetic biomarkers, such as the methylation status of VIM and SEPT9 for colorectal cancer, SHOX2 for lung cancer, and GSTP1 for prostate cancer, are in clinical use and diagnostic kits are commercially available [67, 68, 69, 70, 71]. In case of one of the best characterized biomarkers, GSTP1, a meta study (mainly using prostatectomy tissue or prostate sextant biopsies) showed that hypermethylation of the promoter allows to diagnose prostate cancer with a sensitivity of 82% and a specificity of 95% . Importantly, the use of multiple DNA methylation biomarkers (combining hypermethylation of GSTP1, APC, RASSF1, PTGS2, and MDR1) resulted in a sensitivity and specificity of up to 100% . See Heyn and Esteller  for a recent comprehensive overview of DNA methylation biomarkers and its potential use in the clinic. In addition to its diagnostic potential, it has been well established that DNA methylation is informative for patient prognosis in terms of tumor recurrence and overall survival. For example, the hypermethylation of four genes, CDKN2A, cadherin 13 (CDH13), RASSF1, and APC, can be used to predict tumor progression of stage 1 non-small cell lung cancer (NSCLC) . In addition to disease prognosis, DNA methylation has been shown to be valuable for patient stratification to predict response to chemotherapeutic treatment. A well-known example is hypermethylation of MGMT in glioblastoma, which render the tumors sensitive to alkylating agents [76, 77] such as carmustine and temozolomide.
Together, these examples show the power and feasibility of using epigenetic features, and in particular DNA methylation, as biomarkers. Epigenetic biomarkers are complementary to genetic biomarkers. Whereas genetic mutations can (among others) disrupt protein function due to amino acid changes, epigenetic alterations can de-regulate mechanisms such as transcriptional control, leading to the inappropriate silencing or activation of genes. Notably, epigenetic changes occur early and at high frequencies in a wide range of diseases including cancer . It has been suggested that epigenetic alterations occur at higher percentages of tumors than genetic variations, resulting in a higher sensitivity in the detection of tumors .
Genome-wide epigenetic profiling for DNA methylation biomarkers
Thus far, the discovery of the epigenetic biomarkers mostly relied on targeted approaches using individual gene loci known or suspected to be involved in the etiology or progression of the disease or other phenotype under study. Despite the challenges in the identification of biomarkers using such approaches, this yielded a number of important epigenetic biomarkers. However, these approaches require a priori knowledge for the selection of candidate biomarkers.
In order to perform unbiased screens in the exploratory phase of biomarker discovery, genome-wide profiling technologies have spurred molecular biomarker discovery (detailed information on epigenomic profiling assays is presented in Table 1). Using these technologies, the entire (epi)genome can be interrogated for potential biomarkers by comparing healthy versus diseased cells/tissue, malignant versus non-malignant tumors, or drug-sensitive versus drug-resistant tumors. This enables selection of candidate biomarkers that are most informative for disease detection, prognosis, or stratification. The use of genome-wide screens furthermore enables to detect and evaluate combinations of (many) candidate loci, which often results in increased sensitivity and specificity of the biomarker. Importantly, the identification of individual genomic loci or genes as biomarkers from large datasets requires robust statistical testing such as multiple-testing correction (although traditional tests like the Bonferroni correction are over-conservative since there is often correlation between loci, i.e., they are not independent) or stringent false discovery rate (FDR) control (for example, by the Benjamini–Hochberg procedure) [80, 81, 82]. To define sets of biomarkers from large dataset, alternative statistical methods (such as sparse principle component analysis (PCA) or sparse canonical correlation analysis (CCA) [83, 84]) are available as well. In light of (i) challenges with the experimental setup when using patient material, (ii) costs, and (iii) the extensive computational analysis associated with the exploratory phase of biomarker discovery, genome-wide screens are often performed on relatively small cohorts. Independent of the (statistical) methods used, it is essential to validate (sets of) candidate biomarkers in follow-up studies on large cohorts using targeted epigenetic approaches before potential application in the clinic .
Recent years have seen an increasing number of studies using genome-wide epigenetic profiling to predict disease outcome. For a range of tumors, including childhood acute lymphoblastic leukemia , kidney cancer , NSCLC , rectal cancer , cervical cancer [90, 91], breast cancer [92, 93], and glioblastoma , DNA methylome analysis has been shown to be of prognostic value. Most of these studies define changes in DNA methylation at single sites or at small subsets of sites that represent potential disease signatures. Although these studies are often restricted to a subset of CpGs within the genome and mostly rely on relatively small sample sizes, they show the power of performing genome-wide biomarker screens.
Currently, the most popular platform used in the exploratory phase of DNA methylation biomarker discovery represents the Infinium HumanMethylation450 BeadChip array (further referred to as “450K array”; see a short explanation of the 450K array within Table 1). The probes on the 450K array mainly represent functional CpG islands and functional elements such as promoters, enhancers, and TF binding sites. Main advantages of the 450K array for the detection of DNA methylation as compared to other DNA methylation platforms include (i) its high reproducibility, (ii) the straightforward analysis methods, (iii) the large number of samples that have been profiled using the 450K array thus far (which can be used for comparative purposes), and (iv) the relatively low costs. A disadvantage, like with all bisulfite-based methods (unless combined with additional chemical procedures), is that the 450K array is unable to distinguish between DNA methylation and DNA hydroxymethylation. Hydroxymethylated cytosines represent an intermediate step during demethylation of methylated cytosines but is relatively stable and is therefore likely to have specific biological functions as well . It should be noted that levels of DNA hydroxymethylation are generally much lower as compared to levels of DNA methylation (for example, DNA hydroxymethylation levels are >95% lower in case of peripheral blood mononuclear cell (PBMC) ). A further disadvantage of the 450K array is that genetic differences between samples might result in false positives, in particular since a subset of probes on the 450K array target polymorphic CpGs that overlap SNPs [97, 98]. For association studies using large cohorts, computational methods (based on principle components) have been developed to account for population stratification resulting from differences in allele frequencies [98, 99, 100].
To enable robust screening for a (set of) potential biomarker(s), most current studies apply the 450K array on up to several hundred samples. To narrow down and validate candidate biomarkers, more targeted DNA methylation assays are used on the same or a very similar-sized cohort . Subsequently, the remaining candidate biomarkers are further validated on larger cohorts using targeted DNA methylation assays that are compatible with routine clinical use, for example, by amplicon bisulfite sequencing . Using this powerful workflow, tumors for which prognostic biomarkers have been identified include rectal cancer , breast cancer , hepatocellular carcinoma , and chronic lymphocytic leukemia (CLL) [105, 106]. Interestingly, using a similar workflow, sets of DNA methylation biomarkers have recently been identified that are prognostic for the aggressiveness of tumors in prostate cancer [107, 108]. Such studies are very important for improving treatment of prostate cancer by avoiding (radical) prostatectomy in cases where careful monitoring of the tumor over time is preferred.
Biomarkers other than DNA methylation
The majority of epigenetic biomarkers identified thus far involve changes in DNA methylation. However, in light of the various types of epigenetic misregulation associated with diseases, changes in epigenetic features other than DNA methylation are likely to become powerful molecular biomarkers as well. ChIP-Seq profiling has revealed prominent differences in binding sites of post-translational histone modifications and other proteins between healthy and cancer tissue, both in leukemia as well as in solid tumors. For example, localized changes in H3 acetylation have been reported in leukemia (see, for example, Martens et al.  and Saeed et al. ). For solid tumors, differential estrogen receptor (ER) binding and H3K27me3 as determined by ChIP-Seq has been shown to be associated with clinical outcome in breast cancer [111, 112]. Also, androgen receptor (AR) profiling predicts prostate cancer outcome . A recent study identified tumor-specific enhancer profiles in colorectal, breast, and bladder carcinomas using H3K4me2 ChIP-Seq . Next to ChIP-Seq, DNAseI hypersensitivity assays have identified tumor-specific open chromatin sites for several types of cancer (see, for example, Jin et al. ). In terms of chromatin conformation, it has recently been shown that disruption of the 3D conformation of the genome can result in inappropriate enhancer activity causing mis-expression of genes including proto-oncogenes [116, 117]. These examples show that, besides DNA methylation, changes in (i) protein binding sites (including post-translational histone modifications), (ii) accessible (open) chromatin, and (iii) the 3D conformation of the genome represent epigenetic features that are potential effective biomarkers (Fig. 1). The near absence of biomarkers based on these epigenetic features is mainly due to practical reasons. ChIP-Seq as well as other comprehensive epigenetic profiling technologies traditionally require (much) more input material, up to 1 × 106 cells or more, to obtain robust results as compared to DNA methylation profiling (Table 1). This is particularly challenging for (banked) patient samples, which are often available in small quantities that might not be compatible with epigenetic profiling other than DNA methylation profiling. Also, profiling of such epigenetic features often require elaborate and delicate workflows (Table 1). Hence, quantitation and reproducibility of ChIP-Seq and other epigenetic profiling assays besides DNA methylation profiling are challenging. Furthermore, DNA methylation profiling is better compatible with (archived) frozen or fixed samples.
However, the last 2 years have seen a spectacular progress in miniaturization of epigenetic profiling assays. In various instances, this included automation of (part of) the workflow, improving the robustness of the assays and its output. Also, improved workflows for epigenetic profiling of frozen or fixed samples have been reported. Although this involved proof-of-concept studies in basic research settings, these efforts are likely to have significant impetus on genome-wide epigenetic screens for candidate biomarkers. The remainder of this review will provide an overview of the current status of genome-wide epigenetic profiling and the technological advances that facilitate miniaturization, automation, and compatibility with preserved samples.
New developments in epigenetic profiling: compatibility with preservation methods
Most epigenetic profiling assays have been developed using fresh material in order to preserve the native chromatin architecture. However, epigenetic biomarker screens require the use of patient-derived clinical samples that are generally processed to preserve the samples as well as to allow convenient sample handling, for example, for sectioning of biopsies. Also, samples present in biobanks are fixed to allow long-time storage. In particular for retrospective studies, epigenetic profiling technologies that are applied for biomarker screens should therefore be compatible with methodologies that are routinely used for sample preservation: freezing and chemical fixation (in particular FFPE fixation) .
Freezing of tissue specimens is typically performed by snap-freezing with subsequent storage at −80 °C or in liquid nitrogen . Freezing seems to maintain nuclear integrity and chromatin structure very well (Fig. 2). WGBS , ChIP-Seq [121, 122, 123], ATAC-Seq [124, 125], and DNAseI-Seq [126, 127] all have been shown to be compatible with frozen cells or tissues.
Chemical Fixation (FFPE)
FFPE conditions do not affect DNA methylation, and also formaldehyde and paraffin do not interfere with the WGBS profiling procedure . However, epigenetic assays other than bisulfite-based DNA methylation profiling are cumbersome with FFPE samples (Fig. 2). In case of ChIP-Seq, crosslinking generally occurs in much milder conditions (1% formaldehyde for 10 min) as compared to the harsh conditions used for FFPE fixation , which can complicate shearing and epitope accessibility. Pathology tissue (PAT)-ChIP has been reported to prepare FFPE samples for ChIP-Seq by the use of deparaffinization, rehydration, and MNase treatment followed by sonication at high power [130, 131]. However, PAT-ChIP comes with various limitations including the long running time of the protocol (up to 4 days) and the fact that it is not compatible with all ChIP-grade antibodies. Interestingly, some of these issues have been resolved in the very recently developed fixed-tissue (FiT)-Seq procedure, which might open up new avenues for ChIP-Seq profiling of FFPE samples . DNaseI-Seq on FFPE samples has been reported at the expense of a drop in signal-to-noise ratios of around 50% as compared to the use of fresh material .
Despite new developments for ChIP-Seq and DNaseI-Seq, this overview shows that DNA methylation is still the most robust of all epigenetic marks for profiling of samples that are processed by freezing or chemical fixation. Although most other epigenetic profiling assays are compatible with frozen samples (at the expense of signal-to-noise ratios for some of the assays), they are generally not or poorly compatible with FFPE specimens (Fig. 2). This also implies that for these assays, it is much more challenging to make use of laser microdissection to select specific regions of interest from specimens for epigenetic analysis, for example, to separate tumor cells from stromal cells [132, 133]. An additional advantage of using DNA methylation for biomarker screening is that, in contrast to the other epigenetic profiling assays discussed, the profiling can be performed on isolated genomic DNA. This enables the use of genomic DNA from clinical DNA banks to be included in DNA methylation biomarker screens.
It should be noted that in contrast to retrospective studies, it might be feasible to use fresh or fresh-frozen patient material for screening in prospective biomarker studies. However, the use of fresh(-frozen) material in these studies could interfere with further development of potential biomarkers if it turns out that these biomarkers are incompatible with (FFPE-)fixed patient material present in the clinic. In all cases, when collecting patient samples for profiling of epigenetic marks, it is important to keep the time between surgical removal and fixation or freezing as short as possible to avoid epitope destruction and/or breakdown of the chromatin. It would therefore be helpful if the procedure time up to fixation would be documented for banked samples, so as to evaluate whether such banked samples are suitable for the epigenetic profiling technology of choice.
New developments in epigenetic profiling: miniaturization and automation
Overview of the main technological advances that allowed miniaturization and single-cell readout of genome-wide epigenetic profiling assays
WGBS Conventional WGBS profiling is compatible with a relatively low number of cells (Table 1). Recently, WGBS was adapted to enable single-cell profiling (scBS-Seq; single-cell bisulfite sequencing) . Single cells were captured by fluorescence-activated cell sorting (FACS). To cope with the extensive DNA damage caused by the bisulfite treatment, Smallwood et al.  performed tagging of the DNA fragments with sequencing adaptors after bisulfite conversion as developed by Miura et al. , instead of before conversion as performed in traditional WGBS. scBS-Seq allows to get coverage of up to 48.4% over all CpGs. A subsequent study by Farlik et al.  used a similar approach for scBS-Seq but adapted it such that the whole process of library preparation following bisulfite treatment and cleanup is performed in a single tube, minimizing DNA loss and reducing contamination risk .
ChIP-Seq. Traditionally, ChIP-Seq requires a large number of cells (at least several hundred thousands). However, improvements in the sample preparation procedure to prepare the ChIPped DNA for sequencing allowed to perform ChIP-Seq profiling on 1 × 104 cells for H3K4me3 and H3K27me3 [191, 192, 193, 194] and recently even 200 cells for H3K4me3 . Also, the use of MNase for chromatin digestion has been shown to facilitate low-input ChIP-Seq [196, 197]. An alternative approach for downscaling the number of cells for ChIP-Seq is to use carrier material, such as inert proteins and/or mRNA, which do not interfere with the ChIP-Seq procedure but increase efficiency and sensitivity . This strategy allowed to perform ChIP-Seq on the TF Estrogen Receptor (ER) on 1 × 104 cells. Similarly, bacterial DNA has been used as carrier, although this comes at the cost of increased sequencing depth as the bacterial DNA remains included in the sequencing procedure . In more recent studies aiming to obtain ChIP-Seq information from low numbers of cells, barcodes or adaptors for sequencing are ligated or transposed before or during the ChIP procedure instead of after the ChIP. ChIPmentation, the use of transposase to add adaptors to DNA during the ChIP, was shown to be highly efficient and compatible with as low as 1 × 103 cells . An alternative recent strategy for low-input ChIP-Seq relies on the addition of histone octamers during ChIP to outcompete unspecific binding . Ligation of adaptors before the ChIP (indexing-first ChIP (iChIP)) allows to pool multiple samples during the ChIP-Seq procedure, after which the sequence tags can be mapped back to the original sample [202, 203]. Bernstein and his coworkers developed this further using direct adaptor ligation on MNAse treated chromatin in an automated droplet-based microfluidic device to obtain single-cell resolution for H3K4me3 and H3K4me2 ChIP-Seq . Efficient immunoprecipitations were performed by pooling 100 single cells with the addition of carrier material that is not amplified during preparation of the ChIPped DNA for sequencing. This workflow enables the profiling of thousands of individual cells in parallel, mainly due to the continuous flow of droplets that is being generated to capture the individual cells (Fig. 4). Inherent to single-cell enrichment techniques, the coverage per single cell is sparse (~1000 unique reads per cell) and does not allow comprehensive analysis of protein binding sites in individual cells. However, the single-cell ChIP-Seq was shown to be very powerful in identifying functionally-relevant subpopulations within embryonic stem cells .
ATAC-Seq/DNAseI-Seq. ATAC-Seq has been downscaled to less than 200 cells [135, 175]. Next to this, Buenrostro et al.  reported ATAC-Seq to be compatible with single-cell profiling by performing transposition on single cells captured on a commercial microfluidics platform (Fluidigm C1; Fig. 4). This allows capturing of 96 single cells in parallel and subsequent processing steps toward a full library ready for sequencing. Together, this automated epigenetic platform represents the first of its kind in which a single-cell suspension is loaded on a platform that subsequently generates a full library for sequencing without any further manual intervention. An alternative approach for single-cell ATAC-Seq has been developed by Cusanovich et al. . They performed the transposase reaction in intact nuclei on small pools, while simultaneously performing indexing of the tagged sides. Pooling followed by redistribution of the small cell numbers combined with the introduction of a second barcode for each cell allowed to map back the tags obtained after sequencing to individual cells. The advantage of this strategy is that it allows for a higher throughput as shown by the 15,000 individual cells profiled by Cusanovich et al. . Recently, also DNaseI-Seq has been further developed to facilitate low-input profiling (between 1 × 102 and 1 × 104 cells) as well as single-cell profiling . Critically, after FACS sorting of single cells followed by lysis and DNaseI digestion, large amounts of circular plasmid DNA were added during further sample preparation for sequencing. The genomic coverage of both DNaseI-Seq and ATAC-Seq in single cells is inherently low due to the fact that each cell only contains two copies of the genome. The average number of sequence reads per cell was about 317,000 reads for DNaseI-Seq  and 73,000  or 35,000  reads for ATAC-Seq after deep sequencing of the libraries. Clearly, these numbers of sequencing reads do not allow to investigate individual genomic loci within single cells. Rather, the computational analysis in both studies makes use of DNaseI hypersensitive sites (DHSs) determined in pools of cells in order to call DHSs in single cells. Despite this limitation, the single-cell chromatin accessibility assays were shown to be powerful in identifying cell-type specific transcription factors, and their variation on genomic binding within individual cells on a global scale [115, 151].
4C-Seq and HiC-Seq. 4C-Seq and HiC-Seq are relatively new techniques [182, 183, 184], for which optimization to low cell numbers have not been extensively reported yet. However, it has been shown that HiC-Seq is compatible with single-cell profiling by performing in-nuclei DNA digestion and ligation and subsequent manual picking of individual nuclei. Using single-cell HiC-Seq it was shown that the large megabase-sized TADs that have been identified in large populations of cells are also present in single cells [205, 206]. Furthermore, single-cell HiC-Seq was shown to be very powerful to reconstruct chromosome folding. Although providing information at single-cell resolution, the single-cell HiC-Seq protocol requires 1 × 107 cells as starting material to facilitate the early steps of the protocol. Inherent to the HiC-Seq protocol, the resolution obtained in individual cells is low. Currently, between 10,000 and 30,000 ligation events are profiled per cell .
Miniaturization of epigenetic profiling
Overview of the number of cells required for the various epigenetic profiling assays
Epigenomic profiling method
Cell input using traditional profiling on bulk cells to obtain optimal data quality
Cell input using miniaturized profiling
Compatible with single-cell readout
Compatible with single cell as input
3 × 103
3 × 103
0.5–5 × 106*
1 × 104 or more
1–10 × 106
1 × 103
5 × 104
2 × 102
2.5 × 107
1 × 107
1 × 106
Automation of epigenetic profiling
The use of genome-wide epigenetic profiling for biomarker discovery strongly benefits from automated procedures that are compatible with upscaling to facilitate large-scale screens. Main advantages of automation include (i) a reduction in variability and batch effects, both of which are frequently observed in epigenetic profiling, (ii) increased throughput, (iii) reduced procedure and/or hands-on time, and (iv) lower error rates. In light of the limited number of cells within clinical samples, a combination of automation and miniaturization is likely to be beneficial in most cases. This comes with the additional advantage of reduced reagent cost, which can be substantial considering the high costs associated with epigenetic profiling. It should be noted that epigenetic profiling thus far is mainly being performed within basic research settings on relatively small sample sizes, which are well compatible with manual handling. Therefore, most automated platforms have been developed recently to cope with the increasing sample sizes and the profiling of more challenging (clinical) samples. In this section, we focus on automation of bulk and miniaturized epigenetic profiling; information on automation of single-cell technologies is included in Table 2.
Efforts to design automated workflows for epigenetic profiling have mainly been focused on ChIP-Seq and to a lesser extent on DNA methylation profiling. This can be explained by the fact that DNA methylation profiling, and chromatin profiling (ATAC-Seq/DNAseI-Seq) as well, is relatively straightforward and therefore well compatible with manual handling. Considering 4C-Seq and HiC-Seq, these are relatively new technologies for which automated workflows have not been reported yet. For DNA methylation profiling, (parts of) the workflow for MBD-Seq, MethylCap-Seq, and MeDIP-Seq have been designed on custom-programmed robotic liquid handling systems [140, 141, 142]. For ChIP-Seq, immunoprecipitations and subsequent sample preparation for sequencing have been designed on the same or similar robotic systems [143, 144, 145, 146]. However, these robotic workflows require large amounts of starting material in the range of 1 × 106 cells or more. Clearly, with such input requirements, these platforms are not readily compatible with biomarker discovery.
More recently, miniaturized automated platforms have been described for ChIP-Seq using PDMS (polydimethylsiloxane)-based microfluidic devices that have been designed to perform automated immunoprecipitations. These platforms allow to perform ChIP-Seq using as low as 1 × 103 cells  or 100 cells  due to very small reaction volumes, providing proof-of-principle that automated low-input ChIP-Seq profiling is feasible. However, to facilitate high-throughput profiling, it would be important to increase the number of parallel samples to be profiled, as currently these platforms contain a maximum of assaying four samples in parallel [147, 148]. Furthermore, integration with the labor-intensive DNA library preparation procedure would be desirable; stand-alone library preparation platforms on microfluidic devices have been reported [149, 150]. For DNA methylation profiling, various commercial low-input bisulfite conversion kits have been shown to be compatible with automation. However, a fully automated miniaturized DNA methylation profiling platform has not been reported yet.
Biomarkers are highly valuable and desirable in a wide range of clinical settings, ranging from pharmacodynamics to monitoring treatment. Here, we have provided an overview of recent developments within genome-wide profiling technologies that may enable future large-scale screens for candidate epigenetic biomarkers. When comparing compatibility with miniaturization, automation and tissue preservation methods, bisulfite-based DNA methylation profiling is currently by far superior to other epigenetic profiling technologies for large-scale biomarker discovery. DNA methylation assays are technically less challenging than most other profiling assays, as it is not dependent on delicate enzymatic reactions or on immunoprecipitation, but on chemical conversion. A critical advantage of DNA methylation profiling over other assays is that is not affected by freezing or chemical fixation, and therefore very well compatible with (archived) clinical samples. DNA methylation profiling has the additional advantage that it requires a relatively low number of cells as input. In line with these advantages, most of the epigenetic biomarkers that have been identified thus far involve changes in DNA methylation.
Despite the advantages of DNA methylation, various other epigenetic marks are promising biomarkers. Histone-modifying enzymes are frequently mutated in a range of diseases, often directly affecting epigenetic patterns of post-translational histone modifications. The main methodology to profile these post-translational histone modifications is ChIP-Seq. ChIP-Seq is challenging on samples containing low numbers of cells as well as on archived samples, often resulting in variability in signal-to-noise ratios. However, in view of the continuous improvements in ChIP-Seq procedures for (ultra-)low input samples and for fixed samples, large scale ChIP-Seq-based screens for candidate biomarkers is likely to become feasible in the near future. These screens might benefit from the automated ChIP(-Seq) platforms that are currently being developed. The development of such automated platforms will also facilitate robust integration of ChIP assays as a diagnostic tool in clinical practice.
Of the remaining technologies discussed in this paper, ATAC-Seq and DNAseI-Seq seem most compatible with profiling of clinical samples, requiring as low as several hundred cells as input. Both ATAC-Seq and DNAseI-Seq are compatible with frozen patient samples [125, 126, 127, 128], while DNAseI-Seq was recently successfully applied on FFPE samples . However, as compared to DNAseI-Seq, the workflow of ATAC-Seq is much more straightforward as the adaptors for sequencing are inserted as part of the transposition. Also, at least for single-cell ATAC-Seq, a fully automated platform has been developed . For biomarker discovery, compatibility of ATAC-Seq with FFPE samples would be highly desirable, as this would enable to include clinical samples from biobanks in large-scale ATAC-Seq profiling studies. This might be achieved by incorporating critical steps from the FFPE-compatible DNAseI-Seq. Although the use of open chromatin as an epigenetic biomarker has been rare thus far, the flexibility and ease of the recently developed ATAC-Seq (and possibly DNAseI-Seq) will undoubtedly boost the use of open chromatin in clinical research and clinical practice.
Together, this review shows that genome-wide epigenetic profiling technologies have very rapidly matured over the past decade. While originally these technologies were only compatible with large numbers of (in vitro cultured) cells, most of these can now be applied on samples containing very low numbers of primary cells down to single cells. Combined with an increasing number of sophisticated workflows and (automated) platforms, this will pave the way for large-scale epigenetic screens on clinical patient material. Such screens are essential to fill the need for new biomarkers for disease diagnosis, prognosis, and selection of targeted therapies, necessary for personalized medicine.
We thank members of the Marks’ and Stunnenberg laboratories for discussion and insight. We thank Dr. Arjen Brinkman, Dr. Richard Bartfai, and Dr. Joost Martens for their input on the manuscript. We thank COST action “CM1406- Epigenetic Chemical Biology” for their financial support regarding the publication fee.
Research in the group of HGS is supported by the European Union grant BLUEPRINT (FP7/2011: 282510) and ERC-2013-ADG-339431 “SysStemCell.” Research in the group of HM is supported by a grant from the Netherlands Organization for Scientific Research (NWO-VIDI 864.12.007).
Availability of data and materials
HM prepared the main text and tables with help of RD and HGS. RD prepared the figures. All authors contributed to the content. All authors read and approved the final manuscript.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
- 25.Kretzmer H, Bernhart SH, Wang W, Haake A, Weniger MA, Bergmann AK, et al. DNA methylome analysis in Burkitt and follicular lymphomas identifies differentially methylated regions linked to somatic mutation and transcriptional control. Nat Genet. 2015;47(11):1316–25. doi: 10.1038/ng.3413.PubMedCrossRefGoogle Scholar
- 33.Myzak MC, Tong P, Dashwood WM, Dashwood RH, Ho E. Sulforaphane retards the growth of human PC-3 xenografts and inhibits HDAC activity in human subjects. Exp Biol Med (Maywood). 2007;232(2):227–34.Google Scholar
- 38.Byun HM, Siegmund KD, Pan F, Weisenberger DJ, Kanel G, Laird PW, et al. Epigenetic profiling of somatic tissues from human autopsy specimens identifies tissue- and individual-specific DNA methylation patterns. Hum Mol Genet. 2009;18(24):4808–17. doi: 10.1093/hmg/ddp445.PubMedPubMedCentralCrossRefGoogle Scholar
- 45.Jin B, Tao Q, Peng J, Soo HM, Wu W, Ying J, et al. DNA methyltransferase 3B (DNMT3B) mutations in ICF syndrome lead to altered epigenetic modifications and aberrant expression of genes regulating development, neurogenesis and immune function. Hum Mol Genet. 2008;17(5):690–709. doi: 10.1093/hmg/ddm341.PubMedCrossRefGoogle Scholar
- 48.Miao F, Smith DD, Zhang L, Min A, Feng W, Natarajan R. Lymphocytes from patients with type 1 diabetes display a distinct profile of chromatin histone H3 lysine 9 dimethylation: an epigenetic study in diabetes. Diabetes. 2008;57(12):3189–98. doi: 10.2337/db08-0645.PubMedPubMedCentralCrossRefGoogle Scholar
- 54.Garcia-Manero G, Yang H, Bueso-Ramos C, Ferrajoli A, Cortes J, Wierda WG, et al. Phase 1 study of the histone deacetylase inhibitor vorinostat (suberoylanilide hydroxamic acid [SAHA]) in patients with advanced leukemias and myelodysplastic syndromes. Blood. 2008;111(3):1060–6. doi: 10.1182/blood-2007-06-098061.PubMedCrossRefGoogle Scholar
- 63.Thirlwell C, Eymard M, Feber A, Teschendorff A, Pearce K, Lechner M, et al. Genome-wide DNA methylation analysis of archival formalin-fixed paraffin-embedded tissue using the Illumina Infinium HumanMethylation27 BeadChip. Methods (San Diego, Calif). 2010;52(3):248–54. doi: 10.1016/j.ymeth.2010.04.012.CrossRefGoogle Scholar
- 83.Rousu J, Agranoff DD, Sodeinde O, Shawe-Taylor J, Fernandez-Reyes D. Biomarker discovery by sparse canonical correlation analysis of complex clinical phenotypes of tuberculosis and malaria. PLoS Comput Biol. 2013;9(4):e1003018. doi: 10.1371/journal.pcbi.1003018.PubMedPubMedCentralCrossRefGoogle Scholar
- 86.Milani L, Lundmark A, Kiialainen A, Nordlund J, Flaegstad T, Forestier E, et al. DNA methylation for subtype classification and prediction of treatment outcome in patients with childhood acute lymphoblastic leukemia. Blood. 2010;115(6):1214–25. doi: 10.1182/blood-2009-04-214668.PubMedCrossRefGoogle Scholar
- 100.Daca-Roszak P, Pfeifer A, Zebracka-Gala J, Rusinek D, Szybinska A, Jarzab B, et al. Impact of SNPs on methylation readouts by Illumina Infinium HumanMethylation450 BeadChip Array: implications for comparative population studies. BMC Genomics. 2015;16:1003. doi: 10.1186/s12864-015-2202-0.PubMedPubMedCentralCrossRefGoogle Scholar
- 103.van Veldhoven K, Polidoro S, Baglietto L, Severi G, Sacerdote C, Panico S, et al. Epigenome-wide association study reveals decreased average methylation levels years before breast cancer diagnosis. Clinical epigenetics. 2015;7(1):67. doi: 10.1186/s13148-015-0104-2.PubMedPubMedCentralCrossRefGoogle Scholar
- 108.Zhao S, Geybels MS, Leonardson A, Rubicz R, Kolb S, Yan Q, et al. Epigenome-wide tumor DNA methylation profiling identifies novel prognostic biomarkers of metastatic-lethal progression in men with clinically localized prostate cancer. Clin Cancer Res. 2016. doi: 10.1158/1078-0432.ccr-16-0549.Google Scholar
- 118.Lou JJ, Mirsadraei L, Sanchez DE, Wilson RW, Shabihkhani M, Lucey GM, et al. A review of room temperature storage of biospecimen tissue and nucleic acids for anatomic pathology laboratories and biorepositories. Clin Biochem. 2014;47(4-5):267–73. doi: 10.1016/j.clinbiochem.2013.12.011.PubMedCrossRefGoogle Scholar
- 119.Shabihkhani M, Lucey GM, Wei B, Mareninov S, Lou JJ, Vinters HV, et al. The procurement, storage, and quality assurance of frozen blood and tissue biospecimens in pathology, biorepository, and biobank settings. Clin Biochem. 2014;47(4-5):258–66. doi: 10.1016/j.clinbiochem.2014.01.002.PubMedPubMedCentralCrossRefGoogle Scholar
- 131.Fanelli M, Amatori S, Barozzi I, Soncini M, Dal Zuffo R, Bucci G, et al. Pathology tissue-chromatin immunoprecipitation, coupled with high-throughput sequencing, allows the epigenetic profiling of patient samples. Proc Natl Acad Sci U S A. 2010;107(50):21535–40. doi: 10.1073/pnas.1007647107.PubMedPubMedCentralCrossRefGoogle Scholar
- 133.Schillebeeckx M, Schrade A, Lobs AK, Pihlajoki M, Wilson DB, Mitra RD. Laser capture microdissection-reduced representation bisulfite sequencing (LCM-RRBS) maps changes in DNA methylation associated with gonadectomy-induced adrenocortical neoplasia in the mouse. Nucleic Acids Res. 2013;41(11):e116. doi: 10.1093/nar/gkt230.PubMedPubMedCentralCrossRefGoogle Scholar
- 144.Berguet G, Hendrickx J, Sabatel C, Laczik M, Squazzo S, Mazon Pelaez I, et al. Automating ChIP-Seq experiments to generate epigenetic profiles on 10,000 HeLa cells. J Vis Exp. 2014:(94). doi:10.3791/52150.Google Scholar
- 145.Gasper WC, Marinov GK, Pauli-Behn F, Scott MT, Newberry K, DeSalvo G, et al. Fully automated high-throughput chromatin immunoprecipitation for ChIP-Seq: identifying ChIP-quality p300 monoclonal antibodies. Scientific reports. 2014;4:5152. doi: 10.1038/srep05152.PubMedPubMedCentralCrossRefGoogle Scholar
- 150.Tan SJ, Phan H, Gerry BM, Kuhn A, Hong LZ, Min Ong Y, et al. A microfluidic device for preparing next generation DNA sequencing libraries and for automating other laboratory protocols that require one or more column chromatography steps. PLoS One. 2013;8(7):e64084. doi: 10.1371/journal.pone.0064084.PubMedPubMedCentralCrossRefGoogle Scholar
- 153.Gal-Yam EN, Egger G, Iniguez L, Holster H, Einarsson S, Zhang X, et al. Frequent switching of Polycomb repressive marks and DNA hypermethylation in the PC3 prostate cancer cell line. Proc Natl Acad Sci U S A. 2008;105(35):12979–84. doi: 10.1073/pnas.0806437105.PubMedPubMedCentralCrossRefGoogle Scholar
- 175.Buenrostro JD, Giresi PG, Zaba LC, Chang HY, Greenleaf WJ. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat Methods. 2013;10(12):1213–8. doi: 10.1038/nmeth.2688.PubMedPubMedCentralCrossRefGoogle Scholar
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.