Novel long-range regulatory mechanisms controlling PKD2 gene expression
Cis-regulatory elements control gene expression over large distances through the formation of chromatin loops, which allow contact between enhancers and gene promoters. Alterations in cis-acting regulatory systems could be linked to human genetic diseases. Here, we analyse the spatial organization of a large region spanning the polycystic kidney disease 2 (PKD2) gene, one of the genes responsible of autosomal dominant polycystic kidney disease (ADPKD).
By using chromosome conformation capture carbon copy (5C) technology in primary human renal cyst epithelial cells, we identify novel contacts of the PKD2 promoter with chromatin regions, which display characteristics of regulatory elements. In parallel, by using functional analysis with a reporter assay, we demonstrate that three DNAse I hypersensitive sites regions are involved in the regulation of PKD2 gene expression.
Finally, through alignment of CCCTC-binding factor (CTCF) sites, we suggest that these novel enhancer elements are brought to the PKD2 promoter by chromatin looping via the recruitment of CTCF.
KeywordsADPKD, PKD2, Chromatin organization, Enhancer, Gene expression, Transcription regulation
Chromosome conformation capture
Chromosome conformation capture carbon copy
Chromosome conformation capture carbon copy - sequencing
Autosomal dominant polycystic kidney disease
Renal cyst epithelial cells
DNAse I hypersensitive sites
Polycystic kidney disease
Renal proximal tubule epithelial cells
Topologically associated domain
Transcriptional start site
Autosomal dominant polycystic kidney disease (ADPKD) is one of the most common Mendelian genetic disorders with an incidence of 1/400 to 1/2000 affected individuals worldwide . The disorder is heterogeneous with two main genes, polycystic kidney disease 1 (PKD1) and 2 (PKD2), which are responsible for most disease cases [2, 3]. While PKD1 accounts for 75–80% of patients, 10–15% of patients have defects in PKD2. A third gene, GANAB has recently been identified as accounting for less than 1% of affected patients . Although more than 1200 PKD1 and almost 200 PKD2 pathogenic mutations have been reported (ADPKD Mutation Database, http://pkdb.mayo.edu/), a significant proportion of patients with ADPKD symptoms (5–10%) do not appear to carry these pathogenic gene mutations [5, 6, 7]. There is significant clinical heterogeneity in the severity of the disease among patients as well as within the same family.
PKD1 and PKD2 encode two transmembrane glycoproteins, namely polycystin-1 (PC1) and − 2 (PC2) which interact via their C-terminal tails . The regulated expression of the PC1/PC2 complex is crucial to understand cyst formation and eventually modulate cyst expression.
The correct expression of a gene relies on the regulatory mechanisms that control it. A part of the regulatory network consists of cis-regulatory sequences, such as enhancers and insulators, which may be located far away from their target gene [9, 10, 11, 12]. These regulatory elements control gene expression over long distances via the formation of chromatin loops allowing physical proximity between enhancers and gene promoters . Remote regulatory sequences can be identified by mapping DNAse I hypersensitive sites (DHS), which correspond to regions where DNA is more accessible [14, 15, 16]. In this context, variations in chromatin organization or alterations of long-range acting elements may impact gene expression. Mutations of these cis-acting regulatory sequences can result in human genetic diseases [17, 18], such as aniridia  or blepharophimosis syndrome [20, 21]. Therefore, to fully decipher gene expression and genetic disorders, it is particularly important to investigate genetic regulatory networks.
Since the regulation of polycystin is crucial to understand cystogenesis, we decided to explore the long distance cis-regulation of PKD2. The PKD2 gene is expressed in several tissues and more particularly in the kidney. In addition to its tissue-specific expression, PKD2 is also temporally regulated . Moreover, it has been shown that the PC levels can be strictly controlled [23, 24, 25, 26]. The mechanisms controlling the tight regulation of PKD2 expression remain poorly characterized and understood. The PKD2 promoter does not feature basic regulatory elements such as a TATA or CCAAT box, is GC-rich, and contains multiple putative Sp1, NF-1 and AP-2 protein binding sites [27, 28]. Computational analyses and experimental assays have also described other putative transcription factor binding sites in the PKD2 promoter region [29, 30, 31]. Moreover, several studies have shown that PKD2 expression is regulated by numerous protein interactions including PKD1 or TRPC1 [32, 33, 34, 35, 36, 37, 38]. This control can also occur by other binding protein partners through 5′ and 3′ untranslated regions of PKD2 mRNA [39, 40, 41, 42].
Recent progress in molecular biology and bioinformatics analyses has allowed the development of new techniques to analyse the spatial organization of chromatin. The chromosome conformation capture (3C) technique developed by Dekker and colleagues in 2002  measures the level of contacts between any two defined chromatin regions, and subsequent high-throughput approaches are used to map chromosomal interactions at the scale of the genome. Among these, the chromosome conformation capture carbon copy (5C) measures interaction frequencies within large chromosomal regions at high resolution . We recently used this approach to identify several CFTR enhancers  and applied it here to a large region spanning the PKD2 locus in primary human cyst renal epithelial cells. We choose to firstly conduct our long-range cis-regulation study with the PKD2 gene because of challenging molecular analyses of the PKD1 gene . Approximately 70% of the PKD1 gene (exons 1–33) is duplicated six times within six pseudogenes (PKD1P1 to PKD1P6), which share a 97.7% sequence identity and a high GC content . We identify novel contacts of the PKD2 promoter with chromatin regions, which display characteristics of regulatory elements. In parallel, by using functional analysis with a reporter assay we demonstrate that three DHS regions are involved in the regulation of PKD2 gene expression. Finally, through alignment of CCCTC-binding factor (CTCF) sites, we suggest that these novel enhancer elements are brought to the PKD2 promoter by chromatin looping via CTCF.
Analysing the human PKD2 locus with the 5C technology
Very little is known about the regulatory elements that control PKD2 transcription. To date, no study has examined the involvement of long-range regulatory elements in the mechanisms controlling PKD2 expression. To pinpoint new regulatory elements involved in the control of PKD2 expression, we decided to carry out 5C analysis in primary human cells. We used renal cyst epithelial cells (CRC) isolated from patients with ADPKD (carrying a PKD1 mutation) from the Genkyst cohort who had undergone nephrectomy.
We chose to seek the PKD2 promoter-interacting partners in a larger locus. To delineate the domain where the PKD2 promoter interactions are more likely to occur, we used Hi-C data generated from a human lung carcinoma cell line A549 (one of cell lines with highest expression of PKD2) [48, 49, 50]. Hi-C is a molecular technique, which allows genome-wide quantification of chromosomal interactions in cell populations . This method revealed that chromosomes are organized into topologically associating domains (TADs) corresponding to regions enriched in chromatin contacts [48, 52].
TADs represent folded DNA regions, ranging in size from several hundred kilobases to a few megabases, which is rather conserved across cell types and species [53, 54, 55]. TADs are delimited by boundaries, and loci located in different TADs typically interact less frequently than those within the same TAD, suggesting that boundaries act as physical insulators.
PKD2 promoter is engaged in long-distance interactions
PKD2 contact regions display characteristics of regulatory elements
Some of the 5′ and 3′ regions engaged in long-range contacts with the PKD2 promoter might correspond to DNA elements important for its regulation. We therefore tested if some of these regions displayed features of enhancers by using available databases for different markers of regulatory elements. We aligned our 5C interaction profiles from the four CRC samples with datasets from the human Renal Proximal Tubule Epithelial Cells (RPTECs). We assumed that the regulatory elements controlling PKD2 in this cell line should be, at least in part, similar to the regulatory network in cyst epithelial cells. We specifically looked for the presence of DHS or H3K27Ac marks because these often reflect the presence of regulatory elements.
Several DHS regions show enhancer activities
As shown in Fig. 4b, three of our DHS fragments significantly enhanced PKD2 promoter activity. The fragment encompassing DHS H (hg19, chr4:89,099,146-89,100,467), located at + 170 kb from PKD2 TSS, displayed a strong enhancer function with more than ten-fold effect on the PKD2 promoter. The DHS region F (hg19, chr4: 88,938,944-88,940,420), located at + 10 kb from PKD2 TSS, increased the promoter activity by 5.1-fold and DHS C (hg19, chr4:88,895,792-88,897,206), located at − 32 kb from PKD2 TSS, had a more modest enhancer activity with an almost three-fold effect. However, we observed no evidence of cooperation among these three elements (data not shown). The other DHS regions did not show any enhancer activity in A549 cells.
Formation of a PKD2 “active chromatin hub”
The crucial step in the development of cysts in ADPKD is linked to the amount of functional PC1 or PC2, and a reduction in the amount of these functional polycystins (also called the dosage model) leads to cystogenesis . It is possible that the threshold could vary between tissues and cell types as well as with the type of mutated allele. This highlights the importance of understanding the regulation and expression of polycystins in order to discover new targets to increase their expression and possibly decrease cyst formation and slow disease progression.
Because the regulatory mechanisms of PKD2 transcription remain largely uncharacterized and some cases of ADPKD have no genotypic explanation , it is essential to investigate alternative control mechanisms, particularly long-range regulation. We used 5C technology to identify regions that interact specifically with the PKD2 promoter and may be involved in regulating its expression. We decided to study these interactions in primary cells to avoid altered expression patterns of cell lines. For this purpose, we used human renal epithelial tissues obtained from nephrectomy.
Despite some minor variations among cell samples, our results show that the PKD2 promoter is engaged in numerous long-range interactions (Fig. 2a). These chromatin contacts are mostly located within the PKD2 sub-TAD (Fig. 2b, c). These interacting partners lie downstream and upstream of extragenic regions and within intronic regions, preferentially within the PKD2 gene as well as just upstream and downstream of regions containing SPP1 and ABCG2 genes. Interactions of the PKD2 promoter with neighbouring regions decrease substantially with distance, especially at a region corresponding to a TAD boundary (dashed green box, Fig. 2b, c). This is consistent with studies that recently showed that chromatin interactions within boundary regions is practically non-existent [48, 52].
Our 5C data revealed four strong interacting regions that correlate with marks of regulatory elements (Fig. 3a-c). For example, we detected contacts between the PKD2 promoter and a region located ~ 150 kb upstream (Fig. 3a, region I) containing DHS sites (Fig. 3b) and a high CTCF binding site (Fig. 5a, b).
We also found interactions between the promoter and an element located at ~ 110 kb from the PKD2 TSS in 5′ region (region II), which coincides with DHS and H3K27Ac marks. Furthermore, we detected interactions with two others regions located at 16 kb upstream and 5 kb downstream of the PKD2 TSS (region III and IV), which correspond to strong DHS, H3K27Ac marks and are at proximity of CTCF binding sites.
In addition, by using reporter assays where luciferase expression is driven by a PKD2 promoter fragment, we found that three DHS regions act as enhancers (Fig. 4b). The region H, located at + 170 kb of the PKD2 promoter show the strongest enhancer activity with more than 10-fold increase of the PKD2 expression in A549 cells. DHS C at approximately − 32 kb and DHS F at approximately + 10 kb from the PKD2 TSS have more modest enhancer activities, increasing the promoter activity by 2.8- and 5.1-fold, respectively. These three regions span sequences found to bind different types of transcription factors and chromatin remodelers in an ENCODE compilation of 161 factors with Factorbook motifs in 91 cell types (Additional file 3: Figure S1). To identify the transcription factors driving enhancer activities, these DHS regions were evaluated in subfragments. These segments showed a more modest activity even no enhancement compared with entire regions (data not shown). These results suggest that combinatorial regulatory elements would be involved [66, 67]. For example, transcription factors FOS and JUND could act cooperatively in region F.
In addition, others regions (Fig. 3, regions I to IV that correspond to DHS A, B, D and E, respectively) that correlate with regulatory markers and interact with the PKD2 promoter without increasing its expression, may still function in the regulation of PKD2 through other mechanisms or by acting in a combinatorial fashion. In particular, enhancers C and F are located directly near DHS fragments D and E which are respectively not so far and at close proximity of CTCF binding sites, suggesting that the increase of the PKD2 promoter by these cis-acting regulatory elements could depend on the active chromatin looping driven by these insulator factors which bring them into spatial proximity.
In conclusion, we identified a new set of PKD2 distal cis-regulatory elements in primary human renal epithelial cells. We proposed a looping model in the organization of the PKD2 locus in renal cells via the formation of a chromatin loop with CTCF-CTCF binding around the gene.
It is evident that other sequence variabilities in the coding region, in the promoter sequence or at a distance from the promoter region of the PKD genes are important to explain the regulation of the promoter activity, which can influence cyst formation .
Future work will be aimed at identifying this long-range regulation of PKD2 expression and, in particular, the trans-acting factors that mediate it in greater detail.
Identification of long-range acting regulatory elements that affect expression of genes may also be important for genetic diagnosis of ADPKD. It will also be very interesting to study cis-regulation with the PKD1 promoter and potential co-regulation between these two genes. Mutations in these regulatory elements can also explain many cases of patients who have either a hypomorphic allele in the PKD1 or PKD2 gene coding sequences or none at all. Moreover, genetic alterations within remote-acting regions could lead to PKD dysregulation, which might explain variability in phenotypic features of ADPKD, such as the development of renal insufficiency . Some regulatory variants have already been associated to extensive phenotypic variability. Indeed, recently non-coding variants have been related with cystic fibrosis disease traits . Thus, these PKD2 cis-regulatory elements could be sequenced in ADPKD patients in whom no mutation was found in the PKD gene coding sequences or patients with extreme clinical phenotypes. Maybe, with time, these regions would be analysed by whole-genome sequencing .
In conclusion, we reported for the first time the use of 5C-seq technology in primary human cells to analyse the chromatin conformation of a locus involved in ADPKD and identified new regulatory elements. We demonstrated that three regions surrounding the PKD2 gene interact with the promoter and activate its expression in renal epithelial cells. We proposed a regulatory looping model in the 3D-organization of the PKD2 locus in renal cells via the formation of a chromatin loop thanks to the recruitment of CTCF.
These findings provide valuable information on the basic insights of the regulation of this gene, and the description of PKD2 regulatory elements may be important for gene targeting constructs for therapy development. They also afford new targets for mutation screening in genetic diagnosis of ADPKD.
Cell collection and culture
Human renal tissues were obtained from PKD1 patients with ADPKD (carrying no PKD2 mutation) from the Genkyst cohort who had undergone nephrectomy. Several cyst tops were individually dissected with a scalpel. Each cyst was rinsed thoroughly in ice-cold phosphate buffered saline (PBS) and then placed in a separate tube containing 0.15% Pronase E. Cysts were incubated for 24 h at 4 °C to allow separation of cells from the cyst wall. Dissociation was stopped by addition of foetal bovine serum (FBS). The cyst walls stripped of cells were removed by filtration, and the cells were washed twice in PBS. Renal cyst epithelial cell (CRC) suspensions were centrifuged for 5 min at 2200 g at 4 °C. Cell pellets were resuspended in a selected and optimized cell culture media for human renal epithelial cells (REGM; Lonza), and cells were cultured following the manufacturer’s instructions.
A549 human lung carcinoma cells  were grown in Dulbecco’s modified eagle medium (DMEM; Lonza) with 10% FBS.
All cells were grown on plastic at liquid interface at 37 °C in 5% CO2 saturated humid air.
Cell isolation and fixation for 5C analysis
CRC cells were fixed with 1.5% formaldehyde for 10 min at room temperature. Crosslinking was stopped with glycine (125 mM final concentration) by incubating 5 min at room temperature followed with 15 min on ice. Cells were scraped and centrifuged at 400 g for 10 min at 4 °C. Supernatants were removed and the cell pellets were quick-frozen on dry ice.
3C library preparation
The fixed CRC cell pellets were treated as previously described [73, 74]. Briefly, 10 million cells were incubated in lysis buffer (10 mM Tris (pH 8.0), 10 mM NaCl, 0.2% NP-40, supplemented with fresh protease inhibitor cocktail) for 10 min on ice. Cells were then disrupted on ice with a Dounce homogenizer (pestle B; 2 × 20 strokes). Nuclei were recovered by centrifugation, washed twice with 1X EcoRI buffer (NEB) and resuspended in 100 μl of 1X EcoRI buffer.1X EcoRI buffer (337 μl) was added to 50 μl of cell suspension, and the mixture was incubated for 10 min at 65 °C with 0.1% SDS final concentration (38 μl). Triton X-100 (44 μl of 10% Triton X-100) was added before overnight digestion with EcoRI (400 U). The restriction enzyme was inactivated on the next day by adding 86 μl of 10% SDS and incubating 30 min at 65 °C. Samples were then individually diluted into 7.62 ml of ligation mix (750 μl of 10% Triton X-100, 750 μl of 10X ligation buffer, 80 μl of 10 mg/ml BSA, 80 μl of 100 mM ATP and 3000 Cohesive end Units of T4 DNA ligase). Ligation was carried out at 16 °C for 2 h.
The 3C libraries were next incubated overnight with 45 μL of Proteinase K (10 mg/ml) at 55 °C.The DNA was purified by phenol-chloroform extraction and precipitated with 3 M NaOAc pH 5.2 (800 μl) and cold ethanol. After at least 1 h at − 80 °C, the DNA was recovered by centrifugation, and the pellets were washed with cold 70% ethanol and then resuspended in 400 μl of 1X TE pH 8.0. A second phenol-chloroform extraction and precipitation with 3 M NaOAc pH 5.2 (40 μl) and cold ethanol was performed. DNA was recovered by centrifugation and washed eight times with cold 70% ethanol. The pellets were finally dissolved in 100 μl of 1X TE pH 8.0 and incubated with RNAse A (1 μl at 10 mg/ml) for 15 min at 37 °C.
5C primer and library design
5C primers covering the PKD2 region (hg19, chr4: 88,560,000-89,230,000), the ERCC3 region (hg19, chr2:128,014,866-128,051,752) and the ENr313 region (hg19, chr16:62,276,449-62,776,448) were designed using “my5C.primer”  with the following parameters: optimal primer length of 30 nt, optimal TM of 60 °C, default primer quality parameters (mer: 800, U-blast: 3, S-blasr: 50). Primers were not designed for large (> 20 kb) restriction fragments. Low complexity and repetitive sequences were excluded from our experimental design such that not all fragments could be probed in our assays. Primers with several genomic targets were also removed.
The universal A-key (CCATCTCATCCCTGCGTGTCTCCGACTCAG-(5C-specific)) and the P1-key tails ((5C-specific)-ATCACCGACTGCCCATAGAGAGG) were added to the Forward and Reverse 5C primers, respectively. Reverse 5C primers were phosphorylated at their 5′ ends. An anchored 5C design was used for the PKD2 locus, 2 Reverse 5C primers targeted the PKD2 promoter while 24 Forward 5C primers covered the surrounding region.
For the control regions, an alternating 5C design was used: alternating Forward and Reverse 5C primers covering ERCC3 and ENr313 regions were used to generate the 5C libraries. This design used 31 primers (2 Forward / 3 Reverse for the ERCC3 region, 13 Forward / 13 Reverse ENr313 region). All 5C primer sequences are listed in Additional file 1: Table S1.
5C library preparation
5C libraries were prepared and amplified with the A-key and P1-key primers following a procedure described previously . Briefly, 3C libraries were first titrated by PCR for quality control (single band, absence of primer dimers, etc.) and to verify that contacts are amplified at frequencies similar to what is usually obtained from comparable libraries (same DNA amount from the same species and karyotype) [73, 76, 77].
The 5C primer stocks (20 μM) were diluted individually in water on ice, and mixed to a final concentration of 0.002 μM. The mixed primers were combined with annealing buffer (10X NEBuffer 4, New England Biolabs Inc.) on ice in reaction tubes. Salmon testis DNA was added to each 5C reaction, followed by the 3C libraries and water for a final volume of 10 μl. Samples were denatured at 95 °C for 5 min, and annealed at 50 °C for 16 h before ligation with Taq DNA ligase (10 U) for one hour. Each ligation reaction was then PCR-amplified individually with primers against the A-key and P1-key primer tails. We used 35 cycles based on dilution series showing linear PCR amplification within that cycle range. The 5C PCR products of corresponding 3C libraries were pooled before purifying the DNA on MinElute columns (Qiagen).
The 5C libraries were quantified on agarose gel and diluted to 0.012 ng/μl (for Ion PGM™ Template OT2 200 Kit). A 25 μL volume of the diluted 5C library was used for sequencing with an Ion PGM™ sequencer. Samples were sequenced onto Ion 314™ Chips v2 or Ion 316™ Chips following the Ion PGM™ Template OT2 200 Kit and Ion PGM™ Sequencing 200 Kit v2 protocols recommended by the manufacturer (Life Technologies™).
5C data analyses
Analysis of the 5C sequencing data were performed as described previously [57, 59]. The sequencing data were processed through a Torrent 5C data transformation pipeline on our local instance of Galaxy as previously described [57, 78] (http://galaxy.bci.mcgill.ca/galaxy/). This analysis generates an excel sheet containing interaction frequency lists (IFL) as well as a text file, which were used to visualize results using “my5C-heatmap” .
Normalization between different libraries was done first by read count and then by using the compaction profiles for the ERCC3 region and the gene desert region ENr313 after setting one sample (CRC) as a reference.
The ERCC3 and ENr313 5C data for each sample were normalized by dividing the number of reads of each 5C contact by the total number of reads from the corresponding sequence run. A ratio, calculated by dividing these normalized data by normalized data from a reference sample (CRC), was applied to the corresponding raw data of the study region for each sample.
5C data was averaged, optimally binned at 80 kb (6× step) and are represented in heatmap form where interaction frequencies are colour-coded according to the respective scales. The average interaction frequencies are from the number of sequence reads after normalization. Data was binned based on distance for several reasons. First, by binning the data and re-dividing it into defined genomic distances (or “steps”), 5C signals are averaged along the region such that 5C contacts with greater error rates will not be seen as false positive or negative. This is particularly important given that the data is displayed on a color scale and that the eye does not detect signal linearly. By binning and re-dividing we also avoid displaying gaps which my5C inserts as grey lines that detracts attention from the actual findings. At the same time, this transformation allows us to align the data directly onto the genomic region without having to use “true size” - a display of the data where bin sizes are the same as the fragments sizes and which can enhance or reduce contact frequency to the eye. “True size” displays also insert grey lines in the heatmaps.
Databases and URLs
The Hi-C data from human A549 were downloaded from computational and functional Genomics/Epigenomics Yue Lab website at http://promoter.bx.psu.edu/hi-c/view.php. The Hi-C data shown in a heatmap format were binned at 40 kb. TAD and sub-TADs were outlined manually based on clustering of interaction frequencies.
DHS peaks, CTCF binding sites datasets of H3K27Ac marks for RPTEC and A549 and transcription factor ChIP-seq data of 161 factors from 91 cell types were obtained and visualized using the UCSC genome browser (http://genome.ucsc.edu) .
The “my5C-primer” and “my5C-heatmap” bioinformatics tools are found at http://3dg.umassmed.edu/my5Cheatmap/heatmap.php.
All the cloning steps were done using the “In-fusion®” strategy from Clontech. Using the pGL3-Basic Vector (Promega), the 5′-flanking region of the PKD2 gene (1169 bp, “PPKD2”) was cloned upstream of the firefly luciferase cDNA at the Hind III site. The candidate enhancer elements (A to H) were amplified and inserted downstream of the PPKD2 construct.
All the inserted fragments were verified by sequencing. The PCR primers used to amplify the PKD2 promoter and candidate enhancer sequences are shown in Additional file 2: Table S2.
Cells (2.5 × 105) were seeded in 6-well plates. Transfections were done 24 h later with the transit 2020 reagent (Mirus). 800 ng of the PPKD2 construct and 200 ng of the pCMV-LacZ construct (as internal control) were used for each condition. Every condition was carried out in triplicates.
The cells were washed once with 1X PBS and lysed with Passive lysis buffer (Promega) 48 h post-transfection. Cell lysates were clarified by centrifugation at 15,000 g for 5 min at 4 °C. 20 μL of each protein extract was used to assay the luciferase activity and 25 μL for beta-galactosidase activity. We used Promega reagents and a multiwell plate reader Varioskan (Thermo Fisher). The results were presented as relative luciferase activity with the PPKD2 construct activity equal to 1. Significance determination of the increased luciferase activity was performed using unpaired t-tests using the GraphPad Prism® software.
We thank Christelle Ratajczak, clinical research associate, for cyst sampling, and the patients for their participation. We are grateful to the members of our laboratories for insightful discussions.
This work was conducted with the support of INSERM (annual grant for current equipment and consumables), the Groupement Inter-Régional de Recherche Clinique et d’Innovation (GIRCI grand-ouest) for data collection and the French Fondation Maladies Rares for consumables. JD was supported by a grant from the Canadian Institutes of Health Research [CIHR MOP-142451]. J.D. is a FRSQ Research Scholar (Junior 2).
Availability of data and materials
The Hi-C data from human A549 were downloaded from computational and functional Genomics/Epigenomics Yue Lab website. The Hi-C data shown in a heatmap format were binned at 40 kb (http://promoter.bx.psu.edu/hi-c/view.php?species=human&assembly=hg19&source=inside&tissue=A549&type=raw-rep1&resolution=40&c_url=&transfer=&chr=chr4&start=87720000&end=90000000&sessionID=&browser=none).
DHS peaks, CTCF binding sites datasets of H3K27Ac marks for RPTEC and A549 and transcription factor ChIP-seq data of 161 factors from 91 cell types were obtained and visualized using the UCSC genome browser (http://genome.ucsc.edu/cgi-bin/hgTracks?db=hg19&lastVirtModeType=default&lastVirtModeExtraState=&virtModeType=default&virtMode=0&nonVirtPosition=&position=chr4%3A88576500-89070500&hgsid=676703611_3vY26dSGrdehcr939HtB3zszv3tE) .
The “my5C-primer” and “my5C-heatmap” data are available at https://docs.google.com/spreadsheets/d/1w%2D-3mF1cbf6za6uOxWU6GnpM8XM5XN0Axp0rB24cOpM/edit?usp=sharing.
SM: Designed and supervised the study, carried out the experiments, analyzed the data, wrote the paper. SL: Carried out 5C experiments, analyzed the data. ECLG: Provided material support, edited the paper. YLM: Provided material support. MPA: Provided material support, edited the paper. JD: Provided data analysis software, edited the paper. CF: Supervised lab work, edited the paper. All authors read and approved the manuscript.
Ethics approval and consent to participate
This study with renal tissues from the Genkyst cohort was conducted with the approval of the Comité de Protection des Personnes Ouest VI (DC 2010 1125), of CHU Brest Cavale Blanche (CHU Cavale Blanche - 29,609 BREST CEDEX – FRANCE). Participants gave informed written consent and received clinical information.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
- 4.Porath B, Gainullin VG, Cornec-Le Gall E, Dillinger EK, Heyer CM, Hopp K, Edwards ME, Madsen CD, Mauritz SR, Banks CJ, et al. Mutations in GANAB, encoding the glucosidase IIalpha subunit, cause autosomal-dominant polycystic kidney and liver disease. Am J Hum Genet. 2016;98(6):1193–207.CrossRefPubMedPubMedCentralGoogle Scholar
- 8.Fedeles SV, Tian X, Gallagher AR, Mitobe M, Nishio S, Lee SH, Cai Y, Geng L, Crews CM, Somlo S. A genetic interaction network of five genes for human polycystic kidney and liver diseases defines polycystin-1 as the central determinant of cyst formation. Nat Genet. 2011;43(7):639–47.CrossRefPubMedPubMedCentralGoogle Scholar
- 20.Beysen D, Raes J, Leroy BP, Lucassen A, Yates JR, Clayton-Smith J, Ilyina H, Brooks SS, Christin-Maitre S, Fellous M, et al. Deletions involving long-range conserved nongenic sequences upstream and downstream of FOXL2 as a novel disease-causing mechanism in blepharophimosis syndrome. Am J Hum Genet. 2005;77(2):205–18.CrossRefPubMedPubMedCentralGoogle Scholar
- 21.D'Haene B, Attanasio C, Beysen D, Dostie J, Lemire E, Bouchard P, Field M, Jones K, Lorenz B, Menten B, et al. Disease-causing 7.4 kb cis-regulatory deletion disrupting conserved non-coding sequences and their interaction with the FOXL2 promotor: implications for mutation screening. PLoS Genet. 2009;5(6):e1000522.CrossRefPubMedPubMedCentralGoogle Scholar
- 22.Markowitz GS, Cai Y, Li L, Wu G, Ward LC, Somlo S, D'Agati VD. Polycystin-2 expression is developmentally regulated. Am J Phys. 1999;277(1 Pt 2):F17–25.Google Scholar
- 44.Dostie J, Richmond TA, Arnaout RA, Selzer RR, Lee WL, Honan TA, Rubio ED, Krumm A, Lamb J, Nusbaum C, et al. Chromosome conformation capture carbon copy (5C): a massively parallel solution for mapping interactions between genomic elements. Genome Res. 2006;16(10):1299–309.CrossRefPubMedPubMedCentralGoogle Scholar
- 70.Vecchio-Pagan B, Blackman SM, Lee M, Atalar M, Pellicore MJ, Pace RG, Franca AL, Raraigh KS, Sharma N, Knowles MR, et al. Deep resequencing of CFTR in 762 F508del homozygotes reveals clusters of non-coding variants associated with cystic fibrosis disease traits. Human Genome Variation. 2016;3:16038.CrossRefPubMedPubMedCentralGoogle Scholar
- 71.Mallawaarachchi AC, Hort Y, Cowley MJ, McCabe MJ, Minoche A, Dinger ME, Shine J, Furlong TJ. Whole-genome sequencing overcomes pseudogene homology to diagnose autosomal dominant polycystic kidney disease. Eur J Hum Genet. 2016;24(11):1584–90.Google Scholar
- 77.Dostie J, Zhan Y, Dekker J: Chromosome conformation capture carbon copy technology. Current protocols in molecular biology / edited by Frederick M Ausubel [et al] 2007, Chapter 21:Unit 21 14.Google Scholar
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.