A human specific Alu DNA cassette is found flanking the genes of transcription factor AP2
Alu elements are retroposons that invaded the primate genome and shaped its biology. Some Alus inserted recently and are polymorphic in the human population. It is these Alus that are being sought after in disease association studies and regulatory biology. Discovering polymorphic Alus in the human genome can open areas of new research in these fields.
Using the polymerase chain reaction on genomic DNA, we identified a polymorphic Alu in the flanking region of the TFAP2B and TFAP2D genes. The new insert was found in higher frequency in Europeans (0.4) and Asians (0.38) and lower frequency in Africans (0.25). We also show this Alu to be part of a 3 Alu cassette that is human specific. The TFAP2B and TFAP2D genes encode members of the transcription factor AP-2, which plays a role in organ development. The insertion of this Alu cassette flanking the transcription factor genes distinguishes humans from the primates. This cassette can possibly affect the regulation of both genes or alternately provoke genomic deletions, which we have shown in this study. Its presence in such a location is intriguing and unquestionably opens an investigational window in disease association studies and in the field of gene regulation.
KeywordsAlu Polymorphic TFAP2B TFAP2D AP2 Dental Neural Breast cancer
- ACE ID
angiotensin converting enzyme insertion-deletion polymorphism
transcription factor AP2 beta gene
transcription factor AP2 delta gene
American Type Culture Collection
The Transcription Factor AP2 Beta and Delta (TFAP2B and TFAP2D) genes are located on chromosome 6 and encode members of the AP-2 protein family [1, 2]. AP-2 is a transcription factor that binds directly to DNA either as a homo or hetero dimer regulating gene expression . They display differential binding affinities for different gene promoters and function in both inducing and repressing transcription [4, 5, 6]. The AP-2 protein has been shown to be expressed in various tissues including adrenal, esophagus, kidney, placenta, salivary gland, skin and testis . Mutations in these genes have also been discovered in patients with Char syndrome (TFAP2B) [8, 9]. More recently it was shown that mutations in TFAP2B are linked with isolated tooth agenesis, microdontia, supernumerary tooth and tooth root maldevelopment . It has also been shown that the TFAP2B gene product is tumor promoting in breast cancer and appears to be highly expressed in breast cancer cell lines and in tumor tissues of breast cancer patients . Lobular carcinoma in situ and invasive lobular breast cancer also demonstrated enhanced expression of transcription factor AP-2 . In this paper, we present a polymorphic Alu (Arthrobacter luteus restriction enzyme characterized) element insert found flanking both the TFAP2B and TFAP2D genes in humans.
Alu inserts are DNA elements that have invaded the primate genome by retroposition and are now permanent residents in it with over a million interspersed in the human genome [13, 14]. They have contributed to disease by inserting in coding regions, upsetting gene structure [15, 16]. Since coding sequences are less abundant, Alus litter the landscape of non-coding DNA. This contributes to their recruitment for regulatory functions [17, 18] as in the case of the angiotensin converting enzyme (ACE) Alu . Depending on when these Alus have inserted, they can be traced in lineage by their shared descent in the primates at various loci . Alus that are shared between the primate species are used in the phylogenetic studies of primates . Some Alus are of a more recent origin and can be found to be polymorphic in the human population [14, 21, 22]. It is these Alus that are sought after in disease association studies. Thus, the aim of this study is to identify polymorphic Alu elements in the human population, which can be subsequently used in disease association studies. In this paper, we identified a polymorphic Alu element (HK1) flanking the TFAP gene family.
Genomic DNA used in this study was obtained from human primary cell cultures, which have been previously described . The primary cultures of human endothelial cells came from a variety of tissues, which were purchased from the American Type Culture Collection (ATCC). The cells came from the following tissues of origin: umbilical vein; pulmonary vein; iliac vein; abdominal aorta; femoral artery; iliac artery; and pulmonary artery.
Polymerase chain reaction (PCR)
Total genomic DNA (0.3 µg) was amplified by polymerase chain reaction (PCR) in a 30 µl volume, which included: 20 mM Tris–HCl (pH 8.3), 50 mM KCl, 2.5 mM MgCl2, 250 µM dNTP, 0.2 µM each of forward (5′-GCAGTCCAAACTCGGAAGACTGGG-3′) or reverse (5′-GGACAGAAAGAG GCCTGGACTAC-3′) and 5% dimethylsulfoxide (DMSO). Amplification of DNA was performed at 96 °C for 2 min, followed by 35 cycles of denaturation (96 °C for 30 s), annealing (62 °C for 30 s), and elongation (68 °C for 30 s). The last cycle was followed by a final elongation step (65 °C for 5 min). The reactions were stopped by the addition of 3 µl of a 0.25 M EDTA solution at 0 °C, and aliquots were electrophoresed on a 2% (w/v) agarose gel with 0.5 µg/ml ethidium bromide. The resulting bands were scored for the presence of the Alu and sequenced to confirm their identity. The native (empty site) allele was identified by the presence of a single 271 bp PCR product. The insertion allele (Alu) was identified by the presence of a single 582 bp PCR product.
Frequency analysis of the HK1 polymorphic Alu in Genbank’s genomic assemblies
To determine the Allele frequencies for this locus, we screened Genbank’s human genomic database. The Basic local alignment search tool (BLAST) was used to screen the HK1 locus in both complete and partial genomic assemblies . Aligned sequences were included in the data set and filtered to remove redundant alignments from assemblies derived from the same BioSample. Using the BLAST tool as a virtual polymerase chain reaction (vPCR) we screened Genbank assemblies for the HK1 locus.
Alu invasions occurred en masse in the distant past. More recently, however, there is a steady but perhaps slower spread in the collective primate genome . Alus have been studied ever since the advent of genomics, because in fact they make up a significant chunk of the genome. In the beginning, unfortunately, they were labeled as junk DNA to distinguish them from the genic functional parts of the DNA and this has driven their use simply as time markers for studying primate evolution [14, 20]. Over the past decade, however, Alu elements have been recognized as genomic engineers in their own right and their study shifted to the realm of biological function [17, 18, 24, 25]. It has also become obvious that evolutionary change is not limited to the 2% difference in sequence homology between the higher primates but is in fact driven by micro and macro engineering of genomic structure . To distinguish the rapidity of human change from the rest of the primates, it is pertinent to identify other mechanisms of evolutionary change besides that of the single nucleotide polymorphism (SNP) . Here we show a 3 Alu cassette present in the flanking region of TFAP2B and TFAP2D genes. These Alus are all human specific with one being polymorphic in the human population and in opposite orientation to the others. They display high homology to one another having the potential to change local genomic configuration by the formation of DNA secondary structure. The potential for forming this genomic configuration, however, is not available to all humans, since the polymorphic HK1 Alu is a requisite for this formation. Despite the fact that humans carry the HK2 and HK3 Alus, the genomic configuration is not formed unless the cassette is completed with the HK1 Alu, which only some humans carry. In some ways, we can view this genetic system at 3 hierarchical levels. The first level is represented by the primates, which lack all together this cassette and have no potential in accessing this particular genomic configuration. The second level is represented by those humans that carry the HK2 and HK3 Alus, and we can consider them evolutionarily “primed” for this genomic configuration. Finally, humans that carry the full 3 Alu cassette can be considered evolutionarily in an “active state” fully accessing this genomic configuration with all of its positive and negative outcomes. It is in this category that we have discovered genomic deletion events. This discovery sheds light on the continuing dynamic changes occurring in the human genome. It has been shown previously that Alu elements can influence gene expression by either binding transcription factors, forming chromatin looping or secondary structures and even behaving as enhancer elements inducing or repressing transcription [18, 27]. Since they are widely dispersed in the genome, they have the ability to exert powerful regulation on a local as well as a global level. Perhaps the saltatory argument of evolution comes within reach by considering the changes that might occur in the regulation of genes that code for transcription factors. Since transcription factors have a global effect on gene regulation by impacting the expression of hundreds of genes, a small genomic change in their regulation can have an amplifying effect on biology. The insertion of this Alu cassette flanking the transcription factor genes produces a change in the human genome distinguishing humans from primates. If this genomic change leads to a specific human change in the regulation of the AP2 transcription factor (regulating the regulator), this can have an amplifying effect on many genes. The flip side of the argument would be the deleterious effect of Alu elements. Perhaps the presence of the Alu cassette we described might lead to genomic rearrangements/deletions as a result of Alu recombination events resulting in disease outcomes. In this paper, we have seen such deletions; their impact on human biology, however, remains to be elucidated. Regardless of the implied biological role for this Alu cassette, this discovery opens research opportunities in various fields including gene regulation as well as disease association studies.
By identifying strategically located polymorphic Alu sequences in the human genome, we hope to open a doorway to better understand the etiology of human disease and perhaps the driving force of human evolution. After all, one of the most studied genetic polymorphisms in the history of genetic association studies is the ACE ID polymorphism, which is a polymorphic Alu element present in the angiotensin converting enzyme gene . This Alu polymorphism alone has led to over 3700 publications (as of February 2019) and has been shown to associate with varied diseases in the human population . Here we present a newly inserted Alu element flanking the human TFAP2B and TFAP2D genes and like the ACE Alu is also polymorphic in the human population. More interestingly, however, the HK1 Alu is part of a 3 Alu human specific cassette with the potential of reconfiguring chromatin structure.
Although we have found a polymorphic Alu element flanking the genes TFAP2B and 2D, the allelic frequencies were based on a small set of alleles and should be treated with caution. A larger population study of this polymorphic Alu should be conducted across a wide section of the human population to better approximate the allelic frequencies.
HKH helped to conceive and design the study, design the primers, interpret data, and write the manuscript. ND, RE, ZK, TM, LA, BH, MH, RR, and SD participated in mapping the polymorphic element, cell culture, genomic DNA extraction, performing PCR and sequencing analyses. All authors read and approved the final manuscript.
We thank Dr. Ali Al-Saif and Dr. Khalid Al-Johani for their support and encouragement in this project. We would also like to thank Almustqbal University- College of Dentistry for supporting this project and providing us with laboratory space and facilities. HKH would also like to extend his eternal gratitude to the late professor Dr. Achilles Dugaiczyk.
The authors declare that they have no competing interests.
Availability of data
The data presented here was deposited with GenBank under the accession numbers: MK574933 and MK574934.
Consent for publication
Ethics approval and consent to participate
Research was partially funded by HKH. The College of Dentistry at Almustqbal University provided the laboratory space and facilities. No external funding was received for this research.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
- 25.Buckley RM, Kortschak RD, Adelson DL. Divergent genome evolution caused by regional variation in DNA gain and loss between human and mouse. PLoS Comput Biol. 2018;14:1–36.Google Scholar
- 26.Polak P, Domany E. Alu elements contain many binding sites for transcription factors and may play a role in regulation of developmental processes. BMC Genomics. 2006;15:1–15.Google Scholar
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.