Use of molecular beacons to verify that the serine hydroxymethyltransferase pseudogene SHMT-ps1 is unique to the order Primates
- 6.9k Downloads
The serine hydroxymethyltransferase processed pseudogene SHMT-ps1 has been suggested to be unique to the order Primates because of the failure to amplify this sequence by PCR from genomic DNAs of any non-primate mammal species. Here, 'molecular beacon' probes specific to SHMT-ps1 were used in an attempt to verify this suggestion.
In a search for SHMT-ps1-specific sequences using molecular beacons across a range of mammalian species, SHMT-ps1 was only found in primates. The molecular beacon assays also showed that SHMT-ps1 is present in both Old World and New World species but not among prosimians.
These results suggest that SHMT-ps1 originated close to the origin of the Anthropoidea, some 40 to 50 million years ago.
KeywordsMolecular Beacon Order Primate Serine Hydroxymethyltransferase Molecular Beacon Probe World Monkey Species
Serine hydroxymethyltransferase (EC 22.214.171.124, SHMT) is a pyridoxal-phosphate-binding enzyme that catalyzes the reversible reaction of serine and tetrahydrofolate to glycine and 5,10-methylene tetrahydrofolate . SHMT-catalyzed reactions are important as a primary source of one-carbon units used in folate-dependent metabolism, including purine synthesis . Two isoforms of SHMT are found in eukaryotic cells: one expressed in the cytoplasm (SHMT-cyt) and one in mitochondria (SHMT-mit). The gene encoding the former is located on human chromosome 17p11.2 and the gene encoding the latter is on human chromosome 12q13.2 [3,4]. A third member of this small gene family is a processed pseudogene, SHMT-ps1, located on human chromosome 1p32.3-p33 [5,6]. Analyses of known SHMT amino acid and nucleotide sequences indicate that eukaryotic SHMT-cyt is descended from a prokaryotic glyA-like ancestor and that both SHMT-mit and SHMT-ps1 have been derived from SHMT-cyt [3,4,5,7].
PCR-based analyses of evolutionary conservation of the SHMT-ps1 nucleotide sequence suggest that the reverse transcription event giving rise to SHMT-ps1 occurred after the divergence of the order Primates from the rest of the Mammalia, thus making this locus a genetic marker unique to the primates . Here, this suggestion is verified using two end-point molecular beacon assays. In addition, results presented here further refine the time frame in which the reverse transcription and retrotransposition events occurred.
Results and discussion
Aligned sequences homologous to molecular beacons E8/E9 and E9/E10
(a) Beacon E8/E9 sequence
Spider monkey SHMT-ps1
Human SHMT-cyt cDNA
Rabbit SHMT-cyt cDNA
Sheep SHMT-cyt cDNA
(b) Beacon E9/E10 sequence
Spider monkey SHMT-ps1
Human SHMT-cyt cDNA
Rabbit SHMT-cyt cDNA
Sheep SHMT-cyt cDNA
The appearance of a processed pseudogene such as SHMT-ps1 is due to the reverse transcription of a processed mRNA followed by the random insertion of the derived cDNA into the genome via retrotransposition [12,13]. On the basis of the results presented here, refinement of the point at which those events giving rise to SHMT-ps1 occurred must now exclude the prosimians. Kay et al.  suggest that the divergence of the line leading to the monkeys and apes (Haplorrhini) from that leading to the prosimians (Strepsirrhini) occurred around the beginning of the Eocene some 55 million years ago. They also note that the divergence of the Old World monkeys and apes (Catarrhini) and the New World monkeys (Platyrrhini) probably took place before the end of the Eocene, or before 34 million years ago . Given that SHMT-ps1 is found in both New World and Old World monkeys, the window in which the reverse transcription event could have taken place is further bracketed within a 12 to 15 million year period between the lower and middle Eocene. One species that has not so far been examined that would shed light on this question is, of course, the tarsier. Should the tarsier not have SHMT-ps1 in its genome, the window would be further constrained to a lower bound corresponding to the origin of the Anthropoidea, which is placed somewhat nearer the middle Eocene .
Finally, of interest here is the report of another primate pseudogene, a processed immunoglobulin J chain pseudogene, JΨ, whose origin is via the same process at about the same time. Max et al.  examined several primate species, including Old World and New World monkey species, and concluded that this locus, too, arose in the Primates some 40 to 50 million years ago. We have recently been able to confirm this conclusion by a PCR study of the same genomic DNA panel used in this paper (E.J.D. and H. Trill, unpublished observations).
Is the fact that two retroprocessed pseudogenes were produced in the order Primates at nearly the same time unusual? The answer to this is probably no. Goncalves et al.  suggest that retropseudogenes are a common phenomenon in mammalian genomes and estimate that the human genome contains some 23,000 to 33,000 such loci. They also note several common features among the genes giving rise to retropseudogenes: they are widely expressed; they are highly conserved; they are usually short; and they are relatively GC poor . SHMT-cyt, the antecedent locus of SHMT-ps1, does, in fact, meet most of these criteria, as does the antecedent locus of JΨ. Thus, although SHMT-cyt is quite large [3,4] and neither gene is particularly GC poor [3,4,15], the appearance of these two retroprocessed pseudogenes at about the same time in the evolutionary history of the primates is probably not unusual. Both may, however, prove to be valuable molecular tools in primate phylogenetic research.
Materials and methods
Genomic DNA samples
Genomic DNA samples used in this study were obtained both commercially and through the generosity of a number of colleagues. DNA samples from humans (Homo sapiens) as well as from chimpanzee (Pan troglodytes), orangutan (Pongo pygmaeus), tamarin (Leontopithecus rosalia), horse (Equus equus), cow (Bovis domesticus), dog (Canis familiaris), cat (Felis domesticus), sheep (Ovis aries), pig (Sus scrofa), rabbit (Oryctolagus cunniculatus), rat (Rattus norvegicus), hamster (Mesocricetus auratus), and the northern short-tailed shrew (Blarina brevicauda) were obtained by the author. DNA samples from gibbon (Hylobates lar), macaque species (Macaca mulatta, M. assamensis, M. nemestrina, M. arctoides, and M. fuscata), baboon species (Papio hamadryas and P. anubis), the African green monkey (Chlorocebus aethiops). New World capuchins (Cebus capucinus, C. apella, and C. albifrons), squirrel monkey (Saimiri boliviensis) and spider mokey (Ateles paniscus) were provided by Moses Schanfield. DNA from the ruffed lemur (Varecia variegata) was one of a number of lemur DNAs provided by Edward Lewis (Henry Doorly Zoo, Omaha, Nebraska). DNA from the short-tailed opossum (Monodelphis domestica) was provided by John Vandeberg (Southwest Foundation for Biomedical Research, San Antonio, Texas) and DNA from the blind mole rat superspecies Spalax ehrenbergi by Eviatar Nevo (University of Haifa, Israel).
PCR amplification and molecular beacon design
To date, analyses of SHMT-ps1 in primates and other mammals have focused on a region of the pseudogene that flanks a His-Lys dimer at positions 256-257 [4,6]. This dimer is the pyridoxal-phosphate-binding site and is thus a critical functional region of the protein [17,18]. PCR amplifications of this small region of the SHMT-ps1 sequence are carried out using the primers SHMT-C1 (5'-GGCGTGGTGCCCTCCCCATTT-3') and SHMT-B6 (5'-TGCAGTTGGCCACCACCTGGT-3'). Usual amplification conditions are 94°C for 3 min followed by 30 cycles of 94°C for 30 sec; 62°C for 30 sec; 72°C for 45 sec with a final 72°C extension for 5 min. PCR amplicons are visualized directly on 1.3% low melting temperature agarose gels (1x TBE) after ethidium bromide staining. Amplicons from PCR with the C1/B6 primer pair are three different sizes in primates. The shortest, 281 bp, has been seen in humans, chimpanzees, and gorillas as a result of the 11 bp deletion in those species. The longest, 317 bp, is only seen in C. aethiops, due to the presence of the 25 bp insertion. All other primate species so far examined (18 species in all) display a 292 bp amplicon .
In previous work, evidence for the absence of the SHMT-ps1 locus in non-primate mammals was taken from a consistent failure to amplify the pseudogene sequence from genomic DNAs via PCR using a variety of primers in addition to the C1/B6 primer pair. Among these other primer pairs were sequences specific to published non-primate mammal SHMT cDNAs. Here, this assay is taken further by employing the more sensitive and specific method of molecular beacons . Molecular beacons contain a sequence-specific probe of up to 25 bases flanked by a self-complementary stem of 5-6 bases. On the 5' end of the probe is a fluorescent moiety and on the 3' end is a dark quencher. In the ground state, the molecule is folded such that the fluorophore and the quencher are in direct contact and the transfer of fluorescent energy from the former to the latter is subsequently emitted as heat rather than as light. Thus, in the ground state, molecular beacons are dark. In the presence of a target sequence, the molecular beacon undergoes a conformational change, with the internal probe sequence hybridizing to its target and the fluorophore and quencher becoming separated by a distance sufficient to prevent quenching. In this state, the fluorophore is free to emit light in its specific wavelength [19,20].
Girgis et al.  presented the genomic organization of SHMT-cyt. Their data show that the 281-317 bp SHMT-ps1 amplicon generated by the PCR primer pair C1/B6 contains parts of exons 8 and 10 as well as all of exon 9. SHMT-cyt genomic sequences also show that exons 8 and 9 are separated by a 4,500 bp intron and that exons 9 and 10 are separated by a 2,500 bp intron. To eliminate the possibility of false hybridization to SHMT-cyt genomic sequences instead of the intronless SHMT-ps1 sequence, two molecular beacons were designed to span exon-exon boundaries (Figure 2). Following the general design rules set out by Kramer and colleagues [19,20], the sequences of these two molecular beacons are SHMT-E8/E9: 5'-FAM-CCCGGCACGATCTTCTACAG/GAAAGGAGGGCCGGG-DABCYL-3' and SHMT-E9/E10: 5'-HEX-GCCGCACAACCACACCATTGCTG/GGGTTGCTGCGGC-DABCYL-3'. The self-complementary hairpin sequence in each molecular beacon is in bold and the exon-exon boundaries are indicated by the slashes. Finally, the fluorophores FAM (5-carboxyfluorescein) and HEX (5-hexachlorofluorescein) are paired with the universal fluorescence quencher DABCYL (4-(4'-dimethylaminophenylazo)benzoic acid).
As the analyses carried out in this study were end-point assays having a simple yes/no answer, the PCR amplifications were all performed in the presence of 100 ng genomic DNA. In addition to the target DNAs, 50 pmol of the PCR primers C1 and B6, 10 mM each dNTP, 2.5 U Taq polymerase, and standard Taq PCR buffer (Roche) were used. Amplification conditions were 94°C for 3 min followed by 40 cycles of 94°C for 30 sec, 62°C for 30 sec, 72°C for 45 sec and a final extension step for 7 min at 72°C. Following amplification, 250 pmol of molecular beacon probe was added to the reaction, the mixture was heated to 94°C for 3 min, and then held at 60°C before visualization of the reaction tubes on a UV transilluminator. Each set of reactions was photographed on the transilluminator in the same manner as ethidium bromide-stained agarose gels (Figure 1).
I thank Moses Schanfield, Edward Lewis, John Vandeberg and Eviatar Nevo for their generosity in providing samples of mammalian DNAs.