Evolutionary history of the poly(ADP-ribose) polymerase gene family in eukaryotes
- 6.6k Downloads
The Poly(ADP-ribose)polymerase (PARP) superfamily was originally identified as enzymes that catalyze the attachment of ADP-ribose subunits to target proteins using NAD+ as a substrate. The family is characterized by the catalytic site, termed the PARP signature. While these proteins can be found in a range of eukaryotes, they have been best studied in mammals. In these organisms, PARPs have key functions in DNA repair, genome integrity and epigenetic regulation. More recently it has been found that proteins within the PARP superfamily have altered catalytic sites, and have mono(ADP-ribose) transferase (mART) activity or are enzymatically inactive. These findings suggest that the PARP signature has a broader range of functions that initially predicted. In this study, we investigate the evolutionary history of PARP genes across the eukaryotes.
We identified in silico 236 PARP proteins from 77 species across five of the six eukaryotic supergroups. We performed extensive phylogenetic analyses of the identified PARPs. They are found in all eukaryotic supergroups for which sequence is available, but some individual lineages within supergroups have independently lost these genes. The PARP superfamily can be subdivided into six clades. Two of these clades were likely found in the last common eukaryotic ancestor. In addition, we have identified PARPs in organisms in which they have not previously been described.
Three main conclusions can be drawn from our study. First, the broad distribution and pattern of representation of PARP genes indicates that the ancestor of all extant eukaryotes encoded proteins of this type. Second, the ancestral PARP proteins had different functions and activities. One of these proteins was similar to human PARP1 and likely functioned in DNA damage response. The second of the ancestral PARPs had already evolved differences in its catalytic domain that suggest that these proteins may not have possessed poly(ADP-ribosyl)ation activity. Third, the diversity of the PARP superfamily is larger than previously documented, suggesting as more eukaryotic genomes become available, this gene family will grow in both number and type.
KeywordsCatalytic Triad Dictyostelium Discoideum PARP Protein Major Vault Protein BRCT Domain
Poly(ADP-ribosyl)ation activity was originally identified in the 1960s [1, 2, 3, 4, 5]; it is the rapid and reversible posttranslational covalent attachment of ADP-ribose subunits onto glutamate, aspartate, and lysine residues of target proteins. The ADP-ribose polymer is formed by sequential attachment of ADP-ribosyl moieties from NAD+; the polymers can reach a length of over 200 units and can have multiple branching points. Overall, the ADP-ribose polymer is highly negatively charged and has large physiological consequences on functional and biochemical properties of the proteins modified.
Poly(ADP-ribosyl)ation is done by enzymes called poly(ADP-ribose)polymerases (PARPs). The so-called PARP signature, a catalytic ß-alpha-loop-B-alpha NAD+ fold [6, 7], characterizes these enzymes. PARPs are found in diverse groups of eukaryotes [8, 9], but are best studied in animals. PARPs have been shown to be involved in DNA damage repair, cell death pathways, transcription and chromatin modification/remodelling (reviewed in [10, 11, 12, 13]). PARPs have been implicated in a wide range of human diseases (reviewed in ) and are important targets for anti-cancer therapies . A polymorphism in human PARP1, which causes decreased enzymatic activity, has been reported to be associated with an increased cancer risk and a decreased risk of asthma [16, 17], further underlining the importance of this class of enzymes and their complex roles in disease.
The first PARP purified and cloned, PARP1 from human, remains the best studied. PARP1 was long thought to be the only enzyme with poly(ADP-ribosyl)ation activity until two PARP isoforms were identified in plants  and, simultaneously, tankyrase was identified as a PARP localized at the telomere in humans . Subsequently, studies on PARP1 knock out mice demonstrated that the mutant mice still possessed poly(ADP-ribosyl)ation capacity and developed normally [20, 21], suggesting other enzymes existed. Since these studies, a number of genes containing the PARP signature have been identified, although a minority of them have been functionally characterized.
The PARP-like family has been best characterized in humans, where there are seventeen family members that share the PARP catalytic domain, but vary widely in other parts of the proteins [8, 9]. It is postulated that different PARPs subfamilies participate in diverse events mediated by their variable domain structures. However, only some of the family members have been shown to have PARP activity, mostly in humans (PARP1  and its orthologs from other species (for example, [23, 24]), PARP2 [25, 26], tankyrase1 [19, 27], tankyrase2 [28, 29], and vPARP ). Most of these enzymes contain an evolutionarily conserved catalytic glutamate residue in an "HYE" catalytic triad. This residue was shown to be essential for poly(ADP-ribose) chain elongation in human PARP1 . It is clear that some proteins with PARP signatures missing the catalytic glutamate residue or other residues known to be important for chain elongation do not act in poly(ADP-ribosyl)ation. For example, human PARP10 has transferase activity rather than polymerase activity, adding one ADP-ribose subunit to target proteins . It is thought that other PARP-like proteins may actually function in mono(ADP-ribosyl)ation [32, 33, 34] or even have non-enzymatic functions; human PARP9 appears to not have enzymatic activity . Even enzymes that retain the catalytically important residues that have been identified may not act as PARPs. For example, conflicting reports about the catalytic activity of human PARP3 exist; it has been reported act in poly(ADP-ribosyl)ation  and mono(ADP-ribosyl)ation .
Our knowledge of the PARP gene family is principally based on animals, in particular mammals. This taxon is a member of the Opisthokonts, one of the six eukaryotic "supergroups" [38, 39] and therefore represents only a portion of the evolutionary history and diversity of known eukaryotes. For the other five eukaryotic supergroups, studies on PARPs have been limited or non-existent. A previous study on PARPs indentified new members in more basal animals, amoebas, fungi and plants . However, no representatives from Excavates or Chromalveolates were included in the analysis and only one member of Plantae (Arabidopsis thaliana). Here we use comparative genomics and phylogenetic analysis to investigate the distribution of PARP genes across almost the entire breadth of eukaryotes, to reconstruct the evolutionary history of this protein family and to gain insights into its functional diversification. Our results indicate that the last common ancestor of extant eukaryotes encoded at least two PARP proteins, one similar to human PARP1 and functioning in DNA repair and damage response, the other likely acting in mono(ADP-ribosyl)ation; the cellular role of the last group is not known.
Identification of PARP genes from eukaryotic genomes
We used the information obtained from the Pfam database [41, 42, 43] and Uniprot [44, 45] along with BLAST searches  of sequenced eukaryotic genomes at the DOE Joint Genome Institute (JGI), the Broad Institute, the J. Craig Venter Institute, ToxoDB , NCBI, dictyBase  and the Arabidopsis Information Resource (TAIR)  to compile the sequences of over 300 PARP proteins. After preliminary alignment and phylogenetic analysis, we reduced the number of species representing animals; specifically we choose representative species of vertebrates since the genes from this group are shared by all and kept Drosophila melanogaster or Anopheles gambiae to represent insects, since all of our sequences were from Diptera. This left us with 236 sequences from 77 eukaryotic species (Additional file 1). In addition, another 46 sequences contained regions with high similarity to the PARP catalytic domain (Additional file 2); however, these sequences were incomplete and not included in the alignment. Nonetheless, these sequences likely represent bona fide members of the PARP catalytic domain. The PARP catalytic domain was extracted from the proteins sequences and aligned using MUSCLE . This alignment can be found in Additional file 3.
Phylogenetic analysis of the PARP family suggests that the ancestral eukaryote had at least two PARP enzymes
The PARP lineages (which will be detailed below) include one clade, Clade 1, which contains representatives from five of the six so-called eukaryotic supergroups: Plantae, Opisthokonts, Chromalveolates, Excavates, and Amoebozoa (Figures 1, 2 and 3; [38, 39]). There is no completely sequenced species available from the sixth supergroup, Rhizaria. This broad distribution suggests that the last common ancestor of all extant eukaryotes encoded a gene similar to those of Clade 1. Clade 6 is only found in three of the eukaryotic supergroups; however, the position of this clade as sister group to all other members of the PARP superfamily and the placement of these groups within eukaryotes supports the hypothesis that the last common eukaryote also encoded such a gene (Figure 2).
Clade 1: the PARP1 clade
Clade 1A is found in Amoebozoa (Dictyostelium), Opisthokonta (fungi) and Chromalveolates (the ciliate Paramecium tetraurelia) and is the sister group to most of the other Clade 1 subclades (with the exception of Clade 1I; Figure 3). This subclade is unique within Clade 1 in containing proteins with ankyrin repeats, in addition to WGR, PRD and PARP catalytic domains. Clade 1B contains members from both the Opisthokonta (animals and Choanoflagellata) and the Excavata (the Heterolobosea member Naegleria). This subclade is typified by human PARP1, the founding member of the superfamily. This protein has three N terminal zinc fingers that contribute to DNA binding, a BRCT domain and a PADR1 domain in addition to WGR, PRD, and the catalytic domain (Figure 4; [22, 57, 58]).
Both Clade 1C and 1D both contain proteins that have in common WGR, PRD and PARP catalytic domains and mostly do not contain other functional domains. Clade 1C is confined to several Oomyocete Phytophtora species (within the Excavata) and one basal animal. Clade 1D contains members from Opisthokonta (the animals Xenopus laevis (Q566G1) and Schistosoma japonicum (Q5DAZ0) and the fungus Batrachochytrium dendrobatidis) and Plantae (land plants) as well as ciliate members of the Chromalveolates. Some of the land plant members of Clade 1D have acquired SAP domains DNA binding domains  N terminal to the other domains (Figure 4). In addition, the land plant members of this group have altered their catalytic triad, alone among Clade 1 members (Additional files 5 and 6). All the plant proteins have a cysteine in place of the histidine while all except for the moss protein have a valine instead of the tyrosine in the second position. However, the plant Clade 1D proteins have retained the glutamic acid in the third position. It is unclear what effect these changes might have on the catalytic activity of these proteins.
Clade 1E contains most of the fungal members of Clade 1 and is characterized by proteins with BRCT domains N terminal to WGR, PRD and PARP catalytic domains. Clade 1F is specific to the Excavata. The Toxoplasma gondii representative (TGME49_070840) has a similar domain structure to human PARP1, found in Clade 1B. Clade 1G is confined to the Opisthokonta (both animals and the Choanoflagellate Monosiga brevicollis), contains proteins with only WGR, PRD and PARP catalytic domains and includes human PARP2.
All five eukaryotic supergroups that contain sequenced species are represented in Clade 1H (Figures 1 and 3). This clade includes human PARP3. Interestingly, land plants have duplicated one of their Clade 1H genes; one duplicate lineage appears to be changing rapidly, based on the long-branch length in the phylogenetic tree (Figure 3). These proteins may have acquired a novel function or the original function may have been split between the two copies in these species (neofunctionalization or subfunctionalization), as these processes are hypothesized to increase the probability of retention of duplicate genes .
The final subclade in Clade 1, Clade 1I, consists of two Caenorhabditis elegans (C. elegans) proteins, PME1 and PME2, which have been characterized previously . PME1 contains zinc fingers and PADR1, WGR, PRD and PARP domains, while PME2 only has WGR, PRD and PARP domains. As will be discussed further below, many of the nematode proteins are anomalous.
Clade 2: the RCD1 clade
Clade 2 of PARP-like genes consists of proteins identified only in land plants, with representatives found from bryophytes to angiosperms (Figures 2 and 5), a finding that has also been made by another group . However, there is no genomic information available for any member of the streptophyte algae, the sister group to land plants within Plantae, leaving open the possibility that members of this clade may be found in these organisms (Figure 2). All groups of land plants also contain members of Clade 1 PARPs, while the moss Physcomitrella patens contains Clade 6 proteins in addition (Figure 2).
One interesting observation we made concerning Clade 2 was the large number of independent gene duplications that have occurred within this gene lineage (Figure 5). While this is likely due to the propensity of plant genomes to undergo whole genome duplications (reviewed in ), the retention of many of the gene pairs suggests that Clade 2 proteins are undergoing neofunctionalization and/or subfunctionalization at a high rate [60, 68]. This supposition is supported for a pair of Clade 2A paralogs in Arabidopsis thaliana, RCD1 and SIMILAR TO RCD ONE 1 (SRO1), which have been shown to be only partially redundant despite a relatively recent evolutionary origin [65, 69].
Given the heterogeneous composition of Clade 3, it is difficult to divide into subclades; however, we classified the proteins into six subclades as outlined below, partially for the purpose of discussion, and partially based on common domain structures and features of the catalytic domains (Figures 6 and 7 and Additional file 8). Clade 3A is composed of two proteins, including human PARP10, containing an RRM RNA binding domain , a glycine-rich region (GRD), and a UIM domain, known to bind monoubiquitin and polyubiquitin chains . The proteins found in Clade 3B and 3C contain at least one Macro domain N terminal to their C terminal catalytic domain (Figure 4). Macro domains have been shown to bind to poly(ADP-ribose) (PAR) . Clade 3B includes representatives from the most basal animal in our study Trichoplax adhaerens, while 3C includes two human proteins, PARP14 and PARP15. PARP10, PARP14 and PARP15 have been demonstrated to have mART activity .
Clade 3D consists of the two Dictyostelium discoideum and four Tetrahymena thermophila proteins. Unlike the majority of animal proteins in Clade 3, only one of these proteins have a proline located one amino acid away from the third residue of the catalytic triad (Figure 7). The four proteins from the ciliate Tetrahymena thermophilia have no known functional domains outside of their C terminal PARP catalytic domains and are only similar to one another in this region (data not shown), again supporting the idea that these proteins are not closely evolutionarily related to the other proteins in Clade 3. One of the Tetrahymena proteins has retained the glutamic acid of the "HYE" (Figure 7), again supporting this interpretation. All four proteins also share a H/NNSK motif just past the last amino acid of the putative catalytic triad not found in other members of Clade 3 (Figure 7). The Dictyostelium proteins in 3D do not show high similarity outside of the PARP domain. DDB0304590 is a relatively short protein with only the PARP catalytic domain and a short C terminal extension. DDB0232928 has a Macro domain and, at its very N terminus, a U-box (Figure 4). The U-box is a modified RING finger  found in E3 ubiquitin ligases known to bind ubiquitin E2 enzymes . As Amoebozoa is the sister group to Opisthokonts within eukaryotes and given that DDB0232928 contains a Macro domain as do some other members of Clade 3, it is possible that these proteins are orthologous to at least some of the animal Clade 3 proteins.
Clade 3E is confined to animals, but is not represented in Placozoa (Figure 6). Members of this subclade contain one to two WWE domains, alone or in combination with zinc fingers (either CCCH or CCCH types) in front of their PARP catalytic domains (Figure 4). All members of 3E have replaced the glutamic acid characteristic of PARPs with an isoleucine except for two (human ZCC2/PARP13 and Nematostella vectensis A7RWC0) that contain valines at that site (Figure 7). This subclade also contains human PARP12 and human PARPT/PARP7.
Clade 3F, which is sister group to all other Clade 3 subclades, contains human PARP9 and orthologs from vertebrates. These proteins contain two Macro domains N terminal to their PARP catalytic domains (Figure 4) and have a more divergent catalytic triad than the rest of Clade 3, having Q-Y/S-T/S instead of HYE (Figure 7 and Additional file 8). Human PARP9 has been shown to be inactive , suggesting that no Clade 3F proteins act as enzymes. PARP9 was originally identified as a gene conferring risk for diffuse large B-cell lymphoma and named BAL1 (B-aggressive lymphoma 1) . Interestingly, two proteins identified by their similarity to BAL1, PARP14/BAL2 and PARP15/BAL3, although their domain structures resemble that of PARP9/BAL1, group in subclade 3C (Figures 6 and 7), and act as mARTs .
Clade 4: the tankyrase clade
Our analysis indicates true tankyrases are confined to animals, and in fact do not appear to be found outside of the bilateria (Figures 2 and 8). A duplication event that generated two tankyrase-encoding genes appears to have occurred within the vertebrates, sometime after the separation of the amphibians. The absence of tankyrase orthologs outside of the animals contradicts the report of such proteins in protozoa such as Dictyostelium discoideum and Tetrahymena thermophila . However, these protozoan proteins differ from the canonical tankyrases in structure; although they have ankyrin repeats in their N terminal region, these are followed by WGR and PRD domains rather than a SAM motif (Figure 4). Consistent with the presence of the WGR and PRD domains and the low similarity between their PARP catalytic domain and that of tankyrases, these proteins fall into Clade 1A (Figure 3). This suggests that PARP proteins independently acquired ankyrin repeats at least twice.
Clade 5: The vPARP clade
vPARP is associated with vaults, very large cytoplasmic ribonucleoprotein particles first described in the 1980s whose function is unclear . Vaults have a patchy taxonomic distribution within eukaryotes. Our analysis suggests that the phylogenetic distribution of vPARP is also limited (Figures 2 and 9); members of Clade 5A with the vPARP domain structure are found only in animals that have been shown to contain vaults, while Clade 5B proteins are found in Dictyostelium, which also contains vaults . However, although vaults have been identified in trypanosomes , no evidence of proteins sharing the domain structure of vPARP can be found in this group of organisms, although such proteins may be present in species with currently unsequenced genomes.
mART activity may be ancient
Clade 6 proteins are found in Opisthokonts (animals and fungi), Excavates (Parabasalids and Heterolobosa), and Plantae (chlorophyta and bryophytes) (Figures 1, 2 and 10 and Additional file 11). Based on its position as sister group to all other clades of PARPs (Figure 1) and the distribution of species containing Clade 6 PARPs within the eukaryotes (Figure 2), it is likely that the last common eukaryotic ancestor had at least one Clade 6-like protein encoded in its genome. This clade is characterized by N termini with no known functional domains and C terminal extensions beyond the PARP catalytic domain of varying lengths. Almost all of these proteins contain a PfamB_2311 domain immediately before their PARP catalytic domain (Figure 4), although the function or significance of this domain is unknown, supporting the placement of these proteins in a single clade. Another characteristic of Clade 6 members is changes within the PARP catalytic domain. None of the Clade 6 proteins we identified contain the final glutamic acid of the HYE catalytic triad, although they mostly retain the histidine and tyrosine (Additional file 11). This might lead to an inability to catalyze poly(ADP-ribosyl)ation. In fact, the human proteins in this clade (PARP6, 8, and 16) have been predicted to have mono(ADP-ribosyl)ation activity based on structural models , although this awaits experimental confirmation. None of the Clade 6 PARPs have been functionally characterized.
No other known functional domains can be identified in Clade 6A proteins; however, most of these proteins do share another PfamB domain, 30617, at their very N termini . This domain is confined to fungal species and appears to only occur in Clade 6A family members with the exception of a protein from the fungus Uncinocarpus reesii (EEP82442.1) that consists only of this domain (Additional file 14). Pfam-B_30617 averages 360 amino acids in length and has some secondary structure similarity to the RWD domain when modelled using the Protein Homology/Analogy Recognition Engine (Phyre; ), and is predicted to form an alpha helix/beta strand/alpha helix/beta strand/alpha helix structure (Additional file 13C). The RWA domain has some structural similarity to the UBCc domain , further providing a link between the Clade 6A proteins and Ub. The RWA domain is thought to mediate non-catalytic protein-protein interactions. We propose renaming the Pfam-B_30617 domain FPE, for Fungal PARP E2-associated.
Clade 6B proteins are found in a subset of green algae (Figure 10). These proteins have no other domains of known function but do contain PfamB_2311 domains as well as the PARP catalytic domain. Green algae have not previously been shown to have any PARP-like proteins encoded in their genomes. Clade 6C proteins are animal specific and are found in species from across this group, including human (PARP16; Figure 10). Again, other than a PfamB_2311 domain and a PARP catalytic domain, no other obvious protein motifs are present. Clade 6D is confined to Deuterostomes with the exception of the mollusc Lottia gigantea. These proteins consist of no identifiable domains other than a PfamB_2311 domain and the PARP catalytic domain (Figure 4). Human PARP6 and PARP8 are found within this group of proteins.
Clade 6E consist of seven proteins encoded by Trichomonas vaginalis, the only member of the Parabasalids (Excavata) with a fully sequenced genome and one fungal protein (Nectria haematocca 83215). Trichomonas is the causative agent of the sexually transmitted disease trichomoniasis in humans; without other completed genomes available for the parabasalids, it is impossible to determine if members of Clade 6E are found elsewhere in this group. Besides the PARP catalytic domain, the only other identified domain in these proteins is a PfamB_2311 domain. The Nectria haematocca protein does not have a PfamB_2311 domain or any known functional domain.
Phylogenetic analysis suggest multiple independent losses of PARP genes across the eukaryotes
Eukaryotic organisms with no identifiable PARP genes in their nuclear genomes.
Citation or sequencing group
Evolutionary history of the PARP family
Members of Clade 1 have been characterized in a range of organisms, encompassing three of the six eukaryotic supergroups. While a wide range of functions has been described for these PARPs, most characterized members of Clade 1 have been implicated in or demonstrated to have roles in DNA damage response and repair. In Plantae, two of the Arabidopsis thaliana Clade 1 members, AtPARP1 and AtPARP2, have been shown to be induced by DNA damage and be involved in the response to it [92, 93]. In the Opisthokonts, several animal Clade 1 members have been investigated and shown to be involved in DNA repair. This is a well-known function for the human Clade 1 members, PARP1, PARP2, and PARP3 [26, 94, 95]. In addition, a fungal protein, PrpA from Aspergillus nidulans, has been shown to act early in the DNA damage response , while loss of its ortholog from Neurospora crassa, NPO, causes sensitivity to DNA damage and acceleration of replicative aging . Within the Excavates, a Trypanosoma cruzi Clade 1 member, TcPARP, has been shown to be induced in response by DNA damage, be enzymatically activated by nicked DNA and to require DNA for catalytic activity . Clade 1 members in the Chromalveolates and the Amoebozoa have not been functionally characterized, but are also likely to function in DNA damage response. Dictyostelium discoideum in the Amoebozoa has at least four Clade 1 proteins encoded in its genome (Figure 3). Drug studies have implicated PARP activity in oxidative stress response and DNA damage in this organism , but no direct evidence of which PARP or PARPs is involved has been published. The ubiquitous distribution of Clade 1 members and the consistent association of the proteins with DNA damage response suggests that this gene lineage is ancient and that the original function of this family was in DNA repair and genome integrity.
While Clade 6 is found in only three of the five eukaryotic supergroups with available genome information (Opisthokonta, Excavata, and Plantae), the phylogenetic relationship of these groups within eukaryotes suggests that a Clade 6-like protein was found in the LCEA (Figures 2 and 12A). Subsequently, during the eukaryotic radiation, Amoebozoa (or at least Dictyostelium discoideum) and Chromalveolates lost Clade 6 PARPs. The ancestral Clade 6 protein was likely to consist of a PfamB_2311 domain N terminal to the PARP catalytic domain (Figure 12B). Members of Clade 6 were more difficult to identify than other PARPs; it was necessary to do supplemental BLAST searches with the human PARP6 catalytic domain to find most of these proteins (see Methods). This is consistent with the positioning of Clade 6 as sister group to the rest of the PARP superfamily. The fact that Clade 6 PARPs represent an ancient lineage further suggests that changes in the PARP catalytic domain likely to eliminate or change enzymatic activity evolved early in this protein family or, alternatively, PARP activity evolved from mART activity. It is difficult to speculate on the possible function of the Clade 6 ancestral protein, as none of the extant Clade 6 members have been functionally characterized.
One group of PARPs defined in our study has an unusual distribution. Clade 3 is found in animals (Opisthokonta), Dictylostelium discoideum (Amoebozoa) and the ciliate Tetrahymena thermophila (Chromalveolates), but no other species in our analysis, including the ciliate Paramecium tetraurelia. Our phylogenetic tree is based on the PARP catalytic domain. Clade 3 proteins have evolved to become either mARTs or non-enzymatic (Figure 7; ). We propose that the grouping of the Tetrahymena proteins in Clade 3 is an artefact caused by this group of proteins independently beginning to evolve similar changes in the PARP catalytic domain. Clades 3 and 6 independently acquired somewhat similar changes, supporting the idea that changes within the PARP catalytic domain may be constrained in order to preserve overall structure. The hypothesis that the Tetrahymena proteins are not closely related to the other Clade 3 proteins is supported by the fact that one of them (Q22F17) retains the glutamic acid of the PARP catalytic triad, while another (Q24C77) has a conservative substitution of a glutamine at that position and that they do not share any domains outside of the catalytic domain with other members of Clade 3. When more sequences within the ciliates become available, it may become possible to determine if this hypothesis is correct. The Dictyostelium proteins found in Clade 3 may be orthologous to the animal proteins, since one of them has a Macro domain, a domain found in other members of this clade (Figure 4).
In extant eukaryotes, the animal lineage within Opisthokonta appears to have the most diverse collection of PARPs. Most animal genomes encode representatives of at least two clades of PARPs. In addition, a PARP clade has been acquired in this lineage, Clade 4 (Figure 12A). Vertebrates contain the highest number and type of PARPs of any group examined within the eukaryotes, containing members of Clades 1, 3, 4, 5 and 6; additionally they often encode more than one representative of each clade. However, within animals the nematodes are unusual. C. elegans, within the order Rhabditida, only encodes two Clade 1I proteins, PME1 and PME2 (Figure 3), and a protein (PME5) that did not clearly fall into any clade (Additional file 4). Within Clade 1, the nematode 1I PARPs do not group with other animal PARPs but rather are found as the sister group to all of the Clade 1 proteins. PME5 somewhat resembles tankyrases in domain structure but does not group with them. However, the branches leading to the C. elegans proteins are long. The length of these branches likely results in long-branch effects, causing misplacement of these proteins within the tree. Such long-branch effects can be caused by the independent acquisition of identical character states , phylogenetic signal erosion ("long branch repulsion") , or by symplesiomorphy (retention of an old conserved character state) . In contrast to the situation in C. elegans, we were unable to identify any Clade 1 PARPs in the nematode Brugia malayi, in the order Spirudida, but did identify a clear tankyrase (Figures 2, 3 and 8). The nematodes are clearly outliers within the animal lineage and a closer examination of the PARP family across a greater number of such species would be interesting.
Although PARPs are found throughout the eukaryotes, these proteins are not essential for eukaryotic life. This is illustrated most clearly in the fungal lineage within the Opisthokonta. In contrast to their fellow Opisthokont lineage the animals, fungi encode members of only Clades 1 and 6 PARPs (Figures 2 and 11). Lineages within the fungi have independently lost PARPs at least five times, illustrating that eukaryotic organisms do not absolutely require this family of proteins. In addition, it should be noted that none of the fungal species examined retained Clade 6 PARPs in the absence of Clade 1 PARPs. This underscores the relative importance of the so-called "classical" Clade 1 PARPs in these organisms. Interestingly, many of the fungi that have lost all PARPs, including the model fungal systems Saccharomyces cerevisiae and Schizosaccharomyces pombe, are yeasts. This suggests fungi with more complex life cycles may retain PARPs more readily than yeasts do. It is possible that a selective advantage is found in organisms with relatively rapid generation times in dispensing with this class of proteins. This is supported by the retention of Clade 1 PARPs in the basal Saccharomycia fungus Yarrowia lipolytica while the two other sequenced members of this fungal group have lost all PARPs (Figure 11C). Yarrowia can grow in three forms: as yeast, hyphae and pseudohyphae. Candida albicans, also a Saccharomyces member, is trimorphic but lacks PARPs; however, this diploid organism lacks a known sexual cycle, suggesting a simplification of its life cycle. Sacchromyces cerevisiae is only dimorphic, growing only as yeast or pseudohyphae (reviewed in ). Other groups have noted the association of retention of PARPs with filamentous growth . This correlation is also found in the dimorphic human pathogen Histoplasma capsulatum, the cause of histoplasmosis, which grows as either yeast or hyphae. In this organism, we have found that its Clade 6A PARP gene is expressed only during the filamentous growth stage and not when the fungus is growing in the yeast form (Lee and Lamb, data not shown).
Our conclusions about the function and distribution of PARP proteins in the eukaryotes are limited by the availability of species with sequenced genomes. Currently, there is a dearth of sequences available in many groups of eukaryotes while animals, particularly vertebrates, and fungi are relatively well represented. A number of phylogenetically important groups such as streptophyte algae, glaucophytes, phaeophytes, dinoflagellates, and archamoebe have no sequenced genomes. The eukaryotic supergroup Amoebozoa is represented by only one species, Dictyostelium discoideum, while there are no representatives of Rhizaria sequenced. Despite the limitations of the available sequences, we have identified unique types of PARPs in Naegleria gruberi, Trichomonas vaginalis and green algae and clarified the phylogenetic distribution of tankyrases. There are likely to be additional variations of PARPs discovered as more eukaryotic genomes are sequenced and a further advancement of our understanding of evolution of this important proteins superfamily.
Clade 5 and vaults
The Clade 5 PARPs have a limited phylogenetic distribution, found only in a subset of animals and amoeba (Figure 9). vPARP was originally identified in a two-hybrid screen using the major vault protein (MVP) protein as bait and shown to act as a bona fide PARP . vPARP associates not only with the ribonucleoprotein vault complex, but also can be found in the nucleus, associated with the telomere and the mitotic spindle. The function of vPARP at any of its locations is unclear. Vaults have been best studied in mammals and in these organisms are composed of three proteins, MVP, TEL1 (also found at telomeres), and vPARP. In addition, several vault specific RNAs (vRNAs) are found. The function or functions of vaults are still unclear; they are associated with drug resistance and several signalling pathways (reviewed in ), as well as the nuclear pore complex [103, 104]. vPARP-deficient mice are normal and fertile with no defects in telomeres or vaults . More recently these mice have been found to develop more tumours in response to carcinogens, suggesting a role in chemically induced cancers .
Vaults have been identified in diverse animals and in other eukaryotes such as the amoeba Dictyostelium discoideum, flatworms, and trypanosomatides [81, 82]. However, vaults appear to be missing from fungi, a number of model animals (C. elegans and Drosophila melanogaster) and in plants [107, 108, 109].
The fact that vPARP does not appear essential for normal development or vault structure in mouse  suggests that this protein is not essential for vault function. This may explain why organisms that have been demonstrated to contain vaults in their cells do not always encode proteins that look like vPARP.
Clade 2 plant-specific PARPs are involved in stress responses
In addition to containing three Clade 1 PARPs throughout and Clade 6 PARPs only in the bryophytes, the land plants contain a unique clade of PARP-like proteins. This clade can be subdivided into two subclades, one of which contains proteins with an N terminal WWE domain. Clade 2 is distinct from Clade 3, which also contains proteins with WWE domains. A group within Clade 2, confined to the eudicots within the angiosperms, consists of truncated proteins lacking the N terminal WWE domain. Examination of the phylogeny of Clade 2 clearly illustrates the importance of genome duplication during plant evolution [110, 111, 112]; plant species tend to encode gene pairs (Figure 5).
The plant Clade 2 proteins have only been investigated in the model angiosperm Arabidopsis thaliana. Arabidopsis has two genes, RCD1 and SRO1, which encode full-length members of Clade 2A [64, 113]. RCD1 was originally identified as a stress response gene . It is involved in the response to several abiotic stresses and shows altered hormone accumulation and gene expression [64, 114, 115]. rcd1 mutants also display pleiotropic developmental defects including reduced stature, malformed leaves, and early flowering . Loss of SRO1 causes only minor defects; however rcd1; sro1 double mutants are severely affected with a majority of individuals dying during embryogenesis [65, 69], indicating that this clade of PARP proteins has essential functions in land plants. RCD1 has been shown to bind to a number of transcription factors, suggesting that Clade 2 PARPs may function in transcriptional regulation [69, 113]. RCD1 does not appear to have catalytic activity, consistent with the absence of the HYE catalytic triad in this protein (Figure 1 and Additional file 7); however, other members of this clade do contain variant HYE motifs that may confer activity (Additional file 7). Therefore, it will be necessary to test individual members of this clade for activity.
Four genes in Arabidopsis, SRO2-5, encode proteins within Clade 2 that lack the N terminal WWE domain [64, 113] and consist of two gene pairs: SRO2/SRO3 and SRO4/SRO5 (Figure 5). These genes may be involved in stress signalling; SRO5 is necessary for response to both salt and oxidative stress  and can bind transcription factors  and SRO2 is up regulated in chloroplastic ascorbic peroxidase mutants .
Multiple independent acquisitions of mART activity within the PARP superfamily
Although not closely evolutionarily related (Figure 1), the proteins belonging to Clades 3 and 6 have modified their catalytic domains, replacing the glutamic acid of the "HYE" catalytic triad with various other amino acids (Figure 7 and Additional files 8 and 11). The catalytic activity of several human members of Clade 3 has been experimentally investigated. PARP10, which falls into Clade 3A and has an isoleucine instead of a glutamic acid in its catalytic site, has been reported to have auto(ADP-ribosyl)ation activity and modify core histones [33, 34]. More recently it was shown to have mono(ADP-ribosyl)ation activity, not poly(ADP-ribosyl)ation activity, and therefore function as a mono(ADP-ribosyl) transferase (mART) rather than a PARP . Molecular modelling suggested that this enzyme uses substrate-assisted catalysis in order to activate the NAD+ substrate. This group further demonstrated that PARP14/BAL2, a Clade 3C member with a leucine in place of the glutamic acid, also has mART activity, consistent with an earlier paper demonstrating auto(ADP-ribosyl)ation activity . A human member of Clade 3F, PARP9/BAL1, has not only replaced the glutamic acid within the catalytic PARP signature but have also replaced the histidine (with a glutamic acid). This enzyme has been shown to be inactive [32, 35]. Almost all of the proteins comprising both Clade 3 and Clade 6 have replaced at least the glutamic acid of the "HYE" triad. It is likely that none of these proteins function as bone fide PARPs but rather are either mARTs or are no longer enzymatically active. Clade 3 has a limited taxonomic distribution (Figures 2 and 6); Clade 6, on the other hand, is found in at least three of the six eukaryotic supergroups and was likely present in the LCEA (Figure 12A). This suggests that the evolution of mART activity within the PARP gene family occurred before the full complement of crown groups had formed. In addition, the changes in the catalytic domain of the Clade 2 proteins also suggest that these proteins have altered enzymatic activities (Additional file 6). Therefore, it is likely that mART activity and/or loss of enzymatic activity has evolved at least twice from PARP activity (in Clades 3 and 2) and that mART activity in extant Clade 6 proteins represents an even earlier acquisition of this enzymatic activity.
What functions do PARP-like/mART proteins play? While no members of Clade 6 have been characterized, several members of Clade 3 have, all in mammalian systems. PARP9/BAL1, PARP14/BAL2, and PARP15/BAL3 have been shown to interact with transcription factors and mediate transcriptional repression or activation [35, 75, 117, 118]. PARP13/ZCC2/ZAP has been shown to bind to viral RNA through its zinc fingers and promote degradation of the RNA by the exosome [119, 120, 121, 122, 123, 124]. PARP12 shares significant similarity to PARP13 and is thought to function similarly. PARP10 interacts with MYC and inhibits transformation; its overexpression leads to a loss of cell viability [33, 34]. To date, no clear consensus about the function of Clade 3 proteins can be formulated.
True tankyrases are confined to animals
Human tankyrase1 was originally identified as a telomeric protein interacting with TRF1, a negative regulator of telomere length. It was shown to act as a PARP and automodify itself as well as TRF1 . A second human tankyrase, tankyrase2 (Figure 4), was identified shortly after the initial discovery of tankyrase1 [28, 29, 125]. Human tankyrases can be found both in the nucleus , at the nuclear pore and centrosome , and in the cytoplasm associated with the Golgi or vesicles  or the plasma membrane . Since their initial discovery, the known functions of these proteins have expanded to include spindle assembly and vesicle trafficking (reviewed in ), sister chromatid segregation , and regulation of the WNT pathway [130, 131, 132]. Tankyrases have been identified in a number of animal species, including mouse. In this model organism, it appears tankyrase may not function in telomere length control , but its other functions are conserved and its function is essential . Consistent with functions outside of the telomere, a tankyrase is found in Drosophila melanogaster (Figure 8; ), an organism with a highly divergent telomere consisting of transposons rather than the short repeats found in other eukaryotes .
Our phylogenetic tree places a number of proteins previously reported as tankyrases in Clade 1, rather than within Clade 4 (Figures 3 and 8). These proteins do have a different domain structure than tankyrases, sharing ankyrin repeats with tankyrases but having WGR and PRD domains rather than SAM motifs (Figure 4). It is likely that the Clade 1 ankyrin repeat proteins do not share functions with tankyrases.
PME5 from C. elegans was reported as a tankyrase and has been functionally characterized. As mentioned above, this protein does not clearly group with any clade, including Clade 4 (Additional file 4). In the original paper describing PME5, it was shown to be more closely related to a Dictyostelium discoideum protein we have placed in Clade 1A (Q54E42) and to have a higher similarity within the catalytic domain to human PARP1 than human tankyrase . In addition, the induction of PME5 expression by DNA damaging agents, the increased apoptosis in pme5(RNAi) lines after DNA-damage, and the constitutively nuclear chromatin-associated localization of PME5 [53, 136] is more consistent with a role in DNA damage. However, the difficulty in placing C. elegans PARPs into clades complicates the issue. Further work will need to be done to determine the function of PME5.
Connections between ubiquitination, SUMOylation and poly(ADP-ribosyl)ation
The attachment of ubiquitin to proteins is an important mechanism in regulating many cellular processes. Similarly to ADP-ribosylation, one to many ubiquitin units can be added to proteins, although only on lysine resides. A chain consisting of at least four ubiquitin linked together by Lys48 residues causes destruction of the protein via the 26S proteasome [137, 138], while either monubiquitination or polyubiquitination with chains linked at Lys63 serve as nonproteolytic signals in such processes as trafficking, DNA repair, and signal transduction [139, 140]. Ubiquitination of proteins involves an enzymatic cascade involving ubiqutin-activating (E1), ubiquitin-conjugating (E2), and ubiquitin-ligating (E3) enzymes.
A number of connections between PARP proteins and ubiquitination have emerged. One connection involves the fact that both attachment of ubiquitin and ADP-ribose can be made at lysine residues, suggesting that these post-translational modifications could compete for substrates. In addition, several protein domains found in PARP proteins can also be found in proteins associated with the ubiquitin system (Figure 4). For example, many Clade 1 proteins have BRCT domains; these domains were originally identified in the BRCA1 protein. BRCA1 functions as an E3 ligase in a multi-protein complex in response to DNA damage [141, 142, 143]. Within Clade 6, Clade 6A proteins have a UBCc domain, similar to that found in ubiquitin E2s , at their C termini, as well as FPE domains at their N termini (Additional Figures 12, 13 and 14). This novel domain has some similarity to the RWD domain, which in turn is related to the UBCc domain, although thought to be non-catalytic. WWE domains are found in Clade 2 and 3 proteins and also in certain ubiquitin E3 ligases . Some Clade 3 proteins have UIM domains, which can bind ubiquitin and polyubiquitin chains ; this domain is also found in the BRCA1-interacting protein Rap80 . The Dictyostelium discoideum protein DDB0393590 contains a U-box (Figure 4), found in E3 ubiqutin ligases and known to bind E2 enzymes .
In addition to the structural similarities found between PARPs and classes of Ub enzymes, some functional connections are also known. Human PARP14/BAL2, a Clade 3E member, has been shown to bind to the multifunctional phosphoglucose isomerase/autocrine motility factor (PGI/AMF). This binding inhibits polyubiquitination of PGI/AMF, stabilizing the protein . PARP1 in humans is regulated by ubiquitination  and has been shown to bind to the E2 enzyme hUBC9 . Proteasome-mediated proteolysis of ubiquitinated tankyrase has also been documented; this is promoted by the auto-poly(ADP-ribosyl)ation of tankyrase, which releases the protein into the cytoplasm . This is similar to the mechanism whereby tankyrase poly(ADP-ribosyl)ates the telomeric protein TRF1, releasing it from the telomere, allowing its ubiquitination and degradation  and the regulation of axin by tankyrase . There are likely to be more connections found in the future between post-translational ADP-ribosylation and ubiquitination.
Recently, a connection between poly(ADP-ribosyl)ation and SUMOylation has also been demonstrated. PARP1 itself is SUMOylated [150, 151], and this takes place within its automodification domain and does not regulate poly(ADP-ribosyl)ation activity . Rather, PARP1's transcriptional co-activator activity is modified [150, 151]. PARP1 can also form higher order complexes and influence SUMOylation of other proteins. In response to both heat shock and DNA damage, human PARP1 associates with the SUMO E3 ligase PIASy [151, 152] and this requires a PAR-binding motif in this protein . Upon DNA damage, PIASy associates with PAR on PARP1 and subsequently its target NEMO binds and is SUMOylated by PIASy, leading to NF-kappaB activation . Clearly, the interplay between poly(ADP-ribosyl)ation and other post-translational modifications is just beginning to be explored.
We present here a large-scale phylogenetic analysis of the PARP gene family that extends previous examination of this family. Several main conclusions can be drawn from our study. First, the phylogenetic distribution of the PARP protein family is tremendously broad across the eukaryotes, consistent with the last common ancestor of modern eukaryotes containing at least two PARP-encoding genes. Second, two types of PARP-like proteins were present in the LCEA; one likely functioned in DNA repair and genomic maintenance and resembled modern members of Clade 1. The second probably had mART activity. Third, increasing numbers and types of PARP-like protein are likely to be found as more eukaryotic organisms have their genomes sequenced.
Retrieval of the PARP gene sequences
The initial sequence set was selected from the Pfam database (http://pfam.sanger.ac.uk/; [41, 42, 43]), using the sequences identified as members of the PARP family (PF00644). The full sequences of the proteins were retrieved from UniProt [44, 45], using the links provided by Pfam. Additional sequences were retrieved from other eukaryotic organisms at the DOE Joint Genome Institute (JGI; http://www.jgi.doe.gov/), the Broad Institute http://www.broadinstitute.org, the J. Craig Venter Institute http://www.jcvi.org/, ToxoDB (http://toxodb.org/toxo/; ), and the Arabidopsis Information Resource (TAIR; http://www.arabidopsis.org/) using BLAST searches  based on human or Arabidopsis thaliana PARP catalytic domain sequences as search queries. Specific phylogenetically interesting genomes were also individually searched by BLAST to confirm the absence of PARP proteins (see Table 1). The catalytic domains of most retrieved sequences were delineated using Pfam. Sequences in Clade 6 have lower similarity to the classical PARPs (i.e. Clade 1) used to generate the Pfam HMM, so the PARP catalytic domains for these sequences were identified using BLAST searches based on human PARP6 catalytic domain as the query and identifying the region of retrieved sequences that had similarity to this PARP signature. In addition, many sequences whose catalytic domain was incompletely identified by Pfam were completed by BLAST searches using closely related complete PARP catalytic domains from other closely related species, in order to provide as much sequence information as possible for the alignment and phylogeny inference. The identified PARP catalytic domains were extracted using the extract.pl tool in the Wildcat Toolbox set of Perl utilities (http://proteomics.arizona.edu/wildcat_toolbox; ). Sequences of less than 100 amino acids in length and many that were missing important structural elements of the PARP domain were discarded to allow better alignment and phylogenetic signal recovery. Many of these sequences were obtained from shotgun sequencing and are presumably incomplete.
The collected PARP catalytic domains were aligned using the MUSCLE3.8.31 multiple alignment tool, using default settings . The multiple alignment was subjected to a maximum-likelihood (ML) analysis using PhyML3.0  using the computer facilities at the Ohio Supercomputer Center http://www.osc.edu. The substitution model parameters using for the PhyML analysis were the WAG substitution matrix, Γ8+I correction to model site rate heterogeneity and empirical equilibrium frequencies. These parameters were selected as the optimal substitution model based on analysis by ProtTest v2.4 . A parsimony-based starting tree was used. Branch supports were computed in PhyML using an aLRT non-parametric Shimodaira-Hasegawa-like (SH) procedure . Once a tree with all PARP domains had been generated, it was used to identify the six clades referred to in the text in combination with examination of domains outside of the PARP catalytic domain. After the six clades were defined, sequences from each clade were aligned separately using MUSCLE. These alignments were used to generate individual clade trees using PhyML with identical parameters. The phylogenetic trees were generated for figures using FigTree http://tree.bio.ed.ac.uk/software/figtree. Alignment figures were generated using TEXshade  and Jalview .
Prediction of protein domains
After sequences of PARP family members were retrieved and placed into clades, the sequences were checked for other domains at the Pfam website . Domains identified are shown in Figure 4. PfamB_30617 was identified in Clade 6A fungal proteins and extracted aligned as above. This domain was further analyzed using the Protein homology/analogy recognition engine (Phyre)  and renamed FPE (Fungal PARP E2-associated). Subsequently, a consensus FPE sequence was used in BLAST searches to find other proteins containing this region. The UBCc domains from Clade 6A proteins were similarly processed.
We thank Dr. Iris Meier (Ohio State University) and two anonymous reviewers for critical reading of the manuscript and members of the Lamb laboratory for discussions. This work was supported in part by an allocation of computing time from the Ohio Supercomputer Center. This work was supported by a grant from the Ohio Plant Biotechnology Consortium to RSL and by funds from the Ohio State University.
- 2.Fujimura S, Hasegawa S, Shimizu Y, Sugimura T: Polymerization of the adenosine 5'-diphosphate-ribose moiety of nicotinamide-adenine dinucleotide by nuclear enzyme. I. Enzymatic reactions. Biochim Biophys Acta. 1967, 145 (247-259):Google Scholar
- 3.Chambon P, Weil JD, Doly J, Strosser MT, Mandel P: On the formation of a novel adenylic compound by enzymatic extracts of liver nuclei. Biochem Biophys Res Commun. 1966, 25: 638-643. 10.1016/0006-291X(66)90502-X.Google Scholar
- 5.Doly J, Petek F: Etude de la structure d'un compose "poly(ADP-ribose" synthetise par des extraits nucleaires de foie de poulet. CR Hebd Scanc Acad Sci Ser D Sci Nat. 1966, 263: 1341-1344.Google Scholar
- 15.Fong PC, Boss DS, Yap TA, Tutt A, Wu P, Mergui-Roelvink M, Mortimer P, Swaisland H, Lau A, O'Connor MJ, Ashworth A, Carmichael j, Kaye SB, Schellens JH, de Bono JS: Inhibition of Poly(ADP-Ribose) Polymerase in Tumors from BRCA Mutation Carriers. N Engl J Med. 2009, 361 (2): 123-134. 10.1056/NEJMoa0900212.PubMedGoogle Scholar
- 16.Cottet F, Blanche H, Verasdonck P, Le Gall I, Schachter F, Burkle A, Muiras ML: New polymorphisms in the human poly(ADP-ribose) polymerase-1 coding sequence: lack of association with longevity or with increased cellular poly(ADP-ribosyl)ation capacity. J Mol Med. 2000, 78 (8): 431-440. 10.1007/s001090050488.PubMedGoogle Scholar
- 17.Tezcan G, Gurel CB, Tutluoglu B, Onaran I, Kanigur-Sultuybek G: The Ala allele at Val762Ala polymorphism in poly(ADP-ribose) polymerase-1 (PARP-1) gene is associated with a decreased risk of asthma in a Turkish population. J Asthma. 2009, 46 (4): 371-374. 10.1080/02770900902777791.PubMedGoogle Scholar
- 24.Podesta D, Garcia-Herreros MI, Cannata JJ, Stoppani AO, Fernandez Villamil SH: Purification and properties of poly(ADP-ribose)polymerase from Crithidia fasciculata. Automodification and poly(ADP-ribosyl)ation of DNA topoisomerase I. Mol Biochem Parasitol. 2004, 135 (2): 211-219. 10.1016/j.molbiopara.2004.02.005.PubMedGoogle Scholar
- 33.Yu M, Schreek S, Cerni C, Schamberger C, Lesniewicz K, Poreba E, Vervoorts J, Walsemann G, Grotzinger J, Kremmer E, Mehraein Y, Mertsching J, Kraft R, Austen M, Luscher-Firzlaff J, Luscher B: PARP-10, a novel Myc-interacting protein with poly(ADP-ribose) polymerase activity, inhibits transformation. Oncogene. 2005, 24 (12): 1982-1993. 10.1038/sj.onc.1208410.PubMedGoogle Scholar
- 36.Augustin A, Spenlehauer C, Dumond H, Menissier-De Murcia J, Piel M, Schmit AC, Apiou F, Vonesch JL, Kock M, Bornens M, De Murcia G: PARP-3 localizes preferentially to the daughter centriole and interferes with the G1/S cell cycle progression. J Cell Sci. 2003, 116 (Pt 8): 1551-1562. 10.1242/jcs.00341.PubMedGoogle Scholar
- 41.Coggill P, Finn RD, Bateman A: Identifying protein domains with the Pfam database. Curr Protoc Bioinformatics. 2008, Chapter 2: Unit 2 5Google Scholar
- 43.Finn RD, Tate J, Mistry J, Coggill PC, Sammut SJ, Hotz HR, Ceric G, Forslund K, Eddy SR, Sonnhammer EL, Bateman A: The Pfam protein families database. Nucleic Acids Res. 2008, D281-288. 36 DatabaseGoogle Scholar
- 45.The Universal Protein Resource (UniProt) 2009. Nucleic Acids Res. 2009, D169-174. 37 DatabaseGoogle Scholar
- 47.Gajria B, Bahl A, Brestelli J, Dommer J, Fischer S, Gao X, Heiges M, Iodice J, Kissinger JC, Mackey AJ, Pinney DF, Roos DS, Stoeckert CJ, Wang H, Brunk BP: ToxoDB: an integrated Toxoplasma gondii database resource. Nucleic Acids Res. 2008, D553-556. 36 DatabaseGoogle Scholar
- 48.Fey P, Gaudet P, Curk T, Zupan B, Just EM, Basu S, Merchant SN, Bushmanova YA, Shaulsky G, Kibbe WA, Chisholm RL: dictyBase--a Dictyostelium bioinformatics resource update. Nucleic Acids Res. 2009, D515-519. 10.1093/nar/gkn844. 37 DatabaseGoogle Scholar
- 49.Swarbreck D, Wilks C, Lamesch P, Berardini TZ, Garcia-Hernandez M, Foerster H, Li D, Meyer T, Muller R, Ploetz L, Radenbaugh A, Singh S, Swing V, Tissier C, Zhang P, Huala E: The Arabidopsis Information Resource (TAIR): gene structure and function annotation. Nucleic Acids Res. 2008, D1009-1014. 36 DatabaseGoogle Scholar
- 52.Ohi MD, Link AJ, Ren L, Jennings JL, McDonald WH, Gould KL: Proteomics analysis reveals stable multiprotein complexes in both fission and budding yeasts containing Myb-related Cdc5p/Cef1p, novel pre-mRNA splicing factors, and snRNAs. Mol Cell Biol. 2002, 22 (7): 2011-2024. 10.1128/MCB.22.7.2011-2024.2002.PubMedCentralPubMedGoogle Scholar
- 54.Gravel C, Stergiou L, Gagnon SN, Desnoyers S: The C. elegans gene pme-5: molecular cloning and role in the DNA-damage response of a tankyrase orthologue. DNA Repair (Amst). 2004, 3 (2): 171-182. 10.1016/j.dnarep.2003.10.012.Google Scholar
- 62.Jaspers P, Overmyer K, Wrzaczek M, Vainonen JP, Blomster T, Salojarvi J, Reddy RA, Kangasjarvi J: The RST and PARP-like domain containing SRO protein family: analysis of protein structure, function and conservation in land plants. BMC Genomics. 2010, 11: 170-10.1186/1471-2164-11-170.PubMedCentralPubMedGoogle Scholar
- 63.Overmyer K, Tuominen H, Kettunen R, Betz C, Langebartels C, Sandermann H, Kangasjarvi J: Ozone-sensitive arabidopsis rcd1 mutant reveals opposite roles for ethylene and jasmonate signaling pathways in regulating superoxide-dependent cell death. Plant Cell. 2000, 12 (10): 1849-1862. 10.1105/tpc.12.10.1849.PubMedCentralPubMedGoogle Scholar
- 64.Ahlfors R, Lang S, Overmyer K, Jaspers P, Brosche M, Tauriainen A, Kollist H, Tuominen H, Belles-Boix E, Piippo M, Inze D, Palva ET, Kangasjarvi J: Arabidopsis RADICAL-INDUCED CELL DEATH1 belongs to the WWE protein-protein interaction domain protein family and modulates abscisic acid, ethylene, and methyl jasmonate responses. Plant Cell. 2004, 16 (7): 1925-1937. 10.1105/tpc.021832.PubMedCentralPubMedGoogle Scholar
- 69.Jaspers P, Blomster T, Brosche M, Salojarvi J, Ahlfors R, Vainonen JP, Reddy RA, Immink R, Angenent G, Turck F, Overmyer K, Kangasjarvi J: Unequally redundant RCD1 and SRO1 mediate stress and developmental responses and interact with transcription factors. Plant J. 2009, 60 (2): 268-279. 10.1111/j.1365-313X.2009.03951.x.PubMedGoogle Scholar
- 87.Verhagen AM, Coulson EJ, Vaux DL: Inhibitor of apoptosis proteins and their relatives: IAPs and other BIRPs. Genome Biol. 2001, 2 (7): 10.1186/gb-2001-2-7-reviews3009. REVIEWS3009Google Scholar
- 98.Felsenstein J: Cases in which parsimony or compatibility methods will be positively misleading. Syst Zool. 1978, 27: 401-410. 10.2307/2412923.Google Scholar
- 99.Pol D, Siddall ME: Biases in maximum likelihood and parsimony: a simulation approach to a 10-taxon case. Cladistics. 2001, 17: 266-281. 10.1006/clad.2001.0172.Google Scholar
- 100.Hennig W: Phylogenetic Systematics. 1966, Urbana, IL: University of Illinois PressGoogle Scholar
- 105.Liu Y, Snow BE, Kickhoefer VA, Erdmann N, Zhou W, Wakeham A, Gomez M, Rome LH, Harrington L: Vault poly(ADP-ribose) polymerase is associated with mammalian telomerase and is dispensable for telomerase function and vault structure in vivo. Mol Cell Biol. 2004, 24 (12): 5314-5323. 10.1128/MCB.24.12.5314-5323.2004.PubMedCentralPubMedGoogle Scholar
- 114.Fujibe T, Saji H, Arakawa K, Yabe N, Takeuchi Y, Yamamoto KT: A methyl viologen-resistant mutant of Arabidopsis, which is allelic to ozone-sensitive rcd1, is tolerant to supplemental ultraviolet-B irradiation. Plant Physiol. 2004, 134 (1): 275-285. 10.1104/pp.103.033480.PubMedCentralPubMedGoogle Scholar
- 115.Overmyer K, Brosche M, Pellinen R, Kuittinen T, Tuominen H, Ahlfors R, Keinanen M, Saarma M, Scheel D, Kangasjarvi J: Ozone-induced programmed cell death in the Arabidopsis radical-induced cell death1 mutant. Plant Physiol. 2005, 137 (3): 1092-1104. 10.1104/pp.104.055681.PubMedCentralPubMedGoogle Scholar
- 136.Dequen F, Gagnon SN, Desnoyers S: Ionizing radiations in Caenorhabditis elegans induce poly(ADP-ribosyl)ation, a conserved DNA-damage response essential for survival. DNA Repair (Amst). 2005, 4 (7): 814-825. 10.1016/j.dnarep.2005.04.015.Google Scholar
- 162.Palenik B, Grimwood J, Aerts A, Rouze P, Salamov A, Putnam N, Dupont C, Jorgensen R, Derelle E, Rombauts S, et al: The tiny eukaryote Ostreococcus provides genomic insights into the paradox of plankton speciation. Proc Natl Acad Sci USA. 2007, 104 (18): 7705-7710. 10.1073/pnas.0611046104.PubMedCentralPubMedGoogle Scholar
- 163.Nozaki H, Takano H, Misumi O, Terasawa K, Matsuzaki M, Maruyama S, Nishida K, Yagisawa F, Yoshida Y, Fujiwara T, Takio S, Tamura K, Chung SJ, Nakamura S, Kuroiwa H, Tanaka K, Sato N, Kuroiwa T: A 100%-complete sequence reveals unusually simple genomic features in the hot-spring red alga Cyanidioschyzon merolae. BMC Biol. 2007, 5: 28-10.1186/1741-7007-5-28.PubMedCentralPubMedGoogle Scholar
- 166.Jeffries TW, Grigoriev IV, Grimwood J, Laplaza JM, Aerts A, Salamov A, Schmutz J, Lindquist E, Dehal P, Shapiro H, Jin YS, Passoth V, Richardson PM: Genome sequence of the lignocellulose-bioconverting and xylose-fermenting yeast Pichia stipitis. Nat Biotechnol. 2007, 25 (3): 319-326. 10.1038/nbt1290.PubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.