A sophisticated, differentiated Golgi in the ancestor of eukaryotes
- 2.3k Downloads
The Golgi apparatus is a central meeting point for the endocytic and exocytic systems in eukaryotic cells, and the organelle’s dysfunction results in human disease. Its characteristic morphology of multiple differentiated compartments organized into stacked flattened cisternae is one of the most recognizable features of modern eukaryotic cells, and yet how this is maintained is not well understood. The Golgi is also an ancient aspect of eukaryotes, but the extent and nature of its complexity in the ancestor of eukaryotes is unclear. Various proteins have roles in organizing the Golgi, chief among them being the golgins.
We address Golgi evolution by analyzing genome sequences from organisms which have lost stacked cisternae as a feature of their Golgi and those that have not. Using genomics and immunomicroscopy, we first identify Golgi in the anaerobic amoeba Mastigamoeba balamuthi. We then searched 87 genomes spanning eukaryotic diversity for presence of the most prominent proteins implicated in Golgi structure, focusing on golgins. We show some candidates as animal specific and others as ancestral to eukaryotes.
None of the proteins examined show a phyletic distribution that correlates with the morphology of stacked cisternae, suggesting the possibility of stacking as an emergent property. Strikingly, however, the combination of golgins conserved among diverse eukaryotes allows for the most detailed reconstruction of the organelle to date, showing a sophisticated Golgi with differentiated compartments and trafficking pathways in the common eukaryotic ancestor.
KeywordsGolgi apparatus GRASP golgin evolutionary cell biology secretion membrane trafficking
At the intersection of the secretory and endocytic membrane-trafficking pathways in eukaryotes lies the Golgi. This organelle comprises a series of compartments termed cisternae, providing a platform for protein transport, glycosylation, and targeting. The Golgi is crucially important for normal cellular function, as demonstrated by the myriad diseases that result when genes associated with it are mutated . The most salient hallmark of Golgi structure is the presence of multiple membranous compartments, differentiated into cis, medial, and trans-Golgi, and organized into flattened stacks, which facilitates many key Golgi functions in mammalian cells . In mammalian cells, numerous proteins are involved in maintaining the structure and positioning of the Golgi, as well as the specificity of membrane trafficking pathways to and from the Golgi , although the precise mechanism of Golgi stacking is unknown.
Golgins and Golgi reassembly and stacking proteins (GRASPs) are the main factors implicated in Golgi organization and stacking, as reviewed previously . The golgins are a collection of 11 proteins in mammalian cells defined by the presence of coiled-coil domains, attachment to Golgi membranes near their C-termini (either by tail-anchor transmembrane domains or through binding to small GTPases), and functions that include tethering/scaffolding [3, 5]. The domain topology and functions of mammalian golgins have been reviewed extensively elsewhere [3, 6]. Striking evidence for a role of GRASP55, GRASP65, GM130, and golgin-45 in stacking was shown by a knock-sideways experiment demonstrating that ectopic expression of GRASP55 on mitochondria is sufficient for the stacking of mitochondrial and Golgi membranes together . A similar ectopic expression of golgin-84 on mitochondrial membranes also caused stacking of mitochondria . In addition to apparent roles in stacking, golgins, including GM130 and golgin-84, are involved in tethering specific transport vesicles destined for different regions of the Golgi . Furthermore, several golgins, including GM130, are involved in connecting the Golgi to the cytoskeleton [9, 10]. Various additional proteins have also been suggested to be involved in Golgi structure and organization (Additional file 1: Table S1).
The integral role of golgins and other implicated structural proteins at the Golgi makes their evolutionary histories essential to reconstructing both the nature of the Golgi in the last eukaryotic common ancestor (LECA) approximately 1.5 billion years ago , and to tracing the subsequent changes that have occurred in the evolution of diverse eukaryotic lineages. While it has been inferred that the LECA possessed a stacked Golgi , whether there are pan-eukaryotic proteins (e.g., golgins) that may have conserved roles in Golgi stacking remains unknown. Furthermore, the extent and details of golgin-mediated vesicle trafficking in the diversity of eukaryotes as compared with mammalian cells is also an open question.
Intriguingly, while Golgi stacking is observed in most organisms across eukaryotic diversity, there are a few lineages of microbial eukaryotes that lack stacked Golgi, as reviewed previously . In the absence of a morphologically recognizable Golgi, the question arose, for each of these lineages as to whether the organelle (1) was ever present, (2) was present but is no longer a feature of the cellular configuration, or (3) is present but has been shifted to an unrecognizable morphology.
Phylogenetic analysis to determine the evolutionary relationships of these organisms has placed them as embedded within various different eukaryotic groups, in almost every case having relatives with canonical stacked Golgi, rather than related to other organisms lacking stacks [13, 14, 15, 16]. Furthermore, in every case yet examined, when genome-scale data became available, genes were identified that encode orthologues of proteins that function at the Golgi in mammalian and yeast systems [16, 17, 18, 19]. Localization data and functional assays have also confirmed that these proteins are expressed and indeed have shown that discrete Golgi, of morphologies other than stacked cisternae, exist in several of these lineages [19, 20, 21, 22]. Recent genomic data for diverse eukaryotes, including from additional organisms with evidence for unstacked Golgi, therefore present the opportunity to understanding the evolution of Golgi structure across the broadest span of eukaryotes and organelle morphologies.
Herein, we report an analysis of golgins and other Golgi structure-associated proteins across eukaryotes, using genomics, molecular cell biology, and bioinformatics techniques to address evolutionary cell biology of the Golgi in eukaryotes.
The genome of the “Golgi-less” amoeba M. balamuthi encodes Golgi proteins
Golgi-like compartments in M. balamuthi are dispersed and punctate
The COPI complex mediates traffic from the Golgi to the ER in eukaryotic cells, and therefore the ER would be a likely location for the COPI complex were a Golgi not present. To ensure that this was not the case, we co-localized the COPI-β with protein disulfide-isomerase (PDI), a well-known ER marker. This showed a PDI signal present in tubular structures close to nuclei as well as in numerous vesicles in the endoplasm, but little overlap with the COPI-β signal (Fig. 2, top row). Furthermore, since hydrogenosomes, the mitochondria-derived organelles in M. balamuthi, can also take the form of small discrete punctae , co-localization experiments were performed (Fig. 2, middle row) showing no overlap between COPI-β and the hydrogenosomal marker malate dehydrogenase. Together, these informatics and microscopy results are most consistent with the presence of a cryptic unstacked Golgi in M. balamuthi, and validate the inclusion of genomic information from this organism in our subsequent searches.
Evolution of the interacting Golgi structural proteins GM130, golgin-45, GRASP55, and GRASP65
To understand the distribution and evolution of proteins with putative roles in Golgi stacking, we performed comparative genomic searches to assess the taxonomic distribution of mammalian golgins, as well as other Golgi proteins that are either golgin-like (e.g., golgin-45), golgin-associated (e.g., ZFPL1), or GRASPs (Additional file 1: Table S1).
The above observations suggest that the origin of both GM130 and golgin-45 predates the duplication that produced separate GRASP55 and GRASP65 paralogues, rather than coordinately appearing with them. Recent structural studies have elucidated the interaction between GRASP65 and GM130 , and between GRASP55 and golgin-45 , suggesting that these binding interactions involve specific residues near the C-terminus of GM130 and golgin-45 interacting with specific residues of GRASP65 and GRASP55, respectively. Evaluation of the conservation of these residues in vertebrates and non-vertebrate holozoan GM130 homologues reveals that residues near the C-termini that are important for binding to GRASP65 are contained in an extended region acquired in a vertebrate ancestor (Additional file 8: Figure S5A). These residues include F975 and I990 of the human orthologue, which have been experimentally shown to be important for binding of GM130 to GRASP65 . GRASP65 may have become specialized for interaction with GM130 in vertebrates through corresponding amino acid substitutions. For example, M164 of GRASP65 is one of several residues that form a hydrophobic cleft occupied by the C-terminus of GM130 . However, while GRASP65 orthologues have either methionine or leucine residues at the position corresponding to M164, GRASP55 orthologues and pre-duplicate GRASP have tyrosine or phenylalanine residues (Additional file 8: Figure S5B). Understanding whether GM130 interacts with preduplicate GRASP proteins in non-vertebrate metazoans will be an important point to resolve to understand both the evolution of Golgi and biology in species of ecological and agricultural importance.
Evolution of cis-Golgi golgins
The cis-Golgi receives material through anterograde vesicle transport from the ER and in a retrograde fashion from the medial-Golgi and trans-Golgi/TGN. Multiple golgins are involved in tethering incoming vesicles at cis-Golgi cisternae. Although GM130 is Holozoa specific, one of its interactors, ZFPL1 , is more widely conserved and likely present in the LECA (Fig. 4a), consistent with previous identification of a homologue in Arabidopsis, which localizes to the cis-Golgi . Similar to GM130, golgin-160 appears restricted to Metazoa, and was present in the earliest metazoans, despite being absent in Drosophila and Caenorhabditis (Fig. 3a). By contrast, its binding partner GCP16 appears to be a more ancient invention, being found in opisthokonts and Amoebozoa (Fig. 4). Even more ancient still are p115 and GMAP210, the homologues of which are found across the diversity of eukaryotes and thus were likely present in the LECA.
Mammalian GMAP210 contains an N-terminal amphipathic alpha helix (ALPS domain), which is important for tethering ER-derived vesicles to the cis-Golgi . Using the HeliQuest web service , we did not identify any such helices in the first 80 residues of GMAP210 sequences from non-vertebrates, suggesting that this is a lineage-specific mechanism for recognition of vesicles by GMAP210, consistent with previous observations . Additionally, GMAP210 orthologues from non-holozoans do not share the N-terminal tryptophan-containing motif also shown to be involved in recognizing vesicles for tethering to the cis-Golgi  (Additional file 8: Figure S5C). This motif was previously shown to be necessary for tethering vesicles containing GalNAc-T2 and giantin, but not those containing golgin-84 instead , which may indicate lineage-specific trafficking mechanisms as giantin is specific to chordates (Fig. 3b). Increased complexity of GMAP210-mediated trafficking pathways may be due to the presence of an ER–Golgi intermediate compartment (ERGIC) in metazoan cells, as GMAP210 has been shown to be involved in trafficking to both ERGIC and the cis-Golgi . In contrast to the N-terminal motifs, the Arf-binding GRAB domain of GMAP210  is conserved in orthologues across eukaryotes (Additional file 8: Figure S5D).
Evolution of cisternal rim golgins
At least four golgins localize to the rims of Golgi cisternae (including medial-Golgi cisternae) in mammalian cells, namely golgin-84, CASP, TMF, and giantin. TMF and golgin-84 have direct roles in vesicle tethering, while giantin appears to be important for organizing Golgi cisternae . Giantin is the most recently evolved, appearing in the chordates (Fig. 3). In contrast to previous suggestions that the Drosophila protein lava lamp is a giantin homologue , no homologues of giantin were identified in Drosophila. However, the origin of the giantin-interacting protein GCP60 (ACBD3)  (Additional file 1: Table S1) predates that of giantin, having originated prior to the common ancestor of extant holozoans. Both CASP and golgin-84, however, appear to have been present in the LECA as they can be identified in taxonomically diverse eukaryotic genomes (Fig. 4a and Additional file 2: Figure S1). While golgin-84 and CASP have been identified previously in plants [46, 47], we also identify orthologues of golgin-84 in Excavata, rhizarians, amoebozoans, and a basal opisthokont, and identify CASP in even more numerous taxa (Fig. 4 and Additional file 2: Figure S1).
Golgin-84, CASP, and giantin are anchored to the Golgi rims by transmembrane domains of similar length that share sequence similarity, even among mammalian and plant homologues . Mutation of a conserved tyrosine in the transmembrane domain (TMD) of mammalian CASP prevents export from the ER, suggesting a similar importance for this residue in the TMDs of golgin-84 and giantin . In addition, residues within 100 residues immediately upstream of the TMD of mammalian golgin-84 and giantin, although dissimilar to each other, were shown to be involved in localization of these proteins to the Golgi . The TMD and 100 residues on the cytoplasmic side are sufficient for Golgi localization of the Arabidopsis orthologues of both golgin-84  and CASP . Here, we confirm that the TMD and upstream cytoplasmic region of CASP and golgin-84 orthologues are conserved across eukaryotes, including Excavata (Additional file 8: Figure S5E). These observations are consistent with conserved mechanisms of localization of golgin-84 and CASP within the Golgi, which would also have occurred in the LECA’s Golgi.
Mammalian golgin-84 and TMF have previously been shown to contain tryptophan-containing N-terminal motifs similar to that of GMAP210 . Like GMAP210, TMF does not show conservation of this motif outside of metazoans. In contrast, golgin-84 orthologues across eukaryotes contain comparable N-terminal motifs (Additional file 8: Figure S5F). TMF shows conservation within the coiled-coil region that is thought to function in vesicle capture  (Additional file 9), as well as its C-terminal Rab6-binding domain  (Additional file 8: Figure S5G).
Evolution of trans-Golgi/TGN golgins
Mammalian GRIP (Golgin-97, RanBP2alpha, Imh1p, and P230/golgin-245) domain-containing golgins at the trans-Golgi/TGN receive vesicles from various endosomal sources (GCC88, golgin-97, and golgin-245) [8, 51]. The presence of four distinct GRIP golgins in mammalian cells suggests that there might be multiple ancient GRIP golgin paralogues; however, this is not what we observe. All four of the human GRIP golgins (the vesicle tethers and GCC185) appear to be restricted to metazoa (Fig. 3). Non-mammalian GRIP domain-containing proteins include the previously identified and characterized golgins Saccharomyces Imh1p , Arabidopsis AtGRIP , and Trypanosoma TbGRIP . Herein, GRIP domain-containing proteins are found across all supergroups (Fig. 4a and Additional file 2: Figure S1).
Further, the coiled-coil domain-containing protein SCY1-like 1 binding protein 1 (SCYL1BP1) binds Rab6 at the trans-Golgi in mammalian cells, but has unknown function . The origin of SCYL1BP1 predates that of the choanoflagellate lineage of Holozoa (Fig. 3). A potential Arabidopsis homologue has been noted previously . This protein was identified but did not meet the criteria for inclusion, whereas proteins that met the E-value cutoffs were identified here in Guillardia and Bigelowiella (Additional file 6: Table S3). Nevertheless, whether these are true homologues remains ambiguous considering the short length of similar sequence regions as well as the numerous independent gene losses implied by such a patchy distribution of homologues. Should these be true orthologues, then SCYL1BP1 would be deduced to have a much earlier evolutionary origin than stated. However, we suggest that conclusions regarding homology be reserved until functional characterization is available.
Evolution of additional proteins implicated in Golgi structure
Three golgin-like proteins with functions that have not been assigned to specific Golgi regions were also included in the analysis, and appear to have originated within the Holozoa or Opisthokonta. First, CG-NAP, a protein with function at both the Golgi and the centrosome  (Additional file 1: Table S1), originated prior to the divergence of Branchiostoma from other chordates. Second, homologues of NECC1/NECC2 were found to have an earlier origin, with identification of a homologue in Nematostella, indicating that the origin possibly predated the diversification of the deepest-branching animal lineages (Fig. 3). Third, SCOCO, an Arl1/Arl3-binding protein of unknown function [58, 59], appears to be opisthokont specific, with homologues only identified in fungi and Holozoa (Fig. 4 and Additional file 2: Figure S1).
Finally, an additional three proteins of interest are relevant to the evolutionary investigation of Golgi structure. First, the existence of metazoan-specific golgins suggested that lineage-specific golgin-like proteins may be present in other eukaryotic lineages as well. One such protein has already been identified in kinetoplastids, and the homologue in Trypanosoma brucei (TbG63) has been implicated in Golgi organization . Our analyses found that this protein is present in the genome of Bodo saltans, the sister lineage to trypanosomatids, but not in any non-kinetoplastids (Additional file 2: Figure S1). Second, although not localized to the Golgi, Sec16 has been shown to be widely conserved  and important for Golgi stacking in the yeast Pichia pastoris, through its function in regulating COPII coat components at tER exit sites [62, 63]. We recapitulate this finding, albeit with increased sampling. Finally, TM9SF3 is one of four widely conserved TM9 superfamily proteins (or nonaspanins) . It is not orthologous to EMP70 in Saccharomyces, which is instead more similar to human TM9SF4. Based on its exclusive Golgi localization and its loss of expression correlated with Golgi fragmentation in mammalian spermatids, TM9SF3 has been implicated in Golgi structure . Our analyses demonstrated that TM9SF3 is found across the span of eukaryotes though not in several taxonomically coherent groups, including ascomycete and basidiomycete fungi, ciliates, and apicomplexans (Fig. 4 and Additional file 2: Figure S1).
By applying comparative information from a broad diversity of eukaryotic organisms, evolutionary cell biology has the potential to provide complementary context to more traditional molecular cell biological studies. We have applied this approach to the evolution and cell biology of the Golgi.
M. balamuthi contains a cryptic Golgi
M. balamuthi was one of the organisms originally proposed to lack a Golgi, consistent with the idea at the time that it had diverged prior to the evolutionary emergence of the organelle . This idea of primitive Golgi absence has been fully disproven , and ultrastructural work has identified compartments proposed as candidate unstacked Golgi cisternae in some Mastigamoeba species (M. balamuthi was not imaged) . Nevertheless, the possibility of complete absence of this organelle in any given organism remains viable, as was recently demonstrated for mitochondria . Our genomic and immunomicroscopy data suggests that M. balamuthi possesses a cryptic Golgi, possibly composed of distributed vesicles. The precise form and dynamics of the organelle remain interesting open questions, ones that must await the technological development of better tools for molecular cell biology in this organism.
Holozoa-specific golgins reflect lineage-specific increases in trafficking complexity
Our comparative analyses identified a set of Golgi proteins that appear to have originated within Holozoa and which may reflect increased complexity of both vesicle traffic at the Golgi and connection to the cytoskeleton, relative to a pre-holozoan ancestor. N-terminal vesicle recognition motifs present in mammalian orthologues of GMAP210, TMF, and GRIP golgins, but absent outside of Holozoa, suggest a potential gain of tethering functions in these proteins relative to the ancestral sequences. Additionally, several of the proteins originating within Holozoa, for which functional information is available, have roles in tethering the Golgi to the cytoskeleton, including golgin-160 , GM130 , GCC185 , CG-NAP , and bicaudal-D . Cytoskeleton-dependent Golgi positioning along microtubules is important for cellular functions that are essential to metazoan multicellularity, including wound healing . This may explain the relatively recent origin of some of these factors. Despite animal-specific gains in complexity, other eukaryotes may also exhibit comparably complex Golgi. One possibility is that proteins, such as TbG63 as well as undiscovered Golgi proteins in other eukaryotic lineages, reflect parallel increases in complexity, which cannot be inferred by characterization of homologues of human Golgi proteins.
Conservation of golgins suggests differentiated Golgi compartments were present in the LECA
Counter to the intuitive idea that the ancient ancestor of eukaryotes was simple, molecular evolutionary reconstruction of the LECA has revealed a complement of cell biological machinery that is consistent with a highly complex cell. This applies not only to membrane-trafficking proteins but also to nuclear proteins, the cytoskeleton, mitochondria, and metabolism . The set of pan-eukaryotic Golgi-structural proteins that can be deemed as ancient, which we identify here, adds to this ancestral complexity. This has important implications for the complexity and organization of the Golgi in diverse eukaryotes and in the LECA. The presence of proteins such as p115 and ZFPL1 in non-metazoan eukaryotes raises important questions about Golgi function to be explored in those organisms, given that known binding partners of those proteins are metazoa specific. Evolutionarily, although homologues of p115, GMAP210, golgin-84, CASP, TMF, ZFPL1, and GRIP-containing golgins have been previously identified and localized in plant cells [37, 46, 47, 72], identification of homologues in the extensive taxonomic sampling used here confirms that these were present in the LECA for two reasons. First, it makes the possibility of lateral gene transfer even less likely. Second, identification of CASP, golgin-84, TMF, p115, and TM9SF3 in excavates (Naegleria gruberi in particular) provides evidence that they were present in the LECA regardless of uncertainty in the rooting of the eukaryotic tree [73, 74, 75].
With respect to established TGN compartments, the only inferred LECA golgin at the TGN is a GRIP domain-containing golgin, which acts to receive vesicles from endosomes. The presence of a GRIP domain in proteins across eukaryotic diversity, and the localization of these GRIP domain-containing proteins at the TGN in yeast, plants, and trypanosomes [52, 54, 72] suggests some conserved TGN function from the LECA. The previously identified conservation of Arl1 in eukaryotes, including the representatives of the Excavata, is consistent with conserved function of GRIP golgins . However, the lack of clear conservation of multiple TGN golgins suggests that vesicle traffic to the trans-Golgi in non-metazoan cells, and in the LECA, involves fewer specialized tethers and possibly fewer types of transport vesicles. This could also be reflective of the variation of TGN organelles across eukaryotes.
Previous reconstruction of trafficking pathways as present in the LECA, for example, via analysis of COPI, COPII, Retromer, and AP1,4 complexes, as well as Golgi-specific SNARE proteins [78, 79], had suggested potential differentiation of Golgi compartments to some degree. However, these did not indicate whether the ancestral Golgi was a single compartment with specialized domains or was composed of differentiated cisternae. The presence of at least eight ancient proteins implicated in Golgi structure at cis-Golgi, cisternal rims, or trans-Golgi/TGN, along with conservation of several functional motifs that mediate interactions with binding partners (e.g., Rab6, Arl1, Arf) also reconstructed as present in the LECA, shows that the LECA Golgi was much more complicated than it has been previously possible to infer (Fig. 5). Conservation of golgin-84 and TMF is particularly relevant, as they are specific to intra-Golgi vesicle traffic, which would arguably be unnecessary if Golgi cisternae were not differentiated.
Golgi stacking is likely an ancient, emergent property
Our analyses also speak to the cell biological question of how Golgi stacking takes place today which, despite its importance and apparent conservation of the stacked morphology of the organelle, remains a matter of significant debate . The predominant paradigm is that one or more Golgi-localized proteins are necessary for the morphology. Given the presence of Golgi stacking across eukaryotes, such a protein could well be predicted to be universal. However, it is not known which proteins, if any, may be necessary for a conserved pan-eukaryotic mechanism of stacking.
By contrast with this paradigm, other suggestions have been put forward to explain Golgi stacking as a morphological property based on several combined factors. This idea has most explicitly been laid out by the “cisternal adhesion” model of Lee et al. , whereby one or more proteins with adhesive functions have a stacking effect when present in sufficient quantities. Stacking could also involve regulation of membrane flux through the Golgi, with insufficient input or replenishment as compared to output, causing dissolution of stacks . A model of additive effects of redundant proteins or membrane flux is also consistent with the phenotypes observed in knockouts of retromer components that result in depleted retrograde trafficking from the endosomes to the TGN and fragmentation of the Golgi [81, 82]. The idea that properties of organelles, including Golgi stacking, are dependent on systems-level properties is gaining traction as a viable alternative to exclusively genetic explanations . We collectively denote these hypotheses as Golgi stacking being an emergent property. Overall, the question of how the hallmark morphology of the organelle is established and maintained remains open to debate.
Under the paradigm of a protein with a conserved necessary function in Golgi stacking, such a protein would likely be present in all genomes of organisms showing Golgi stacking, and likely absent from the genomes of those organisms without (i.e., the taxonomic distribution of stacking factors should match that of Golgi stacking). Such a pattern of presence directly correlating with function has been observed for protein complexes responsible for cristae formation in mitochondria , and this phylogenetic screening approach has successfully identified proteins involved in flagellar function [85, 86]. The evolutionary analyses performed here across 75 taxa with stacked Golgi and 12 without showed that none of the 27 putative stacking factors that we examined matched this pattern.
There are several caveats to our results. First, individual false positives, or false negatives, are always possible in comparative genomic analyses. Nonetheless, we have used the most accurate homology searching methods, examined datasets of alternate protein models for genomes when relevant and have manually curated the gene assignments. Second, it is conceivable that a universal and necessary stacking gene could exist that possesses multiple functions and so had lost the relevant Golgi function in organisms with unstacked Golgi. However, the fact that every candidate protein examined was apparently absent in multiple genomes of organisms that possess Golgi stacks renders this possibility incompatible with our observations. Finally, it is possible that an as-yet unreported, necessary stacking factor protein may exist, for which we did not search. Proteomics technology allowing distinction between the proteomes of organelles with similar densities, such as the plant ER and Golgi, and even the unique proteomes of organelle sub-compartments  may identify previously uncharacterized Golgi proteins that could be candidates for such a necessary stacking factor.
However, accepting these caveats, our results are inconsistent with the hypothesis that any one of the proteins participates in a pan-eukaryotic mechanism of Golgi stacking; this does not discount the importance of lineage-specific functions. Nonetheless, our data are most consistent with Golgi stacking being dependent on an additive, redundant function of non-homologous proteins, i.e., the emergent property hypotheses. An emergent property could rely on ancient redundant proteins, or could rely upon recently evolved, lineage-specific ones that replace ancient factors. With 14 recently evolved proteins identified within the Holozoa (Fig. 3), it is tempting to speculate that additional lineage-specific proteins are also present in other eukaryotes and may have stacking functions. The presence of a kinetoplastid-specific protein (TbG63) is consistent with this scenario, and searches for lineage-specific membrane-trafficking factors associated with clathrin-mediated endocytosis  and the sortilin system  have certainly been fruitful and illuminating. This will be exciting to pursue in order to understand the mechanisms of Golgi trafficking and stacking, particularly as more genetic and molecular biological tools become available for non-opisthokont model organisms.
Overall, our data do not rule out the existence of a widely conserved necessary stacking factor, but rather support the idea that Golgi stacking as an emergent property needs to be more extensively explored. This may well be the key to understanding one of the most prominent eukaryotic cellular features.
The cisternal stacking of the Golgi and the separation into cis-, medial- and trans-Golgi compartments is one of the most recognizable aspects of the eukaryotic cell. Our results have allowed insight into both the underlying cell biology and evolution of this prominent eukaryotic feature. At least 10 proteins implicated in Golgi structure have been reconstructed as ancient factors contributing to a differentiated Golgi organelle in the ancestor of eukaryotes over a billion years ago.
M. balamuthi strain (ATCC 30984) was maintained axenically in PYGC medium at 24 °C in 50 mL culture tissue flask . For immunofluorescence microscopy, M. balamuthi cells were fixed in 1% formaldehyde for 30 min, washed, and treated in 1% Triton TX-100 for 10 min. Fixed cells were stained using polyclonal rat anti COPI-β subunit, rabbit anti PDI, rabbit anti MDH  Abs, and monoclonal mouse α tubulin (Sigma) Ab. Alexa Fluor 488 (or 594) donkey anti rabbit, Alexa Fluor 594 (or 488) donkey anti rat, and Alexa Fluor 594 donkey anti mouse Abs (Life Technologies) were used as secondary antibodies. Structured illumination microscopy (SIM) was performed using a commercial 3D N-SIM microscope (inverted Nikon Eclipse Ti-E, Nikon) equipped with a Nikon CFI SR Apo TIRF objective (100× oil, NA 1.49). A structured illumination pattern projected into the sample plane was created on a diffraction grating block (100 EX V-R 3D-SIM) for laser wavelengths 488 and 561 nm. Excitation and emission light was separated by filter cubes with appropriate filter sets SIM488 (ex. 470–490, em. 500–545), and SIM561 (556–566, 570–640). Emission light was projected through a 2.5× relay lens onto the chip of an EM CCD camera (AndoriXon Ultra DU897, 10 MHz at 14-bit, 512 × 512 pixels). Three-color z-stacks (z-step: 120 nm) were acquired in NIS-Elements AR software (Laboratory Imaging). Laser intensity, EM gain, and camera exposure time were set independently for each excitation wavelength. The intensity of fluorescence signal was held within the linear range of the camera. Fifteen images (three rotations and five phase shifts) were recorded for every plane and color. SIM data were processed in NIS-Elements AR. Before sample measurement, the symmetry of point spread function was checked with 100 nm red fluorescent beads (580/605, carboxylate-modified microspheres, Life Technologies) mounted in Prolong Diamond Antiface Mountant (Life Technologies), and optimized by adjusting objective correction collar. The signal for 4,6-diamidine-2-phenylindole dihydrochloride (DAPI) was observed in wide-field mode.
Preparation of antibodies
To obtain complete and partial recombinant PDI and COPI-β proteins, respectively, the corresponding gene sequences were amplified by PCR (Primers: COPI-β forward: CATATGAAGAACCTCGAGCACAGG, COPI-β reverse: AAGCTTCGCGTCGGCCTTGA; PDI forward: CATATGAAGTGGCAGTACATCG, PDI reverse: AAGCTTGAGCTCCTTCTTCTCCCC) using M. balamuthi cDNA as template. The PCR products were subcloned into the pET42b+ vector (Novagen), and expressed with a 6xHis tag in Escherichia coli BL21 (DE3). The proteins were purified by affinity chromatography under denaturing conditions according to the manufacturer’s protocol (Qiagen) and used to immunize rats (COPI-β) or rabbits (PDI).
The genomic databases used for bioinformatics searches are listed in Additional file 10: Table S4. Of note, both the filtered and unfiltered gene model databases at JGI were searched (unfiltered datasets include any redundant gene models for the same gene loci). Additionally, the draft genome of M. balamuthi, produced as part of an ongoing project, was searched for conserved Golgi marker and putative stacking factor genes. The draft genome sequence is available at http://www.ebi.ac.uk/ena/data/view/CBKX00000000 (deposited January 22, 2015). The identified gene sequences are detailed and made available in Additional file 3: Table S2.
Basic Local Alignment Search Tool (BLAST 2.2.29+)  was used to search for homologues of proteins of interest in M. balamuthi-predicted proteins. A bidirectional best-hit criterion was applied with an E-value cut-off of 0.05 for both forward and reverse searches. Additionally, identified sequences were required to retrieve the original query in the reverse search with an E-value of at least two orders of magnitude lower than other sequences. Initial queries are either from the H. sapiens or S. cerevisiae genomes, or are from other eukaryotes as identified in previous studies [81, 93, 94, 95], and multiple queries were used.
For searches to identify orthologues of Golgi structure-associated proteins of interest, a multi-phase approach was taken. BLAST was run locally to search protein sequence databases from a large sampling of eukaryotes (Additional file 10: Table S4). To identify highly similar homologues, reciprocal best hit BLASTP searches were performed using H. sapiens query sequences and with the following criteria: E-value of 1 × 10–20 or lower for forward search, E-value of 0.05 or lower for reverse search, and a minimum E-value difference of two orders of magnitude, in the reverse BLAST results, between the hit(s) corresponding to the original query and the first negative hit.
HMMER 3.1b1 was then used to perform searches in the same protein sequence databases (http://hmmer.org) . For this, positive hits from BLAST searches were used to build initial Hidden Markov Models (HMMs). Sequences were aligned using MUSCLE v3.8.31  with default parameters. For these searches, the following criteria were applied to define positive hits: E-value of 1 × 10–10 or lower for forward (HMMer) search and E-value of 0.05 or lower for reverse (BLASTP) search. After each HMMer search, positive hits, if identified, were aligned and viewed manually before inclusion in HMMs for subsequent searches. This process was repeated until no more positive hits were identified. An exception to these methods was made in the case of the GRIP domain-containing proteins in taxa outside of Metazoa, which were identified using HMMs including only the subsequence of proteins corresponding to the GRIP domain, because no proteins with sequence similarity to individual human GRIP containing proteins outside the GRIP domain were identified outside metazoan taxa. In addition to the above methods, for these non-metazoan GRIP golgins, due to the short length and high sequence conservation of the GRIP domain, a bit score of 25 was used as a cutoff to identify positive hits, and criteria based on reverse search results were not applied. Results of the final searches, including accessions and E-values, are summarized in Additional file 6: Table S3. Alignments used for constructing HMMs are found in Additional file 9.
Finally, false negatives could be due to the divergence of a candidate from the experimentally validated H. sapiens query. In order to mitigate this possibility, HMMer searches were repeated with the same E-value cutoffs, but using protein databases of different taxa for reciprocal BLAST analysis. These taxa were selected from those taxa for which positive hits were validated in the previous HMMer searches, and which are included in the same supergroup as the taxa queried. For example, a CASP orthologue was identified in Neospora caninum using the closely related taxon Toxoplasma gondii for reverse BLAST searches, but not using H. sapiens (Additional file 6: Table S3). Additionally, BLAST was used to search nucleotide scaffold sequences in the case of one protein of interest (Sec16) in Pichia pastoris because it could not be found in the protein sequence database for this organism, and the protein database for the very closely related yeast Komagataella phaffii (which does contain a Sec16 sequence) was also included in the analyses.
For phylogenetic analyses, sequences were aligned using MUSCLE v3.8.31  with default parameters, and manually trimmed to retain only regions of clear homology. Alignments used for phylogenetic analyses are found in Additional file 11 and Additional file 12. RAxML version 8.2.8  was used for maximum likelihood analysis. For RAxML analyses, the PROTGAMMALG4X model was used, and 100 non-parametric bootstraps were performed using the default faster hill climbing method (–f b, –b, –N 100). MrBayes version 3.2.6  was used for Bayesian analysis. For MrBayes analyses, over four million Markov chain Monte Carlo generations were run under the Mixed model with a burnin of 25% to average standard deviations of splits frequencies of 0.01 or lower, indicating convergence. Both RAxML and MrBayes analyses were run using the CIPRES webservice . In the case of the GRASP proteins, several consecutive analyses were required with removal of divergent sequences to resolve phylogenetic relationships.
The authors would like to thank Christen M. Klinger for collaboration on informatics workflows used for running homology searches and all members of the Dacks lab, past and present, for helpful discussion. We also want to thank Drs Aaron Turkewitz, Paul Melançon, Alan Warren and Frances Brodsky for helpful discussion.
LDB is supported by a Postgraduate Scholarship-Doctoral from the Natural Sciences and Engineering Research Council of Canada (NSERC). MA was supported by an Alberta Innovates Technology Futures Postdoctoral Fellowship. Work in the Dacks lab is supported by NSERC Discovery grant (RES0021028 ) and JBD is the Canada Research Chair (Tier II) in Evolutionary Cell Biology. JT is supported by Czech Science Foundation (16-06123S), BIOCEV (CZ.1.05/1.1.00/02.0109), and LQ1604 NPU II provided by MEYS CR. We acknowledge the Imaging Methods Core Facility at BIOCEV supported by the Czech-BioImaging RI project (LM2015062 funded by MEYS CR).
Availability of data and materials
All data generated or analyzed during this study are included in this published article and its supplementary information files.
EN and JT designed, performed and interpreted the experimental work on M. balamuthi, and also wrote the relevant manuscript sections. MA and LDB performed genomic analysis of M. balamuthi. LDB performed all molecular evolutionary analyses. LDB and JBD designed the molecular evolutionary analyses, interpreted the results, and wrote the manuscript. All authors edited the manuscript and approved of its content prior to publication.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
- 5.Witkos TM, Lowe M. The Golgin family of coiled-coil tethering proteins. Front Cell Dev Biol Front Cell Dev Biol. 2016;1:863389–6.Google Scholar
- 6.Gillingham AK. At the ends of their tethers! How coiled-coil proteins capture vesicles at the Golgi. Biochem Soc Trans. 2017. https://doi.org/10.1042/BST20170188.
- 11.Eme L, Sharpe SC, Brown MW, Roger AJ. On the age of eukaryotes: evaluating evidence from fossils and molecular clocks. Cold Spring Harb Lab Press. 2014;6(8).Google Scholar
- 20.Marti M, Li Y, Schraner EM, Wild P, Köhler P, Hehl AB. The secretory apparatus of an ancient eukaryote: protein sorting to separate export pathways occurs before formation of transient Golgi-like compartments. Mol Biol Cell. 2003;14:1433–47. https://doi.org/10.1091/mbc.E02-08-0467.CrossRefPubMedPubMedCentralGoogle Scholar
- 22.Ghosh SK, Field J, Frisardi M, Rosenthal B, Mai Z, Rogers R, et al. Chitinase secretion by encysting Entamoeba invadens and transfected Entamoeba histolytica trophozoites: Localization of secretory vesicles, endoplasmic reticulum, and Golgi apparatus. Infect Immun. 1999;67:3073–81.PubMedPubMedCentralGoogle Scholar
- 25.Dacks JB, Davis LAM, Sjögren ÅM, Andersson JO, Roger AJ, Doolittle WF. Evidence for Golgi bodies in proposed “Golgi-lacking” lineages. Proc R Soc B Biol Sci. 2003;270(SUPPL):2.Google Scholar
- 75.Burki F, Kaplan M, Tikhonenkov DV, Zlatogursky V, Minh BQ, Radaykina LV, et al. Untangling the early diversification of eukaryotes: a phylogenomic study of the evolutionary origins of Centrohelida, Haptophyta and Cryptista. Proc R Soc B Biol Sci. 2016;283:20152802. https://doi.org/10.1098/rspb.2015.2802.CrossRefGoogle Scholar
- 83.Mani S, Thattai M. Stacking the odds for Golgi cisternal maturation. Elife. 2016;5. https://doi.org/10.7554/eLife.16231.
- 87.Parsons HT, Lilley KS. Mass spectrometry approaches to study plant endomembrane trafficking. Semin Cell Dev Biol. 2017. https://doi.org/10.1016/j.semcdb.2017.10.014.
- 91.Nývltová E, Šuták R, Harant K, Šedinová M, Hrdy I, Paces J, et al. NIF-type iron-sulfur cluster assembly system is duplicated and distributed in the mitochondria and cytosol of Mastigamoeba balamuthi. Proc Natl Acad Sci U S A. 2013;110:7371–6. https://doi.org/10.1073/pnas.1219590110.CrossRefPubMedPubMedCentralGoogle Scholar
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.