An iterative computational design approach to increase the thermal endurance of a mesophilic enzyme
Strategies for maximizing the microbial production of bio-based chemicals and fuels include eliminating branched points to streamline metabolic pathways. While this is often achieved by removing key enzymes, the introduction of nonnative enzymes can provide metabolic shortcuts, bypassing branched points to decrease the production of undesired side-products. Pyruvate decarboxylase (PDC) can provide such a shortcut in industrially promising thermophilic organisms; yet to date, this enzyme has not been found in any thermophilic organism. Incorporating nonnative enzymes into host organisms can be challenging in cases such as this, where the enzyme has evolved in a very different environment from that of the host.
In this study, we use computational protein design to engineer the Zymomonas mobilis PDC to resist thermal denaturation at the growth temperature of a thermophilic host. We generate thirteen PDC variants using the Rosetta protein design software. We measure thermal stability of the wild-type PDC and PDC variants using circular dichroism. We then measure and compare enzyme endurance for wild-type PDC with the PDC variants at an elevated temperature of 60 °C (thermal endurance) using differential interference contrast imaging.
We find that increases in melting temperature (Tm) do not directly correlate with increases in thermal endurance at 60 °C. We also do not find evidence that any individual mutation or design approach is the major contributor to the most thermostable PDC variant. Rather, remarkable cooperativity among sixteen thermostabilizing mutations is key to rationally designing a PDC with significantly enhanced thermal endurance. These results suggest a generalizable iterative computational protein design approach to improve thermal stability and endurance of target enzymes.
KeywordsComputational protein design Biofuels Thermal stability Pyruvate decarboxylase
Advanced biofuels can meet up to three billion barrels of liquid fuel demand on a yearly basis, but face high costs and process complexity as obstacles to commercial implementation [1, 2]. Consolidated bioprocessing (CBP) offers a way to address both cost and process complexity by combining cellulase production, enzymatic digestion, and fermentation into a one-pot reaction . The challenge of CBP is to engineer a single microbe that has the capability of deconstructing cellulosic biomass and fermenting the released sugars to biofuels or bio-based chemicals at sufficiently low cost. Promising CBP microbes are able to effectively deconstruct plant biomass to elemental sugars [4, 5, 6], but still require optimization to increase the fermentation of those sugars to products such as bioethanol [7, 8, 9].
Metabolic pathway engineering of CBP microbes has focused on achieving chemical outputs near theoretical yields. Streamlining metabolic pathways by removing enzymes at branched points is a common approach to increase chemical yields. Alternatively, branch points can be bypassed entirely through the introduction of nonnative enzymes. Introducing a nonnative enzyme can be difficult, however, if the enzyme has not evolved to function in an environment similar to that of the host organism. In these cases, the hope is that protein engineering can be used to rationally engineer enzymes, increasing the range of tools available for metabolic pathway engineering.
Incorporation of the Z. mobilis PDC and alcohol dehydrogenase (ADH) into the cellulolytic mesophile, Clostridium cellulolyticum, resulted in a 53% increase in the production of ethanol and a 48% decrease in the production of lactate . Clostridium thermocellum is a highly cellulolytic thermophile that has emerged as one of the most promising CBP microbes . To date, however, PDC has not been identified in any thermophilic organism, challenging the incorporation of PDC into microbes such as C. thermocellum. The potential for using enzymes like PDC to increase yields of renewable fuels and chemicals demonstrates the value of combining computational protein design and metabolic engineering .
Protein design tools enhance our ability to manipulate the physicochemical properties of target enzymes. While a majority of randomly selected mutations are deleterious, substitutions identified using computational approaches have a much higher rate of success. In fact, previous computational protein design results demonstrate that multiple independent mutations can be combined to efficiently enhance the thermostability of a target protein without having to screen each substitution individually [14, 15]. Although there are examples of proteins that have been made more thermostable by computational protein design methods, many previous examples have been smaller monomeric proteins [16, 17]. Engineering a larger tetrameric protein like PDC poses additional design challenges. For example, large, multidomain proteins likely have complex folding trajectories and therefor may not have energy minima corresponding to single stable conformations . Thus, larger proteins may require stabilization in more than one structural region in order to observe significant increases in thermal stability.
Engineering an enzyme to function in a thermophilic organism requires increasing the thermal endurance of the enzyme at the growth temperature of the host and may not directly correlate with increases in melting temperature. C. thermocellum, for example, grows at 60 °C , which is well above the growth temperature of Z. mobilis (maximum growth temperature 42 °C)  but just below the melting temperature (Tm) of the Z. mobilis PDC (62 °C) (Additional file 1: Figure S1A). Changes in ellipticity at 60 °C indicate PDC is within a structurally unstable temperature. When seeking to engineer a mesophilic enzyme for use in a thermophilic host organism it is unknown what the target melting temperature should be to achieve an extended enzyme lifetime, or thermal endurance, at the target temperature.
Here we engineer PDC enzymes with increased thermal stability using the Rosetta protein modeling software . We screen the PDC designs using circular dichroism to identify designs with increased resistance to thermal denaturation. We then monitor the lifespan of the enzymes at 60 °C using phase-contract microscopy and find that while we are able to engineer enzymes with significantly longer life-spans, increases in melting temperature do not correlate to increases in endurance at elevated temperature.
In silico design
Single substitutions were identified with in silico site saturation mutagenesis, using the Rosetta protein modeling software . The in silico experiments used either a fixed-backbone approximation, where the backbone coordinates from the X-ray crystal structure were held fixed in the simulation, or a minimization of backbone and side-chain torsion angles prior to evaluating amino acid identities at each position.
Previous work comparing thermophilic and mesophilic enzymes showed that the backbone of structurally equivalent clusters can accommodate alternate sequence combinations by adjusting an average of 3.5 Å for each Cα atom . Here, clusters of interacting residues were determined using a distance cutoff, where residues were considered interacting when any side-chain heavy-atoms were within 4 Å. Two clusters were identified for design targets based on poor side-chain packing, G6, I97, I166, A171 and H21, M42, Q44, Y46. These two clusters were designed by iterating between sequence design and minimization of backbone and side-chain torsion angles.
The PDC dimer displays symmetry at the interface resulting in a design constraint where any mutation made to one monomer must be mirrored in the bound partner. The symmetric design protocol simultaneously models the effects of an amino acid substitution at both positions at the interface .
Engineering a protein surface to have a high net charge can increase solubility and enhance thermal stability [24, 26]. Designing the PDC complex on the hypothesis that supercharging the surface can stabilize the interactions between domains, we identified substitutions to impart a high net charge while modeling the energetic consequences of the substitutions.
We selected PDC1.01 and PDC1.10 as the foundations for the second round of protein design based on the observed enhancements in thermal stability. We identified four additional mutations by selecting the next set of substitutions that would remove glycine residues (G491A, G515A, G516A, and G540A; Additional file 1: Table S1). We targeted the identification of additional glycine-removing mutations due to the successful thermal stabilization of PDC1.01, which contained two such mutations. These cases where a substitution removed a glycine residue underwent additional evaluation. Glycine, which has only a hydrogen atom at its side-chain position, can accommodate phi–psi backbone angles not accessible to any other natural amino acid. For this reason, any in silico mutations replacing a glycine residue underwent a filter based on Rosetta energy scores. The Rosetta energy term, p_aa_pp, gives the probability of finding an amino acid at given phi and psi dihedral angles. In silico mutations that removed a glycine residue were only considered if the p_aa_pp energy score was below 0.2. Additionally, glycine is the smallest amino acid. Mutations that passed the above filter were then sorted by the Lennard–Jones repulsive energy term (LJrep) to ensure significant atomic clashes were not present that might indicate the protein region could not accommodate the substitution (Additional file 1: Table S1). Glycine residues within 6 Å of the dimer active sites were not considered for redesign. We additionally avoided mutations introducing cysteine residues, as cysteine residues can participate in redox reactions .
A solvent exposed mutation in PDC1.01, A189K, was identified as a potentially beneficial single mutation in an in silico point mutation scan, but places a positively charged residue at the PDC surface and thus does not fit the goal of the negatively supercharged surface for PDC1.10. This mutation was subsequently removed in PDC2.02. A version of PDC2.02 that included the A189K mutation was characterized and evaluated using CD to confirm the mutation does not alter thermal stability based on observed changes in molar ellipticity (Additional file 1: Figure S3).
Construction, cloning, and expression of PDC variants
DNA sequences for each PDC variant were codon optimized and cloned into a pET22b(+) vector (GenScript, Piscataway, NJ). The sequence for a hexahistidine tag was placed at the C terminus of the constructs. PDC protein variants were expressed either for 4 h at 37 °C or overnight at 25 °C with 0.1 mM IPTG in the BL21 (DE3) strain of Escherichia coli. Proteins were concentrated using Vivaspin spin columns with a molecular weight cutoff of 10,000 Da (GE Healthcare Life Sciences, Pittsburgh).
Protein purification. The frozen cell pellets were thawed at room temperature with equal volume of buffer A (50 mM Tris pH 7.5, 100 mM NaCl, 10 mM imidazole, 0.1 mM thiamine pyrophosphate (TPP), 0.5 mM dithiothreitol (DTT) and 1 mM MgCl2) and lysed with lysozyme and sonication. One mg/mL lysozyme (Hampton Research, Aliso Viejo, CA), 1.0 U/mL Pierce Universal Nuclease (Thermo Scientific, Rockford, IL) and EDTA-free protease inhibitor (Thermo Scientific, Rockford, IL) according to manufacturer instructions were added in the lysis mixture and incubated for 30 min at room temperature with occasional vortexing. Sonication was done at room temperature for 2 min using a Branson 5510 water bath sonicator (Branson Ultrasonics Corporation, Danbury, CT). Cell debris was removed by centrifugation at 15,000×g for 15 min. The supernatant was loaded onto an eight mL HisPur Cobalt column (Thermo Scientific, Rockford, IL) using an Akta FPLC system (GE Life Sciences, Piscataway, NJ) with buffer A (50 mM Tris pH 7.5, 100 mM NaCl, 20 mM imidazole, 0.1 mM TPP, 0.5 mM DTT, and 1 mM MgCl2). After loading and washing the unbound proteins from the column, PDC samples were eluted using 100% of Buffer B (50 mM Tris pH 7.5, 100 mM NaCl, 250 mM imidazole, 0.1 mM TPP, 0.5 mM DTT, and 1 mM MgCl2). Final purification was performed by size-exclusion chromatography using a HiLoad Superdex 200 (26/60) column (GE Healthcare, Piscataway, New Jersey, USA) in buffer C (20 mM Tris pH 7.5, 100 mM NaCl, 0.1 mM TPP, 0.5 mM DTT, and 1 mM MgCl2).
Differential scanning calorimetry
Protein samples were measured over a temperature range of 10–90 °C and at a scan rate of 60 °C/h. No feedback mode was used for each DSC experiment. Buffer baseline scans were established before each protein run by loading both the sample and reference cells with buffer and performing 3 or more heating and cooling cycles until the deviation in scans was less than 0.01 mcal/min. Buffer was then removed from the sample cell and replaced with the protein sample during the temperature range of 25–15 °C during the cool down cycle. All protein samples were diluted to 0.2 mg/mL in 20 mM TRIS pH 7.5, 100 mM MnCl, 1 mM MgCl2, 0.5 mM DTT, 0.1 mM TPP buffer and run at that concentration in order to minimize post-unfolding aggregation.
DSC data on each sample was analyzed by the Origin 7.0 software  coupled to a DSC data analysis module provided with the Microcal VP-DSC. Sample scans were buffer subtracted, normalized to molar concentration and corrected by baseline correction options in the DSC data analysis module prior to least squares analysis using a non-two-state model option. The fitted sample curves produced the melting temperature (Tm), enthalpy of unfolding (∆Hcal), and van’t Hoff enthalpy (∆HvH) for each transition.
Circular dichroism spectroscopy and thermal melts
Circular dichroism (CD) spectra and thermal melt measurements were performed on a Chirascan-plus spectrometer (Leatherhead, Surrey, UK) using a 0.5-mm path-length quartz cuvette. WT and mutant PDC proteins were buffer exchanged into a potassium phosphate buffer (10 mM potassium phosphate, 100 mM NaCl, 1 mM MgCl2, 0.1 mM TPP, 0.5 mM DTT, pH 7.5) and prepared to a final concentration of 0.2 mg/mL. CD spectra were measured at the stated temperatures with a step size of 0.25 nm. The thermal melts were performed in continuous ramp mode at a rate of 2 °C/min while measuring CD at 222 nm. The Tm of each protein was determined from the first derivative of the thermal melt curves using Prism 6 (GraphPad, La Jolla, CA). Each thermal melt experiment was performed in triplicate.
ThermoFluor high-throughput protein stability assay
The high-throughput ThermoFluor assay was used to evaluate the effects of the glycine-to-alanine substitutions G491A, G515A, G516A, and G540A. For the ThermoFluor assay, a hydrophobic dye, SYPRO Orange, binds to exposed hydrophobic regions of the protein. As proteins begin denaturation upon heating, increasing amounts of hydrophobic regions are exposed resulting in an increased signal. The assay is performed in an RT-PCR machine using 96-well plates, allowing the simultaneous characterization of many protein variants. This assay has been used to rapidly screen protein mutants generated from computational design approaches [29, 30], and thus allowed us to evaluate the threshold for selecting these mutations for combinatorial libraries.
SYPRO® Orange Protein Gel Stain, supplied at 5000× concentrate in dimethyl sulfoxide (Thermo Scientific, Waltham, MA), was diluted to 200× in buffer (20 mM Tris pH 7.5, 100 mM NaCl, 0.1 mM TPP, 0.5 mM DTT, and 1 mM MgCl2). Wild-type PDC and variants were diluted to 5 μM in buffer, with protein concentrations determined by measuring absorbance at 280 nm. Extinction coefficients were calculated using the method described by Gill and von Hipple . 50 μL samples were made by combining 45 μL of protein with 5 μL of 200× SYPRO® Orange stain. Protein variants were measured in triplicate, with 50 μL samples placed in Hard-Shell® 96-well PCR plates with clear wells (BioRad, Hercules, CA). Plates were covered with MicroAmp® optical adhesive film (Thermo Scientific, Rockford, IL) to prevent sample evaporation. Spectra were obtained on a BioRad Real-Time C1000 Touch Thermal Cycler (BioRad, Hercules, CA). Thermal denaturations were done by increasing temperature from 25 to 95 °C at a rate of 1 °C/min, taking a plate read every minute using the FRET scan mode. Circular dichroism and ThermoFluor spectra were generated using IGOR Pro (WaveMetrics Inc., Lake Oswego, OR).
PDC activity assay
PDC activity was measured using an assay where decarboxylation is coupled with alcohol dehydrogenase , and the conversion of NADH to NAD+ by the alcohol dehydrogenase was monitored for 5 min with a Varian Cary 400 (Agilent Technologies, Santa Clara, CA) temperature-controlled spectrophotometer at 25 °C. Four cuvettes, a blank and a triplicate of one sample of interest, were prepared and measured at a time. Each cuvette contained one mL of reaction mix and 20 µL of PDC with a suitable dilution in buffer D (20 mM BIS–Tris pH 6.5, 100 mM NaCl, 1 mM MgCl2, 0.1 mM TPP, 0.5 mM DTT), mixed using a 1 mL pipette (all samples in triplicate). Reaction mix contained 20 mM Na pyruvate (Fisher Scientific, Fair Lawn, New Jersey), 0.288 mM NADH (Sigma Chemical CO, St. Louis, MO) and 40 U yeast alcohol dehydrogenase (MP Biomedicals LLC, Solon, OH) in buffer D. Blank contained only the reaction mix. The Protein concentration was determined using the Bradford protein reagent with bovine serum albumin as the standard (BioRad, Hercules, CA).
PDC2.03 crystals were initially obtained with sitting drop vapor diffusion using a 96-well plate with Grid Screen Salt HT from Hampton Research (Aliso Viejo, CA). Fifty microliter of well solution was added to the reservoir, and drops were made with 0.2 µL of well solution and 0.2 µL of protein solution using a Phoenix crystallization robot (Art Robbins Instruments, Sunnyvale, CA). The crystals were grown in 0.1 M MES monohydrate pH 6.0 and 2.4 M ammonium sulfate at 20 °C. The protein solutions contained 6 mg/mL of protein in 20 mM Tris pH 7.5, 100 mM NaCl, 1 mM MgCl2, 0.5 mM DTT, and 0.1 mM TPP.
Data collection and processing
The PDC2.03 crystals were flash frozen in a nitrogen gas stream at 100 K before home source data collection using an in-house Bruker X8 MicroStar X-ray generator with Helios mirrors and Bruker Platinum 135 CCD detector. Data were indexed and processed with the Bruker Suite of programs version 2014.9 (Bruker AXS, Madison, WI).
Structure solution and refinement
Intensities were converted into structure factors, and 5% of the reflections were flagged for Rfree calculations using programs F2MTZ, Truncate, CAD, and Unique from the CCP4 package of programs . The program MOLREP  version 11.4.06 was used for molecular replacement using wild-type PDC (PDB code 2WVG ) as the search model. Refinement and manual correction was performed using REFMAC5  version 5.8.0155, and Coot  version 0.8.6. The MOLPROBITY method  was used to analyze the Ramachandran plot, and root-mean-square deviations (RMSD) of bond lengths and angles were calculated from ideal values of Engh and Huber stereo chemical parameters 47 . Wilson B-factor was calculated using CTRUNCATE version 1.15.10 . The data collection and refinement statistics are shown in Additional file 1: Table S2.
Programs Coot 45, PyMOL (http://www.pymol.org) and ICM (http://www.molsoft.com) were used for comparing and analyzing structures. Figures 2a, b, 5 were created using PyMOL. The root-mean-square deviation (RMSD) between the monomeric unit of PDC and PDC2.03 was computed using PyMol (The PyMOL MolecularGraphics System, Version 184.108.40.206 Schrödinger, LLC.)
Pyruvate decarboxylase (PDC) protein preps were diluted in buffer to 1 mg/mL concentration. 7.5 µL of protein solution was placed between two glass coverslips separated by a 0.15 mm deep SecureSeal imaging spacer (Grace Bio-labs, Bend, OR). Heating was controlled using a Linkam FTIR600 temperature-controlled microscope stage (Linkam Scientific Instruments, UK) and heated from 24 °C to either 60 or 120 °C at a ramp rate of 1 °C/min. The optics were set up for bright field differential interference contrast imaging on a NikonE800 microscope (Nikon, Tokyo, Japan), using a 20× 0.75 NA PlanApo ELWD objective. Images were captured every 30 s over the 3 h using a SPOT RTKE CCD camera (Diagnostic Instruments, Sterling Heights, MI) as TIFF stacks. TIFF stacks were analyzed using FIJI (ImageJ).
Computational design of a thermostable PDC
PDC self-associates to form a 260 kDa dimer of dimers, with two active sites buried at each dimer interface (Fig. 2a) . A thermostable and active PDC will likely require stabilization of the monomer to resist denaturation at high temperatures and also stabilization of the interactions between the monomers to maintain the active complex. In addition, the monomeric unit of PDC is composed of three distinct structural regions (Fig. 2b). Our protein-engineering goal was to identify thermostabilizing mutations located in each structural region to address the distinct components and interactions of the PDC complex.
Designing thermostabilizing mutations for the PDC homotetrameric complex
Activity at 25 °C
G224A, V374I, G540A
F55W, G224A, V374I
F55W, G224A, V374I, G540A
G6A, I97V, I166F, A171F
H21I, M42F, Q44V, Y46F
A77V, L78F, S79M, A80M, A83V, I84V, G85V, G86M, M127V, A128M, Y163F
A77V, L78F, S79M, A80M, D82V, A83V, I84V, G85V, G86M, M127V, A128M
S2R, N41K, A207K, Q333K, A357K, T379R, N402K, V524K, A527K, A557K
S2D, L38D, A189K, A207E, Q333E, A357D, A376D, A519E, A527D, K553E
G109A, G224A, V374I, G491A, G515A, G516A, G540A
S2D, L38D, A207E, G224A, Q333E, A357D, V374I, A376D, A519E, A527D, G540A, K553E
S2D, L38D, G109A, A207E, G224A, Q333E, A357D, V374I, A376D, G491A, G515A, G516A, A519E, A527D, K553E, G540A
The mutations display features associated with increased thermal stability, including increased buried hydrophobic surface area , enhanced quality of side-chain packing , increased surface charge of the tetrameric complex , decreased conformational flexibility and improved packing at the symmetric dimer interface . A summary of the designs is shown in Table 1. Our approach utilizes an iterative workflow, where the most promising designs are selected after experimental characterization, then combined, and/or further enhanced with additional mutations (Fig. 2c). The iterative computational design approach is key to identifying thermostabilizing mutations that work together to stabilize any structurally weak regions.
Identification of PDC designs with enhanced thermostability
To evaluate whether the PDC designs resulted in an enhancement in thermal stability, genes encoding wild-type PDC and variants listed in Table 1 were expressed in Escherichia coli. Enzymatic activity for all PDC enzymes was determined, with all successfully expressed PDC enzymes showing similar levels of activity. Pyruvate decarboxylation is measured by coupling PDC to an NADH-dependent alcohol dehydrogenase (ADH). The oxidation of NADH by ADH upon the conversion of acetaldehyde to ethanol results in a colorimetric change. NADH is not stable at elevated temperatures, thus activity was determined at 25 °C . Any designs that did not produce a detectable level of soluble protein in the first attempt were discarded, and activity for these designs was labeled “ND”, or not determined, in Table 1.
Differential scanning calorimetry (DSC) was initially used to evaluate the thermal stability of the PDC variants. DSC scans displayed multiple peaks, possibly due to the multimeric attribute of PDC, challenging the comparison of some PDC variants (Additional file 1: Figure S2). DSC results served to eliminate a few underperforming designs (PDC1.02, PDC1.04, PDC1.03, PDC1.07), but we instead relied on circular dichroism (CD) to characterize the remaining variants. PDC was shown to have irreversible unfolding, thus the melting temperatures reported here for PDC variants indicate relative thermal stabilities.
Three tiers of design were attempted for the second round of PDC engineering. First, PDC1.01 was further improved by the addition of four substitutions, generating PDC2.01. Second, PDC1.01 and PDC1.10 were combined to evaluate the additivity of these two successful designs, generating PDC2.02. In the third design, the same four substitutions were combined with PDC1.01 and PDC1.10, generating PDC2.03. The third approach was the most thorough test of both the accuracy of our computational predictions and the combinability of the designs.
The thermal stability of PDC2.01, containing the four substitutions and the parent design PDC1.01, was evaluated using the high-throughput ThermoFluor assay (Additional file 1: Figure S1B, A, respectively). PDC2.01 exhibited a melting temperature nearly identical to PDC1.01, displaying no improvement in thermal stability. Nearly all of the pairwise combinations displayed a Tm within one or two degrees of the parent PDC1.01. The most significant increase in Tm was seen with the addition of G491A and G516A, resulting in an increase of 4 °C.
PDC2.02 and PDC2.03 were evaluated by measuring molar ellipticity from 195 to 260 nm, as done with the parent design, PDC1.10 (Fig. 3c). PDC2.02 resists thermal denaturation, as seen with PDC1.01. PDC2.02 also loses some secondary structure at 60 °C, as seen with PDC1.01. Thus, while PDC2.02 does not fully denature up to 90 °C, it also does not display an apparent increase in thermal stability compared to PDC1.01 based on changes in molar ellipticity.
A significant increase in thermal stability is observed with PDC2.03, which combines PDC1.01, PDC1.10 and the additional four substitutions. PDC2.03 does not show loss of secondary structure until 90 °C, well above the target growth temperature of 60 °C, at which point small changes in molar ellipticity are observed (Fig. 3d). Importantly, PDC2.03 differs from PDC2.02 by only four substitutions. The combined ThermoFluor and CD results indicate, therefore, the same four mutations have no apparent thermostabilizing effect in the context of PDC2.01 while significantly increasing thermal stability in the context of PDC2.03.
Measuring the structural endurance of PDC variants
The aim of this work was to develop an iterative computational protein design protocol to rationally engineer a mesophilic enzyme, PDC from Z. mobilis, to function like a thermophilic enzyme. The maximum growth temperature for Z. mobilis is 42 °C , well below the optimum growth temperature of 60 °C for C. thermocellum . While our target temperature of 60 °C is below the Tm of wild-type PDC (Additional file 1: Figure S1), where half the protein has been thermally inactivated, changes in ellipticity indicate PDC is at or approaching a structurally unstable temperature.
Seeking an approach capable of making time-resolved measurements of PDC variants at the target temperature of 60 °C, we evaluated variants using differential interference contrast (DIC) microscopy during a heat and hold experiment. This approach is similar to optical density and static light scattering experiments, where blockage or scattering of light indicates changes in the size of protein species, generally due to aggregation [42, 43]. In the approach used here to evaluate the structural endurance of PDC variants, we combined measured changes in birefringence with precise temperature control to evaluate protein aggregation upon thermal denaturation.
Wild-type PDC entered phase 2 immediately upon reaching 60 °C, remaining in phase 2 for approximately 25 min before reaching phase 3. PDC1.01, which exhibited a four degree increase in Tm based on ellipticity measurements, showed no improvement over wild-type PDC when held at 60 °C. PDC1.10, which resisted denaturation and precipitation in CD experiments to at least 90 °C, similarly showed little to no improvement over wild-type PDC when held at 60 °C. PDC1.10 results suggest that the loss of molar ellipticity observed at 60 °C in CD experiments indicates a structural instability and does not represent a stable conformation.
All second round designs (PDC2.01, PDC2.02 and PDC2.03) outperform the first round designs, (PDC1.01 and PDC1.10). PDC2.01 enters phase 2 within minutes after reaching 60 °C but remains in phase 2 for approximately 35 additional minutes before entering phase 3. PDC2.02 and PDC1.10, which displayed nearly identical changes in molar ellipticity, have markedly different behavior when monitored at 60 °C. PDC2.02 remains in phase 1 for 27 min at 60 °C before entering phase 2. PDC2.02 then lasts another 90 min before entering phase 3, outperforming the parent designs PDC1.01 and PDC1.10.
PDC2.03, which displayed no loss of molar ellipticity until 90 °C when monitored by CD, lasted 60 min at 60 °C before entering phase 2, and never entered phase 3 for the duration of the experiment. Thus, the four additional mutations (G491A, G515A, G516A, and G540A) displayed manifestly different stabilizing effects depending on whether they were evaluated in the context of design PDC2.01 or PDC2.03.
Structural evaluation of PDC2.03
To determine whether PDC2.03 retained the backbone conformation from the design runs, we elucidated the X-ray crystal structure of the PDC variant (PDB code 5TMA). The structure of PDC2.03 was refined to a resolution of 1.67 Å with R and Rfree of 0.170 and 0.203, respectively. There are two molecules in the asymmetric unit in complex with TPP and magnesium. Each monomer has three domains with an open α/β topology  with several ethylene glycol and sulfate molecules on the surface.
The backbone root-mean-square deviation (RMSD) between chain A of the wild-type PDB and PDC2.03 is 0.19 Å (Fig. 5c). The largest conformational difference, located in the upper left of Fig. 5c, is located at a crystal contact point and distant from both the active sites and any of the 16 mutations. Thus, the mutations impart significant enhancement in thermal stability without unintended structural alterations.
In this study, we used iterative computational protein design to rationally engineer the mesophilic enzyme, pyruvate decarboxylase, to exhibit enhanced thermal endurance at the growth temperature of a thermophilic organism. The target temperature of 60 °C is below the Tm of the wild-type PDC, at 62 °C, but is in a thermally unstable region as determined by changes in ellipticity (Additional file 1: Figure S1A). We identified a series of thermostabilizing mutations, increasing the Tm of PDC by 4 °C in one case (PDC1.01) and generating a variant that resists thermal denaturation to at least 90 °C (PDC1.10) in another. While increases in structural endurance at elevated temperature are not achieved by any single design, the combination of designs resulted in a PDC variant with a large increase in thermal endurance at 60 °C.
We find, however, that increases in Tm do not directly correlate with enhanced thermal endurance. Designs PDC1.10 and PDC2.02, for example, exhibit no observable difference in thermal stability based on measured changes in molar ellipticity, but display strikingly different thermal endurance; PDC2.02 survives approximately twice as long at 60 °C compared to PDC1.10. Similarly, PDC1.10 shows significant thermal stabilization compared to wild-type PDC, resisting thermal denaturation to at least 90 °C, yet survives only minutes longer at 60 °C.
The explanation for these observed incongruities may have to do with cooperativity between the mutations. Fersht introduced the concept of a double-mutant cycle to determine whether two residues are energetically coupled, either through a direct or indirect interaction . Based on this analysis, two residues are coupled if the change in free energy for the double mutant is not equal to the sum of the changes in free energy for each mutation individually. This concept has been expanded to include changes in free energy associated with structural or functional properties upon a double mutation . Here we have generated PDC variants which all included multiple mutations. However, the energetic nonadditivity expected between mutations of coupled residues is clearly observed when comparing the thermal endurance of PDC variants (Fig. 4). PDC2.02, which resulted from the combination of PDC1.01 and PDC1.10, displays an increase in structural endurance that far exceeds what would be achieved if the mutations were additive. Similarly, PDC2.03 survives much longer at 60 °C than would be expected if the residues in the parent designs, PDC1.10 and PDC2.01, were not energetically coupled. None of the sixteen mutations that comprise PDC2.03 are directly interacting and are generally structurally removed from one another. Thus, the observed energetic coupling is likely due to structural stabilizations in interacting regions of the PDC.
Further, the glycine-to-alanine mutations (G109A, G491A, G515A, and G516A) included in PDC2.01 and PDC2.03 display context-dependent behavior. These four mutations do not increase thermal stability when added to PDC1.01 to generate PDC2.01, and similarly do not result in a significant increase in thermal endurance. However, these same four mutations increase both thermal stability and thermal endurance when added to PDC2.02 to generate PDC2.03. On the one hand, these results suggest that the high-throughput thermal stability screens may fail to identify thermal stabilizing mutations if they are not evaluated in the favorable context of a more thermostable protein. However, the context dependence of these mutations also demonstrates the importance of cooperativity, such that even mutations that by themselves yield only the smallest gains in stability can ultimately make a significant contribution when part of a larger design.
These results suggest stabilizing all regions of a protein at or above the target temperature is critical to achieve the desired enhancement in thermal endurance. PDC1.10 and PDC2.02 both exhibit some loss of molar ellipticity at 60 °C, the temperature at which we evaluate thermal endurance by DIC. The improvement in endurance for PDC2.02 compared to PDC1.10, which is not observed as an improvement in stability based on the thermal melt, may therefore be the result of structural stabilization near 60 °C. In addition, PDC2.03 exhibits the highest degree of thermal endurance at 60 °C and also shows no loss of molar ellipticity until 90 °C. Both the context dependence of the four glycine-to-alanine mutations as well as the pronounced cooperativity observed in the combined mutations of PDC2.03 may be a reflection of the thermal stabilization of structurally cooperative regions of the multimeric PDC.
Much of protein design and structure prediction is predicated on the hypothesis that the native structure for a given protein is the conformation with the lowest Gibbs free energy. However, there is significant evidence that for many proteins the active and native conformation may not be the lowest energy conformation, but instead kinetic energy barriers drive the folding trajectories to achieve the observed native conformations . For proteins that fall into such kinetically driven folding trajectories, perturbing physiological conditions such as temperature or pH can alter protein folding and subsequently alter the sampled conformations. Thus, a critical next step and test of our ability to computationally design thermostable enzymes is to evaluate protein folding under elevated temperatures.
DWS performed in silico modeling, protein expression, and ThermoFluor stability assays, assisted in protein purification, wrote the manuscript. NK performed circular dichroism experiments. BSD performed differential interference contrast microscopy. MA performed protein purification, activity assays, and X-ray crystallography. VVL performed X-ray crystallography. DC assisted in protein expression. NSS assisted in protein purification and ThermoFluor stability assays. HY assisted in circular dichroism experiments, and analysis. AM assisted in activity assays. MEH assisted in protein purification and activity assays. AMG assisted in experimental design and development of the project idea and protein expression. YJB assisted in development of the project idea and protein purification. All authors read and approved the final manuscript.
We would like to thank Shaun Bevers for his help with the differential scanning calorimetry experiments.
The authors declare that they have no competing interests.
Availability of data and materials
The X-ray crystal structure of PDC2.03 generated during the current study is available in the Protein Data Bank [PDB] repository with the PDB ID 5TMA, [https://www.rcsb.org/structure/5tma].
Consent for publication
Ethics approval and consent to participate
This work was supported by the BioEnergy Science Center (BESC). BESC is a U.S. Department of Energy Bioenergy Research Center supported by the Office of Biological and Environmental Research in the U.S. DOE Office of Science. The research was also supported by the National Institutes of Health: NIH R01GM103843 (HY), F30CA180249 (NK).
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
- 16.de Souza AR, de Araujo GC, Zanphorlin LM, Ruller R, Franco FC, Torres FA, Mertens JA, Bowman MJ, Gomes E, Da Silva R. Engineering increased thermostability in the GH-10 endo-1,4-beta-xylanase from Thermoascus aurantiacus CBMAI 756. Int J Biol Macromol. 2016;93(Pt A):20–6.CrossRefPubMedGoogle Scholar
- 28.Edwards PM. Origin 7.0: scientific graphing and data analysis software. J Chem Inf Model. 2002;42(5):1270–1.Google Scholar
- 44.Tomazic SJ, Klibanov AM. Mechanisms of irreversible thermal inactivation of Bacillus alpha-amylases. J Biol Chem. 1988;263(7):3086–91.Google Scholar
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.