Crystal structure of the YffB protein from Pseudomonas aeruginosa suggests a glutathione-dependent thiol reductase function
The yffB (PA3664) gene of Pseudomonas aeruginosa encodes an uncharacterized protein of 13 kDa molecular weight with a marginal sequence similarity to arsenate reductase from Escherichia coli. The crystal structure determination of YffB was undertaken as part of a structural genomics effort in order to assist with the functional assignment of the protein.
The structure was determined at 1.0 Å resolution by single-wavelength anomalous diffraction. The fold is very similar to that of arsenate reductase, which is an extension of the thioredoxin fold.
Given the conservation of the functionally important residues and the ability to bind glutathione, YffB is likely to function as a GSH-dependent thiol reductase.
KeywordsSeMet Selenomethionine MALDI Mass Spectrometry Arsenate Reductase arsC Gene
The yffB (PA3664) gene of Pseudomonas aeruginosa encodes an uncharacterized protein of 13 kDa molecular weight. Based on the amino acid sequence analysis, YffB and its homologs have been assigned to the family of arsenate reductase (AR) and related proteins (Pfam entry PF03960) . AR participates in arsenic detoxification by catalyzing the reduction of arsenate [the oxyanion of As(V)] to arsenite [the oxyanion of As(III)], which is then exported through a specific transport system . There are two different types of bacterial ARs. AR of Gram-negative bacteria has a distinct HX3CX3R catalytic sequence motif, belongs to the thioredoxin (Trx) structural superfamily, and is coupled to the glutathione (GSH) and glutaredoxin (Grx) system for its enzyme activity [3, 4]. AR of Gram-positive bacteria also has a redox active cysteine residue but within a CX5R sequence motif embedded in a fold typical for low molecular weight protein tyrosine phosphatases, and requires Trx and Trx reductase for enzyme activity [5, 6, 7].
Besides the Trx-like AR, the PF03960 family includes a number of uncharacterized proteins whose function seems unlikely to be arsenate reductase. Members of the family are widely represented in bacteria, but not in archaea or eukaryotes. The pattern of amino acid distribution along the polypeptide chain suggests that these proteins may have a common fold.
The crystal structure determination of YffB was undertaken as part of a structural genomics effort  in order to assist with the functional assignment of the protein. The project was focused on the so-called hypothetical proteins from Haemophilus influenzae. The YffB protein from P. aeruginosa has emerged as an ortholog of HI0103 to increase chances for successful crystallization. YffB was cloned, expressed, and the crystal structure determined at 1.0 Å resolution. The protein fold appeared to be similar to that of Escherichia coli AR. Analysis of the structure suggests that YffB may function as a thiol reductase.
Results and discussion
The atomic model of YffB contains all residues but the N-terminal methionine, which has probably been posttranslationally cleaved. The molecular weight of the SeMet protein measured by matrix-assisted laser-desorption ionization (MALDI) mass spectrometry was close to the calculated value of 13,121 Da for residues 2–115. An addition of one selenomethionine would have increased the molecular weight by 178 Da.
The α/β-domain is characteristic of a superfamily of the Trx-like proteins . Common to all of them, a cis-proline residue is located at the N-terminus of strand β3. This residue is important for the integrity of the active site, as the cis-peptide bond promotes a turn of the polypeptide chain. The corresponding Pro93 in YffB is also in the cis-conformation.
The overall structural similarity to the members of the Trx superfamily allows the identification of the putative active site of YffB in the loop between the first β-strand and the following α-helix. A cysteine residue at the N-terminus of the α-helix acts as a catalytic nucleophile . Its pK is lowered by the basic environment and the dipole of the helix . In ArsC the thiol group of the cysteine attacks the arsenic of the substrate to form a covalent intermediate . The arsenite ion is released upon binding of GSH, which is then reduced by Grx . Unlike many Trx-like proteins, the catalytic cysteine in ArsC does not form an internal disulfide in the oxidized state. YffB also has only one cysteine residue at the active site (Cys11), which further emphasizes its similarity to ArsC and suggests that it functions in a GSH-dependent manner.
Despite the structural similarity to ArsC, the two proteins share only 16% identical residues. However, most of the residues involved in substrate binding and catalysis are conserved in these proteins. Three invariant arginine residues, Arg60, Arg94, and Arg107, bind the substrate and stabilize the reaction intermediate in ArsC . They may also enhance the nucleophilicity of the active cysteine. Arg94 together with the ensuing cis-Pro95 is conserved in all AR-related proteins. Lys91 of YffB, despite its different location in the sequence, is spatially equivalent to Arg60 of ArsC and therefore can take part in substrate binding. The most important difference with respect to ArsC is the substitution of Gly for Arg107 that leaves the binding site without the positively charged anchor and also makes Cys11 more accessible for bulky compounds. This substitution probably reflects a difference in substrate specificity between ArsC and YffB.
One particular consequence of this Arg to Gly replacement might be the ability of YffB to bind GSH, whereas ArsC cannot bind GSH in the absence of arsenate . The binding of GSH to YffB was detected by MALDI mass spectrometry using an oxidized form of glutathione (GSSG). The mass increase of about 300 Da indicated one GSH molecule bound per protomer. This result supports the contention that GSH is probably involved in the functional cycle of YffB.
YffB has a very polarized distribution of charges over the surface of the molecule. The active site area is thronged with basic residues, as it is in ArsC, whereas the opposite side of the molecule is predominantly negatively charged (Fig. 1B). The positive electrostatic potential would certainly favor binding of anions.
Given the structural similarity to the Trx-like proteins and particularly to ArsC, and the conservation of the functionally important residues, YffB is likely to function as a thiol reductase. The nature of the substrate remains to be established in further biochemical and biophysical studies. These studies will be facilitated by the three-dimensional structure of the protein.
Cloning, expression and purification
The yffB (PA3664) gene from Pseudomonas aeruginosa PAO1 was amplified using PfuTurbo DNA polymerase (Stratagene), genomic DNA (ATCC 47085D), and the following 5'- and 3'-end primers.
Forward: 5'-CACCCTGGTGCCGCGCGGCAGC CATATG ACCTACGTTCTCTACGGCATCA-3'.
The sequence encoding the thrombin cleavage site is underlined, and the Nde I restriction site is shown in italic. The PCR product was introduced into a pET100/D-TOPO expression vector by the TOPO directional cloning procedure (Invitrogen). Recombinant plasmids were isolated from the E. coli TOP10 strain. The expression construct for production of the native protein without a His-tag was prepared by digestion with Nde I and self ligation. For production of the selenomethionine (SeMet) protein, E. coli strain B834 (DE3) was transformed with the recombinant plasmid, and cells were grown in a minimal medium supplemented with 100 μg/mL ampicillin and 40 μg/mL SeMet until the A600 reached 0.8. At this point the cells were induced with 1 mM isopropyl β-D-thiogalactoside and harvested after 3 h.
The SeMet protein was purified by column chromatography in three steps. The cell extract was applied to a Q Sepharose HP (Pharmacia) column equilibrated with 20 mM HEPES (pH 6.7), 50 mM NaCl, and 0.5 mM EDTA. About half of the protein bound to the column and was eluted in a 50–500 mM NaCl gradient. After dialysis in 20 mM HEPES (pH 6.7) and 0.5 mM EDTA, the fractions containing the protein were applied to a Source 15S (Pharmacia) column, and eluted with a 0–450 mM NaCl gradient. The protein was concentrated to 4 mg/ml, applied to a Sephacryl S100 (Pharmacia) gel filtration column, and eluted in 20 mM HEPES (pH 7.5), 100 mM NaCl, and 0.25 mM EDTA. According to the SDS gel, the protein was at least 95% pure. For crystallization, the protein was concentrated to 15 mg/ml.
Crystallization and structure determination
YffB crystals were grown by the vapor diffusion hanging drop method at room temperature from 0.1 M CHES, pH 10, 26% polyethylene glycol 3350, and 5% isopropanol. They belong to the space group C2 with unit cell parameters: a = 87.45 Å, b = 43.25 Å, c = 29.06 Å, β = 93.5°. There is one protein molecule in the asymmetric unit with the solvent content of 40%. For X-ray data collection, the crystals were soaked in the mother liquor supplemented with 15% polyethylene glycol 400 and flash-frozen in liquid propane.
X-ray data and refinement statistics.
Number of unique reflections2
Fraction of refls with I>3σ (%)
Rfree (2% data)
Number of protein non-hydrogen atoms
Number of water molecules
Mean B-factor from the model (Å2)
Mean B-factor from Wilson plot (Å2)
RMSD in bonds (Å)
RMSD in angles (°)
RMSD in main-chain B factors (Å2)
The atomic coordinates of YffB and structure factors were deposited in the Protein Data Bank under the accession code 1RW1.
Binding of glutathione was detected by MALDI mass spectrometry using a Voyager spectrometer (Applied Biosystems, Foster City, CA). The SeMet protein (10 μM) was incubated for 1 h at room temperature with 1 mM GSSG in 20 mM HEPES buffer, pH 7.T he sample was mixed 1:1 with matrix solution (10 mg/mL 3,5-dimethoxy-4-hydroxycinnamic acid, 50% aqueous acetonitrile, and 0.2% trifluoroacetic acid), deposited onto a golden plate, and allowed to dry at room temperature. Bovine myoglobin was used for molecular mass calibration.
This work was supported by the National Institutes of Health grant No. P01-GM57890.
Certain commercial materials, instruments, and equipment are identified in this manuscript in order to specify the experimental procedure as completely as possible. In no case does such identification imply a recommendation or endorsement by the National Institute of Standards and Technology nor does it imply that the materials, instruments, or equipment identified is necessarily the best available for the purpose.
- 8.Eisenstein E, Gilliland GL, Herzberg O, Moult J, Orban J, Poljak RJ, Banerjei L, Richardson D, Howard AJ: Biological function made crystal clear – annotation of hypothetical proteins via structural genomics. Curr Opin Biotechnology 2000, 11: 25–30. 10.1016/S0958-1669(99)00063-4CrossRefGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.