|Home | About | Journals | Submit | Contact Us | Français|
To optimize the in vivo folding of proteins, we linked protein stability to antibiotic resistance, thereby forcing bacteria to effectively fold and stabilize proteins. When we challenged Escherichia coli to stabilize a very unstable periplasmic protein, it massively overproduced a periplasmic protein called Spy, which increases the steady-state levels of a set of unstable protein mutants up to 700-fold. In vitro studies demonstrate that the Spy protein is an effective ATP-independent chaperone that suppresses protein aggregation and aids protein refolding. Our strategy opens up new routes for chaperone discovery and the custom tailoring of the in vivo folding environment. Spy forms thin, apparently flexible cradle-shaped dimers. Spy is unlike the structure of any previously solved chaperone, making it the prototypical member of a new class of small chaperones that facilitate protein refolding in the absence of energy cofactors.
The folding of many proteins is assisted by molecular chaperones and other folding helpers in the cell1. Many chaperones act by inhibiting off-pathway events, such as aggregation, and serve a broad range of substrates in a stoichiometric manner. In vitro assays for chaperone activity are thus almost by necessity relatively insensitive, making these assays more useful in studying previously identified chaperones than in identifying new ones in crude cell lysates. Instead of being discovered directly for their capacity to assist in vivo protein folding processes, many chaperones were initially characterized because of their induction by stress conditions2. The lack of a sensitive and general in vivo assay for chaperone activity led us to wonder how complete the list of known chaperones is.
Previously, we developed a genetic system that directly links increased protein stability to increased antibiotic resistance3 and thereby provides a selectable and quantitative in vivo measure of protein stability3. Our selection system makes use of a sandwich fusion between β-lactamase and unstable proteins, which effectively links the stability of the inserted proteins to the penicillin resistance of the host strain. We showed that β-lactamase tolerates the insertion of a well-folded protein in a surface loop (at position 197) and still retains enzymatic activity (Fig. 1). However, insertion of unstable proteins at this site negatively affects β-lactamase activity and decreases penicillin V resistance (PenVR) in vivo3. This strategy allowed us to select for stabilized protein variants of a well characterized protein, immunity protein 7 (Im7)3.
Now, we hypothesized that a similar strategy could even be used to select for host variants that alter the in vivo folding environment of individual proteins by enhancing protein folding or stability, either specifically or generally. Because our system functions in the periplasm, it may also provide an opportunity to explore protein folding in what has generally been considered to be a relatively chaperone-poor environment and, in doing so, uncover new chaperones. In selecting for host variants that stabilized a very unstable Im7 mutant, we discovered variants that massively overproduce a previously poorly characterized protein called Spy. We found Spy to be the founding member of a new class of chaperones that have a novel cradle-shaped structure and function as ATP-independent folding chaperones in the periplasm of Escherichia coli.
We designed a dual selection system by inserting the same unstable test protein Im7-L53A I54A into the middle of β-lactamase, which encodes PenVR, and DsbA which encodes cadmium resistance (CdCl2R)4, as sandwich fusions (Fig. 1). Our overall experimental scheme is illustrated in Fig. 2. The two resistance markers operate via very different mechanisms4, 5 and thus provide an independent measure of the stability of the inserted protein. In strains that co-express both fusions, we reasoned that host mutations that simultaneously increased PenVR and CdCl2R should positively affect the one thing that these two constructs apparently have in common, namely the stability of the inserted protein. Insertion of Im7 variants into DsbA after residue Thr99, a site that can tolerate protein insertions (Fig. 1 and Supplementary Fig. 1), revealed an excellent correlation between CdCl2R and Im7 variant stability (Fig. 1c). CdCl2R also correlated well with PenVR when the same Im7 variants were inserted into β-lactamase (Fig. 1d). The highest resistance to both antimicrobials came from the most stable Im7 variants. These results suggested that host variants that stabilize Im7 should increase both PenVR and CdCl2R and thus should be easily selected for on this basis.
We searched for host mutations that enable the proper folding of Im7-L53A I54A, a very unstable Im7 variant6. Using ethyl methanesulfonate (EMS), we randomly mutagenized strain SQ1306, which contains sandwich fusions between this unstable Im7 variant and both DsbA and β-lactamase. We selected for variants with enhanced PenVR and then screened those for enhanced CdCl2R.
Plate screens revealed that about 13% (35 of 263) of the strains that had gained PenVR had simultaneously acquired resistance to CdCl2. Ten independently isolated mutant strains that were resistant to both PenV and CdCl2 (EMS1–EMS10, Supplementary Table 1a) were selected for further analysis. Eight of these contained substantially elevated levels of the β-lactamase-Im7 fusion protein, as determined by quantitative Western blots (Supplementary Fig. 2). Since EMS4 and EMS9 contained the highest amounts of the sandwich fusion protein and exhibited resistance to high concentrations of both PenV and cadmium they were chosen for further analysis. To confirm these host variants had improved the folding and consequently increased the protein levels of the Im7 protein itself, we transformed EMS4 and EMS9 with plasmids encoding Im7 and three different destabilized Im7 mutants in the absence of the fusion. Both EMS4 and EMS9 strain backgrounds accumulated large amounts of Im7 proteins in the periplasm, in contrast to the parental PenVS/CdCl2S strain SQ1306 (Fig 3a). Quantification using an Agilent bioanalyzer showed that Im7 proteins made up 7-10% of the periplasmic content in EMS4 and EMS9, a 34- to 92-fold increase over the wild-type strain (Fig. 3a, Tables 1 and Supplementary Table 1b). The Im7 proteins were extracted from the periplasm in the absence of detergent, indicating that they accumulate in a soluble form.
In examining the periplasmic extracts of the Im7 overexpressing strains, it was impossible to ignore the strong induction of a 15.9 kD protein, which was identified by mass spectrometry analysis as Spy (Spheroplast protein Y)7. The Spy protein accounted for up to 48% of total periplasmic content in eight of the ten tested PenVR/CdCl2R EMS mutant strains (Fig. 3, Supplementary Fig. 2 and Supplementary Table 1b), suggesting that Spy overexpression might be involved in the stabilization of our Im7 variants.
Spy expression is under the control of the Bae and Cpx periplasmic stress response systems7-9, which are induced by a variety of stress conditions known to cause protein unfolding and aggregation2, 10. To identify the mutation(s) that caused the massive up-regulation of Spy expression in EMS4 and EMS9, we sequenced the spy gene, its regulators cpxARP and baeRS, and a number of other candidate genes, picked because they encode periplasmic chaperones, proteases, stress response regulators or multidrug exporters, surA, skp, fkpA, prc, degP, ptr, ompP, ompT, rcsABCDF and mdtABCD and their upstream regulator sequences. We discovered that both EMS4 and EMS9 contained mutations in baeS but not in any of the other sequenced genes or regulatory sequences (Supplementary Table 1c). We then sequenced the baeS gene in all other independently isolated PenVR/CdCl2R strains (EMS1–EMS10) and determined that it was mutated in all strains except EMS7, a strain that failed to overproduce Spy or Im7. A total of six different mutations were found in baeS (Supplementary Table 1c).
BaeS is a putative histidine kinase that together with the proposed response regulator BaeR makes up the two-component BaeSR envelope stress response regulation system2, 7-9; this system has previously been shown to regulate spy and a few other periplasmic stress genes. We picked one baeS mutation, baeS-R416C, for further analysis. We established that the baeS-R416C mutation was necessary and sufficient for enhanced Im7 expression (Supplementary Fig. 3). Transcriptional analysis of EMS4 revealed not only a massive up-regulation of spy mRNA but also a significant induction of other known downstream targets of BaeSR (Supplementary Table 1d), suggesting that the baeS-R416C mutation caused the constitutive activation of the BaeSR envelope stress response.
To determine if Spy overproduction alone is responsible for the enhanced levels of the unstable Im7 protein, or whether other BaeSR-regulated proteins are involved as well, we overproduced Spy from the pTrc promoter to levels similar to those seen in the EMS4 mutant, in a baeSR null background. We found that for the various destabilized Im7 mutants, overexpression of Spy, even in the absence of a functional BaeSR system, led to soluble Im7 levels that were very similar to those observed in EMS4 by SDS-PAGE analysis (Fig. 3b). Quantification of Im7 and Spy levels showed that upon Spy overexpression, Im7 levels increased 100- to 700-fold (Table 1). The overexpression of downstream targets of BaeSR other than spy apparently does not contribute to the observed PenV resistant phenotype, as their individual deletions had no effect on PenV resistance (Supplementary Fig. 3). Based on these results, we concluded that Spy overproduction is necessary and sufficient to increase the levels of soluble periplasmic Im7. Strains deficient in the protease DegP exhibited increased levels of Im7 (Supplementary Fig. 4a), so it appears that Spy is acting at least in part to protect Im7 from proteolysis.
Although sequences homologous to Spy are present in a wide variety of enterobacteria, protobacteria and some cyanobacteria (Supplementary Fig. 5), very little was previously known about Spy function7. Deletion of spy was reported to cause slight induction of degP and rpoH, two genes under the control of rpoE, a stress response system involved in outer membrane protein biogenesis leading to the suggestion that Spy may also be involved in this process11. Using quantitative reverse transcription polymerase chain reaction (qRT-PCR), we were unable to detect significant induction of these or other periplasmic stress regulated genes upon deletion of spy in our strain background (Supplementary Table 1d), suggesting that spy deletion does not cause significant defects in membrane integrity.
Our finding that Spy overexpression leads to the accumulation of an otherwise highly unstable protein instead suggested that Spy might function as a chaperone that facilitates protein folding in the bacterial periplasm. To assess its chaperone activity, we purified Spy and analyzed its influence on the aggregation of a number of substrate proteins in vitro (Fig. 4a). We first tested the effect of Spy on the aggregation of thermally denatured malate dehydrogenase (MDH) and found that addition of increasing amounts of Spy significantly reduced protein aggregation. Even sub-stoichiometric quantities of Spy effectively inhibited the aggregation process, suggesting that Spy is a highly efficient chaperone. Analysis of the effects of Spy on urea-denatured MDH revealed similar results and showed that Spy effectively prevents MDH from aggregating. We found that Spy, which is strongly induced by the protein denaturant ethanol9, protects MDH from ethanol-mediated aggregation. We also observed a strong suppression of aggregation of chemically denatured aldolase and glyceraldehyde-3-phosphate dehydrogenase (GAPDH) by Spy. Given that heat, urea and ethanol use different mechanisms to unfold proteins, we concluded that Spy must have a general affinity for a wide range of different protein unfolding intermediates. Combined, our results strongly suggested that we identified a new, general chaperone in the periplasm of E. coli.
Most known ATP-dependent chaperones, such as the DnaK and GroEL systems, function as folding chaperones12. They use cycles of ATP binding and hydrolysis to regulate substrate binding and release, thus facilitating protein folding12. In contrast, most known ATP-independent chaperones function as holding chaperones, which prevent protein aggregation but usually lack the ability to support protein folding13. To test if Spy can support protein folding, even though it is localized to the ATP-devoid environment of the bacterial periplasm, we analyzed its influence on the refolding yield of chemically and thermally unfolded proteins. We found that Spy significantly increased the refolding yield of a number of substrates (Fig. 4b). Because the assays were performed in the absence of any cofactors, these results strongly suggest that Spy has intrinsic protein folding capacity, providing an excellent explanation of how Spy overexpression is sufficient to substantially increase the amount of folded Im7 protein in vivo.
qRT-PCR-based measurements indicated that spy mRNA is induced nearly 500-fold in response to tannin treatment14. The gene for IbpB, an E. coli small heat shock protein homolog also involved in inhibiting protein aggregation, is induced 48-fold by tannins, making it second only to spy in induction by tannins14. Tannins, which have long been used to tan leather, are the fourth most abundant component of vascular plant tissue and are synthesized by plants as a protection against bacterial and fungal infections15. Tannins are thought to be responsible for the astringent taste of many human food substances including cheap red wine, strong tea, and unripe fruit16. Some forage crops contain up to 25% tannins by dry weight. E. coli found in the gut of herbivores is thus exposed to high concentrations of tannins, driving it to develop tannin resistance17. Tannins can have human disease-related antimicrobial effects18. The tannins present in cranberry juice, for instance, act as potent inhibitors of the attachment of pathogenic E. coli to the uroepithelium and are thought to explain the effectiveness of cranberry juice in preventing urinary tract infections19. Only small quantities of tannins are required to aggregate proteins, a feature that may be responsible for their antimicrobial activity20. Their astringent taste in foods may be due to the precipitation of mouth proteins21. The astringent taste of strong tea is often reduced by addition of milk, which drives co-precipitation of the tannins with the disordered protein casein that is present in milk21. Tannic acid has also been reported to have potent anti-amyloidogenic activity22-24.
We tested the effects of Spy on tannin-mediated inactivation of the E. coli membrane protein DsbB (Fig. 5a), aldolase (Fig. 5b) and the E. coli periplasmic protein alkaline phosphatase (Fig. 5c). Spy protected all three proteins from tannic acid-induced activity loss. Consistent with this in vitro result, we discovered that spy null mutants are highly sensitive to tannins (Fig. 5d) and show decreased alkaline phosphatase activity in vivo (Supplementary Fig. 2). baeSR mutants have been reported to be tannin sensitive14, and we found that most, but not all, of the tannin sensitivity could be attributed to induction of Spy (Fig. 5d). The antimicrobial action of tannins can substantially alter the microbial content of the gut; indeed, the relatively high tannin resistance of enterobacteria, which we show here is mediated at least in part by Spy, appears to be responsible for allowing the population of fecal enterobacteria to increase up to 19-fold in rats fed a high tannin diet17. We conclude that the induction of Spy is likely to be involved in protecting cells from tannin-induced protein aggregation and inactivation in vitro and in vivo.
The Spy protein is 29% identical to CpxP, an inhibitory component of the CpxRA regulatory system25. CpxP binds to the periplasmic domain of CpxA, inhibiting its autokinase activity. The presence of unfolded proteins causes the release of CpxP, thereby activating the Cpx response25, 26. These observations suggested that CpxP might be acting as one of the very few known periplasmic chaperones, targeting itself and its unfolded protein cargo to the protease DegP for degradation27, 28. We found that CpxP has weak chaperone activity in vitro and CpxP overproduction causes the accumulation of Im7 in otherwise wild-type strains (Supplementary Fig. 4). To assess whether Spy might be involved in regulating BaeSR or the other periplasmic stress response systems, we carried out qRT-PCR in strains either lacking or overexpressing spy (Supplementary Table 1d). We found no notable influence on the expression of baeS, cpx, rpoE, rpoH, rcs or psp regulated genes, suggesting that Spy, unlike CpxP, may not be playing a major regulatory role. Instead, our results strongly suggest that Spy functions directly as a molecular chaperone.
To gain insights into the mechanism of Spy's chaperone action, we crystallized and determined the three-dimensional structure of His-tagged Spy (Table 2). The crystal structure shows that Spy molecules associate into tightly bound dimers. Size exclusion chromatography and analytical ultracentrifugation (Supplementary Fig. 6) confirmed this oligomerization state and revealed that Spy is also dimeric in solution. Each Spy monomer consists of four α-helices (α1–α4). The 28 N-terminal residues and 14 C-terminal residues are disordered in the crystal. Helices α1, α2 and α3 fold into a hairpin, with α1 and α2 forming one arm and α3 the other arm (Fig. 6a and Supplementary Fig. 6). Helix α3 has nine turns and is bent in the middle by ~30° due to partial unwinding at Met85–Glu86. The N-terminal half of α3 is parallel to α2, whereas its C-terminal half is parallel to α1. Helix α4 runs antiparallel to α3 and is inclined to it by ~45°. The first six ordered N-terminal residues, Phe29–Leu34, assume an extended conformation and follow along helix α4. Following the submission of our work, the crystal structure of Spy was determined as part of a high-throughput effort29; as expected, this structure is very similar to ours. The Spy dimer is formed through the antiparallel coiled-coiled interaction. The shape of the dimer is rather unusual, with one surface highly concave and the other convex, reminiscent of a cradle. The bottom of this cradle is formed by helices α3, the sides by the connection between helices α1 and α2 and the tips by helices α1 and α4. The cradle is extremely thin in cross section; its average thickness is 9.2 Å (Supplementary Fig. 6). The contacts between the two monomers are extensive, burying a surface of ~1850 Å2 per monomer upon dimerization and suggesting high dimeric stability. Although the concave surface has an overall positive charge29, it is also lined with a number conserved apolar side chains, localized in two clusters that are exposed as hydrophobic patches (Fig. 6b,c).
To investigate the interaction between Spy and protein substrates, we labeled Spy with two different environmentally sensitive probes, acrylodan and 4-(N-(iodoacetoxy)ethyl-N-methyl)amino-7-nitrobenz-2-oxa-1,3-diazole (IANBD). Probe attachment sites were made by substituting cysteines at six different positions; these included residues exposed to both the concave (R89C, H96C) and convex sides of the cradle (E59C, K77C), as well as residues within the structurally disordered N and C termini (H24C and A128C) (Fig. 6b). We then determined the influence of substrate addition on the fluorescence of the probe-labeled Spy variants. Changes in the microenvironment near the probes caused by substrate addition could reflect either direct binding to the labeled region of the probe or structural rearrangements within Spy caused by substrate binding.
The Spy variants retained substantial ability to inhibit the aggregation of aldolase, showing that they are at least partially active (Supplementary Table 1e and Supplementary Fig. 7a). For the interaction experiments we used casein as a Spy substrate because unlike most of our other Spy substrates it is intrinsically disordered but soluble; Im7-L53A I54A was also chosen because it is an excellent Spy substrate in vivo. Im7-L53A I54A in vitro is trapped as a partially unfolded but soluble intermediate3. Analytical gel filtration revealed the presence of an apparently stable complex between Spy and casein (Fig. 7a). As shown in Fig. 7b,c, equimolar addition of casein or Im7 significantly decreased Spy-mediated refolding of MDH. We interpreted these data as likely due to direct competition for the same binding site on Spy. We then measured changes in the fluorescence emission spectra of the various labeled Spy variants upon addition of equimolar quantities of casein or Im7-L53A I54A, thereby monitoring potential environmental changes in the vicinity of the labeled residues upon substrate binding. As shown in Fig. 7d and Supplementary Fig. 7b, the fluorescence of acrylodan attached via H24C, E59C, R89C, H96C and A128C significantly increased and blue-shifted with casein addition, suggesting that the region near these residues becomes more hydrophobic in the presence of casein. Acrylodan-labeled H24C and A128C Spy variants also showed fluorescence increases and slight blue shifts upon Im7-L53A I54A addition (Fig. 7e and Supplementary Fig. 7c). IANBD is generally not as sensitive as acrylodan in its ability to reflect changes in the fluorophore environment. Nevertheless, IANBD-labeled H24C, H96C and A128C Spy variants exhibited significantly decreased fluorescence upon casein binding (Supplementary Fig. 7d). Because the fluorescence of IANBD is quenched in hydrophobic environments30, these results suggest that these residues become more hydrophobic in character with casein binding. In contrast, the fluorescence of acrylodan attached via K77C is decreased with casein addition (Fig. 7d), indicating that the region near this residue probably becomes more hydrophilic with substrate binding. That nearly all our Spy mutants exhibit significant changes in their environment upon substrate addition suggests that substrate binding occurs over large regions of Spy (including both the concave and convex sides of the cradle). Alternatively, major structural rearrangements or higher-order oligomerization reactions could occur within Spy upon substrate binding or both.
It may seem surprising that a chaperone as effective as Spy has remained unstudied in an organism as well characterized as E. coli. However, because chaperones are usually effective only in stoichiometric quantities, chaperone assays are generally not sensitive enough to enable their purification from crude lysates by activity. Instead, chaperones have often been first identified because their genes are induced in protein unfolding conditions2. Our approach of linking folding to selectable markers opens up the possibility of directing evolution to alter the in vivo folding environment to specifically enhance the folding of a given unstable protein and provides a new route for chaperone discovery.
Spy has a unique cradle shape that is unlike any other chaperone whose structure is known. Spy lacks a significant globular core and would therefore be expected to display higher flexibility than the average globular protein. The molecule averages 9.2 Å in thickness, less than the 12Å diameter of a single α–helix. Its thinness places a disproportionate number of side chains on the protein surface. The highest backbone temperature factors are observed in surfaces and bumps extending from the concave side of the cradle, particularly in the connectors between helices α1 and α2, which form the sides of the cradle, suggesting the possibility for bending and twisting of the molecule (Fig. 6d). Spy's shape combined with its apparent flexibility leads us to propose a model for Spy action that involves the shielding of aggregation-sensitive regions on substrate proteins, which are revealed upon partial unfolding by interaction with Spy. Spy's apparently highly flexible nature may allow the chaperone to accommodate a variety of partially unfolded protein substrates. The extremely high flexibility of another periplasmic chaperone HdeA, for instance, appears to allow it to bind numerous substrate proteins and prevent aggregation31.
Addition of 4 mM tannic acid results in the massive induction of Spy so that it comprises ~25% of periplasmic proteins (Supplementary Fig. 2d), similar to the induction seen in our baeS constitutive EMS strains (Supplementary Table 1b). Of E. coli's ~4280 genes, spy is one of those most strongly induced by butanol32, 33, a protein unfolding agent34, leading to Spy comprising ~20% of periplasmic proteins (Supplementary Fig. 2d). Ethanol, another well known protein denaturant and potent inducer of the heat shock response35, also strongly induces Spy so that it makes up ~5% of periplasmic proteins9 (Supplementary Fig. 2d). Spy is also strongly induced by other conditions that induce protein unfolding, such as overproduction of miss-folded PapG and NlpE7. Thus, strong induction of Spy is physiological, not something that only occurs in our BaeS constitutive mutants. That Spy is induced so strongly by protein unfolding or precipitation agents that it becomes up to 25% of the periplasmic contents implies a pressing need for it to respond to unfolding conditions. Other chaperones also attain high percentages after stresses that unfold proteins. GroEL, for instance, becomes 12% of the total cellular protein during growth at 46 °C36, but the abundance of Spy is more extreme, particularly when calculated on a molar basis. The total concentration of protein in cells is ~300 g l−1 37, allowing one to calculate the molar concentration of the 31 kDa Spy dimer in the periplasm to be up to 2.4 mM. GroEL functions as a 790 kDa 14-mer, so its molar concentration in the cell when 12% of total cellular contents is only about 0.05 mM or about 50-fold less abundant on a molar basis than the physiological levels of induced Spy. Even though Im7 can attain ~10% of the cell protein upon Im7 overproduction, Spy is even more abundant; upon Spy overproduction, approximately two Spy dimers are present in the periplasm for every Im7 monomer.
The extreme abundance of Spy leads us to propose a speculative model for Spy action in which it protects proteins in vivo by binding to aggregation prone regions exposed on the surface of unstable proteins, coating these regions (or possibly even the entire unstable protein) with a thin layer that inhibits proteolysis and/or aggregation. Upon Spy overproduction, the levels of unstable Im7 mutants go from barely detectable to up to 10% of periplasmic extracts (Supplementary Table 1d), likely because Spy is very effective in inhibiting in vivo proteolysis and/or aggregation. Changes in the fluorescence of environmentally sensitive probes labeled with acrylodan or IANBD are consistent with either Spy binding to substrates via large parts of the molecule or major rearrangements occurring in the Spy structure upon substrate binding or both. Defining the precise substrate binding site, the stoichiometry of its interaction with substrates, and its precise mechanism of action, including the possible involvement of co-chaperones, awaits future experimentation.
Although suppression of protein aggregation does not require energy, release and refolding of bound substrate proteins usually does38. ATP is absent in the periplasm and in our assays was not required for the refolding function of Spy, indicating that the chaperone activity of Spy is ATP-independent. Thus, Spy appears to be one of the very few chaperones that actively support protein refolding in the absence of any obvious energy source, suggesting that Spy uses a mechanism to control substrate binding and release that is different from that of previously characterized chaperones. How Spy enables refolding in the absence of energy cofactors is a provocative question for future research. At this point, we can only speculate that the apparently highly dynamic nature of the Spy structure may permit structural fluctuations that not only allow it to mold to various proteins but also enable it to release them.
Bacteria containing the plasmids pBR322blaGS linker Im7 L53A I54A and pBAD33dsbAGS linker Im7 L53A I54A, which encode sandwich fusions of Im7 L53A I54A with β-lactamase and dsbA respectively, were subject to EMS mutagenesis39, and mutants resistant to 1500 μg ml−1 penicillin and 0.5 mM CdCl2 were selected. Western blotting of total cell extracts was used to detect the levels of β-lactamase-Im7 fusion protein present in these mutant strains. Following transformation with plasmids that encode wild-type Im7 and various destabilized Im7 mutants expressed under the pTrc promoter, periplasmic extracts were prepared as previously described40, and the pattern of soluble periplasmic protein expression was examined on SDS gels (Invitrogen). The level of Im7 and Spy present in these extracts was quantified using a Bioanalyzer 2100 (Agilent) as described below.
The amount of the sandwich fusion protein β-lactamase-Im7 L53A I54A in SQ1306 and EMS1–EMS10 was quantified by Western blot with anti-β-lactamase antibody using whole cell lysate as described previously3 with minor modifications. To quantify plasmid-encoded Im7 (wild-type and variants) expressed in the absence of the fusion, cells were grown to mid-log phase in LB medium at 37 °C. Im7 protein expression was induced with 2 mM IPTG for 2 h. Addition of IPTG also induces the expression of Spy when the pTrc-spy plasmid is present. Periplasmic extractions were prepared as described previously40, and the proteins were separated using a Bioanalyzer 2100 (Agilent) with the high sensitivity protein 250 kit and the conditions specified by the manufacturer. To visualize the tiny signal corresponding to Im7 present in the absence of Spy expression, a pico labeling protocol was applied (Agilent technical note, publication number: 5990-3703EN). Prior to labeling, periplasmic extractions were diluted to different extents to ensure the linear relationship between Im7 amount and signal. Protein ratios were determined by integrating the trough-to-trough peak area. The high sensitivity of the 250 kit and the pico labeling protocol allowed the visualization and quantification of the very small amount of Im7 present in the absence of Spy, an amount that was not detectable on either Coomassie blue-stained SDS-PAGE gels or on the Agilent bioanalyzer using either the 80 or 230 protein kits. Although excellent for determining the ratio of a single defined protein present in different sample preparations, we reasoned that differences in labeling efficiencies between different proteins would make this protocol suboptimal for percentages of the total protein. Instead, the percentage of Spy and Im7 was quantified in overproducing strains using the protein 80 kit (Agilent) without labeling. Areas corresponding to Spy or Im7 peaks were compared to the total area of all proteins present in the periplasmic extract. In the absence of Spy, the Im7 peak was not detectable; therefore, the percentage of Im7 was calculated by dividing the Im7 percentage in the presence of Spy overexpression with the fold increase of Im7 level. For example, Im7 is 10% of total periplasm in SQ1405. Im7 expression increased 34-fold in SQ1405 compared to SQ1413. The percentage of Im7 in SQ1413 was therefore calculated to be 0.3%. The percentage of Spy present in periplasmic extracts in different EMS strains was quantified similarly with the protein 80 kit.
Initial crystallization conditions of the mature form of Spy (residues 1–138) with N-terminal His6-tag were identified using the AmSO4 suite (Qiagen Inc., Canada). Spy contains 161 residues of which 23 constitute the signal peptide that directs Spy to the periplasm. Our residue numbers refer to the mature, periplasmic form of Spy, which contains 138 residues; the N-terminal His6-tag is left un-numbered. The SeMet-substituted protein was crystallized under the same conditions. The best diffracting crystals were obtained by mixing 1 μl protein in the final buffer with 1 μl of reservoir solution containing 0.3 M CdCl2 and 2.4 M AmSO4 under the vapor diffusion hanging drop method. Prior to data collection, the crystals were cryoprotected in paratone and flash frozen in liquid nitrogen.
The diffraction data were collected from a single crystal at the CMCF-1 beamline at the Canadian Light Source (CLS), Saskatoon, Saskatchewan at the Se absorption edge (wavelength of 0.9792 Å) to 2.6 Å resolution at 100 K. Data were processed and scaled using DENZO and SCALEPACK in the HKL2000 suite41.
The structure was solved by the single anomalous diffraction method. Of the 22 expected Se atoms in the asymmetric unit, 8 were located and refined using autoSHARP42. These sites were used to obtain preliminary phases. The model was built manually in Coot43 using the Se sites as reference points. Several cycles of refinement using REFMAC544 followed by model rebuilding were carried out. The final refinement was performed with PHENIX45 and included the TLS model for thermal motions. The His6 tag and the first 28 residues as well as the last 14 C-terminal residues are disordered. The final model includes the central 96 residues for each monomer. Of these, 11 long side-chains in molecule A and 15 in molecule B were only partially modeled. The pertinent data are shown in Table 2. The model has good geometry as analyzed by Molprobity46, 96.9% (309/319) of all residues are in the favored region and 100% of all residues are in the allowed region.
A detailed description of all other methods is given in Supplementary Methods.
We thank G. Morgan and S. Radford for communicating results prior to publication, and for the kind gift of Im7-L53A I54A protein, T. Franzmann for performing the ultracentrifugation experiments, D. Reichmann, C. Cremers, and A. Malik for advice, M. Lei and Y. Chen for providing vector and reagents for Spy purification, and C. Munger for initial purification and crystallization of Spy. Protein identification was performed by the Protein Structure Facility at the University of Michigan. Diffraction data were collected at the CMCF-1 beamline at the Canadian Light Source, Saskatoon, Saskatchewan, Canada. We would like to thank S. Labiuk for data collection. The Howard Hughes Medical Institute funded this work. M.C. acknowledges financial support from CIHR grant GSP-48370.
Accession Code: The coordinates and structure factors have been deposited in the Protein Databank with accession code 3O39.
Author Contributions: J.C.A.B. designed the study and wrote the manuscript with contributions from M.C and S.Q.. S.Q., P.K., T.T., N.K., K.R., R.S., J.P., S.H. and G.R. performed the experiments and collected and analyzed the data. J.C.A.B., X.U. and M.C. further analyzed the data. L.F. and U.J. provided technical support and conceptual advice.
Competing Financial Interests: The authors declare no competing financial interests.
Note: Supplementary Information is available on the Nature Structural & Molecular Biology website.