|Home | About | Journals | Submit | Contact Us | Français|
Clostridium difficile is a leading cause of nosocomial infections. The major virulence factors of this pathogen are the multidomain toxins TcdA and TcdB. These toxins contain a cysteine protease domain (CPD) that autoproteolytically releases a cytotoxic effector domain upon binding intracellular inositol hexakisphosphate. Currently there are no known inhibitors of this protease. Here we describe the rational design of covalent small molecule inhibitors of TcdB CPD. We identified compounds that inactivate TcdB holotoxin function in cells and solved the structure of inhibitor-bound protease to 2.0Å. This structure reveals the molecular basis of CPD substrate recognition and informed the synthesis of activity-based probes for this enzyme. The inhibitors presented here will guide the development of therapeutics targeting C. difficile, and the probes will serve as tools for studying the unique activation mechanism of bacterial toxin CPDs.
The Gram-positive anaerobic bacterium Clostridium difficile is a major cause of hospital-acquired diarrhea and the severe gastrointestinal illness pseudomembraneous colitis (Kelly and LaMont, 2008; Rupnik, et al., 2009). Although infection rates have risen dramatically in the last decade, there is currently a lack of therapeutics to treat C. difficile infection (Halsey, 2008; Kelly and LaMont, 2008). This is in large part due to the organism’s resistance to most classes of antibiotics. A viable strategy for combating C. difficile and other prominent bacterial pathogens is to target virulence factors instead of essential enzymes (Clatworthy, et al., 2007; Puri and Bogyo, 2009). This method limits the selective pressure on the organism to develop resistance to treatment, extending the effective lifespan of the drug. The large glucosylating toxins TcdA and TcdB are ideal targets for this approach because they are the primary virulence factors of C. difficile (Genth, et al., 2008; Jank and Aktories, 2008). TcdB in particular has been shown to be critical for virulence and is found in all clinical isolates (Lyras, et al., 2009; Rupnik, et al., 2009).
Both TcdA and TcdB cause cell death through an orchestrated sequence of events (Jank and Aktories, 2008). These multi-domain toxin proteins first enter cells by triggering receptor-mediated endocytosis (Frisch, et al., 2003; Rolfe and Song, 1993); acidification of toxin-containing endosomal compartments subsequently initiates translocation of the N-terminal cytotoxic glucosyltransferase domain and presumably the cysteine protease domain (CPD) into the cytosol (Just, et al., 1995; Pfeifer, et al., 2003; Qa’Dan, et al., 2000). The CPD is activated by the eukaryotic-specific small molecule inositol hexakisphosphate (InsP6) (Egerer, et al., 2007; Reineke, et al., 2007). This activation catalyzes the autoproteolytic release of the toxin’s cytotoxic glucosyltransferase domain from the endosomal membrane (Egerer, et al., 2007; Pfeifer, et al., 2003). The liberated effector domain then monoglucosylates small Rho family GTPases (Just, et al., 1995), resulting in loss of cell-cell junctions and ultimately cell death (Genth, et al., 2008; Gerhard, et al., 2008; Qa’Dan, et al., 2002).
CPD-mediated autoprocessing of TcdB is a critical step during target cell intoxication. Genetic inactivation of the CPD has been shown to reduce the overall function of TcdB in target cells (Egerer, et al., 2007). A homologous CPD also autoproteolytically regulates the Multifunctional Autoprocessing RTX (MARTX) toxins (Prochazkova, et al., 2009; Sheahan, et al., 2007; Shen, et al., 2009), an otherwise unrelated family of toxins produced by Gram-negative bacteria (Satchell, 2007). Structural analyses of the CPD of both families of toxins have demonstrated that the protease is allosterically regulated by the small molecule InsP6 (Lupardus, et al., 2008; Prochazkova, et al., 2009; Pruitt, et al., 2009). These analyses have also revealed that the CPD is a clan CD protease whose closest known structural homolog is human caspase-7 (Lupardus, et al., 2008). Despite their disparate mechanism of activation, V. cholerae MARTX CPD exhibits similarities in substrate recognition to the caspases (Shen, et al., 2009), except that the CPD cleaves exclusively after a leucine instead of an aspartate residue. In contrast, the molecular details of TcdB CPD substrate recognition remain uncharacterized.
In this study we used a combination of chemical synthesis and structural analyses to probe the substrate recognition and inhibitor sensitivity of the TcdB cysteine protease domain. By screening a focused library of substrate-based CPD inhibitors, we identified several compounds capable of blocking holotoxin function in cell culture. We also solved the structure of TcdB CPD bound to one of these inhibitors. Combined with the structure-activity relationship series derived from our inhibitor analyses, these results provide a foundation for the development of therapeutics targeting this important virulence factor. We further used this information to develop activity-based probes (ABPs) specific for TcdB CPD that will permit the molecular dissection of its unique allosteric activation mechanism. The information presented here may also be valuable for the study of protease domains in other bacterial toxins.
The use of peptide-based inhibitors is an effective strategy for selectively inactivating proteases through mimicry of natural substrates (Berger, et al., 2006; Kato, et al., 2005; Powers, et al., 2002). Given the importance of the CPD in regulating C. difficile glucosylating toxin function (Egerer, et al., 2007; Reineke, et al., 2007), we sought to identify inhibitors of the TcdB CPD protease. We first tested whether inhibitors specific for a related CPD found in V. cholerae MARTX (MARTXVc) toxin (Shen, et al., 2009) could also inhibit TcdB CPD function (Figure 1). These inhibitors contain tripeptide sequences coupled to either an aza-epoxide or acyloxymethyl ketone (AOMK) reactive electrophile (Figure 1A and S1). Inhibitor potency against TcdB CPD was determined using a gel-based autocleavage assay, in which inhibitor concentration was varied in the presence of the activator inositol hexakisphosphate (InsP6). The autocleavage substrate TcdB(1-804) used in this assay consists of TcdB’s N-terminal 804 amino acids and contains the gIucosyltransferase and CPD domains and the natural autoprocessing site. The aza-epoxide MARTXVc CPD inhibitors were only weakly inhibitory, with both Cbz-LLaL-EP and the related Cbz-EAaL-EP exhibiting observed IC50’s greater than 100 μM (Figure 1B). In contrast, the Cbz-EAL-AOMK inhibitor was significantly more potent, exhibiting an observed IC50 of 7.2 ± 0.7 μM. Because the primary difference between the Cbz-EAaL-EP and Cbz-EAL-AOMK inhibitors is the electrophilic reactive group, we reasoned that the AOMK electrophile is more optimal for TcdB CPD inhibition. Therefore, we synthesized a focused library of covalent AOMK inhibitors based on the natural substrate cleavage sequence of the C. difficile TcdB cysteine protease domain (CPD) (Figure 2A). These inhibitors consist of dipeptide or tripeptide sequences coupled to the dimethylbenzoic acid AOMK, an electrophilic group that has been described as optimal for targeting the structurally related clan CD proteases, the caspases (Kato, et al., 2005; Thornberry, et al., 1994). For all inhibitors, the P1 position (the residue N-terminal to the scissile bond) was held constant as leucine, since this is the primary substrate specificity determinant of bacterial CPDs (Egerer, et al., 2007; Prochazkova, et al., 2009; Shen, et al., 2009). The P2 and P3 positions, as well as the N-terminal capping group, were varied in the library.
We first probed TcdB CPD P2 specificity using a diverse set of carboxybenzyl (Cbz) capped dipeptide compounds (Figure 2B). Acidic and branched aliphatic amino acids were poorly tolerated in the P2 position, with calculated IC50’s for Cbz-EL-AOMK and Cbz-LL-AOMK of 30 and 100 μM, respectively. Inhibitors containing smaller residues such as alanine and serine were more potent, with IC50’s for Cbz-AL-AOMK and Cbz-SL-AOMK of 1.64 ± 0.08 μM and 1.55 ± 0.26 μM, respectively. Unexpectedly, compounds with basic residues in the P2 position were also potent, with an IC50 of 1.06 ± 0.07 μM observed for Cbz-KL-AOMK. We examined whether the enhanced potency of Cbz-KL-AOMK was specific to this residue or whether other amino acids with a positive charge could recapitulate this effect. Consistent with the latter interpretation, compounds with arginine and ornithine in the P2 position were also potent inhibitors (Figure 2B).
We next assessed the contribution of the carboxybenzyl (Cbz) cap to inhibitor recognition by synthesizing acetyl (Ac) and hydroxyphenyl acetyl (Hpa) capped analogs of Cbz-KL-AOMK and Cbz-SL-AOMK. While the smaller Ac group decreased potency compared to the Cbz cap for both compounds, the Hpa cap increased potency, resulting in an observed IC50 of 0.71 ± 0.05 μM for Hpa-SL-AOMK (Figure 2B). These results indicate that hydrophobic bulk in the dipeptide cap/P3 binding position contributes to TcdB CPD inhibitor potency. Notably, the most potent compound (Hpa-SL-AOMK) contains amino acids found in the natural TcdB CPD substrate (Figure 2A).
Based on these results, we surveyed the P3 specificity of the protease domain using the same technique. We synthesized a focused library of acetyl-capped tripeptide AOMK inhibitors containing the natural leucine and serine residues fixed in the P1 and P2 positions, respectively. The P3 position was varied using a diverse set of amino acids (Figure 2B). Additionally, given the favorable contribution of aromatic bulk in the dipeptide cap site, three non-natural amino acids with an aromatic side chain were included in the P3 position (Figure 2, Figure S1B). Ac-capped tripeptide inhibitors containing small or basic residues in the P3 position were the most potent, while an acidic residue in the P3 position was poorly tolerated (Ac-ESL-AOMK, Figure 2B). More diversity was tolerated in the P3 position of inhibitors containing the natural SL cleavage sequence in the P2/P1 positions. Bulky residues such as leucine were tolerated, with an IC50 of 4.95 ± 0.37 μM observed for Ac-LSL-AOMK. Homophenylalanine (hPhe) was the most potent of the inhibitors containing aromatic P3 residues, with an observed IC50 of 3.32 ± 0.26 μM. This is possibly because its methylene extension at the β-carbon position affords more flexibility in assuming productive interactions with the protease domain (Figure S1B). Conversely, aromatic bulk in the P4 position decreased inhibitor potency, since the Cbz-capped GSL-AOMK inhibitor was ~3-fold less potent than the analogous inhibitor carrying the smaller Ac cap (4.46 ± 0.47 μM vs. 1.78 ± 0.16 μM). None of the compounds in the P3 library, however, were more potent than the most potent dipeptide inhibitor, Hpa-SL-AOMK.
In order to rationalize the results of the SAR analyses and to gain structural insight into substrate recognition by TcdB CPD, we co-crystallized and solved the structure of InsP6-bound CPD covalently inhibited with the Ac-GSL-AOMK inhibitor at 2.0 Å (Table 1). Although this inhibitor was not the most potent compound identified in the screen, it reflects the natural substrate cleavage site and exhibited improved solubility over dipeptide compounds carrying aromatic caps. The overall structure of inhibitor-bound, activated TcdB CPD (Figure 3A and B) is similar to the previously solved InsP6-bound structure of TcdA CPD, which shares ~60% identity with TcdB CPD (rsmd of ~1Å) (Pruitt, et al., 2009).
As with most proteases, the substrate binding pocket of TcdB CPD can be subdivided into multiple subsites. The catalytic residues are positioned between the S1 and S1’ subsites; the numbering of the subsites reflects the corresponding substrate residue recognized, with the prime subsites interacting with substrate residues C-terminal to the scissile bond. The most striking feature of the inhibitor structure is the insertion of the P1 leucine within a deep hydrophobic S1 pocket (Figure 3C). Eight residues, seven of which are nonpolar, are within Van der Waals (4.4 Å) bonding distance of the P1 leucine sidechain. Ile589 and Ala593 are contributed by helix 1 (Figure S2); Ile651 and Gly652 are contributed by strand D; Leu696 and Gly697 are contributed by strand E; and Val744 is contributed by strand G1 of the β-flap, a functional region that is involved in both InsP6 recognition and substrate binding (Lupardus, et al., 2008; Prochazkova, et al., 2009; Pruitt, et al., 2009). Ile746 from the G1 strand of the β-flap forms the distal side of the P1 pocket away from the active site, yet falls just outside van der Waals bonding distance in our structure. It likely also contributes to P1 recognition in vivo, since the inhibitor may be pulled in slightly towards the catalytic cysteine due to the covalent nature of the modification. Since most of these residues are also conserved in the related MARTXVc CPD, which binds to and cleaves after a P1 leucine (Prochazkova, et al., 2009; Shen, et al., 2009), bacterial CPDs would appear to share a common mechanism for recognition of this primary substrate specificity determinant.
Whereas many residues in the S1 subsite participate in recognition of the P1 Leu, residues in the other subsites make minimal interactions with the inhibitor (Figure 3C). In the S2 subsite, the main chain carbonyl of Val744 forms a hydrogen bond with the P2 serine backbone amide of the inhibitor, while on the prime side the main chains of Gly654 and Cys698, which form the “oxyanion hole,” hydrogen bond with the carbonyl formed after inhibitor reaction (Figure 3C). The P2 serine sidechain points towards Glu743; in contrast, the P3 glycine is oriented such that the P3 sidechain would be exposed to solvent. Thus, the inhibitor structure of TcdB CPD reveals the mechanistic basis for substrate recognition.
To gain additional insight into the inhibitor specificity requirements of the active site, we used docking simulations to replace the Ac-GSL-AOMK in the TcdB CPD crystal structure with other compounds from the focused library. The structure of the modeled inhibitor and all sidechains within 4.5 Å were energy minimized using the default parameter of the Molecular Operating Environment (MOE) software. The docking simulations suggest an explanation for why compounds containing basic amino acids such as lysine or arginine in the P2 position were particularly potent. In these analyses, the positively charged P2 residue forms an electrostatic interaction with the acidic Glu743 residue, which helps to form the S2 pocket (Figure 4). This negatively charged subsite likely also explains why the Cbz-EL-AOMK was not as potent, as this inhibitor cannot form the same favorable electrostatic interactions during binding (Figure 4). Instead, the acidic glutamate of the inhibitor is predicted to interact with Arg745; this interaction, however, may be unfavorable given that Arg745 likely stabilizes the activated conformation of the CPD through a π-cation interaction with the highly conserved Trp761 (Pruitt, et al., 2009).
The docking studies also rationalize the increased potency of the dipeptide inhibitors capped with bulky Cbz or Hpa groups. These caps are predicted to fit into a hydrophobic groove formed between Ile746 and Ile589 such that the hydroxyl of the Hpa cap serves as the donor in a hydrogen bond with Ile746 (Figure 4). This may help favorably orient the inhibitors during binding and subsequent reaction with the catalytic Cys698.
In order to determine if the inhibitors identified in vitro were functional in cells, we assessed their ability to block the intoxication of primary human foreskin fibroblasts (HFFs) by TcdB holotoxin. Pre-treating cells with the most potent inhibitors prevented the cytopathic effects induced by recombinant TcdB holotoxin (Figure 5A). Cell rounding was quantified for select compounds by counting the number of rounded cells visible per field upon inhibitor titration (Figure 5B). The results corroborate the potency rankings observed in the initial autocleavage screen. Two of the most potent compounds, Hpa-SL-AOMK and Hpa-KL-AOMK, both prevented cell rounding. Hpa-SL-AOMK completely inhibited toxin function by 100 μM, with an observed IC50 of approximately 20 μM, while the Hpa-KL-AOMK was slightly less potent. The difference in potency between Hpa-SL-AOMK and Hpa-KL-AOMK may be due to the increased cell permeability of the P2 serine relative to the positively charged lysine. Cell permeability of the inhibitors is likely to determine their potency because CPD inhibition can only occur after toxin entry into cells, when the CPD can become activated by InsP6. In contrast, the negative controls Cbz-EL-AOMK and the epoxide-based pan-cathepsin inhibitor JPM-OEt (Bogyo, et al., 2000; Greenbaum, et al., 2000) both failed to inhibit toxin function just as they poorly inhibit CPD activity (Figure 2B and S1). Surprisingly, Cbz-KL-AOMK and to a lesser extent Cbz-SL-AOMK were found to be cytotoxic (Figure S3). Given that no toxicity was observed for the Hpa-capped analogs, these findings implicate the carboxybenzyl cap in affecting cell viability.
We confirmed that inhibition of the TcdB CPD directly prevented holotoxin effector domain activity by monitoring glucosylation of its cellular target Rac1, a Rho GTPase (Genth, et al., 2006; Yang, et al., 2008). Addition of wildtype toxin to cells resulted in complete glucosylation of Rac1, while pretreatment with either Hpa-SL-AOMK or Hpa-KL-AOMK was protective (Figure 5C). Importantly, inhibition of cathepsins using JPM-OEt had no effect on toxin function; JPM-OEt was used as a control since P1 Leu AOMKs have previously been shown to weakly cross-react with the cathepsins (Kato, et al., 2005).
The activation of the CPD by the eukaryotic-specific small molecule InsP6 is a critical step in regulating the function and trafficking of C. difficile glucosylating toxins (Egerer and Satchell, 2010; Shen, 2010). In order to facilitate more detailed studies of this important process, we created fluorescently-tagged and biotin-tagged probe versions of the SL-AOMK inhibitor (AWP19 and AWP15, respectively) to visualize toxin activation (Figure 6A). Both AWP19 and AWP15 covalently label the recombinantly produced TcdB(1-804) autocleavage substrate in response to small molecule activation of the protease by InsP6 (Figure 6B). This substrate represents the toxin region predicted to translocate into the cytosol during intoxication. Notably, the cleavage product TcdB(544-804) formed after proteolytic cleavage was more active than the full-length substrate, as shown by the increase in activity-based probe (ABP) labeling relative to the amount of protein present (Figure 6B and C). This observation suggests that the isolated CPD domain TcdB(544-804) is either more accessible or more reactive with the probe than the full-length substrate. As expected, the ABPs failed to label the catalytically inactive C698A mutant of TcdB(1-804) in the presence or absence of InsP6.
We also determined that the ABPs could label functional recombinant holotoxin, which was either produced in E. coli or in the Bacillus megaterium expression system (Pruitt, et al., 2010; Yang, et al., 2008) (Figure 6C). The B. megaterium system produces a His6-tagged TcdB holotoxin that is more pure than the native toxin purified from C. difficile culture supernatants (Genth, et al., 2006; Yang, et al., 2008). Both probes could sensitively detect active CPD within the toxin, since the cleaved TcdB(544-2366)-His6 fragment could be labeled by the probe even though it is not detectable by either Coomassie or anti-His6 antibody-conjugated horseradish peroxidase. Titration of probe labeling confirmed the potency of the probes, with AWP19 labeling active holotoxin at probe concentrations below 50 nM (Figure S4). This level of sensitivity will be valuable for tracking CPD activation during toxin trafficking, as it permits the labeling of ensembles of toxin while minimizing the risk of completely inhibiting toxin function.
Monitoring toxin trafficking is further enabled by the cell permeability of AWP19’s near-infrared fluorescent cyanine 5 fluorescent tag, which allows the probe to be used in intact cells. In order to check for off-targets of this probe, we incubated both primary HFFs and the RAW macrophage cell line with AWP19. In both cell types, the only off-target the probe labeled was cathepsin B, as confirmed by comparison with a pan-cathepsin ABP (Figure S5) (Paulick, 2010). Similarly, pre-treatment of both cell types with Cbz-SL-AOMK prevented cathepsin B labeling by the pan-cathepsin probe, but not cathepsin L or X labeling. Thus, cathepsin B would appear to be the primary off-target TcdB CPD inhibitors. Nevertheless, cathepsin B inhibition was not sufficient to reduce TcdB toxin function, since the pan-cathepsin inhibitor JPM had no effect on toxin-induced cell rounding or Rac1 glucosylation (Figure 5). This result strongly suggests that the observed reduction in TcdB glucosylating activity in target cells upon treatment with CPD inhibitors is due to inhibition of CPD function.
The rising rate of C. difficile infections necessitates the development of new classes of therapeutics to combat this pathogen. Because of its natural antibiotic resistance, there has been increased focus on targeting the glucosylating toxins TcdA and TcdB for direct therapeutic intervention, as they are the primary mediators of C. difficile pathogenesis (Halsey, 2008; Kelly and LaMont, 2008; Rupnik, et al., 2009). In this study we present the first validation that the TcdB cysteine protease domain (CPD) is a druggable target. Although inhibition of this protease active site is difficult because the small molecule is competing with an intramolecular autoproteolytic event, our findings are encouraging for the development of competitive inhibitors for the TcdB CPD. The most potent compound in our library is the 499 Da capped dipeptide inhibitor Hpa-SL-AOMK (Figure 2), which is within the size constraints generally accepted for therapeutics (Lipinski, 2000). In addition, the minimal interaction between the protease and inhibitor peptide backbone (Figure 4) suggests that inhibitors with non-peptidic scaffolds can be developed to bypass the pharmacokinetic shortfalls of peptidic compounds.
Our rational approach to probing the inhibitor sensitivity of the CPD active site using structural analysis and a focused library of substrate-based compounds yielded multiple inhibitors capable of blocking holotoxin function (Figure 5). These analyses produced the unexpected observation that inhibitors containing basic P2 or P3 residues and bulky hydrophobic N-terminal caps could potently block TcdB autoprocessing. This SAR profile provides a starting point for the development of compounds suitable for therapeutic applications. We note that relying solely on substrate specificity profiling for these domains may not have produced such promising results, since potent substrates do not always translate into viable inhibitors (Drag, et al., 2010). Furthermore, fluorogenic substrate cleavage assays lack sensitivity in detecting bacterial CPD activity because these autoprocessing enzymes exhibit poor transcleavage efficiency (Babe and Craik, 1997; Lupardus, et al., 2008). Our attempts to develop optimized substrates for TcdB CPD produced substrates with poor Km values (~1 mM, data not shown). The approach described here may also prove applicable to other protease domain-containing bacterial toxins (Lebrun, et al., 2009).
Information about the inhibitor sensitivity profile of TcdB CPD was bolstered by our crystal structure of the enzyme bound to the Ac-GSL-AOMK inhibitor (Figure 3). This structure permitted the molecular docking studies that helped rationalize the increased potency of inhibitors with basic P2 residues and bulky hydrophobic N-terminal caps (Figure 4). Furthermore, given the overall similarity between the inhibitor structure of InsP6-bound TcdB CPD presented here and InsP6-bound TcdA CPD (Pruitt, et al., 2009), many of our findings may be translatable to inhibiting this closely related toxin. Our most efficacious inhibitor Hpa-SL-AOMK will likely exhibit similar potency against TcdA due to its identical P1/P2 substrate sequence (Figure 2A), increasing its value as a C. difficile virulence-targeting agent.
This crystal structure also permits comparisons to be made with the inhibitor-bound structure of the related CPD of Vibrio cholerae MARTX toxin (Shen, et al., 2009). Both inhibitor structures reveal that the primary substrate specificity determinant for these proteases is the P1 leucine, and residues involved in recognizing this leucine are conserved in both proteases. The inhibitor structures differ in that MARTXVc CPD makes more backbone interactions with its inhibitor than TcdB CPD and in the S1′ subsite, the region that recognizes residues C-terminal to the scissile bond. For MARTXVc CPD, this region is relatively flat and featureless, whereas in TcdB CPD, Asp656 and Glu657 directly extend into this region and may thus occlude substrate or inhibitor binding (Figure S6). This acidic extension may explain why such a large difference in potency was observed between the Cbz-EAL aza-epoxide and AOMK derivatives for TcdB CPD (Figure 1), while both warheads inhibit MARTXVc CPD to a similar extent (Shen, et al., 2009).
The CPDs of both the C. difficile large gluosylating and MARTX toxin families appears to “sense” the eukaryotic cell environment and activate toxin function accordingly (Egerer and Satchell, 2010; Shen, 2010). However, this unique allosteric activation process is difficult to study using traditional biochemical approaches because it is post-translationally regulated. Our activity-based probes (ABPs) overcome this problem because they afford the direct visualization of CPD activation by InsP6 in complex mixtures and possibly in vivo. Furthermore, the probes provide a robust read-out for CPD activity that could be used in screening applications for competitive (Schneider and Craik, 2009) or allosteric inhibitors (Lee and Craik, 2009) of the CPD, in lieu of fluorogenic substrates (which react poorly with the protease) or autocleavage assays (which are less sensitive).
Because of the covalent nature of these probes, they provide a direct readout of when the toxin has encountered InsP6. This is valuable within the context of studies directed at dissecting toxin trafficking and the molecular mechanisms underlying the allosteric regulation of toxin function by InsP6 (Giesemann, et al., 2008; Jank and Aktories, 2008). For example, it is notable that the cleaved form of TcdB CPD is more effectively labeled by the ABP than the full-length protein (Figure 6). This may imply that the CPD is held in an inhibitory conformation within the native holotoxin and that autoproteolytic cleavage relieves this inhibition. Alternatively, the active site may be more accessible to the ABP following cleavage, which would suggest that the conformation of full-length TcdB in vitro occludes substrate binding. C. difficile glucosylating toxins undergo significant conformational rearrangements during the pH-dependent toxin translocation process (Pruitt, et al., 2010; Qa’Dan, et al., 2000). Based on our observation that cleaved TcdB toxin (aa 544-2366) was also more readily labeled by the probe in the presence of InsP6 (Figure 6C), it is tempting to speculate that the CPD is subject to additional regulation at the level of toxin conformation. Further studies using these promising tools will provide a more detailed understanding of the regulation of the CPD by InsP6 and in the context of the full-length toxin.
The large glucosylating toxins TcdA and TcdB are the primary virulence factors of the antibiotic-resistant bacterium Clostridium difficile. These toxins are autoproteolytically activated by an internal cysteine protease domain (CPD) in a step that is critical for toxin function. We synthesized a focused library of substrate-based compounds in order to determine a structure-activity relationship for TcdB CPD inhibitors and then gained further insight by co-crystallizing the domain with one of these inhibitors. This rational approach yielded compounds potent enough to inhibit toxin function in cell culture, validating the clostridial glucosylating toxins as druggable targets. Our results provide a promising starting point for the development of therapeutics that minimize the selective pressure on C. difficile to develop resistance. We also used the inhibitor data to develop covalent activity-based probes (ABPs) that can directly measure the allosteric activation of the protease by the small molecule inositol hexakisphosphate. Because these ABPs monitor the post-translational activation of the toxin, they will be useful in studies directed at understanding this unique regulatory mechanism in both biochemical and cell-based assays.
The aza-leu epoxide inhibitors were synthesized in solution using standard chemistry as previously described (Asgian, et al., 2002). The AOMK inhibitors were synthesized using solid-phase synthesis as previously described (Kato, et al., 2005). The activity-based probe AWP19 was synthesized by combining H2N-aminohexanoic-SL-AOMK (1 equiv.) with Cy5-NHS (0.9 equiv.) and DIEA (5 equiv.) in DMSO for one hour and then purifying directly by HPLC. The identity and purity of all compounds was characterized using HR-MS and LCMS.
An overnight culture of pET28a-TcdB1–804 was diluted 1:500 into 4 L 2YT media and grown shaking at 37ºC. When an OD600 of 0.6–0.9 was reached, IPTG was added to 250 μM, and cultures were grown for 3 hr at 225 rpm at 30ºC. Cultures were pelleted, resuspended in 60 mL lysis buffer [500 mM NaCl, 50 mM Tris-HCl, pH 7.5, 15 mM imidazole, 10% glycerol] and flash frozen in liquid nitrogen. Lysates were thawed, then lysed by sonication and cleared by centrifugation at 15,000 × g for 30 minutes. C-terminally His6-tagged TcdB(1-804) was affinity purified by incubating the lysates in batch with 2.0 mL Ni-NTA Agarose beads (Qiagen) with shaking for 3 hr at 4ºC. The binding reaction was pelleted at 1,500 × g, and the pelleted Ni-NTA agarose beads were washed 3 x with lysis buffer. His6-tagged CPD was eluted from the beads by the addition of 400 μL high imidazole elution buffer [500 mM NaCl, 50 mM Tris-HCl, pH 7.5, 175 mM imidazole, 10% glycerol]. The elution was repeated three times; the eluate was pooled, buffer exchanged in gel filtration buffer [200 mM NaCl, 10 mM Tris pH 7.5, 5% glycerol), and concentrated to 750 μL. The concentrated prep was pelleted at 13,000 × g for 10 min at 4 °C prior to loading on a HiPrep S200 16/60 Sephacryl column (GE Healthcare). Purified His6-tagged CPD was concentrated and stored at −20ºC in gel filtration buffer.
Recombinant TcdB(1-804) was diluted to a final concentration of 0.5 μM in assay buffer [60 mM NaCl, 20 mM Tris pH 7.5, 250 mM sucrose] in a 96-well plate. 0.5 μL of a 100X inhibitor stock in DMSO was then added to each well in triplicate, and the samples were incubated at 37 °C for 30 minutes. Inositol hexakisphosphate (0.5 μL, InsP6, Calbiochem) was then added to a final concentration of 25 μM, and the reaction was incubated at 37 °C for 1 hour. Samples were then diluted in 4X SDS-PAGE loading buffer and resolved by SDS-PAGE on 12% gels. Cleavage reactions were visualized by Coomassie staining and quantified using the program ImageJ (http://rsb.info.nih.gov/ij/). For each sample, the amount of autocleaved protein relative to the total protein amount was plotted versus the concentration of inhibitor and globally fit using the sigmoidal function in KaleidaGraph.
An overnight culture of pET22b-TcdB544–797 was diluted 1:500 into 3 L 2YT media and grown shaking at 37ºC. When an OD600 of 0.6–0.9 was reached, IPTG was added to 250 μM, and cultures were grown for 12–16 hr (225 rpm) at 18–20ºC. Cultures were pelleted, resuspended in 50 mL lysis buffer [500 mM NaCl, 50 mM Tris-HCl, pH 7.5, 15 mM imidazole, 10% glycerol] and flash frozen in liquid nitrogen. Lysates were thawed, then lysed by sonication and cleared by centrifugation at 15,000 × g for 30 minutes. TcdB544–797 was purified as described above except that it was concentrated to 1 mM, and the gel filtration buffer was 150 mM NaCl, 10 mM Tris pH 7.5. Gel filtration purified TcdB(544-797) was treated with 2 mM InsP6 and 2 mM Ac-GSL-AOMK (inhibitor stock was at 200 mM in DMSO). The inhibitor was added slowly due to poor solubility; reaction with the protease improved inhibitor solubility. The reaction was allowed to proceed for 1 hr at room temperature after which excess inhibitor was pelleted by centrifuging the reaction at 13,000 × g for 10 min at 4 °C. Crystallization screening was carried out using the sitting drop vapor-diffusion method and initial hits were observed in 0.1 M Tris HCl pH 8.0 and 30% PEG2000 MME as the precipitant. Crystals used for data collection were grown in 1 μL drops by mixing equal volumes of protein with mother liquor and appeared only after ~45–60 days. Crystals were cryoprotected in 45% PEG2000 MME, flash frozen in liquid nitrogen, and data collected under cryo-cooled conditions at 100 K at beamline 8.2.1 at the Advanced Light Source (UC-Berkeley). Diffraction data were processed using MOSFLM (Leslie) and SCALA (Potterton, et al., 2003), and processing statistics are listed in Table 1.
Initial phases were obtained by molecular replacement with PHASER (McCoy, et al., 2007) using the structure of the TcdA CPD (PDB ID 3HO6) as a search model (Pruitt, et al., 2009). Using the molecular replacement phases, the TcdB CPD was built by ARP/wARP (Perrakis, et al., 1999) to approximately 85% completeness, followed by rounds of model building and adjustment with COOT (Emsley and Cowtan 2004) and refinement with PHENIX (Adams, et al., 2002). Restraints for the Ac-GSL molecule were obtained from the PRODRG server (Schuttelkopf and van Aalten, 2004). The final model underwent restrained and translation/libration/screw refinement in REFMAC5 (Murshudov, et al., 1997), resulting in a final R/Rfree values of 18.8% and 23.3%. Ramachandran analysis of model geometry by Molprobity (http://molprobity.biochem.duke.edu/) indicates that 99.0% of residues reside in the most favorable regions, with none in the disallowed regions. Refinement statistics can be found in Table 1. Structural figures were prepared with Pymol (DeLano, 2002). The final model contains two TcdB CPD molecules in the asymmetric unit (ASU), with each bound to one InsP6 molecule, one Ac-GSL inhibitor molecule, two sodium ions and a calcium ion bridging the crystal contact between the molecules in the ASU. Chain A of the Tcd CPD is used for all figures in the paper.
The different homology models of TcdB CPD were built from the crystal structure of the protease bound of Ac-GSL-AOMK using the default parameters of the Molecular Operating Environment (MOE) software. In each model, the covalent linkage to Cys698 and the position of leucine in the P1 pocket were initially fixed as those of the crystal structure. The N-terminal cap and P2 side chain were manually built into the active site of the protease. Energy minimization was performed first on the modified region of the inhibitor and all side chains within 4.5 Å, and second on the entire inhibitor and all side chains within 4.5 Å.
Primary human foreskin fibroblasts (HFFs) were seeded in 96-well plates at a density of 1–2 × 104 cells/well in DMEM supplemented with 10% FBS. Prior to the assay the cells were washed three times with 100 μL DMEM alone. One microliter of a 100X inhibitor stock in DMSO was then added to each well in triplicate, and the cells were incubated at 37 °C for 30 minutes. Recombinant TcdB holotoxin expressed in E. coli was then added to the cells at a final concentration of 0.3 pM. The samples were incubated at 37 °C for 2 hours and then imaged using a 20X objective on an inverted microscope. Four fields were imaged per well, and the average number of rounded cells per field was calculated.
HFF cells were seeded into 24-well treated plates (7.5 × 105) and grown to 100% confluency overnight, washed once with pre-warmed DMEM, and left in 0.25 mL DMEM per well. The indicated concentration of inhibitor was added as a 1:100 dilution from a DMSO stock and incubated for 30 min. Recombinant TcdB holotoxin purified from E. coli was added to the cells (0.3 pM) and incubated for 90 min at 37 °C. The media was removed and then the cells were lysed in 25 μL 1 X FSB by scraping the cells in concentric circles. Cell lysates were boiled for 5 min at 95 °C, and 15 μL of lysate was resolved on a 14% SDS-PAGE gel and transferred to nitrocellulose. Unglucosylated Rac1 was detected using a 1:1000 dilution of mAb102 (Millipore) and a 1:5000 dilution of anti-mouse IgG HRP (Biorad). Actin was simultaneously visualized using a polyclonal anti-actin antibody at 1:2000 dilution (Sigma), and a 1:5000 dilution of anti-rabbit IgG HRP (Biorad).
Overnight cultures of pET28a-TcdB wildtype or C698A holotoxin were diluted 1:500 into 3 L 2YT media and grown shaking at 37ºC. C-terminally His6-tagged holotoxin was purified as described for His6-tagged TcdB(1-804) with the exception that β-mercaptoethanol was added to the lysis buffer at 2 mM.
Wildtype and C698A TcdB 1-804 (0.5 μM) were incubated with 10 μM of the indicated probe. InsP6 was added to a final concentration of 25 μM (1:100 dilution) in a total volume of 50 μL where indicated, and probe labeling proceeded for 1 hr at 37 °C. Fifteen microliters of 4X final sample buffer was added and then the sample was boiled for 3 min at 95 °C. The samples were resolved on a 10% gel by SDS-PAGE. For AWP19 labeling, fluorescent labeling was visualized using a fluorescent scanner followed by Coomassie staining. For AWP15 staining, the sample was loaded in duplicate, and one set was visualized by Coomassie staining (5 μL sample loaded), while the other set was transferred to nitrocellulose (2.5 μL sample loaded) and blotted using streptavidin-HRP (Sigma) at 1:3000.
For labeling of TcdB holotoxin purified from E. coli, wildtype and C698A TcdB was diluted to 0.3 μM in assay buffer and then the indicated probe was added to 10 μM (1:100 dilution from DMSO stock) in a total volume of 15 μL. InsP6 was added at a final concentration of 25 μM where indicated. For labeling of TcdB holotoxin purified from B. megaterium (a gift from R. Pruitt and D.B. Lacy), 0.5 μM of toxin was diluted in CPD buffer and then 10 μM of the indicated probe was added (1:100 dilution from DMSO stock). InsP6 was then added at a final concentration of 25 μM. Alternatively, InsP6 (25 μM) was added to the B. megaterium produced toxin (0.5 μM) and incubated for 1 hr at 37 °C and then the indicated probe was added at 10 μM final concentration. Following probe addition, the labeling reactions were allowed to proceed for 1 hr at 37 °C after which 5 μL of 4 X FSB was added, and the samples were boiled for 3 min at 95 °C. For AWP19 labeled samples, 15 μL was resolved on a 10% gel and then visualized by fluorescence scanning followed by Coomassie staining. For AWP15 labeled samples, the samples (either 5 or 2.5 μL) were loaded in duplicate and resolved on a 10% gel, then transferred to nitrocellulose. The membranes were probed with Streptavidin-HRP (Sigma) at 1:3000 (5 μL sample loaded) or with an anti-His antibody (Pierce) and anti-rabbit IgG HRP (Biorad) at 1:10,000 (2.5 μL sample loaded).
We thank D. Borden Lacy and Rory Pruitt (Vanderbilt University) for the generous gift of TcdB toxin produced in B. megaterium and Elaine Hamm and Jimmy Ballard (Oklahoma Health Sciences University) for Clostridium difficile VPI 10463 genomic DNA. A.W.P. is supported by an NSF Graduate Research Fellowship. P.J.L. is a Damon Runyon Fellow, supported by the Damon Runyon Cancer Research Foundation. K.C.G. is supported by the Keck Foundation and the Howard Hughes Medical Institute. M.B. is supported by the Burroughs Wellcome Foundation and NIH grants R01EB005011 and R01AI078947. A.S. is supported by an NIH National Institutes of General Medical Sciences 1-K99-GM092934-01. Coordinates and structure factors have been deposited in the Protein Data Bank (www.rcsb.org) under accession number XXXX.
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.