|Home | About | Journals | Submit | Contact Us | Français|
M. tuberculosis ArfA (Rv0899) is a membrane protein encoded by an operon that is required for supporting bacterial growth in acidic environments. Its C-terminal domain (C domain) shares significant sequence homology with the OmpA-like family of peptidoglycan-binding domains, suggesting that its physiological function in acid stress protection may be related to its interaction with the mycobacterial cell wall. Previously, we showed that ArfA forms three independently structured modules and we reported the structure of its central domain (B domain). Here we describe the high-resolution structure and dynamics of the C domain, we identify ArfA as a peptidoglycan-binding protein, and elucidate the molecular basis for its specific recognition of diaminopimelate (DAP) type peptidoglycan. The C domain of ArfA adopts the characteristic fold of the OmpA-like family. It exhibits pH-dependent conformational dynamics (with significant hereogeneity at neutral pH and a more ordered structure at acidic pH), which could be related to its acid-stress response. The C domain associates tightly with polymeric peptidoglycan isolated from M. tuberculosis and also associates with a soluble peptide intermediate of peptidoglycan biosynthesis. This enabled us to characterize the peptidoglycan binding site where five highly conserved ArfA residues, including two key arginines, establish the specificity for DAP- but not Lys-type peptidoglycan. ArfA is the first peptidoglycan-binding protein to be identified in M. tuberculosis. Its functions in acid stress protection and peptidoglycan binding suggest a link between the acid stress response and the physico-chemical properties of the mycobacterial cell wall.
Mycobacterium tuberculosis, the causative agent of tuberculosis, infects approximately one third of the world’s population and kills two million people each year. Its high resistance to many drugs is partly due to the nature of its cell envelope, a highly complex structure composed of lipids, polysaccharides, and proteins, that protects the organism from external stress and harmful compounds 1; 2. The mycobacterial envelope consists of an inner lipid bilayer plasma membrane and a cell wall, formed by a peptidoglycan-arabinogalactan polymeric network occupying the periplasmic space and an outer lipid membrane enriched in mycolic acids covalently linked to the arabinogalactan layer 3; 4; it is further enclosed by an exterior capsular layer of glycans, lipids and proteins 5. Most antibiotics developed for M. tuberculosis target the cell envelope, therefore identifying its components and understanding how they interact to provide a mechanically strong, highly impermeable barrier, that also maintains communication with the outside world, is important for identifying new drug targets 1; 6.
Rv0899 (also known as OmpATb) has been proposed to function as an outer membrane porin that protects M. tuberculosis against acid stress, based on its sequence homology with E. coli OmpA, which forms a membrane-spanning β-barrel in the bacterial outer membrane 7; 8; 9; 10. However, the three-dimensional structure of Rv0899 does not contain a transmembrane β-barrel and is not compatible with porin function 11; 12. Indeed, recent studies showed that Rv0899 is not a porin 13. Rather, the rv0899 gene is part of an operon, including rv0900 and rv0901, also encoding membrane proteins 13; 14, that is required for fast ammonia secretion, rapid pH neutralization and growth of M. tuberculosis in acidic environments 13. Genes with sequences and genetic context similar to rv0899, rv0900 and rv0901, are widespread in bacteria that specialize in nitrogen fixation or metabolism, adaptation to nutrient poor environments, and/or establishing symbiosis with the host organism 15, further supporting their function in M. tuberculosis ammonia secretion.
In light of these findings, we propose to name the rv0899-rv0901 operon arf (ammonia release facilitator)* and its three genes, arfA, arfB, and arfC. In mycobacteria, the arf operon is found exclusively in organisms associated with tuberculosis (M. tuberculosis, M. bovis) and other diseases (M. marinum, M. ulcerans, M. kansasii), suggesting that it may have a role in pathogenicity and be an attractive candidate for the development of chemotherapeutic agents. However, the mechanism whereby ArfA (Rv0899), ArfB, (Rv0900) and ArfC (Rv0901) contribute to physiological function is not known.
Previously, we showed that ArfA forms three independently structured domains and reported the high-resolution structure of its central domain 12. The N-terminus (M domain; residues 1–80) includes a membrane-anchoring sequence of 20 hydrophobic amino acids (residues 28–50) that is required for membrane translocation but is not cleaved 10. The central region (B domain; residues 81–195) contains two consecutive repeats of the BON (Bacterial OsmY and Nodulation) domain (pfam04972), a conserved, putative lipid binding sequence that is found in some bacterial osmotic shock protection proteins and nodulation specificity proteins 16. Finally, the C-terminal region (C domain; residues 196–326) shares significant sequence homology with the OmpA-like superfamily (pfam00691) of peptidoglycan-binding proteins, including the C-terminus of the outer membrane protein OmpA from E. coli. OmpA-like domains are ubiquitous in Gram negative bacteria and also found in some Gram positive bacteria 17, however ArfA appears to be the only OmpA-like protein of M. tuberculosis.
Owing to its homology with OmpA, most functional studies have focused on establishing the activity of ArfA as a porin, while its potential interaction with the peptidoglycan layer has not received attention. However, the physiological functions of ArfA in facilitating bacterial growth at acidic pH may well be related to its interactions with the cell wall. Indeed, the cell envelope plays an important role in bacterial adaptation and survival, and recent studies of M. tuberculosis acid stress response genes indicate that the same mechanisms involved in acid resistance are also involved in cell wall functions and provide protection against other forms of extracellular stress (reviewed by Ehrt and coworkers 18).
The peptidoglycan layer is a complex polymer of cross-linked sugar and polypeptide units, that provides both the strength and plasticity required to maintain cell shape, withstand the high osmotic pressure of the protoplast, and enable bacterial growth 1; 2; 19; 20. It also regulates molecular diffusion across the cell, participates in the regulation of cell division, and provides an anchoring scaffold for cell-envelope lipids, proteins and polysaccharides. In this study, we identify ArfA as the first known peptidoglycan-binding protein of M. tuberculosis, and describe both the high-resolution solution structure and dynamics of its C domain (ArfA-c) as well as the binding site responsible for the specific recognition of the peptide stem of M. tuberculosis peptidoglycan. The function of ArfA in acid adaption may be related to its peptidoglycan-binding ability, conferring structural strength to the bacterial cell envelope under acid or other stress conditions.
The C domain of wild-type ArfA folds into four β-strands and four α-helices, arranged in the topological order αβαβαβαβ (Fig. 1a). Three parallel (β1, β2, β3) and one antiparallel (β4) β-strands form a four-stranded β-sheet (β1–β4-β2–β3) that packs against three α-helices (α1, α2, α3), while a fourth helix (α4) extends from the N-terminus of β4. A disulfide bond between C208 and C250 connects the N-terminus of α1 to the C-terminus of α2. All ArfA orthologs from mycobacteria, as well as from Kribbella and α-proteobacteria, have Cys residues at similar positions, suggesting that the disulfide bond is conserved in all their structures 15.
The disulfide bond stabilizes the structure (see below) but is not required to maintain the overall fold, and the protein can be reversibly reduced and oxidized by adding reducing DTT (dithiotheritol) or oxidizing GSSG (glutathione disulfide). Indeed, the 1H/15N NMR spectra obtained after treatment with DTT or GSSG show only minor changes in peaks from residues located near C208 and C250, while peaks from the rest of the protein are not affected (Fig. S1a)§. The structure is held together by a network of hydrophobic contacts among side chains in α1, α2 and β4 (L211, I215, V243, L247, Ile323 and V325) and is further stabilized by a hydrogen bond between the backbone amide of V325 and the side chain carbonyl of Q212.
The absence of NMR peaks from residues G226 to E238 in the loop linking β1 to α2 suggests that the loop undergoes conformational exchange between two or more states on an intermediate timescale relative to NMR, and precluded complete structure determination. Although several of these peaks manifest upon acidification (Fig. S1b), they remain significantly broadened, and no 1H/1H NOEs can be observed to connect residues G226 to E238 with other polypeptide sites, resulting in significant conformational heterogeneity in the β1-α2 loop. This is in contrast to a recent structural study, which also noted the pH dependence of these peaks but, nevertheless, reported a hybrid pH7.2 / pH3.5 solution structure similar to ours, albeit with a fully ordered loop 11.
Upon closer examination of wild-type ArfA we noted that a negatively charged residue (D236) at the start of helix α2 clashes with several hydrophobic residues in the β1-α2 loop (L232), in α2 (L240, A244), in α3 (V281, L285), and in β1 (I223, F225). Indeed, when we mutated D236 to Ala, all of the β1-α2 loop peaks were present at neutral pH (Fig. S2a), and several NOEs connecting the loop to α3 and to the β-sheet could be measured, enabling us to determine a high resolution structure that is very well defined over the entire length of the protein sequence (Fig. 1b). In the structure of the D236A mutant, A236 packs comfortably against L232, I233, L240 and V281, confirming that the negative charge of D236 disrupts this hydrophobic cluster in the wild-type protein at neutral pH, leading to conformational heterogeneity in the β1-α2 loop. Furthermore, the N-terminus of α2 in the mutant is longer by one turn and the β1-α2 loop forms a flap, hinged at the C-terminus of β1 and the N-terminus of α2, that folds over the β-sheet and closes at α3 (Figs. 1b, 1c).
To determine whether the β1-α2 loop adopts a similar, albeit dynamically disordered, conformation in wild-type ArfA at pH7, we mutated L232, in the middle of the loop, to Gly, and examined its effects on other structured regions of the protein by mapping the peak changes in the 1H/15N NMR spectrum of ArfA(L232G). The L232G mutation in the loop causes several significant (≥ 0.03 ppm) changes in peaks from residues in α3 and in the β-sheet (Fig. S2b). In ArfA(D236A), the loop packs against the β-sheet and the presence of strong NOEs establishes close contacts between the methyl protons of L232 and the amide protons of A224 and V281. Since the peaks from A224 and V281 are significantly perturbed by the L232G mutation, we conclude that the loop of wild-type ArfA-c is not dissociated from the rest of the polypeptide at pH7 but maintains close contact with the globular structure, notwithstanding a significant degree of conformational exchange.
To examine backbone dynamics, we measured heteronuclear 1H/15N NOEs for both wild-type and D236A mutant ArfA-c, at neutral and acidic pH (Fig. 1d). The dynamics of isolated ArfA-c are similar to those observed in the connected B and C domains 12. For both wild-type protein and D236A mutant, negative NOEs, reflecting rapid (ps-ns) backbone motions, are observed at the N-terminus corresponding to the flexible inter-domain linker. Beyond the N-terminus, both wild-type ArfA-c at acidic pH, and D236A mutant at neutral pH, exhibit very similar, positive values (~0.8) of the 1H/15N NOE, reflecting a rigid backbone for the entire protein sequence. Slightly, but visibly lower 1H/15N NOEs for residues in the loops (β1-α2; β2-α3; β3-α4) indicate the presence of some restricted motions and may reflect the importance of the loops in protein/ligand interactions.
To further dissect the stabilizing effects of acidic pH and of the C208–C250 disulfide bond, we performed DSF (differential scanning fluorimetry) studies comparing the thermal stability of wild-type and mutant ArfA, under various conditions of pH and oxidation state. After treatment with DTT, ArfA-c undergoes thermal unfolding at 60.1°C, while disulfide bond formation by oxidation with GSSG has a significant stabilizing effect and increases the melting temperature to 63.2°C (Fig. 2c). A similar effect is observed for the joint B and C domains (ArfA-bc) where the disulfide bond provides about 1°C of stabilization (Fig. 2a), while the melting temperature of ArfA-b, which does not contain Cys, is unaffected by oxidation state (Fig. 2b). Notably, ArfA-b has a significantly higher melting temperature (67.4°C) than either the C or combined BC domains, reflecting its overall greater content of secondary structure relative to loops 12.
Lowering the pH from 7 to 4 further increases the temperature of thermal unfolding of ArfA-c by 2°C (from 63.2°C to 65.0°C; Fig. 2d). The effect of pH is mirrored by the D236A mutation, which increases the unfolding temperature of ArfA-c at pH7 by a similar amount (from 63.2°C to 65.2°C; Fig. 2d). We conclude that the stabilizing effect of acidic pH on the conformational exchange of the β1-α2 loop of ArfA-c is primarily through neutralization of the negative charge of D236, which enables α2 and the loop to pack against α3 and the β-sheet. Whether this pH-dependent conformational heterogeneity correlates with the pH-dependent physiological function of ArfA is not known.
While the structure of M. tuberculosis ArfA-b was unprecedented in the database, ArfA-c shares the same βαβαβαβ core structure and significant amino acid conservation with other domains of the OmpA-like family (Fig. S3), some of which have been shown to bind peptidoglycan 21; 22; 23. To determine whether ArfA also shares this function we tested its ability to associate with intact peptidoglycan isolated from M. tuberculosis.
The peptidoglycan of M. tuberculosis (Fig. 3a) is composed of linear chains of GlcNAc (N-acetyl-D-glucosamine) and Mur (muramic acid) that can be both N-acetylated (MurNAc) and N-acylated with glycolic acid (MurNGlyc) 1; 2; 19; 20. The sugars are substituted with a heavily cross-linked peptide, containing L-Ala, D-γ-Glu, m-DAP (meso-diaminopimelate), and D-Ala, similar to the composition found Gram-negative bacteria (e.g. E. coli) and Gram-positive Bacillus, but distinct from the peptidoglycan that is more typically found in Gram-positive bacteria, where m-DAP is usually replaced by L-Lys. OmpA-like proteins have been identified exclusively in organisms with DAP-type peptidoglycan 17, suggesting specificity for DAP in their structures.
Incubation of soluble ArfA polypeptides with insoluble polymeric M. tuberculosis peptidoglycan caused a significant amount of ArfA-bc, ArfA-c and ArfA-c(D236A) to separate with the insoluble fraction after centrifugation, while all three proteins remained in the supernatant in the absence of peptidoglycan (Fig. 4a). In contrast, ArfA-b2 (residues 73–197) remained primarily in the supernatant both in the presence and absence of peptidoglycan. We conclude that ArfA binds polymeric M. tuberculosis peptidoglycan via its C domain. No appreciable difference (within the limits of this pull-down assay) was detected between the peptidoglycan-binding abilities of wild-type ArfA-c and ArfA-c(D236A), indicating that the mutant is active in this respect and that conformational heterogeneity in the β1-α2 loop does not affect peptidoglycan-binding.
To further characterize the peptidoglycan binding site of ArfA we examined its interaction with a soluble intermediate of peptidoglycan biosynthesis, UMDP (also known as Park's nucleotide, Fig. 3b), similar to the peptide found in association with H. influenzae Pal 21. Addition of UMDP to 15N-labeled ArfA-c(D236A) produced several peak frequency and intensity changes in the 1H/15N NMR spectrum of the protein (Figs. 4b). The changes are specific as they map to a surface cavity formed by residues in the β1-α2 and β2-α3 loops, the N-terminus of helix α3 and the C-terminus of helix α4 (Fig. 5), similar to the binding site identified for Pal 21. Similar effects of UMDP were observed for wild-type ArfA-c at both pH7 and pH4. This is consistent with the observation that both wild-type and D236A mutant ArfA-c bind intact peptidoglycan (Fig. 4a), indicating that the acid-dependent structural heterogeneity of the β1-α2 loop does not affect peptidoglycan binding by ArfA. The most striking peak perturbations are observed for residues with high sequence conservation in the OmpA-like family (D228, A230, T261, D262, N263, T264, G265, N270, R277, I280, T297, S302, R319). Notably, mutations of residues corresponding to T261, D262, R277, and R319 all render the OmpA-like protein MotB nonfunctional 24, underscoring their importance.
The interaction is specific for m-DAP at position 3 of the peptide stem, since addition of the L-Lys analog UMKP, more typical of Gram-positive bacteria but absent from M. tuberculosis, had no effect on the spectrum of ArfA-c (Fig. S4b), and neither did addition of the dipeptide GMAG, lacking m-DAP (S4c). Thus, the presence of MurNAc alone is insufficient for binding while m-DAP is required, although we cannot exclude the possibility that MurNGlyc, which is abundant in M. tuberculosis, may be important. Finally, addition of UMDP had no effect on the spectrum of ArfA-b (Fig. S4d), confirming that ArfA binds peptidoglycan solely through its C domain.
Unlike the Pal study, where NOEs connecting the peptide to the protein enabled the structure of the complex to be determined 21, the interaction of ArfA-c with UMDP did not yield any intermolecular NOEs. However, a number of key experimental observations, described below, enabled us to construct a structural model that sheds light on peptidoglycan recognition by ArfA (Fig. 6).
Although the affinity of ArfA-c for polymeric M. tuberculosis peptidoglycan is strong (enough to precipitate the protein), its binding affinity for monomeric UMDP is weak, and a dose-response experiment, performed by mapping NMR peak changes upon titration of UMDP into the protein, yielded a Kd in the range of 1 mM. This high-specificity, low-affinity interaction suggests that electrostatics (hydrogen bonds, charge-charge contacts) play an important role in mediating UMDP recognition. A prominent surface electrostatics feature of ArfA-c is the presence of two juxtaposed clusters of positive charge at or near the peptidoglycan binding site, formed by R277, R319 and R320 (Fig. 5d). In the free protein, these arginines participate in electrostatic interactions within a hydrophobic environment, reflected in the presence of side chain guanidinium NHε peaks, with characteristic downfield chemical shift (e.g. 8.06 ppm for R320), in the NMR spectrum. Indeed, the R320 guanidinium group has polar interactions with E322 and N304, and its hydrocarbon side chain is stabilized by hydrophobic contacts with the side chains of I306 and Y260, all established by NOE connections. The R277 side chain is stabilized by hydrophobic contacts with the aromatic ring of F225, with L271 and I280, all residues with very high OmpA-like conservation. The heteronuclear 1H/15N NOEs measured for the guanidinium NHε peaks of these arginines (0.58 for R277; 0.62 for R319; 0.86 for R320) reflect a rather rigid side chain for R320, but significantly more flexibility for R277 and R319.
The R277 and R319 side chains are positioned to facilitate the formation of a hydrogen bond network that includes their guanidinium groups as well as the hydroxyl of T261, the carboxylate of D262, and the carboxamide of N270. These residues all have NMR peaks that exhibit significant changes in the presence of UMDP: the side chain NHε peaks from R277 and R319, but not R320, undergo substantial shifts, and the side chain NH2 peak of N270 is broadened beyond detection, suggesting a direct interaction with the peptide (Fig. 5a). Furthermore, the backbone amide peaks of N270, T261 and D262 are all affected by UMDP. The five T261, D262, N270, R277 and R319 side chains all converge at the peptidoglycan binding site where they provide an effective recognition site for m-DAP (Fig. 6), possibly facilitated by the side chain flexibility of R277 and R319. Peptidoglycan recognition proteins of the immune system rely on a similar, buried electrostatic interaction between conserved Arg and the m-DAP carboxyl group to select for Dap-type peptidoglycan 25, suggesting that this may be a conserved interaction in a variety of protein folds.
To confirm the identity of the peptidoglycan recognition site, we generated a mutant of ArfA-c(D236A) where R277 was replaced by Glu, and tested its ability to bind UMDP by NMR spectroscopy. Although the R277E mutation alone caused several peak changes compared to the spectrum of ArfA-c(D236A), the perturbations were largely localized to sites near residue 277, the peak movements could be easily tracked, and the overall appearance of the spectrum was maintained, indicating that the overall fold of the protein is preserved. Most importantly, no changes were observed in the spectrum of the R277E mutant upon addition of UMDP (Fig. S5), indicating that the protein has lost its ability to bind peptidoglycan. Thus, we conclude that R277 is a key residue for the recognition of peptidoglycan by ArfA.
Our model indicates that the side chain of m-DAP is stabilized by charge-charge interactions between its electronegative carbonyl group with the R277 and R319 guanidinium groups and with the N270 carboxamide, and between its electropositive amino group with the D262 carboxyl and the T261 hydroxyl. These residues are strictly conserved in the OmpA-like family and our data show that they constitute the specific recognition site for DAP-type peptidoglycan (Fig. 6b). For example, in the structure of H. influenzae Pal, D71 (corresponding to ArfA D262) forms an electrostatic interaction with the side chain NH3 group of m-DAP. Our model also indicates that UMDP could further interact with ArfA-c through contacts of its γ-Glu3 and Ala2 backbone amides with the side chain hydroxyl of S266, of its MurNac OH4 with the backbone carbonyl of G265, of its MurNac O3 with the amide proton of E267, and of its MurNac NH2 with the E267 carboxyl.
Saturation transfer difference NMR experiments show that the amide proton of S302, located in the β3-α4 loop, is in some proximity to the methyl protons of the UDMP MurNAc group (Fig. S6). The structural position of S302 is identical to that of S208 in H. pylori MotB, where it appears to stabilize the interaction with a single MurNAc sugar molecule, by forming hydrogen bonds between its side chain hydroxyl group and the O1 atom of MurNAc and between its backbone amide NH and the acetamido oxygen atom (O7) of MurNAc 22. S302 may assist the interaction of ArfA with peptidoglycan in a similarly way. A Ser or other hydroxyl-bearing residue at this position is conserved in many of the OmpA-like sequences with the exception of YiaD, where it is replaced by Pro. In all sequences, this residue is preceded by a perfectly conserved Gly, which could confer structural flexibility in the loop to further assist interaction with the peptidoglycan.
The highly specific albeit weak interaction of ArfA-c with UMDP is reminiscent of the immune response peptidoglycan recognition proteins, which achieve selective recognition of peptidoglycan type by specifically discriminating for either DAP or Lys in the third position of the peptide stem, and for the degree and type of peptide stem cross-linking 26. We note that UMDP is chemically and structurally different from polymeric M. tuberculosis peptidoglycan, whose affinity for ArfA is sufficiently strong to precipitate the protein (Fig. 4a). The interaction of ArfA with polymeric peptidoglycan may be strengthened by the highly cross-linked peptidoglycan structure, which could provide additional binding sites for ArfA (e.g. at peptide stem cross-links), cause an effective increase in protein concentration by sterically constraining its diffusion, and reduce the flexibility and conformational space of the peptidoglycan peptide stem. These effects could be particularly important for M. tuberculosis where 75% of peptidoglycan is cross-linked, compared to the 20 to 50% that is cross-linked in E. coli (reviewed by Rubin, Jackson, Brennan and coworkers 1; 2). The interaction may also be strengthened by the formation of ArfA dimers in vivo, as proposed for Pal 27. For example, MotB, and to a weaker extent RmpM, crystallize as dimers 22; 23, and the B domain of ArfA has been proposed to oligomerize 11. Pal and ArfA are monomeric in solution, both in their free and UMDP-bound states. However, the molecular surface of ArfA C domain displays a distinct electrostatic polarity (electropositive on the side where α3 packs against β3 and negative on the opposite side where α1 packs against α2), and it is tempting to speculate that this charge juxtaposition may assist the assembly of stacked, neighboring C domains on the peptidoglycan polymer as a means of strengthening the cell wall.
ArfA is the first peptidoglycan-binding protein to be identified in M. tuberculosis. Its other established roles in fast ammonia secretion and rapid pH neutralization 13 are consistent with the presence of ArfA-like proteins in bacteria active in nitrogen fixation and/or nitrogen metabolism 15. Interestingly, M. tuberculosis is known to generate substantial quantities of ammonia, which inhibits phagosome fusion in infected macrophages 28. Taken together, the peptidoglycan binding activity of ArfA and its function in supporting growth of M. tuberculosis at low pH indicate a possible link between the physico-chemical properties of the mycobacterial cell envelope and the acid stress response, as previously noted for other stress response proteins 18.
ArfA could contribute structural strength to the bacterial cell wall during acid or other stress conditions. It could achieve this directly, by providing a physical link between the peptidoglycan layer and the bacterial outer or inner membrane, or it could help guide nascent peptidoglycan peptide stems in their structurally correct position for polymerization, to ensure solid construction of the cell wall. This could be important for maintenance and determination of the mycobacterial cell shape, as suggested for other cytoskeletal bacterial proteins 1. Additional studies examining the relationship between the peptidogycan binding and pH neutralization activities of ArfA will shed light on the acid stress response mechanism of M. tuberculosis.
ArfA is reported to reside in the outer membrane of M. tuberculosis and to be exposed to the cell surface 5; 7; 10; 13; 29; 30. However, the finding that ArfA binds M. tuberculosis peptidoglycan through its C-terminus appears to contradict its surface accessibility, since the only possible architectures allowing for an interaction with peptidoglycan would involve insertion of the ArfA N-terminal transmembrane helix across either the outer or inner membranes, placing the B and C domains into the periplasmic space. It is well known that both periplasmic and inner membrane proteins can fractionate with the bacterial cell wall due to their significant interactions with the outer membrane or with the peptidoglycan layer 31; 32. The interaction of ArfA with peptidoglycan would certainly strengthen its association with the cell wall and potentially cause it to separate with it in fractionation experiments, or to be shed with it under conditions of bacterial stress. On the other hand, ArfA could occupy distinct areas of the cell, as recently reported for the E. coli major lipoprotein Lpp, which appears to localize to different subcellular compartments depending on whether its C-terminus is covalently linked to peptidoglycan or not 33. The structure and peptidoglycan recognition of Lpp are completely different from those of ArfA. However, in the case of ArfA, the existence of two protein populations, one located in the periplasm and the other on the mycobacterial cell surface, could reconcile the surface accessibility of ArfA with its peptidoglycan binding activity. Understanding the cellular localization of ArfA, its relationships with the other arf-encoded proteins, ArfB and ArfC, and the function of its pH-dependent dynamics, will shed light on the physiological function of the arf operon and could advance the development of new drugs for the treatment of tuberculosis.
Protein preparation, and characterization were as described previously 12. All proteins were monomeric by size exclusion chromatography or analytical ultracentrifugation. The ArfA polypeptides used in this study were: ArfA-bc (residues 73–326); ArfA-b1 (residues 73–220); ArfA-b2 (residues 73–197); and ArfA-c (residues 196–326). Mutations (D236A; L232G; R277E) were introduced by site-directed mutagenesis using the QuikChange Lightning kit (Stratagene). Purified protein was stored at 4°C as a 40 µM solution in buffer A (25 mM Na/PO4, pH 6.8, 1 mM EDTA). Formation of the C208–C250 disulfide bond was induced by adding 50 mM oxidized glutathione (GSSG) to the protein in 50 mM Tris/HCl, 150 mM NaCl, pH 8.0, and incubating it for 1 h at room temperature. Reduction of the disulfide bond was obtained by adding 100 mM DTT to the protein in degassed buffer and incubating overnight at 4°C. For NMR experiments the protein solution was concentrated by ultrafiltration to 0.2–1.0 mM.
Solutions containing 15 µM protein and Sypro orange dye (500 fold dilution of the 5000X stock solution supplied by Invitrogen) were mixed to a final volume of 15 µL, loaded into a 96-well plate (Roche), and sealed with sealing foil to prevent evaporation and contamination. Protein melting experiments were performed using a LightCycler 480 instrument (Roche). The temperature was increased from 20°C to 95°C, with a linear gradient of 0.03°C per second. Fluorescence intensity was measured at every 0.3°C increment using excitation and emission wavelengths of 465 nm and 580 nm, respectively. The data were processed using the Roche Protein Melt Tool, using the first derivative method to smooth the raw data by converting each data point into a moving average of itself and 19 additional adjacent points, for an average of 20 points, as described by the manufacturer. Fluorescence intensity, as well as the change in intensity divided by the change in temperature, were plotted as a function of temperature.
Purified peptidoglycan from Mycobacterium tuberculosis, strain H37Rv, was obtained through the NIH Biodefense and Emerging Infections Research Resources Repository supported by the NIH, NIAID (BEI #: NR-14853). Purified protein (1 µL of 40 µM stock) was added to peptidoglycan (40 µg), resuspended in 10 µM of buffer A, and incubated for 2 hours at room temperature, as described 34. The mixture was centrifuged in a microfuge (14,000 rpm, 15 min) to separate the soluble and insoluble fractions. After removing the supernatant, the pellet was washed with fresh buffer and the mixture was centrifuged again. This procedure was repeated twice. The resulting supernatant and pellet fractions were analyzed with SDS-PAGE with coomassie blue staining.
NMR chemical shift mapping experiments were performed using water-soluble peptidoglycan peptides. The peptides UMDP (uridine-5'-diphosphate–MurNAc–L-Ala–D-γ-Glu–m-DAP–D-Ala– D-Ala) and UMKP (uridine-5'-diphosphate–MurNAc–L-Ala–D-γ-Glu–L-Lys–D-Ala–D-Ala) were obtained from the UK Bacterial Cell Wall Biosynthesis Network, at Warwick University. The GMAG (GlcNAc–MurNAc–L-Ala–D-γ-Glu-NH2) dipeptide was obtained from Calbiochem.
Since recombinant Pal, expressed with an E. coli pET vector, co-purified with endogenous E. coli UMDP 21, we tested whether this might also be the case for recombinant ArfA-c. To this end, we denatured purified ArfA-c in 8 M urea and subjected it to size exclusion chromatography in 8 M urea to remove any potentially associated molecules. We then refolded the eluted protein in urea-free buffer and examined its 1H/15N HSQC spectrum. As previously noted for ArfA-b 12, the native fold of ArfA-c could be readily restored after removal of denaturant by dialysis. No difference was detected between the spectra obtained before or after denaturation and refolding, indicating that ArfA-c, unlike Pal, does not co-purify with E. coli UMDP.
NMR experiments were performed at 25°C or 40°C on a Bruker AVANCE 600 MHz spectrometer or a Bruker AVANCE 800 MHz spectrometer, each equipped with a cryoprobe. The pulse sequences are described in detail in the literature 35; 36; 37; 38; 39; 40; 41; 42; 43; 44. The NMR data were processed and analyzed using NMRPipe 45, NMRViewJ 46 and Sparky 47.
Resonance assignments for UMDP were obtained by means of 2D TOCSY with a 60 ms mixing time, and by comparison with previous published assignments for UMKP 48. The backbone 1H, 15N, and 13C chemical shifts were resolved using two-dimensional 1H/15N- and 1H/13C-HSQC experiments, and assigned using three-dimensional 1H/15N/13C HNCA and HNCACB experiment. Assignments of the side chain resonances were obtained with three-dimensional HCC(CO)NH, CC(CO)NH, HCCH-COSY/TOCSY and 13C-NOESY-HSQC experiments. Homonuclear 1H NOEs were measured from three-dimensional 15N- and 13C-edited NOESY-HSQC spectra, obtained with mixing times of 100 ms. 1H/15N Residual dipolar couplings (RDCs) were measured using in-phase/anti-phase 15N/1H HSQC experiments 41 with samples in magnetically oriented Pf1 phage (10 mg/ml). Heteronuclear 1H/15N NOE measurements were made using difference experiments with and without 5 s of saturation of the 1H resonances between scans 42. For saturation transfer difference experiments 43; 44, selective radiofrequency saturation was obtained by applying a train of frequency-selective 50 ms Gaussian pulses, separated by a 1 ms delay. Two experiments, each with on- or off-resonance saturation, were performed in an interleaved fashion, and the difference spectrum was obtained by subtracting the resulting free induction decays. Water suppression was achieved by excitation sculpting 49.
Structure calculations were performed using XPLOR-NIH version 2.27 50. Structures were visualized with PyMol 51. Electrostatic potentials were calculated in units of kT/e (k=Boltzmann constant; T=absolute temperature; e=proton charge) using the programs PDB2PQR 52 and APBS 53. A total of 400 structures were generated from extended random coil coordinates, as described previously 12. Structures were calculated using restraints measured at pH7 and 25°C, including: backbone ϕ and ψ dihedral angles derived from analysis of HA, N, C, CA, and CB chemical shifts with the database program TALOS+ 54; distances derived from NOEs; and orientations derived from 1H/15N RDCs, analyzed with the XPLOR-NIH script CalcTensor and with REDCAT 55. The 40 structures with lowest energies were subjected to a final refinement in water using the explicit solvent refinement module available in XPLOR-NIH 56. Structure calculations for wild-type ArfA-c included two single long distance restraints (4.2 ± 2.4 Å) between the methyl protons of L232 and the amide protons of A224 and V281 to loosely constrain the β1-α2 loop, as suggested by the results of the L232G mutation.
The 20 lowest energy structures were evaluated with PROCHECK 57. They have no dihedral angle violations greater than 5°, no bond angle violations greater than 5°, no NOE distance restraint violations greater than 0.5 Å, and no bond distance violations greater than 0.05 Å. Only 1.2% (wild type) or 0.9% (D236A mutant) of the residues had ϕ/ψ dihedral angles in generously allowed and/or disallowed regions of the Ramachandran plot; these residues are located in the termini outside the structured core of the protein or in connecting loops. Statistics for the resulting family of 20 structures are reported in Table S1.
To model the complex of ArfA-c with MDP (MurNAc–L-Ala–D-γ-Glu–m-DAP–D-Ala–D-Ala), we first generated an extended MDP peptide and subjected it to Cartesian coordinate Powell minimization, then we optimized the interface between ArfA-c(D236A) and MDP using semi-rigid body simulated annealing 58. In this step, the MDP peptide was flexible, ArfA-c side chain atoms (from CB onward) with NMR peaks affected by addition of UMDP were allowed to rotate and translate, and all other ArfA-c atoms were held fixed. The starting position of MDP relative to ArfA-c was guided by the chemical shift mapping and STD results of ArfA-c(D236A) with UMDP, and by the published complex of Pal with UMDP 21. The energy function included distance restraints derived from chemical shift mapping and STD (Table S2), as well as terms for covalent geometry, Van der Waals contacts, and the torsion angle database Rama potential 59. The 10 lowest energy structures out of 100 calculated structures were selected and analyzed.
Atomic coordinates, resonance assignments and NMR restraints have been deposited in the Protein Data Bank (PDB) for wild-type ArfA-c (PDB: 2LCA) and ArfA-c(D236A) (PDB: 2LBT). NMR assignments have been deposited in the Biological Magnetic Resonance Bank (BMRB: 17575).
We thank Stanley Opella for providing purified Pf1 bacteriophage, Jasmina Rakic and Yuan Yang for their assistance with protein preparation, and Houhui Song for preparing the original wild-type ArfA expression plasmid. This research was supported by a grant from the National Institutes of Health (AI074805). It utilized the NIH-supported Biodefense and Emerging Infections Research Resources Repository (NIAID), and NIH-supported NMR Facilities at the Sanford Burnham Medical Research Institute (CA030199) and at the University of California San Diego (S10RR23773).
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
*Abbreviations: ArfA, ammonia release facilitator; BON, Bacterial OsmY and Nodulation; DFS, Differential Scanning Fluorimetry; DTT, dithiotheritol; m-DAP, meso-diaminopimelate; GSSG, glutathione disulfide; GlcNAc, N-acetyl-D-glucosamine; Mur, muramic acid; MurNAc, N-acetylmuramic acid; MurNGlyc, N-glycolylmuramic acid; MDP, MurNAc–L-Ala–D-γ-Glu–m-DAP–D-Ala–D-Ala; UMDP, uridine-5'-diphosphate–MurNAc–L-Ala–D-γGlu–m-DAP–DAla–D-Ala; UMKP, uridine-5'-diphosphate–MurNAc–L-Ala–D-γ-Glu–L-Lys–D-Ala–D-Ala; GMAG, GlcNAc–MurNAc–L-Ala–D-γ-Glu-NH2
§Supplemental information related to this article can be found online.