We first built a dataset of proteins containing S-nitrosylated Cys residues based on literature reports (Supplementary information, Table S1
). We only considered proteins with established crystal or NMR structures and proteins which could be modeled by standard homology modeling approaches using Swiss Model server, and used non-redundant proteins with less than 50% sequence identity to any other protein in the dataset. We also required the proteins in our dataset to have established position for the modifiable sites (i.e., NO-Cys site experimentally demonstrated). Finally, proteins found to contain NO-Cys based on the experiments that employed the concentration of NO-sylating agents in the mM range were excluded, as high levels of these agents could result in artifacts (i.e., NO-Cys sites that are not biologically relevant or do not occur under physiological conditions). This restriction, even if it may exclude some naturally occurring NO-Cys sites, was used to limit false positives. Nevertheless, a potential weakness of the published studies is that they employed exogenous NO or S-nitrosylating agents to discover proteins and their sites of modifications and that many of them have not been confirmed in vivo
. In view of the absence of endogenous S-nitrosocysteine proteomes, we focused on the data generated by in vitro
Our searches resulted in a set of 55 proteins from various organisms containing 70 NO-Cys sites (Table S1
), including Trx1, cyclic nucleotide gated channel alpha 2 (CNG2), mitogen-activated protein kinase kinase kinase 5 (ASK1), mitogen-activated protein kinase (JNK1), dimethylarginine dimethylaminohydrolase (DDAH), Bcl2, FADD-like apoptosis regulator isoform 1 (FLIP), caspase 3, caspase 1, calpain 2, GST theta (GSTt), GST pi (GSTp), serine/threonine protein phosphatase (Ser/Thr protein phosphatase), 14-3-3 protein zeta/delta, malate dehydrogenase, pyruvate kinase, triosephosphate isomerase, vesicle-fusing ATPase C-terminal domain (NSF C-terminal domain), vesicle-fusing ATPase N-terminal domain (NSF N-terminal domain), ADP/ATP translocase, sodium/potassium-transporting ATPase, tubulin alpha, GSTmu (GSTm), semaphorin 4D, glucokinase, annexin 6, S-adenosylmethionine synthetase (MAT), matrix metalloproteinase 9 (MMP9), iron-responsive element binding protein (IRP2), inhibitor of nuclear factor kappa-B kinase (IKKB), peroxiredoxin 2 (PRX2), oxidative stress transcriptional regulator (OXYR), Ras p21, creatine kinase, GAPDH, vinculin, T-complex protein 1, Ras-related protein Rab, peptidylprolyl isomerase, myosin heavy chain 9, stress 70 protein (GRP75), COPA protein, annexin 2, annexin 11, elongation factor alpha (EF1A), protein tyrosine phosphatase (PTP), syntaxin 1A, myeloid differentiation primary response protein (MyD88), hemoglobin subunit beta, tubulin beta, histone deacetylase 2, and Parkinson’s disease protein 7 (DJ1), Dynamin-2, SG15 ubiquitin-like modifier (ISG15), and G-protein coupled receptor kinase 2 (GRK2). The majority of these proteins contained one reported nitrosylation site, but several had multiple modification sites (e.g., tubulin alpha and beta, GAPDH) as reported in detail in Table S1
As our goal was to analyze NO-Cys sites for general features, we employed a broad set of proteins without separating them for different physiological roles and regulatory, signaling and stress functions. In addition, we did not consider subsets of NO-Cys sites based on the chemical form and concentration of nitrosylating agents, and biochemical pathways involved. As we discuss in more detail later in the text, the use of these parameters would require significantly more confirmed NO-Cys sites. Also, due to the scope of our work (i.e., distinguishing common features of NO-Cys sites and utilizing this information for predictive purposes), choosing a broad protein dataset was important, but subsets of proteins in our dataset could also be defined based on the parameters discussed above, and utilized in the future studies.
NO-Cys pKa, exposure and conservation
We first tested our dataset for parameters most often discussed in regard to posttranslational Cys modifications: pKa, exposure, and conservation. We computed, with the program PropKa, pKa values of Cys residues that are targets for S-nitrosylation and found them to be slightly higher (average 9.1) than those of redox and non-redox catalytic Cys residues (average pKa 5.5, calculated with the same program) (), but consistent with the average pKa of Cys residues in proteins.
Calculated pKa and sulfur atom exposure of known NO-Cys sites
We next assessed Cys exposure (i.e., S atom exposure). While some NO-Cys sites were exposed, others were not (). Approximately 48% of NO-Cys sites had exposure higher than 1 Å2 when a 1.4 Å probe (to mimic the water molecule) was employed, and 65% when a 1.2 Å probe was used (to account for the slightly smaller NO molecule). Thus, although some enrichment in sulfur exposure was detected for NO-Cys, about 35% of sulfur atoms of NO-Cys sites were predicted to be buried (i.e., with exposure values ≤1.0 Å2) and not accessible even to small molecular probes.
We further analyzed conservation of NO-modified Cys in proteins in our dataset. PSI-Blast search of the NCBI non-redundant database revealed an average conservation of 62% for NO-Cys sites, but also this parameter greatly varied among proteins in the dataset (Figure S2 and Table S1
). Indeed, NO-Cys can be based on both highly (e.g., PRX2, GAPDH) and poorly (e.g., Cys 50 in GSTt, Cys 73 in TRX1) conserved Cys, reflecting low significance of the average value (standard deviation 34%). Thus, Cys conservation does not define NO-Cys sites either.
Sequence analysis: amino acid composition and hydrophobicity
To further analyze common features of NO-Cys sites, we focused on sequence analysis of Cys residues in the dataset. Amino acid composition of NO-Cys-flanking sequences is shown in . The data obtained for 70 NO-Cys sites were compared to those of a reference set of Cys residues made up of 1,000 randomly chosen eukaryotic proteins from PDB. An overrepresentation of negatively charged residues was detected for NO-Cys sites, including aspartate in positions −1 and +1, and glutamate in position −3. The presence of a nearby (at the sequence level) acid residue is regarded as one of the features of a putative motif for NO-Cys, and in at least one case (KCNQ1 channel, NO-Cys 445), the presence of the acidic amino acid was found to be necessary for NO-sylation of Cys [25
]. However, in our analysis, even if the flanking acidic residues were over-represented (), they did not represent a feature characteristic of all or even a majority of NO-Cys sites. This observation did not exclude a possibility that subsets of NO-Cys sites, such as those exclusively involved in signaling or derived from a particular group of organisms, could be defined by the presence of a flanking to NO-Cys acidic residue. This feature, however, does not apply to the entire group of NO-Cys sites. In this regard, an important case study is ryanodine receptor with its NO-Cys 3635. This protein contains 12 Cys flanked by an acidic residue, however, its only nitrosylated Cys is not flanked (i.e., the flanking residues are Ala 3634 and Phe 3636).
Amino acid composition of NO-Cys sites
We discuss in detail the actual tertiary structure of NO-Cys-flanking acidic residues later in the text, while describing the occurrence and distribution of a potential revised acid-base motif for all proteins in our dataset. Additional sequence features of NO-Cys sites included a rare occurrence of an additional Cys nearby (the 13 residue window was considered; particularly striking is the lack of second Cys in positions +2 and −2, which are also the positions particularly enriched for Cys residues that flank catalytic redox and metal-binding Cys) and under-representation of leucine, especially in positions −1 and +6. However, attempts to build a NO-Cys profile and a signature sequence derived from it () revealed the lack of statistical significance, indicating that these factors could not be employed for prediction of NO-Cys sites. These observations are in line with previous reports [14
] and suggest that sequence analysis of NO-Cys sites is not sufficiently robust for reliable identification of S-nitrosylated Cys residues.
We analyzed the sequences flanking NO-Cys for hydrophobicity as defined by the Kyte-Doolittle scale (). The average hydrophobicity of NO-Cys sites was slightly increased (an average value of 0.03 +/− 0.7), but considerable variation existed among NO-Cys sites. Moreover, hydrophobic environment is a feature commonly found around Cys residues. In fact, the set of randomly chosen Cys residues (Random Cys) showed the average value of 0.09 +/− 0.8. Thus, the sequences flanking NO-Cys do not significantly vary from those containing random Cys, and do not permit differentiation of NO-Cys and other Cys sites.
Hydrophobicity analysis of NO-Cys sites
Structure-based analysis: amino acid composition and hydrophobicity
We further performed structural profile analyses of NO-Cys-containing proteins using a previously described method [26
]. Structural profile analyses build a signature sequence of amino acids located around a specified residue: in our work, segments of amino acids located within 8 Å of each NO-Cys were extracted from the structure and combined into single contiguous sequences, generating structural profiles. However, this approach also did not yield common features among proteins containing NO-Cys (Figure S3
). It should be noted that the structural profile analysis performs well for catalytic Cys in proteins [23
], and in fact has been designed for this purpose. The evolutionary constraints act to a greater extent on the active sites than on pos t-translational modification sites (the former often being significantly more conserved).
As a next step, we analyzed the amino acid composition and hydrophobic content of the regions surrounding NO-Cys. For this purpose, we considered regions that fall inside spheres with defined radii (4 to 10 Å) and centered at the sulfur atom of each NO-Cys. As a reference, we employed an unbiased set of representative structures from the Random Cys dataset. Hydrophobicity in the NO-Cys set was only slightly higher (average Kyte-Doolittle value of 0.4 +/− 0.5 for the 6 Å region), with significant variation among NO-Cys sites (). These values were not significantly different from those of the Random Cys set (0.5 +/− 0.5). Further analyses at other distances (i.e., 4 Å, 8 Å, 10 Å) showed similar behavior.
Turning our attention to the amino acid composition, we analyzed the average frequency of each residue at various distances (4 to 10 Å) from the modified Cys (). Once again, the low occurrence of other Cys residues near NO-Cys was the most obvious feature (with an average occurrence for the closest distance (4 Å) of 1.1% +/− 1.9% for NO-Cys, compared with 11% +/− 7.5% for the reference set). In addition, slight but insignificant overrepresentation of negatively charged residues and leucine was observed, in accordance with the sequence analysis (). Interestingly, low occurrence of His was evident from the structure-based approach, whereas this feature was not detected at the sequence level (). This trend, even if intriguing, is, however, not statistically significant. Altogether, the results indicated considerable heterogeneity within the NO-Cys set.
However, at a closer look, only 4% of proteins in our dataset (3 out of 70) did not have any charged residues within 6 Å from NO-Cys. In contrast, nearly one fourth (26%) of random Cys was characterized by the absence of net charge in close proximity (6 Å) to the Cys. Among the closest charged residues located near NO-Cys, 82% had a positive charge (and 18% negative), whereas the values for the Random Cys reference set were 64% and 36%, respectively.
With regard to the concomitant occurrence of at least one negatively and one positively charged residue (i.e., the acid-base motif) within the 6 Å region from NO-Cys, only 26% of NO-Cys and 19% of random Cys satisfied this requirement. These observations are important because they indicate that the acid-base motif, often referred in the literature as being a characteristic feature of NO-Cys sites, does not actually define these modification sites, at least within 6 Å of NO-Cys.
We also considered a possibility that the previously reported features, even if being unreliable descriptors of NO-Cys sites, may be used for predictive purposes if used in a combination. We carried out principal component analysis (PCA) that included Cys exposure, pKa, hydrophobicity, conservation and sequence features (detailed in Table S1
); based on the results of PCA, however, we did not find any combination of parameters capable of describing the NO-Cys sites as whole. Taken together, our data indicate significant NO-Cys heterogeneity in all parameters considered, which is consistent with the idea that other mechanisms (or additional components) are behind specific S-nitrosylation. Thus, besides the direct nitrosylation chemistry (i.e., direct reaction of Cys with NO), various trans-nitrosylation processes might potentially account for many of the observed NO-Cys sites. For example, caspase 3 does not undergo nitrosylation in the absence of TRX1 [28
]. The interaction between these proteins is driven by two essential and oppositely charged amino acids (Glu 70, Lys 72), which are exposed to solvent in the molecular surface proximal to the NO-Cys 73 site (the actual trans-nitrosylating residue) of TRX1; however, this acid-base motif does not point toward the sulfur atom and while the basic residue is within 6 Å (with some atoms, but not the charged one), the acidic Glu 70 residue is outside the range. It should be mentioned that TRX1 may have a more complex but still specific interaction with caspase 3 [29
] due to NO transfer in both directions between these proteins. Given the potential importance of mandatory protein-protein interactions and the occurrence of the acid-base motif, we further examined these features in more detail.
A distant acid-base motif: occurrence and distribution
When longer distances (up to 8 Å) were considered, approximately 90% of NO-Cys sites had both positively and negatively charged residues (e.g., there were at least one acidic and one basic residue within 8 Å). However, this feature was again insufficient in defining the NO-Cys set: 85% of control proteins also had such ionizable residues within 8 Å. To address the relationship between NO-Cys and charged residues, we evaluated their positioning in respect to the modifiable Cys sulfur atoms by examining whether these residues point their charged groups (GrF) toward the NO-Cys sulfur. In fact, if the negatively and positively charged residues are involved in the reaction between Cys and NO, one would expect their actual active atoms (i.e., atoms of GrF) to be located in proximity to the sulfur atom.
We calculated the distances between each charged atom (Lys, Arg, His, Glu and Asp within 8 Å from NO-Cys) and the sulfur atom (S) of Cys; for simplicity, we further refer to this measurement as the GrF-S distance. These distances were compared with those of the approximate center of mass (CM) of each residue (S-CM distance). When the GrF-S distance was smaller than the S-CM distance, the charged functional group was considered as pointing toward the NO-Cys sulfur atom. On the other hand, when GrF-S was greater than S-CM (i.e., the center of mass was closer to S than GrF), the charged functional group was considered to point outward.
First, we took a closer look at the positioning of negatively charged residues relative to the Cys, as these residues were more frequently found in the sequence analysis discussed before. All NO-Cys sites flanked by an acidic residue (i.e., with Asp or Glu in positions +1 or −1 in the sequence) were separately analyzed, and, remarkably, in the vast majority of these cases the acidic residue had the charged atoms pointing away from the sulfur atoms. This observation is illustrated in Figure S4
. These data are in agreement with the previously discussed results: acidic residues, even when they are near the NO-Cys in the sequence, tend to stay distant from the reactive sulfur atom in the structure.
Recently, human DPR-1 was found to possess a single NO-modification site, Cys 644, which upon modification led to a series of toxic cellular events associated with Alzheimer’s disease [31
]. In addition, this NO-Cys site had flanking acidic residues (Asp 643 and Glu 645). We modeled the structure of human DPR-1 and analyzed the orientation of these residues; once again both acidic residues flanking the NO-Cys site were clearly pointing outward with respect to Cys 644 (Figure S4
, panel R).
Considering the NO-Cys dataset as a whole, only several proteins (DDAH Cys 249, creatine kinase Cys 283; Ras p21 Cys 118, MAT Cys 121; histone deacetylase 2 Cys 274) had positively and negatively charged groups pointing toward the Cys (~7% of the dataset). In addition, both GrF were closer than 6 Å to the NO-Cys sulfur atom only in three cases: MAT (positive GrF charge was found at 5.3 Å and negative at 5 Å), Ras p21 (positive GrF at 5 Å, negative GrF at 5.8 Å) and histone deacetylase (positive GrF at 3.9 Å and negative GrF at 3.1 Å).
When all other non-modifiable Cys (i.e., all Cys in the dataset that are not known to be S-nitrosylated) were used as a reference set, both types of charged functional groups pointed toward the Cys in 24% of control Cys (Figure S5
). These data suggested that the presence of charged residues (i.e., the acid-base motif) that are slightly more distant with regard to NO-Cys may play a role other than a direct acid-base motif-dependent Cys activation.
Analysis of the exposed 8 Å region
In the following text, we refer to the region composed of exposed amino acids within 8 Å from NO-Cys sulfur atoms as the exposed 8 Å region. To define this region, we considered exposed residues with at least one of their atoms accessible to the solvent (area greater than 1.0 Å2
, a permissive cut-off value, which should not exclude a priori
less polar residues). shows the comparison of amino acid composition of the exposed 8 Å region and the whole 8 Å region for NO-Cys and control proteins. An increase in both positively and negatively charged residues (Arg, Lys, Glu and Asp; but not His) clearly characterized the exposed regions of NO-Cys sites (). At the same time, other polar but non-charged residues were not over-represented in the NO-Cys exposed region. The complete list of amino acids composing the exposed 8 Å region is given in Supplementary Information, Table S2
Amino acid composition of structural regions around NO-Cys sites
Charged residues often play important roles in proteins. Besides participation in active sites (e.g., acid-base catalysis) and metal binding, they usually exert strong influence on the electrostatic potential distribution of proteins. This feature is often crucial for protein-protein interactions, one of the main reasons why the distribution of the electrostatic potential on a protein molecular surface or solvent-accessible area (e.g., the Connelly surface) is a common analysis in structural biology. Among the charged residues responsible for this function, His is used less frequently (in part because of its higher hydrophobicity), instead more often playing a role in chemical plasticity due to its imidazole ring (metal binding, π-stacking interaction, etc.). As seen in , the unchanged occurrence of His and the higher representation of all other charged residues in regions proximal to NO-Cys support the idea that the latter amino acids may be involved in protein-protein interactions that occur in proximity of NO-Cys sites. Interestingly, when the exposed 8 Å region was considered, the average conservation of all residues involved increased to 69.5 +/−16%, which was higher than the conservation of NO-Cys itself (Table S2
), again suggesting an important role for this region. These results support the idea that the acid-base motif, found in proximity to molecular surface, could play a role in protein-protein interactions, possibly extending the previous findings for TRX1 and caspase 3 (i.e., exposed acid-base motif being in proximity but not too close to NO-Cys) to a larger set of proteins.
Deviating NO-Cys: NO-Cys lacking the acid-base motif in the exposed 8 Å region
As discussed above, some NO-Cys sites in our dataset lacked the acid-base motif, including GSTp (Cys 48), GSTt (Cys 50), Ser/Thr phosphatase (Cys 228), malate dehydrogenase (Cys 137), GAPDH (Cys 150), PRX2 (Cys 121), and tubulin beta (Cys 12). Additionally, histone deacetylase with its Cys 262 had the acid-base motif with 8 Å, but its only basic residue (His 282) was not exposed. Thus, 8 out of 70 NO-Cys did not have a solvent-exposed acid-base motif. Searching for common features among these Cys, we found that while they were exposed, the output values were highly variable: pKa ranged from 2 (PRX2) to 11 (tubulin beta), conservation ranged from less than 1% (GSTt) to more than 95% (Ser-Thr phosphatase, GAPDH, PRX2, tubulin beta), and proximity to other Cys thiols also varied significantly. We further discuss each of these proteins in detail.
Direct reactivity with NO
When considering potential mechanisms of S-nitrosylation, besides the case-specific protein-protein interaction, the most obvious alternatives are (i) direct S-nitrosylation; and (ii) trans-nitrosylation via amino-acid derivatives, such as S-nitrosoglutathione (GSNO). We first tested the possibility of direct NO/Cys reaction employing the chemical-physical features thought to be necessary for an effective reaction [18
], i.e., that the NO/Cys reaction may take place in small and slightly hydrophobic pockets (i.e., clusters of spatially related amino acids, defining a partially solvent accessible region), with concomitant presence of a basic residue with its functional atom(s) in proximity to the modifiable Cys sulfur atom.
We employed a simple algorithm, which analyzed the NO-Cys sites in the dataset in the following steps: (i) analysis of protein pockets with Castp (http://sts-fw.bioengr.uic.edu/castp/
); (ii) analysis of NO-Cys belonging to one or more pockets; (iii) analysis for occurrence of basic residues in the same pocket (if there was more than one potential pocket, each was evaluated separately) at a distance less than 8 Å from the sulfur atom; and (iv) calculation of the hydrophobicity index for the pocket according to the Kyte-Doolittle scale. These assumptions were meant to detect NO-Cys sites characterized by direct nitrosylation. We tested all proteins in our dataset, and the positives included GSTp, malate dehydrogenase, GSTm, IKKB, vesicle-fusing ATPase, myosin heavy chain 9, Stress-70 protein, and MMP9 (). Interestingly, two proteins, GSTp and malate dehydrogenase, were those with deviating NO-Cys (absence of the exposed acid-base motif). Therefore, for direct S-nitrosylation, the presence of both acid and basic residues near the NO-Cys does not appear to be a necessary feature.
Properties of NO-Cys sites relevant for direct S-nitrosylation
Docking analysis: NO-Cys and GSNO
To computationally investigate trans-nitrosylation via GSNO, we carried out docking calculations for all NO-Cys-containing proteins in our dataset. In a recent study, influences exerted by the NO group on Cys residues were investigated [32
] and new force field parameters specific for this modification determined, providing a robust starting point for a variety of structure-based investigations of NO modification sites. We implemented these parameters for NO-Cys in docking calculations; in particular, we transferred the QM-theory level partial charges in the modified site of interest (i.e., Cys-NO side chain of GSNO, as detailed in Experimental Procedures). In our analysis, we required the binding positions to be characterized by an overall favorable energy (i.e., energy < −1 kcal/mol) and with a distance between the sulfur atoms or between the nitrogen of the NO substituent lower than 3.0 Å.
First, we discuss the data for the models lacking the acid-base motif in the exposed 8 Å region, i.e., GADPH, tubulin beta, histone deacetylase, PRX2, GSTp, GSTt, Ser/Thr phosphatase and malate dehydrogenase. These proteins, except GSTs and malate dehydrogenase, showed good affinity for GSNO with more than one binding position (the best are shown in and further information is provided in ). In the case of malate dehydrogenase and GSTp, GSNO docking did not yield reasonable structural models, suggesting that GSNO is unlikely to serve as the nitrosylating agent. However, these proteins were found as positive candidates for the direct NO reactivity test described above.
GSTt dimerization interface region
Proteins with predicted reactivity toward GSNO
An interesting case was the analysis of Ser/Thr phosphatase: docking GSNO to the reported NO-Cys site (Cys 228) did not yield a reliable model. Instead, a good affinity was found between GSNO and a nearby Cys 256 (4 Å from Cys 228), as shown in and . In the best model (with the energy of −5.4 kcal/mol, ), the GSNO sulfur atom was within 2.4 Å of the Cys 256 sulfur (and 2.8 Å from the NO nitrogen), suggesting that Cys 256 (rather than 228) was a good NO-Cys candidate, via interaction with GSNO. Thus, it is possible that Cys 256 may act as a donor in the trans-nitrosylation reaction between Cys 228 (the reported final acceptor) and Cys 256. This possibility raises an important question: could Cys residues in the same protein be engaged in trans-nitrosylation? If so, the low occurrence ( and ) of nearby Cys (especially within 6 Å) could be explained by the need to avoid promiscuous and uncontrolled NO transfer within proteins whose functionality is affected by NO-Cys modification. It would be important to address this question in further direct experiments. In this regard, Ser/Thr phosphatase may be an excellent model protein for such study.
Docking calculations for proteins in the NO-Cys dataset
As to the other NO-Cys-containing proteins in the dataset, we found that only few showed good affinity toward GSNO. The other positive probes () included OxyR (Cys 113), MAT (Cys 121), creatine kinase (Cys 283), calpain 2 (Cys 301), tubulin alpha (Cys 347), myosin heavy chain 9 (Cys 91), JNK1 (Cys 116), TRX1 (Cys 73), and Isg15 (Cys 76). Additionally, sodium/potassium-transporting ATPase showed a reactive Cys 42, which was not previously reported to be S-nitrosylated, but it was close to the known NO-Cys site (Cys 49, with the S-S distance of 4.3 Å from Cys 42), similar to what was described for Ser/Thr phosphatase.
We found good affinity of TRX1 for GSNO with preferential binding in the exposed region between Cys 32 (catalytic Cys) and Cys 73. Indeed, in our calculations (based on the reduced form of TRX1), only the latter was found in a position such that its sulfur atom could react with the NO group of the substrate (Figure S6
). These observations are in line with previous experimental studies [33
]. We did not find good candidates for the Cys 69-GSNO complex (Figure S6
), notwithstanding Cys 69 is also a potential target of S-nitrosylation via GSNO [35
]. Finally, our docking data indicated that the acid-base motif did not correlate with the affinity toward GSNO: 5 out of 8 “deviating” NO-Cys sites (i.e., without the acid-base motif) were predicted as reactive toward GSNO ( and ). Thus, similarly to what was found for direct reactivity with NO, GSNO affinity also did not require the presence of both basic and charged residues in proximity of the modifiable Cys.
Overall, 7 out of 8 proteins lacking the exposed distant acid-base motif in proximity of NO-Cys were found to be good candidates for reaction with GSNO (5 proteins) or NO (the other 2 proteins). Altogether, our results support the hypothesis that the acid-base motif may play an indirect role in the NO/Cys reaction: its exposure, positioning (with functional groups distal to NO-Cys) and lack of control with GSNO and NO reactivity, point directly to a role in defining the molecular surface near NO-Cys (e.g., electrostatic potential and ionic interactions).
However, while the role of the acid could be explained in this way, the role of basic residues may be more complex. Hypothetically, their presence can be linked to thiolate stabilization, protein-protein interactions, or both. Additionally, it has to be considered that both acidic and basic residues may contribute, together with the other nearby residues (e.g,. Ser and Thr), to interactions (electrostatic, H-bond) with GSNO. However, this contribution varies case by case, and from our docking analysis appears not to be a generalizable feature of NO-Cys reactivity with GSNO.
Effects of protein-protein interactions on Cys reactivity
These considerations do not explain the case of the single remaining deviating protein that lacked the exposed acid-base motif and did not show reactivity with GSNO or NO): GSTt. Cytosolic GSTs, a family of multifunctional enzymes, naturally form homodimers [36
]. The importance of their subunit interface region is well understood and fundamental to the function of these proteins [37
]. Two types of interaction emerged as critical for GSTs dimerization: hydrophobic contacts (particularly in the Alpha, Mu and Pi classes) and electrostatic interactions driven by class-specific patterns of charged amino acids exposed in the contact region [38
]. We modeled a rat GSTt dimer based on the crystal structure of the human protein (1LJR which could not be directly used due to significant differences with the rat protein, including Cys50Ser mutation). In the dimer structure, Cys 50 was well exposed (>10 Å2
) and found in proximity to the dimerization interface (). Moreover, its reactivity was significantly enhanced, with a calculated pKa of 6.9 for the dimeric form of the protein (8.2 for the monomeric form).
We tested the GSTt dimer model for (i) direct S-nitrosylation; and (ii) GSNO docking assay. Its NO-Cys (Cys 50) was predicted with the sulfur atom in a well defined pocket (Cys 50, Val 63, Leu 64, Thr 65 of one monomer, and Glu 97, Lys 149 of the other monomer), which was slightly hydrophobic (0.4 is the overall Kyte-Doolittle score). Moreover, the basic residue had its charged atom at 4.8 Å from the sulfur (, Lys depicted in sticks representation). Thus, Cys 50 was a potential candidate for direct S-nitrosylation ().
As to the docking assay, GSNO showed affinity for Cys 50 of the dimeric GSTt, in clear contrast with the zero affinity found for the monomeric protein. However, no strictly reactive positions were found for the first 10 ranked docking models (Figure S7
). In turn, GSH consistently docked, in both monomeric and dimeric forms of GSTt, close to its natural binding site (near Gln 12, Figure S7
). So, in the case of this protein, the NO modification appears to exert the effect toward GSH properties, leading to an increased affinity of Cys 50 toward GSNO. For comparison purposes, we conducted the same docking analysis for the modeled dimer of GSTp, and the results were clearly different: neither GSH nor GSNO arrived closer than 10 Å to Cys 48. Instead, both substrates docked in the natural GSH-binding pocket.
Thus, dimerized GSTt showed clear enhancement in Cys 50 reactivity (for all tested parameters), transforming a previously apparently inert Cys into a reactive residue. In particular, Cys 50 in the dimer appeared to form a suitable site for direct S-nitrosylation. This could be taken as a general case of how protein-protein interactions could change the character of Cys in an interface region, enhancing its reactivity. This switch in Cys reactivity upon interaction with another protein may be crucial in the interprotein NO transfer process, and also could explain why many reported NO-Cys sites eluded efficient predictions based on standard reactivity parameters (e.g., exposure, pKa).
Topology and relative mobility of NO-Cys
Finally, notable observations were obtained from the analysis of B factors (). In a clear pattern, shared by all modified Cys residues, nitrosylated sites were located in portions of proteins characterized by lower mobility than an average value (i.e., the B factor value relative to the modified Cys was lower than the average B factor value for the whole protein) for the same protein ().
B factor analysis of NO-Cys sites
However, increased mobility characterized these proteins in the progression from NO-Cys to proximal (6 Å) region and then to the whole protein. Thus, a clear trend was revealed by our analysis wherein both the modified Cys and, to a lower extent, its proximally located residues showed lower mobility (as revealed by experimentally derived indicators - the B-factors) when compared to other regions in the same proteins (). Importantly, this pattern was absent in the control set of proteins (). These data suggest that S-nitrosylation occurs on structured Cys residues, wherein the local effect of NO modification could influence at a greater extent the regions distant from the modified site.
Effects of Cys nitrosylation on electrostatic potential distribution
We analyzed available crystallographic structures of proteins containing NO-Cys, including human Trx1 (pdb 2HXK), Blackfin tuna myoglobin (2NRM), human hemoglobin (1BUW) and human PTP1b (3EU0), and compared them with the corresponding non-modified structures (see , for details on pdb codes). Upon modification, these proteins show moderate structural rearrangements. Superimposing NO-sylated and non-modified proteins, the all atom root mean square deviation (rmsd) varied from 0.8 Å for Trx1, to 1.3 Å for hemoglobin, with 1.1 Å for myoglobin and 0.9 Å for PTP1b (). Interestingly, the average displacement differed for various sets of amino acids, if grouped by their physico-chemical properties (i.e., basic, acidic, polar not charged, and apolar residues, as defined by DeepView 4.0). Charged residues clearly showed larger displacement than other categories of amino acids (). In three proteins (myoglobin, TRX1 and hemoglobin), basic residues were the most displaced, whereas acidic residues were the most displaced in PTP1b. A common trend was apparent wherein following S-nitrosylation, the structural rearrangement affected charged residues on a greater extent. Of particular interest was the observation that many highly displaced charged residues were exposed (present in the molecular surface), and were located far (more than 8 Å) from the Cys modification site. For example, upon nitrosylation of Cys 10 of myoglobin, a loop (Lys 73 to His 78) that was laterally placed with respect to the N-terminal α-helix containing the NO-Cys showed complete repositioning. This movement corresponded to a rmsd of 4.4 Å, which was significantly higher than the average for the whole protein () and also higher than the displacement for the NO-Cys region itself (rmsd of 1.6 Å, the α-helical region spanning Asp 4 and Ala 15). This region contained three positively charged residues (Lys 73, Lys 75 and His 78), each located at more than 15 Å from the modification site. As consequence of the movement of exposed charged residues, the molecular surface was locally rearranged, and this led to a change in its electrostatic properties (i.e., when charged residues moved in the surface, both the surface and relative positions of charged atoms were changed, thus affecting the surface electrostatic potential distribution), as shown in . At least in three cases (myoglobin, TRX1 and hemoglobin), a marked redistribution of the electrostatic potential was observed. In the case of PTP1B, which was curiously the protein showing the lowest displacement of basic residues (), the redistribution following S-nitrosylation of its catalytic Cys was smaller though still detectable. Given the low number of available paired experimental structures with and without NO-Cys, it may be difficult to draw general conclusions. However, from the analysis of this limited protein dataset, it seems that an important effect of S-nitrosylation is to modify the electrostatic properties of molecular surfaces by triggering the movement of its exposed charged residues. This hypothesis fits well with sophisticated theoretical calculations on NO-Cys in proteins, showing that nitrosylation causes substantial charge redistribution in Cys side chain atoms [32
]. Intriguingly, in this scenario an additional role for acidic and basic residues might be to enhance the electrostatic perturbation introduced by S-nitrosylation through a mechanism wherein the message propagates from a receiver (Cys) to peripheral parts of the protein.
Average displacement of amino acids following S-nitrosylation
Effect of S-nitrosylation on electrostatic potential distribution