MUTYH is a unique glycosylase that catalyzes the removal of an undamaged adenine that has been misincorporated opposite the lesion OG. In 2002, MUTYH was thrust into the limelight when inherited mutations in the gene were correlated with colorectal cancer [
7]. MUTYH-associated polyposis (MAP) is the only documented form of cancer that originates from inherited biallelic mutations in a base excision repair gene [
3]. The initial mutations in
MUTYH that were uncovered encoded the variants Y165C and G382D MUTYH [
7]. Subsequent work has uncovered a plethora of other missense mutations, as well as truncating and splice site mutations associated with MAP; however, these two original mutations presently represent 70–80% of the known mutations in
MUTYH [
8]. Though the clinical data related to MAP has been accumulating rapidly, the amount of functional data on MUTYH and variants is limited. Moreover, the reported results from enzyme assays of MUTYH variants are conflicting and provide a murky picture of the relationship between the glycosylase activity and pathogenicity of a given MUTYH variant [
14,
15,
44–
46]. We believe the conflicting results are a consequence of the variable amounts of active MUTYH obtained from bacterial overexpression that complicate the analysis of the enzymatic activity. The aim of this study was to provide an approach for characterizing the glycosylase activity of MUTYH and variants associated with MAP. We have analyzed the kinetics of adenine removal from an OG:A-containing substrate with several common MAP variants, namely P391L, Q324R, Y165C, and G382D MUTYH. These results, in combination with the ability of the variant MUTYH enzymes to suppress DNA mutations in rifampicin resistance assays, provide new insight into the factors influencing the ability of MUTYH and variants to prevent mutations associated with OG.
In order to obtain enough high quality protein to analyze the adenine glycosylase activity of MUTYH
in vitro, we have overexpressed MUTYH as a fusion-protein with the maltose-binding protein at the N-terminus in
E. coli. The expression as an MBP-fusion coupled with the modified MUTYH gene that contains optimal Arg codons resulted in enough active MUTYH protein to use the same approach used previously to analyze the adenine removal activity of
E. coli MutY. A critical aspect of this approach is measurement of the burst amplitude from pre-steady state reactions of MUTYH with an OG:A substrate to determine the amount of
active enzyme. The challenges associated with expression and isolation of MUTYH from bacteria results in significant fractions of the overexpressed protein that may be misfolded and therefore inactive. Indeed, we observe considerable variations in active fraction with different preparations of WT MUTYH. However, using the active site concentration allows for correction between preparations and reproducible determination of the kinetic rate constants
k2 and
k3 that correspond to steps involved in the base excision process and product release, respectively. The rate constant
k3 is similar for MUTYH to
E. coli MutY under the same conditions indicating a similarly high affinity for the DNA product [
41]. Interestingly, the rate constant
k2 for the intrinsic base excision step measured for WT MUTYH with an OG:A substrate is similar to the corresponding rate constant with the murine Mutyh measured previously [
30]. However, WT MUTYH showed minimal activity toward the G:A-containing substrate under the assay conditions approximating physiological salt (150 mM NaCl). This is consistent with results obtained with
E. coli MutY, where there is a 70-fold lower
k2 rate constant for the adenine glycosylase activity for the G:A containing duplex substrate compared to the OG:A counterpart at higher salt concentrations [
41]. In the same study, when the same sequence context for G:A was evaluated in a cellular repair assay, nominal MutY-mediated repair at the G:A bp was observed, while the OG:A mismatch was completely repaired. Such results are consistent with the supposition that OG:A mismatches are the most physiologically relevant substrates for both the bacterial and mammalian forms of this BER glycosylase [
33,
47].
Using the same kinetic approach, the adenine glycosylase activity of Y165C, G382D, Q324R, and P391L MUTYH variants was examined. Single-turnover experiments were used to determine the rate constant
k2 for these variants (). Our results show a 3-fold reduction in rate for each variant compared to WT MUTYH. This is quite different from previous reported results for Y165C and G382D MUTYH [
14,
15,
44]. For example, the results of Wooden
et al. with cell extracts containing Y165C and G382D MUTYH expressed as an N-terminal GST fusion in a CC104
mutY- strain indicated that the variant enzymes were completely devoid of adenine glycosylase activity [
14]. A more recent study reported measurements of the rate constant
k2 with purified GST-fusion constructs of Y165C and G382D MUTYH expressed in a bacterial strain harboring the tRNAs for the rare Arg codons [
15]. The authors concluded that the glycosylase activity of Y165C MUTYH was severely compromised while that of G382D MUTYH was only slightly compromised compared to the WT MUTYH enzyme [
15]. We attribute the differences in the previous results to those reported herein to the lack of consideration and correction for active enzyme fraction. We have observed different active enzyme fractions from different preparations of the same enzyme form, including WT MUTYH. In fact, in preliminary work using an expression vector without the Arg codon changes encoding His
6-tagged WT MUTYH, some enzyme preparations exhibited barely detectable adenine removal activity. Correction for active site concentration is critical, since differences in active site fraction among different preparations of the same enzyme can be as large as those between different variant enzymes. In the study of Wooden
et al., the reaction of WT MUTYH with the OG:A substrate was not complete within the 30 minute incubation-time analyzed indicating a low amount of active enzyme in the reaction [
14]. In the later study by Ali
et al. [
15], the reaction of the OG:A substrate with WT GST-MUTYH was shown to go to completion indicating a higher active enzyme concentration by expression in
E. coli harboring the rare Arg tRNAs; however, in the determinations of the rate constant
k2, the reactions were performed using the same concentration of total enzyme rather than active enzyme. This type of analysis is misleading since different amounts of active protein may lead to differences in the observed rates. In such cases, attributing these differences to the mutation of a specific amino acid would misrepresent the pathogenicity of a given variant. Moreover, ensuring that the observed rate approximates the rate constant
k2 requires that the active enzyme concentration be above
Kd and the DNA concentration. Appropriate analysis of the kinetic parameters allows for accurate assessment of the extent with which a modification in an amino acid affects a specific aspect of the mismatch recognition or base excision process. Indeed, such knowledge may also allow for more accurate prediction of the potential outcomes of amino acid changes at other positions in MUTYH. It should also be noted that reproducibly lower amounts of active enzyme fraction for a given variant compared to the WT form may be due to the amino acid change that alters the proper folded conformation required for catalysis; therefore, this is an additional piece of information on the consequences of an amino acid variation in MUTYH.
The kinetic results from adenine glycosylase assays of the Y165C and G382D MUTYH variants in this study are consistent with previous work from our laboratory on the bacterial [
10,
11] and murine Mutyh [
13] enzymes, consistent with the high degree of sequence conservation between MutY enzymes from bacteria to humans. The major difference is that previous functional work on
E. coli MutY showed a more dramatic decrease in adenine glycosylase activity caused by the Tyr to Cys change [
11]. In addition, the affects on the activity and binding are consistent with the location of the corresponding Tyr and Gly residues in the
Bs MutY-DNA structure [
12] that place both residues in close proximity to OG. Thus, the structural and functional data are consistent with an increased frequency of G to T transversions in a variety of genes, most conspicuously
APC, leading to MAP.
P391L MUTYH is a variant that has been reported in several MAP cases [
8,
26–
28]. Pro 391 in MUTYH corresponds to Pro 269 in
Bs MutY and is highly conserved across species (). The P391L variant is particularly interesting since the Pro residue is located quite remotely from the OG and adenine binding sites as well as the DNA backbone (). This residue is located in the C-terminal domain, which makes extensive contacts with the OG-containing strand and interfaces with the catalytic domain of
Bs MutY [
12]. Indeed, functional studies have established that this domain plays an important role in the recognition of OG within OG:A mismatches [
48,
49]. Recently studies using a cell-based repair assay showed that identification of OG rather than the adenine base has a greater influence on location and repair of the mismatch by MutY [
41]. Though seemingly far away from the target OG:A bp in the
Bs MutY-DNA structure, Pro 269 resides near several other hydrophobic residues and together these residues form a hydrophobic core within the C-terminal domain [
12]. Moreover, Pro 269 is involved in a turn region between two β-sheets that are strongly packed with other portions of the C-terminal domain. Disruption of the intricate contacts within this domain by introduction of the large leucine residue might result in destabilization or altered flexibility of this domain that ultimately hamper the enzyme’s ability to recognize OG [
26].
The region of MUTYH corresponding to Gln 324 is located in a region not found in the bacterial enzymes that represents an extension of the N-terminal domain before the flexible linker between the two domains (). This location makes it difficult to make structural prediction on the possible consequences to DNA mismatch binding and catalysis caused by mutations at this position. Interestingly, the codon for Gln 324 is frequently mutated. The commonly observed polymorph Q324H MUTYH [
8] was shown by Ali
et al. to be slightly less active than the wild type protein [
15], while Shinmura
et al. reported that the activity of this enzyme form is identical to the WT protein [
50]. In the case of the mutation of Gln to Arg in Q324R MUTYH, we observe that the rate constant
k2 for adenine removal from OG:A substrates is reduced three-fold compared to the WT protein. Western blot analysis with whole cell lysates also shows that the bacterial expression levels are comparable to the WT protein (data not shown). Q324R MUTYH is also able to suppress the mutation frequency to WT levels in the rifampicin assays in the absence of endogenous MutY. Thus, in this case, though this variant showed a reduced rate for adenine excision in the glycosylase assays, the enzyme activity was sufficient to mediate repair and prevent mutations in a cellular context. This suggests that enough active enzyme with a sufficient rate of adenine excision was present, providing full complementation for the bacterial enzyme. The idea that some reduction in adenine removal activity may be tolerated is consistent with recent work in our laboratory that showed that the modified substrate base pair, OG:Z3 (where Z3 = 3-deaza-2′-deoxyadenosine), was fully repaired to G:C in
E. coli despite having a 100-fold reduction in the measured
k2 for base removal
in vitro with MutY [
41]. However, it was noted that MutY exhibits high affinity for OG:Z3 base pairs suggesting that a reduction in catalytic adenine removal may be more readily tolerated in terms of overall repair if the mismatch is effectively recognized and intercepted prior to replication.
Despite having similar reductions in the rate of adenine removal as Q324R MUTYH compared to WT MUTYH (), the Y165C, G382D, and P391L MUTYH variants are all less able to suppress mutations as determined in the rifampicin resistance assays. In the presence of G382D MUTYH, the mutation frequency is increased 4-fold over the WT enzyme and is only 2-fold reduced compared to the expression vector control lacking
MUTYH. The presence of Y165C or P391L MUTYH provided mutation frequency values that are greater than the vector control indicating an absence of any ability to prevent DNA mutations. The compromised ability to prevent mutations of these three variants could be a result of the reduced stability and the active fraction of these variants in
E. coli. Western blot analyses indicated that total amounts of the variant proteins were similar to the WT MUTYH enzyme; this is an advantage of using the MBP tag since its presence equalizes protein expression compared to other tag constructs. However, the concentration of active protein for the Y165C, P391L and G382D MUTYH proteins measured via the glycosylase reaction are typically less than the WT protein. Reduced amounts of active protein coupled with slower adenine removal capacity would effectively further reduce OG:A repair and increase the mutation frequency. It is also possible that the intrinsic activity and active fraction of these variants may be further compromised in the cellular environment. Indeed, we previously observed that the rate constant
k2 from adenine glycosylase assays of both the Y150C and G365D murine Mutyh variants was reduced by the presence of the AP endonuclease while the corresponding
k2 measurements with WT murine Mutyh enzyme was not affected [
13]. This suggested that other proteins may be able to compete with the variants, but not the WT enzyme, for the DNA substrate, thus magnifying any defects in activity in a cellular setting. Interestingly, our previous work showed that Y82C MutY was more severely compromised in terms of catalysis of adenine removal compared to G253D MutY [
10]. In addition, the rifampicin assays herein show that the mutation frequency was reduced, albeit modestly (only 2-fold), in the presence of G382D MUTYH, while no mutation suppression was observed with Y165C MUTYH. These results suggest that some specific amino acid substitutions may be more sensitive and consequences due to the presence of the variant MUTYH may be more readily revealed in cellular assays than
in vitro. Notably, a recent clinical study showed that the presence of the Y165C MUTYH variant is associated with an earlier onset and a more severe CRC phenotype than G382D MUTYH [
51].
P391L MUTYH resulted in mutation frequency results similar to Y165C MUTYH. With this variant the results are not as easily rationalized as those for Y165C MUTYH. For example, the analogous
E. coli enzyme (P262L MutY) was found to be identical to the WT MutY in a variety of aspects, including kinetics, mismatch affinity, stability, and ability to suppress DNA mutations [
52]. A potential explanation of the seemingly conflicting results with P391L MUTYH may be a result of magnification of defects in locating the mismatch in a large DNA substrate in a cellular context that are not apparent in the bacterial enzyme or the
in vitro experiments with a short defined duplex containing a single lesion. In addition, the results can not simply be explained by instability of the enzyme leading to its degradation and absence, since Western blot analyses showed that the level of total protein in cell lysates was similar for P391L and WT MUTYH. However, active fraction on average is reduced for P391L MUTYH and may be magnified in the cellular context. A distinct feature of the kinetics of P391L MUTYH is the considerably reduced rate constant
k3 indicating a product release rate that is even slower than WT MUTYH. In a cellular context, MUTYH not only initiates BER but also makes sure that the DNA intermediate is passed on to the next protein in the pathway (APE1) [
53,
54]. Though this is a bacterial assay, the fact that MUTYH is able to complement for MutY suggests that initiation of BER and coordination with downstream BER enzymes is still maintained. The magnitude of the rate of product release may be quite important to ensure a “safe transition” between the glycosylase and the AP endonuclease. If the release is too fast, this may release the toxic and mutagenic OG:AP site product intermediate prior to full BER. Failure to release the product at some point may also be problematic since this may also thwart full repair of the mismatched site to a G:C base pair.
Clearly many unanswered questions remain as to the extent that defects in the catalytic rates of adenine glycosylase activity translate to reduction in repair of OG:A mismatches and increases in mutation frequency in
APC. In the case of Q324R MUTYH, the fact that this enzyme was able to complement for
E. coli MutY in the rifampicin assay suggests that this enzyme is functional to reduce mutations despite its reduced glycosylase activity. Together these results suggest that this variant may be less deleterious compared to the other variants that show defects in both types of assays. However, caution should be taken when translating these results to a clinical outcome since other factors may also modulate the activity of the MUTYH protein in colonic cells. For example, post-translational modifications should be considered with eukaryotic enzymes. Indeed, previous work has suggested that MUTYH is phosphorylated [
44,
55]. In addition, MUTYH has been shown to interact with several proteins associated with the repair and replication machinery [
56]. In considering these issues, it is possible that the activity of variants such as Q324R and G382D MUTYH may be more reduced in human cells. Deficiency in the glycosylase activity of MUTYH may only be one of the pieces in the context of a larger repair puzzle. Hence, in order gain a better understanding of the potential pathogenic properties of MUTYH variants, several different types of analyses should be undertaken. For example, determining the effects of the mutations on stability, protein expression levels, recognition and binding of the OG:A mismatch, rate of base removal, interaction with other BER and replication machinery proteins, and cellular mutation frequency, would be particularly illuminating. These results can be correlated with clinical data on the phenotypic presentation associated with a given variant. One might expect that variants that exhibit defects in a variety of assays will be the most strongly correlated with detrimental effects to human health. This is an exceptional example of a cancer mechanism where functional data from biochemical and cellular experiments coupled with structural insight from X-ray crystallography can be correlated with clinical data on patients to provide a comprehensive picture of the disease mechanism.