|Home | About | Journals | Submit | Contact Us | Français|
The fidelity of DNA synthesis by A-family DNA polymerases ranges from very accurate for bacterial, bacteriophage and mitochondrial family members to very low for certain eukaryotic homologues. The latter include Pol ν which, among all A-family polymerases, is uniquely prone to misincorporate dTTP opposite template G in a highly sequence-dependent manner. Here we present a kinetic analysis of this unusual error specificity, in four different sequence contexts and in comparison to Pol ν’s more accurate Family A homologue, the Klenow fragment of E. coli DNA polymerase I. The kinetic data strongly correlate with rates of stable misincorporation during gap-filling DNA synthesis. The lower fidelity of Pol ν compared to Klenow fragment can be attributed primarily to a much lower catalytic efficiency for correct dNTP incorporation, whereas both enzymes have similar kinetic parameters for G-dTTP misinsertion. The major contributor to sequence-dependent differences in Pol ν error rates is the reaction rate, kpol. In the sequence context where fidelity is highest, kpol for correct G-dCTP incorporation by Pol ν is ~ 15-fold faster than kpol for G-dTTP misinsertion. However, in sequence contexts where the error rate is higher, kpol is the same for both correct and mismatched dNTPs, implying that the transition state does not provide additional discrimination against misinsertion. The results suggest that Pol ν may be fine-tuned to function when high enzyme activity is not a priority and may even be disadvantageous, and that the relaxed active-site specificity towards the G-dTTP mispair may be associated with its cellular function(s).
Cells rely on a variety of processes to ensure faithful DNA replication and repair and DNA polymerases play key roles in these processes. The human genome encodes at least 15 DNA polymerases (reviewed in (1)). Three of these enzymes, DNA polymerases γ (Pol γ)1, θ (Pol θ) and ν (Pol ν), are members of the A-family of DNA polymerases because they share homology with Escherichia coli DNA polymerase I, encoded by the polA gene. The focus of this study is Pol ν, encoded by the human POLN gene (2). Although the biological role of Pol ν is uncertain, one clue to its possible function is its homology to Drosophila melanogaster Mus308, a Family A DNA polymerase with a helicase domain at the N-terminus (3). mus308 mutants are hypersensitive to DNA crosslinking agents (nitrogen mustard and cisplatin) but not to MMS (4), implicating Mus308 in the repair of highly toxic interstrand cross-links (2, 4, 5). POLN-knockdown cells are sensitive to various DNA-damaging agents (6, 7) including DNA cross-linking agents, indicating that Pol ν may be involved in repairing lesions generated by these agents. There is also functional and physical evidence for the interaction of Pol ν with proteins involved in the Fanconi-anemia pathway that is involved in repair of cross-linked DNA (7). Consistent with possible roles in cross-link repair, Pol ν can bypass a psoralen DNA interstrand crosslink (6) and DNA interstrand cross-links with linkages through the DNA major groove (8). It can also bypass a subclass of thymine glycol lesions (9). Pol ν is a processive enzyme and performs efficient strand-displacement synthesis, and these properties may be important for its biological function.
Several Family A DNA polymerases, including E. coli Pol I (10), T7 DNA polymerase (11, 12) and Pol γ (13-15), have intrinsic 3′ to 5′ exonuclease activity that can edit the occasional mismatches they create, thereby enhancing the fidelity with which they synthesize DNA (16, 17). Other Family A DNA polymerases, such as Taq DNA polymerase (18, 19) and Bst DNA polymerase (20), lack intrinsic 3′ to 5′ exonuclease activity and therefore cannot proofread their mistakes. Nonetheless, they are among the most accurate of the naturally exonuclease-deficient polymerases (18-23). Pol θ and Pol ν comprise yet a third subtype of Family A polymerase. They too lack 3′ to 5′ exonuclease activity, but they have low nucleotide selectivity. For example, compared to the exonuclease-deficient form of the Klenow fragment of E. coli Pol I, human Pol θ has much lower selectivity and forms a variety of different single base mismatches at high rates (24). Human Pol ν also has low nucleotide selectivity, but its specificity is much more biased, specifically for misinsertion (9) and stable misincorporation of dTTP opposite template G (25). Moreover, when stable misincorporation of dTTP was monitored opposite a large number of template guanines, the site-to-site variation in error rate for the G-dTTP mismatch was more than 30-fold (25).
Previous studies of error specificity have provided clues to the biological functions of DNA polymerases. For example, studies of the error specificity of human Pol η (26) eventually led to the conclusion that it is responsible for base substitutions at A-T base pairs during somatic hypermutation of immunoglobulin genes (27-29). Also, studies of the error specificity of yeast replicases (30-32) have led to a better understanding of leading and lagging strand DNA replication in vivo. It is our hope that the unusual error specificity of human Pol ν will likewise result in a better understanding of its biological function(s). In the present study we have focused particularly on the sequence dependence of misincorporation, a well documented phenomenon which remains poorly understood. Pol ν’s preference for forming G-dTTP mispairs in a highly sequence-dependent manner allowed us to investigate the kinetic parameters for correct and incorrect dNTP insertion and mismatch extension at hotspots and coldspots. Examining the kinetic parameters for correct and incorrect dNTP insertion by Pol ν in four different sequence contexts, we have found that the the error rate for stable misincorporation of dTTP opposite template G correlates strongly with the kinetic parameters for correct dNTP insertion, as has been previously noted for a variety of DNA polymerases (33, 34).
Materials for the M13mp2 fidelity assay were from sources described previously (35). Ultrapure dNTPs were purchased from Amersham Biosciences (GE Healthcare). Pol ν Fragment 1 (E175-G863, 77.1 kDa, Pol ν-77) was expressed and purified as previously described (9, 36). Exonuclease-deficient (D424A) Klenow fragment was purified according to our published procedures (37, 38). DNA oligonucleotides were synthesized by the Keck Biotechnology Resource Laboratory at Yale Medical School and were purified by denaturing gel electrophoresis as described previously (39).
The fidelity assay scores errors generated in the lacZ α-complementation gene in M13mp2 during synthesis to fill a 407-nucleotide gap. Reaction mixtures (25 μl) to fill the gap contained 0.2 nM M13mp2 gapped DNA substrate, 20 mM Tris-HCl (pH 7.5 or 8.8), 8 mM magnesium acetate, 2 mM dithiothreitol, 80 μg of bovine serum albumin, 4% (v/v) glycerol and 1 mM each of dATP, dGTP, dCTP and dTTP. Reactions were initiated by adding Pol ν-77 (200 nM), incubated at 37°C for 30 min, and terminated by adding EDTA to 20 mM. Half of the reaction mixture was mixed with SDS buffer (20 mM Tris-HCl, pH 8.0, 5 mM EDTA, 5% SDS, 0.5% bromophenol blue and 20% glycerol) and analyzed by agarose gel electrophoresis. The results indicated that the gap was completely filled for reactions performed at both pH values (data not shown, but for a typical result, see (25)). Aliquots of the remaining DNA products were then used to determine lacZ mutant frequencies for the purpose of obtaining error rates, as described (25). Correct synthesis produces M13mp2 DNA that yields dark blue M13 plaques when introduced into an E. coli α-complementation strain and plated on indicator plates. Polymerase errors are scored as light blue or colorless M13 plaques. DNA from independent mutant clones was sequenced to define the sequence changes in lacZ, and this information was used to calculate error rates as described in (35). Error rates for individual types of mutation were calculated according to Equation: ER=((Ni/N)×MF)/(D×0.6) where Ni is the number of mutations of a particular type, N is the total number of mutants analyzed, MF is frequency of lacZ mutants, D is the number of detectable sites for the particular type of mutation, and 0.6 is the probability of expressing a mutant lacZ allele in E. coli (35).
Table 1 lists the sequences of the duplex DNA substrates used in this study. Primer strands, 5′-labeled with 32P, were annealed to a 1.5-fold molar excess of the appropriate template strand. Primer extension reactions were carried out at room temperature (20-22 °C) using a rapid-quench-flow instrument (KinTek Corp., Model RQF-3) for fast reactions and manual quenching when the reaction was sufficiently slow. In either case, the reaction was initiated by mixing equal volumes of a polymerase-DNA mix and a dNTP solution, both in Pol ν reaction buffer, 20 mM Tris-HCl, pH 7.5, 8 mM Mg(OAc)2, 2 mM DTT, 4 % (v/v) glycerol, and 80 μg/ml bovine serum albumin. The final reaction mixture contained 5 nM primer-template duplex, 7 nM Pol ν-77 and varying dNTP concentrations. The preparation of Pol ν-77 used in the majority of the kinetics measurements was ~ 23% active, and therefore the concentrations listed above gave burst kinetics (40, 41) with a burst amplitude of ~ 0.3. Reactions were quenched at appropriate time intervals using excess EDTA and were fractionated on denaturing polyacrylamide-urea gels and quantitated on a Fuji FLA 5100 scanner. The fraction of labeled DNA converted to product was plotted as a function of reaction time and fitted to the burst equation: A(1 – exp(−kobst)) + ksst .
Burst rate constants (kobs, corresponding to the rate constant for the first turnover) were determined at a series of dNTP concentrations and kpol and Kd were determined from a plot of kobs vs. dNTP concentration fitted to the hyperbolic equation: kobs = kpol[dNTP]/(Kd + [dNTP]). In a few instances, where lack of sufficient Pol ν precluded use of the rapid-quench instrument, rate measurements were carried out at very low dNTP concentrations using manual quenching. At low dNTP concentrations, the hyperbolic equation reduces to kobs = kpol[dNTP]/Kd, allowing the ratio kpol/Kd to be obtained directly from the slope of the plot of kobs against dNTP concentration.
Pol ν-77, at a series of concentrations from 0.05 to 100 nM, was incubated with 32P-labeled duplex DNA in Pol ν reaction buffer, plus an additional 10% glycerol, for 10 min at 22 °C. The samples were loaded onto an 8% polyacrylamide gel, poured and run in 50 mM Tris borate, 2 mM MgCl2 and 0.1 mM EDTA. The samples were loaded with the gel running at 300 V, and then electrophoresis was continued at 150 V for 3 to 4 h at 4 °C. The gel was dried down and radioactivity was quantitated on a Fuji FLA 5100 scanner. Two types of information were obtained from plots of the fraction of bound DNA as a function of the concentration of Pol ν-77, as illustrated in Supplementary Figure S1. With DNA at 1 nM, which is > KD, the fraction of DNA bound to Pol ν-77 is given by f[Pol]/[DNA], so that the slope of the graph allowed us to determine f, the fraction of the added Pol ν-77 that is active in binding DNA. At a DNA concentration below KD, the fraction of bound DNA approximates to the hyperbolic relationship [Pol]/([Pol]+ KD) allowing calculation of KD.
Our earlier study highlighted the tendency of full-length human Pol ν to generate G to A substitutions via misincorporation of dTTP opposite template guanine (25). We began this study by confirming this error signature for a slightly abbreviated derivative of Pol ν (the eventual goal being structural studies), designated Pol ν-77 (36). We used the M13mp2 forward mutation assay, which detects a wide range of sequence changes in the lacZ gene in a variety of sequence contexts (35). The DNA products of gap-filling synthesis at pH 8.8 yielded an average lacZ mutant frequency of 15% (Table 2), comparable to the 18% value reported previously for the full-length protein (25, 36). In addition to similar error rates, the truncated and full-length Pol ν proteins have similar processivity (compare Figure 2 in reference (25) and Figure 4 in (36)). We also examined the fidelity at pH 7.5, the pH used in our kinetic studies below. Pol ν is more accurate at pH 7.5, as revealed by a 4-fold lower average mutant frequency of 3.5% (Table 2). Higher accuracy at lower pH is also a characteristic of other Family A DNA polymerases, including Taq polymerase (19) and the large Klenow fragment of E. coli pol I (42). When 204 independent lacZ mutants were sequenced from reactions performed at pH 7.5, a variety of single base changes were observed (Table 2). These were distributed throughout the lacZ target sequence (Figure 1), in a pattern similar to that seen earlier with full-length human Pol ν at pH 8.8 (25). Substitutions predominated, and the vast majority were G to A changes, yielding an average error rate of 22 × 10−4 for stable misincorporation of dTTP opposite the 22 template guanines where this event results in a change in plaque color (35). The second most common error was the mismatch involving the same two bases but differing with respect to the templating versus incoming base, i.e., the T-dGTP mismatch (average error rate of 2.0 × 10−4, Table 2). We have previously compared Pol ν specific error rates to the error rates of other DNA polymerases (see Table 2 in (25)).
As before (25), G to A mutation rates varied by sequence context. The highest mutation rate for a phenotypically detectable site was observed at template guanine number +165 (designated the G(165) hotspot), where 38 substitutions were observed among 204 sequenced lacZ mutants. This corresponds to an error rate of 1.1%; because of this high error rate, the G(165) hotspot sequence was chosen for our kinetic analysis. Additionally, we measured dNTP insertion kinetics at the neighboring guanine, G(164), which is also a hotspot. Although substitutions at G(164) are phenotypically silent, two G to A substitutions were observed as silent hitchhikers among 204 sequenced lacZ mutants, corresponding to an error rate of 1.6%, similar to the error rate at G(165) 2. For comparison, we analyzed Pol ν kinetics at a mutational coldspot, G(−66), selected because only three G to A changes were detected at this phenotypically-detectable position, corresponding to an error rate of 0.09%. dNTP misinsertion. The design of the kinetics experiments was influenced by the properties of the Pol ν-77 preparations. Purification of Pol ν-77 is challenging and the purified protein (36) was typically not fully active, as shown by active-site titration using the measurement of burst amplitudes in primer extension reactions (Figure 2A, B). An alternative method using DNA binding to assess the concentration of active Pol ν-77 is illustrated in Supplementary Figure S1. Measurements by the two methods were in good agreement. The Pol ν-77 preparation used in the majority of the kinetics experiments contained 23% active polymerase. As described in Methods, we measured reaction rates under burst conditions, with the concentration of active Pol ν-77 about 3-fold less than the concentration of the DNA substrate. This had the advantage of conserving our stocks of enzyme and also avoided the problem that high concentrations of some Pol ν-77 preparations inhibited the reaction.
We measured misinsertion by Pol ν-77 on several different DNA substrates (Table 1). The 13/19-mer duplex is a DNA substrate that we have used routinely in recent kinetic studies of Klenow fragment (38, 43). We compared misinsertions at a template T and a template G in this sequence. The results with Klenow fragment (Table 3, line 2 of each section) were in good agreement with data previously obtained on a different DNA sequence (reproduced in Table 3, line 1 of each section) (44). As described above, three other DNA substrates were derived from the lacZ target sequence, in order to study dTTP misinsertion opposite template guanines in three different sequence contexts: the G(164) and G(165) hotspots and the G(−66) coldspot. In each case, we measured the rates of correct and mismatched dNTP incorporation at a series of dNTP concentrations and calculated kpol and KD(dNTP). Figure 2C and D shows an example of the data obtained for T misinsertion opposite G(164). The misinsertion data obtained for Pol ν-77, with selected Klenow fragment misinsertions for comparison, are shown in Table 3. The final column of Table 3 compares the efficiencies of correct and incorrect dNTP addition on each substrate, giving the selectivity in favor of the correct complementary dNTP.
The misinsertion data for the 13/19-mer substrate fit well with the overall mutational data for both Klenow fragment and Pol ν. For example, Klenow fragment has 3-fold lower discrimination against T-dGTP errors (1.3 × 10−4, bottom of Table 3) compared to G-dTTP errors (3.4 × 10−4, top of Table 3), consistent with its higher G-dTTP error rate in the M13mp2 forward mutation assay (10). For both T-dGTP and G-dTTP errors, the individual kinetic constants for Klenow fragment show that the selectivity in favor of the correct base pair is made up of ~ 10-fold discrimination in dNTP binding and ~ 103-fold discrimination in the rate of misincorporation, with the kinetic differences between correctly paired and mispaired substrates slightly larger in the case of G-dTTP errors. For Pol ν, the most frequent error in the M13mp2 assay is G-dTTP, whose rate is 11-fold higher than that of T-dGTP (Table 2). Again, this is reflected in the kinetic data, where we observe a 13-fold difference in discrimination against the two mismatches (selectivity values of 38 for G-dTTP versus 510 for T-dGTP, third lines in top and bottom sections of Table 3).
What accounts for the low fidelity of Pol ν for G-dTTP errors? Both Klenow fragment and Pol ν have similar kinetic constants for misinsertion of dTTP opposite template G (Table 3, top right). However, correct dCTP insertion by Pol ν opposite template G is ~ 103-fold less efficient than is correct dCTP insertion catalyzed by Klenow fragment (Table 3, top left). The difference in the efficiency of correct incorporation therefore translates directly to ~ 103-fold lower selectivity of Pol ν against G-dTTP mismatches, compared with Klenow fragment. By contrast, at template T, dGTP misinsertion by Pol ν is ~ 260-fold less efficient than that of Klenow fragment (Table 3, lower right, compare 950 vs. 3.7). This difference moderates the effect of the large (6,300-fold) difference in the efficiency of correct (T-dATP) incorporation (Table 3, lower left, compare 1.2 × 107 to 1.9 × 103), resulting in ~ 24-fold lower selectivity of Pol ν against T-dGTP errors.
The kinetic data for G-dTTP misinsertion by Pol ν on the lacZ sequences also parallel the mutational data. Error rates in the M13mp2 fidelity assay are high at template G(165) and G(164) and lower at template G(−66). Likewise, selectivity against dTTP misinsertion (Table 3) is at least 20-fold higher opposite template G(−66) (selectivity factor of 520) than opposite templates G(165) and G(164) (selectivity factors of 22 and 25, respectively). The different selectivity values derive from some very interesting sequence-dependent differences in individual kinetic constants. First, the binding constants for both correct dCTP and incorrect dTTP are 15- to 20-fold weaker at the G(−66) position as compared to the G(164) and G(165) positions. At G(−66), where the error rate is lowest, incorporation of correct dCTP is ≈ 16-fold faster than misinsertion of dTTP (Table 3, kpol of 0.28 and 0.018, respectively). In contrast, at the G(164) and G(165) hotspots, the kpol values for correct dCTP incorporation are slower than at G(−66), and similar to those for misinsertion of dTTP at these same hotspot sequences. Therefore, the lower fidelity of Pol ν for G-dTTP mismatches in one sequence context as compared to another can be attributed largely to differences in the kinetics of correct G-dCTP incorporation, particularly the rate of dCTP addition, since the ratio of dCTP and dTTP binding constants is very similar at all three sequences. The kinetic constants for the template G in the 13/19-mer DNA substrate resemble to some extent those of the G(−66) sequence, with weak binding constants and a fast rate of G-dCTP incorporation.
To examine whether the mispair specificity of Pol ν operates at the level of mismatch extension, we measured the rates of extension of DNA substrates having a terminal (primer)C-G(template) basepair or a T-G mismatch (Table 4). On the 14/19-mer extension substrate, both Klenow fragment and Pol ν showed a similar discrimination between C-G and T-G termini, suggesting that mismatch extension does not account for the difference in mutational specificity of these two DNA polymerases. However, the discrimination by Pol ν against extension of the T-G mismatch was modulated by sequence context, with ~ 8-fold less discrimination at G(165) as compared to G(−66). The difference was primarily due to a faster rate of mispair extension at (G165); the rates of extension of a C-G terminus were similar. Gel mobility shift assays (Figure 3) indicated that binding of Pol ν to the T-G mismatch-extension substrates corresponding to the G(165) and G(−66) template-primers was indistinguishable (KD = 0.1 nM, in terms of fully active Pol ν). Therefore the behavior of Pol ν-77 on the two substrates does not result from a difference in binding affinity. Likewise, there was no difference in the DNA binding behavior of Pol ν when comparing the corresponding insertion substrates and C-G extension substrates at the same two sequences (data not shown).
Relative to many DNA polymerases that commit T-dGTP errors more frequently than any other base substitution (e.g., Klenow fragment (10), E. coli Pol III (45), Pol θ (24), Pol γ (14, 46), Pol η (26), Pol τ (47), Pol α (48), Pol ε (49), and Pol δ (50)), Pol ν is unusual in preferentially generating a mismatch with the same base composition but the opposite symmetry, i.e., dTTP opposite template G. Not only does Pol ν make this error at much higher rates than for the 11 other single base-base mismatches, it does so in a highly sequence context-dependent manner (25). The kinetic studies described here examine the binding and rate constants responsible for the unusual properties of Pol ν, in comparison with Klenow fragment, a prototypical A-family polymerase. We first consider results using a common template-primer. We then consider explanations for the site-to-site variations for Pol ν.
A major factor in the low fidelity of Pol ν for both errors is its > 103-fold lower efficiency of correct dNTP incorporation, compared to Klenow fragment, illustrating the general observation that differences in polymerase fidelity are frequently dominated by differences in the kinetics of correct base insertion (33, 34). The differences in selectivity of Pol ν and Klenow fragment against particular errors can be understood in terms of the kinetics of the relevant misinsertion reactions. On the 13/19-mer substrate, respresenting a generic DNA sequence not chosen for its hotspot or coldspot behavior, T-dGTP misinsertion by Pol ν is ~ 260-fold less efficient than that of Klenow fragment and this counteracts the 6,300-fold difference in the efficiency of correct (T-dATP) incorporation, resulting in ~ 24-fold lower selectivity of Pol ν against T-dGTP errors. By contrast, the kinetic parameters for G-dTTP misinsertion in the same sequence context are almost identical for the two enzymes, so that the entire 1,000-fold difference in correct (G-dCTP) incorporation is manifested in a 1,000-fold lower selectivity of Pol ν against G-dTTP errors, consistent with the abundance of these errors in synthesis by Pol ν. The similar kinetic parameters for G-dTTP misinsertion by Pol ν and Klenow fragment imply similar ground-state and transition-state energetics for this particular misinsertion in the Klenow fragment and Pol ν active sites, even though the energetics (and the corresponding kinetic parameters) of correct additions are substantially different.
The kinetic comparisons of the lacZ hotspot and coldspot sequences reveal that the efficiency of G-dTTP misinsertions by Pol ν varies over a range of nearly 100-fold. In contrast, the efficiency (kpol/Kd) of correct G-dCTP insertions changes very little (~ 3-fold), though the individual kpol and Kd values vary in interesting ways. Nucleotide binding is stabilized by more than 10-fold in the two hottest sequence contexts compared with the cold sequence. However, this does not contribute quantitatively to fidelity because the changes are of similar magnitude for both G-dCTP and G-dTTP insertions, indicating that ground-state binding of the ternary complex provides the same degree of selectivity against the G-dTTP nascent mispair at both hotspot and coldspot sequences. Instead, the important contributor to hotspot behavior is the rate constant, kpol. On the hotspot DNA, kpol is almost the same for correct and mismatched dNTPs, implying that the hotspot transition state fails to provide additional discrimination beyond what was already present in the ground state. In contrast, the coldspot substrate has ~ 15-fold faster kpol for correct G-dCTP incorporation compared with G-dTTP misinsertion. Thus, the tighter substrate binding at the hotspot template position appears to correlate with a less-optimal transition state for G-dCTP addition (slower reaction rate than at the coldspot). Conversely, correct G-dCTP incorporation on the coldspot template appears to use a weaker dNTP binding site and a more favorable transition state geometry, thus maintaining higher selectivity against the G-dTTP error. A similar correlation between tight dNTP binding and slow incorporation was observed in an earlier study of a Klenow fragment mutator mutant, and attributed to altered active-site geometry (44).
Compared to Klenow fragment, Pol ν exhibits stronger DNA binding, less efficient incorporation of correct base pairs, an active site favorable to G-dTTP nascent mispairs, and a strong influence of sequence context on the G-dTTP nascent mispair. Structural studies of Pol ν, which remain an elusive goal, will be essential for understanding differences between Klenow fragment and Pol ν and how they contribute to the kinetic properties reported here. Absent such information, sequence alignments of Pol ν with bacterial Family A DNA polymerases (51) and structural information on other Family A polymerases (23, 52-54) suggest regions of Pol ν that may contribute to its unique properties. One feature that may be relevant to the DNA binding and processivity of Pol ν is a 6-amino acid insertion predicted to be at the tip of the thumb subdomain, adjacent to conserved sequences already implicated in DNA binding in Klenow fragment (55). A strong interaction with DNA may be necessary for Pol ν’s proposed role in lesion bypass, especially when lesions compromise DNA interactions around the active site. Sequence alignments reveal that Pol ν also differs from more accurate Family A members in the O-helix (Figure 4), which makes several important interactions with the nascent basepair (Figure 4A) (9, 25). One example is Lys679 of Pol ν, which aligns with Ala759 of Klenow fragment and Thr664 of Klentaq (Figure 4B). Changing Lys679 of Pol ν to the alanine or threonine present in the more accurate polymerases did indeed increase the fidelity of Pol ν (51). Nonetheless, these changes did not alter the preferential formation of G-dTTP mismatches or the sequence dependence of Pol ν, indicating that such preferences depend on other enzyme-substrate interactions.
In a more global attempt to identify differences in the sequence of the O-helix of Pol ν that may affect specificity, we replaced residues in the O-helix of Klentaq with the corresponding sequence (residues 675-687: EQTKKVVYAVVYG) from Pol ν. The resulting chimera was active in gap filling synthesis, but the accuracy of this reaction was much higher than that of Pol ν and similar to that of the parental Klentaq enzyme (data not shown). Further experiments (our unpublished data) targeted two additional regions where Pol ν deviates from the A-family consensus (Figure 4). In the bacterial A-family polymerases, the N-terminus of the N-helix has the highly conserved motif D(I/V)H, where the histidine side chain interacts with the dNTP β-phosphate. The corresponding sequence is DVF in the Pol ν group. Mutation of Klenow fragment towards Pol ν consensus (H734F) caused a substantial drop in polymerase activity; however, the relative rates of T-dGTP versus G-dTTP misinsertions, and the selectivity against the G-dTTP mispair, were the same as in wild-type Klenow fragment. A similar result was obtained by changing the N-terminus of the Klenow fragment Q-helix (motif 6 of Patel et al. in (23)) towards Pol ν consensus (replacing residues 841 to 849 of Klenow fragment, RAAINAPMQ, with RQAINFVVQ). Although these experiments did not pinpoint a sequence motif responsible for the characteristic Pol ν error signature, they do suggest that the Klenow fragment and Klentaq active site sequence motifs are optimized for efficiency, so that mutations away from consensus tend to be associated with decreased polymerase activity. Pol ν, on the other hand, may have been fine-tuned by evolution for some specialized purpose where maximum enzyme activity is not a priority. Similarly, the unusual error specificity of Pol ν and its particular DNA sequence specificities may be a consequence of the relaxed active-site specificity associated with its primary function within the cell.
We thank Katarzyna Bebenek and Nigel Grindley for helpful comments on the manuscript, Komli Kofi Atsina and Ruoting (Valerie) Gong for Klenow fragment kinetic data quoted in tables, and Christina Grindley for making the Pol ν-like Klenow fragment mutants.
†This work was supported by Project Z01 ES065070 (to T.A.K.) from the Division of Intramural Research of the NIH, NIEHS, and by NIH grant GM28550 (to C.M.J.).
2It is also possible that the hotspot for single deletions observed at G164-165 (Figure 1) results from a misinsertion-misalignment event initiated by a G-dTTP misinsertion at G(164), making the G-dTTP error rate at G(164) higher than that calculated from the observed base-substitution mutations.
Supporting Information Available
Additional gel mobility shift data for Pol ν is presented in Figure S1. This material is available free of charge via the Internet at http://pubs.acs.org.