Cells possess many DNA polymerases (DNAPs); e.g., human cells, yeast (
S. cerevisiae) and
E. coli have at least fifteen, eight and five, respectively [
1–
3]. Some DNAPs are present to do translesion synthesis (TLS) past DNA lesions/adducts that result from radiation and chemical insults [
1–
9]. Most lesion-bypass DNAPs are in the Y-Family [
1–
9], where humans have three members (hDNAPs η, ι and κ), yeast has one (scDNAP η) and
E. coli has two (ecDNAPs IV and V).
Y-Family DNAPs have a conserved ~350aa core, which includes the polymerase active site (representative
Refs.: [10–
18]). As with all DNA polymerases, Y-Family members resemble a right-hand with thumb, palm and finger domains, although their “stubby” fingers and thumb result in more solvent accessible surface around the template/dNTP-binding pocket [
7]—presumably to accommodate the bypass of bulky and/or deforming DNA adducts/lesions, which protrude into these open spaces. Y-Family DNAPs grip DNA with an additional domain [
7,
10,
11,
15], usually called the “little finger”. Steps in the mechanism of Y-Family DNAPs have been proposed for both protein structural changes based on a series of X-ray structures [
19,
20] and for chemical catalysis based on theoretical studies [
21].
Our work has focused on benzo[a]pyrene (B[a]P), which is a well-studied DNA damaging agent that is a potent mutagen/carcinogen and an example of a polycyclic aromatic hydrocarbon (PAH), a class of ubiquitous environmental substances produced by incomplete combustion [
22,
23]. PAHs in general and B[a]P in particular induce the kinds of mutations thought to be relevant to carcinogenesis and may be important in human cancer (representative
Ref.: [24]). B[a]P mutational spectra were established with the major metabolite that reacts with DNA, namely (+)-
anti-B[a]PDE, in
E. coli [
25], yeast [
26,
27] and mammalian (CHO) cells ([
28] and references therein). Mutagenesis has also been studied with [+ta]-B[a]P-N
2-dG (+BP, ), the major adduct of (+)-
anti-B[a]PDE, and G → T mutations predominate in most cases ([
29], and references therein).
DNAPs IV and V of
E. coli are both involved in translesion synthesis (TLS) with B[a]P-N
2-dG adducts, although evidence suggests they play very different roles. In studies with purified proteins, DNAP IV inserted dCTP (>99%) opposite both +BP and [-ta]-B[a]P-N
2-dG (−BP) in a 5′-CGA sequence, while DNAP V inserted dATP (>99%) [
30], and this tendency is evident in
E. coli where DNAP V inserts dATP opposite +BP in the G → T pathway [
31], while in the non-mutagenic pathway DNAP IV does dCTP insertion opposite +BP [
32–
36], −BP [
34] and other N
2-dG adducts [
35,
36]. Regarding the non-mutagenic pathway, only DNAP IV is required for efficient TLS with −BP [
34], while DNAP V is required in addition to DNAP IV with +BP [
32–
36]. Why are two DNAPs required for non-mutagenic TLS with +BP: certain lesions need one DNAP for insertion and a second for extension [
37,
38]. Thus, if DNAP IV does dCTP insertion [
30,
32–
36], then DNAP V must do extension, which is sensible given kinetic findings with purified proteins, where DNAP V can be up to ~1500 times better than DNAP IV at the step directly following adduct-G:C formation (i.e., extension) in the case of +BP compared to −BP (discussed in greater detail in
Ref. [34]). Finally, random mutagenesis studies with [+
anti]-B[a]PDE suggest that most G → T mutations with B[a]P-adducts require SOS-inducible Y-Family DNAPs, although a minor non-SOS-inducible G → T pathway does exist (discussed in
Ref. [34]), which has been studied with +BP in a 5′-GGA sequence [
32].
The study of
E. coli's Y-Family DNAPs may provide insights about Y-Family DNAPs in general. Human DNAP κ was originally discovered because its sequence closely resembles
E. coli DNAP IV [
39–
41], and dNTP insertion opposite a variety of adducts/lesions, including +BP, is remarkably similar for the DNAP IV/κ pair (), suggesting they are functional orthologs (discussed in
Ref. [42]). This notion was substantiated when the identical mutation in a conserved residue (the “steric gate”, which excludes rNTPs) in the active site of DNAP IV and DNAP κ had a similar effect on lesion bypass vs. normal replication both
in vitro and in cells [
35]. DNAPs IV and κ have been shown to accurately bypass a variety of N
2-dG-adducts [
32–
36], including those formed via various cellular trioses [
36], which may be the main cellular rationale for the genesis of the IV/κ-class.
E. coli DNAP V and human DNAP η are also functional orthologs, based on their similarity of dNTP insertion opposite a variety of adducts/lesions ( [
42]), and a strong case has been made that the main cellular rationale for the V/η-class is TLS of UV-induced CPDs (discussed in
Ref. [43]).
| Table 1Dominant dNTP insertions opposite various DNA adducts/lesions by E. coli DNAPs IV and V and human DNAPs κ and η.a. |
There must be structural reasons why the insertion preference opposite adducts/lesions is different for the DNAP IV/κ pair vs. the DNAP V/η pair ( [
42]). However, no X-ray structures exist for UmuC (the polymerase subunit of DNAP V), DNAP IV or hDNAP η, which prompted us to build models taking a homology modeling approach [
42]. Analysis of X-ray structures, modeled structures and sequence alignment suggests that Y-Family DNAPs lend themselves to accurate homology modeling [
42,
43]. An X-ray structure does exist for hDNAP κ with DNA [
18].
Herein a molecular modeling study is presented that suggests a hypothesis for why the IV/κ-class correctly inserts dCTP opposite B[a]P-N
2-dG adducts. To form an adduct-dG:dCTP base pair, the B[a]P moiety must be in the developing minor groove, since the adduction site (N
2-dG) is in the minor groove in a Watson–Crick base pair. On the minor groove side, Y-Family DNAPs have an opening (or gap) next to the active site between the fingers and little finger domains. This opening looks like an elliptical hole of varying size in Dpo4 [
11–
14], Dbh [
10], hDNAP ι [
17] and in our models of DNAP IV and UmuC(V) (see below), while it looks like a slot in hDNAP κ [
18]. The character of this opening can be analyzed based on analogy to a “chimney,” where a cluster of nearby amino acids can be thought of as a “flue” that either plugs the chimney leaving a small opening, or does not plug the chimney leaving a large opening. Next to the flue is a single amino acid, which can be thought of as a “flue-handle” that controls whether the flue is open or closed. Three regions of the protein make up the chimney as shown in for our model of DNAP IV: an upper lip (turquoise), a lower lip (dark blue) and a left lip (blue) [
43]. It is not unreasonable to think that chimney opening size and shape might influence dNTP insertional mechanism given that the bulky B[a]P moiety (red, ) must fit in it.
Herein we show that DNAP IV has a large chimney opening, which allows +BP to pair with dCTP, when dCTP adopts the canonical shape observed in all other families of DNAPs. In contrast, our molecular models suggest that UmuC(V) has a small chimney opening, which forces +BP downward in the active site into a position where catalysis is less likely to be facile.
Secondarily, we reflect on the conundrum: if the small chimney opening of DNAP V enforces a structure that seems unlikely to be active, then how might DNAP V insert dATP opposite +BP? Dpo4 is the best studied Y-Family DNAP in terms of structure (representative
Refs.: [11–
14]). Though the dNTPs in the active site of many Dpo4 structures adopt the canonical “chair-like” dNTP shape (, green), which is similar to the shape of dNTPs in the active sites of other families of DNAPs (, insert), a second non-canonical “goat-tail-like” shape has also been observed (, yellow), as noted by others [
13]. (For simplicity we refer to the “chair-like” shape as S1-dNTP and the “goat-tail-like” shape as S2-dNTP.) Based on X-ray structures, modeled structures, sequence alignment and experimental results, a logical framework emerges from the hypothesis that the
syn-adenine orientation of S2-dNTP pairs with adduct-dG, since S2-dNTP sits lower down in the active site and permits +BP to fit under the small chimney of UmuC(V). This hypothesis depends on the non-canonical S2-dNTP shape being active, which is considered in Section 4. The goal of the modeling work herein was to probe the insertional differences between the V/η-class and IV/κ-class of Y-family DNAPs and develop hypotheses, which we intend to test experimentally.