Overall structure of the single-stranded -10 element/σ2 complex
The oligonucleotide sequence used for cocrystallization () is based on the conserved motif discovered in ssDNA aptamers selected for binding to free
Taq σ
A in vitro (
Feklistov et al., 2006). The ssDNA oligonucleotides bound
Taq σ
A2-3 () with a dissociation constant (K
d) more than three orders of magnitude lower than a control, anticonsensus sequence (
Figure S1A). The most detailed model (with the
T-12A-11C-10A-9A-8T-7 –10 element; ) was refined at 2.1 Å-resolution to an
R/Rfree of 0.194/0.237 (
Table S1,
Figure S1B). The structure with the
T-12A-11T-10A-9A-8T-7 –10 element was essentially identical (root-mean-square deviation of 0.165 Å over all atoms). Although biochemical analysis of the crystal contents established that σ
A3 was present in the crystals, electron density for the domain was absent and it was presumed disordered.
The ssDNA is draped across a highly conserved, positively-charged surface of σ
A2 (,
S1C), with a 90° turn in the DNA backbone between the -11 and –10 positions (). Extensive interactions occur between the protein and the DNA backbone of every nucleotide from –12 to –6 (). Base-specific interactions occur primarily with
T-12,
A-11,
A-8, and
T-7, especially
A-11 and
T-7, which are notably flipped out of the ssDNA base-stack and entirely buried in protein pockets ().
The ssDNA/protein complex results in the burial of 1,096 Å
2 total molecular surface area. On the DNA, the interactions occur almost entirely within the –10 element (92% of the buried surface area). The DNA interacts with residues from all of the σ conserved regions (1.2, 2.1, 2.2, 2.3, and 2.4; ; (
Lonetto et al., 1992), with the bulk of the interactions occurring within σ regions 2.3 (73% of the buried surface area) and 2.1 (23%).
Based on previous studies, we expected to observe interactions between the downstream discriminator element (G
-6G
-5G
-4) and residues of σ
A region 1.2 (
Feklistov et al., 2006;
Haugen et al., 2006;
Haugen et al., 2008). In fact, the bases of the discriminator element do not interact with the protein, but peel away and participate in crystal contacts with GGG motifs from symmetry-related complexes, forming an unexpected G-quadruplex structure that is unlikely to be relevant to σ factor function but plays a critical role in crystal packing (
Figure S1D). Nearby, the σ
A structure features a shallow, positively charged channel that likely accommodates the GGG motif in the physiological complex (
Figure S1C; (
Haugen et al., 2008).
Recognition of T-12 consistent with base-paired -12 position in RPo
T is strongly favored at –12, the upstream position of the –10 element (). In general, promoter mutations that substitute
T-12 with another base weaken promoter activity/binding, and mutations that substitute another base with T strengthen promoter activity/binding (
Moyle et al., 1991). Optimal binding of RNAP holoenzyme to fork-junction promoter probes, which have a mostly single-stranded –10 element, required the base-pair at –12 (
Guo and Gralla, 1998), suggesting that the –12 position normally remains base-paired even in RP
o (). Nevertheless,
T-12 can also be recognized in the context of ssDNA (
Feklistov et al., 2006;
Roberts and Roberts, 1996;
Sevostyanova et al., 2007).
The three 5'-nucleotides of the ssDNA (5’T-14G-13T-12) hang off the edge of the protein structure, with only T-12 making significant interactions with the protein (, ). The three nucleotides maintain a structure similar to one strand of a B-form double-helix, except that the base of G-13 is in the syn conformation. Modeling in a B-form double-helix reveals that there is ample space for the paired t-strand (-14 to –12), but continuation of the double-helix downstream to –11 is blocked by the invariant W-dyad (W256/W257) and other elements of the protein (, ).
T-12 is propped against the W-dyad, with W256 making extensive van der Waals interactions primarily with the T-12 deoxyribose moiety (as well as with the base), and W257 making an ‘edge-on’ van der Waals interaction with the T-12 pyrimidine ring (). The T-12 5'-phosphate [-12(P)] is held by polar interactions with R246. These interactions would likely occur regardless of the identity of the –12 base. Explaining the preference for T at this position, R237, reaches over from the σ region 2.2 α-helix and forms a hydrogen-bond (H-bond) with the O4 atom of T-12 [T-12(O4)], and the aliphatic side-chain of K241 makes van der Waals contact with the T-12(C5-methyl) ().
The primary role of σ
2 in -10 element recognition was first uncovered in genetic screens for σ mutants that suppressed single-base substitutions in the -10 element. Specifically, it was shown that substitutions at the position corresponding to
E. coli σ
70 Q437 (
Taq σ
A Q260), which is absolutely conserved among Group I σ's (
Campbell et al., 2002;
Gruber and Bryant, 1997); ), to H or R allows efficient transcription from mutant promoters having a T to C substitution at position -12 (
Kenney et al., 1989;
Waldburger et al., 1990).
In the
Taq σ
A2/-10 element DNA structure, electron density maps show clear evidence for two, roughly equally populated conformations of Q260. In either conformation, however, the shortest distance between any atom of Q260 and
T-12 is 6.3 Å – too far for the genetic results to be explained by a direct interaction between Q260 and the
T-12 base (). However, modeling of the t-strand A base-paired to
T-12 places the major-groove edge of this base within H-bonding distance of Q260 (). Surveys of protein/DNA interactions (
Hoffman et al., 2004;
Luscombe et al., 2001) point to a strong preference for Q to interact with the major-groove edge of A, while H and R strongly prefer G (), which corroborates our modeling and explains previous genetic results.
Q260 and other σ region 2.4 residues implicated in -10 element recognition lie on a long α-helix that is roughly perpendicular to the trajectory of the promoter DNA double-helix (
Murakami et al., 2002). Because of this, amino acid side chains of σ do not appear to be able to establish sequence-specific interactions with the double-stranded -10 element (as in RP
c) due to the depth of the major groove. Structural modeling () suggests that Q260 can recognize the major-groove edge of the -12 bp only when the -11 position and downstream are strand-separated, allowing the -12 position of the resulting upstream fork-junction to move closer to the σ region 2.4 α-helix.
A-11 is flipped out of the DNA base-stack and buried in a complementary hydrophobic pocket of σ2
The most highly conserved position of the –10 element is
A-11 (). Only a few per-cent of σ
A promoters have a base other than A at this position. Mutations from the consensus A; i) often completely inactivate promoters (
Lee et al., 2004;
Lim et al., 2001), ii) cause severe defects in RP
o stability (
Fenton and Gralla, 2001,
2003a,
b;
Guo and Gralla, 1998;
Lim et al., 2001;
Matlock and Heyduk, 2000;
Schroeder et al., 2007), and iii) cause defects in binding to nt-strand ssDNA oligonucleotides (
Roberts and Roberts, 1996);
Figure S1A).
In addition to interacting with the
T-12 backbone, the first W of the invariant W-dyad, W256, also interacts with the
A-11 backbone and occupies the space where the
A-11 ribose moiety would be if the DNA double-helix extended downstream from –12 (, , ). This necessitates a flip of the entire
A-11 nucleotide which, in turn, removes the
A-11 base from the upstream base-stack formed by T
-14G
-13T-12 (). Instead, the
A-11 base is completely buried in a hydrophobic protein pocket (, ; (
Tsujikawa et al., 2002).
The
A-11 pocket is perfectly shaped to fit an A and would poorly accommodate any other base, explaining the high conservation of
A-11 in the –10 element () and the severe effect of promoter mutations at this position. On one face of the
A-11 base (the front, or downstream face), Y253 makes a π-stack, as predicted by (
Schroeder et al., 2007). On the opposite face, R246 stacks on the base, forming a cation-π interaction (
Wintjens et al., 2000)(, , ). The position of the R246 side-chain is stabilized through a polar interaction with the -12(P) (, ). In the absence of DNA, the R246 side chain is free to swing into an open configuration, allowing the
A-11 base to slip into the pocket.
Studies using nucleotide analogs in place of
A-11 revealed a strict requirement for a purine base with no side groups at the N1 and C2 positions (
Lee et al., 2004;
Matlock and Heyduk, 2000). Furthermore, methylation of
A-11 at N3 interfered with holoenzyme binding (
Johnsrud, 1978;
Siebenlist et al., 1980). In the structure, the back wall of the
A-11 pocket forms a tight steric fit with the base that is only possible if the N1, C2, and N3 positions are unsubstituted (). F242 makes a hydrophobic contact with
A-11(C2), while the polypeptide backbone between residues 241 to 243 makes several H-bonds with the
A-11 base. The
A-11 pocket is topped by F248, which makes van der Waals contact with
A-11(N3) (). The side-chain of E243 does not contact the
A-11 base, but appears to play an important role by forming the bottom part of the pocket and by making polar interactions with R246 to help stabilize its position ().
(C/T)-10A–9A–8 interact with σ2 primarily through the DNA backbone
The path of the DNA backbone wraps around the surface of the protein with a 90° turn between the –10(P) and –9(P) (). T252 interacts with –9(P), and serves as a fulcrum of the DNA backbone turn (, ). In this way, T252 plays a critical role. Of all the highly conserved residues of σ region 2.3 (), T252 is the least tolerant to substitution (
Waldburger and Susskind, 1994). Changes at this position yield the most severe promoter melting defects
in vitro (
Schroeder et al., 2008), and the only substitution that yields functional σ
in vivo is highly conservative S (
Waldburger and Susskind, 1994). Comparison of promoter binding vs. melting activities of the
E. coli σ
70 T429A substitution (corresponding to
Taq σ
A T252A) suggests that T252 exerts its critical role at the strand separation step, after formation of RP
c (
Schroeder et al., 2008), consistent with the structure.
After A-11, the most highly conserved position of the –10 element, the next three nucleotides, T-10A-9A-8, are the least conserved (), and promoter mutations at these positions generally have less effect on promoter activity than mutations at the –12, -11, or –7 positions. In line with these observations, the –10/-9/-8 nucleotides are primarily bound through extensive interactions with the DNA sugar-phosphate backbone (, ). The three bases are stacked together and point away from the protein; only the base of A-8 makes van der Waals contact with T255, and also water-mediated contacts to the R259 side-chain and T252 main-chain ().
The –9(P) and –8(P) make extensive polar contacts with protein side-chains and main-chain, whereas the -7(P) does not interact with the protein (,
S2). This explains chemical probing results, which found that ethylation of the –9(P) or –8(P), but not other phosphates in the –10 element, interfered with promoter binding by RNAP (
Johnsrud, 1978;
Siebenlist et al., 1980).
T-7 is flipped out of the DNA base-stack and buried in a hydrophilic pocket of σ2
The downstream position of the –10 element,
T-7, is almost as highly conserved as
A-11 (). Promoter mutations at this position also generally have severe consequences for promoter activity (
Moyle et al., 1991). The base-stack of
C-10A-9A-8 is prevented from continuing in the downstream direction by T255 and R259 (). Instead, the entire
T-7 nucleotide is flipped out of the base stack (as predicted by (
Schneider, 2001) and buried in another protein pocket formed by residues from conserved regions 1.2, 2.1, and 2.3 of σ (). Unlike the
A-11 protein pocket, the
T-7 pocket is i) spacious compared with the size of the base, and ii) hydrophilic in nature. The
T-7 pocket accommodates well-ordered water molecules that participate in the recognition of the base (, ). Every potential interacting moiety of the
T-7 base is recognized by the protein.
Although the
T-7 pocket is relatively spacious compared to the pyrimidine base, purine bases cannot be accommodated in the pocket. The spatial arrangement of H-bond donors and acceptors of a pyrimidine C
-7 are not compatible with the
T-7 pocket, and a favorable hydrophobic van der Waals interaction would be lost due to the absence of the C5-methyl (
Figure S2A).
A salt bridge between R208 and the –6(P) is the final biologically relevant DNA/protein contact (). Downstream, G
-6G
-5G
-4 turn away from the protein to form the intermolecular G-quartet structure that participates in crystal packing (,
S1D).
Recognition of the -10 element sequence is coupled with strand separation
It has been established that σ
2 sequence-specifically recognizes the nt-strand of the -10 element in RP
o (
Marr and Roberts, 1997;
Roberts and Roberts, 1996;
Savinkova et al., 1988), mediated by universally conserved aromatic residues of σregion 2.3 (; (
Juang and Helmann, 1994). The role, if any, of sequence-specific recognition of the duplex -10 element in RP
c is less clear, due to the transient nature of this intermediate. Current thinking posits that the -10 element (or at least its upstream part) may be recognized sequence-specifically in dsDNA form (i.e. in RP
c) by residues of σ region 2.4, while upon strand separation and RP
o formation, residues of σ region 2.3 recognize the nt-strand bases of the -10 element (reviewed in: (
deHaseth et al., 1998;
Helmann and deHaseth, 1999;
Hook-Barnard and Hinton, 2007). In contrast to this view, our structural modeling suggests that sequence-specific interactions between σ
2 and the duplex -10 element are unlikely to form prior to strand-separation beginning at
A-11: we hypothesize that recognition of the -10 element sequence only occurs when strand separation is initiated, as the
A-11 and
T-7 bases are captured in their σ
2 pockets (–).
To test our hypothesis, we investigated the binding of dsDNA containing -10 element sequences to
E. coli RNAP holoenzyme compared with DNA lacking the -10 element [anti(-10) DNA] under conditions favoring RP
c (4°C). To monitor DNA binding, we employed a recently reported ‘RNAP beacon assay’ (
Mekler et al., 2011), which takes advantage of the sensitivity of light emission from a fluorophore attached on the σ–surface near the cluster of aromatic residues implicated in -10 element recognition. In free RNAP, the probe fluorescence is quenched due to photoinduced electron transfer from the cluster. Upon binding of -10 element DNA, the contacts between the aromatic residues and the fluorophore become disrupted, resulting in increased fluorescence signal. The assay is ideal for our purposes since it reports on specific σ
2/-10 element interactions while being ‘blind’ to non-specific protein/DNA binding elsewhere on the holoenzyme that can mask weak, specific interactions in conventional binding assays.
To focus on RNAP/-10 element interactions and avoid the contribution of other promoter elements to binding affinity, we chose a dsDNA fragment (-22 to +4) based on the
lacUV5 promoter (), for which a stable RP
c has been reported at low temperatures (
Spassky et al., 1985). We observed specific binding of the dsDNA fragment (K
d = 1.0 ± 0.4 μM at 4°C; ). A single-base substitution at the -11 position (A to G) resulted in a significant drop of affinity (K
d = 33.7 ± 14.9 μM), almost to the level of the anti(-10) sequence (K
d = 48.9 ± 21.1 μM; ). In the two extremes, the observed specific interaction could be between RNAP holoenzyme and the fully duplex DNA fragment (as postulated in RP
c) or, alternatively, the -10 element may be bound to RNAP holoenzyme with the
A-11 and
T-7 bases in their respective σ
2 pockets (). To distinguish between these two scenarios, we introduced modified bases at the -11 and -7 positions of the -10 element duplex designed to prevent binding of the bases in their σ
2 pockets but preserve the recognition surfaces of the dsDNA, and measured their effects on binding.
According to available RP
c models (
Murakami et al., 2002;
Shultzaberger et al., 2006), the (A/T)
-11 base pair of the duplex -10 element is exposed to σ
2 via its major groove (). With 2,6-diaminopurine (diAP) in place of
A-11, the major groove profile and overall geometry of the -11 base pair remain intact (; (
Cheong et al., 1988), but binding of the base in its σ
2 pocket (, ) is compromised due to steric clash of the exocyclic amine at the 2-position (
Figure S2B; (
Lee et al., 2004). The diAP
-11 incorporated into the duplex -10 element fragment caused a 14-fold decrease of binding (). We presumed that the residual binding of (diAP/T)
-11 dsDNA [compared to anti(-10)] was due to recognition of the -10-like sequence on the bottom strand, where 4 bases out of 6 match the consensus (). Neutralizing this second -10 element with a 2-aminopurine (2AP) modification in the bottom strand opposite
T-7 resulted in a loss of binding nearly to the level of the anti(-10) DNA (). The (T/2AP)
-7 modification by itself does not affect the recognition of the nt-strand -10 element (). Introduction of diAP into each of three possible positions of a -35 element-containing duplex promoter fragment (-41 to -12) had no effect on binding (
Figure S2C), ruling out possible effects of diAP on DNA helix geometry that could affect the putative -10 element dsDNA mode of binding.
Introducing modified bases into the t-strand would not be expected to affect the binding of the nt-strand A-11 in its σ2 pocket. Indeed, 5-methyl isocytosine (MeiC) or 3-nitropyrrole (3-NP) opposite A-11 alters the minor (MeiC, 3-NP) and major (3-NP) grooves of the base pair, but these modifications have no significant effect on DNA binding ().
In RP
c, the (T/A)
-7 bp is expected to face σ
2 via its minor groove (
Murakami et al., 2002;
Shultzaberger et al., 2006); ). Replacing the (T/A)
-7 bp with C/H (H, hypoxanthine) preserves the disposition of functional groups within the minor groove, but prevents binding of the -7 nt-base (,
S2A). This modification resulted in a 30-fold increase in the K
d (). Introducing 2AP or even a universal base (5-nitroindole; 5-NI) in the t-strand opposite
T-7 had no effect on dsDNA fragment binding even though these modifications significantly alter both major and minor groove profiles of the dsDNA ().
Finally, (H/C)-11 and (2-sT/A)-7 (2-sT, 2-thiothymidine) modifications address the unlikely cases where, in RPc, the (A/T)-11 bp faces σ2 from its minor groove and the (T/A)-7 bp faces σ2 from its major groove (). Again, even though these alterations preserve the respective dsDNA grooves that could be facing the σ surface, these modifications to the nt-stand A-11 and T-7 bases compromise the fit in their σ2 pockets and result in loss of binding affinity ().
In summary, we find that modifications to the A-11 or T-7 bases of the nt-strand of the -10 element expected to disrupt ssDNA binding () compromise binding of the dsDNA fragment (marked red in ). At the same time, dramatic alterations to the major and minor groove structures of the dsDNA do not significantly affect binding, as long as the A-11 or T-7 bases remain intact (marked green in ). In combination, these results can only be explained if the critical nt-strand A-11 and T-7 bases are bound by σ2 in the single-stranded state and not in the context of fully closed dsDNA. We conclude that the specific binding observed for the unmodified duplex -10 element fragment is due to recognition of the A-11 and T-7 bases in their σ2 pockets, and that within the limits of detection of our assay, sequence-specific recognition of the duplex -10 element does not occur.