Allosteric mechanisms have evolved to modulate protein function. Allostery can enable a protein to integrate and respond to multiple signals. For example, nuclear hormone receptors, such as the glucocorticoid receptor (GR), use hormones as allosteric effectors of their transcriptional regulatory activity (
1); additional inputs, such as phosphorylation, also affect GR function (
2).
Upon hormone binding, GR associates with high affinity to genomic GR binding sequences (GBSs), typically imperfect palindromic, hexameric half sites separated by 3–base pair (bp) spacers (
3). Within these 15-bp GBSs, five positions are nearly invariant, whereas the remainder can be altered with little effect on GR binding (
4) and vary substantially among all functional GBSs (
5). In contrast, certain GBSs linked to target genes are highly conserved across species (
5). Similarly, single nucleotide differences in NF-κB binding sequences determine cofactor specificity for NF-κB dimers (
6). Thus, the precise nucleotide sequence of a regulatory factor binding site, in addition to guiding the factor to specific genomic loci, may also specify the mode of transcriptional regulation.
To examine GBS effects on GR activity, we constructed luciferase reporters, each containing a single 15-bp core GBS (). The GBSs, differing by as little as 1 bp, were derived from endogenous target genes, matched the GR consensus motif (
5), or were reportedly responsive (
7). The reporters displayed comparable basal activities (
fig. S1A), whereas induction by dexamethasone (dex), a synthetic glucocorticoid, varied from ~twofold (Pal, GilZ) to ~20-fold (TAT) ().
However, gel shift assays revealed no correlation between in vitro GBS affinities and in vivo transcriptional activities ( and
fig. S1C) for a GR fragment with DNA binding affinities comparable to that of full-length GR (
8,
9). For example, TAT was 2- to 10-fold more active than the other GBSs but bound comparably to those with lower activities, whereas GBSs with similar transcriptional activities (Pal, GilZ) displayed different affinities. Moreover, merely reversing the orientation of asymmetric GBSs relative to the transcription start site, which presumably does not alter their affinities, altered their regulatory activities (
fig. S2). These data suggest that GBS half sites confer unique function to the associated monomer.
Next, we tested the effects of mutating each of three GR surfaces implicated in transcriptional activation: (i) activation function 1 (AF1), (ii) AF2, and (iii) the dimerization region (Dim) (
10). Wild-type GR or point mutants that abrogate each activity were cotransfected with GBS reporter plasmids into U2OS cells, which lack endogenous GR expression. Similar to endogenous target genes (
10), we found a GBS-specific usage of GR surfaces (). For example, GBSs that differed by 1 bp (Cgt, Sgk) differed in their dependence on the Dim domain, and GBSs with identical half sites but different spacer sequences (FKBP5, Pal) used all three surfaces differently. Pal and GilZ used different patterns of GR surfaces to arrive at their final indistinguishable activities. Furthermore, for a ~1-kb genomic fragment that recapitulated the domain utilization of endogenous GilZ [see
supporting online material (SOM) text] (
10,
11), changing only GBS sequences changed domain utilization, suggesting that GBSs direct GR activity, even in the context of large composite elements found at endogenous genes (
fig. S3).
To assess GBS specific actions of GR cofactors, we knocked down expression of SWI/SNF subunit Brahma (Brm) (
12) and coactivator-associated arginine methyltransferase 1 (CARM1) (
13). Knock-down of CARM1 reduced activation for each GBS tested (), whereas Brm knock-down had effects ranging from only ~10% reduction on the GilZ GBS to ~60% at FKBP5 (). Thus, the influence of GBS sequences on GR, in turn, alters the composition or function of cognate regulatory complexes.
To test the structural basis for these sequence specific changes, we examined the GR–DNA binding domain (GR-DBD) () in complex with various GBSs by x-ray crystallography. Each GBS was bound asymmetrically by a GR-DBD dimer, with crystal packing contacts between dimerization domain residues on both monomers and a surface of one monomer, denoted here as chain A (
fig. S4). By convention, we place chain A in contact with the conserved AGAACA half site and chain B in contact with the variable half site. The chain A crystal packing surface included parts of the DNA recognition helix, the Dim domain, and the “lever arm” (residues 469 to 474) (
9,
14) loop connecting these two motifs (). In contrast to a described structure (a palindrome with a non-canonical 4-bp spacer) (
15), the 3-bp spacer allowed both recognition helices to make specific major groove contacts (
figs. S4 and S5). The contact made by R466 (
16) was invariant, whereas V462, and more profoundly K461, were GBS-specific (
fig. S5). However, the lack of fixed DNA orientation (
17) precluded definitive attribution of side chain contacts to individual bases.
We also observed a C-terminal helix (H3; residues 509 to 515) that, as with progesterone receptor (PR) (
18), made a nonspecific minor groove backbone contact 3 bp upstream of the GBS motif mediated by R510 (
fig. S6). Distinct from PR, H3 of GR laid across the minor groove, presenting five lysines following R510 as potential sources of interaction. The R510A mutation reduced GR-DBD affinity for DNA (~threefold) and K514A (~twofold), whereas transcriptional activation was unaffected by the K514A mutation and increased by R510A, reaffirming that regulatory activity is not determined solely by affinity (
fig. S6).
A comparison of 13 GR-DBD:GBS structures revealed that they were virtually superimposable, except for the lever arm. In every complex, H472 within the lever arm adopted one of two distinct orientations ( and
fig. S7, A and B). In chain A of each structure, H472 packed identically in the core of the protein fold; in chain B, H472 was flipped out, and the lever arm conformation was more heterogeneous, particularly between complexes with a different spacer length (). The chain B lever arm of four different GR-DBD:GBS complexes crystallized under the same conditions refined to a similar predominant conformation, albeit with higher B factors, indicating less well-defined structure. Composite omit maps (for an explanation, see SOM text) revealed additional electron density specifically at the lever arm, indicating discrete alternate conformations (). The densities were distinct for each complex, suggesting that DNA sequence directs distinct changes in the lever arm.
The conformation of the lever arm appeared to be influenced by DNA backbone contacts at one or both ends. At one end, Y474 formed a weak hydrogen bond to the DNA backbone at GilZ but not the other GBSs (
fig. S8). At the other end, E469 contacted the DNA backbone through K465. GilZ displayed the strongest interactions at these two sites and no alternate conformations, whereas for the other GBSs, the anchoring was weaker and exhibited alternate conformations (). Thus, the conformation of the lever arm is specified by DNA topology, which, in turn, is determined by sequence. Crystallization of GR-DBD:GilZ complexes under a range of conditions revealed substantial lever arm heterogeneity, whereas GR-DBD:Sgk complexes were invariant under the crystallization conditions tested (
fig. S7, C and D). Together, these structures reveal that the lever arm is conformationally sensitive, responding to small environmental shifts, including DNA sequence.
To assess the role of the lever arm in modulating GR activity, we studied GRγ, a splice variant that inserts an arginine in the lever arm () (
19). Consistent with the initial description of GRγ (
19), reporter assays revealed reduced activity for each GBS (). Relative to the predominant GRα isoform, GRγ displayed normal DNA binding affinity (
fig. S9) and near-normal repression of osteocalcin (
20), suggesting that the lever arm selectively affects transcriptional activation. We then tested the role of the lever arm on activity at endogenous genes by comparing U2OS cells stably expressing GRα (
21) or GRγ. The two isoforms were generally similar, but transcriptional regulation was distinct at a subset of target genes ( and
fig. S10). Transcriptional activation of FKBP5 by GRγ was equivalent to GRα, whereas SDPR was elevated and BIRC3 was decreased by GRγ. Chromatin immunoprecipitation showed comparable GRα and GRγ occupancy () at these genes (
5), indicating that the lever arm modulates events subsequent to GR:GBS binding to produce gene-specific, probably GBS-specific, regulation.
We crystallized GRγ-DBD:GBS complexes and found that the structures were similar to those with GRα, except within the lever arm (). Base-specific and DNA backbone contacts were maintained (
fig. S11), reflecting the similar affinities of GRα and GRγ. Thus, the conformational differences in the lever arm have functional consequences.
Mutational analysis of the lever arm revealed that E469 was an important residue for mediating activation () and that the H472A mutation produced increased activity, whereas H472R impaired GR activation. Several lever arm mutations had GBS-specific effects (): N473A selectively reduced activation at Pal, whereas G470A reduced activation at Pal and Tat. Thus, different lever arm conformations may produce multiple functionally distinct interaction surfaces.
In chain A, the packing of H472 occurred through interactions with residues within the DBD core that are GR-specific within the nuclear hormone receptor family (): Y497, L501, and the carbonyl adjacent to V468 (). GRα complexes with 16- and 18-bp GBSs displayed different crystal packing, yet all exhibited the packed chain A and flipped out chain B (
fig. S12). Disruption of chain A packing by a Y497L mutation (
22) or by the R insertion in GRγ affected receptor activity in certain contexts, suggesting that packing of GRα chain A confers functional consequences quite distinct from those seen if both chains are flipped, as in GRγ and perhaps in other nuclear hormone receptors.
Protein functions have evolved commonly to be modulated by cellular signals. Based on our results, we propose that DNA sequences serve as one such signal, functioning as allosteric ligands that direct the activity of GR and probably other transcriptional regulators (
23). Evidence for transduction of structural changes from DNA to other nuclear receptor domains has been described: DNA binding by GR induces secondary structure in its AF1 domain (
24), and the estrogen receptor (ER) AF2 domain interacts with different cofactor peptides when ER is bound to different sequences (
25,
26). We propose that conformational changes in the lever arm amplify signals at the reading head and transmit them to other domains. Our studies with the GR-DBD are an important first step establishing that distinct binding site sequences induce subtle structural differences that are propagated and functionally amplified in the context of the full-length receptor. Studies of other transcriptional regulatory factors imply that interpretation of sequence may be a general property of DNA binding proteins (
6,
23,
27).