The recognition of short DNA motifs by transcription factors is a key step in the regulation of transcription. However, the low information content of most of the TFBS predicts a promiscuous binding that is in contrast with the observed specificity. This apparent paradox raises the question of how transcription factor selectivity is achieved. Although several mechanisms including chromatin accessibility and TF cooperation, have been shown to restrict the target space for a given TF, a complete explanation is still lacking in most of the cases. In this scenario, HIF transcription factors are not an exception. HIF heterodimers bind to the RCGTG motif (
4,
6,
9), which is present in almost every gene in the human genome, yet hypoxia results in the regulation of a few hundred genes only and, accordingly, HIF only binds to a subset of the potential binding sites (
5,
6,
9,
10). It has been recently demonstrated that HIF binds preferentially to RCGTG motifs present in the promoters of genes actively transcribed under normoxic conditions (
8). Although this restriction results in a large reduction in the number of potential targets, it does not fully explain HIF selectivity as many genes transcribed under normoxia and containing RCGTG motifs are not induced by hypoxia. With the aim of gaining insight into the mechanisms of HIF target selectivity, we investigated the
GYS1/RUVBL2 genes as a particularly striking example of differential regulation. Given their close proximity and the location of the functional HRE between both genes, we had expected their coordinated regulation by HIF. However, our results showed that while
GYS1 was induced by hypoxia,
RUVBL2 levels remained unchanged (). The basal level of mRNA, RNA polymerase II binding and promoter activity ( and
Supplementary Figure S1) ruled out the accessibility of chromatin as an explanation for the lack of induction of
RUVBL2 by hypoxia. Detailed analysis of the promoter activity of different fragments derived from the
GYS1/RUVBL2 intergenic region revealed the existence of an inhibitory region between the HRE and the
RUVBL2 gene (). Finally, specific EBAs performed
in vitro (), as well as
in vivo () demonstrated the existence of an insulator element that could explain the lack of effect of the HRE over the
RUVBL2 promoter and thus, the differential regulation of
GYS1 and
RUVBL2 genes by hypoxia.
Insulators have been shown to play a key role in the differential patterns of gene expression during development and cell-lineage specification (
23). However, to our knowledge, this is the first example of an enhancer blocking element contributing to the selectivity of a transcription factor acutely induced by environmental factors, such as HIF. Although further work is required to determine whether this is a general mechanism contributing to HIF specificity, our working hypothesis is that this mechanism could be particularly relevant in cases of bidirectional promoters, such as the one described herein, where chromatin accessibility and DNA methylation is likely to be similar across the intergenic region. In support to this possibility, we have found at least one further example of bidirectional promoters differentially regulated by hypoxia,
BCKDHA and
EXOSC5. Preliminary results indicate that, in spite of similar basal transcription,
BCKDHA, but not
EXOSC5, is induced by hypoxia (
Supplementary Figure S3).
Another issue raised by our results relates to the molecular identity of the insulator element located upstream of the
RUVBL2 gene. In vertebrates, several regulatory elements including CTCF binding motifs (
36–38), repetitive elements, [such as ALUs (
39), SINE B2 (
29) and SINE B1 (
30)] and scaffold/matrix-attachment regions [S/MARs; (
40,
41)], have been shown to function as insulators (
25,
42). Among them, the most widespread and well characterized are CTCF-binding elements (
43). We have not found obvious CTCF binding motif within the −202 to −30 region upstream of
RUVBL2, and published ChIP data shows CTCF binding to the GYS1 TSS region, but not to the region between the HRE and
RUVBL2 gene (
Supplementary Figure S4A, CTCF ChIP signal track). On the other hand, there are no repetitive elements within this genomic region (
Supplementary Figure S4A, repeats tracks) arguing against the existence of a SINE element that could explain the observed enhancer blocking activity. Finally, although we found no locally high proportion of A/T nucleotides, typically associated with S/MARs elements (
41) in the region upstream of
RUVBL2 (
Supplementary Figure S4A, GC percent track), computer prediction of S/MAR sites, found a significant signal in the upstream region of
RUVBL2 (
Supplementary Figure S4B). Thus, the insulator activity described herein could be mediated by S/MAR elements. However, as this evidence has been obtained through
in silico approaches, further work is required to identify the minimal region required for the enhancer blocking function, including its molecular identity, underlying mechanism and the associated potential
trans-acting factors. This is yet another perfect example demonstrating the diversity of mechanisms, most of them not known to date, that cells are using to organize functional insulator elements (
25).
The mechanism by which enhancer blockers prevent the activity of upstream enhancers is unclear, although our current understanding is that insulators will probably not be using unique mechanisms but, rather, adaptations of pre-existing ones already in place for the normal regulation of gene expression (
23). Several of the proposed models invoke the generation of chromatin loops that segregate enhancer-sensitive and resistant promoters in distinct domains (
44,
45). In the case described herein, the model is further complicated by the short distance existing between the
cis-elements involved. A piece of information that could shed light into the mechanism is the intriguing observation that the inclusion of the uG region seems to abolish the enhancer blocking effect of uR in EBA (B, compare constructs 1R/1F with 3R/3F). Although we cannot currently explain this behavior, it would suggest that the EBA of uR can be modulated by elements located in its proximity. In this regard, it has been previously found that the EBA of the ‘gypsy’ element is affected by the number of copies of this element. When two copies, instead of one, are located between the enhancer and promoter, its blocking effect is abolished (
46). However, the enhancer blocking effect is restored by the insertion of a third copy (
47), depending on the order and distance of the insulator elements, indicating that complex protein–protein interactions are responsible for these unexpected effects and underlying a major role of insulators in whole nuclear genome organization (
24,
48). Thus, it is possible that the impairment of the EBA of uR by uG can be reverted by other
cis-elements, present in the native genomic context, but not included in this set of constructs. A further possibility is that the HRE-containing block could enhance the transcription from the minimal CMV promoter. If this is the case, the enhancer blocking element within uR would be located upstream of the HRE element in the construct 1R and would be thus, unable to prevent its action upon the minimal CMV promoter. Regardless of the specific mechanism by which uR exerts its effect, our data clearly demonstrates that it contains an EBA.
Reprogramming of cellular metabolism, in particular glucose metabolism, is central in the cellular adaptation to hypoxia. The hypoxic induction of
GYS1, encoding for an isoform of glycogen synthase, is part of this reprogramming (
27). On the other hand, the existence of an enhancer blocking element between the HRE and the
RUVBL2 promoter raises the question of why
RUVBL2 expression has to be shielded from the HIF-mediated induction.
RUVBL2 gene encodes for Reptin, an AAA+ ATPase that forms part of chromatin remodeling complexes. Interestingly, it has been recently shown that hypoxia leads to reptin methylation and that, upon this post-translational modification, it is able to repress HIF-mediated transcription (
49). Thus, it is plausible that
RUVBL2 expression might not be induced by hypoxia so as not to upset the balance between methylated/unmethylated reptin that could lead to premature termination of HIF-mediated transcription. However, this hypothesis does not provide an explanation for the close proximity of these genes. The conservation of the
GYS1/RUVBL2 genomic arrangement from opossums to humans suggests a selective pressure to maintain both genes in close proximity. Previous studies have shown that a substantial proportion of mammalian genes is arranged in a divergent head-to-head structure and controlled by bidirectional promoters (
50), so that the pair of genes tend to be co-expressed (
50,
51). The need for co-regulation of the pair of genes under the control of a bidirectional promoter could explain the selective pressure that keeps them in close proximity. However, in the case of
GYS1/RUVBL2, co-regulation of both genes is an unlikely reason for their close proximity as the existence of the insulator element would prevent the action of flanking
cis-elements on the opposite promoter.
In summary, we have identified an insulator, acting as a functional enhancer blocking element, that explains the differential response of GYS1 and RUVBL2 genes to hypoxia. To our knowledge, this is the first report describing a role for this type of genetic elements in dictating the specificity of acutely induced transcription factors in response to environmental, as opposed to developmental, signals. Importantly, the generalization of this model adds to the arsenal of strategies, including chromatin accessibility and combinatorial assembly of TFs, that are employed by eukaryotes to ensure a highly specific gene expression based on an otherwise promiscuous set of cis-regulatory elements.