Long intergenic noncoding RNAs (lincRNAs) regulate dosage compensation, imprinting, and developmental gene expression by establishing chromatin domains in an allele- and cell-type specific manner [reviewed by (1
)]. LincRNAs are intimately associated with chromatin remodeling complexes (3
), but molecular mechanisms of their functions are still lacking. Post-translational modifications of histones recruit DNA binding proteins and chromatin remodeling machinery, and are often coupled for combinatorial control [reviewed by (8
)]. For instance in embryonic stem cells, many genes encoding developmental regulators, such as the HOX
, are transcriptionally silent but possess bivalent histone H3 lysine 4 (H3K4) and lysine 27 (H3K27) methylation, which are resolved into univalent H3K4 or H3K27 methylation domains upon differentiation (9
). Here we show that a lincRNA can coordinate histone modifications by binding to multiple histone modification enzymes.
The lincRNA HOTAIR is transcribed from the HOXC
locus and targets Polycomb Repressive Complex 2 (PRC2, comprised of H3K27 methylase EZH2, SUZ12, and EED) to silence HOXD
and select genes on other chromosomes (7
). The genomic regions flanking HOXD
are also bound by CoREST/REST repressor complexes (12
), which contain LSD1 (KDM1/BHC110), a demethylase that mediates enzymatic demethylation of H3K4me2 (13
) and that is required for proper repression of Hox
genes in Drosophila
). We therefore hypothesized that HOTAIR may coordinately interact with both PRC2 and LSD1. Immunoprecipitation (IP) of either endogenous LSD1 or FLAG-tagged LSD1 from primary foreskin fibroblasts or HeLa cells specifically retrieved endogenous HOTAIR RNA with comparable enrichment to EZH2 IP, the positive control ( and fig. S1A). IP of three other chromatin proteins did not retrieve HOTAIR (fig. S1A), and LSD1, EZH2, or FLAG-LSD1 IP did not retrieve U1 RNA, a nuclear ncRNA that served as a negative control. Purified biotinylated HOTAIR RNA, but not GFP RNA or an antisense HOTAIR fragment, specifically retrieved EZH2, SUZ12, and LSD1 from HeLa cell nuclear extract ( and fig. S1B). LSD1 forms a complex with CoREST (15
), which can bridge LSD1 to the neuronal gene silencer REST (16
). REST is believed to mediate silencing through two distinct effector arms: one via LSD1-CoREST, and separately via the adaptor protein CDYL and the H3K9 KMT G9a (17
). HOTAIR specifically bound to CoREST and REST, but not CDYL or G9a, nor to the putative PRC1 subunit YY1 (). Further, biotinylated HOTAIR bound to purified PRC2 and LSD1 complexes in vitro ( and fig. S1C). These results suggest that HOTAIR directly interacts with PRC2 and LSD1 complexes.
Fig. 1 5′ domain of HOTAIR binds PRC2 and 3′ domain of HOTAIR binds LSD1. (A) LSD1 IP specifically retrieves HOTAIR RNA. Data (mean ± SD, n = 3) is relative to mock-IP (IgG or FLAG). ND, not-detectable. (B) In vitro transcribed (IVT), (more ...)
Using a series of HOTAIR deletion mutants, the PRC2 binding activity mapped to nucleotides 1 to 300 of HOTAIR, while the LSD1 complex binding activity mapped to nucleotides 1500 to 2146 (). Deletion mutants that retained nucleotides 1 to 300 bound EZH2 or SUZ12 with equal efficiency as full length HOTAIR, and deletion mutants that retained nucleotides 1500 to 2146 retained LSD1 binding activity. Thus, HOTAIR is a modular bifunctional RNA that has distinct binding domains for PRC2 and LSD1 complexes. Computational analysis and RNA footprinting showed that the PRC2 and LSD1 binding domains of HOTAIR are likely to possess extensive but distinct secondary structures (fig. S2).
The presence of independent binding sites for PCR2 and LSD1 on HOTAIR suggests that HOTAIR may bridge PRC2 and LSD1 complexes. EZH2 IP retrieved LSD1, and conversely LSD1 IP retrieved EZH2 from foreskin fibroblasts (). We estimate that less than 5% of the two complexes physically interact with each other, consistent with prior purification results that isolated PRC2 and CoREST-LSD1 as separate complexes (18
). RNAi of HOTAIR or RNase treatment of the IP abrogated the interaction between EZH2 and LSD1, suggesting that HOTAIR is required to bridge this interaction ( and fig. S3). Wild-type HeLa cells or HeLa cells stably expressing FLAG-LSD1 (FL-HeLa) expressed ~ten-fold less HOTAIR than foreskin fibroblasts and showed undetectable endogenous interaction between PRC2 and LSD1. Enforced expression of HOTAIR in FL-HeLa cells to a level comparable to foreskin fibroblasts allowed robust interaction between PRC2 and LSD1 (). Gel filtration chromatography confirmed that HOTAIR expression shifts PRC2 subunits into a higher molecular weight complex coincident with the LSD1 complex, suggesting the formation of a higher ordered complex comprised of HOTAIR, PRC2, and LSD1 complexes in HOTAIR overexpressing cells (fig. S4). Moreover, expression of each HOTAIR mutant that lacked the ability to bind either PRC2 or LSD1 in vitro failed to induce PRC2-LSD1 interaction in cells ( and fig. S3C).
Fig. 2 HOTAIR is necessary and sufficient for interaction between EZH2 and LSD1. (A) In foreskin fibroblasts, EZH2 interacts with LSD1 (lanes 1, 4). Knockdown of HOTAIR (lanes 3, 6), but not GFP (lanes 2, 5), abolishes this interaction. HOTAIR levels (mean ± (more ...)
HOTAIR-mediated bridging of PRC2 and LSD1 complexes also enables their coordinate binding to target genes on chromatin. HOTAIR is required for H3K27 methylation and transcriptional silencing across the HOXD
). Therefore, we mapped PRC2 (as indicated by SUZ12) and LSD1 occupancy across the HOX
loci and on promoters genome-wide by chromatin IP followed by microarray analysis (ChIP-chip), in primary foreskin fibroblasts after control RNAi or HOTAIR knock down ( and figs. S5 to S7). HOTAIR knockdown decreased SUZ12 and LSD1 occupancy in a similar pattern across HOXD
(, and fig. S6; R
= 0.59, p
-test). Coordinate loss of SUZ12 and LSD1 occupancy caused by HOTAIR knockdown were concentrated in proximal promoters of HOXD
genes (). These regions correspondingly lost H3K27me3 and gained H3K4me2, the respective histone methylation products of PRC2 and LSD1 complexes (, and figs. S6 and S7; R
= 0.40, p
-test). The loss of H3K27me3 occurred across broad domains encompassing multiple HOXD
genes and intergenic regions, while the gain of H3K4me2 was concentrated near the transcriptional start sites of HOXD
). Multiple independent siRNAs targeting HOTAIR gave the same results.
Fig. 3 HOTAIR coordinates localization of PRC2 and LSD1 genome-wide. (A) Changes in mRNA and occupancy of H3K4me2, H3K27me3, LSD1, and SUZ12 across HOXD locus after RNAi of HOTAIR in foreskin fibroblasts. Yellow boxes indicate regions of notable correlation (more ...)
Examining human promoters genome-wide, ChIP-chip analysis showed that PRC2 and LSD1 occupied 4740 and 2116 gene promoters, respectively (). Nearly one third of LSD1 occupied promoters, comprised of 721 genes, were also occupied by SUZ12, revealing a significant overlap (257 overlap expected by chance alone. p = 3.4 × 10−164, hypergeometric distribution). Among these 721 genes co-occupied by SUZ12 and LSD1, the distances between the binding sites of SUZ12 and LSD1 were predominantly less than 500 base pairs, which is the fragmentation size of chromatin in our ChIP assay and the limit of resolution (fig. S8A).
HOTAIR knockdown led to concordant loss of SUZ12 and LSD1 occupancy in 289 of the 721 genes normally co-occupied by SUZ12 and LSD1 (almost 40%) ( and table S1). Additional genes showed more exclusive loss of LSD1 occupancy (33%) or SUZ12 occupancy (16%), suggesting that HOTAIR may be involved in other LSD1- or SUZ12-dependent pathways. ChIP followed by qPCR confirmed the requirement of HOTAIR for PRC2 and LSD1 localization for all six genes tested (fig. S8C). HOTAIR knockdown did not change the chromatin occupancy by PRC2 and LSD1 at hundreds of other genes, nor did it affect the protein or mRNA level of the subunits of PRC2 or LSD1 complexes ( and fig. S9, A to C). The functional consequence of coordinate targeting of PRC2 and LSD1 by HOTAIR is gene repression: genes co-occupied by SUZ12-LSD1 in a HOTAIR-dependent manner are also significantly induced upon HOTAIR knockdown as measured by microarray or qRT-PCR [p
< 0.05, Gene Set Enrichment Analysis (20
); and fig. S8D]. These results suggest that a single lincRNA—HOTAIR—may be required to target both PRC2 and LSD1 to hundreds of genes across the genome in order to coordinate histone modifications for gene silencing.
Both PRC2 and LSD1 can bind multiple proteins that are thought to provide DNA target specificity (16
). A possible consequence of the HOTAIR-mediated bridging is that PRC2 may be recruited to LSD1-CoREST-REST binding sites, and conversely, LSD1 may be recruited to PRC2 binding sites. Prior genome-scale mapping studies of PRC2 already identified the REST motif as one of the most enriched DNA sequence motifs within PRC2 binding sites but with no mechanistic explanation (22
). We searched for enriched sequence motifs in SUZ12 binding sites lost upon HOTAIR knock down (“HOT-S sites” for short) and identified several enriched motifs (23
), including a motif that corresponds to the right half of the canonical REST motif (p
= 1.05 × 10−12
; and fig. S10). REST is able to bind only one half site of the canonical REST motif (24
), and genes containing HOT-S sites are enriched for experimentally measured REST occupancy (p
< 1.27 × 10−16
, hypergeometric distribution; fig. S9D and table S2) (24
). The most significantly enriched motif in LSD1 binding sites that are lost upon HOTAIR knockdown (termed “HOT-L sites”) is a CG-rich motif (p
= 3.66 × 10−10
; and fig. S10), which is important for PRC2 binding (22
). Thus, the enrichment of the CG-rich motif may reflect the HOTAIR-dependent recruitment of LSD1 complexes to PRC2 bound sites, which are often in CpG islands. We examined the gain of SUZ12 and LSD1 occupancy on chromatin when HOTAIR is overexpressed in primary lung fibroblasts, which do not express endogenous HOTAIR. HOTAIR overexpression caused ectopic occupancy of LSD1 and SUZ12 that significantly overlapped (p
< 7.31 × 10−95
). Further, motif analysis of the ectopically gained binding sites recovered an almost identical CG-rich motif (p
= 7.9 × 10−37
; ), suggesting that this motif is involved in HOTAIR target selection. Nonetheless, the REST half site and the CG-rich motif are currently not sufficient for de novo prediction of all HOTAIR-dependent genes, suggesting that additional motifs, binding partners, and/or motif arrangements may be important.
Fig. 4 HOTAIR-dependent SUZ12 and LSD1 binding motifs. (A) SUZ12 occupancy sites lost upon HOTAIR knockdown (HOT-S sites) are enriched for a DNA motif very similar to the right half of canonical REST motif. (B) LSD1 occupancy sites lost upon HOTAIR knockdown (more ...)
In this report, we demonstrate that the lincRNA HOTAIR can link a histone methylase and a demethylase by acting as a modular scaffold (fig. S11). Other lincRNAs may also contain multiple binding sites for distinct protein complexes that direct specific combinations of histone modifications on target gene chromatin. Some lincRNAs may be “tethers” that recruit several chromatin modifications to their sites of synthesis (2
), while other lincRNAs can act on distantly located genes as “guides” to affect their chromatin states (2
). Based on their dynamic patterns of expression (27
), specific lincRNAs can potentially direct complex patterns of chromatin states at specific genes in a spatially and temporally organized manner during development and disease states.