Our failure using Mfold
6 to identify dsRNA structures similar to the SBS of ARF1 mRNA within the 3'UTRs of other SMD targets led us to notice that two well-characterized SMD targets – plasminogen activator inhibitor type 1 (SERPINE1) mRNA and hypothetical protein FLJ21870 mRNA
1,2 – contain a single 3'UTR Alu element. We also found that ~13% of the ~1.6% of protein-encoding transcripts in human epithelial HeLa cells that are upregulated at least 1.8-fold upon STAU1 downregulation in three independently performed microarray analyses
2 contain a single 3'UTR Alu element (
Supplementary Table 1). This percentage is higher than the ~4% of HeLa-cell protein-encoding transcripts that contain one or more 3'UTR Alu elements
7, indicating that 3'UTR Alu elements are enriched in SMD targets relative to the bulk of cellular mRNAs.
Alu elements are the most prominent repeats in the human genome: they constitute more than 10% of DNA sequences, are present at up to 1.4 million copies per cell, and share a 300-nucleotide consensus sequence of appreciable similarity among subfamilies
8. To date, Alu elements have been documented to be
cis-effectors of protein-encoding gene expression by influencing transcription initiation or elongation, alternative splicing, A-to-I editing or translation initiation
3,5,9. Since ncRNAs that perfectly base-pair with mRNA can function in
trans to generate endogenous siRNAs
4, it seemed possible that imperfect base-pairing between the Alu element of a ncRNA and the Alu element of an mRNA 3'UTR could create an SBS so as to regulate mRNA decay. We focused on mRNAs that contain a single 3'UTR Alu-element to avoid the possibility of intramolecular base-pairing between inverted Alu elements, which could result in A-to-I editing and nuclear retention
10.
Analysis of Antisense ncRNA Pipeline
11,12 identified 378 lncRNAs that contain a single Alu element (
Supplementary Table 2). Among them, the Alu element of lncRNA_AF087999 (NCBI) has the potential to base-pair with the Alu element within SERPINE1 and FLJ21870 3'UTRs (;
Supplementary Fig. 1a) with ΔG values of, respectively, −151.7 kcal/mol and −182.1 kcal/mol (
Supplementary Table 2; where −151.7 kcal/mol defined the most stable duplex predicted to form between SERPINE1 mRNA and any of the 378 lncRNAs). lncRNA_AF087999, which for reasons that follow is designated ½-sbsRNA1, derives from chromosome 11. RT-semiquantitative (sq)PCR (
Supplementary Fig. 2a) demonstrated that ½-sbsRNA1 is detected in cytoplasmic but not nuclear HeLa-cell fractions and is polyadenylated (
Supplementary Fig. 2b,c). Downregulating the cellular abundance of the two major isoforms of STAU1 to <10% of normal (see, e.g., below) did not affect either the cellular distribution or the abundance of ½-sbsRNA1 (
Supplementary Fig. 2b). ½-sbsRNA1 is present in every human tissue that was examined (
Supplementary Fig. 2d). ½-sbsRNA1 is not a substrate for Dicer or AGO2 (
Supplementary Fig. 2e) and thus is distinct from the lncRNAs that generate endogenous siRNAs.
Two forms of ½-sbsRNA1 have been reported (NCBI). They differ at their 5' end but share a common Alu element and a common 3' end that contains a putative polyadenylation signal (AUUAAA) situated 13 nucleotides upstream of a poly(A) tract. RNase protection assays confirmed the presence of one short (S) and one long (L) form of ½-sbsRNA1 that have a different 5' end and a relative abundance in HeLa cells of 3:1 (
Supplementary Fig. 3a). Primer extension (
Supplementary Fig. 3b) and RT-sqPCR (
Supplementary Fig. 3c) mapped the 5' end of ½-sbsRNA1(S) to a C residue. Therefore, ½-sbsRNA1(S) consists of 688 nucleotides excluding the poly(A) tract (
Supplementary Fig. 3d). While some transcripts that are annotated as ncRNAs may be translated
4, data indicate that ½-sbsRNA1(S) is not translated (
Supplementary Fig. 4).
Remarkably, not only STAU1 siRNA but also ½-sbsRNA1 siRNA increased the levels of SERPINE1 and FLJ21870 mRNAs to 2-to-4.5-fold above normal (;
Supplementary Fig. 5;
Supplementary Fig. 6a;
Supplementary Table 3). Furthermore, experiments that employed cycloheximide indicated that the ½-sbsRNA1-mediated reduction in SERPINE1 and FLJ21870 mRNA abundance depends on translation (
Supplementary Fig. 6b), as does SMD
13. The reduction in SERPINE1 and FLJ21870 mRNA abundance is attributable to their respective 3'UTR sequences since ½-sbsRNA1 siRNA also increased the levels of FLUC-SERPINE1 3'UTR and FLUC-FLJ21870 3'UTR reporter mRNAs relative to FLUC-No SBS mRNA (;
Supplementary Fig. 5;
Supplementary Table 3). The ½-sbsRNA1 siRNA-mediated increase in the abundance of SERPINE1 or FLJ21870 mRNA was reversed by co-expressing ½-sbsRNA1(S)
R, which is resistant to siRNA (
Supplementary Fig. 6c), arguing against siRNA-mediated off-target effects. Significantly, ½-sbsRNA1 siRNA did not affect the expression of other FLUC reporter mRNAs that contain the 3'UTR of SMD targets not predicted to base-pair with ½-sbsRNA1 (
Supplementary Fig. 7).
If ½-sbsRNA1 were to create an SBS by base-pairing with the 3'UTR of SERPINE1 or FLJ21870 mRNA, then it should be possible to co-immunoprecipitate complexes of the lncRNA and each mRNA. To test this possibility, lysates of HeLa cells that transiently expressed (i) ½-sbsRNA1(S)-MS2bs, which contains 12 copies of the MS2 coat protein binding site (MS2bs)
14 upstream of the lncRNA polyadenylation signal or, as a negative control, ½-sbsRNA1(S) or FLUC-MS2bs mRNA () and (ii) FLAG-MS2-hMGFP, which consists of FLAG-tagged MS2 coat protein fused to hMGFP, were immunoprecipitated using anti-FLAG. As expected, prior to IP ½-sbsRNA1(S) as well as ½-sbsRNA1(S)-MS2bs decreased the abundance of SERPINE1 and FLJ21870 mRNAs but not SMD targets that encode interleukin 7 receptor (IL7R), CUG domain-containing protein 1 (CDCP1) or methylthioadenosine phosphorylase (MTAP) (; see below). In support of our hypothesis that ½-sbsRNA1 creates an SBS with partially complementary mRNA sequences, using lysates of cells expressing ½-sbsRNA1(S)-MS2bs, the anti-FLAG IP of FLAG-MS2-hMGFP bound to ½-sbsRNA1(S)-MS2bs co-immunoprecipitated endogenous STAU1, SERPINE1 mRNA and FLJ21870 mRNA as well as the UPF1 SMD factor (). In contrast, irrelevant proteins, such as Calnexin, the dsRNA binding protein ILF3
15, the single-stranded RNA binding protein FMR1
16, and mRNAs that are not predicted to base-pair with ½-sbsRNA1, such as those encoding SMG7, IL7R, CDCP1 or MTAP, were not co-immunoprecipitated (). STAU1 siRNA reduced the co-IP of the SERPINE1 mRNA as well as FLJ21870 mRNA with ½-sbsRNA1(S)-MS2bs to, respectively, ~19% or ~15% of normal (;
Supplementary Fig. 5), indicating that STAU1 stabilizes the duplex formed between SERPINE1 or FLJ21870 mRNA and ½-sbsRNA1.
As additional evidence that ½-sbsRNA1 creates an SBS by base-pairing with the SERPINE1 or FLJ21870 3'UTR, only STAU1-HA
3 but not ILF3 or FMR1 co-immunoprecipitated with ½-sbsRNA1 (
Supplementary Fig. 8).
To determine if ½-sbsRNA1 is required for the co-IP of STAU1 with SERPINE1 or FLJ21870 mRNA, HeLa cells that transiently expressed STAU1-HA
3 and Control siRNA or ½-sbsRNA1 siRNA in the presence or absence of ½-sbsRNA1(S)
R were immunoprecipitated using anti-HA. Compared to Control siRNA, ½-sbsRNA1 siRNA, which reduced the level of ½-sbsRNA1 to ~50% of normal, reduced by ~2-fold the co-IP of STAU1-HA
3 with SERPINE1 or FLJ21870 mRNA (;
Supplementary Fig. 5). In contrast, restoring the level of ½-sbsRNA1 to ~100% of normal by expressing ½-sbsRNA1 siRNA together with ½-sbsRNA1(S)
R restored the co-IP of STAU1-HA
3 with SERPINE1 or FLJ21870 mRNA to near normal (;
Supplementary Fig. 5). As expected, the level of IL7R mRNA, which binds STAU1
2 but does not contain sequences complementary to ½-sbsRNA1, was unaffected by any condition either before or after IP (;
Supplementary Fig. 5).
We conclude that the SMD of SERPINE1 or FLJ21870 mRNA involves base-pairing between their 3'UTR Alu element and the Alu element within ½-sbsRNA1. Base-pairing creates an SBS that is stabilized by STAU1. Furthermore, the level of STAU1 and, thus, the efficiency of SMD does not alter the level of ½-sbsRNA1. Our finding that downregulating SERPINE1 or FLJ21870 mRNA to 50% and 25% of normal, respectively, failed to detectably decrease the co-IP of STAU1-HA
3 with ½-sbsRNA1 (
Supplementary Fig. 9) indicates that ½-sbsRNA1 may bind to more than SERPINE1 and FLJ21870 mRNAs to recruit STAU1 if not trigger SMD.
The presence of UPF1 in the anti-FLAG IP of FLAG-MS2-hMGFP () is consistent with the idea that STAU1 that is bound to a ½-sbsRNA1-created SBS associates with UPF1, analogously to how STAU1 that is bound to the ARF1 SBS associates with UPF12,
13. Furthermore, downregulating UPF1, like downregulating STAU1, increases the abundance of SERPINE1 mRNA, FLJ21870 mRNA and FLUCSERPINE1 3'UTR mRNA by increasing mRNA half-life
1,2. To test for UPF1 function in conjunction with ½-sbsRNA1, we analyzed the effects of various siRNAs on the production of FLUC-SERPINE1 3'UTR mRNA in which the 3'UTR was intact, precisely lacked the region that was partially complementary to ½-sbsRNA1, or contained solely this region (). Relative to Control siRNA, STAU1 siRNA, UPF1 siRNA or ½-sbsRNA1 siRNA did not affect the level of FLUC-SERPINE1 3'UTR mRNA that lacked the ½-sbsRNA1 binding site (BS; ;
Supplementary Fig. 5), but each siRNA increased the levels of FLUC-SERPINE1 3'UTR mRNA and FLUC mRNA that contained only the ½-sbsRNA1-BS (;
Supplementary Fig. 5). We conclude that, as indicated by its name, ½-sbsRNA1 base-pairs with the 3'UTR of SERPINE1 mRNA and, by analogy, FLJ21870 mRNA so as to recruit STAU1 and its binding partner UPF1 in a way that triggers a reduction in mRNA abundance. Consistent with previous studies of SMD
2,13, the STAU1- and ½-sbsRNA1-mediated reduction in mRNA abundance is due to a decrease in mRNA half-life (
Supplementary Fig. 10). With regard to function, scrape injury repair assays revealed that ½-sbsRNA1 contributes toward reducing cell migration by targeting SERPINE1 and RAB11FIP1 mRNAs for SMD (
Supplementary Fig. 11).
Characterizing seven other lncRNAs that contain a single Alu element and consist of <1000 nucleotides (
Supplementary Table 2) confirmed that they, too, are largely cytoplasmic and polyadenylated (
Supplementary Fig. 2b,c; data not shown) and have the potential to base-pair with the single Alu element within at least one mRNA 3'UTR (;
Supplementary Fig. 1b,c,d;
Supplementary Table 2; data not shown). Individually downregulating three of these lncRNAs – lncRNA_BC058830 (½-sbsRNA2), lncRNA_AF075069 (½-sbsRNA3) or lncRNA_BC009800 (½-sbsRNA4) – upregulated those tested mRNAs that (i) contain a partially complementary Alu element and (ii) are upregulated upon STAU1 or UPF1 downregulation; each lncRNA failed to upregulate mRNAs that lack a partially complementary Alu element (;
Supplementary Fig. 5; data not shown). While ½-sbsRNA2 targeted the 3'UTR Alu element of CDCP1 mRNA (;
Supplementary Fig. 5;
Supplementary Table 2, where ΔG = −153.7 kcal/mol), ½-sbsRNA3 and ½-sbsRNA4 targeted the 3'UTR Alu element of MTAP mRNA (;
Supplementary Fig. 5;
Supplementary Table 2, where ΔG = −203.1 and −264.2 kcal/mol, respectively). Furthermore, none of the three lncRNAs downregulated SERPINE1 mRNA (;
Supplementary Fig. 5;
Supplementary Table 2, where ΔG = 0, −66.4 and −108.2 kcal/mol, respectively) but two downregulated FLJ21870 mRNA ~2-fold (;
Supplementary Fig. 5;
Supplementary Table 2, where ΔG = −261.9 and −444.2 kcal/mol).
These findings illustrate the potentially complex network of regulatory events that are controlled by lncRNA–mRNA duplexes that bind STAU1 and is reminiscent of the web of regulatory mechanisms that are mediated by miRNAs
17. Notably, both CDCP1 mRNA and MTAP mRNA were upregulated at least 2-fold upon STAU1 downregulation in experiments reported here (), and indeed CDCP1 mRNA is among those mRNAs that were upregulated minimally 1.8-fold upon STAU1 downregulation
2(
Supplementary Table 1). However, since MTAP mRNA was upregulated only ~1.5-fold
2, it is not included in
Supplementary Table 1. Thus, this Table must be considered as providing only a partial list of mRNAs that are modulated by one or more ½-sbsRNAs. Conceivably, the degree of modulation could vary in different cell types (
Supplementary Fig. 2d) or developmental stages depending on the abundance of the ½-sbsRNA(s) and on proteins that inhibit or enhance base-pairing.
It is important to note that ΔG values are not in themselves absolute predictors of SBS function. For example, while ½-sbsRNA2 is predicted to base-pair with the 3'UTR Alu element of BAG5 mRNA with a ΔG of – 416 kcal/mol, BAG5 mRNA is not targeted for SMD in HeLa cells (
Supplementary Fig. 12). The 3'UTR Alu element of BAG5 mRNA may be physically inaccessible to base-pairing with ½-sbsRNA2. Nevertheless, base-pairing per se may not be sufficient for SBS function since converting the 100-nt apex of the intramolecular ARF1 SBS to a 4-nt loop that is not predicted to disrupt the adjacent 19-bp stem of the ARF1 SBS reduces STAU1 binding
in vivo by 50%
2.
Here, we report an unforeseen role for some of the lncRNAs that contain Alu elements: the creation of SBSs by intermolecular base-pairing with an Alu element within the 3'UTR of one or more mRNAs. We conclude that SBSs can form either through intramolecular base-pairing, as exemplified by the ARF1 SBS, or intermolecular base-pairing between a ½-SBS within an mRNA 3'UTR and a complementary ½-sbsRNA in the form of a largely cytoplasmic lncRNA ().
There are estimated to be tens of thousands of human lncRNAs that have little or no ability to direct protein synthesis and that are distinct from rRNAs, tRNAs, snRNAs, snoRNAs, small interfering RNAs or microRNAs
18. Thus, the paradigm that partially complementary ncRNA–mRNA duplexes can form SBSs may extend to the creation of binding sites for other dsRNA binding proteins. Since only 23% of lncRNAs were found to contain one or more Alu elements, lncRNA–mRNA duplexes that do not involve Alu elements could expand the number of ncRNAs that regulate gene expression via SMD or a different dsRNA binding protein-dependent pathway.