Bidirectional promoters produce non-coding transcripts in many organisms and the mechanisms are likely to be conserved, so studies in yeast should be highly relevant to all eukaryotes.
46 Here we explore the differences between two subclasses of ncRNAs that arise from divergent transcription at yeast mRNA promoters. Although the term CUT was originally applied generally to small, unannotated non-coding RNAs, Xu et al.
14 operationally divided these into transcripts that were only apparent in an
rrp6Δ background (CUTs) and those that were present even with a fully functional nuclear exosome (SUTs). In agreement, we see that CUTs are strongly stabilized in cells containing mutations of the nuclear exosome components Rrp6 or Rrp47. The Nrd1/Sen1 early termination pathway is also critical, consistent with the idea that CUTs use the same pathway as snoRNAs for their termination and processing. Interestingly, mammalian PROMPTs and plant UNTs, non-coding transcripts associated with promoters, are also degraded by the nuclear exosome and it will be interesting to see whether some yet to be defined early termination pathways target this degradation pathway to these transcripts.
20,21We find that the SUTs analyzed here are also degraded by the nuclear exosome, but still exist at significant levels even in the presence of wild-type Rrp6 and Rrp47. This result suggests that either there are two populations of the SUT, only one of which is intrinsically susceptible to nuclear degradation, or that the SUT is not efficiently recognized by the nuclear exosome and thereby kinetically bypasses this degradation pathway. Either possibility could be explained by inefficient use of the Nrd1/Sen1 early termination pathway, which is physically coupled to exosome recruitment. So far, no obvious correlation of fewer Nrd1 and Nab3 binding sites or longer transcript size has been connected with SUT identity, although this is also true of mRNAs. It is possible that SUTs contain stronger polyadenylation sequences, which would be consistent with the effect of the
rna14-1 mutant on their levels. SUTs are clearly exported from the nucleus, as they are subject to cytoplasmic degradation mediated by a combination of NMD factors, decapping enzyme, and the Xrn1 5′ to 3′ exonuclease.
39 Just as mutants in nuclear exosome components revealed a hidden layer of the transcriptome,
14,16,21,29 it is likely that analysis of
xrn1 mutants would uncover even more cryptic transcription. Intriguingly, although both the nuclear exosome and cytoplasmic pathways target SUTs, the steady state levels of these transcripts in wild-type cells suggest they have half-lives closer to that of mRNAs.
Our results are consistent with and help explain some previous results from other labs that raised questions about the metabolism of CUTs. Like SUTs, a non-coding antisense transcript arising from the long terminal repeat of the yeast Ty1 retrotransposon (designated RTL) is polyadenylated and degraded by the Xrn1 cytoplasmic degradation pathway.
24 Srg1 and several other short, cryptic transcripts are also susceptible to cytoplasmic decay and several were shown to associate with ribosomes.
23,24 Although these RNAs were referred to as CUTs at the time the papers were published, by comparison to the transcripts analyzed here they would now be more accurately classified as SUTs.
These and our results indicate that multiple sequential or overlapping degradation pathways can act upon non-coding transcripts in yeast and that the functional distinction between CUTs and SUTs is largely due to which termination and degradation pathways predominate. Plasticity in choice of 3′ end formation pathway has been previously been reported for other non-coding transcript classes.
47,48 While CUTs use the snoRNA pathway, SUTs may more closely resemble coding mRNAs. Indeed, it is interesting to speculate that these cryptic transcripts provide raw material for evolution of new coding transcripts. Although CUTs and SUTs are usually referred to as non-coding, many contain small open reading frames that could encode peptides. Some of these transcripts have been found associated with polyribosomes and depend on translation for degradation via the NMD pathway.
23,39 Examples of small peptides with striking biological functions include the wide range of toxic peptides found in Conus snails and more recently a regulator of Drosophila embryogenesis.
49,50One unexpected discovery of our RNA analysis was the existence of 3′ extended transcripts emanating from the CUTs and SUTs. While these are readily apparent by northern blotting, they would likely be obscured during RNA microarray analysis because of their lower levels and overlap with adjacent gene transcripts. In addition, polyadenylated RNA is used for tilling array analysis but eCUTs and eSUTs may be under-represented in this RNA fraction.
14 The eCUTs appear to arise from read-through of Nrd1-dependent termination, since their levels increase dramatically in the
nrd1 mutant. At least in some cases, this also appears true of the eSUTs. The eCUTs and eSUTs levels increased in mutants in NMD and cytoplasmic 5′ to 3′ decay factors, indicating they are transported out of the nucleus and may even associate with ribosomes. These RNA species will need to be further characterized in future studies, since preliminary analysis raises questions about how they are terminated and about their 3′ end formation. Based on size, they do not appear to simply be bicistronic read-through transcripts that terminate at the next annotated polyA site. Also remaining to be explored is whether these long non-coding
S. cerevisiae transcripts play a role in regulating adjacent genes, either by reading into them,
47 through other mechanisms that could include chromatin effects
12,51 or action in cis.
26,52 Long non-coding transcripts in other organisms play important roles in a range of processes, including X-chromosome inactivation,
53 regulation of HOX gene expression,
8,54 mediating p53 responses associated with cancer,
55 roles in
S. pombe centromeric heterochromatin formation
5 and flowering time regulation in Arabidopsis.
3 Given the compact genome of
S. cerevisiae, eCUTs and eSUTs have great potential to be components of gene regulatory circuits.