One of the most interesting findings from a number of recent high-throughput genomic analyses is the identification and isolation of a large class of previously uncharacterized non-protein-coding RNAs (ncRNAs) (). These transcripts were found to be expressed in regions not previously annotated as containing an actively expressed genic unit [
9,
44]. This class of RNAs seems to represent a major component of the transcriptome that has been suggested to play a novel role in epigenetic gene regulation [
25]. Furthermore, a large fraction of these epigenetic regulatory ncRNAs are transcribed by RNA Polymerase II (RNAPII) [
45]. One model for gene silencing mediated by this new class of RNAs suggests that expression of the ncRNA from the same or opposite strand could interfere with expression of a target gene, such as the case for the
SRG1 ncRNA and its target
SER3 () [
46], the
Tsix ncRNA [
47] and its ncRNA target
Xist, and the
Air ncRNA [
45] and its target
Igf2r.
Interestingly, the previously mentioned gene expression studies of DNA methyltransferase (
met1 and
drm1 drm2 cmt3) mutants also identified a large number of ncRNAs whose expression levels increased in these plants [
19••]. Therefore, genomewide gene expression data in combination with mapping of the sites of DNA methylation suggested that the increased expression of these ncRNAs may indeed be a by-product of the loss of DNA methylation, which when present would silence expression from these loci (). Furthermore, a recent report by Kapranov
et al. [
48••] demonstrated that within mammalian genomes there are numerous, small ncRNAs whose expression originates in sites of high-density DNA methylation (CpG islands), suggesting the existence of ncRNAs in mammals that may also be regulated by DNA methylation.
A number of the novel transcripts found in the
Arabidopsis genome seem to be repeated many of times in the genome (multi-copy elements), thereby suggesting they may represent unannotated transposons (
19••). Furthermore, this class of novel ncRNAs have been suggested to silence neighboring genes through there expression by 1) RNA interference (RNAi) [
47,
49] and 2) occlusion of promoter binding by RNA polymease (RNAP) and other transcription factors () [
46,
50]. Additionally, others of these novel ncRNAs were found to be single or low copy genic units within the genome (
19••). Many of the ncRNAs in this class have no significant sequence homology to the genomes of other organisms, thus suggesting that the
Arabidopsis genome contains a large number of fast-evolving ncRNAs whose expression is controlled epigenetically by DNA methylation. Indeed, a very large fraction of ncRNAs are poorly conserved, which has led to the speculation that they are not functional [
51]. However, this observation is not necessarily associated to a lack of functionality [
52], seeing that the DNA sequences of known functional ncRNAs such as
Xist and
Air are poorly conserved [
53]. Additionally, an independent study on expression of intergenic sequences in human and chimpanzee showed that conservation of expression of ncRNAs in equivalent genomic positions, but not conservation of the RNA sequences [
54]. Therefore, these unique ncRNAs may have originated from evolutionary responses to the growth environment. For example, since plants are sessile organisms, they may have evolved novel genes/processes to rapidly regulate/respond to environmental changes. Interestingly, previous whole-genome gene expression studies using
Arabidopsis culture cells by Yamada
et al. [
9] revealed that many pseudogenes, transposons, and ncRNAs are over-expressed in these de-differentiated cells, similar to the transcriptome profile witnessed for
met1 mutant plants. Together, these results suggest that DNA methylation likely plays a role in cellular de-differentiation and regeneration through its regulation of gene expression from putative pseudogenes, transposons, and ncRNAs that litter the
Arabidopsis genome.