Eukaryotic rRNAs contain a large number of nucleotide modifications including 2'-O-methylation and pseudouridylation which are directed by box C/D snoRNAs and box H/ACA snoRNAs, respectively [
1,
2]. These modifications are usually found in the conserved core regions of rRNAs, and they play important roles in ribosome function [
3]. SnoRNAs are among the most numerous and functionally diverse non-coding RNAs currently known [
4,
5], existing widely in eukaryotes including human [
6-
8], plants [
9-
11], yeasts [
12-
15] and protists [
16-
19], as well as in Archaea [
20]. This indicates that they are ancient molecules that arose over 2-3 billion years ago [
21]. In addition to guiding the posttranscriptional modifications of rRNAs in eukaryotes and Archaea, snoRNAs also interact with spliceosomal RNAs [
22], tRNAs [
23,
24], SL RNAs in trypanosomes [
25], and at least one brain-specific mRNA in mammals [
26]. Recently, snoRNA U50 was found to be a candidate tumor suppressor gene in prostate cancer [
27]. The existence of substantial numbers of orphan snoRNAs indicates that snoRNAs also participate in diverse biological processes that remain to be identified [
4].
SnoRNAs exhibit canonical sequence motifs and structural features. Box C/D snoRNAs carry the conserved box C (RUGAUGA, where R can be any purine) and D (CUGA) motifs near their 5' and 3' termini, respectively. Additionally, the variants of the C and D boxes, designated C' and D' box, are usually present internally [
28]. Box H/ACA snoRNAs possess a hairpin-hinge-hairpin-tail secondary structure and two conserved sequence motifs, box H and box ACA. The hinge region contains the H box (ANANNA) and the tail consists of the ACA box located 3 nt before the 3' end [
29,
30]. The snoRNAs exert their functions by base-pairing with their targets and recruit related proteins to the sites of modification [
31]. Box C/D snoRNAs can form 10-21 basepairs (bp) with multiple cellular RNAs. The methylated nucleotide in the target RNA is usually positioned 5 nt upstream of the D or D' box of the snoRNAs, the so called "D/D'+5" rule [
6]. In box H/ACA snoRNAs, two short antisense sequences in one or both of the two hairpin domains form 9-13 basepairs with rRNA sequences that flank the target uridine to be converted to pseudouridine. The pseudouridine is always located 14 to 16 nt upstream from the H box or the ACA box of the snoRNA [
29,
30]. These structural and functional features of box C/D and H/ACA snoRNAs provide the parameters for identifying snoRNAs and their function.
The genomic organization of snoRNA genes displays great diversity in different organisms. In vertebrates, almost all snoRNA genes are located in the introns of host genes, with a few exceptions, such as U3 which are independently transcribed [
4]. In plants and trypanosomatids, snoRNA genes are present in polycistronic clusters which encode both C/D and H/ACA snoRNAs [
9,
17]. A particular genomic organization, the intronic gene cluster, was first found in rice and then in
Drosophila melanogaster [
32,
33]. Moreover, a unique genomic organization (dicistronic tRNA-snoRNA genes) has been characterized exclusively in plants [
34]. The genomic organization of snoRNA genes differs largely in fungi. In the budding yeast
Saccharomyces cerevisiae, apart from seven intronic snoRNA genes, the majority of snoRNA are encoded by independent genes as well as five polycistronic snoRNA gene clusters [
12]. In contrast, most box C/D snoRNA genes from the fission yeast
Schizosaccharomyces pombe are intron-encoded. This raises the question about the genomic organization and expression modes of snoRNA genes in the fungus kingdom. It is well known that multicellular fungi dominate the population of fungi. However, little is known about snoRNAs in multicellular fungi. It was thus of interest to determine the snoRNA genes from a multicellular fungi to shed light on these characteristics.
Neurospora crassa is a filamentous fungus sharing key components with animal cells in cellular physiology and genetics, contributing to the fundamental understanding of the genome defense system, DNA methylation, post-transcriptional gene silencing, cellular differentiation and development [
35]. As a model eukaryote, the genome of
N. crassa has been completely sequenced [
36]. However, only four box C/D snoRNAs, snR39, snR52, snR60, snR61 (Rfam) were annotated in
N. crassa . Recently, we identified three U3 snoRNA genes from
N. crassa; each of them is independently transcribed and contains a small intron [
37](Table ). It is evident that the majority of the
N. crassa snoRNAs remain to be identified. Meanwhile, a comparative genome analysis between yeast and multicellular fungi will provide insights into the evolution of snoRNA genes in the fungus kingdom. In this study, by combining computational and experimental methods, an extensive analysis of snoRNA genes from
N. crassa was performed. Here, we present the first comprehensive list of two major families of snoRNAs from
N. crassa , and further discuss the characteristics and evolutionary significance of the snoRNA genes.
| Table 1Box C/D snoRNAs identified in N. crassa |