Bacterial pre-rRNA processing is carried out by a defined set of nucleases (
3-
5,
43,
52). Key among this set is RNase III, initially isolated by its ability to bind and cleave duplex RNA (
47,
48). RNase III generates the immediate precursors to the mature 16S and 23S rRNAs from the primary transcripts by cleaving within two extended RNA duplexes formed by long-range interactions that pair the termini of each rRNA (
7,
63). These long-range interactions provide a simple method of coordinating the processing events at both ends of the transcript. In eukaryotes, pre-rRNA processing is more complex and requires many more small nucleolar RNAs (snoRNAs) and protein components with overlapping functions (
13,
15,
16,
41,
46). For example, the removal of the 5′ external-transcribed spacer requires 4 snoRNAs (U3, snR30, U14, and snR10) and about 64 snoRNAs are required for rRNA modifications (
24,
57). snoRNAs are divided into two major subclasses: the first includes box C/D snoRNAs that function mostly as a guide for the methylation of rRNA (
6,
20,
21,
55), while the second includes H/ACA snoRNAs that guide RNA pseudouridine formation (
25,
39,
59). Most mammalian snoRNAs are encoded within intron sequences and are processed from either unspliced precursors or lariat species (
18,
19,
64). In
Saccharomyces cerevisiae, most snoRNAs are transcribed either as independent units or as a part of polycistronic transcript, while only 7 of the 66 known snoRNAs are located in the introns of mRNAs (
14,
44,
53). Several polycistronic snoRNAs, and a few monocistronic ones, are processed by Rnt1p, the orthologue of the bacterial RNase III (
30), which is also required for the processing of the pre-rRNA's 3′ end (
2,
10,
11,
23,
33). Following processing by Rnt1p, the RNAs are trimmed by exonucleases producing the mature ends (
22,
56).
Unlike other RNase IIIs, Rnt1p recognizes substrates with conserved stem-loop structures and has a low affinity for generic RNA duplexes (
27). Most Rnt1p substrates exhibit a conserved AGNN tetraloop structure (
9,
28,
32,
61). Rnt1p cleaves at a fixed distance from the conserved loop, generating a product with staggered ends (
28). Mutations (
28), chemical protection assays (
27), chemical interference (
9), and nuclear magnetic resonance analysis (
28) indicate that Rnt1p binding and cleavage are regulated by reactivity epitopes grouped into three boxes (see Fig. ). These are the initial binding and positioning box (IBPB), located at the tetraloop; the binding stability box (BSB), located adjacent to the tetraloop; and the cleavage efficiency box (CEB), located near the cleavage site. Alteration of the sequences of both the IBPB and the BSB inhibits cleavage and reduces binding, while alteration of the CEB sequence inhibits cleavage without affecting the binding efficiency. Thus, despite the lack of universally conserved residues, the nucleotide composition of the reactivity epitopes contributes to substrate selectivity. The second nucleotide of the IBPB is believed to be universally conserved, and changing it to any nucleotide other than G reduces binding to known substrates and blocks cleavage (
9,
27,
28,
38). The solution structure of the Rnt1p/substrate complex indicates that the enzyme interacts with the minor groove adjacent to the 3′ end of the tetraloop, suggesting that substrate recognition depends on the shape of the groove (
60). However, accurate identification of the universal features of Rnt1p substrates requires the identification of a large set of substrates that allows statistical analysis of the cleavage signals.
In this study, we searched for new Rnt1p substrates by examining the expression profiles of all known snoRNAs in the presence and absence of Rnt1p. In parallel, we developed a program that identifies potential Rnt1p cleavage signals near known snoRNA sequences. All newly identified substrates were tested for cleavage in vitro, and their contributions to snoRNA processing were verified in vivo. Our combined in silico and in vitro approach identified all known substrates of Rnt1p and revealed seven new snoRNA substrates. In general, monitoring the expression of snoRNAs was most effective when Rnt1p cleavage was not redundant with other processing events that could lead to the maturation of the snoRNA in question. In contrast, the in silico screen was most effective in identifying snoRNAs that harbored conserved processing signals, regardless of their processing pathway in vivo.