The bioinformatic prediction of mRNA targets for sequence-specific RNA binding proteins continues to be a significant challenge. In most cases, biologically relevant motifs are hard to define, in part due to the unknown impact of secondary structure. This is confounded by the fact that in vivo assays to validate predictions are often not trivial. One approach to identify targets is to use genome-wide detection of mRNAs that directly associate with an RNA-binding protein. This approach was used with success
[15] to identify putative Pum-associated mRNAs from ovaries and early embryos. In this study, we have taken a different approach to identify neuronal targets that might underlie Pum's role in memory. We took advantage of: (1) the availability of well characterized structural and functional information about Pum-HD:RNA interactions; (2) several conserved NRE elements that had been described for the
hb and
bcd genes; (3) the availability of a robust in vivo functional assay
[1], and (4) in vivo imaging of one target gene's expression to validate our predictions. We have identified a group of putative neuronal targets of Pum, including
dlg1 and
Ace, both of which are also induced during memory consolidation. In the case of
dlg1, the identified NRE appears capable of functioning both in a heterologous in vivo context of the early embryo and an endogenous one in the adult brain ( and ).
Our results also suggest that the binding specificity of Pum is conserved between
Drosophila and mammals, as previously noted in Wang et al.
[17], which is consistent with the observations that human Pum2 binds to the
Drosophila NRE sequence
[28],
[29]. First, NRE_M10, which is based on assumptions derived from the human Pum-RNA crystal structure, performed best among the three motif models constructed with known Pum targets in flies. Second, a motif derived from mouse PUM2 SELEX data, MmSelex_M8 (“Conservation of Pum binding specificity between fly and mouse” in
Text S1 and
Figure S2), fit well with the
Drosophila Pum binding data from EMSA. Furthermore, this conservation of Pum binding specificity may be extended to non-mammalian vertebrates, as
Xenopus Pum has been shown to bind
Drosophila hb NRE
[18],
[30]. In fact, the RNA-binding domain of
Drosophila Pum is very similar to that in human, mouse and
Xenopus (amino acid identity ≥78%).
The fact that prediction scores of NRE_M10 and MmSelex_M8 are well correlated with in vitro binding data demonstrates the validity of these two models for Pum binding site prediction. The predicted hits by these two models in the synaptic gene set are significantly higher than random (
Figure S3), further demonstrating their validity and also suggesting that a number of synaptic genes are likely regulated by Pum. In the case of
dlg1, our in vivo evidence indicates that the predicted NRE can function, not only in context of the
hb 3′UTR, but also in CNS while Pum is over-expressed.
Comparing our synaptic gene set with the pulled-down targets from Gerber et al.
[15], 27 (18%) genes are in the adult specific target list. Only one gene overlaps with the embryo specific targets, presumably because the embryo specific target list is much smaller. Our predicted Pum targets using NRE_M10 and mmSelex_M8 are significantly enriched with experimentally pulled-down targets (36% and 30%, respectively, see
Figure S5 for more details). Although our NRE models, NRE_M10 and mmSelex_M8 were constructed from a very limited number of training sequences, the motif patterns match closely with the consensus Pum binding site published in Gerber et al.
[31], especially in the 8-nt core motif. These all validate the effectiveness of our method. Of course, further improvement can be made with more high confidence training sequences.
Studies in diverse organisms strongly indicate that sequences around BoxB play a major role in binding to Puf proteins
[15],
[17],
[18],
[31] although BoxA may affect the binding affinity to some extent
[32]. Interestingly, the binding specificities appear to vary among Puf family members even though their RNA-binding domains are highly conserved. For example, Puf3, Puf4 and Puf5 in yeast appear to recognize similar motifs but in different lengths
[31]. A recent finding by Opperman et al.
[33] shed a light on this. It is indicated that small structural difference in the RNA-binding domain may require extra spacer nucleotides in the binding site. This BoxB related motif, hallmarked with UGUA tetranucleotide, may represent the most prevalent binding sites for Pum or even Puf family proteins. However, other types of binding sites may also exist as we will discuss below.
Notably, Pum binds to a 142-nt RNA harboring
CycB TCE with a lower affinity than
hb NRE under our experimental conditions.
CycB TCE was initially proposed due to its resemblance to
bcd and
hb NRE, and was required for translational repression control
[22]. This cis-acting element was able to bind GST-Pum
[5],
[34], but not the purified Pum RBD or native embryonic extracts
[5],
[34]. Indeed,
CycB TCE has a lower score according to our matrix. A new element downstream of TCE has recently been proposed and been shown to bind to Pum in gel mobility shift experiments and, when substituted for the native
hb NRE in a chimeric
hb mRNA, was able to mediate
CycB-like regulation on
hb mRNA
[5],
[34]. Intriguingly, our matrix also predicts a Pum-binding site with high score (
ATTGTGCAAA, nts 561–570 of 3′UTR of
CycB mRNA) in the RNA fragment used in these experiments. Our predicted site is close to the NRE element proposed by Kadyrova and colleagues, but not the same. Further work needs to be done to address this discrepancy. It is also worth mentioning that there are several significant differences between regulation of
CycB mRNA and
hb/
bcd mRNAs
[5],
[34]. In contrast with
bcd and
hb, for example, regulation of
CycB is Brat-independent. Kadyrova et al.
[5] have demonstrated that in the case of
CycB, Pum binding seems important only to recruit Nanos, because artificially tethering Nanos to the 3′UTR bypasses the requirement for Pum binding. This is in contrast to Pum's regulation of
hb. Thus it seems that there are significant differences between the Pum-binding sites in
CycB mRNA and those in
hb and
bcd mRNAs, as proposed previously
[8]. Related to that, in the minimal 51 nt eIF-4E 3′UTR sequence bound by Pum
[8], only one binding site is predicted by NRE_M8 with a score just above the cutoff value 7.5, suggesting the Pum binding to eIF-4E 3′UTR may be also different from
hb and
bcd. Discovery of additional Pum targets from a variety of cell types and biological contexts may uncover the relationship between NRE sequence and regulatory mechanism.
To our knowledge, this is the first study to characterize and predict Pum-binding sites with a PWM approach, which is typically more sensitive and more precise than consensus methods
[21]. Our in vitro binding assay of Pum on a subset of the predicted targets provides a measure of validation of our motif models. Like Pum, two of these targets,
Ace and
dlg1, also appear to be transcriptionally induced after spaced training relative to massed training, suggesting that these are relevant targets for memory formation. We do not know why both a translational repressor and its putative targets are transcriptionally induced. It may be that transcripts are increased on a cell-wide level, while translation is spatially regulated within neurons. In the case of
dlg1, our in vivo evidence supports the conclusion that the predicted NRE can mediate Pum-dependent repression both when it is in the context of the
hb 3′UTR and in the endogenous
dlg1 transcript in the CNS. Thus, our findings directly predict that
dlg1 is a synaptic target of Pum.
Dlg is the sole
Drosophila member of a family of membrane-associated guanylate kinases (MAGUKs) that in mammals have been shown to play a key role in assembling the post-synaptic density in glutamatergic synapses. In
Drosophila, Dlg expression is both pre- and post-synaptic at Type I boutons at the NMJ, and mutants exhibit post-synaptic structural defects as well as increased transmitter release
[35],
[36]. Dlg is thought to play a key role in clustering GluRIIB receptors at the NMJ
[37] as well as Shaker K
+ channels throughout the CNS
[26].
Like Dlg, Pum also appears to have both pre- and post-synaptic effects at the NMJ and is co-localized with Dlg at Type I boutons
[8]. In addition to morphological effects on synapse structure, Pum appears to regulate excitability via an effect on expression of
para Na
+ channels
[9],
[11],
[38]. The regulation of
para may be direct, or may depend upon Pum's putative role in regulating translation of eIF-4E
[8]. Pum expression itself is activity-induced and is induced by behavioral training that results in long-term memory
[9],
[12]. Thus, one reasonable hypothesis is that activity-dependent increases in Pum expression play a homeostatic role by reducing excitability via repression of
para [38].
para is in our list of synaptic genes, yet our models did not predict any Pum binding sites in its 3′UTR. That is not surprising since Mee et al.
[9] reported NRE-like sequence located in its 5′UTR. Therefore, a different mechanism may be involved in the regulation of
para by Pum.
Our findings suggest that an additional role of Pum is direct regulation of dlg1 expression, thereby antagonizing the effects of Dlg on neuronal structure and/or function. We do not yet know whether other classic factors (Nanos and Brat) that cooperate with Pum in early embryos are also required in the translational control of Dlg in neurons. Further investigation also will be required to separate the roles of Pum in neuronal development and memory formation. Ultimate confirmation that Pum-dependent repression of dlg1 and the other predicted NRE-containing genes underlies Pum's role in neuronal structure, function and memory will also require additional examination.