|Home | About | Journals | Submit | Contact Us | Français|
Genuine La proteins contain two RNA binding motifs, a La motif (LAM) followed by a RNA recognition motif (RRM), arranged in a unique way to bind RNA. These proteins interact with an extensive variety of cellular RNAs and exhibit activities in two broad categories: i) to promote the metabolism of nascent pol III transcripts, including precursor-tRNAs, by binding to their common, UUU-3’OH containing ends, and ii) to modulate the translation of certain mRNAs involving an unknown binding mechanism. Characterization of several La-RNA crystal structures as well as biochemical studies reveal insight into their unique two-motif domain architecture and how the LAM recognizes UUU-3’OH while the RRM binds other parts of a pre-tRNA. Recent studies of members of distinct families of conserved La-related proteins (LARPs) indicate that some of these harbor activity related to genuine La proteins, suggesting that their UUU-3’OH binding mode has been appropriated for the assembly and regulation of a specific snRNP (e.g., 7SK snRNA assembly by hLARP7/PIP7S). Analyses of other LARP family members (i.e., hLARP4, hLARP6) suggest more diverged RNA binding modes and specialization for cytoplasmic mRNA-related functions. Thus it appears that while genuine La proteins exhibit broad general involvement in both snRNA-related and mRNA-related functions, different LARP families may have evolved specialized activities in either snRNA or mRNA related functions. In this review, we summarize recent progress that has led to greater understanding of the structure and function of La proteins and their roles in tRNA processing and RNP assembly dynamics, as well as progress on the different LARPs.
La proteins are factors found in nearly all eukaryotes examined  and have important functions in RNA metabolism. Human La (hLa, also called Sjogren’s Syndrome Antigen B, or SS-B) was first identified as an autoantigen in patients suffering from autoimmune disorders such as systemic lupus erythematosus, Sjogren’s syndrome and neonatal lupus [2, 3]. Although it remains unclear why La is targeted as an autoantigen in some persons, its phosphorylation status, subcellular localization and susceptibility to proteases in the context of apoptosis are believed to contribute to its antigenicity in the susceptible population [4–7]. La is an essential protein in all examined metazoans, including Drosophila, mice, as well as in Trypanosoma and Arabidopsis [8–11], but can be deleted in budding and fission yeast [12, 13], in which much of the work describing fundamental La function has been done. Clues to its cellular functions came from early experiments indicating that La preferentially associates with RNAs ending in stretches of uridylate residues, as are found at the ends of all nascent RNA polymerase III (pol III) transcripts due to their d(T)n transcription termination signals [14–16].
For the largest class of pol III transcripts, the precursor tRNAs, the uridylate-containing 3’ ends are removed during processing, and consistent with the rapid turnover of most La ribonucleoproteins (RNPs) , La occupancy is not observed on the mature RNAs. Pol III transcripts bound by La include pre-tRNAs, pre-5S rRNA, pre-U6 snRNA, the nascent forms of 7SL SRP RNA, 7SK RNA, Y RNAs, Alu, B1, B2 and some other cellular RNAs [13, 18–26] as well as the pol III transcribed viral RNAs EBER I & II and adenoviral VA RNAs [27–29]. Its presence in a human pol III holoenzyme, as well as chromatin immunoprecipitation with pol III transcribed genes in yeast and human, suggests that La is involved in pol III activity and is positioned to be the first factor to bind nascent pol III transcripts [30–32]. In addition to Pol III transcripts, La has also been found to influence the metabolism of short-lived intermediates of the Pol II transcribed U1, U2, U3, U4 and U5 snRNAs, in yeast [33–35], and associate with a U1 precursor in human cells . Close examination revealed that these intermediates also end in UU-3’OH as a result of post-transcriptional processing [37–39]. For RNA targets terminating in UU-3’OH, the simplest and most fundamental role of La proteins is to bind and promote their maturation, in large part by protecting them from 3’ exonucleolytic degradation or excessive trimming. As will be discussed in upcoming sections, more complex functions have also been described for human and yeast La proteins, both for nascent snRNAs, and for other RNAs.
While function in pre-tRNA processing has been well documented and described mechanistically, La has also been strongly implicated in the translation and/or metabolism of many UUU-3’OH-lacking cellular and viral mRNAs, apparently involving a broad range of activities [40–58]. While understanding the roles of La-mediated activities in mRNA metabolism is an important subject, the diversity of data suggests that it may function differently for different mRNAs, and a review of each would be beyond the scope of this review. However, we believe it is worthwhile to note that the mRNAs whose translation is modulated by La protein contain atypical translational initiation contexts, including viral and cellular mRNAs with internal ribosome entry sites (IRESs) , 5’ terminal oligopyrimidine tracts (5’TOPs) [51, 60, 61] or upstream open reading frames (uORFs) . While many positive effects of La on translation have been reported, its effects can also be negative, and in some cases the outcome is dependent on the conditions of the experiment and the presence of poly-A binding protein (PABP) [51, 62–64]. In general, most mRNAs for which La positively effects translation have lengthy and complex 5’ UTRs (see refs above), whereas studies of specific mRNAs whose outcomes revealed a negative effect on translation were on mRNAs with short presumably nonstructured 5’ UTRs such as the very short 5' UTR containing 5'TOP mRNAs [51, 62–64]. In this regard we note evidence that suggests that the nonphosphorylated (cytoplasmic) isoform of the human La protein can recognize the 5’ GpppN cap (and related structures) present on cellular mRNAs . This leads us to speculate as to how both the negative and positive influences on translation may be effected by this cryptic cap-binding activity of La. Binding to the caps of 5' TOP mRNAs by nonphospho-La may inhibit cap-mediated translation initiation. The same activity on complex, e.g., IRES mRNAs can conceivably shift the balance toward internal initiation. This would not necessarily exclude a more direct role for La at the IRES element (HCV mRNA) itself or just upstream of the AUG, as for MDM2 mRNA .
La proteins have a conserved architecture, with an N-terminal La domain consisting of a La motif (LAM; amino acids 6–100, hLa numbering ) and an RNA recognition motif (RRM1; 106–195) separated by a short linker (Fig. 1A). Since the original description of the La motif in genuine La proteins, it has also been found in a number of other La-related proteins (LARPs) that segregate into distinct families conserved throughout evolution. Four LARP families have been discerned in addition to La, LARPs 1, 4, 6 and 7. Although earlier gene annotation also noted LARPs 2 and 5, further comparative analyses argue that these should be referred to as LARPs 1b and 4b respectively since they are highly sequence related to their main families .
As will be discussed later in the review, it is hypothesized that the LARP families have evolved distinct and conserved functions compared to genuine La proteins, despite their retention of the La motif critical for classical La function, as described below. Structural studies of the LAM and RRM from human La revealed that the LAM harbors a winged-helix type fold [66–68], common in DNA binding transcription factors and some RNA-binding proteins [69–71], while RRM1 adopts a classic β1α1β2β3α2β4 topology similar to that found in other members of the abundant class of RRM proteins . Since earlier studies established that both the LAM and RRM1 were required for recognition of UUU-3’OH by La [73, 74], it was expected that their typical nucleic acid interaction surfaces would be involved in La-RNA target interactions . On the contrary, a co-crystal structure containing the La domain from human La (hLa) and a short RNA ending in UUU-3’OH revealed that this mode of binding surprisingly occurs through contacts other than to the expected surfaces (described in more detail in  and below), but nonetheless confirming the requirement for both motifs for RNA binding while leaving the function of the expected nucleic acid interaction surfaces unclear. This mode of UUU-3’OH recognition was confirmed and expanded upon by additional cocrystal structures [76, 77].
Although many RNA binding proteins utilize more than one RNA binding motif for target RNA binding, the way that La does this is more complex than usual. For example, while poly-A binding protein and sex lethal each use the typical RNA binding surfaces of adjacent RRMs to contact a contiguous stretch of RNA, La uses two spatially separate regions of its RRM1 to form parts of two RNA binding surfaces that contact different regions of an RNA. Thus, for La, a single RRM1 contributes to two separate RNA binding surfaces, one of which is also formed in part by the juxtaposed LAM (Fig. 1B). As will be discussed below, arrangement of the LAM and RRM1 in hLa supports formation of the two RNA binding surfaces, mediated in part by contacts to the RNA segment that bridges the RRM1 and LAM, and in part by two additional direct contacts between the LAM and the RRM1.
The best-characterized substrates for La binding are pre-tRNAs, and La binding to these and other pol III transcripts is dependent of the length of the 3' terminal oligo(U) tract. Human La protein binds best to RNAs that contain 3 or more terminal Us [14–16], whereas a direct comparison indicates that S. pombe La clearly requires 4 or more terminal Us for efficient binding . As the S. pombe pol III requires at least 5T residues as a minimal terminator whereas human pol III requires only 4 , these co-variations suggest that binding of nascent pol III RNAs by La protein is functionally linked to the pol III termination system. Moreover, transcription termination by pol III on a particular gene produces a heterogeneous set of transcripts that differ in the length of their 3' oligo-U tracts. Typically, the most abundant RNA species end with one less rU residue than the minimal number of dT residues required for termination, and as alluded to above, it is this nascent RNA species that can be efficiently engaged by La protein. However a substantial fraction of the nascent pol III transcripts from the same gene ends with fewer U residues than required for efficient La binding [15, 78]. By following the phenotype of a functional suppressor-tRNA in S. pombe, Huang et al. observed that this 3' end heterogeneity leads nascent transcripts that differ in their La binding to two different posttranscriptional tRNA maturation pathways . Moreover, the extent to which the pol III subunit, Rpc11p, executes its 3' exonuclease activity during the transcription termination process can be a determinant of terminal oligo-U length and therefore the degree to which the nascent transcripts are engaged by La  (Fig. 2). One consequence of this heterogeneity is that the La-associated pre-tRNAs and the La-independent pre-tRNAs are differentially sensitive to the 3'-5' exosome-TRAMP nuclear surveillance system [80, 81] (see below).
Studies in budding and fission yeast indicate that in the absence of La most pre-tRNAs follow a variant tRNA processing pathway (Fig. 2). During pre-tRNA processing, La binds and protects the 3’ ends of pre-tRNAs from at least two exonucleases, the 3’ exonuclease Rex1p (pathway 1 in Fig. 2) and for defective pre-tRNAs, the nuclear exosome-Rrp6 (pathway 4 in Fig. 2) [80, 81]. After processing of the 5’ leader by RNAse P, the 3' end-protected trailer is processed by the endonuclease RNAse Z (pathways 1 & 3 in Fig. 2) precisely after the discrimator base, the first unpaired nucleotide following the acceptor stem [82–86], leading to dissociation of La from the tRNA and addition of CCA to the new 3' end . In the absence of La, and for those pre-tRNAs that do not engage La even when it is present  (pathway 2 in Fig. 2), the order of pre-tRNA processing may be altered such that exonucleolytic trimming of 3’ trailers precedes 5’ leader cleavage by RNAse P [13, 18, 89].
Yet, precisely how La and other factors contribute to the 3’ end metabolism of all pre-tRNAs in the cell is not fully clear. For example, as noted above although La is required for normal 3’ endonucleolytic processing of many pre-tRNAs in S. cerevisiae , some pre-tRNAs do not use the La-dependent pathway even when it is present . Rex1p [81, 90] is a 3' riboexonuclease that functionally processes pre-tRNAs in the absence of La and presumably also those pre-tRNAs that undergo 3' exonucleolytic processing even when La is present. For those pre-tRNAs that do use it, La appears to promote endonucleolytic processing by blocking access of the 3’ ends to the processing exonucleases, i.e., Rex1p [81, 90]. Accordingly, since La is nonessential in yeast, and in its absence, pre-tRNAs gain access to Rex1p for productive processing, it might have been expected that in cells lacking La, the 3’ endonuclease, RNAse Z, would be nonessential. Intriguingly, this is not the case in fission yeast, S. pombe . It remains to be determined if this is also true in S. cerevisiae and also if it reflects the possibility that RNAse Z may be required for an essential yeast function other than pre-tRNA processing..
It has been shown that La deletion is synthetically lethal with yeast mutations that disrupt the secondary structure of essential tRNAs [18, 92]. This leads to susceptibility of the structurally-impaired pre-tRNA to degradation, in this case, by a second class of 3’ exonuclease distinct from Rex1p, the surveillance activities of the TRAMP polyadenylation complex and the nuclear exosome which degrades aberrant RNAs [93, 94]. Furthermore, ectopic La can complement the deletion of tRNA modification enzymes that catalyze addition of modifications that presumably stabilize tRNA structure [93, 95, 96]. The absence of proper modification can sensitize some pre-tRNAs to degradation by the surveillance exosome, which can be averted by the 3’ end binding activity of La . In the case of pre-tRNAiMet, failure to be methylated on A58 by the tRNA methyltransferase TRM6-TRM61, leads to nuclear degradation of the hypomodified precursor tRNA; remarkably however, overexpression of La can avert this degradation, permitting the hypomethylated but otherwise mature tRNAiMet to persist and be exported to the cytoplasm where it apparently functions in translation . This latter observation suggests that a critical function of methylation of A58 of tRNAiMet is to avoid decay of the pre-tRNAiMet in the nucleus.
Some functions of La in pre-tRNA processing reflect a La activity beyond simple 3’end protection . Several lines of evidence indicate that La exhibits a complex function related to RNA structure/folding. In addition to the capacity of La to complement the loss of tRNA modifications thought to stabilize tRNA structure, La mutants have been described which harbor wild-type 3’ end binding and protection activity but are still incapable of rescuing the maturation of certain structurally impaired pre-tRNAs, even when Rrp6, a 3' exonuclease component of the nuclear surveillance exosome, is deleted . These results are consistent with functions for La in pre-tRNA processing that are more complex than simple UUU-3’OH binding and protection from the 3’ surveillance exonucleases . Indeed, the 3' end binding activity and the second, more complex activity which allows La to function in the rescue of structurally-impaired pre-tRNAs, have been localized to the two different RNA binding surfaces of the LAM and RRM1 of the human and S. pombe La proteins [80, 87].
Attempts to reconcile simple vs. complex functions in RNA metabolism have become a critical theme in the study of the mechanisms that control La function: simple functions related to UUU-3’OH mediated RNA binding and 3’ end protection, vs. complex La functions that involve enhancing the folding and/or stability of various RNA targets via poorly understood binding mode(s) and mechanisms. Although this has begun to be addressed in the context of pre-tRNA processing [80, 87], understanding precisely how the coordinated use of the separate RNA binding surfaces of the LAM and RRM1 contribute to other activities such as in mRNA-related functions, during RNA chaperone activity, and for La binding to its more stably associated pol III transcripts such as cellular Y RNAs, as well as viral encoded EBER and VA1 RNAs remains a challenge.
As described in detail below, recent crystallographic and biochemical studies have advanced understanding of the variations of La-RNA binding modes and associated functions significantly. Another advance related to these outstanding issues involves the increasingly studied LARPs [21, 97–105]. Just as genuine La proteins combine various mechanisms of RNA binding to provide complex functions, it would appear that an emerging theme in LARP function might also involve variations on similar modes to provide distinct functions on their own sets of RNA targets. The diversity of function of La proteins and LARPs, and their expected shared and divergent manners of RNA recognition, will be the topic of the remainder of this review article.
The C-terminal domain of La proteins is more divergent, and increases significantly in size from yeast to higher organisms (Fig. 1A). In higher eukaryotes, a second RRM of unknown function (RRM2 or RRMc) is often found following RRM1, although whether this RRM in human La binds RNA is questionable since it has chemical features unusual for a RRM and its β-sheet surface is obscured by an α-helix that constitutes a nuclear retention element .
La shuttles between the nucleus and cytoplasm , and the C-terminal domain contains several determinants important for La intracellular trafficking, including a nuclear localization signal (NLS) found at the end of all La proteins examined except S. cerevisiae in which it is located more centrally in the protein [108, 109], and a nuclear retention element [110, 111] which masks a conserved nuclear export element in RRM1  (Fig. 1A). Disruption of these trafficking elements causes mislocalization of La and malfunction in tRNA maturation, and in some cases leads to disorder of some of the pre-tRNA processing steps [110, 112].
In addition to direct effects on tRNA processing, the intracellular trafficking of La is an important determinant of the types of RNA substrates it encounters for binding . This is noteworthy since several conditions have been described including viral infection that lead to a redistribution of La to the cytoplasm where it is presumably used for viral replication  (more extensively reviewed in ).
Another motif in the hLa CTD has been implicated in RNA binding. In higher organisms, a highly charged short-basic motif (SBM) of about forty amino acids that includes a Walker A like motif immediately following the NRE appears to be a RNA binding element that increases the overall affinity and stability of La-RNA interactions whose binding activity can be inhibited by phosphorylation at ser-366. The unphosphorylated SBM inhibits 5’-processing of pre-tRNAs by RNAse P presumably by stabilizing and/or reorganizing pre-tRNA binding to La [73, 89, 114]. This inhibition of 5' processing is relieved by phosphorylation of S366 by protein kinase CK2 , a pathway that is faithfully executed upon hLa expression in yeast .
Interestingly, phospho-specific antibodies have localized S366-phosphorylated hLa to the nucleoplasm, the locale of the immature pol III transcripts that comprise the primary binding targets of La, while non-phosphorylated hLa is concentrated in the nucleolus and also accumulates in the cytoplasm where it is found associated with certain transcripts including the 5' terminal oligopyrimidine (5'TOP) mRNAs [60, 115]. Thus, the highest affinity ligands for La are UUU-3'OH bearing pre-tRNAs (Kd 2–10 nM) and other snRNAs which are most abundant in the nucleus, and their processing is facilitated by phospho-La. Although the higher-affinity RNA binding isoform, nonphospho-La resides in the cytoplasm (and nucleolus), there are far fewer UUU-3'OH bearing RNAs in this compartment and nonphospho-La uses its higher affinity binding mode to associate with its next best high affinity ligands, the 5' terminal oligopyrimidine (5'TOP) mRNAs . Accordingly, the relative concentrations of hLa S366 isoforms in different subcellular compartments, in conjunction with the relative concentrations of specific competing RNA ligands in these compartments, would appear to determine the differential association of nonphospho-La and phospho-La with their respective classes of different RNAs . While phosphorylation of S366 has been described only for hLa, other compensatory modifications could confer comparable discriminatory functions in other species .
Several other phosphorylation sites have also been mapped to La  and one of these, T302, promotes nuclear export and polysome association of mouse La in response to the mitogen activated kinase, Akt . Retrograde transport of La in sensory neuron axons is controlled by sumoylation of residue K41 .
Yeast La proteins are also phosphorylated at multiple sites including a site for the master control kinase, Sch9, dependent on TAP42 and in response to rapamycin, although no connection to RNA metabolism has yet been established [119, 120]. Since no single phosphorylation site appears to have been conserved across widely diverged species (unpublished observation), it seems that different organisms use La phosphorylation to different degrees and this may coincide with species-specific recruitment of different La kinases, as observed for mouse and human La .
The importance of C-terminal determinants in RNA target discrimination has also been revealed by a study of the LARP7 family member hLARP7/PIP7S ( described in detail in a section below). While La and hLARP7/PIP7S have highly homologous N-terminal domains, especially in the residues implicated in UUU-3’OH recognition (later section), hLARP7/PIP7S effectively discriminates against pre-tRNAs to preferentially bind its natural RNA target, the UUU-3’OH containing pol III transcript, 7SK snRNA . This is reflected in the lack of pre-tRNAs as compared to 7SK snRNA bound to PIPS7 in human cells as monitored by immunoprecipitation  (R.Y. & R.M., unpublished). We note that differences in the precise subcellular localization of the La and LARP7 proteins within the nucleus may contribute to targeting their different ligands toward this discrimination although this remains to be determined. Nonetheless, while full-length hLARP7/PIP7S is incapable of functioning in a pre-tRNA-mediated suppression assay in S. pombe, as compared to wild-type S. pombe or human La which are highly active, deletion of the C-terminus of PIP7S, however, activates it for tRNA–mediated suppression. This reflects significant RNA 3' end protection activity by the hLARP7 La domain, and is consistent with its C-terminal region playing an important function in preventing PIP7S from interacting with pre-tRNAs . Given the divergence of C-terminal domains between genuine La proteins and the LARPs, it is likely that this region in these factors play important roles in RNA target discrimination.
The recent publication of multiple high-resolution La-domain/UUU-3’OH co-crystal structures has greatly enhanced our understanding of this mode of RNA recognition [76, 77]. Although the structure of a LARP-RNA interaction has yet to be solved, it is tempting to speculate that the conserved juxtaposition of the LAM and RRM, as well as high conservation by the LARPs of residues important in UUU-3’OH recognition by La, may reflect similarities in RNA binding modes, perhaps most especially for LARP7 and the p65- and p43 LARPs since these show the highest homology to the La proteins , and appear to exhibit UUU-3’OH recognition.
The greatest degree of homology between these various LARP classes of proteins occurs in the LAM, reminiscent of the high degree of homology in this motif in genuine La proteins across species (e.g., Fig. 3A). The RRM found adjacent to the LAM in all La proteins is also present in a high proportion of LARPs. RRMs in many proteins contain aromatic residues in their RNP-1 and RNP-2 motifs (short sequence tracts on the β-3 and β-1 strands respectively of the RRM β-sheet surface) that stack on RNA target bases during canonical RNA binding [72, 121]. As alluded to above, although the canonical RRM β-sheet surface is not used by La for UUU-3'OH binding, it is used for an additional independent mode of RNA binding . For hLa, the LAM and RRM1 bind different regions of a pre-tRNA . The same may be true for LARPs, although it is also plausible that the LAM and RRM of LARPs may be arranged to form a contiguous RNA binding site. However, it is entirely conceivable that the paucity of intermotif contacts between the LAM and RRM1 of La protein, and the apparent reliance of the orientation of these motifs on the UUU-3’OH RNA itself, may allow arrangements of the LAM and RRM1 that are significantly different from that observed in the hLa-UUU-3’OH structures. Moreover, the observation that the RNA ligand mediates much of the LAM-RRM1 interface in the hLa structures suggests that different RNA targets may program different orientations of the LAM and RRM even for La or a LARP.
Notable is the conservation of RRM1 RNP-1/2 motifs among genuine La proteins and some LARPs despite the lack of RNP-RNA contacts in the La-UUU-3’OH co-crystals. In any case, evidence suggests that the RRM may be used to different degrees by different LARPs since point-mutation of hLa F155 (RNP-1) decreases hLa binding to the respiratory syncytial virus leader RNA  but not pre-tRNA , and mutation of LARP7 Y127 (RNP-2) has a significant effect on LARP7 binding to 7SK RNA (see below,). While some LARPs contain RRMs of sufficient homology to be recognized computationally as such, others lack the aromatic residues in the RRM RNP-1 and/or RNP-2 [72, 121] even though they are predicted to adopt the β1α1β2β3α2β4 RRM fold . It might therefore be expected that La proteins and LARPs might utilize residues other than, or in addition to, those in the RNP-1/2 motifs for RRM-mediated RNA recognition.
Recent work  suggests that during pre-tRNA processing, the loop connecting the β2 and β3 strands of hLa RRM1 (loop-3) (Fig. 1B) makes UUU-3’OH independent contacts to the main body of tRNA using an RNA binding mode distinct from the typically observed RNP-mediated RRM mechanism, in that for La binding to pre-tRNA, loop-3 appears to confer substantially more affinity than the aromatic residues in RNP-1 and −2. This is not unprecedented, as many RRMs depend on the loops between their β-strands and α-helices for RNA recognition, sometimes exclusively . This alternative binding mode leaves open the possibility that the RRM-like motifs in the more divergent LARPs might still engage RNA more similarly to La than might otherwise be anticipated. The loop-3 dependent mechanism has been hypothesized to play an important function in tRNA precursor-product discrimination by La before and after endonucleolytic pre-tRNA 3’ trailer processing: specifically, La preferentially binds intact pre-tRNAs using tRNA contacts to RRM1 and UUU-3’OH contacts to the LAM, compared to single site lower affinity binding to the tRNA and trailer after 3’ endonucleolytic processing. The resulting lower affinity for the products of pre-tRNA 3’ processing would allow La to preferentially dissociate from these and recycle onto new tRNA precursors . Consistent with this RRM1 binding mode playing important functions in La activities that are more complex than simple 3’ trailer binding and protection, residues in loop-3 are also important for a characterized RNA chaperone activity of hLa [87, 123].
Sequence conservation of the RRM1 loop-3 is very high within the genuine La protein family and within individual LARP families, yet is the most highly divergent portion of RRM1 across LARP families and between these and La proteins [1, 87]. RRM1 loop-3 is therefore an attractive candidate for a determinant of RNA specificity and functional divergence across La and LARP families, in addition to that conferred by the LAM (see later section). An appealing model for the importance of a UUU-3’OH independent binding mode in LARP function relates to the interaction of hLARP7/PIP7S to its 7SK snRNA target. La binds stably to pre-tRNA as a result of combined use of two separate binding surfaces. Contrary to the dynamic turnover of pre-tRNA-La RNPs that occurs rapidly during tRNA processing, the hLARP7/PIP7S-7SK snRNA interaction, which would appear to be more stable, may be so because 7SK snRNA undergoes little if any RNA 3' oligo(U) end cleavage/modification as part of its posttranscriptional processing. It is tempting to postulate that the LARP7–7SK snRNA interaction is analogous to the stable interaction of La with the pre-tRNA prior to 3' trailer cleavage; because 7SK RNA does not undergo 3' processing, the UUU-3’OH dependent and UUU-3’OH independent binding modes would be expected to retain their additive effect resulting in a long-lived hLARP7/PIP7S-7SK snRNA interaction. However, as noted above, the C-terminal domain of hLARP7/PIP7S also contributes to 7SK binding. Nevertheless, the remodeling of the 7SK snRNP that is associated with physiological responses to regulate P-TEFb activity [21, 103, 104] likely uses additional mechanisms distinct from those observed for La-pre-tRNA RNPs.
Binding of the LARP7 family members p43 and p65 to their UUU-3’OH-containing telomerase RNAs, described in more detail in a later section, may employ a similar additive use of their LAMs and RRM1s, in addition to their C-terminal regions ().
During UUU-3’OH recognition by La, most contacts between the terminal uridylates and the La domain are made to highly conserved residues on the LAM (Fig. 3A, B). Of particular interest are Q20 (hLa numbering), which makes uridylate-specific contacts to the penultimate base (U-2) of the RNA; F35 and F55, which stack on the base and sugar of the last uridine (U-1), respectively; Y23, which in addition to making the only direct hydrogen bond protein-protein contact between the LAM and RRM1 (to N139), stacks on U-2; and D33, which by making bidentate contacts to the 2’-OH and 3’-OH groups of the terminal sugar residue, allows La to discriminate between DNA ends and RNA ends, and to discriminate between 3’OH-containing transcripts and 3’ phosphate-containing RNAs which result from cleavage by some ribonucleases. The only UUU-3'OH contact to RRM1 is an uridylate specific contact between the O4 of U-2 and the peptide backbone of I140. As diagrammed, the conservation of residues implicated in RNA binding varies across representative LARPs from different families, with the LARP7 family (including Tth p65 and Eae p43) having the greatest homology and LARP4 and 4b having the least (Fig. 3A). With the high conservation of these residues by La and the LARP7 family members hLARP7/PIP7S, p65 and p43, it would seem likely that LARP7 members engage their pol III synthesized targets (telomerase RNA is a pol III transcript in ciliates) via the same UUU-3’OH mediated mechanism.
One unanticipated result of the assignment of residues important for La binding to UUU-3’OH was the surprising lack of apparent RNA sequence specificity . These studies revealed an exact sequence requirement only for the penultimate (U-2) uridylate of the UUU’3-OH motif, with mutations in the RNA at the terminal (U-1) or other 3’ uridylates accompanied by only moderate drops in affinity. Conversely, mutation of individual La residues revealed by the structure as important for RNA binding resulted in significant to catastrophic decreases in affinity . The data are consistent with these La residues fulfilling important functions in RNA binding without necessitating a strict requirement for RNA sequence identity. Indeed a high degree of the amino acid side chains in the UUU-3’OH binding pocket provide aromatic groups that stack with the RNA. Consistent with this high degree of apparent flexibility, more recent structures that capture interactions between the La domain and other UUU-3’OH containing ligands have revealed a significant degree of plasticity in binding . These structures compared the nature of La-UUU-3’OH binding to several small RNA targets varying in length, composition and the number of terminal uridylates. It was found that the orientations of the RNAs ending in three versus four uridylates were qualitatively distinct: even though the identity of amino acids and functional groups involved in the uracil specific contacts were maintained, the identity of which uracil (i.e., U-4 vs. U-3) took part in these contacts varied. These studies also showed divergence in the path taken by the 5’ ends of the RNAs upstream of the conserved uridylate contacts, especially as compared to the original crystal structure  (for direct comparison see Supplementary Fig. 8 in .
These data raise the possibility that the potential to use a variety of RNA-binding mechanisms may be an inherent feature of the presumed dynamic nature of the LAM-RRM1 arrangement to accommodate various RNA targets. Consistent with this, the high-resolution co-crystal structures reveal a surprising paucity of contacts restricting the spatial orientation of the RRM1 with respect to the bound RNA and the LAM. As noted above, these consist of a hydrogen bond between the bound U-2 RNA and RRM1 residue I140, a salt bridge between LAM residue R57 and RRM1 residue D125, and a hydrogen bond between the hydroxyl of Y23 of the LAM and RRM1 residue N139. Strikingly, even this Y23-N139 intermotif protein-protein contact is poorly conserved in genuine La proteins (Fig. 3A), as Y23 is replaced by phenylalanine in yeast La, a residue which could still stack on U-2 of the bound RNA, but with the loss of the tyrosine-specific hydroxyl functional group would lose its ability to contact RRM1 by this mechanism. It is intriguing that this replacement is associated with a requirement by Sla1p for more terminal uridylates than human La for optimal binding . This raises the possibility that the additional U residue recognized by Sla1p  mediates an additional LAM-RRM1 interface interaction, i.e., similar to the role of U-2, in the absence of Y23.
Consistent with a lack of spatial restriction between the LAM and RRM1, NMR studies have found that these motifs, although tethered to each other by the linker, tumble independently of one another in the absence of RNA . Taken together, the information from several structural studies are consistent with common themes that include binding site plasticity as a means to accommodate a variety of RNA ligands.
The modality of La-RNA binding as illustrated by differences in the existing crystal structures may provide insight into the expected divergence of RNA target recognition by the LARPs. The significant but non-absolute conservation of residues important for UUU-3’OH recognition in some of the LARPs might mean these same residues also play important functions in RNA binding, without necessitating a concomitant conservation of RNA target identity. At one end of this proposed spectrum is the highly homologous LARP7 family which also binds to RNAs terminating in UUU-3’OH, such as hLARP7/PIP7S binding to the 7SK snRNA and p43 and p65 binding the ciliate pol III transcribed telomerase RNAs. As discussed previously, hLARP7 also depends on specificity determinants in its C-terminal sequences to bind 7SK and prevent non-productive interactions with RNAs normally bound by genuine La. At the other end of the spectrum is LARPs 4 (& 4b) which have significantly less conservation in the LAM, and exhibit RNA sequence binding characteristics distinct from hLa (R. M. and S. Gaidamakov, unpublished observations), and one of these, LARP4, appears to function in mRNA metabolism (R.Y and R.M, submitted). These observations are consistent with a model in which the various LARP families have evolved specialized roles involving the UUU-3’OH dependent or UUU-3’OH independent functions of La proteins. It will be interesting to evaluate how the variations in amino acids important in La-UUU-3’OH recognition correlate with target discrimination by the LARPs. Determination of which binding modes provide substrate specificity across LARP families should prove to be of primary importance for this emerging field.
While the function of La in the metabolism of nascent pol III transcripts has been extensively studied, the study of LARPs is still in its infancy. LARPs can be broadly defined as factors containing a high degree of homology with genuine La proteins in the LAM-RRM arrangement of their La domain, but have taken on independent functions from those associated with La. A recent phylogenetic analysis of LAM containing proteins reveals that these are found in all eukaryotes, with the exception of some protists . A subset of these includes factors that contain, like genuine La proteins, an RRM or RRM-like motif (RRM-L) immediately following the LAM (i.e., containing a La domain), and these factors segregate into conserved families across evolution. One significant facet of this analysis was the discovery that La proteins and LARPs are totally absent from the archaea, suggesting these proteins evolved early after the archaeal/eukaryal radiation. While investigating the function of these factors is still in the initial stages, an emerging theme appears to be the apparent appropriation of La-like mechanisms for RNA binding interactions. As noted above this is most apparent for the LARP7 family members.
A previously underappreciated element of LARP conservation is the predominant co-evolution of an RRM or RRM-L immediately following the LAM. Early study of LARP function focused on the budding yeast proteins Sro9p and Slr1p, which unlike genuine La contain a LAM but no adjacent RRM [98–100]. The more recent phylogenetic analysis of LARPs indicates that this RRM-less architecture of LAM-containing proteins is the exception rather than the rule; out of 134 examined LAM containing sequences, 113 contain an RRM (or RRM-L) immediately following the LAM, similar to what is always found for genuine La proteins . With respect to RRM1 conservation, according to Bousquet-Antonelli and Deragon, only LARP7 contains a canonical RRM1 as is found in genuine La proteins, while the other LARP families contain variations of RRM-L . It is noteworthy that the degree of divergence from the canonical RRM varies for the different LARPs. For example, we found that while the RRM of LARP4 was routinely predicted as such by the NCBI BLAST server, the RRMs of LARPs 1 and 6 were not (unpublished observations) but were found using special methods . While the preponderance of RRM or RRM-L motifs in LARPs is high, their sequence conservation varies such that they segregate LARPs into distinct groups, previously noted by a numbered classification of RRM-L motifs (Figure 1) . Sequence alignment of these indicate high similarity within families but significant divergence across them, suggesting that these LARP families likely have distinct functions that are broadly conserved across species, some of which have been investigated experimentally in a variety of model organisms. To date, the best-characterized family is LARP7, which includes the human protein hLARP7/PIP7S and its homologs in numerous species  as well as the telomerase subunits p43 and p65, of two ciliate species.
Human PIP7S, also called hLARP7 and HDCMA18, is an RNA binding protein with high homology to human La in its La domain but with a divergent C-terminal region. First identified in a comprehensive screen of genes showing microsatellite instability in human gastric tumors, LARP7/HDCMA18 was the second most susceptible gene to acquire a frameshift mutation in a short homopolymeric dN tract that leads to stop codons truncating RRM2 . An independent analysis of factors associated with various transcription machineries identified hLARP7/PIP7S as a P-TEFb interacting partner, although no function for hLARP7/PIP7S was apparent at that time . P-TEFb, comprised of the Cdk9 kinase and cyclin T1, is recruited to sites of active transcription and phosphorylates the CTD of RNA polymerase II, relieving a block to transcriptional elongation . Additional analyses revealed that hLARP7/PIP7S specifically binds to and enhances the stability of the 7SK snRNA [21, 103, 104], which along with the HEXIM1 & 2 proteins and the 5' methylphosphate capping enzyme, MeCPE, forms a complex (the 7SK snRNP) that sequesters P-TEFb and inhibits both its activity and the capping activity of the MeCPE [129–132]. siRNA-mediated knockdown of hLARP7/PIP7S resulted in decreased levels of 7SK RNA and the 7SK snRNP, resulting in an increase in P-TEFb dependent transcription with consequent biological effects [21, 103, 104].
Consistent with a role as a tumor-suppressor, hLARP7/PIP7S knockdown was also shown to increase P-TEFb occupancy at two tumor-related genes and was sufficient to cause the transformation of a cell line used to study carcinogenesis . In retrospect the tumor suppressor activity of hLARP7/PIP7S is consistent with the LARP7 Drosophila ortholog multisex-combs (mxc) encoding a tumor-suppressor  although it wasn’t noted to be homologous to La protein. Recently, a Drosophila homolog of 7SK RNA was identified , and HEXIM, 7SK and LARP7 homologs have also been identified in many metazoa ; this presumably reflects the evolutionary conservation of a P-TEFb or like activity that is controlled by 7SK RNA and LARP7. This leads to a very interesting question: what advantages are afforded through the regulation of P-TEFb activity by the regulated assembly and dissassembly of a RNP (see below), as opposed to the more commonly observed and typically simpler protein-mediated regulation?
A kinetic coupling model of transcription suggests that the elongation rate of pol II can determine the efficiency with which some splice acceptor or donor sites are recognized during cotranscriptional splicing. Moreover, recruitment of splicing factors to the CTD of pol II can affect alternative splicing. Consistent with this, hLARP7-mediated alterations of P-TEFb activity induces gene-specific alternative splicing .
The La domains of hLARP7/PIP7S and human La are highly homologous, and hLARP7/PIP7S shares with La all the residues previously implicated in specific UUU-3’OH recognition [76, 77] (Fig. 3A, B). As 7SK RNA is a pol III transcript ending in UUU-3’OH, it seems likely that hLARP7/PIP7S would use a binding mode similar to La for the engagement of the 7SK 3’ end. In support of this, replacement of the LAM (where the majority of UUU-3’OH specific contacts are made) from hLARP7/PIP7S with that of La results in only a modest decrease in 7SK RNA binding or 7SK snRNP formation in vivo . Consistent with this, other regions provide RNA substrate specificity and these differ for La and hLARP7/PIP7S. First, the Y127D mutation in hLARP7/PIP7S, located in the RRM1 of the La domain, significantly reduces the affinity for 7SK RNA, while (a less drastic) mutation of the equivalent residue in human La (Y114A) has little effect on pre-tRNA binding [21, 80]. More significantly, and consistent with a 7SK specificity determinant residing in the RRM of the La domain of hLARP7/PIP7S, replacing the hLARP7/PIP7S RRM1 with the hLa RRM1 led to significant decreases in 7SK RNA binding and 7SK snRNP formation in vivo . While the loop-3 of the RRM1 of hLa provides significant affinity for pre-tRNA, it remains unclear if this region of hLARP7/PIP7S is important for 7SK RNA binding. Because of the basic nature of loop-3 of hLa RRM1 it is tempting to speculate that the hLa RRM1 exhibits general affinity for RNA, perhaps recognizing RNA backbone or other features of structure with little requirement for sequence specificity, consistent with its ability to engage a wide variety of different pre-tRNA sequences as well as other RNAs, whereas the RRM1 of hLARP7/PIP7S is likely to have adopted a sequence-specific interaction with 7SK RNA.
As alluded to above, the function of the C-terminus of hLARP7/PIP7S in RNA recognition appears to be complex, since deletion of the hLARP7/PIP7S C-terminal region (or transformation-associated frameshift mutations producing premature stop codons upstream of RRM2) results in significant defects in 7SK RNA binding and 7SK snRNP formation in vivo , while other studies suggest more modest defects by the N-terminal La domain in in vitro 7SK RNA binding assays . Furthermore, the La domain of hLARP7/PIP7S, but not the full-length protein, is capable of partially complementing La 3’ end protection function in pre-tRNA maturation, as monitored using a red-white screen for tRNA-mediated suppression in S. pombe . In sum, the data suggest that the La domain of both La and hLARP7/PIP7S provide specific and high-affinity binding to UUU-3’OH, but the LARP7 C-terminus enhances binding to 7SK RNA while limiting binding to UUU-3'OH containing RNAs other than 7SK RNA. An attractive model is that the decreases in 7SK RNA levels observed in the context of C-terminally truncated hLARP7/PIP7S in vivo could be due to increased hLARP7/PIP7S occupancy by other UUU-3’OH containing RNAs which can now better compete for binding. The C-terminal region of hLARP7/PIP7S has also been shown to be important for interaction with CDK9 in the absence of 7SK RNA .
It had been known that a small fraction (2–5%) of 7SK snRNA was associated with La protein, consistent with its transcription by pol III and the idea that La is only transiently associated with nascent pol III RNAs that then go on to become part of more mature RNPs [20, 136]. Older as well as recent work from several laboratories now provide the beginnings of a model for the transfer of 7SK snRNA from La during the dynamic assembly of the 7SK snRNP. La can bind to the UUU-3'OH of nascent 7SK RNA. The short basic motif (SBM) of human La that includes a Walker A like motif independently recognizes the 5' pppG end of nascent pol III transcripts . However, a small number of pol III transcripts, including 7SK snRNA, U6 snRNA and B2 SINE RNAs, are modified on their 5' ends by the 5’ methyl capping enzyme, MeCPE to produce the monomethyl phosphate cap, CH3pppG . The 5' monomethyl cap of 7SK RNA decreases the affinity for La protein  and has also been shown to stabilize the 7SK RNA in an in vivo system . Moreover, MeCPE appears to remain associated with 7SK snRNA. Thus it would appear that hLARP7/PIP7S and the MeCPE may displace La from nascent 7SK RNA to form a RNP in which the RNA 5' and 3' ends are stabilized against exonucleolytic attack by MeCPE and LARP7/PIP7S respectively . Although the kinetic components of the 7SK snRNP assembly pathway remain to be determined, it would be useful to determine if LARP7/PIP7S plays a role in the 7SK RNP assembly and/or disassembly dynamics that contribute to regulating P-TEFb activity in a manner similar to what has been described for LARP-p65 during hierarchical assembly of the telomerase RNP .
Other well studied LARPs that segregate phylogenetically into the LARP7 family  include the ciliate homologs p43 (Eupliotes aediculatus) and p65 (Tetrahymena thermophila), which for p65 has been shown to assist in the correct folding of the telomerase RNA and hierarchal assembly of the RNP [140–144]. Unlike in yeast and vertebrates, where telomerase RNA is transcribed by pol II, in ciliates telomerase RNA is synthesized by pol III and retains its terminal uridylates in the telomerase RNP [145–148]. In the absence of p43 or p65, binding of the telomerase reverse transcriptase (TERT) to telomerase RNA and the associated telomerase activity is compromised. While this suggests that these LARPs may undertake a molecular chaperone function for the telomerase RNA similar to that which has been hypothesized for La proteins [75, 87, 92, 123, 149], the role that p65 plays in the assembly is different from that provided by general chaperone activity . Affinity, competition and footprinting experiments indicate that while p43 and p65 do have higher affinity for telomerase substrates ending in UUU-3’OH, this is not the exclusive or even principal binding determinant [141, 150, 151]. Other telomerase RNA binding determinants, in particular the stem-loop preceding the terminal uridylates, have been mapped [124, 141, 151], and similar to human hLARP7/PIP7S-7SK RNA, domains in the C-terminal region have importance in the mechanism of telomerase RNA binding. Indeed, it appears the C-terminal region of p65 is critical for any interaction with the telomerase RNA [124, 151]. This significant involvement of the C-terminal region of LARP7 family members may have special functional relevance to the nature of their RNA target interactions: human LARP7 binding to 7SK RNA and p43/p65 binding to telomerase RNA constitute stable interactions, in contrast to La, where the N-terminal La domain is thought to be largely sufficient for its more transient interactions with immature pol III transcripts that must undergo extensive processing and RNP remodeling that includes dissociation from La [73, 87, 149, 152]. However, with regard to the possibility of a molecular handoff of nascent pol III transcribed telomerase RNA during ciliate telomerase RNP assembly, we note that no genuine La protein sequence has been found in the as yet incomplete Euplotes database . Moreover, while four LAM-containing sequences have been found in Tetrahymena, none of these grouped with genuine La proteins .
LARP6 family members are characterized by having a more central, RRM-L3 containing La domain and a newly described motif of unknown function at the extreme C-terminus called the LSA motif . The first studied member of the LARP6 family is the moth Manduca sexta homolog Acheron. Originally identified in a screen for factors induced during apoptosis of intersegmental muscles during M. sexta development , Acheron shares homology with the human protein hLARP6 . Mammalian LARP6 has been implicated in cellular differentiation of myotubes and has been shown to interact with the developmental transcription factor CASK-C, a novel CASK/Lin-2 family member [56, 153]. Thus, studies of the LARP6 members of moth, fish and mammals indicate that LARP6 interacts with transcription factors, and functions upstream of the transcription factor MyoD, to control muscle development [56, 102, 153]. As a transcription factor-associated protein, LARP6 may function in the control of mRNA synthesis, and this could conceivably be related to the way LARP7 controls P-TEFb activity, although no small RNA target (i.e., akin to 7SK snRNA) has been identified for LARP6.
More recently, human LARP6 has been shown to specifically bind with high affinity (Kd 1.4 nM) to a very highly conserved stem-loop motif in the 5’UTR of α1 collagen mRNAs and is hypothesized to regulate their localized translation, suggesting a function for LARP6 in mRNA metabolism  distinct from that proposed for Acheron. Additional data showed that hLARP6 was found associated with collagen related mRNAs that share a similar stem-loop motif . Overexpression of hLARP6 led to redistribution of collagen mRNAs off of polyribosomes , similar to 5’TOP mRNAs upon overexpression of hLa . Knockdown of hLARP6 also led to decreased collagen synthesis suggesting that hLARP6 levels should be precisely controlled to regulate its effects on translation .
Like LARP7, LARP6 has conserved all of the amino acid side chains involved in UUU-3'OH recognition by genuine La proteins (Fig 3A & B). Indeed, mutating a single U to A in the single stranded bulge of the α1 collagen mRNA stem-loop abolished hLARP6 binding . However, it is not yet clear if this uracil-dependent hLARP6 binding to the α1 collagen mRNA stem-loop is mediated by the LAM binding pocket. It would be important to determine this because if so it suggests that the LAM RNA binding pocket of LARPs can bind RNAs that do not end in 3’OH. This is a significant issue because the LARPs contain the hLa equivalent of D33 which recognizes the 3’OH and discriminates against 3’ phosphate. By some estimates, it is thought that in its 3'OH binding mode, the LAM binding pocket of hLa can not accommodate a 3' phosphate. Although hLa binding to 5' TOP mRNAs suggests that some rearrangement may occur to accommodate a 3' phosphate, this is speculative.
The LARP1 family  is more generally divergent from genuine La proteins than the LARP6 and 7 families, although this is not reflected in Fig. 3A. LARP1 members lack a typical RRM after the LAM. Instead, many LARP1 family members are predicted to contain an RRM-L5 that adopts the RRM fold but lacks the RNP-1 and RNP-2 consensus sequences, though some members of this family may have no recognizable RRM adjacent to the LAM . Furthermore, members of this family may or may not harbor a C-terminal DM15 motif whose function is unknown. As indicated in Fig. 3A, the amino acids in the LAM that are used by genuine La proteins for UUU-3'OH binding are conserved in hLARP1.
Both LARP1 and LARP1b are related sequences throughout their length . The hLARP1 and hLARP1b amino acid sequences are 59% identical and 73% similar. Yet, each appears to have been inherited as separate lineages from an ancient genome since both are found in a wide variety of organisms.
The first characterized member of the LARP1 family was the D. melanogaster protein dlarp, a Hox target shown to be important in spermatogenesis and later found to bind PABP [155–157]. Another characterized member is LARP1 from C. elegans. Like human La, the La domain of LARP1 has the ability to bind poly (U) and poly (G) . Surprisingly, the C-terminal region of the protein (which includes the DM15 motif) was also reported to have this ability in isolation from the La domain. LARP1 null worms are viable, but display defects in oogenesis. Deletion of C. elegans LARP1 resulted in an increase in the steady state levels of certain mRNAs. Although it was not determined whether the increase in mRNA levels in the larp-1 deletion mutant was due to alterations in transcription or decay, LARP1 protein appeared to colocalize with P bodies which are sites of mRNA degradation, and this led to the suggestion that LARP1 functions to lower mRNA levels by promoting mRNA decay . However, the loci observed were not distinguished from other loci such as stress granules which are linked to distinct effects on mRNA metabolism. These results are somewhat reminiscent of observations made for the distantly related S. cerevisiae LARPs Sro9p and Slr1p, which bind mRNAs on translating ribosomes, and data that localize Sro9p to punctate bodies in the cytoplasm . Curiously, overexpression of Sro9p led to increases in mRNA levels that were in part due to augmented transcription by pol II and in part to increased mRNA stability . Consistent with a possible link to the LARP1 family, Sro9p and Slr1p also display a domain architecture found in certain LARP1 family members, specifically, the presence of a more C-terminally located LAM without an adjacent RRM . While the present data point to a role for the LARP1 family in mRNA metabolism, questions about the nature of this function and possible mechanisms for target RNA binding and recognition are still unclear.
The LARP4 family is notably distinguished from La proteins and other LARP families by the lack of conservation of 2 or 3 of the key side chains in the LAM that are used by La proteins to recognize UUU-3'OH (Fig. 3A). In humans (and many other organisms) two LARP4 family members are found (hLARP4 and hLARP4b, previously called hLARP5) , which share 37% identity and 53% similarity throughout their amino acid sequence. Accordingly, the gene for each appears to have been inherited as separate lineages from an ancient genome since both are found in a wide variety of organisms, sometimes on different chromosomes.
Unlike other LARPs, LARP4 and LARP4b diverge in several of the amino acids critical for canonical UUU-3’OH recognition, suggesting that they may have distinct functions and RNA targets. Their divergence from genuine La and each other is most notable at position 24 equivalent to hLa proteins where C replaces the invariant aromatic (usually F), as well as at position 55 where M (LARP4) or L (LARP4b) replace the invariant F, and position 57 where E (LARP4) or H (LARP4b) replace either K or R (Fig. 3A). Moreover, LARP4b bears T in place of Q20, striking because Q20 is one of the few side chains in hLa that makes specific contacts to the key U-2 base of UUU-3'OH. These observations indicate that the LARP4 family is the most diverged from genuine La proteins and other LARPs in the RNA binding pockets of their LAMs.
We are aware of only one functional study of a LARP4 family member, human LARP4 (R.Y. and R.M., submitted), which suggests that hLARP4 is a cytoplasmic, polyribosome-associated protein that interacts with poly-A binding protein (PABP). LARP4 overexpression stabilizes the mRNAs whose levels it affects as determined by actinomycin D treatment followed by half-life analyses (R.Y. and R.M., submitted). Thus, LARP4 appears to act as a positive factor to promote mRNA stability, in contrast to what is suspected for LARP1, i.e., that it promotes mRNA decay. However, notable similarities for LARP4 and LARP1 exist. Drosophila LARP1 was also found to be associated with PABP in vivo, also in a manner resistant to RNAse . The Yang et al. study shows that LARP4 uses a highly conserved PAM2 motif  to interact with PABP as do several other PABP-associating proteins that are involved in translation and/or posttranscriptional mRNA metabolism. The presence of a highly conserved PAM2 in LARP4 and 4b family members suggests that these are constituents of a larger system of PABP-interacting proteins that compete for the PAM2 interaction site on PABP. We could not identify a candidate PAM2 sequence in LARP1 (unpublished observation), and the mechanism by which LARP1 associates with PABP remains to be determined.
In summary, most LARPs have conserved the LAM-RRM arrangement and exhibit varying degrees of conservation of the amino acid side chains in the LAM that are involved in UUU-3'OH binding by genuine La proteins. However, as noted for human La, these amino acid side chains make surprisingly few contacts that are U-specific, and this appears to account for the ability of human La to recognize sequences other than U at the −1 and −3 position with minimal loss of affinity. On the basis of these observations together with the observed plasticity of the human La UUU-3’OH binding pocket, there would appear to be great potential for plasticity of the LAM binding pockets of LARPs, including the ability to bind sequences other than UUU-3'OH. Nonetheless, based on the degree to which these amino acids have been conserved by LARP7 family members which have been well documented to recognize UUU-3’OH on their RNA ligands, it is tempting to speculate that sequence specificity of the UUU-3’OH binding pockets of LARPs may be dictated by their conservation of these amino acid side chains. A single U to A mutation in the highly conserved collagen mRNA 5’ UTR abolishes binding by hLARP6 in vitro suggesting that U recognition is maintained by the LAM of LARP6, which has also conserved all of these key amino acids, in a manner similar to hLa although mutagenesis of hLARP6 residues will be needed to confirm this. A corollary to this view is that some LARPs, including those with the lowest conservation of these amino acids, such as LARPs 4 and 4b may be expected to recognize RNA sequence other than UUU-3'OH.
Moreover, the phylogenetic analysis presented in Fig. 3A, which reveals lack of conservation of Y at position 23 of genuine La proteins, the side chain that makes one of only two protein-protein interactions between the LAM and RRM1 and thereby contributes to fixation of the LAM and RRM orientations relative to each other, suggests additional plasticity in how the LAM and RRM1 may be arranged to work together during RNA binding. For hLa protein, the independent nature of the LAM and RRM1 binding sites is illustrated by their differential sensitivity to magnesium ions . The relative contributions of the LAM and RRM1 to RNA binding may vary for the different LARPs. For example, for La protein, the LAM is used for sequence-specific binding whereas the RRM1 appears to be an ancillary binding site that can accommodate different RNAs. It is suspected that for some LARPs the RRM1 may dominate over the LAM in sequence specificity and/or affinity. Future biochemical and structural studies are expected to contribute largely to our understanding of these interesting proteins.
Finally, a theme from emerging studies seems to be that some LARPs are involved in RNA stability, as is the case for genuine La proteins which protect their ligands against 3’ end-mediated decay. Thus, LARP7 would appear to protect 7SK snRNA from 3’ end digestion whereas LARPs −1, −4 and −5 may be involved in mRNA stability or other aspects of mRNA metabolism although many details and confirmatory studies will be necessary to shed more light on this.
We are grateful to Sasi Conte, Stephen Curry, Cécile Bousquet-Antonelli and Ying Huang for critical comments and discussion, and to Qiang Zhou and Cécile Bousquet-Antonelli for sharing data prior to publication. This work was supported by the National Science and Engineering Council of Canada (M.B.) and the Intramural Research Program of the NICHD, NIH (R.M and R.Y.)
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.