The SR proteins are a family of conserved splicing factors that consist of either one or two N-terminal RNA recognition motifs (RRM) and a C-terminal arginine- and serine-rich (RS) domain
[1],
[2]. SR proteins promote constitutive and alternative splicing through multiple modes
[3], some of which are presumed to require their RS domains. Exonic splicing enhancers (ESEs) are degenerate 6–8 nucleotide motifs that promote exon inclusion, in many cases through the action of SR proteins
[4]–
[9]. SR proteins bind to ESEs via their RRM(s)
[10], whereas their RS domains are thought to function as protein-protein interaction modules that facilitate exon inclusion by recruiting components of the basal splicing machinery to the flanking 5′ and 3′ splice sites early in splice-site recognition
[11]. In yeast two-hybrid and Far Western assays, the SR protein SF2/ASF was shown to interact with itself and with the U1-snRNP-specific protein U1-70K and the small subunit of the U2AF heterodimer, U2AF
35; these protein-protein interactions required the RS domains of each protein
[12],
[13]. Subsequently it was proposed that SR proteins can promote recruitment of the U1 snRNP to the 5′ splice site through SR protein RS-domain-mediated interactions with U1-70K
[14]. However, the RS domain of SF2/ASF alone is unable to interact with U1-70K
in vitro [15]. Enhancer-bound SR proteins are also thought to escort U2AF
65 to the 3′ splice site polypyrimidine tract through RS-domain-mediated recruitment of U2AF
35 [16],
[17],
[18]. A role for SR proteins in bringing U2AF
65 to the polypyrimidine tract is supported by several experiments in which improving this tract can relieve the requirement for an ESE for pre-mRNAs with enhancer-dependent introns
[19],
[20],
[21]. However, other experiments failed to detect changes in U2AF recruitment in the presence versus in the absence of an ESE
[22],
[23], calling into question the hypothesis that an ESE-bound RS domain is required for recruitment of U2AF
65. Although the aforementioned functions of SR proteins are assumed to occur via RS-domain-mediated protein-protein interactions, it has not yet been demonstrated that such interactions occur in the context of a functional spliceosome
[24].
A second mode by which SR proteins promote exon inclusion is by antagonizing the negative regulation conferred by exonic splicing silencers (ESSs), pre-mRNA regulatory elements that inhibit exon inclusion in both constitutive and alternative splicing
[25]. Although the mechanisms by which SR proteins counteract the effects of splicing silencers are not well understood
[4], their RS domains are not always required for this function, as SF2/ASF lacking its RS domain can act from the position of an HIV
tat exon 3 ESE to antagonize an ESS present in the same exon
[26].
A third mechanism by which SR proteins have been reported to promote splicing is by engaging in transient RS domain-pre-mRNA contacts during the course of splicing. An ESE-bound RS domain can interact directly with the branchpoint of an IgM substrate in the pre-spliceosomal A complex
[27]. The RS domains of SR proteins bound to ESEs can also act as protein-RNA interaction modules to promote base-pairing of pre-mRNA and U5 and U6 snRNAs during the course of pre-mRNA splicing
[28]. However, although it is clear that an RS domain recruited to the ESE position can function as a splicing activator, an RS domain tethered to the position of the ESE is not always required, as splicing of the ESE-dependent substrate dsx lacking its ESE can also be accomplished simply by the addition of an excess of free RS domain to nuclear extract
[27], consistent with the hypothesis that the function of an SR protein may be merely to recruit any RS domain to the vicinity of the splicing signal. On the other hand, adding an RS domain peptide to nuclear extract is insufficient to promote exon inclusion for
BRCA1 pre-mRNA exon 18 lacking a functional ESE, whereas recruitment of a synthetic RS domain to the mutated ESE rescues inclusion of this exon
[29].
The RS domains of SR proteins are conserved, and the serine residues within these domains are targets of phosphorylation by multiple kinases, including SRPK1
[30] and SRPK2
[31], Clk/Sty
[32], and DNA topoisomerase I
[33]. Phosphorylation of RS domains influences the subcellular localization of SR proteins
[32],
[34],
[35],
[36]. The phosphorylation state of the RS domain has a significant influence on SR protein function, as both hyper- and hypophosphorylated SR proteins are unable to support splicing
[37],
[38],
[39]. SR protein RS domains were at one time thought to be indispensable for constitutive splicing
in vitro, yet dispensable for concentration-dependent effects on alternative splice-site selection
[40],
[41]. However, we subsequently found that the RS domain of SF2/ASF is not required for constitutive splicing of several pre-mRNAs
in vitro, including tat23, an ESE-dependent pre-mRNA known to be regulated by an ESS
[42]. Thus, pre-mRNAs could be classified as either RS-domain-dependent or RS-domain-independent, based on their ability to be spliced with an SR protein lacking its RS domain (“ΔRS”). RS-domain-dependence was found to be related to the strength of the 3′ splice site and the requirement for U2AF
35 [42]. IgM M1-M2 was identified as an RS-domain-dependent pre-mRNA, congruent with at least some previous reports that it is U2AF
35-dependent and possesses relatively weak polypyrimidine-tract and branchpoint sequences
[43],
[44].
IgM M1-M2 has been used by several laboratories as a model substrate to explore the role(s) of ESEs in promoting pre-mRNA splicing. However, the functions of the ESE-bound SR protein in the context of the RS-domain-dependent IgM M1-M2 pre-mRNA have been controversial, and there are several competing models for the mechanism by which SF2/ASF promotes splicing at the ESE position in this substrate. In the recruitment model, SF2/ASF binds via its RS domain to U2AF
35 to indirectly recruit U2AF
65 to the polypyrimidine tract
[21], whereas in the antagonism model, the sole function of enhancer-bound SF2/ASF is to prevent PTB from binding to a downstream ESS
[45]. Some experiments with the IgM M1-M2 substrate strongly support the model for SR protein function in which an ESE-bound RS domain recruits U2AF
35 and U2AF
65 to the polypyrimidine tract
[21]. However, other experiments detected no difference in U2AF
35 occupancy on IgM M1-M2 in the presence and absence of the ESE
[23].
The discovery that many but not all substrates could be spliced with SF2/ASF lacking its RS domain
[42] suggested that SR protein functions might be subdivided into RS-domain-dependent and RS-domain-independent categories. We prepared various fragments of SF2/ASF for structural and functional studies, including versions lacking the C-terminal RS domain and/or an N-terminal extension that precedes RRM1. N-terminal and C-terminal extensions of RRMs have been demonstrated to regulate nucleic acid binding in other splicing factors
[46], and we noted that SF2/ASF and some other SR proteins have N-terminal RRM extensions. We had previously characterized an N-terminally His-tagged SF2/ASF lacking the RS domain as unable to complement S100 for constitutive splicing
[40], but omitting this N-terminal tag allowed the same protein to support splicing of some pre-mRNAs
[42]. These precedents suggested that the natural N-terminus of SF2/ASF may influence its activity, and we therefore investigated whether the N-terminal extension preceding RRM1 had any influence on the splicing activity of ΔRS with the RS-domain-dependent substrate IgM M1-M2. Deletion of the N-terminus from ΔRS revealed that the RS domain is not required for splicing of IgM M1-M2, lending further support to our previous finding that the RS domain of SF2/ASF is sometimes dispensable for splicing
in vitro, and calling for a reevaluation of traditional models of SR protein function.