Exonic splicing enhancers (ESEs) are pre-mRNA cis-acting elements required for splice-site recognition. We previously developed a web-based program called ESEfinder that scores any sequence for the presence of ESE motifs recognized by the human SR proteins SF2/ASF, SRp40, SRp55 and SC35 (). Using ESEfinder, we have undertaken a large-scale analysis of ESE motif distribution in human protein-coding genes. Significantly higher frequencies of ESE motifs were observed in constitutive internal protein-coding exons, compared with both their flanking intronic regions and with pseudo exons. Statistical analysis of ESE motif frequency distributions revealed a complex relationship between splice-site strength and increased or decreased frequencies of particular SR protein motifs. Comparison of constitutively and alternatively spliced exons demonstrated slightly weaker splice-site scores, as well as significantly fewer ESE motifs, in the alternatively spliced group. Our results underline the importance of ESE-mediated SR protein function in the process of exon definition, in the context of both constitutive splicing and regulated alternative splicing.
doi:10.1093/nar/gki810
PMCID: PMC1201331
PMID: 16147989
Human immunodeficiency virus type 1 (HIV-1) exonic splicing silencers (ESSs) inhibit production of certain spliced viral RNAs by repressing alternative splicing of the viral precursor RNA. Several HIV-1 ESSs interfere with spliceosome assembly by binding cellular hnRNP A/B proteins. Here, we have further characterized the mechanism of splicing repression using a representative HIV-1 hnRNP A/B-dependent ESS, ESSV, which regulates splicing at the vpr 3′ splice site. We show that hnRNP A/B proteins bound to ESSV are necessary to inhibit E complex assembly by competing with the binding of U2AF65 to the polypyrimidine tracts of repressed 3′ splice sites. We further show evidence suggesting that U1 snRNP binds the 5′ splice site despite an almost complete block of splicing by ESSV. Possible splicing-independent functions of U1 snRNP-5′ splice site interactions during virus replication are discussed.
doi:10.1128/MCB.23.23.8762-8772.2003
PMCID: PMC262674
PMID: 14612416
Cryptic splice sites are used only when use of a natural splice site is disrupted by mutation. To determine the features that distinguish authentic from cryptic 5′ splice sites (5′ss), we systematically analyzed a set of 76 cryptic 5′ss derived from 46 human genes. These cryptic 5′ss have a similar frequency distribution in exons and introns, and are usually located close to the authentic 5′ss. Statistical analysis of the strengths of the 5′ss using the Shapiro and Senapathy matrix revealed that authentic 5′ss have significantly higher score values than cryptic 5′ss, which in turn have higher values than the mutant ones. β-Globin provides an interesting exception to this rule, so we chose it for detailed experimental analysis in vitro. We found that the sequences of the β-globin authentic and cryptic 5′ss, but not their surrounding context, determine the correct 5′ss choice, although their respective scores do not reflect this functional difference. Our analysis provides a statistical basis to explain the competitive advantage of authentic over cryptic 5′ss in most cases, and should facilitate the development of tools to reliably predict the effect of disease-associated 5′ss-disrupting mutations at the mRNA level.
doi:10.1093/nar/gkg830
PMCID: PMC275472
PMID: 14576320
Point mutations frequently cause genetic diseases by disrupting the correct pattern of pre-mRNA splicing. The effect of a point mutation within a coding sequence is traditionally attributed to the deduced change in the corresponding amino acid. However, some point mutations can have much more severe effects on the structure of the encoded protein, for example when they inactivate an exonic splicing enhancer (ESE), thereby resulting in exon skipping. ESEs also appear to be especially important in exons that normally undergo alternative splicing. Different classes of ESE consensus motifs have been described, but they are not always easily identified. ESEfinder (http://exon.cshl.edu/ESE/) is a web-based resource that facilitates rapid analysis of exon sequences to identify putative ESEs responsive to the human SR proteins SF2/ASF, SC35, SRp40 and SRp55, and to predict whether exonic mutations disrupt such elements.
PMCID: PMC169022
PMID: 12824367
Splicing factors of the SR protein family share a modular structure consisting of one or two RNA recognition motifs (RRMs) and a C-terminal RS domain rich in arginine and serine residues. The RS domain, which is extensively phosphorylated, promotes protein-protein interactions and directs subcellular localization and—in certain situations—nucleocytoplasmic shuttling of individual SR proteins. We analyzed mutant versions of human SF2/ASF in which the natural RS repeats were replaced by RD or RE repeats and compared the splicing and subcellular localization properties of these proteins to those of SF2/ASF lacking the entire RS domain or possessing a minimal RS domain consisting of 10 consecutive RS dipeptides (RS10). In vitro splicing of a pre-mRNA that requires an RS domain could take place when the mutant RD, RE, or RS10 domain replaced the natural domain. The RS10 version of SF2/ASF shuttled between the nucleus and the cytoplasm in the same manner as the wild-type protein, suggesting that a tract of consecutive RS dipeptides, in conjunction with the RRMs of SF2/ASF, is necessary and sufficient to direct nucleocytoplasmic shuttling. However, the SR protein SC35 has two long stretches of RS repeats, yet it is not a shuttling protein. We demonstrate the presence of a dominant nuclear retention signal in the RS domain of SC35.
doi:10.1128/MCB.22.19.6871-6882.2002
PMCID: PMC134038
PMID: 12215544
The RNA-recognition motif (RRM) is a common and evolutionarily conserved RNA-binding module. Crystallographic and solution structural studies have shown that RRMs adopt a compact α/β structure, in which four antiparallel β-strands form the major RNA-binding surface. Conserved aromatic residues in the RRM are located on the surface of the β-sheet and are important for RNA binding. To further our understanding of the structural basis of RRM-nucleic acid interaction, we carried out a high resolution analysis of UP1, the N-terminal, two-RRM domain of heterogeneous nuclear ribonucleoprotein A1 (hnRNP A1), whose structure was previously solved at 1.75–1.9 Å resolution. The two RRMs of hnRNP A1 are closely related but have distinct functions in regulating alternative pre-mRNA splice site selection. Our present 1.1 Å resolution crystal structure reveals that two conserved solvent-exposed phenylalanines in the first RRM have alternative side chain conformations. These conformations are spatially correlated, as the individual amino acids cannot adopt each of the observed conformations independently. These phenylalanines are critical for nucleic acid binding and the observed alternative side chain conformations may serve as a mechanism for regulating nucleic acid binding by RRM-containing proteins.
PMCID: PMC101846
PMID: 11917013
The synthesis of human immunodeficiency virus type 1 (HIV-1) mRNAs is a complex process by which more than 30 different mRNA species are produced by alternative splicing of a single primary RNA transcript. HIV-1 splice sites are used with significantly different efficiencies, resulting in different levels of mRNA species in infected cells. Splicing of Tat mRNA, which is present at relatively low levels in infected cells, is repressed by the presence of exonic splicing silencers (ESS) within the two tat coding exons (ESS2 and ESS3). These ESS elements contain the consensus sequence PyUAG. Here we show that the efficiency of splicing at 3′ splice site A2, which is used to generate Vpr mRNA, is also regulated by the presence of an ESS (ESSV), which has sequence homology to ESS2 and ESS3. Mutagenesis of the three PyUAG motifs within ESSV increases splicing at splice site A2, resulting in increased Vpr mRNA levels and reduced skipping of the noncoding exon flanked by A2 and D3. The increase in Vpr mRNA levels and the reduced skipping also occur when splice site D3 is mutated toward the consensus sequence. By in vitro splicing assays, we show that ESSV represses splicing when placed downstream of a heterologous splice site. A1, A1B, A2, and B1 hnRNPs preferentially bind to ESSV RNA compared to ESSV mutant RNA. Each of these proteins, when added back to HeLa cell nuclear extracts depleted of ESSV-binding factors, is able to restore splicing repression. The results suggest that coordinate repression of HIV-1 RNA splicing is mediated by members of the hnRNP A/B protein family.
doi:10.1128/JVI.75.18.8487-8497.2001
PMCID: PMC115094
PMID: 11507194
The first component known to recognize and discriminate among potential 5′ splice sites (5′SSs) in pre-mRNA is the U1 snRNP. However, the relative levels of U1 snRNP binding to alternative 5′SSs do not necessarily determine the splicing outcome. Strikingly, SF2/ASF, one of the essential SR protein-splicing factors, causes a dose-dependent shift in splicing to a downstream (intron-proximal) site, and yet it increases U1 snRNP binding at upstream and downstream sites simultaneously. We show here that hnRNP A1, which shifts splicing towards an upstream 5′SS, causes reduced U1 snRNP binding at both sites. Nonetheless, the importance of U1 snRNP binding is shown by proportionality between the level of U1 snRNP binding to the downstream site and its use in splicing. With purified components, hnRNP A1 reduces U1 snRNP binding to 5′SSs by binding cooperatively and indiscriminately to the pre-mRNA. Mutations in hnRNP A1 and SF2/ASF show that the opposite effects of the proteins on 5′SS choice are correlated with their effects on U1 snRNP binding. Cross-linking experiments show that SF2/ASF and hnRNP A1 compete to bind pre-mRNA, and we conclude that this competition is the basis of their functional antagonism; SF2/ASF enhances U1 snRNP binding at all 5′SSs, the rise in simultaneous occupancy causing a shift in splicing towards the downstream site, whereas hnRNP A1 interferes with U1 snRNP binding such that 5′SS occupancy is lower and the affinities of U1 snRNP for the individual sites determine the site of splicing.
PMCID: PMC102138
PMID: 11046128
Individual members of the serine-arginine (SR) and heterogeneous nuclear ribonucleoprotein (hnRNP) A/B families of proteins have antagonistic effects in regulating alternative splicing. Although hnRNP A1 accumulates predominantly in the nucleus, it shuttles continuously between the nucleus and the cytoplasm. Some but not all SR proteins also undergo nucleo-cytoplasmic shuttling, which is affected by phosphorylation of their serine/arginine (RS)–rich domain. The signaling mechanisms that control the subcellular localization of these proteins are unknown. We show that exposure of NIH-3T3 and SV-40 transformed green monkey kidney (COS) cells to stress stimuli such as osmotic shock or UVC irradiation, but not to mitogenic activators such as PDGF or EGF, results in a marked cytoplasmic accumulation of hnRNP A1, concomitant with an increase in its phosphorylation. These effects are mediated by the MKK3/6-p38 pathway, and moreover, p38 activation is necessary and sufficient for the induction of hnRNP A1 cytoplasmic accumulation. The stress-induced increase in the cytoplasmic levels of hnRNP A/B proteins and the concomitant decrease in their nuclear abundance are paralleled by changes in the alternative splicing pattern of an adenovirus E1A pre-mRNA splicing reporter. These results suggest the intriguing possibility that signaling mechanisms regulate pre-mRNA splicing in vivo by influencing the subcellular distribution of splicing factors.
PMCID: PMC2175157
PMID: 10769024
alternative splicing; hnRNP A1; signal transduction; p38 kinase; stress signaling
Exonic splicing enhancers (ESEs) are important cis elements required for exon inclusion. Using an in vitro functional selection and amplification procedure, we have identified a novel ESE motif recognized by the human SR protein SC35 under splicing conditions. The selected sequences are functional and specific: they promote splicing in nuclear extract or in S100 extract complemented by SC35 but not by SF2/ASF. They can also function in a different exonic context from the one used for the selection procedure. The selected sequences share one or two close matches to a short and highly degenerate octamer consensus, GRYYcSYR. A score matrix was generated from the selected sequences according to the nucleotide frequency at each position of their best match to the consensus motif. The SC35 score matrix, along with our previously reported SF2/ASF score matrix, was used to search the sequences of two well-characterized splicing substrates derived from the mouse immunoglobulin M (IgM) and human immunodeficiency virus tat genes. Multiple SC35 high-score motifs, but only two widely separated SF2/ASF motifs, were found in the IgM C4 exon, which can be spliced in S100 extract complemented by SC35. In contrast, multiple high-score motifs for both SF2/ASF and SC35 were found in a variant of the Tat T3 exon (lacking an SC35-specific silencer) whose splicing can be complemented by either SF2/ASF or SC35. The motif score matrix can help locate SC35-specific enhancers in natural exon sequences.
PMCID: PMC85223
PMID: 10629063
PMCID: PMC84117
PMID: 10207048
Expression of most RNA polymerase II transcripts requires the coordinated execution of transcription, splicing, and 3′ processing. We have previously shown that upon transcriptional activation of a gene in vivo, pre-mRNA splicing factors are recruited from nuclear speckles, in which they are concentrated, to sites of transcription (Misteli, T., J.F. Cáceres, and D.L. Spector. 1997. Nature. 387:523–527). This recruitment process appears to spatially coordinate transcription and pre-mRNA splicing within the cell nucleus. Here we have investigated the molecular basis for recruitment by analyzing the recruitment properties of mutant splicing factors. We show that multiple protein domains are required for efficient recruitment of SR proteins from nuclear speckles to nascent RNA. The two types of modular domains found in the splicing factor SF2/ ASF exert distinct functions in this process. In living cells, the RS domain functions in the dissociation of the protein from speckles, and phosphorylation of serine residues in the RS domain is a prerequisite for this event. The RNA binding domains play a role in the association of splicing factors with the target RNA. These observations identify a novel in vivo role for the RS domain of SR proteins and suggest a model in which protein phosphorylation is instrumental for the recruitment of these proteins to active sites of transcription in vivo.
PMCID: PMC2132840
PMID: 9786943
nucleus; phosphorylation; pre-mRNA splicing; recruitment; transcription
We report striking differences in the substrate specificities of two human SR proteins, SF2/ASF and SC35, in constitutive splicing. β-Globin pre-mRNA (exons 1 and 2) is spliced indiscriminately with either SR protein. Human immunodeficiency virus tat pre-mRNA (exons 2 and 3) and immunoglobulin μ-chain (IgM) pre-mRNA (exons C3 and C4) are preferentially spliced with SF2/ASF and SC35, respectively. Using in vitro splicing with mutated or chimeric derivatives of the tat and IgM pre-mRNAs, we defined specific combinations of segments in the downstream exons, which mediate either positive or negative effects to confer SR protein specificity. A series of recombinant chimeric proteins consisting of domains of SF2/ASF and SC35 in various combinations was used to localize trans-acting domains responsible for substrate specificity. The RS domains of SF2/ASF and SC35 can be exchanged without effect on substrate specificity. The RNA recognition motifs (RRMs) of SF2/ASF are active only in the context of a two-RRM structure, and RRM2 has a dominant role in substrate specificity. In contrast, the single RRM of SC35 can function alone, but its substrate specificity can be influenced by the presence of an additional RRM. The RRMs behave as modules that, when present in different combinations, can have positive, neutral, or negative effects on splicing, depending upon the specific substrate. We conclude that SR protein-specific recognition of specific positive and negative pre-mRNA exonic elements via one or more RRMs is a crucial determinant of the substrate specificity of SR proteins in constitutive splicing.
PMCID: PMC83978
PMID: 10022872
SR proteins are required for constitutive pre-mRNA splicing and also regulate alternative splice site
selection in a concentration-dependent manner. They
have a modular structure that consists of one or two
RNA-recognition motifs (RRMs) and a COOH-terminal arginine/serine-rich domain (RS domain). We have
analyzed the role of the individual domains of these
closely related proteins in cellular distribution, subnuclear localization, and regulation of alternative splicing
in vivo. We observed striking differences in the localization signals present in several human SR proteins. In
contrast to earlier studies of RS domains in the Drosophila suppressor-of-white-apricot (SWAP) and Transformer (Tra) alternative splicing factors, we found that
the RS domain of SF2/ASF is neither necessary nor sufficient for targeting to the nuclear speckles. Although
this RS domain is a nuclear localization signal, subnuclear targeting to the speckles requires at least two of
the three constituent domains of SF2/ASF, which contain additive and redundant signals. In contrast, in two
SR proteins that have a single RRM (SC35 and SRp20),
the RS domain is both necessary and sufficient as a targeting signal to the speckles. We also show that RRM2
of SF2/ASF plays an important role in alternative splicing specificity: deletion of this domain results in a protein that, although active in alternative splicing, has altered specificity in 5′ splice site selection. These results
demonstrate the modularity of SR proteins and the importance of individual domains for their cellular localization and alternative splicing function in vivo.
PMCID: PMC2138183
PMID: 9230067