|Home | About | Journals | Submit | Contact Us | Français|
Correct interpretation of the coding capacity of RNA polymerase II transcribed eukaryotic genes is determined by the recognition and removal of intronic sequences of pre-mRNAs by the spliceosome. Our current knowledge on dynamic assembly and subunit interactions of the spliceosome mostly derived from the characterization of yeast, Drosophila, and human spliceosomal complexes formed on model pre-mRNA templates in cell extracts. In addition to sequential structural rearrangements catalyzed by ATP-dependent DExH/D-box RNA helicases, catalytic activation of the spliceosome is critically dependent on its association with the NineTeen Complex (NTC) named after its core E3 ubiquitin ligase subunit PRP19. NTC, isolated recently from Arabidopsis, occurs in a complex with the essential RNA helicase and GTPase subunits of the U5 small nuclear RNA particle that are required for both transesterification reactions of splicing. A compilation of mass spectrometry data available on the composition of NTC and spliceosome complexes purified from different organisms indicates that about half of their conserved homologs are encoded by duplicated genes in Arabidopsis. Thus, while mutations of single genes encoding essential spliceosome and NTC components lead to cell death in other organisms, differential regulation of some of their functionally redundant Arabidopsis homologs permits the isolation of partial loss of function mutations. Non-lethal pleiotropic defects of these mutations provide a unique means for studying the roles of NTC in co-transcriptional assembly of the spliceosome and its crosstalk with DNA repair and cell death signaling pathways.
Intron sequences of RNA polymerase II (RNAPII) transcribed nuclear pre-mRNAs in eukaryotes are removed by the spliceosome to produce mature mRNAs. The spliceosome is composed of U-rich U1, U2, U4, U5, and U6 small nuclear snRNAs, which form snRNP particles with specific sets of proteins. Spliceosomal snRNPs undergo in cell extracts timely ordered assembly on model pre-mRNA templates and recruit numerous stage specific auxiliary regulatory factors, the complexity of which increases in metazoan’s evolution (Wahl et al., 2009). Except for plants, where thus far no suitable in vitro splicing assay is available, past studies of in vitro spliceosome assembly have generated a wealth of mass spectrometry, RNA cross-linking, and crystallographic data on basic functions, binding specificities, and interactions of core spliceosomal proteins. Other studies have independently identified the functions of numerous auxiliary factors that co-purify with different snRNPs. The emerging regulatory interactions reveal a huge complexity and gene/intron specific variation of in vivo spliceosome interactions with the RNAPII transcription initiation, capping, elongation, and polyadenylation/cleavage complexes, as well as subunits of the exon junction, pre-mRNA splicing and retention (RES), mRNA export (THO/TREX), exosome, microRNA processing, nuclear pore, nonsense-mediated decay (NMD), and chromatin remodeling/modifying complexes. In fact, accumulating data on transcript specific splicing kinetics and alternative splicing demonstrate that these interactions govern co-transcriptional assembly, selectivity, and progressivity of the spliceosome, which are also remarkably dependent on changes in epigenetic modifications of histones during different phases of transcription (Oesterreich et al., 2011). Co-transcriptional coupling and regulation of the spliceosome activity is further indicated by the fact that introns are removed from the majority of pre-mRNAs before completion of transcription in yeast and mammalian cells. In general, splicing of introns close to the 5′-ends of transcripts is completed earlier compared to the removal of 3′ introns. Due to different sequence features of introns, their splicing does not however follow always the order of their transcription. Some “difficult” introns might remain completely or partially unspliced upon transcription termination. Partially spliced pre-mRNAs accumulate together with hyperphosphorylated RNAPII, spliceosome components, and auxiliary factors in nuclear speckles, especially in differentiated cells types, where their processing occurs post-transcriptionally (Han et al., 2011). Thus, while spliceosome assembly appears to be regulated co-transcriptionally, splicing per se does not necessarily depend on active transcription.
As in other metazoans, the majority of introns are recognized by the U2-type spliceosome in Arabidopsis. Regulatory features and targets of Arabidopsis U12-specific minor spliceosome have been reviewed recently (Simpson and Brown, 2008). Here we provide a compilation of proteins identified by mass spectrometry in 27 purified yeast, Drosophila, and human U2-type spliceosome complexes and annotation of corresponding Arabidopsis homologs (Table S1 in Supplementary Material). Yeast spliceosome complexes are composed of 50–60 core snRNP subunits and about hundred additional splicing-related factors, most of which are conserved in metazoans (Fabrizio et al., 2009). In comparison, the total number of proteins identified in purified Drosophila and human spliceosomal complexes is about 260 and 400, respectively (Rappsilber et al., 2002; Herold et al., 2009; Will and Lührmann, 2011). Based on comparative sequence analysis, Wang and Brendel (2004) predicted 395 genes encoding splicing-related proteins in Arabidopsis, including all putative RNA-binding factors identified earlier by Lorkovic and Barta (2002). Our compilation, based on comparative analysis of mass spectrometry data using the TAIR10 version of reannotated genome sequence, indicates a conservation of about 430 spliceosomal factors in Arabidopsis (Table S1 in Supplementary Material). However, in some cases the relationships between yeast, Arabidopsis, and other metazoan homologs are suggested only by conservation of certain functionally important domains. Thus, verification of composition of Arabidopsis core snRNPs and auxiliary factors of various spliceosomal complexes awaits further analysis of their subunits by mass spectrometry.
In yeast and metazoans, the U1, 2, 4, and 5 snRNAs associate through their U-rich PuAU4–6GPu motives to a heptamer of Sm proteins, while U6 is bound to a similar Sm-like LSm complex (Tharun, 2009; Weber et al., 2010). In Arabidopsis all seven Sm proteins and LSm1, 3, and 6 are encoded by duplicated genes (Figure (Figure1).1). In addition to their U6 snRNP-specific roles, the LSm proteins target aberrant transcripts for decapping and 5′ to 3′ decay. Mutation of Arabidopsis LSm4 (emb1644) causes embryonic lethality, while the lsm5/sad1 point mutation results in a partial loss of function conferring supersensitivity to abscisic acid (ABA) and drought (Xiong et al., 2001).
Subunits of snRNPs show weaker sequence conservation but similar redundancy in Arabidopsis. From the U1 snRNP-associated proteins, several are encoded by at least two genes, while Arabidopsis lacks Prp42 and Snu56 homologs (Figure (Figure1).1). Similarly, the U2AF35, U2AF65, and polypyrimidin tract-binding (PTB) proteins from the splice-site selecting U2 components, as well as numerous U2 snRNP subunits and U2-related factors are encoded by duplicated genes in Arabidopsis. Spliceosome association of other factors that share only domain homology with the U2 snRNP components is nonetheless uncertain and should be confirmed by further proteomics analyses. Genetic studies indicate that inactivation of PRP39a component of U1 snRNP confers late flowering by upregulating the transcription of FLC flowering time repressor (Wang et al., 2007). In contrast, the elf9 mutation of Arabidopsis ortholog of tat-SF1 U2 snRNP component upregulates transcription of a defective splice-isoform of suppressor of overexpression of CO1 (SOC1) causing early flowering (Song et al., 2009). On the other hand, the U2 SPF45 subunit is implicated in DNA recombination and repair both in Arabidopsis and Drosophila, where it is an interacting partner of RAD201, a member of the RecA/Rad51 family (Chaouki and Salz, 2006).
In the case of U5 snRNP, there are multiple Arabidopsis homologs of Prp8 and Brr2 RNA helicases and Snu114 GTPase that play important roles in both activation and dissociation of the spliceosome (Wahl et al., 2009). Nonetheless, analysis of the Arabidopsis NineTeen Complex (NTC) complex (Monaghan et al., 2009) and genetic data suggest that BRR2 is represented probably by only a single locus, while PRP8 and SNU114 are encoded by two differentially regulated genes. The prp8a/sus2 mutation results in a cell division defect of embryonic suspensor, while the emb1507 embryo lethal mutation is located in the BRR2 gene. Different genetic screens led to the identification of maternal effect of embryo arrest 5 (mee5), clotho (clo1), and gametophytic factor 1 (gfa1) mutations of SNU114, which is required for specification of egg cell fate and floral organ number and identity (Liu et al., 2009; Yagi et al., 2009). Similarly, the atropos (ato) mutation of U2 snRNP SF3a60/PRP9 gene compromises egg and central cell fate, and SNU114 is necessary for tissue specific expression of Lachesis (LIS) that encodes the U4/U6 snRNP subunit PRP4 (Moll et al., 2008). A common consequence of all these mutations is the abortion of gametophyte. A potential link to cell death regulation is suggested by high level induction of Yellow-Leaf-Specific Gene 8 (U5-15/YLS8) during late leaf senescence (Yoshida et al., 2001). Expression of the U5 snRNA subunit gene U5-102/PRP6 is cold stimulated and its stabilized 1 (sta1)/emb2770 mutation results in defective splicing and stabilization of the cold-induced COR15A and other unstable transcripts leading to a range of pleiotropic developmental and stress response defects (Lee et al., 2006). Like the ELF9/tat-SF1 subunit of U2 snRNP, PRP6 might function in interaction with the exosome and exon junction complex (EJC) implicated in the recruitment of NMD regulators (Gehring et al., 2005).
The U4/U6 subunits PRP3 and Tri-20 are represented by two Arabidopsis homologs, while U4/U6-15.5/SNU13 is likely encoded by three genes as the Tri65 subunit of U4/U6.U5 tri-snRNP. Mutation of U4/U6 subunits Prp24 and Prp31 lead to embryo lethality (emb140 and emb1220), while inactivation of the RS-domain of Tri120/SNU66 by the meristem defective (mdf) mutation stimulates ectopic meristem formation in vegetative tissues but inhibits root and shoot meristem activities. The mdf mutation reduces the transcript levels of PIN2 and PIN4 auxin influx carriers, as well as those of Plethora, Scarecrow, and Shortroot in the root and Wuschel (WUS) in the shoot meristem (Casson et al., 2009), likely due to splicing defects and destabilization of these transcripts. The mdf mutation is allelic with defectively organized tributaries 2 (dot2) that causes altered vein differentiation pattern in juvenile leaves consistently with defects of auxin transport (Petricka et al., 2008).
The U1 and U2 snRNPs recognize the 5′and 3′ splice-sites (5′SS and 3′SS) and conserved branch sites (BS: AG) of introns. BS is followed by a polypyrimidine tract (PPT) upstream of the 3′SS in metazoans. BS and PPT are poorly conserved in plants (Brown and Simpson, 1998; Simpson et al., 2002), although orthologs of BS-interacting U2AF and PTB proteins are present in Arabidopsis. In vitro assembly studies indicate that the U1 snRNP binds first through U1-C and U1-70K to the 5′SS. Subsequently, PPT and 3′SS are bound respectively by the U2 auxiliary factors U2AF65 and U2AF35 that interact with SF1 at the BP, as well as with a range of SR, hnRNP, Transformer (Tra), and EJC proteins that recognize exonic and intronic splicing enhancer and silencer sequences, contributing to the definition of exons’ positions (for review see Will and Lührmann, 2011). The SR-related Arabidopsis SR45a/Tra-2b factor interacts, for example, with U1-70K and U2F35a to assist splice-site selection, as well as with PRP38 during spliceosome activation (Tanabe et al., 2009). From the 19 Arabidopsis SR-proteins classified into seven subfamilies (Barta et al., 2010), yeast and human orthologs of SR1, SC35, RS33, and RSZ33, as well as the SR-related proteins Tra-1A, B1/2, and SRm160 were identified in purified spliceosomal complexes (Table S1 in Supplementary Material). Transcription and alternative splicing of SR-protein genes is regulated by a multitude of stress and hormonal stimuli, and their known mutations result in pleiotropic regulatory defects (Reddy, 2007). The activity and stability of SR-proteins is regulated by phosphorylation including the Lammer/CLK, SRPK1, and SRPK2 kinases families, as well as by several PRMT arginine methylases that also recognize other classes (e.g., Sm, Lsm, hnRNP, etc.) of spliceosomal proteins (Fluhr, 2008). Arabidopsis PRMT5 was recently demonstrated to methylate several Sm and LSm factors (Deng et al., 2010). The prmt5 mutation results in defective splicing of FLK/hnRNP-E pre-mRNA and late flowering by increasing the FLC transcript level, as well as alters 5′SS recognition leading to aberrant processing of pre-mRNAs encoding components of the circadian clock (Sanchez et al., 2010).
Interaction of U1 with the U2AF-recruited U2 snRNP is stabilized by the ATP-dependent DExH/D-box RNA helicase Prp5. Displacement of SF1 by the SF3b14a subunit of PPT-binding U2-SF3a/b complex is stimulated by Prp5 leading to the formation of prespliceosome complex A (Behzadnia et al., 2007; Figure Figure2).2). Prp5 also facilitates annealing the U2 snRNA with BS, which bulges out an A residue of the intron for the first transesterification reaction. Subsequent recruitment of the U4/U6.U5 tri-snRNP results in the assembly of U1/2/4/5/6 penta-snRNP in the precatalytic complex B (Deckert et al., 2006). Penta-snRNP can be purified from yeast but it is inactive and requires additional factors, in particular the NTC, to form an activated BACT complex (Stevens et al., 2002).
During complex B to BACT transition, interaction of U1 and U1-C with the 5′SS is interrupted by the Prp28/U5-100 helicase, which is activated by the U4/U6.U5 tri-snRNP-associated kinase SRPK2. Subsequently, unwinding the based-paired U4/U6 snRNAs by the Brr2/U5-200 helicase facilitates U6 snRNA interactions with the 5′SS and U2 snRNA. This stimulates the release of U1 and U4 snRNPs, U6 specific Lsm, and Prp24 proteins, as well as the formation of an intramolecular stem-loop (ISL) in U6, which represents the metal-binding catalytic center of the spliceosome. Brr2 is controlled by the interacting Snu114 GTPase and Prp8 U5 subunit. Removal of U2-associated SF3a/b proteins by the Prp2 helicase exposes BS in the remodeled catalytically active complex B*. During step I of splicing, the 2′-OH of BS adenosine residue attacks and forms a covalent bond with the U6 ISL-cleaved 5′SS yielding the 5′-exon and lariat intron-3′exon intermediates.
Catalytic activation of the spliceosome critically depends on its association with the NTC before or during unwinding of the U4 and U6 snRNAs. The NTC regulates the interactions of U5 and U6 with the pre-mRNA before and after step I, as well as formation of the spliceosome’s catalytic center (Chan et al., 2003; Chan and Cheng, 2005). Nucleophilic attack of 3′-OH of 5′ exon at the 3′SS in step II requires further structural rearrangements by multiple factors leading to the formation of complex C. Prp8, anchoring the 5′exon and lariat intron-3′ exon intermediates (Grainger and Beggs, 2005), the Prp16 ATPase, and the NTC subunit Isy1 are involved in monitoring completion of step I and displacing U6 from the 5′SS to liberate it for the catalytic step II. The Prp18 helicase and loop 1 of U5 snRNA juxtapose for ligation of the 5′ exon and 3′SS bound by the interacting NTC subunits Slu7 and Prp22 helicase (Smith et al., 2008). Next, Prp22 deposited downstream of the exon–exon junction disrupts the interaction of Prp8 and U5 with exon sequences, releasing the spliced mRNA from complex C. Dissociation of U2, U5, and U6 is catalyzed by the Prp43 helicase, which is encoded by three candidates genes in Arabidopsis, while Brr2 and Snu114 are thought to unwind and separate the U2 and U6 components of the post-splicing complex (Valadkhan and Jaladat, 2010).
NTC is a regulatory non-snRNP complex that is essential for catalytic activation of the spliceosome (Hogg et al., 2010). When isolated under stringent conditions from in vitro assembly reactions, yeast NTC contains eight core subunits (Prp19, Cef1/Ccd5, Snt309/Spf27, Syf1, Syf2, Syf3/Clf1, Isy1, and Ntc20 listed in Figure Figure3;3; Fabrizio et al., 2009). In contrast, tandem affinity purification of NTC by its tagged Cef1/Cdc5 subunit from budding and fission yeast cell extracts indicates a conservation of at least 26 NTC-associated proteins (Ohi et al., 2002). Recent mass spectrometry analysis of NTC complexes purified by the help of TAP-tagged Prp19, Prp17, and Cwc2 subunits also shows that the eight core subunits are associated in vivo with several Sm proteins, components of the U2 and U5 snRNPs, and at least 30 non-snRNP proteins (Ren et al., 2011). Many of the latter NTC-associated factors were previously found to co-purify with spliceosome A, B, B*, or C, as well as with the RES and THO/TREX complexes. The human core NTC assembled on RNA templates in vitro and purified at high stringency shares only PRP19, CDC5L, and SPF27 with the yeast NTC. The human NTC core carries PRL1/PRLG1 and AD002/HSPC148 orthologs of NTC-associated yeast proteins Prp46 and Cwc15, respectively, and two human specific subunits CTNNBL1/NAP and HSP73 (Grote et al., 2010; Figure Figure33).
Detailed interaction studies between the NTC subunits demonstrate that the PRP19 WD-40-protein forms a homotetrameric platform, in which its U-box domain interacts with SPF27 and the C-terminus of CDC5. Due to heterogeneity of N-terminal sequences of Prp46/PRL1 orthologs, the N-terminus of yeast Prp46 interacts only with Cef1/Cdc5, while N-terminus of human PRL1 binds to PRP19, which might explain why yeast Prp46/PRL1 dissociates from the NTC core under stringent conditions (Ohi and Gould, 2002; Grote et al., 2010). Arabidopsis PRP19, CDC5, and PRL1 show analogous interactions as their human orthologs (Palma et al., 2007). Comparison of mass spectrometry data (Table S1 in Supplementary Material) indicates that nearly all components of larger NTC complexes purified by TAP-tagging from yeast are present in different yeast, Drosophila, and human spliceosome complexes isolated either by immunoprecipitation or in vitro assembly on model pre-mRNA templates (Ajuh et al., 2000; Makarov et al., 2002; Stevens et al., 2002; Zhou et al., 2002; Makarova et al., 2004; Deckert et al., 2006; Behzadnia et al., 2007; Bessonov et al., 2008, 2010; Fabrizio et al., 2009; Grillari et al., 2009; Herold et al., 2009). Remarkably, several components of the NTC core and some associated factors were also detected in the human prespliceosome complex A (Behzadnia et al., 2007). Nonetheless, semi-quantitative mass spectrometry and western blotting studies indicate that the representation of NTC components increases from complex A/B to C dramatically (Makarova et al., 2004; Agafonov et al., 2011).
Arabidopsis NTC purified recently by its tagged SPF27/MOS4 subunit (Monaghan et al., 2009) contains six subunit orthologs of the yeast NTC core [CDC5/Cef1 (MAC1), Prp19a/b (MAC3A/B), SPF27 (MOS4), SYF1 (MAC9), SYF3/CRN1c (MAC10), and ISY1 (MAC8) listed in Figure Figure3].3]. In addition, PRL1 (MAC2) is present in the Arabidopsis complex as in the human NTC core. Intriguingly, Arabidopsis NTC carries two PRL19 paralogs encoded by similarly regulated duplicated genes in Arabidopsis. This suggests that they might form a heterotetramer in analogy to yeast and human NTCs. Compared to yeast, the Arabidopsis complex lacks Syf2, although it has a plant homolog, and also Ntc20, which is a unique component of yeast NTC. Although encoded in the Arabidopsis genome, homologs of human AD002/HSPC148, CTNNBL1/NAP, and HSP73 NTC subunits are also missing from the complex. By contrast, several proteins found in association with the NTC core in yeast and humans, including PRP45/SKIP (MAC6), ECM2-1a/b (MAC5A/B), and PRP17/CDC40 (MAC12), and the complex BACT and C specific factors PRP2a (MAC15), and Aquarius (AQR/MAC7) are parts of the Arabidopsis NTC. More remarkably, U5-220/PRP8a (MAC16), U5-200-2a/BRR2 (MAC14), U5-116-1a/SNU114 (MAC11), and the PRP8-binding WD-40 factor U5-40 (MAC17), which are essential for spliceosome activation, co-purify with NTC from Arabidopsis. Composition of Arabidopsis NTC thus resembles to that of the human 35S U5 snRNA complex characterized earlier by Makarov et al. (2002). While the human 35S U5 snRNP and affinity purified yeast NTCs contain a full set of Sm proteins and other NTC-associated factors that are likely part of functional NTC in vivo, their corresponding homologs are absent from the Arabidopsis NTC.
Precursor RNA processing 19 (Prp19) was first identified in yeast and found in stable association with the spliceosome after dissociation of U1 and U4 snRNPs. The yeast prp19 mutation blocks step I of splicing and results in a temperature-dependent growth defect, likely due to defective splicing of α-tubulin and Bub1 pre-mRNAs (Tarn et al., 1993; Song et al., 2011). Inactivation of mice Prp19/SNEV leads to early embryonic lethality in the blastocyst stage (Fortschegger et al., 2007). Recently, a prp19a prp19b (mac3a mac4b, modifier of SNC1) double T-DNA insertion mutation was reported to suppress innate immunity conferred by a dominant mutation of suppressor of NPR1-1 constitutive 1 (SNC1) without affecting normal plant development (Monaghan et al., 2009). However, other combinations of prp19a and prp19b mutations cannot be isolated in homozygous form, which suggests that PRP19 is also an essential gene in Arabidopsis.
Prp19 carries an N-terminal U-box/RING-finger domain, which confers E3 ubiquitin ligase activity and mediates the formation of ubiquitin K63 chain on the U4 snRNP subunit Ppr3 by enhancing its interaction with Prp8 in the U4/U6.U5 tri-snRNP. Ubiquitinated Prp3 is recognized by the U4/U6 snRNP subunit Prp24/SART3 and deubiquitinated by Usp4 facilitating Brr2/Snu114-mediated dissociation of U4 components before spliceosome activation (Song et al., 2011). While K63 ubiquitin chain formation does not target proteins for proteasomal degradation, intriguingly both yeast and human Prp19 interact in vivo with the β7 catalytic subunit of 20S proteasome suggesting that PRP19 could mediate degradation of some NTC-associated factors (Löscher et al., 2005; Sihn et al., 2007). The central coiled-coil domain of Prp19 is required for its tetramerization, which is essential for NTC and spliceosome assembly (Ohi et al., 2005; Grote et al., 2010). Prp19 tetramerization creates a central stalk of U-boxes and exposes the periferial WD-40 domains for interaction with yeast Cwc2 (i.e., a binding partner of ubiquitin conjugase Ubc3) and Prp17 (Ren et al., 2011). Mapping of NTC subunit interactions shows that the central U-box domain is required for binding Snt309/SPF27, as well as the C-terminus of Cef1/CDC5 and N-terminus of human PRL1 (Chen et al., 2002; Ohi and Gould, 2002; Grote et al., 2010). The presence of PRP19, CDC5, PRL1, and SPF27 in the Arabidopsis NTC suggests that these molecular interactions are likely conserved. Yeast prp19 and snt309 mutants accumulate free U4 snRNA, while U6 snRNA is depleted in the snt309 background. Suppression of growth defect of snt309 mutant by overexpression of U6 snRNA indicates that NTC also controls recycling of spliceosomal snRNPs (Chen et al., 2006).
Human Snt309 is named Breast Cancer Amplified Sequence 2 (BCAS2) and involved as transcriptional co-activator of the estrogen receptor in the regulation of p53-dependent induction of apoptosis (Kuo et al., 2009). Mutation of Arabidopsis Snt309/BCAS2 homolog MOS4 suppresses constitutive activation of innate immunity pathway in the snc1 mutant similarly to mutations of other plant NTC subunits (Palma et al., 2007).
Snt309/BCAS2 is required for stable interaction of Prp19 with CDC5, a member of the Myb transcription factor family. CDC5 shows specific DNA-binding in Arabidopsis, while its mutations leads to accumulation of retained introns in partially spliced pre-mRNAs and G2 cell cycle arrest in yeast (Hirayama and Shinozaki, 1996; Burns et al., 1999). Cell cycle arrest of the yeast cdc5 mutant is suppressed by removal of the intron from the α-tubulin (Tub1) gene (Burns et al., 2002). Cell cycle dependent phosphorylation of CDC5 appears to be an important modulator of NTC-mediated activation of the spliceosome (Gräub et al., 2008). Human CDC5 interacts with NIPP1 (nuclear inhibitor of protein phosphatase 1 absent from Arabidopsis), which recognizes the SF3b155 subunit of U2-SF3a/b complex that becomes hyperphosphorylated during the first splicing reaction. Interaction of CDC5 with NIPP1 probably targets a PP1 phosphatase to the SF1a/b complex leading to its destabilization, which is required for the second step of splicing (Tanuma et al., 2008). Furthermore, human CDC5 directly interacts with hLodestar/HuF2, which is an SF2-like ATP-dependent helicase subunit of SWI/SNF chromatin remodeling complex implicated in transcription termination (Leonard et al., 2003).
C-terminus of CDC5 binds the C-terminal WD-40 domain of Prp46/PRL1 in yeast and human NTCs (Ajuh et al., 2001; Ohi and Gould, 2002). PRL1 encoded by the Pleiotropic Regulatory Locus 1 was first identified in Arabidopsis by a T-DNA insertion mutation causing highly pleiotropic defects characterized by altered regulation of root and leaf development, flowering time, sugar, cold, ethylene, cytokinin, and auxin responses (Németh et al., 1998). Arabidopsis has a PRL1 paralog, PRL2, which is transcribed at low levels in vegetative tissues but shows much higher expression compared to PRL1 during embryogenesis and seed development. Low PRL2 expression in the prl1 mutant thus appears to be sufficient for maintaining plant viability but leads to highly pleiotropic defects. Nuclear import of PRL1 is aided by its interaction with a specific member of the α-importin family, ATHKAP2/IMPA-3 (Németh et al., 1998). Inactivation of IMPA-3 in the Arabidopsis mos6 mutant suppresses constitutive activation of innate immunity by the snc1 mutation similarly to the mos4 (snt309), cdc5 (mac1), and prl1 (mac2) mutations (Palma et al., 2005, 2007). In comparison, human CDC5 is imported into the nucleus by the CTNNBL1 (catenin-β-like 1) armadillo repeat protein, which remains in CDC5L-bound form in the human core NTC and mediates interaction with the PRP31 subunit of U4/U6 snRNP (Grote et al., 2010). Temperature sensitive mutations of fission yeast Cwf1/CDC5 and Prp5/PRL1 show strong negative genetic interactions (i.e., synthetic lethality at permissive temperature) suggesting overlapping functions (McDonald et al., 1999). Removal of the C-terminus of CDC5 results in dissociation of PRL1 from the yeast NTC (Ohi and Gould, 2002) but it is yet unclear whether this leads to overall destabilization of the complex. Virus-induced gene silencing of Arabidopsis CDC5 is reported to activate early senescence causing accelerated cell death independently of salicylic acid (SA) signaling (Lin et al., 2007a). According to Palma et al. (2007), however, the cdc5, prl1, and mos4 mutations stimulate SA production but show npr1-independent activation of the PR genes. RNAi-silencing of CDC5 was also suggested to result in a G2/M cell cycle defect by affecting Shoot Meristemless (STM) and WUS transcription and meristem development. Furthermore, the GABI_278B09 cdc5 T-DNA insertion mutation was found to cause embryo lethality by Lin et al. (2007b). The latter phenotypic trait is however probably caused by an unrelated mutation, since Palma et al. (2007) found that removal of C-terminal PRL1-binding domain from the CDC5 coding sequence by the same T-DNA insertion mutation results in viable plants showing a partial loss of function phenotype similar to that of the prl1 mutant.
Arabidopsis PRL1 interacts with and inhibits in vitro the activity of Snf1-related protein kinase AKIN10 (Bhalerao et al., 1999), a functional homolog of yeast Snf1 that represents a regulatory partner of GCN5 histone acetylase in the yeast RNAPII SAGA co-activator complex (Lo et al., 2001; Liu et al., 2010). Whereas association of SnRK1 with SAGA in plants awaits further confirmation, SnRK1 AKIN10 was found to interact in vivo with the common SKP1 subunits of SCF (Skp1–cullin1–F-box) E3 ubiquitin ligases by targeting them to the α4/PAD1 subunit of 20S proteasome catalytic cylinder (Farrás et al., 2001). Arabidopsis α4/PAD1 is an ortholog of human proteasome subunit α7, which is the closest neighbor of β7 subunit targeted by Prp19 (Löscher et al., 2005). Recently, PRL1 was found to function as substrate receptor subunit of a Cul4–DDB1 ubiquitin ligase, and the prl1 mutation was reported to stabilize SnRK1 AKIN10. It is remarkable that in addition to PRL1, WD-40 repeats of spliceosome-associated THOC6 and FY subunits of the THO/TREX mRNA export, and polyadenylation/cleavage complexes also contain DDB1-binding DWD motives are detected in association with Cul4 and DDB1 in Arabidopsis (Lee et al., 2008). Knockout of mouse PRLG1/PRL1 results in early embryonic lethality as inactivation of mouse PRP19/Pso4 (Löscher et al., 2005; Kleinridders et al., 2009), whereas no homozygous prl1, prl2 and cdc5 null mutants can be obtained in Arabidopsis. These results are consistent with the observations of Ohi et al. (2002) indicating that Cef1/Cdc5, Prp46/PRL1, and Prp19 are essential genes in yeast. In contrast, the yeast snt309 and Arabidopsis mos4 mutations do no cause lethality (Palma et al., 2007). Ecm2/Slt11 is also not essential in yeast but its mutation causes cold sensitivity (i.e., as the prl1 mutation in Arabidopsis; Németh et al., 1998) and impaired splicing (Xu and Friesen, 2001). However, a combination of mutations of corresponding Arabidopsis homologs ECM2A and B (MAC5A/B) is lethal (Monaghan et al., 2010). Ecm2, as the homologous human RBM22 zinc-finger RNA-binding protein is controlled by sumoylation and involved in U2/U6 helix II formation required for spliceosome activation. Ecm2/RBM22 interacts with the step II splicing factors Slu7, Prp16, and Prp17 and its inactivation in zebrafish leads to early embryo lethality as in Arabidopsis (He et al., 2009).
From the remaining conserved NTC components, Syf1/Ntc90/XAB2/MAC9 is essential in yeast and interacts with most NTC subunits, except Prp19, and SPF27/Ntc25. Syf1 is required for recruitment of the coiled-coil factor Yju2/Cwc16/CCDC130, which is transiently associated with NTC before step I of splicing together with Cwc22, and facilitates the first catalytic reaction after Prp2-mediated remodeling of the spliceosome (Chang et al., 2009). Cwc22 is essential for Prp2-mediated release of U2-SF3a/b complex before step I of splicing (Yeh et al., 2011). Prp45/SKIP/MAC6 is essential in budding and fission yeast, shows genetic interactions with step II splicing factors and is required for recruitment of Prp22 helicase mediating spliceosome dissociation (Gahura et al., 2009). In addition, Prp45 interacts in fission yeast with the splice-site selecting U2AF35 factor suggesting an early role in spliceosome assembly (Ambrozková et al., 2001). Transcription of Arabidopsis SKIP/Prp45 is stimulated by ABA, salt, and osmotic stress and its overexpression confers salt and osmotic stress tolerance (Lim et al., 2010).
The Syf3/CRN1/MAC10 tetratricopeptide repeat (TRP) protein is one of the earliest acting NTC proteins, which is essential for loading the U4/U6.U5 tri-snRNP during complex A to B transition. In Drosophila, mutation of CRN1/Syf3 (crooked neck) causes early embryonic lethality (Chung et al., 2002). Isy1/MAC8 interacts with the Prp16 ATPase required for proper recognition of 3′SS and implicated in remodeling of U6. Isy1 also interacts with Cwc21/SRm300, as well a with the trimethylguanosine synthase capping enzyme during early stage of spliceosome assembly (Villa and Guthrie, 2005; Hausmann et al., 2008; Khanna et al., 2009). The PRL1-binding non-essential second step factor Prp17/CDC40/MAC17 is a WD-40-protein, which functions together with Prp16, Prp8, and the SR protein kinase Sky1 in the recognition of 3′SS. Prp17 is essential for splicing of introns longer than 200 nucleotides and shows co-immunoprecipitation with lariat intron–exon intermediates (Dagher and Fu, 2001; Sapra et al., 2008). Finally, from the conserved NTC-associated proteins Aquarius/AQR/MAC7 is found to bind intron sequences in the vicinity of BP and it is identified by the Arabidopsis by the embryo defective 2765 mutation (Hirose et al., 2006). The key step II RNA helicase Prp2 is represented by three potential homologs in Arabidopsis, from which the eps3 mutation of PRP2/MAC5 confers enhanced gene silencing, probably due to destabilization of aberrantly spliced pre-mRNAs and subsequent generation of derived siRNAs (Herr et al., 2006). The maternal embryo effect 29 (mee29) mutation in its paralog Prp2b results in deficient gametophyte/embryo development (Pagnussat et al., 2005).
Taken together, the available genetic data indicate that mutations of essential Arabidopsis NTC components result in lethality. In those cases, where one of the duplicated genes is preferentially expressed during embryo and seed development whereas the other shows higher expression in vegetative tissues, the corresponding single gene mutations yield embryo lethality and pleiotropic effects in seedlings, respectively. Although not all NTC mutants were characterized so far for their pleiotropic defects in such a detail as prl1, all studied NTC deficiencies appear to function as mos suppressors of snc1-induced innate immunity, which is coupled to the induction of cell death in pathogen infected tissues. Recently, the prl1 mutation was also demonstrated to suppress the induction of cell death by singlet oxygen generated through the accumulation of free protochlorophyllide in the fluorescent in blue light (flu) mutant (Baruah et al., 2009). Furthermore, the Arabidopsis mos2 mutation, which has not been connected so far to NTC, is located in a gene encoding a spliceosome complex C specific factor Spp2/GPKOW, which is a binding partner of yeast Prp2 and human protein kinase Cβ2 that shows interaction with PRL1 in human cells (Roy et al., 1995; Németh et al., 1998; Zhang et al., 2005a; Aksaas et al., 2011). Another candidate MOS factor is the complex C specific protein RUVBL1, a chromatin remodeling ATPase inhibitor of apoptosis (Taniue et al., 2011). The Arabidopsis RUVBL1 homolog, TIP49A/RIN1, is an interacting partner of RPM1 receptor required for disease resistance against Pseudomonas syringae pv. maculicola (Holt et al., 2002). It is also apparent that some pathogen produced effector molecules could directly modulate the activity of the spliceosome. Thus, a natural antitumor compound isolated from Pseudomonas functions as spliceostatin A that specifically targets the U2 snRNP SF3b complex and thereby inhibits splicing and nuclear retention of unspliced pre-mRNAs (Kaida et al., 2007).
In addition to controlling spliceosome assembly and activation, NTC is emerging to play a central role in the regulation of DNA damage responses, which is consistent with the effects of NTC subunit mutations observed in Arabidopsis. In yeast, Prp19 was identified as Pso4, mutation of which causes hypersensitivity to the DNA cross-linking agent psoralen, as well as to a broad range of other DNA damaging agents (see for review Legerski, 2009). Expression of human PRP19/Pso4/SNEV is induced by DNA damage and its down-regulation by RNAi results in the accumulation of double-stranded DNA breaks and apoptosis (Mahajan and Mitchell, 2003). Using a DNA interstand cross-link repair assay, Pso4 was purified together with the NTC subunits CDC5, PRL1, and SPF27, from which CDC5 directly interacts with the Werner syndrome DNA helicase (WRN) in complex with the replication protein A (RPA, Zhang et al., 2005b). RPA is essential for stabilization of single-stranded gap structure arising from dual incision of damaged DNA strand, while WRN assists DNA repair replication.
DNA damage causing an arrest of transcription elongation and DNA replication leads to activation of the DNA repair checkpoint, which prevents cells entering or leaving the S-phase. Mutations inactivating the regulators of this checkpoint lead to persistent activation of p53 tumor suppressor stimulating cell death and apoptosis in mammals. Activation of transcription-independent global genome and transcription-coupled nucleotide excision repair (GG-NER and TC-NER) pathways is mediated by different sensory but common executing components (Lagerwerf et al., 2011). In GG-NER, various DNA lesions are recognized by the UV-DDB (Cul4–DDB1–DDB2) and XPC–Rad23B complexes. In TC-NER, the hyperphosphorylated RNAPII stalled by the DNA damage is recognized the Cockayne Syndrome CSB chromatin remodeling ATPase, which subsequently recruits the Cul4–DDB1–CSA E3 ubiquitin ligase and GCN5/p300 histone acetylase complexes. Remarkably, DDB2 and CSA/ERCC8 are WD-40 proteins that in analogy to PRL1 are recruited to DDB1. This suggests that PRL1 in the Pso4 complex might perform a similar substrate receptor function although its target is unknown so far. Cul4–DDB1–DDB2 ubiquitinates histones H3 and H4 contributing to opening the damaged chromatin site. On the other hand, the CSA WD-40 protein is a substrate receptor of CSB, which is being released and degraded before subsequent loading of the TFIIH RNAPII transcription initiation/DNA repair complex to the damaged site. In the case of TC-NER, loading of TFIIH requires the recognition of CSA/CSB by the XAB2/Syf1 NTC subunit, as well as interaction of the XAB2 complex with the transcription elongation factor TFIIS and high mobility group chromatin factor HMGN1 (Fousteri et al., 2006).
The human XAB2 complex contains Aquarius, PRP19, CCDC16/ZNF830 (a component of spliceosome complex B, Table S1 in Supplementary Material), ISY1 and the NTC-associated PPIE peptidyl-prolyl cis–trans isomerase (Kuraoka et al., 2008). Differences between subunit composition of the Pso4 and XAB2 complexes is probably explained by the observation that in response to DNA damage PRP19 undergoes self-ubiquitination, which stimulates the release of CDC5 and PRL1 (Lu and Legerski, 2007). Whether this event leads to switching the role of NTC between splicing and DNA repair remains however to be determined.
Mutations inactivating essential components of the DNA repair pathways lead to prolonged presence of the ATR checkpoint kinase at the damaged DNA sites. Chronic activation of ATR, as well as inhibition of transcription elongation, causes a dramatic increase in phosphorylated forms of p53 and histone H2AX, and ubiquitination of H2A (Lagerwerf et al., 2011). Although not detected in either Pso4 or XAB2 complex, the NTC subunit Syf2 is found in association with the MCM3 and PCNA DNA replication factors and is required for activation of ATR (Chu et al., 2006). The yeast NTC subunit Cfl1/CRN1 is similarly found in a complex with the DNA replication factor Orc2 and its mutation results in delayed entry into the S-phase, indicating a direct role in DNA replication (Zhu et al., 2002). Human CDC5 interacts with and phosphorylated by ATR, and is required for the activation of components of downstream S-phase checkpoint pathway blocking cell cycle progression (Zhang et al., 2009). Knockouts of mouse and zebrafish PRLG1/PRL1 result in nuclear to cytoplasmic translocation of CDC5, stimulation of p53 and histone H2AX phosphorylation, and induction of apoptosis (Kleinridders et al., 2009). This indicates that, despite their lack in the XAB2 complex, both PRL1 and CDC5 are essential for activation of the DNA repair checkpoint and their absence results in stimulation of default cell death pathway in mammals.
The human NTC subunit BCAS2/SPF27 (MOS4) directly interacts with and inhibits the activity of p53. As the prl1 mutation, inactivation of BCAS2 stimulates nuclear retention and phosphorylation of p53 leading to apoptosis, while in p53 mutant cells permits normal G2/M cell cycle arrest (Kuo et al., 2009). Another NTC subunit, SKIP/Prp45 is specifically required for proper splicing of the cell cycle arrest factor p21Cip1. Thus, inactivation of SKIP leads to deregulation of the cell cycle and activation of p53 even in the absence of DNA damage (Chen et al., 2011). Remarkably, PRP19/SNEV/PSO4 overexpression confers tolerance to reactive oxygen species and DNA damaging agents prolonging the life span of human cells (Voglauer et al., 2006). Whether this overexpression effect is due to accumulation of free PRP19, which is not incorporated into NTC, remains a question to answer. In any case, in Arabidopsis that lacks a functional p53 homolog the effects of NTC mutations are just opposite compared to yeast and mammals, as instead of stimulating apoptosis they function as suppressors of induced cell death. It is thus important to clarify how these mutations affect the DNA repair pathways and how do they modulate splicing of downstream effectors of signaling pathways that are activated by the snc1 and flu mutations of pathogen and oxidative stress response pathways.
While splicing defects caused by the Arabidopsis NTC mutations are poorly studied so far, it was observed early on that the Arabidopsis prl1 mutation results in both up and down-regulation of transcription of many stress regulated genes, which is also observable in nuclear run-on transcription assays (Németh et al., 1998). The conclusion that NTC plays a direct role in the regulation of RNAPII transcription and simultaneous co-transcriptional spliceosome assembly is supported now by an overwhelming amount of data (see for reviews e.g., Perales and Bentley, 2009; Muñoz et al., 2010; Oesterreich et al., 2011).
From the core NTC subunits, BCAS2/Snt309, and SKIP/Prp45 are directly recruited to promoters of RNAPII transcribed genes. BCAS2/Snt309 is a transcription co-activator of human estrogen receptor, while SKIP/Prp45 is a common co-activator of several nuclear receptors and a binding partner of multiple histone deacetylases and nuclear co-repressors. SKIP plays a pivotal role in the regulation of transcription in response to Notch and Wnt/β-catenin signaling, and acts as an inhibitor of NADH+-dependent SIRT deacetylase by controlling senescence and aging in conjunction with the retinoic acid receptor (Zhou and Hayward, 2001; Kang et al., 2010; Wang et al., 2010).
Other NTC components are recruited to RNAPII by their contacts with initiation and elongation factors, and spliceosome components. The carboxy-terminal Y1S2P3T4S5P6S7 heptapeptide repeat domain of RNAPII largest subunit (RNAPII CTD) serves as platform for binding and assembly of these regulatory factors (Buratowski, 2009). During transcription initiation, the RNAPII CTD undergoes S5/S7 phosphorylation by the TFIIH-associated Kin28/CDK7 kinases (Akhtar et al., 2009). Phosphorylation of the S5 CTD-residue mediates the recruitment of the capping enzyme, which interacts with the NTC component Isy1 (Hausmann et al., 2008). Subsequent formation of the cap-binding complex CBC and interaction of the U1 snRNP subunit Prp40 with the phosphorylated CTD mediates RNAPII loading of the U1 snRNP through the CBC-binding of U1 subunit Luc7. At the same time, PRP40 recruits the NTC subunit Clf1/Syf3 (Fortes et al., 1999). Arabidopsis PRP40 homologs show similar recognition of RNAPII CTD (Kang et al., 2009) and mutations of the CBC subunits result in the accumulation of retained introns in pre-mRNAs (Laubinger et al., 2008). As U1 snRNP interaction can also be observed with the RNAPII on intronless pre-mRNAs, the U1 snRNP-5′SS commitment complex is only stabilized when the branch site is bound by the U2AF complex. Interaction of the NTC subunit Syf3/Clf1 with U2AF65 and recognition of BP-bound factor SF1 by Prp40 forms a platform for further loading of the U2 snRNP (Chung et al., 2011). In addition, U2AF65 and its interacting SR protein partner SC35 directly bind to RNAPII CTD, and thus play a role in co-transcriptional assembly of the spliceosome (Spiluttini et al., 2010).
Recently, U2AF65 was reported to co-purify with the NTC components PRP19, CDC5, PRL1, and SPF27 and shown to facilitate RNAPII CTD-dependent NTC-mediated activation of splicing (David et al., 2011). The U2 subunit SF3b130 and Cul4–DDB1 are associated with GCN5/p300 histone acetylases of yeast and human SAGA/STAGA RNAPII co-activator complexes. Similarly to activation of NTC-modulated DNA repair, histone acetylation is essential for stabilization of the U2 snRNP in the prespliceosome (Martinez et al., 2001; Gunderson and Johnson, 2009).
Phosphoserine-5 mark of the RNAPII CTD also provides a signal for recruitment of SET1-type histone methyltransferases (Buratowski, 2009). H3K4-trimethyl histone marks deposited by Set1 are recognized by the human SAGA-associated chromodomain protein CHD1, which specifically interacts with the SF3a subcomplex of U2 snRNP (Sims et al., 2007). The U2 snRNP SF3b complex recruits the NTC-associated factor Bud31 and the Ist3 subunit of mRNA retention RES complex, which remain then associated with the spliceosome (Wang et al., 2005). The U2-associated human factor tat-SF1 together with the cap-binding complex plays a role in recruitment of the CDK9/P-TEFb kinase, which stimulates transcription elongation by phosphorylating the S2 residues of RNAPII CTD.
P-TEFb directly interacts with human SKIP and Menin H3K4 methyltransferase (Brès et al., 2009), as well as with exonic splicing enhancer- and silencer-binding SR-proteins, such as SF2/ASF and SRp20. In complex with the HP1 adaptor protein, SRp20 is involved in the recognition of histone H3K9 trimethylation and regulation of alternative splicing (for review see Lenasi and Barboric, 2010). In addition, P-TEFb-mediated phosphorylation of RNAPII CTD S2 residue results in the recruitment of Set2-type histone methyltransferases that deposit H3K36-trimethyl histone marks preferentially on exon sequences. The H3K36me3 chromatin mark is recognized by the adaptor protein MRG15, which in complex with the PPT-binding protein PTB acts as important regulator of alternative splicing (see for review Luco et al., 2011). Interaction of NTC with RNAPII also critically affects transcriptional elongation by recruiting components of the THO/TREX complex required for the formation and nuclear export of messenger ribonucleoproteins (mRNPs, Chanarat et al., 2011).
In this review, we took a glimpse at molecular mechanisms underlying the emerging central regulatory functions of NTC in coordination of spliceosome assembly/activation with transcription, DNA repair/replication, and stress responses related to aging and cell death. In particular, we wished to highlight recent advances in understanding the regulatory functions of Arabidopsis NTC and spliceosome components embedded in a frame of current knowledge derived from similar yeast, Drosophila, and human studies. In the absence of corresponding Arabidopsis data on NTC connections to the regulation of mitotic and meiotic recombination, chromosome segregation, chromatin remodeling, temperature sensing, Polycomb and small RNA-mediated gene silencing, nuclear mRNA export, nonsense mediated decay, and several other essential processes, these subjects remained to be covered by a next timely overview. Compilation of available data on yeast, Drosophila, and human NTC and spliceosome-associated proteins and their conserved plant homologs in the Supplement intends to assist further research to fill this gap.
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The Supplementary Material for this article can be found online at http://www.frontiersin.org/Plant_Physiology/10.3389/fpls.2012.00009/abstract
Compilation of mass spectrometry data on protein composition of purified yeast, Drosophila, human, and Arabidopsis spliceosome and NTC complexes. Phosphorylated human proteins reported by Agafonov et al. (2011) are highlighted in red, Arabidopsis factors with indentified regulatory functions are indicated in bold. Core subunits of yeast, human, and Arabidopsis NTC complexes are shaded in gray. References describing the purification of various spliceosomal and NTC complexes are given at the end of the Table corresponding to numbers in bracket indicated in the heading. Brackets within the table mark unique fission yeast factors, which have either Drosophila or human or Arabidopsis homologs. For each factor, the corresponding NCBI gene accession number is depicted and in all cases frequently used alternative designations are listed.
We apologize for those many colleagues whose work we could not cite due to space limitation. Our studies related to this review were funded by the DFG grant KO 1438/12-1 within the Arabidopsis Functional Genomics Network (AFGN).