|Home | About | Journals | Submit | Contact Us | Français|
Export of messenger RNA (mRNA) from the nucleus to the cytoplasm is a critical step in the gene expression pathway of eukaryotic cells. Here, we report the functional and structural characterization of the mammalian TREX-2 complex and show how it links transcription/processing with nuclear mRNA export. Mammalian TREX-2 is based on a germinal-centre associated nuclear protein (GANP) scaffold to which ENY2, PCID2 and centrins bind and depletion of any of these components inhibits mRNA export. The crystal structure of the GANP:ENY2 complex shows that two ENY2 chains interact directly with GANP, but they have different orientations from those observed on yeast Sac3. GANP is required to recruit ENY2 to nuclear pore complexes (NPCs), but ENY2 is not necessary to recruit GANP, which requires both its CID and MCM3AP domains, together with nucleoporin Nup153. GANP and ENY2 associate with RNA polymerase II and inhibition of mRNA processing redistributes GANP from NPCs into nuclear foci indicating that mammalian TREX-2 is associated with transcription. Thus, we implicate TREX-2 as an integral component of the mammalian mRNA export machinery where it links transcription and nuclear export by facilitating the transfer of mature mRNPs from the nuclear interior to NPCs.
The export of mature messenger RNA (mRNA) to the cytoplasm represents the culmination of the nuclear phase of the gene expression pathway (1–4). Transport through nuclear pore complexes (NPCs) is mediated primarily by the NXF1 export factor, but this process has to be tightly coupled to earlier steps in the pathway to ensure that they have been completed successfully (1–5). In mammalian cells, GANP (germinal-centre associated nuclear protein) integrates transcription/processing with nuclear export by facilitating the movement to nuclear pores of mRNPs generated deep in the nucleus (6). GANP binds directly to NXF1 through its N-terminal domain and is thought to shuttle between NPCs and nuclear processing centres to which it becomes relocated when transcription is inhibited (6). GANP is the mammalian orthologue of the yeast Sac3 protein that forms the scaffold of the TREX-2 transcription-export complex that also contains Thp1, Sem1, Sus1 and Cdc31 (7–12) (the corresponding mammalian proteins are PCID2, DSS1, ENY2 and centrins, respectively). The way in which GANP contributes to the gene expression pathway in metazoans is fundamentally different from the way in which its partial analogue, Sac3, contributes in budding yeast. Thus, in yeast, the Sac3-based TREX-2 complex is important in localizing a subset of actively transcribing genes to NPCs (11,13) to facilitate the export of their transcripts in a process referred to as ‘gene gating’ (14). However, in metazoa, GANP appears to function primarily in chaperoning mature mRNPs as they pass from processing centres to nuclear pores (6). Thus, in contrast to yeast, the majority of active genes in mammalian cells lie in transcription foci or ‘factories’ deep within the nucleus (15). Indeed, examination of the dynamics of single mRNPs in human U20S cells indicates that they accumulate in a particulate pattern in the nucleoplasm similar to that seen with the transcription factories (16,17). Importantly, the nuclear distribution of specific mRNPs is not influenced by the relative position of the transcription site in relation to the nuclear envelope (16,17). These fundamental differences may reflect the increased nuclear size, greater incidence of introns and genome complexity of metazoans.
In budding yeast, all the components of the TREX-2 complex are required for efficient mRNA nuclear export (7,8,12,18). We show here that although in human cells GANP forms an analogous TREX-2 complex, its contribution to the gene expression pathway and the integration of transcription/processing with nuclear export differs from that made by the analogous complex in yeast. Thus, although the human TREX-2 complex has a similar composition to its yeast analogue, there are differences in the way in which ENY2 is bound, and, importantly, the localization of the complex to NPCs requires the MCM3AP domain of GANP in addition to the presence of ENY2 and centrins. Furthermore, both GANP and ENY2 are associated with RNA polymerase II, and inhibition of mRNA processing redistributes GANP from NPCs into nuclear foci, suggesting that TREX-2 is associated with transcription.
Full-length ENY2, centrin-2 and centrin-3 were generated by PCR from I.M.A.G.E. cDNA clones (imaGenes) and cloned into pET30a (Novagen). Full-length PCID2 and DSS1 were cloned in to pRSFDuet-1 (Novagen). GANP fragments were generated by PCR from an I.M.A.G.E. cDNA clone and ligated into pGEXTEV (19), a modified version of pGEX-4T-1 (GE Healthcare; Chalfont St Giles, UK) in which the thrombin site has been replaced with a TEV protease site.
For crystallography, a GST-fusion with GANP residues 1162–1235 was co-expressed with untagged ENY2 in BL21 (DE3) CodonPlus RIL cells in 2xTY with conventional isopropyl-beta-d-thio-galactopyranoside (IPTG) induction overnight at 20°C. Cells were lysed by high-pressure cavitation (10–15k psi) in 50mM Tris–HCl (pH 8.0), 25% w/v sucrose, 1mM ethylene glycol tetra-acetic acid (EGTA) and 1mM phenyl methane sulphonyl fluoride. Complete EDTA-free protease inhibitor cocktail (Roche) was added to the lysed cells. Cells were clarified by centrifugation, filtered through a 0.22-µm membrane and bound to glutathione sepharose 4B resin (GE Healthcare) for 1h at 4°C. The resin was washed with 500ml of 50mM Tris–HCl (pH 8.0), 250mM NaCl, 1mM EGTA and 2mM DTT to remove non-specifically bound proteins, and the GANP:ENY2 complex released from the GST tag by overnight incubation with 100µg of His-TEV protease [S219V mutant (20)]. The complex was further purified on a HiLoad Superdex 75 26/60 prep-grade column (GE Healthcare) in 20mM Tris–HCl (pH 8.0), 50mM NaCl, 1mM EDTA and 1mM DTT. SeMet-labeled protein was expressed by IPTG induction in B834(DE3) cells as described (21) and complexes formed and purified as above, but in the presence of 5mM DTT.
Crystals of the native GANP(ENY2)2 complex were grown at 19°C by hanging drop vapor diffusion in 100mM MES pH 6.5, 11% w/v PEG8K and 0.2M NaOAc and were briefly exposed to mother liquor containing 25% w/v glycerol before flash cooling in liquid nitrogen before data collection. Crystals of the SeMet-containing complex were grown in 100mM MES, pH 6.5, 13% w/v PEG8K and 0.3M NaOAc by micro-seeding using a rabbit's whisker and crystals of native complex as a source of seeds. SeMet crystals were cryo-protected in mother liquor containing 20% w/v ethylene glycol before freezing. Crystallographic data were collected either in-house using a Rigaku FR-E+ SuperBright generator equipped with Osmic mirrors and a Mar DTB camera or at beamline I03 at the Diamond Light Source (Didcot, UK) (Table 1).
Crystals of the native GANP(ENY2)2 complex had P21 symmetry and diffracted to 2.1Å using synchrotron radiation. Molecular replacement using various copies of Sus1 from the Sac3:Sus1:Cdc31 complex (18) and the Sgf11:Sus1 complex (22) in PHASER (23) did not yield a satisfactory solution. Instead, high-multiplicity MAD data collected for the Se peak, inflection and remote were processed in PHENIX (24) to determine heavy atom positions and initial phases. The two copies of the complex in the asymmetric unit were built using BUCANEER (23) and ARP/warp (23) and refined using PHENIX (24) alternating with extensive manual rebuilding in COOT (25). This preliminary model was then refined against the 2.1Å native data to generate a final model with an R-factor of 25.7% (Rfree=22.0%), excellent geometry (Table 1) and a MolProbity (26) score of 1.41 (99th percentile).
All proteins were expressed in 2xTY medium with conventional IPTG induction overnight at 20°C. Untagged ENY2, untagged centrin 2/3 and GST-GANP 686-1003 were expressed individually. His-PCID2 and DSS1 were co-expressed from pRSFDuet-1. GST-GANP CID fragments were co-expressed with either untagged ENY2 or untagged centrins. For the GANP CID domain interaction studies, binding assays were carried out by immobilizing GST fusion protein containing complexes on glutathione sepharose 4B resin from clarified bacterial lysates. Cell lysates containing GST-GANP fragments co-expressed with either untagged ENY2 or centrin (as appropriate) were mixed with cell lysate containing untagged centrin or untagged ENY2, respectively, and bound to resin at 4°C for 1h. Resin was washed with 50mM Tris–HCl, pH 8.0, 200mM NaCl and 1mM DTT and samples were analysed by SDS–PAGE. For binding studies involving PCID2, DSS1 and GANP, ternary complex was formed by mixing clarified lysates containing co-expressed His-PCID2:DSS1 and GST-GANP 686-1003 and applying to Ni-NTA resin at 4°C for 1h. Resin was washed with 50mM Tris–HCl, pH 8.0, 300mM NaCl, 40mM imidazole and 5mM β-mercaptoethanol and samples analysed by SDS–PAGE.
For siRNA-mediated depletion, 106 HCT116 cells were transfected (Amaxa AG) with 10µl of 20µM siRNA. We used the following siRNAs: ENY2 siRNA4: AGAGUUGCUGAGAGCUAAA (Qiagen); ENY2 siRNA5-ON-TARGETplus SMART pool against E(Y)2: GGCACACUGUAAAGAGGUA, GAGCAGCGAUUAACCAAAA, AGAGAACGCCUCAAAGAGU, GCUGGAAGGAUCAGUUGAA, Dharmacon); Nup358 siRNA: SMART pool: GCGAAGUGAUGAUAUGUUU, CAACCACGUUAUUACUAA, CAGAACAACUUGCUAUUAG, GAAGGAAUGUUCAUCAGGA (Dharmacon); PCID2 siRNA: AUGCAGAUCAACAGUUGGUAA; centrin-2 siRNA: AAGCUCUUUGAUGAUGAUGAA and AAGCACAUGUAACUAGAUUUA (Qiagen); Nup153 siRNA: AAGCCCUUUAGCGGAUGCAAA (Qiagen); ALY siRNA: GGAACUCUUUGCUGAAUUU (Qiagen)). Cells were harvested either 48, 72 or 96h post-transfection. Efficiency of depletion was monitored by immunoblotting with the indicated antibodies.
Cells were fixed in 2 or 4% paraformaldehyde for 5min at room temperature and permeabilized in PBS, 0.1% Triton X-100 (Sigma) and 0.02% SDS for 10min at room temperature. After 30-min blocking (same buffer+1% BSA), coverslips were incubated with the appropriate primary and secondary antibodies (Molecular Probes) and examined using a Zeiss LSM510 Meta confocal microscope. For some experiments, cells were permeabilized before fixation with PBS, 0.1% Triton X-100 for 5min on ice. Scanning analysis of cells was performed using ImageJ software (NIH). All images used for comparative analysis were acquired using identical microscope settings. A line width of 20 was used, and pairs of cells with nuclei of same scan width as indicated by DAPI staining were used for analysis. All analyses are representative of the cell population.
RNA FISH was performed as previously described (6) using an oligo(dT) primer (Sigma). Immunoprecipitations were performed using nuclear extract as previously described (6). For immunoprecipitations with RNase-treated nuclear extract, 50µl of a 10mg/ml solution of RNase A (Sigma) was added to pre-cleared nuclear extract for 1h at 4°C, after which 5µg of antibody was added for 2h. Samples were incubated as before with 50µl of Protein A Sepharose (GE Healthcare) overnight with rotation, washed with lysis buffer four times and then eluted with 50µl SDS loading buffer. Antibodies used were anti-GANP (raised against recombinant GANP 1050–1250 aa purified in E. coli) (6), anti-ENY2 (raised against full-length recombinant His-ENY2), mAb414 (Babco), β-actin (Abcam), anti-centrin, which recognizes centrin-2 but not centrin-3 (Sigma), anti-centrin-3 (Abcam), anti-RNA polymerase II (CTD4H8) (Millipore) and GFP, which recognizes YFP moiety (Clontech).
Based on the Sac3 CID (Cdc31 interaction domain) motif structure (18), secondary structure predictions and sequence conservation, we identified human GANP residues 1162–1256 as a putative region to which ENY2 and centrins [homologues of Sus1 (10,11) and Cdc31 (7), respectively] might bind (Figure 1A). In humans, multiple centrins have been identified [reviewed in Ref. (27)]. Centrin-1 has tissue-specific expression, and so we focused on centrin-2 and centrin-3 as they are expressed ubiquitously, located in the nucleus and shared 53% sequence identity (27).
GST pulldown experiments confirmed that GANP residues 1162–1256 bound ENY2 and either centrin-2 or centrin-3 in vitro (Figure 1B). To explore the arrangement of the ENY2 and centrin-binding sites within the GANP CID, fragments of GANP encompassing predicted binding sites were tested for binding to ENY2 and centrin-2 or 3 (Figure 1B). The results indicate that the GANP CID domain contains two different sites for ENY2-binding (GANP residues 1162–1200 and 1207–1235) together with a single centrin-binding site (GANP residues 1236–1256).
GANP also interacts with both centrins in vivo as co-immunoprecipitation studies using nuclear extracts from HCT116 colon carcinoma cells and specific antibodies against either centrin-2 or centrin-3 (Supplementary Figure S1) showed that endogenous GANP bound centrin-2 or centrin-3 in vivo (Figure 1C). Importantly, the interaction between GANP and centrin-2 was insensitive to RNase treatment (Figure 1C). The interaction of ENY2 with GANP was confirmed in vivo (Figure 1D) using a fragment generated by the fusion of YFP to full-length ENY2 that bound to endogenous GANP, whereas YFP alone did not (Figure 1D). Additionally, co-immunoprecipitation studies using specific antibodies against either ENY2 or GANP showed that endogenous ENY2 bound GANP in vivo and endogenous GANP also bound ENY2 (Figure 1D).
In yeast, the complete TREX-2 complex contains Thp1 and Sem1 in addition to Sac3, Sus1 and Cdc31 (7,8,11,12). Human TREX-2 has a similar composition and GANP (Sac3 homologue), in addition to binding ENY2 (Sus1 homologue) and centrin-2 (Cdc31 homologue), also binds the human Thp1 homologue PCID2 and Sem1 homologue Dss1 (Figure 1E). Thus, GST pulldown experiments showed that GANP residues 686–1003 (encompassing the conserved Sac3 homology domain) bound PCID2 and DSS1 (Figure 1E). Taken together, these results suggest that both the CID motif and PCID2/Thp1 and DSS1/Sem1 binding is conserved from yeast to human cells.
The structural basis of the interactions between GANP and ENY2 was determined using X-ray crystallography. Despite exhaustive trials, we were unable to obtain crystals of a complex containing GANP, ENY2 and centrin-2 or -3; however, we did obtain crystals of GANP residues 1162–1235 complexed with ENY2 that had P21 symmetry and diffracted to 2.1Å resolution. Se MAD data was used to obtain phases, and the preliminary atomic model obtained was refined against the 2.1Å native data to yield a final structure that had an R factor of 25.7% (Rfree=22.0%) with excellent geometry (Table 1) and contained GANP residues 1162–1234 and two ENY2 chains in which there was electron density for residues 7–100. Figure 2A illustrates the structure in which GANP forms a 9.6-nm gently undulating α-helix to which two chains of ENY2 bind (ENY2A and ENY2B bind to GANP residues 1168–1196 and 1204–1232, respectively).
The ENY2 fold resembled that of its yeast orthologue, Sus1 (18) (Figure 2A), but the higher resolution obtained in the present study enabled structural features to be defined more confidently. The ENY2 fold was based on five α-helices separated by putative hinge regions that were arranged into an anti-parallel hairpin allowing ENY2 to wrap around the GANP helix (Figure 2A–C). Both ENY2-binding sites on GANP gave rise to a hydrophobic stripe that wound around the GANP helix (Figure 2C) and contained putative salt bridge-forming residues and cysteine residues that form putative hydrogen bonds with ENY2.
Notably, the crystal structure shows that the orientation of the two ENY2 molecules relative to one another was significantly different from the orientation of the two Sus1 molecules observed in the yeast Sac3 CID complex. In the GANP complex, the two ENY2 molecules have approximately equivalent azimuthal orientations (Figure 2A and B), whereas in the Sac3 complex the two Sus1 molecules are rotated by ~90° azimuthally (Figure 2A and B). This rotation comes about, because there is an additional residue in the alpha helix between the two ENY2 sites and GANP compared to the corresponding region in Sac3. As a result of this rotation, the exposure of the GANP helix in the complex is different from that seen for Sac3. In the Sac3 CID complex, a continuous face of the Sac3 helix is exposed (Figure 2A) as it traverses from Sus1A to Sus1B, whereas there is not a corresponding exposed face in the GANP complex. Instead, two opposing faces of the GANP helix are exposed (Figure 2A). Consequently, the Sus1/ENY2-binding regions of Sac3 and GANP have quite different surface topologies. Moreover, the different spatial positioning of the two ENY2 chains would prevent them from both binding to a single target molecule in the same way as the two Sus1 chains bound to Sac3 (Figure 2A).
Previously, GANP was shown to function in mammalian mRNA export (6). To test if its binding partners, ENY2 and centrin-2 also, are necessary for mRNA export in human cells, we depleted both using siRNA and assayed poly(A)+ RNA localization by RNA FISH. Two individual siRNAs (siENY2-4 and -5) against ENY2 and against centrin-2 significantly decreased the levels of RNA and protein for ENY2 and centrin-2, respectively (Figure 3A and B). Whereas most poly(A)+RNA was cytoplasmic in control cells, except for a few discrete foci in nuclei, ENY2 or centrin-2 depletion resulted in nuclear accumulation of poly(A)+ RNA (Figure 3C). Depletion of the human homologue of Thp1, PCID2 (Figure 3B), an additional component of the yeast TREX-2 complex (8,28) also resulted in a nuclear accumulation of poly(A)+ RNA (Figure 3C). Therefore, these results indicate that GANP, PCID2, ENY2 and centrin-2/3 all function in mRNA export in human cells, consistent with their forming a putative TREX-2 complex analogous to that present in yeast. Indeed, expression of ENY2 in yeast cells could partially rescue mutant phenotypes resulting from a deletion of Sus1, including growth at elevated temperature and synthetic lethality when combined with mutations in yeast Mex67 (NXF1 in humans) (Supplementary Figure S3).
Previously, GANP was shown to localize to both the nuclear envelope and nuclear interior (6). To map the region of GANP necessary for NPC localization, we expressed GANP fragments fused to GFP in HCT116 cells and observed their cellular location by fluorescence microscopy (Figure 4A). We first analysed the location of GANPΔCID-GFP, because, in yeast, the Sac3-CID region was shown to function in the localization of TREX-2 to NPCs (7,18). Surprisingly, GANPΔCID-GFP remained localized to the nuclear envelope, suggesting that the CID domain is not necessary for the nuclear envelope localization of GANP. Indeed, a construct containing only the CID domain fused to GFP localized throughout the cell, in both the cytoplasm and nucleus, with no obvious nuclear envelope staining (Figure 4A). In GANP, the CID domain lies immediately upstream of the MCM3AP domain, which is not readily apparent in Sac3 (Supplementary Figure S1). The MCM3AP protein is an inhibitor of DNA replication initiation (29,30) and can be transcribed independently of GANP even though the nucleotide sequence for MCM3AP is completely contained within the 3′ region of the sequence of GANP (31). Interestingly, GANPΔMCM3AP-GFP did not localize to the nuclear envelope, but was instead distributed throughout the nucleoplasm (Figure 4A), suggesting that the MCM3AP domain is necessary for the NPC localization of GANP. However, overexpressed MCM3AP localizes to the cytoplasm, and overexpression of MCM3 is required for the nuclear import of MCM3AP (29). This suggests that both the CID domain and the MCM3AP domain of GANP may be contributing to its NPC localization. Consistent with this hypothesis, a GFP fusion containing both the CID region and the MCM3AP region (CID+MCM3AP-GFP) was localized primarily to the nuclear envelope with weaker cytoplasmic staining (Figure 4A). When cells were permeabilized before fixation, CID+MCM3AP-GFP displayed strong nuclear envelope staining (Supplementary Figure S4). We conclude that the CID domain by itself is not sufficient for the nuclear envelope localization of GANP, but that the CID and MCM3AP domains together are necessary and sufficient for localizing GANP to the nuclear envelope.
We next determined the cellular localization of ENY2 and its role in GANP localization. Immunofluorescence of permeabilized HCT116 cells showed strong nuclear envelope staining of ENY2 that was abolished following siRNA-mediated depletion of ENY2 (Figure 4B). This staining co-localized with endogenous GANP staining. However, following depletion of ENY2, GANP nuclear envelope staining was still observed (Figure 4B). In contrast, depletion of GANP abolished both GANP and ENY2 nuclear envelope staining (Figure 4B). Additionally, depletion of PCID2, a component of the human TREX-2 complex, had no effect on the localization of GANP (Figure 5G). Thus, the localization of ENY2 to NPCs is dependent on GANP, whereas the localization of GANP to NPCs is independent of ENY2 and PCID2.
GANP was shown by co-immunoprecipitation to associate with Nup153 and Nup358, nucleoporins on the nuclear and cytoplasmic faces of the NPC, respectively (6). To further explore these interactions, we depleted Nup153 and Nup358 by siRNA. Interestingly, depletion of Nup153 (Figure 5B) redistributed GANP into nuclear foci, with less at the NPC (Figure 5A). In contrast, depletion of Nup358 had no effect on the localization of GANP (Figure 5A). This change in GANP localization following Nup153 depletion suggests that GANP may mediate transport of mRNPs from sites of transcription to NPCs for efficient mRNA export. In support of this hypothesis, GANP and ENY2 both co-immunoprecipitate with endogenous RNA polymerase II (Figure 1D and Supplementary Figure S5). We also used RNase digestion to demonstrate that these interactions were direct and not mediated by both proteins binding to a single mRNA chain. Thus, whereas RNase treatment resulted in ALY and centrin-2 (two proteins that do not interact directly) no longer being associated with each other (Figure 1C), instead, we observed an increase in the amount of RNA polymerase II co-immunoprecipitated with the GANP antibody from RNase-treated nuclear extract, even though an equivalent amount of GANP was immunoprecipitated from untreated nuclear extract (Figure 5D). A similar increase in the amount of GANP co-immunoprecipitated with the RNA polymerase II antibody following RNase treatment was also seen (Figure 5E). This suggests that TREX-2 may be associated with transcription. The TREX component ALY is recruited to mRNP complexes generated by splicing (32), and the TREX complex is recruited to the 5′-end of mRNA (33). Thus, recruitment of the TREX complex to sites of transcription during mRNP maturation is thought to be one of the earliest steps in mRNA export (34). Interestingly, inhibition of mRNA processing via depletion of the upstream mRNA export factor, ALY (Figure 5C), altered the localization of GANP and resulted in retention of mRNPs and GANP in punctate nuclear foci (Figure 5F and G), with a reduced level of GANP staining observed at the NPC. Taken together, our results implicate TREX-2 in the recruitment and transport of mRNPs from the nuclear interior to NPCs.
Our results have demonstrated that in mammalian cells, GANP, ENY2 and centrins form a complex that links transcription with nuclear mRNA export. Although an analogous complex is present in yeast, their function in the gene expression pathway differs in several important respects and underlies the differences that are inherent in these pathways between yeast and metazoans. Thus, although metazoans possess all the orthologous proteins corresponding to those found and participating in gene gating in yeast, this phenomenon has not been observed directly in metazoans. Instead, the majority of active genes in mammalian cells lie in transcription foci or ‘factories’ deep within the nucleus (15). Our findings implicate TREX-2 as a crucial factor linking these mRNP export complexes in the nuclear interior to NPCs.
Our immunofluorescence and co-immunoprecipitation data suggest that a proportion of TREX-2 is localized at or near sites of transcription mediated by RNA polymerase II deep in the nuclear interior of mammalian cells. Indeed, depletion of the mRNA export adaptor ALY resulted in the retention of mRNPs and GANP in punctate nuclear foci (Figure 5). Therefore, when proper mRNP assembly is inhibited, GANP's distribution is altered from predominantly NPC bound to predominantly in nuclear foci, suggesting that mammalian TREX-2 functions in a step downstream of transcription and splicing. These findings also indicate that there is a degree of co-operation between the TREX and TREX-2 complexes in mammalian cells. In mammals, TREX is recruited to mRNPs in a 5′ cap-dependent and splicing-dependent manner (3,9,33). The loss of the THO complex and UAP56 coupled with recruitment of the mRNA exporter NXF1 by ALY signals maturation of the mRNP particle to a transport competent state (3,9). We propose that it is at this stage of the gene expression pathway that the TREX-2 complex becomes attached to the mRNP particle through the interaction of the GANP component (residues 1–313; nucleoporin homology domain) with NXF1 (6) and that this attachment facilitates efficient delivery of mRNPs to NPCs.
We propose that upon reaching NPCs, GANP becomes attached via nucleoporin Nup153. In yeast, the Sac3 CID region and associated chains target Sac3 to NPCs. The different surface topologies of the Sac3 and GANP CID regions (brought about by the difference in the azimuthal orientation of the two ENY2/Sus1 chains on GANP/Sac3) suggest that these regions of GANP and Sac3 may differ in the way in which they interact with other components of the gene expression machinery. Additionally, there are likely differences in the way in which centrins bind GANP and Sac3. The crystal structure of a peptide from Xeroderma pigmentosum Group C protein bound to centrin-2 (35) suggests how centrin-2 might bind to the GANP CID (Supplementary Figure S2A). Consistent with the differences in topology of the GANP and Sac3 CID regions, we observe that the GANP CID region needs to be augmented by the MCM3AP domain to elicit NPC-binding activity (Figure 4), in contrast to yeast in which the Sac3 CID domain is sufficient for localization and in which Sac3 lacks an MCM3AP domain.
Our findings have implications for the potential role of mRNA export factors in cancer progression (36). GANP is upregulated in a variety of lymphomas (37) raising the important question of whether mRNA export derangements contribute to the development of cancer. Intriguingly, two recent studies have implicated components of TREX-2 in both embryonic stem cell specification in mammals (38) and in replication fork progression and stability during transcription in yeast (39). The structural and functional characterization of TREX-2 in mammalian cells thus provides a platform to understand its potential role in these processes in humans.
The coordinates and structure factors for the crystal structure of the GANP:ENY2 complex have been deposited in the Protein Data Bank under accession code 4DHX.
Supplementary Data are available at NAR Online: Supplementary Figures 1–5.
Medical Research Council (MRC) and Cancer Research UK; Wellcome Trust and Medical Research Council grant (U105178939 to M.S.) in part. Funding for open access charge: University of Cambridge, Medical Research Council, Wellcome Trust.
Conflict of interest statement. None declared.
V.O.W. acknowledges the support of Ashok Venkitaraman during the latter half of this study. S.L. was a recipient of an EMBO short-term fellowship to study in the lab of RAL.