|Home | About | Journals | Submit | Contact Us | Français|
To understand the structure and function of large molecular machines, accurate knowledge of their stoichiometry is essential. In this study, we developed an integrated targeted proteomics and super-resolution microscopy approach to determine the absolute stoichiometry of the human nuclear pore complex (NPC), possibly the largest eukaryotic protein complex. We show that the human NPC has a previously unanticipated stoichiometry that varies across cancer cell types, tissues and in disease. Using large-scale proteomics, we provide evidence that more than one third of the known, well-defined nuclear protein complexes display a similar cell type-specific variation of their subunit stoichiometry. Our data point to compositional rearrangement as a widespread mechanism for adapting the functions of molecular machines toward cell type-specific constraints and context-dependent needs, and highlight the need of deeper investigation of such structural variants.
Little is known about remodeling and composition of protein complexes across different cell types in higher eukaryotes. Previous studies have shown that protein complex composition can be subjected to temporal variation in yeast, for example, throughout the cell cycle (de Lichtenberg et al, 2005). To the best of our knowledge, only a handful of protein complexes have been shown to vary across different cell types so far (Noda et al, 2000; Murata et al, 2007; Liu et al, 2008; Wu et al, 2009), but whether protein complexes are rearranged as a function of cell type has never been addressed on a global scale. Here, we systematically study the exact compositions of nuclear pore complexes (NPCs) from several human tissue cultures and also quantify cell type-dependent variations of well-characterized nuclear protein complexes in general.
The NPC is one of the most intricate molecular machines of eukaryotic cells and conducts the transport of molecules in and out of the nucleus. It is built from ~30 nucleoporins (Nups) that assemble in multiple copies to form an eight-fold rotationally symmetric complex (Hoelz et al, 2011). NPCs possess different hierarchical levels of structural organization: Nups first assemble into hetero-oligomeric subcomplexes, which in turn, serve as modular building blocks to form larger structures. The best currently characterized modules are the so-called Nup107 and Nup93 subcomplexes, which are essential architectural elements of the NPC scaffold, and the Nup62 subcomplex that constitutes a major transporter module (Brohawn et al, 2009). Accurate knowledge of Nup copy numbers per NPC is a crucial prerequisite toward the generation of structural models that explain how subcomplexes are arranged in the higher order assembly and for a detailed understanding of the transport mechanism. Based on semi-quantitative investigations of Nup stoichiometry (Rout et al, 2000; Cronshaw et al, 2002), a head-to-tail arrangement of the yNup84 subcomplex (human homolog Nup107) has been proposed, which relies on 16 copies of this scaffold motif per NPC (2 copies per unit cell) (Alber et al, 2007b; Kampmann and Blobel, 2009). Alternatively, the fence pole (Debler et al, 2008) and the lattice model (Brohawn and Schwartz, 2009) have assumed four copies per unit cell. Since there is no consensus on the relative as well as absolute stoichiometry, those models disagree even on fundamental aspects, such as the overall size, the orientation of known subcomplexes and the total molecular weight. In this study, we used targeted proteomics and super-resolution microscopy to establish the abundance of all human Nups within the NPC on an absolute scale. In contrast to previous studies, our data imply a higher baseline of copy numbers and the existence of multiple distinct structural species of subcomplexes in situ. Furthermore, we found that the abundance of a large subset of peripheral Nups is rearranged as a function of the cell type, while the NPC scaffold structure remains steady. To investigate whether the compositional changes of the NPC are the rule or an exception, we used shotgun proteomics to analyze a set of well-defined nuclear protein complexes across five human cell lines. With only a few cell types at hand, we already found rearrangements for 38% of the studied complexes, implying that most molecular machines are likely to be fine-tuned at the quaternary structure level to comply with the particularities of different cell types.
To determine the exact stoichiometry of all human Nups when subcomplexes are assembled into NPCs, we used targeted mass spectrometry (multiple reaction monitoring, MRM) in combination with absolutely quantified (AQUA) internal standard peptides, as previously described (Gerber et al, 2003; Picotti et al, 2009). We quantified all Nups within nuclear envelopes (NEs) that were purified to homogeneity from HeLa cells using an improved sub-cellular fractionation procedure (Supplementary Materials and Methods). We performed a series of quality control experiments for our preparations to rule out leakage of NPC components during the isolation and ensure efficient removal of nucleoplasmic material following chromatin digestion (Supplementary Figure S1). For 29 out of 32 Nups, the absolute abundance was determined with high confidence, namely with two or more independent peptide measurements per protein (Figure 1; Supplementary Table S1). More than 80% of the selected assays displayed <20% variation across four biological replicates (Supplementary Figure S1F), corroborating the high quality of the data. Nup abundances span about one order of magnitude and, in most cases, occur as multiples of each other (Figure 1). Remarkably, our data reveal an unexpected compositional architecture of the NPC. Within subcomplexes that were biochemically defined in vitro (Siniossoglou et al, 2000; Kampmann and Blobel, 2009; Amlacher et al, 2011), Nups often do not occur iso-stoichiometrically in situ. These data indicate either the co-existence of the same subunit in different assemblies or the presence of hetero-oligomers that are not iso-stoichiometric. For example, two members of the Nup93 subcomplex, Nup205 and Nup53, have the same abundance as the majority of members of the Nup107 subcomplex. However, Nup188 occurs at half of this abundance and Nup93 as well as Nup155 are 1.5 times as abundant. Interestingly, the abundance of Nup93 equals the sum of Nup188 and Nup205 abundances supporting the existence of two independent structural species (Kosova et al, 1999; Theerthagiri et al, 2010). Here, we provide highly detailed stoichiometric information of all Nups, which is a key prerequisite for further structural investigations, in particular studies relying on the reconstitution of subcomplexes and structural modeling.
In order to translate the Nup stoichiometries (protein ratios) into copy numbers per NPC, previous studies relied on the assumption that, due to the eight fold rotational symmetry, the lowest abundant Nups occur in eight copies per NPC (1 copy per asymmetric unit) (Cronshaw et al, 2002). It has been argued that Nups occur at three discrete abundance levels corresponding to 8, 16 and 32 copies per NPC (Alber et al, 2007a). As a consequence, 16 copies per NPC were assumed for the majority of scaffold Nups, which would correspond to 2 copies per asymmetric unit distributed pseudo-symmetrically across the NE plane. In contrast to the aforementioned studies, we observed two discrete populations of Nups that are less abundant than the scaffold level (two- and four-fold less abundant; Figure 1). If the lowest abundant Nups occur in eight copies per NPC as previously assumed, then this finding implies that at least 12 Nups (including, e.g., Nup107) occur in 32 copies per NPC, while at least 5 others (including, e.g., Elys) occur in 16 copies. Finally, Gle1 and Nlp1 are only a quarter as abundant, which would correspond to 8 copies per NPC.
To directly measure the copy number of Nup107 per NPC, we used an integrated approach. First, we accurately counted nuclei isolated from HeLa cells by FACS using an internal fluorescent bead standard for calibration and cross-validated this method by an independent counting scheme (Supplementary Materials and Methods). Subsequently, we quantified the total copies of Nup107 contained in a defined number of nuclei by targeted MS. We estimated Nup107 to occur at ~125±6 thousand copies per nucleus. Combined with the average number of NPCs per NE, these data should ultimately reveal the copies of Nup107 per NPC. Next, we acquired a large set of cryo electron tomograms NEs and determined the density of NPCs (Supplementary Figure S2A). To extrapolate the NPC density to the number of NPCs per nucleus, we measured the nucleus surface area using fluorescent dyes and confocal microscopy (Supplementary Figure S2B). By combining these measurements, we estimated an average number of 3376±1304 NPC per nucleus, in agreement with previous reports (Maul et al, 1972; Ribbeck and Gorlich, 2001). Taken together with the number of molecules per nucleus, these data result in an average of 37±14 Nup107 copies per NPC.
As an independent measurement, we used super-resolution microscopy to count the Nup107 protein copies per NPC. Fluorophore counting of the mEos2 protein by iterative photo-conversion with subsequent bleaching was previously established (Annibale et al, 2011; McKinney et al, 2009). To efficiently integrate mEos2 into NPCs, we engineered a stable cell line derived from human embryonic kidney cells (HEK293-Flp-In-T-REx) that co-expresses two microRNAs silencing endogenous Nup107 and a replacement gene encoding Nup107 N-terminally fused to mEos2. We observed efficient silencing of the endogenous Nup107 and co-occurring expression of the mEos2-tagged protein with a replacement efficiency of about 80–90% (Figure 2A). The use of genetically encoded fluorescent proteins suffers from one major disadvantage, that is, these proteins do not necessarily mature into functional fluorophores (Ulbrich and Isacoff, 2007). Since this maturation efficiency cannot be assessed easily, fluorophore counting will only permit to determine a minimal number for mEos2-Nup107 fusion protein copies in the NPC. In other words, if the experimentally determined copy number is significantly larger than 16, then the NPC could contain 32, 64 or more, but not 8 or 16 copies of mEos2-Nup107 (Supplementary Figure S3; Supplementary Materials and Methods). In order to observe signals with high optical sectioning near the coverslip, we performed imaging of purified nuclei that were allowed to settle on a coverslip and observed using total internal reflection fluorescence (TIRF) microscopy. We acquired time-lapse movies of sequential photo-conversion/photo-bleaching of mEos2 molecules until complete bleaching was reached, as previously described (Betzig et al, 2006; Supplementary Movie S1). A clustering algorithm based on density (DBSCAN) (Ester et al, 1996) allowed us to select photo-conversion events corresponding to isolated, individual NPCs. The reconstructed images show individual NPCs as circular patterns (Figure 2B; Supplementary Figures S4A–D), which is to our knowledge the first time that the ring-like arrangement of the major scaffold component Nup107 was directly visualized by super-resolution microscopy. The number of photo-conversion events detected in each NPC was corrected for fluorophore blinking using the method developed by Annibale et al (2011). Although the approach employed here is likely to systematically underestimate the copy number of mEos2-Nup107 (as explained above), the majority of all measured events (90%) account for >16 copies per NPC (Figure 2C).
To validate our workflow and test if we could, in principle, resolve an overlay of multiple structural species of different copy numbers, we performed Monte-Carlo simulations considering three alternative scenarios, 16, 32 and 64 copies of Nup107 per NPC. These simulations take into account a stochastic NPC position within the NE, a stochastic labeling density across individual NPCs, a rotational distribution of fluorophores around the central channel as well as the photo-conversion, blinking and bleaching of mEos2. All input parameters are based on experimentally determined values (see also Supplementary Materials and Methods and Supplementary Figure S4E). The synthetic data recapitulate the experimental results very well (Supplementary Figures S4F–H), that is, they are most consistent with a scenario of 32 copies and argue for a monodisperse distribution of structural species.
Finally, the sum of the molecular weight of all copies of all Nups in our study is 110±10MDa when 32 copies of Nup107 are assumed (Supplementary Table S1). This is in excellent agreement with molecular weight measurements of the vertebrate NPC by scanning transmission electron microscopy (112±11MDa) (Reichelt et al, 1990). We furthermore assessed the structural plausibility of the molecular mass by volumetric matching into the cryo electron microscopy map of the human NPC (Maimon et al, 2012; Supplementary Figure S2C). Although the calibration experiments described above are on their own less accurate than the stoichiometric measurements by MS, taken together, several independent lines of evidence suggest that 32 copies per NPC have to be alternatively considered for the major scaffold module.
To ensure that the established stoichiometry serves as a general principle for all human cells, but also to test for the opportunity of cell type-specific adjustments that might have a functional impact, we used our targeted proteomic assays to quantify NPC composition across various human tissue culture cells. First, we used gene expression data to screen for cell lines that displayed very pronounced variations in Nup mRNA expression levels and identified three human cancer-derived cell lines: K562, an erythroleukemic cell line; RKO, a colon carcinoma-derived cell line; and SK-MEL-5, a malignant melanoma-derived cell line (Supplementary Figure S5). To investigate whether the different mRNA profiles are reflected by Nup protein abundance and ultimately differential NPC composition, we compared nuclei isolated from these three cell lines as well as HeLa and non-tumor derived HEK293 cells by targeted proteomics. We consistently analyzed the profile of 29 Nups across the 5 cell lines in three biological replicates and identified significant changes in the abundance of 7 Nups across at least two cell types (Figure 3A). We observed cell-specific changes for two nucleoplasmic Nups (Nup50 and Tpr), two cytoplasmic Nups (Nup214 and Aladin), two out of the three transmembrane Nups (Gp210 and Pom121) and one scaffold Nup (Nup37). Our analysis thus implies dynamic rearrangements of NPC stoichiometry across cell types. This is primarily true for peripheral Nups, while the NPC scaffold structure remains static (Figure 3B). The observed differences prompted us to study Nups of varying abundance in a variety of human tissues. For five out of these seven Nups (Gp210, Pom121, Aladin, Nup50 and Tpr), the variation at the protein level could be correctly inferred from gene expression data (Figure 3A). Therefore, we analyzed mRNA expression profiles of these 5 Nups across 44 tissue types and 30 disease states (Figure 4). We detected significant tissue-specific expression in distinct cell types such as stem cells, brain-derived tissues and certain classes of immune cells as well as in several diseases including various cancers and inflammatory disorders (Figure 4).
To determine whether cell type-specific compositional changes, as observed in case of the NPC, are rare or represent a more generic principle underlying functional variations of molecular machines, we analyzed the expression profiles of the subunits of a defined set of well-characterized nuclear protein complexes by label-free shotgun proteomics. For this purpose, we used nuclear extracts from the same five cell lines that we used to investigate NPC composition. Since many protein complexes share components (Gavin et al, 2006; Krogan et al, 2006), we manually selected 34 complexes with minimal overlap that were detected with sufficient coverage across the 5 cell lines in our data set (see Materials and methods, Supplementary Figure S6 and Supplementary Table S2). As an intrinsic measure for protein complex integrity, we used coherence measurements of the abundance of their subunits as previously described (Wang et al, 2009) (Supplementary Figure S6D, inset).
To test whether the selected subset of well-defined complexes are subject to compositional variations, we implemented a two-step analysis procedure. We first analyzed the absolute abundance of the each complex as a whole, which was often highly variable across the five cell lines (Figure 5A). Subsequently, we normalized the expression level of each subunit to the absolute abundance of the respective protein complex and used significance testing to identify subunits deviating from the expression profile of the other members of the same protein complex, which is indicative of a variable composition (Figure 5B; Supplementary Figure S6E). We validated our method using the NPC as a test case (Figures 5A and B) and found that it could to a large extent recapitulate the compositional variations that were accurately quantified by targeted proteomics beforehand (Figure 3). When we examined the data for significant, cell type-specific changes in the relative expression of other protein complex components, we found that the subunits of 21 out of 34 protein complexes (62%) displayed remarkably stable expression, indicating a conserved complex stoichiometry (Figure 5E). These cases include for example the cohesin complex and the core complexes of RNA polymerases I and II. For the remaining protein complexes (13 out of 34, 38%), the expression of specific subunits appeared to be more dynamic, that is, significantly changed in one or more of the analyzed cell lines, and decoupled from the expression of other components of the same complex. This indicates that these complexes might undergo compositional rearrangements in different cell types. This group contains not only the NPC (Figure 5B), but also the TREX complex that is involved in mRNPs packaging and export (Figure 5C), and complexes involved in chromatin remodeling and histone modification such as NuRD and BAF complexes (Figure 5D). To exclude a potential bias due to the manual selection of protein complexes, we repeated our analysis using a recently published data set of soluble protein complexes that were biochemically defined in two of the cell lines used in our study (HEK293 and HeLa) (Havugimana et al, 2012) and obtained similar results (Supplementary Figure S6F). Since we tested only five cell lines, it is likely that much more stoichiometric variations will exist when more cell types are analyzed, that is, the 38% we observe here likely represent a lower limit.
The newly established stoichiometry of the human NPC calls for a new generation of structural models that take this into account. The determined Nup abundances provide a direct insight into the modularity of the NPC by revealing specific stoichiometries of different subcomplexes within assembled pores. The stoichiometric abundances of the majority of the components of the Nup107 subcomplex are consistent with biochemical and structural data obtained for the homolog Nup84 subcomplex from yeast (Rout et al, 2000; Alber et al, 2007b; Kampmann and Blobel, 2009). However, two Nups, namely Seh1 and Elys, are sub-stoichiometric relative to the other components. Interestingly, Seh1 was located to the periphery of the yeast Nup84 subcomplex and does not directly contribute to the structural core of the complex (Fernandez-Martinez et al, 2012), as previously thought (Nagy et al, 2009). In case of the chromatin binding protein Elys (Rasala et al, 2006), the lower copy number might be explained by its potential asymmetric occurrence on the nuclear face of the NPC. Moreover, both human Seh1 and Elys have been shown to be more loosely associated with the Nup107 subcomplex than other components (Loiodice et al, 2004; Franz et al, 2007). Taken together, these data suggest that a second, distinct structural instance of the Nup107 subcomplex might exist in humans, which has not yet been biochemically characterized. The two subspecies might be associated with an alternative subset of peripheral components, a possibility that has been neglected by all previous structural models. Our comprehensive stoichiometry data provide not only a general framework for understanding Nup interactions but also hint at some heterogeneity of certain Nup populations, in particular peripheral ones. This information might also facilitate the in vitro reconstruction of subcomplexes that account for distinct native structural species. The presented experimental strategy is universal and paves the way toward elucidating the detailed architecture of other molecular machines.
Recently, it has been reported that temporal regulation of the transmembrane nucleoporin Gp210 occurs in myoblast and stem cells and that varying Gp210 expression is required for differentiation to myotubes and neural cells (D'Angelo et al, 2012). Here, we show that Gp210 expression also varies spatially between different human cancer cell lines, at both the transcript and protein level (Figure 3A). In addition, we identified six other, mostly peripheral, Nups with a similar dynamic behavior, while the expression of the majority of the scaffold Nups remains stable (Figure 3B), presumably due to their essential structural role as seen in gene silencing experiments (Boehmer et al, 2003; Walther et al, 2003; Hawryluk-Gara et al, 2008; Mitchell et al, 2010), or due to deleterious subunit dosage effects (Papp et al, 2003). At least five of the seven significantly regulated nucleoporins have been shown to be non-essential NPC components (Aladin, Nup214, Nup50, Tpr and Gp210) (Smitherman et al, 2000; Cronshaw and Matunis, 2003; Hase and Cordes, 2003; Antonin et al, 2005; Hutten and Kehlenbach, 2006), and six out of seven have a predicted peripheral localization (Figure 3B). Since protein and gene expression strongly indicate copy number variation across a number of cell types as well as healthy and pathological tissues, rearrangements of the NPC stoichiometry could influence several mechanisms that link Nups to the regulation of cell function. Stable integration of the relevant Nups into NPCs at the NE has been well established in terms of both localization and mean residence times (Rabut et al, 2004). Therefore, it is highly unlikely that these subunits are part of other protein assemblies and thus differentially expressed, except for Nup50 and Tpr for which we also detected significant nucleoplasmic pools of protein (see also Supplementary Materials and Methods). Different arrangements of peripheral Nups could affect nucleocytoplasmic transport by varying the docking sites available for transport factors, thus having an impact on processes such as translocation of signaling molecules (Xylourgidis et al, 2006) and mRNA export (Forler et al, 2004). In addition, changes in the expression of nucleoplasmic Nups could affect the spatial organization of chromatin in the proximity of the NE (Krull et al, 2010), regulate the activity of histone modifying enzymes (Kehat et al, 2011), and have an impact on transcription regulation via direct interaction with chromatin (Capelson et al, 2010; Kalverda et al, 2010). Fine-tuning of the function of the NPC, the machinery that indirectly influences gene expression by controlling the composition of the nuclear compartment and the export of regulatory and messenger RNAs, might thus be a more general mechanism for reprogramming the cell machinery according to the needs of individual cell types or even cell states, for example, during development or disease.
Shotgun proteomics confirmed the cell type-specific variation of NPCs and revealed that more than one third of the well-characterized molecular machines studied show a dynamic expression of at least one subunit across the five cell lines in which we studied nuclear pore composition. This finding indicates that these complexes might similarly undergo compositional rearrangements as a function of the cell type (Figure 5), although this remains to be ultimately proven by their biochemical isolation from the different cell types in the future. The characteristic changes in expression of components of the same protein complex might occur through one of at least three different mechanistic scenarios (Figure 6): (i) a change in the stoichiometry of the protein complex rendering the function, as in case of the NPC; (ii) a switch in subunit composition that is driven by the downregulation of one or more components coupled to a balancing expression of homologous proteins, as observed for the chromatin remodeling complex BAF (Lessard et al, 2007; Ho et al, 2009) and the proteasome (Noda et al, 2000; Murata et al, 2007); or (iii) an adjustment in expression as a result of the association of the subunit with another complex that varies in abundance (in case of overlapping complexes). Since we minimized subunit overlap, the majority of dynamic protein complexes studied would appear to follow one of first two scenarios. Our data are generated from the combined bulk of nuclei/NEs and they comprise therefore an average over multiple single cell species. It is thus possible that for a significant fraction of protein complexes, multiple distinct compositional states co-exist in a single cell with certain stoichiometries being more abundant than others. However, the observed differences are highly robust and reproducible and thus also cell-type specific. The compositional variations that we describe co-occur with remarkable changes in total complex abundance between different cells (as shown for the NPC in Figure 5). Both these mechanisms might therefore contribute to adapt the activity of protein complexes toward cell type-specific needs.
Since human tissue gene expression data also point to considerable spatial variation and many other functional states of cells are conceivable in which protein complex composition might be different (e.g., development or disease state), we believe that only a comparative study of homogeneous cell populations on a temporal and spatial scale will reveal the true extent of compositional rearrangements of molecular machines in multicellular organisms. The repertoire of context-dependent complex variants should be the starting point to uncover both the mechanisms leading to differential regulation and the functional consequences of varying quarternary structures.
Nuclei were isolated by hypotonic cell lysis using a dounce homogeneizer followed by centrifugation through a sucrose cushion. NEs were obtained by a combined DNase and RNase treatment of nuclei followed by an additional sucrose cushion step. Detailed procedures for the different cell lines and additional experiments performed to ensure the integrity of the obtained organelles are described in Supplementary Materials and Methods and Supplementary Figure S1.
Nups were quantified in nuclear or NE extracts using targeted proteomics in combination with the spike-in of isotopically labeled peptides used as internal standard. For absolute quantification, the abundance of each Nup was calculated from the median ratio between the intensities of its endogenous proteotypic peptides (PTPs, light) and the corresponding spiked-in AQUA peptides (heavy). A panel of 90 MRM assays for 76 PTPs was used for absolute quantification. For relative quantification of Nup abundances across cell lines, crude synthetic peptides were used as internal standard instead of AQUA peptides. Therefore, a larger panel of 142 MRM assays for 119 PTPs was used. A detailed description of MRM assays development, data acquisition and processing is available in Supplementary Materials and Methods and Supplementary Figures S1E–G. A list of the MRM assays employed is available in Supplementary Table S1. Raw MRM data and method files are available at http://www.peptideatlas.org/PASS/PASS00189 for absolute quantification, and http://www.peptideatlas.org/PASS/PASS00188 for relative quantification across cell lines.
Acquisition of PALM measurement of purified nuclei with native Nup107 replaced with mEos2-Nup107 was acquired following Betzig et al (2006). A clustering algorithm based on density (DBSCAN) (Ester et al, 1996) was employed to select photo-conversion events corresponding to isolated NPCs. The counted number of photo-conversion events detected in each pore was then corrected for fluorophore blinking with the method proposed by Annibale et al (2011) in order to retrieve the number of fluorophores in each pore. More details can be found in Supplementary Materials and Methods and Supplementary Figures S3 and S4.
The average copy number per nucleus of Nup107 was measured using targeted proteomics on HeLa nuclei that were FACS sorted (because nuclei are more reliably sorted then NEs; a nuclear protein pool of Nup107 was not detected). After sorting, the concentration of isolated nuclei was assessed by two independent methods using CountBright Absolute Counting Beads (Life Technologies), according to manufacturer instructions, and, in parallel, using a Neubauer chamber. In both cases, the measurements were performed in triplicate. We observed a high correlation between the two methods (data not shown). Next, for each biological replicate between 5 and 7 × 105 counted nuclei were spiked with AQUA peptides at a concentration of 1pmol per peptide/1 × 106 nuclei, digested as described in Supplementary Materials and Methods and measured in scheduled MRM mode. The Nup107 concentration was derived directly from the light to heavy ratios of two distinct PTPs and transformed into protein copy numbers using Avogadro's number. In order to estimate the average number of NPC per nucleus, we measured NPC density using cryo electron microscopy and estimated nuclear surface area by membrane staining and fluorescence microscopy (Supplementary Figures S2A and B). Isolated NEs were deposited on the EM grid (Copper R2/1, Quantifoil Micro Tools GmbH, Jena, Germany), blotted and immediately plunge frozen. Tilt series of intact cryo-fixed NEs were collected in the range of ±60 degree with 3 degrees increment on a Polara TEM (300 kV) (FEI, equipped with Gatan Camera 2k x 2k and energy filter). Tomograms were reconstructed from the tilt series using IMOD (Kremer et al, 1996). The number of NPCs per tomogram was counted manually. The surface area was estimated in each tomogram by fitting a surface onto the membrane curvature. For membrane staining, nuclei were incubated for 1min on ice with 3μg/μl of FM1-43FX (Life Technologies) in ice-cold PBS immediately before mounting the sample on glass coverslips in Mowiol. Confocal z-stacks were acquired on a Zeiss LSM780 using the 458nm laser line for excitation. The surface was obtained by fitting the acquired z-stacks using Imaris software (Bitplane, Zürich, Switzerland) with automatic thresholding.
We acquired shotgun MS data from nuclei extracts of the five cell lines that we selected to investigate NPC composition. For each cell line, three biological replicates were analyzed and protein abundance scores based on PTP intensities were calculated as described in Supplementary Materials and Methods. In total, we quantified 1159 proteins using at least two PTPs consistently identified in at least two out of three replicates for each cell line. We assessed the consistency of our measurements using hierarchical clustering (Spearman correlation with average linkage) and found that biological replicates of cell lines clustered together, indicating high reproducibility (data not shown). Raw shotgun MS data are available at http://www.peptideatlas.org/PASS/PASS00190.
We selected 57 large protein complexes, composed of at least 5 proteins with predicted nuclear localization. A limitation of our approach lies on the analysis of subunits that are shared between different protein complexes. For these cases, a change in abundance of one out of a subset of protein complexes sharing a subunit could result in the false positive detection of dynamic stoichiometries for the others (since our second normalization step is based solely on the median abundance of the complex analyzed). In order to minimize redundancy, complexes were manually selected from minimal endogenous modules (MEMOs) described in a recent large-scale affinity-purification MS study (Malovannaya et al, 2011), entries from the CORUM database (Ruepp et al, 2010) and literature mining (Supplementary Figures S6A–D; Supplementary Table S2). Subsequently, the shotgun proteomics data were used to extract protein complexes having at least 50% of their components cross-quantified and, in any case, a minimal number of four quantified proteins was required. In total, we selected 34 protein complexes, totaling 274 quantified proteins, largely not redundant. Less than 4% of the proteins were shared between two different protein complexes, with the remaining 96% being uniquely assigned to one protein complex (Supplementary Figure S6; Supplementary Table S2). We then designed a workflow to analyze subunit expression profiles in a complex centered manner as explained in Figure 5 and Supplementary Figure S6E.
We gratefully acknowledge support from EMBL's proteomics, flow cytometry, advanced light microscopy core facilities and mechanical workshop, and particular want to thank Toby Gibson, Holger Dinkel and Jeroen Krijgsveld. We thank Christine Köhler, Amanda DiGuilio and Lukas Reiter for technical support; Joseph Glavy, Ulrike Kutay, Alexander Schmidt, Iain Mattaj, Vera van Noort and Katja Beck for critical advice and reagents. AO was supported by postdoctoral fellowships from the Alexander von Humboldt Foundation and Marie Curie Actions. HKB was supported by postdoctoral fellowships from the Swiss National Science Foundation, the European Molecular Biology Organization and Marie Curie Actions. EAL acknowledges funding by the Emmy Noether program of the German Research Foundation. MB acknowledges funding by the European Research Council (Grant No. 309271/NPCAtlas).
Author contributions: AO designed and performed experiments, analyzed data and wrote the manuscript; NB and AA designed and performed experiments, and analyzed data; MI designed data analysis procedures, and analyzed data; CE performed experiments; HKB and LS performed experiments, and analyzed data; VS analyzed data; OR designed experiments; PB, EAL and MB coordinated the project, designed experiments, and wrote the manuscript.
OR and CE are employees of Biognosys AG.