Characterizing changes in protein-protein interactions associated with sequence variants (e.g. disease-associated mutations or splice forms) or following exposure to drugs, growth factors or hormones is critical to understanding how protein complexes are built, localized and regulated. Affinity purification (AP) coupled with mass spectrometry permits the analysis of protein interactions under near-physiological conditions, yet monitoring interaction changes requires the development of a robust and sensitive quantitative approach, especially for large-scale studies where cost and time are major considerations. To this end, we have coupled AP to data-independent mass spectrometric acquisition (SWATH), and implemented an automated data extraction and statistical analysis pipeline to score modulated interactions. Here, we use AP-SWATH to characterize changes in protein-protein interactions imparted by the HSP90 inhibitor NVP-AUY922 or melanoma-associated mutations in the human kinase CDK4. We show that AP-SWATH is a robust label-free approach to characterize such changes, and propose a scalable pipeline for systems biology studies.
Research advancing our understanding of Mycobacterium tuberculosis (Mtb) biology and complex host-Mtb interactions requires consistent and precise quantitative measurements of Mtb proteins. We describe the generation and validation of a compendium of assays to quantify 97% of the 4,012 annotated Mtb proteins by the targeted mass spectrometric method selected reaction monitoring (SRM). Furthermore, we estimate the absolute abundance for 55% of all Mtb proteins, revealing a dynamic range within the Mtb proteome of over four orders of magnitude, and identify previously un-annotated proteins. As an example of the assay library utility, we monitored the entire Mtb dormancy survival regulon (DosR), which is linked to anaerobic survival and Mtb persistence, and show its dynamic protein-level regulation during hypoxia. In conclusion, we present a publicly available research resource that supports the sensitive, precise, and reproducible quantification of virtually any Mtb protein by a robust and widely accessible mass spectrometric method.
Tight spatio-temporal signaling of cytoskeletal and adhesion dynamics is required for localized membrane protrusion that drives directed cell migration. Different ensembles of proteins are therefore likely to get recruited and phosphorylated in membrane protrusions in response to specific cues.
Here, we use an assay that allows to biochemically purify extending protrusions of cells migrating in response to three prototypical receptors: integrins, recepor tyrosine kinases and G-coupled protein receptors. Using quantitative proteomics and phospho-proteomics approaches, we provide evidence for the existence of cue-specific, spatially distinct protein networks in the different cell migration modes.
The integrated analysis of the large-scale experimental data with protein information from databases allows us to understand some emergent properties of spatial regulation of signaling during cell migration. This provides the cell migration community with a large-scale view of the distribution of proteins and phospho-proteins regulating directed cell migration.
Fibroblast; Directional cell migration; Signaling; Proteomics; Phosphorylation
This White Paper sets out a Life Sciences Grand Challenge for Proteomics Technologies to enhance our understanding of complex biological systems, link genomes with phenotypes, and bring broad benefits to the biosciences and the US economy. The paper is based on a workshop hosted by the National Institute of Standards and Technology (NIST) in Gaithersburg, MD, 14–15 February 2011, with participants from many federal R&D agencies and research communities, under the aegis of the US National Science and Technology Council (NSTC). Opportunities are identified for a coordinated R&D effort to achieve major technology-based goals and address societal challenges in health, agriculture, nutrition, energy, environment, national security, and economic development.
Complex systems; Democratization of proteomics; Economic growth; Grand challenges; Integration; Systems biology
Complete reference maps or datasets, like the genomic map of an organism, are highly beneficial tools for biological and biomedical research. Attempts to generate such reference datasets for a proteome so far failed to reach complete proteome coverage, with saturation apparent at approximately two thirds of the proteomes tested, even for the most thoroughly characterized proteomes. Here, we used a strategy based on high-throughput peptide synthesis and mass spectrometry to generate a close to complete reference map (97% of the genome-predicted proteins) of the S. cerevisiae proteome. We generated two versions of this mass spectrometric map one supporting discovery- (shotgun) and the other hypothesis-driven (targeted) proteomic measurements. The two versions of the map, therefore, constitute a complete set of proteomic assays to support most studies performed with contemporary proteomic technologies. The reference libraries can be browsed via a web-based repository and associated navigation tools. To demonstrate the utility of the reference libraries we applied them to a protein quantitative trait locus (pQTL) analysis, which requires measurement of the same peptides over a large number of samples with high precision. Protein measurements over a set of 78 S. cerevisiae strains revealed a complex relationship between independent genetic loci, impacting on the levels of related proteins. Our results suggest that selective pressure favors the acquisition of sets of polymorphisms that maintain the stoichiometry of protein complexes and pathways.
S. cerevisiae; selected reaction monitoring; SRM; MRM; spectral library; peptide library; mass spectrometric map; protein QTL
Bulk degradation of cytoplasmic material is mediated by a highly conserved intracellular trafficking pathway termed autophagy. This pathway is characterized by the formation of double-membrane vesicles termed autophagosomes engulfing the substrate and transporting it to the vacuole/lysosome for breakdown and recycling. The Atg1/ULK1 kinase is essential for this process; however, little is known about its targets and the means by which it controls autophagy. Here we have screened for Atg1 kinase substrates using consensus peptide arrays and identified three components of the autophagy machinery. The multimembrane-spanning protein Atg9 is a direct target of this kinase essential for autophagy. Phosphorylated Atg9 is then required for the efficient recruitment of Atg8 and Atg18 to the site of autophagosome formation and subsequent expansion of the isolation membrane, a prerequisite for a functioning autophagy pathway. These findings show that the Atg1 kinase acts early in autophagy by regulating the outgrowth of autophagosomal membranes.
•The Atg1 kinase phosphorylation consensus was identified on peptide arrays•Atg9 is a direct target of the Atg1/ULK1 kinase in vitro and in vivo•Atg9 phosphorylation recruits Atg18 and Atg8 to the PAS•Atg9 phosphorylation is required for isolation membrane expansion/autophagy function
Autophagy function is pivotal to cell health. Papinski et al. identify the phosphorylation consensus of the central kinase in this pathway, Atg1. The autophagy-related protein Atg9 is a direct target of Atg1. Atg9 phosphorylation by Atg1 is required for autophagosome formation. This finding sheds light on how Atg1 controls autophagy.
Pyrococcus Furiosus (Pfu) is an excellent organism to generate reference samples for proteomics labs because of its moderately sized genome and very little sequence duplication within the genome. We demonstrated a stable and consistent method to prepare proteins in bulk that eliminates growth and preparation as a source of uncertainty in the standard. We performed several proteomic studies in different laboratories using each laboratory's specific workflow as well as separate and integrated data analysis. This study demonstrated that a Pfu whole cell lysate provides suitable protein sample complexity to not only validate proteomic methods, work flows, and benchmark new instruments, but also to facilitate comparison of experimental data generated over time and across instruments or labs.
Pyrococcus furiosus (Pfu); Proteomics; Protein Complex Standard; MudPIT; OFFGEL electrophoresis; Directed LC-MS/MS
Affinity purification coupled with mass spectrometry (AP-MS) is now a widely used approach for the identification of protein-protein interactions. However, for any given protein of interest, determining which of the identified polypeptides represent bona fide interactors versus those that are background contaminants (e.g. proteins that interact with the solid-phase support, affinity reagent or epitope tag) is a challenging task. While the standard approach is to identify nonspecific interactions using one or more negative controls, most small-scale AP-MS studies do not capture a complete, accurate background protein set. Fortunately, negative controls are largely bait-independent. Hence, aggregating negative controls from multiple AP-MS studies can increase coverage and improve the characterization of background associated with a given experimental protocol. Here we present the Contaminant Repository for Affinity Purification (the CRAPome) and describe the use of this resource to score protein-protein interactions. The repository (currently available for Homo sapiens and Saccharomyces cerevisiae) and computational tools are freely available online at www.crapome.org.
Topoisomerase I (TOP1) inhibitors are an important class of anticancer drugs. The cytotoxicity of TOP1 inhibitors can be modulated by replication fork reversal, in a process that requires PARP activity. Whether regressed forks can efficiently restart and the factors required to restart fork progression after fork reversal are still unknown. Here we combined biochemical and electron microscopy approaches with single-molecule DNA fiber analysis, to identify a key role for human RECQ1 helicase in replication fork restart after TOP1 inhibition, not shared by other human RecQ proteins. We show that the poly(ADPribosyl)ation activity of PARP1 stabilizes forks in their regressed state by limiting their restart by RECQ1. These studies provide new mechanistic insights into the roles of RECQ1 and PARP in DNA replication and offer molecular perspectives to potentiate chemotherapeutic regimens based on TOP1 inhibition.
Adoption of targeted mass spectrometry (MS) approaches such as multiple reaction monitoring (MRM) to study biological and biomedical questions is well underway in the proteomics community. Successful application depends on the ability to generate reliable assays that uniquely and confidently identify target peptides in a sample. Unfortunately, there is a wide range of criteria being applied to say that an assay has been successfully developed. There is no consensus on what criteria are acceptable and little understanding of the impact of variable criteria on the quality of the results generated. Publications describing targeted MS assays for peptides frequently do not contain sufficient information for readers to establish confidence that the tests work as intended or to be able to apply the tests described in their own labs. Guidance must be developed so that targeted MS assays with established performance can be made widely distributed and applied by many labs worldwide. To begin to address the problems and their solutions, a workshop was held at the National Institutes of Health with representatives from the multiple communities developing and employing targeted MS assays. Participants discussed the analytical goals of their experiments and the experimental evidence needed to establish that the assays they develop work as intended and are achieving the required levels of performance. Using this “fit-for-purpose” approach, the group defined three tiers of assays distinguished by their performance and extent of analytical characterization. Computational and statistical tools useful for the analysis of targeted MS results were described. Participants also detailed the information that authors need to provide in their manuscripts to enable reviewers and readers to clearly understand what procedures were performed and to evaluate the reliability of the peptide or protein quantification measurements reported. This paper presents a summary of the meeting and recommendations.
Tissue homeostasis is controlled by signaling systems that coordinate cell proliferation, cell growth and cell shape upon changes in the cellular environment. Deregulation of these processes is associated with human cancer and can occur at multiple levels of the underlying signaling systems. To gain an integrated view on signaling modules controlling tissue growth, we analyzed the interaction proteome of the human Hippo pathway, an established growth regulatory signaling system. The resulting high‐resolution network model of 480 protein‐protein interactions among 270 network components suggests participation of Hippo pathway components in three distinct modules that all converge on the transcriptional co‐activator YAP1. One of the modules corresponds to the canonical Hippo kinase cassette whereas the other two both contain Hippo components in complexes with cell polarity proteins. Quantitative proteomic data suggests that complex formation with cell polarity proteins is dynamic and depends on the integrity of cell‐cell contacts. Collectively, our systematic analysis greatly enhances our insights into the biochemical landscape underlying human Hippo signaling and emphasizes multifaceted roles of cell polarity complexes in Hippo‐mediated tissue growth control.
AP‐MS; cell polarity; Hippo signaling; modularity; protein complex analysis
Application of a kinetic model of miRNA-mediated gene regulation to experimental data sets shows that the timescale of regulation is slower than previously assumed, due to bottlenecks imposed by miRNA turnover in the RNA-induced silencing complex and by slow protein decay.
A mathematical model links the dynamics of miRNA expression and loading into the Argonaute protein to the dynamics of miRNA targets.Loading of miRNAs into Argonaute and the slow decay of proteins impose two bottlenecks on the speed of miRNA-mediated regulation.Accelerated miRNA turnover is necessary for regulating target expression on the timescale of a day.
MiRNAs are post-transcriptional regulators that contribute to the establishment and maintenance of gene expression patterns. Although their biogenesis and decay appear to be under complex control, the implications of miRNA expression dynamics for the processes that they regulate are not well understood. We derived a mathematical model of miRNA-mediated gene regulation, inferred its parameters from experimental data sets, and found that the model describes well time-dependent changes in mRNA, protein and ribosome density levels measured upon miRNA transfection and induction. The inferred parameters indicate that the timescale of miRNA-dependent regulation is slower than initially thought. Delays in miRNA loading into Argonaute proteins and the slow decay of proteins relative to mRNAs can explain the typically small changes in protein levels observed upon miRNA transfection. For miRNAs to regulate protein expression on the timescale of a day, as miRNAs involved in cell-cycle regulation do, accelerated miRNA turnover is necessary.
gene expression regulation; kinetics; miRNAs; modeling; protein turnover
Deciphering physiological changes that mediate transition of Mycobacterium tuberculosis between replicating and nonreplicating states is essential to understanding how the pathogen can persist in an individual host for decades. We have combined RNA sequencing (RNA-seq) of 5′ triphosphate-enriched libraries with regular RNA-seq to characterize the architecture and expression of M. tuberculosis promoters. We identified over 4,000 transcriptional start sites (TSSs). Strikingly, for 26% of the genes with a primary TSS, the site of transcriptional initiation overlapped with the annotated start codon, generating leaderless transcripts lacking a 5′ UTR and, hence, the Shine-Dalgarno sequence commonly used to initiate ribosomal engagement in eubacteria. Genes encoding proteins with active growth functions were markedly depleted from the leaderless transcriptome, and there was a significant increase in the overall representation of leaderless mRNAs in a starvation model of growth arrest. The high percentage of leaderless genes may have particular importance in the physiology of nonreplicating M. tuberculosis.
•A resource for the identification of in vitro active promoters in M. tuberculosis•A quarter of all genes in M. tuberculosis are expressed as leaderless mRNAs•Leaderless mRNAs are differentially associated with toxin-antitoxin modules•Abundance of leaderless mRNAs increases during starvation-induced growth arrest
In this study, Cortes, Young, and colleagues report genome-wide mapping of transcriptional start sites combined with RNA sequencing and shotgun proteomics in the human pathogen Mycobacterium tuberculosis. A striking finding is a high proportion of genes expressed in the form of leaderless transcripts lacking the Shine-Dalgarno sequence conventionally required for translation. The distribution of functional gene classes between leaderless and Shine-Dalgarno transcriptomes suggests that changes in the specificity of translation may play a role in bacterial adaptation during infection.
Public repositories for proteomics data have accelerated proteomics research by enabling more efficient cross-analyses of datasets, supporting the creation of protein and peptide compendia of experimental results, supporting the development and testing of new software tools, and facilitating the manuscript review process. The repositories available to date have been designed to accommodate either shotgun experiments or generic proteomic data files. Here, we describe a new kind of proteomic data repository for the collection and representation of data from selected reaction monitoring (SRM) measurements. The PeptideAtlas SRM Experiment Library (PASSEL) allows researchers to easily submit proteomic data sets generated by SRM. The raw data are automatically processed in a uniform manner and the results are stored in a database, where they may be downloaded or browsed via a web interface that includes a chromatogram viewer. PASSEL enables cross-analysis of SRM data, supports optimization of SRM data collection, and facilitates the review process of SRM data. Further, PASSEL will help in the assessment of proteotypic peptide performance in a wide array of samples containing the same peptide, as well as across multiple experimental protocols.
data repository; MRM; software; SRM; targeted proteomics
Serum biomarkers can improve diagnosis and treatment of malignant pleural mesothelioma (MPM). However, the evaluation of potential new serum biomarker candidates is hampered by a lack of assay technologies for their clinical evaluation. Here we followed a hypothesis-driven targeted proteomics strategy for the identification and clinical evaluation of MPM candidate biomarkers in serum of patient cohorts.
Based on the hypothesis that cell surface exposed glycoproteins are prone to be released from tumor-cells to the circulatory system, we screened the surfaceome of model cell lines for potential MPM candidate biomarkers. Selected Reaction Monitoring (SRM) assay technology allowed for the direct evaluation of the newly identified candidates in serum. Our evaluation of 51 candidate biomarkers in the context of a training and an independent validation set revealed a reproducible glycopeptide signature of MPM in serum which complemented the MPM biomarker mesothelin.
Our study shows that SRM assay technology enables the direct clinical evaluation of protein-derived candidate biomarker panels for which clinically reliable ELISA’s currently do not exist.
Malignant pleural mesothelioma; Selected reaction monitoring; Surfaceome; Targeted proteomics; Serum biomarkers
Chemical cross-links identified by mass spectrometry generate distance restraints that reveal low-resolution structural information on proteins and protein complexes. The technology to reliably generate such data has become mature and robust enough to shift the focus to the question of how these distance restraints can be best integrated into molecular modeling calculations. Here, we introduce three workflows for incorporating distance restraints generated by chemical cross-linking and mass spectrometry into ROSETTA protocols for comparative and de novo modeling and protein-protein docking. We demonstrate that the cross-link validation and visualization software Xwalk facilitates successful cross-link data integration. Besides the protocols we introduce XLdb, a database of chemical cross-links from 14 different publications with 506 intra-protein and 62 inter-protein cross-links, where each cross-link can be mapped on an experimental structure from the Protein Data Bank. Finally, we demonstrate on a protein-protein docking reference data set the impact of virtual cross-links on protein docking calculations and show that an inter-protein cross-link can reduce on average the RMSD of a docking prediction by 5.0 Å. The methods and results presented here provide guidelines for the effective integration of chemical cross-link data in molecular modeling calculations and should advance the structural analysis of particularly large and transient protein complexes via hybrid structural biology methods.
The ATP-dependent chromatin-remodeling complex SWR1 exchanges a variant histone H2A.Z/H2B dimer for a canonical H2A/H2B dimer at nucleosomes flanking histone-depleted regions, such as promoters. This localization of H2A.Z is conserved throughout eukaryotes. SWR1 is a 1 megadalton complex containing 14 different polypeptides, including the AAA+ ATPases Rvb1 and Rvb2. Using electron microscopy, we obtained the three-dimensional structure of SWR1 and mapped its major functional components. Our data show that SWR1 contains a single heterohexameric Rvb1/Rvb2 ring that, together with the catalytic subunit Swr1, brackets two independently assembled multisubunit modules. We also show that SWR1 undergoes a large conformational change upon engaging a limited region of the nucleosome core particle. Our work suggests an important structural role for the Rvbs and a distinct substrate-handling mode by SWR1, thereby providing a structural framework for understanding the complex dimer-exchange reaction.
•SWR1 consists of four structurally discrete functional modules•Rvb1 and Rvb2 assemble into a single, heterohexameric ring in SWR1•SWR1 undergoes a large conformational change upon nucleosome binding•SWR1 forms limited contact with the nucleosome core particle
Structural analysis of the 14 subunit SWR1 chromatin remodeler identifies the orientation of its functional modules and reveals conformational changes induced by nucleosome binding.
The rigorous testing of hypotheses on suitable sample cohorts is a major limitation in translational research. This is particularly the case for the validation of protein biomarkers where the lack of accurate, reproducible and sensitive assays for most proteins has precluded the systematic assessment of hundreds of potential marker proteins described in the literature.
Here, we describe a high throughput method for the development and refinement of selected reaction monitoring (SRM) assays for human proteins. The method was applied to generate such assays for more than 1000 cancer-associated proteins, which are functionally related to candidate cancer driver mutations. We used the assays to determine the detectability of the target proteins in two clinically relevant samples, plasma and urine. 182 proteins were detected in depleted plasma, spanning five orders of magnitude in abundance and reaching below a concentration of 10 ng/mL. The narrower concentration range of proteins in urine allowed the detection of 408 proteins. Moreover, we demonstrate that these SRM assays allow the reproducible quantification of 34 biomarker candidates across 84 patient plasma samples. Through public access to the entire assay library, which will also be expandable in the future, researchers will be able to target their cancer-associated proteins of interest in any sample type using the detectability information in plasma and urine as a guide. The generated reference map of SRM assays for cancer-associated proteins is a valuable resource for accelerating and planning biomarker verification studies.
Quantitative measurement of proteins involved in insulin signaling and central metabolism in C57BL/6J and 129Sv mice subjected to a sustained high-fat diet reveals that the two strains diverge early in their response to the feeding regimen.
Quantitative targeted protein measurements were designed to quantify murine proteins covering the insulin-signaling pathway
and the lipid and carbohydrate metabolism and used to compare the differential effect of a persistent high-fat diet in C57BL/6J
and 129Sv mouse strains.Differential effect of a persistent high-fat diet were compared in C57BL/6J and 129Sv mouse strains.Differences in protein abundances suggest that peroxisomal β-oxidation is actively promoted in fatty C57BL/6J mice whereas
lipogenesis activation dominates the response of 129Sv mice.Most strain-specific changes were apparent early in the regimen when phenotypic changes were already set, but not yet very
pronounced and they allow a clear discrimination of the mouse strains at an early stage during the long-term high-fat diet.Persistent high-fat diet also alters the transient changes that normally occur in C57BL/6J and 129Sv mice in response to fasting
or food intake.
The metabolic syndrome is a collection of risk factors including obesity, insulin resistance and hepatic steatosis, which occur together and increase the risk of diseases such as diabetes, cardiovascular disease and cancer. In spite of intense research, the complex etiology of insulin resistance and its association with the accumulation of triacylglycerides in the liver and with hepatic steatosis remains not completely understood. Here, we performed quantitative measurements of 144 proteins involved in the insulin-signaling pathway and central metabolism in liver homogenates of two genetically well-defined mouse strains C57BL/6J and 129Sv that were subjected to a sustained high-fat diet. We used targeted mass spectrometry by selected reaction monitoring (SRM) to generate accurate and reproducible quantitation of the targeted proteins across 36 different samples (12 conditions and 3 biological replicates), generating one of the largest quantitative targeted proteomics data sets in mammalian tissues. Our results revealed rapid response to high-fat diet that diverged early in the feeding regimen, and evidenced a response to high-fat diet dominated by the activation of peroxisomal β-oxidation in C57BL/6J and by lipogenesis in 129Sv mice.
liver; metabolic syndrome; NAFLD; proteomics; SRM
Affinity purification coupled with mass spectrometry (AP-MS) is now a widely used approach for the identification of protein-protein interactions. However, for any given protein of interest, determining which of the identified polypeptides represent bona fide interactors versus those that are background contaminants (e.g. proteins that interact with the solid-phase support, affinity reagent or epitope tag) is a challenging task. While the standard approach is to identify nonspecific interactions using one or more negative controls, most small-scale AP-MS studies do not capture a complete, accurate background protein set. Fortunately, since negative controls are largely bait-independent, we reasoned that the negative controls generated by the proteomics research community could be developed as a resource for scoring AP-MS data.
Here we present the Contaminant Repository for Affinity Purification (The CRAPome), currently containing AP-MS data from 343 control purifications conducted by 11 different research groups (www.crapome.org). Users employ an intuitive graphical user interface to explore the database, by either querying one protein at a time, downloading background contaminant lists for selected experimental conditions, or uploading their own data (alongside their own negative controls when available) and performing data analysis. The CRAPome database scores contaminants vs. true interactors based on semi-quantitative mass spectrometry data (normalized spectral counts) embedded in most mass spectrometry experiments. The Significance Analysis of INTeractome (SAINT) scoring scheme, in addition to a simpler Fold Change calculation (FC score) are used to score user-supplied data and return a ranked list of putative interactors. We also describe database structure and composition, provide examples of the use of this resource to filter contaminants with properly chosen controls, and demonstrate the utility of the scoring scheme for identifying bona fide interaction partners. The CRAPome accommodates a variety of purification schemes and, while currently focused on human data, will be expanded to other species.
Metabolome, proteome and physiology measurements were combined with mathematical modeling to unravel the temporal regulation of the metabolic fluxes during the diauxic shift in Saccharomyces cerevisiae.
The diauxic shift involves three main events: a reduction in the glycolytic flux and the production of storage compounds before glucose depletion; the reversion of carbon flow through glycolysis and onset of the glyoxylate cycle operation upon glucose exhaustion; and the shutting down of the pentose phosphate (PP) pathway with a change in the source of NADPH regeneration.The redistribution of fluxes toward the production of storage compounds prior glucose depletion drives glycolytic reactions closer to equilibrium, which is essential for the reversion of fluxes upon glucose exhaustion.The onset of the glyoxylate cycle is quantitatively more important than the activation of the tricarboxylic acid cycle for growth on ethanol.Flux through the PP pathway is halted in the later stages of the adaptation and NADPH regeneration is taken over by NADP-dependent enzymes in the glyoxylate cycle and ethanol metabolism.
The diauxic shift in Saccharomyces cerevisiae is an ideal model to study how eukaryotic cells readjust their metabolism from glycolytic to gluconeogenic operation. In this work, we generated time-resolved physiological data, quantitative metabolome (69 intracellular metabolites) and proteome (72 enzymes) profiles. We found that the diauxic shift is accomplished by three key events that are temporally organized: (i) a reduction in the glycolytic flux and the production of storage compounds before glucose depletion, mediated by downregulation of phosphofructokinase and pyruvate kinase reactions; (ii) upon glucose exhaustion, the reversion of carbon flow through glycolysis and onset of the glyoxylate cycle operation triggered by an increased expression of the enzymes that catalyze the malate synthase and cytosolic citrate synthase reactions; and (iii) in the later stages of the adaptation, the shutting down of the pentose phosphate pathway with a change in NADPH regeneration. Moreover, we identified the transcription factors associated with the observed changes in protein abundances. Taken together, our results represent an important contribution toward a systems-level understanding of how this adaptation is realized.
diauxic shift; fluxome; metabolome; proteome; Saccharomyces cerevisiae
We report a high quality and system-wide proteome catalogue covering 71% (3,542 proteins) of the predicted genes of fission yeast, Schizosaccharomyces pombe, presenting the largest protein dataset to date for this important model organism. We obtained this high proteome and peptide (11.4 peptides/protein) coverage by a combination of extensive sample fractionation, high resolution Orbitrap mass spectrometry, and combined database searching using the iProphet software as part of the Trans-Proteomics Pipeline. All raw and processed data are made accessible in the S. pombe PeptideAtlas. The identified proteins showed no biases in functional properties and allowed global estimation of protein abundances. The high coverage of the PeptideAtlas allowed correlation with transcriptomic data in a system-wide manner indicating that post-transcriptional processes control the levels of at least half of all identified proteins. Interestingly, the correlation was not equally tight for all functional categories ranging from rs >0.80 for proteins involved in translation to rs <0.45 for signal transduction proteins. Moreover, many proteins involved in DNA damage repair could not be detected in the PeptideAtlas despite their high mRNA levels, strengthening the translation-on-demand hypothesis for members of this protein class. In summary, the extensive and publicly available S. pombe PeptideAtlas together with the generated proteotypic peptide spectral library will be a useful resource for future targeted, in-depth, and quantitative proteomic studies on this microorganism.