Cells respond to environmental stimuli via specialized signaling pathways. Concurrent stimuli trigger multiple pathways that integrate information, predominantly via protein phosphorylation. Budding yeast responds to NaCl and pheromone via two mitogen-activated protein kinase cascades, the high osmolarity, and the mating pathways, respectively. To investigate signal integration between these pathways, we quantified the time-resolved phosphorylation site dynamics after pathway co-stimulation. Using shotgun mass spectrometry, we quantified 2,536 phosphopeptides across 36 conditions. Our data indicate that NaCl and pheromone affect phosphorylation events within both pathways, which thus affect each other at more levels than anticipated, allowing for information exchange and signal integration. We observed a pheromone-induced down-regulation of Hog1 phosphorylation due to Gpd1, Ste20, Ptp2, Pbs2, and Ptc1. Distinct Ste20 and Pbs2 phosphosites responded differently to the two stimuli, suggesting these proteins as key mediators of the information exchange. A set of logic models was then used to assess the role of measured phosphopeptides in the crosstalk. Our results show that the integration of the response to different stimuli requires complex interconnections between signaling pathways.
cell signaling network; crosstalk; HOG pathway; pheromone pathway; phosphoproteomics
Tandem mass (MS/MS) spectra generated by collision-induced dissociation (CID) typically lack redundant peptide sequence information in the form of e.g. b- and y-ion series due to frequent use of sequence-specific endopeptidases cleaving C- or N-terminal to Arg or Lys residues.
Here we introduce arginyl-tRNA protein transferase (ATE, EC 22.214.171.124) for proteomics. ATE recognizes acidic amino acids or oxidized Cys at the N-terminus of a substrate peptide and conjugates an arginine from an aminoacylated tRNAArg onto the N-terminus of the substrate peptide. This enzymatic reaction is carried out under physiological conditions and, in combination with Lys-C/Asp-N double digest, results in arginylated peptides with basic amino acids on both termini.
We demonstrate that in vitro arginylation of peptides using yeast arginyl tRNA protein transferase 1 (yATE1) is a robust enzymatic reaction, specific to only modifying N-terminal acidic amino acids. Precursors originating from arginylated peptides generally have an increased protonation state compared with their non-arginylated forms. Furthermore, the product ion spectra of arginylated peptides show near complete 2× fragment ladders within the same MS/MS spectrum using commonly available electrospray ionization peptide fragmentation modes. Unexpectedly, arginylated peptides generate complete y- and c-ion series using electron transfer dissociation (ETD) despite having an internal proline residue.
We introduce a rapid enzymatic method to generate peptides flanked on either terminus by basic amino acids, resulting in a rich, redundant MS/MS fragment pattern. © 2014 The Authors. Rapid Communications in Mass Spectrometry published by John Wiley & Sons Ltd.
We describe a method that integrates data derived from different mass spectrometric (MS) techniques with a modelling strategy for structural characterization of protein assemblies. We encoded structural data derived from native MS, bottom-up proteomics, ion mobility-MS and chemical cross-linking MS into modelling restraints to compute the most likely structure of a protein assembly. We used the method to generate near-native models for three known structures and characterized an assembly intermediate of the proteasomal base.
T cell antigen receptor (TCR)-mediated T cell activation requires the interaction of dozens of proteins. We used quantitative mass spectrometry and activated primary CD4+ T cells from mice in which a tag for affinity purification was knocked into several genes to determine the composition and dynamics of multiprotein complexes forming around the kinase Zap70 and the adaptors Lat and SLP-76. Most of the 112 high confidence time-resolved protein interactions we observed were novel. The CD6 surface receptor was found capable of initiating its own signaling pathway by recruiting SLP-76 and Vav1, irrespective of the presence of Lat. Our findings provide a more complete model of TCR signaling in which CD6 constitutes a signaling hub contributing to TCR signal diversification.
Eukaryotic translation initiation requires the recruitment of the large, multiprotein eIF3 complex to the 40S ribosomal subunit. We present X-ray structures of all major components of the minimal, six-subunit Saccharomyces cerevisiae eIF3 core. These structures, together with electron microscopy reconstructions, cross-linking coupled to mass spectrometry, and integrative structure modeling, allowed us to position and orient all eIF3 components on the 40S⋅eIF1 complex, revealing an extended, modular arrangement of eIF3 subunits. Yeast eIF3 engages 40S in a clamp-like manner, fully encircling 40S to position key initiation factors on opposite ends of the mRNA channel, providing a platform for the recruitment, assembly, and regulation of the translation initiation machinery. The structures of eIF3 components reported here also have implications for understanding the architecture of the mammalian 43S preinitiation complex and the complex of eIF3, 40S, and the hepatitis C internal ribosomal entry site RNA.
•X-ray structures of major yeast eIF3 components and subcomplexes•Crosslinking coupled to mass-spectrometry analysis of 40S⋅eIF1⋅eIF3 complex•Integrative modeling reveals architecture of 40S⋅eIF1⋅eIF3 complex
A hybrid approach drawing on X-ray structures, crosslinking coupled to mass spectrometry, electron microscopy, and integrative modeling yields mechanistic insights into how eIF3 coordinates translation initiation.
The Cop9 signalosome complex (CSN) regulates the functional cycle of the major E3 ubiquitin ligase family, the cullin RING E3 ubiquitin ligases (CRLs). Activated CRLs are covalently modified by the ubiquitin-like protein Nedd8 (neural precursor cell expressed developmentally down-regulated protein 8). CSN serves an essential role in myriad cellular processes by reversing this modification through the isopeptidase activity of its CSN5 subunit. CSN5 alone is inactive due to an auto-inhibited conformation of its catalytic domain. Here we report the molecular basis of CSN5 catalytic domain activation and unravel a molecular hierarchy in CSN deneddylation activity. The association of CSN5 and CSN6 MPN (for Mpr1/Pad1 N-terminal) domains activates its isopeptidase activity. The CSN5/CSN6 module, however, is inefficient in CRL deneddylation, indicating a requirement of further elements in this reaction such as other CSN subunits. A hybrid molecular model of CSN5/CSN6 provides a structural framework to explain these functional observations. Docking this model into a published CSN electron density map and using distance constraints obtained from cross-linking coupled to mass-spectrometry, we find that the C-termini of the CSN subunits could form a helical bundle in the centre of the structure. They likely play a key scaffolding role in the spatial organization of CSN and precise positioning of the dimeric MPN catalytic core.
Characterizing changes in protein-protein interactions associated with sequence variants (e.g. disease-associated mutations or splice forms) or following exposure to drugs, growth factors or hormones is critical to understanding how protein complexes are built, localized and regulated. Affinity purification (AP) coupled with mass spectrometry permits the analysis of protein interactions under near-physiological conditions, yet monitoring interaction changes requires the development of a robust and sensitive quantitative approach, especially for large-scale studies where cost and time are major considerations. To this end, we have coupled AP to data-independent mass spectrometric acquisition (SWATH), and implemented an automated data extraction and statistical analysis pipeline to score modulated interactions. Here, we use AP-SWATH to characterize changes in protein-protein interactions imparted by the HSP90 inhibitor NVP-AUY922 or melanoma-associated mutations in the human kinase CDK4. We show that AP-SWATH is a robust label-free approach to characterize such changes, and propose a scalable pipeline for systems biology studies.
Research advancing our understanding of Mycobacterium tuberculosis (Mtb) biology and complex host-Mtb interactions requires consistent and precise quantitative measurements of Mtb proteins. We describe the generation and validation of a compendium of assays to quantify 97% of the 4,012 annotated Mtb proteins by the targeted mass spectrometric method selected reaction monitoring (SRM). Furthermore, we estimate the absolute abundance for 55% of all Mtb proteins, revealing a dynamic range within the Mtb proteome of over four orders of magnitude, and identify previously un-annotated proteins. As an example of the assay library utility, we monitored the entire Mtb dormancy survival regulon (DosR), which is linked to anaerobic survival and Mtb persistence, and show its dynamic protein-level regulation during hypoxia. In conclusion, we present a publicly available research resource that supports the sensitive, precise, and reproducible quantification of virtually any Mtb protein by a robust and widely accessible mass spectrometric method.
Tight spatio-temporal signaling of cytoskeletal and adhesion dynamics is required for localized membrane protrusion that drives directed cell migration. Different ensembles of proteins are therefore likely to get recruited and phosphorylated in membrane protrusions in response to specific cues.
Here, we use an assay that allows to biochemically purify extending protrusions of cells migrating in response to three prototypical receptors: integrins, recepor tyrosine kinases and G-coupled protein receptors. Using quantitative proteomics and phospho-proteomics approaches, we provide evidence for the existence of cue-specific, spatially distinct protein networks in the different cell migration modes.
The integrated analysis of the large-scale experimental data with protein information from databases allows us to understand some emergent properties of spatial regulation of signaling during cell migration. This provides the cell migration community with a large-scale view of the distribution of proteins and phospho-proteins regulating directed cell migration.
Fibroblast; Directional cell migration; Signaling; Proteomics; Phosphorylation
This White Paper sets out a Life Sciences Grand Challenge for Proteomics Technologies to enhance our understanding of complex biological systems, link genomes with phenotypes, and bring broad benefits to the biosciences and the US economy. The paper is based on a workshop hosted by the National Institute of Standards and Technology (NIST) in Gaithersburg, MD, 14–15 February 2011, with participants from many federal R&D agencies and research communities, under the aegis of the US National Science and Technology Council (NSTC). Opportunities are identified for a coordinated R&D effort to achieve major technology-based goals and address societal challenges in health, agriculture, nutrition, energy, environment, national security, and economic development.
Complex systems; Democratization of proteomics; Economic growth; Grand challenges; Integration; Systems biology
Quality control is increasingly recognized as a crucial aspect of mass spectrometry based proteomics. Several recent papers discuss relevant parameters for quality control and present applications to extract these from the instrumental raw data. What has been missing, however, is a standard data exchange format for reporting these performance metrics. We therefore developed the qcML format, an XML-based standard that follows the design principles of the related mzML, mzIdentML, mzQuantML, and TraML standards from the HUPO-PSI (Proteomics Standards Initiative). In addition to the XML format, we also provide tools for the calculation of a wide range of quality metrics as well as a database format and interconversion tools, so that existing LIMS systems can easily add relational storage of the quality control data to their existing schema. We here describe the qcML specification, along with possible use cases and an illustrative example of the subsequent analysis possibilities. All information about qcML is available at http://code.google.com/p/qcml.
Motivation: The determination of absolute quantities of proteins in biological samples is necessary for multiple types of scientific inquiry. While relative quantification has been commonly used in proteomics, few proteomic datasets measuring absolute protein quantities have been reported to date. Various technologies have been applied using different types of input data, e.g. ion intensities or spectral counts, as well as different absolute normalization strategies. To date, a user-friendly and transparent software supporting large-scale absolute protein quantification has been lacking.
Results: We present a bioinformatics tool, termed aLFQ, which supports the commonly used absolute label-free protein abundance estimation methods (TopN, iBAQ, APEX, NSAF and SCAMPI) for LC-MS/MS proteomics data, together with validation algorithms enabling automated data analysis and error estimation.
Availability and implementation:
aLFQ is written in R and freely available under the GPLv3 from CRAN (http://www.cran.r-project.org). Instructions and example data are provided in the R-package. The raw data can be obtained from the PeptideAtlas raw data repository (PASS00321).
Supplementary data are available at Bioinformatics online.
Complete reference maps or datasets, like the genomic map of an organism, are highly beneficial tools for biological and biomedical research. Attempts to generate such reference datasets for a proteome so far failed to reach complete proteome coverage, with saturation apparent at approximately two thirds of the proteomes tested, even for the most thoroughly characterized proteomes. Here, we used a strategy based on high-throughput peptide synthesis and mass spectrometry to generate a close to complete reference map (97% of the genome-predicted proteins) of the S. cerevisiae proteome. We generated two versions of this mass spectrometric map one supporting discovery- (shotgun) and the other hypothesis-driven (targeted) proteomic measurements. The two versions of the map, therefore, constitute a complete set of proteomic assays to support most studies performed with contemporary proteomic technologies. The reference libraries can be browsed via a web-based repository and associated navigation tools. To demonstrate the utility of the reference libraries we applied them to a protein quantitative trait locus (pQTL) analysis, which requires measurement of the same peptides over a large number of samples with high precision. Protein measurements over a set of 78 S. cerevisiae strains revealed a complex relationship between independent genetic loci, impacting on the levels of related proteins. Our results suggest that selective pressure favors the acquisition of sets of polymorphisms that maintain the stoichiometry of protein complexes and pathways.
S. cerevisiae; selected reaction monitoring; SRM; MRM; spectral library; peptide library; mass spectrometric map; protein QTL
Quantifying the similarity of spectra is an important task in various areas of spectroscopy, for example, to identify a compound by comparing sample spectra to those of reference standards. In mass spectrometry based discovery proteomics, spectral comparisons are used to infer the amino acid sequence of peptides. In targeted proteomics by selected reaction monitoring (SRM) or SWATH MS, predetermined sets of fragment ion signals integrated over chromatographic time are used to identify target peptides in complex samples. In both cases, confidence in peptide identification is directly related to the quality of spectral matches. In this study, we used sets of simulated spectra of well-controlled dissimilarity to benchmark different spectral comparison measures and to develop a robust scoring scheme that quantifies the similarity of fragment ion spectra. We applied the normalized spectral contrast angle score to quantify the similarity of spectra to objectively assess fragment ion variability of tandem mass spectrometric datasets, to evaluate portability of peptide fragment ion spectra for targeted mass spectrometry across different types of mass spectrometers and to discriminate target assays from decoys in targeted proteomics. Altogether, this study validates the use of the normalized spectral contrast angle as a sensitive spectral similarity measure for targeted proteomics, and more generally provides a methodology to assess the performance of spectral comparisons and to support the rational selection of the most appropriate similarity measure. The algorithms used in this study are made publicly available as an open source toolset with a graphical user interface.
Bulk degradation of cytoplasmic material is mediated by a highly conserved intracellular trafficking pathway termed autophagy. This pathway is characterized by the formation of double-membrane vesicles termed autophagosomes engulfing the substrate and transporting it to the vacuole/lysosome for breakdown and recycling. The Atg1/ULK1 kinase is essential for this process; however, little is known about its targets and the means by which it controls autophagy. Here we have screened for Atg1 kinase substrates using consensus peptide arrays and identified three components of the autophagy machinery. The multimembrane-spanning protein Atg9 is a direct target of this kinase essential for autophagy. Phosphorylated Atg9 is then required for the efficient recruitment of Atg8 and Atg18 to the site of autophagosome formation and subsequent expansion of the isolation membrane, a prerequisite for a functioning autophagy pathway. These findings show that the Atg1 kinase acts early in autophagy by regulating the outgrowth of autophagosomal membranes.
•The Atg1 kinase phosphorylation consensus was identified on peptide arrays•Atg9 is a direct target of the Atg1/ULK1 kinase in vitro and in vivo•Atg9 phosphorylation recruits Atg18 and Atg8 to the PAS•Atg9 phosphorylation is required for isolation membrane expansion/autophagy function
Autophagy function is pivotal to cell health. Papinski et al. identify the phosphorylation consensus of the central kinase in this pathway, Atg1. The autophagy-related protein Atg9 is a direct target of Atg1. Atg9 phosphorylation by Atg1 is required for autophagosome formation. This finding sheds light on how Atg1 controls autophagy.
Pyrococcus Furiosus (Pfu) is an excellent organism to generate reference samples for proteomics labs because of its moderately sized genome and very little sequence duplication within the genome. We demonstrated a stable and consistent method to prepare proteins in bulk that eliminates growth and preparation as a source of uncertainty in the standard. We performed several proteomic studies in different laboratories using each laboratory's specific workflow as well as separate and integrated data analysis. This study demonstrated that a Pfu whole cell lysate provides suitable protein sample complexity to not only validate proteomic methods, work flows, and benchmark new instruments, but also to facilitate comparison of experimental data generated over time and across instruments or labs.
Pyrococcus furiosus (Pfu); Proteomics; Protein Complex Standard; MudPIT; OFFGEL electrophoresis; Directed LC-MS/MS
Affinity purification coupled with mass spectrometry (AP-MS) is now a widely used approach for the identification of protein-protein interactions. However, for any given protein of interest, determining which of the identified polypeptides represent bona fide interactors versus those that are background contaminants (e.g. proteins that interact with the solid-phase support, affinity reagent or epitope tag) is a challenging task. While the standard approach is to identify nonspecific interactions using one or more negative controls, most small-scale AP-MS studies do not capture a complete, accurate background protein set. Fortunately, negative controls are largely bait-independent. Hence, aggregating negative controls from multiple AP-MS studies can increase coverage and improve the characterization of background associated with a given experimental protocol. Here we present the Contaminant Repository for Affinity Purification (the CRAPome) and describe the use of this resource to score protein-protein interactions. The repository (currently available for Homo sapiens and Saccharomyces cerevisiae) and computational tools are freely available online at www.crapome.org.
Topoisomerase I (TOP1) inhibitors are an important class of anticancer drugs. The cytotoxicity of TOP1 inhibitors can be modulated by replication fork reversal, in a process that requires PARP activity. Whether regressed forks can efficiently restart and the factors required to restart fork progression after fork reversal are still unknown. Here we combined biochemical and electron microscopy approaches with single-molecule DNA fiber analysis, to identify a key role for human RECQ1 helicase in replication fork restart after TOP1 inhibition, not shared by other human RecQ proteins. We show that the poly(ADPribosyl)ation activity of PARP1 stabilizes forks in their regressed state by limiting their restart by RECQ1. These studies provide new mechanistic insights into the roles of RECQ1 and PARP in DNA replication and offer molecular perspectives to potentiate chemotherapeutic regimens based on TOP1 inhibition.
Adoption of targeted mass spectrometry (MS) approaches such as multiple reaction monitoring (MRM) to study biological and biomedical questions is well underway in the proteomics community. Successful application depends on the ability to generate reliable assays that uniquely and confidently identify target peptides in a sample. Unfortunately, there is a wide range of criteria being applied to say that an assay has been successfully developed. There is no consensus on what criteria are acceptable and little understanding of the impact of variable criteria on the quality of the results generated. Publications describing targeted MS assays for peptides frequently do not contain sufficient information for readers to establish confidence that the tests work as intended or to be able to apply the tests described in their own labs. Guidance must be developed so that targeted MS assays with established performance can be made widely distributed and applied by many labs worldwide. To begin to address the problems and their solutions, a workshop was held at the National Institutes of Health with representatives from the multiple communities developing and employing targeted MS assays. Participants discussed the analytical goals of their experiments and the experimental evidence needed to establish that the assays they develop work as intended and are achieving the required levels of performance. Using this “fit-for-purpose” approach, the group defined three tiers of assays distinguished by their performance and extent of analytical characterization. Computational and statistical tools useful for the analysis of targeted MS results were described. Participants also detailed the information that authors need to provide in their manuscripts to enable reviewers and readers to clearly understand what procedures were performed and to evaluate the reliability of the peptide or protein quantification measurements reported. This paper presents a summary of the meeting and recommendations.
Tissue homeostasis is controlled by signaling systems that coordinate cell proliferation, cell growth and cell shape upon changes in the cellular environment. Deregulation of these processes is associated with human cancer and can occur at multiple levels of the underlying signaling systems. To gain an integrated view on signaling modules controlling tissue growth, we analyzed the interaction proteome of the human Hippo pathway, an established growth regulatory signaling system. The resulting high‐resolution network model of 480 protein‐protein interactions among 270 network components suggests participation of Hippo pathway components in three distinct modules that all converge on the transcriptional co‐activator YAP1. One of the modules corresponds to the canonical Hippo kinase cassette whereas the other two both contain Hippo components in complexes with cell polarity proteins. Quantitative proteomic data suggests that complex formation with cell polarity proteins is dynamic and depends on the integrity of cell‐cell contacts. Collectively, our systematic analysis greatly enhances our insights into the biochemical landscape underlying human Hippo signaling and emphasizes multifaceted roles of cell polarity complexes in Hippo‐mediated tissue growth control.
AP‐MS; cell polarity; Hippo signaling; modularity; protein complex analysis
Application of a kinetic model of miRNA-mediated gene regulation to experimental data sets shows that the timescale of regulation is slower than previously assumed, due to bottlenecks imposed by miRNA turnover in the RNA-induced silencing complex and by slow protein decay.
A mathematical model links the dynamics of miRNA expression and loading into the Argonaute protein to the dynamics of miRNA targets.Loading of miRNAs into Argonaute and the slow decay of proteins impose two bottlenecks on the speed of miRNA-mediated regulation.Accelerated miRNA turnover is necessary for regulating target expression on the timescale of a day.
MiRNAs are post-transcriptional regulators that contribute to the establishment and maintenance of gene expression patterns. Although their biogenesis and decay appear to be under complex control, the implications of miRNA expression dynamics for the processes that they regulate are not well understood. We derived a mathematical model of miRNA-mediated gene regulation, inferred its parameters from experimental data sets, and found that the model describes well time-dependent changes in mRNA, protein and ribosome density levels measured upon miRNA transfection and induction. The inferred parameters indicate that the timescale of miRNA-dependent regulation is slower than initially thought. Delays in miRNA loading into Argonaute proteins and the slow decay of proteins relative to mRNAs can explain the typically small changes in protein levels observed upon miRNA transfection. For miRNAs to regulate protein expression on the timescale of a day, as miRNAs involved in cell-cycle regulation do, accelerated miRNA turnover is necessary.
gene expression regulation; kinetics; miRNAs; modeling; protein turnover
Deciphering physiological changes that mediate transition of Mycobacterium tuberculosis between replicating and nonreplicating states is essential to understanding how the pathogen can persist in an individual host for decades. We have combined RNA sequencing (RNA-seq) of 5′ triphosphate-enriched libraries with regular RNA-seq to characterize the architecture and expression of M. tuberculosis promoters. We identified over 4,000 transcriptional start sites (TSSs). Strikingly, for 26% of the genes with a primary TSS, the site of transcriptional initiation overlapped with the annotated start codon, generating leaderless transcripts lacking a 5′ UTR and, hence, the Shine-Dalgarno sequence commonly used to initiate ribosomal engagement in eubacteria. Genes encoding proteins with active growth functions were markedly depleted from the leaderless transcriptome, and there was a significant increase in the overall representation of leaderless mRNAs in a starvation model of growth arrest. The high percentage of leaderless genes may have particular importance in the physiology of nonreplicating M. tuberculosis.
•A resource for the identification of in vitro active promoters in M. tuberculosis•A quarter of all genes in M. tuberculosis are expressed as leaderless mRNAs•Leaderless mRNAs are differentially associated with toxin-antitoxin modules•Abundance of leaderless mRNAs increases during starvation-induced growth arrest
In this study, Cortes, Young, and colleagues report genome-wide mapping of transcriptional start sites combined with RNA sequencing and shotgun proteomics in the human pathogen Mycobacterium tuberculosis. A striking finding is a high proportion of genes expressed in the form of leaderless transcripts lacking the Shine-Dalgarno sequence conventionally required for translation. The distribution of functional gene classes between leaderless and Shine-Dalgarno transcriptomes suggests that changes in the specificity of translation may play a role in bacterial adaptation during infection.
Public repositories for proteomics data have accelerated proteomics research by enabling more efficient cross-analyses of datasets, supporting the creation of protein and peptide compendia of experimental results, supporting the development and testing of new software tools, and facilitating the manuscript review process. The repositories available to date have been designed to accommodate either shotgun experiments or generic proteomic data files. Here, we describe a new kind of proteomic data repository for the collection and representation of data from selected reaction monitoring (SRM) measurements. The PeptideAtlas SRM Experiment Library (PASSEL) allows researchers to easily submit proteomic data sets generated by SRM. The raw data are automatically processed in a uniform manner and the results are stored in a database, where they may be downloaded or browsed via a web interface that includes a chromatogram viewer. PASSEL enables cross-analysis of SRM data, supports optimization of SRM data collection, and facilitates the review process of SRM data. Further, PASSEL will help in the assessment of proteotypic peptide performance in a wide array of samples containing the same peptide, as well as across multiple experimental protocols.
data repository; MRM; software; SRM; targeted proteomics