PMCCPMCCPMCC

Search tips
Search criteria 

Advanced

 
Logo of nihpaAbout Author manuscriptsSubmit a manuscriptHHS Public Access; Author Manuscript; Accepted for publication in peer reviewed journal;
 
Curr Opin Chem Biol. Author manuscript; available in PMC 2010 December 1.
Published in final edited form as:
PMCID: PMC2787889
NIHMSID: NIHMS137402

Methods for the proteomic identification of protease substrates

Summary

Proteolysis is a key regulatory post-translational modification in diverse cellular processes including programmed cell death, immune function, and development. Tracking proteolytic events has become a focus of researchers assessing the downstream consequences of protease activation. In this review we summarize unbiased methods for identifying protease substrates and tracking the extent of cleavage, a field termed “degradomics”. These include one- and two-dimensional gel-based methods for identifying protease substrates, N-terminal peptide identification methods for simultaneously identifying substrates and cleavage sites, and approaches for quantitation of cleavage events during endogenous proteolysis. Individual methods have identified more than 300 caspase-cleaved targets during apoptosis suggesting broad future applications for these technologies.

Introduction: Proteolysis in Cellular Signaling

Proteolysis is both a catabolic process for metabolite recycling and energy generation[1] and an essential post-translational modification (PTM) that regulates intracellular and extracellular signal transduction.[24] Like other PTMs, proteolysis regulates protein function more rapidly than is possible via transcription and translation. Thus, its use is pivotal in responsive biological proccesses, for example in coagulation and the inflammatory response.[57] Site-specific proteolysis can also rapidly degrade numerous proteins in a controlled fashion, resulting in widespread morphological changes (e.g. in apoptosis or development).[8] In studies of other PTMs, such as phosphorylation or glycosylation, systems-level protein profiling has linked physiological stimuli to site-specific protein modifications.[9,10] However, while these modifications can be targeted by affinity chromatography, proteolyzed proteins have no unique chemical handle enabling their enrichment from complex mixtures. This review focuses on approaches to track proteolysis in complex mixtures by defining the sequence specificity of proteases, identifying their protein targets and sites of cleavage, and measuring the kinetics and extent of cleavage events. Apoptosis and the proteolytic activity of the caspase family of proteases are model systems that will be used as a semi-universal metric to compare the techniques.

Protease target identification: gel-based methods

Key efforts towards defining proteolytic substrates focus on the determination of primary sequence specificities. Diverse methods including substrate phage display [11,12] and mRNA display [13] identify small numbers of optimized cleavage sequences, while chemically- and enzymatically-derived peptide libraries interrogate the amino acid preferences for proteases at each subsite.[1417] While highly successful, peptide-centric efforts suffer from an inability to define the roll of higher-order protein structure on substrate determination. To address these deficiencies, a number of gel-based techniques have been developed to identify proteases’ protein substrates.

Two-dimensional differential gel electrophoresis (2D-DiGE) employs orthogonal electrophoresis methods to separate complex mixtures into resolvable spots.[18] Comparative staining reveals differences between proteolyzed and control samples that can be analyzed by MS (Figure 1a). A variation on this approach, diagonal electrophoresis, uses two dimensions of SDS-PAGE with an intermediate in-gel protease treatment to identify protease targets.[19] Proteins that migrate off the diagonal in the second, post-proteolysis dimension are likely substrates of the digesting protease (Figure 1b). 2D gel methods have been applied to caspase substrate identification, identifying 41, 13, and 15 putative substrates of caspases-1, -3, and -7 (Table 1).[19,20]

Figure 1
Gel-based methods for identifying protease substrates
Table 1
Comparison of methods for identifying caspase substrates

An alternative 1D gel-based method, termed PROTOMAP, has recently been described to identify cleavages that occur during Jurkat cell apoptosis.[21,22] Apoptotic and control cell lysates were separated by SDS-PAGE in adjacent lanes, and the gel lanes were sliced into bands (Figure 1c). Proteins in each band were identified by LC-MS/MS following in-gel trypsin digestion, and quantified by spectral counting. Proteins from apoptotic cells that decreased in intensity or shifted from higher to lower apparent molecular weight (261 of 1648 total proteins) were presumed to be caspase substrates. A caspase cleavage-derived peptide was detected directly in approximately one quarter of the proteolyzed proteins, and by overlaying the peptides of cleaved fragments onto a sequence map the authors could approximate the cleavage site for many others. However, without direct observation of a caspase-cleaved peptide, definitive cleavage sites cannot be assigned, and the possibility exists that non-caspase proteases are responsible for some of the observed proteolysis. A notable advantage of PROTOMAP is that an essentially infinite number of samples can be compared, limited only by the size of the gel. The authors exploited this to monitor a time course of apoptotic cleavage, and identified both early and late cleavage events during apoptosis and the relative stabilities of the resulting proteolytic fragments. Unfortunately, this technique does not enrich for proteolyzed proteins, and thus preferentially samples highly abundant proteins. Consequently, extensive MS instrument time is required for comprehensive peptide identification from low amounts (~100µg) of sample. It remains unclear whether the technique can be applied to study sparse proteolysis within a high background of unmodified proteins.

N-terminal identifications: Negative selections

Several groups have taken advantage of the semi-unique nature of protein N-termini to identify both the proteolyzed protein and the site of cleavage. Full-length proteins have a limited number of reactive nitrogen atoms corresponding to lysine ε-amines and N-terminal α-amines. Upon digestion with a residue specific protease (e.g. trypsin), C-terminal and internal peptides gain an additional α-amine. Two teams of researchers have used this additional functionality to remove internal peptides and identify sites of proteolysis. One approach, termed COmbined FRActional DIagonal Chromatography (COFRADIC), treats full-length proteins with N-hydroxysuccinimide acetate to acetylate all lysines and N-termini (Figure 2a–b).[2328] Subsequently, the proteins are trypsinized and the N-termini of internal peptides are capped with trinitrobenzenesulfonyl chloride, greatly increasing their hydrophobicity. The N-terminal peptides elute far earlier than the modified internal peptides on RP-HPLC, enabling their selective analysis by MS. By using an isotopically enriched amine-blocking group (e.g. trideuteroacetate) it is possible to differentiate blocked N-termini from endogenous N-terminal acetylation.[23] COFRADIC has identified 58 caspase-cleaved peptides from apoptotic Jurkat T lymphocytes,[23] 11 putative caspase-1 and 9 putative caspase-7 cleaved peptides in a caspase-1 treated lysate,[27] and 585 direct human granzyme B substrates from a granzyme B-treated lysate.[26] In a similar approach, Beynon and colleagues incubated an acetylated and trypsinized cell lysate with amine-reactive N-hydroxysuccinimide-activated beads, to capture internal peptides (Figure 2b).[29,30] Each of these techniques identifies proteins based on a single peptide. Inappropriate lengths (i.e. peptides that are too long or short) or poor ionizability make ~50% of tryptic peptides unsuitable for unambiguous identification via MS and thus limit the number of possible substrate identifications. Furthermore, incomplete capture of internal peptides can lead to a high background of false positives. On average, there are ~25 internal peptides for each N-terminal peptide in an acetylated lysate, so even small inefficiencies in their removal are highly problematic.

Figure 2
Methods for isolating N-terminal Peptides

N-terminal identifications: Positive selections

An alternative to removing internal peptides for N-terminal peptide identification is to positively enrich for the peptides of interest. Salvesen and colleagues have developed a method for enrichment based on selective guanidinylation of lysine residues using O-methylisourea (Figure 2c). O-methylisourea inefficiently modifies protein α-amines, leaving them available for subsequent chemical biotinylation. After digestion with trypsin, N-terminal peptides can be isolated with immobilized streptavidin.[31] In contrast to the two previous methods for N-terminal modification, this approach discards endogenously N-acetylated N-termini (~80% of all N-termini)[32] and thus increases the sensitivity for detecting proteolytic cleavages. The authors used their technique to investigate mitochondrial transit peptides from yeast, mouse, and human cells. They found a total of 34 transit peptides from 27 proteins, only 10 of which had been previously annotated, demonstrating the utility of this method for N-terminal discovery. It should be noted that this method depends on highly efficient modification of lysine residues and highly selective biotinylation of the remaining α-amines. Biotinylation of serine, threonine, or histidine side chains could lead to false identifications. In principle, database matching in the MS-analysis should distinguish real N-terminal peptides from spurious identifications, but incomplete peptide coverage, particularly close to the N-terminus, can introduce false positives.

We have employed subtiligase, an engineered variant of the bacterial protease subtilisin BPN’, to selectively label N-terminal peptides in a single step without lysine derivatization. Created by mutating the catalytic serine to a cysteine and modifying the geometry of this active site residue with a second point mutation (P225A), subtiligase has negligible amidase activity but remains active as an esterase.[3335] The thioester-enzyme intermediate formed with peptide ester substrates during the catalytic cycle is slowly hydrolyzed by water, but can be rapidly intercepted by N-terminal amines, allowing transfer of the N-terminal portion of the ester onto free amines. We have never observed transfer onto peptide or protein side chains, suggesting an enzymatic specificity thus far unachievable via small molecule approaches. Identification of N-terminal peptides was accomplished by treating cell lysates with subtiligase and a biotinylated-peptide ester (Figure 2c).[36,37] The resulting peptide mixture was trypsinized, N-terminal peptides were captured on streptavidin beads, and the desired peptides were released by cleavage of a TEV protease site in the original biotinylated peptide. TEV protease cleavage leaves a dipeptide tag that can be used to confirm true positives. In typical runs, >95% of identified peptides contain this tag. Using this method on apoptotic Jurkat cells, we identified 333 caspase-like cleavage sites on 292 protein substrates.[36] The broad scope of these substrates enabled the discovery of unexpected caspase enzymatic activities- most notably cleavage within helices and systematic cleavage of proteins within protein-protein complexes. Though highly specific, the efficiency of subtiligase N-terminal labeling is low, and thus this technique requires large amounts of material (typically 50–100 mg of a complex mixture per experiment) to identify numerous substrates. However, the highly enriched set of N-termini that result can be routinely analyzed in a single day via LC-MS/MS. Like all N-terminal identification procedures, the identification of one peptide per protein systematically excludes proteins with N-terminal peptides not readily identifiable via MS.

Quantitation of proteolysis

To date, the primary focus of protease-substrate classification techniques has been on identifying substrates and their cleavage sites. Efforts to understand the dynamics and extent of proteolysis could make significant contributions to our understanding of proteolysis in vivo. Simple, two-plex-quantitation can be metabolically, chemically, or enzymatically introduced into these techniques to evaluate relative levels of proteolysis in two samples (Figures 3a–b).[2,23,26,28,38] TMT-[38] and iTRAQ-based[39] quantitation allows for relative comparison of up to eight samples, enabling more detailed kinetic measurements or observations of proteolytic efficiency across multiple biological conditions (Figure 3c).[40] Application of these methods to N-terminal peptides, however, monitors only the appearance of new N-termini, leaving the extent of cleavage undetermined. PROTOMAP and other gel-based methods can theoretically analyze every peptide in the protein and thus can monitor both appearance of new peptides and the extent of cleavage.

Figure 3
Methods for relative peptide quantification

Conclusion

Recent technological developments have revolutionized the process of protease substrate identification. These advances have provided facile means to categorize a protease’s sequence specificity, in vitro substrates, and the in vivo proteolysis that results from biological stimuli. The two main approaches, gel-based methods and N-terminal identification, provide complementary information: gel-based methods do not enrich for the peptides of interest and thus may miss low abundance substrates, while protein identification based on enrichment of single peptides is more sensitive, but inherently limited by the nature of MS-database matching. The introduction of MS-based quantitation into these analyses will enable comprehensive and high-throughput profiling of substrate proteolysis. These advances should reveal the full scope of proteolysis in both normal and disease states.

Acknowledgements

We would like to thank Juan Diaz, J. T. Koerber, Sami Mahrus, and David Wildes for their helpful comments on the manuscript. This work is funded by the National Institute of Health F32-AI077177-01 (NJA), and R01 GM081051 (JAW), and from the Hartwell Foundation (JAW).

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Contributor Information

Nicholas J. Agard, Department of Pharmaceutical Chemistry, University of California, San Francisco, UCSF MC 2552, 1700 4th St, San Francisco, CA 94158, Nicholas.Agard/at/ucsf.edu.

James A. Wells, Department of Pharmaceutical Chemistry, University of California, San Francisco, UCSF MC 2330, 1700 4th St, San Francisco, CA 94158, Jim.Wells/at/ucsf.edu.

References

1. Muntz K. Protein dynamics and proteolysis in plant vacuoles. J Exp Bot. 2007;58:2391–2407. [PubMed]
2. Doucet A, Butler GS, Rodriguez D, Prudova A, Overall CM. Metadegradomics: toward in vivo quantitative degradomics of proteolytic post-translational modifications of the cancer proteome. Mol Cell Proteomics. 2008;7:1925–1951. [PubMed]
3. Kopan R, Ilagan MX. The canonical Notch signaling pathway: unfolding the activation mechanism. Cell. 2009;137:216–233. [PMC free article] [PubMed]
4. Brasier AR. The NF-kappaB regulatory network. Cardiovasc Toxicol. 2006;6:111–130. [PubMed]
5. Roberts HR, Monroe DM, Hoffman M. Molecular biology and biochemistry of the coagulation factors and pathways of hemostasis. In: Lichtman MA, Kipps JA, Kaushansky K, Beutler E, Seligsohn U, editors. Williams Hematology. edn 7th. Prchal JT: McGraw-Hill; 2006. pp. 1665–1695.
6. Ting JP, Willingham SB, Bergstralh DT. NLRs at the intersection of cell death and immunity. Nat Rev Immunol. 2008;8:372–379. [PubMed]
7. Scheidereit C. IkappaB kinase complexes: gateways to NF-kappaB activation and transcription. Oncogene. 2006;25:6685–6705. [PubMed]
8. Yi CH, Yuan J. The Jekyll and Hyde functions of caspases. Dev Cell. 2009;16:21–34. [PMC free article] [PubMed]
9. Oppermann FS, Gnad F, Olsen JV, Hornberger R, Greff Z, Keri G, Mann M, Daub H. Large-scale proteomics analysis of the human kinome. Mol Cell Proteomics. 2009 [PubMed]
10. Wells L, Vosseller K, Cole RN, Cronshaw JM, Matunis MJ, Hart GW. Mapping sites of O-GlcNAc modification using affinity tags for serine and threonine post-translational modifications. Mol Cell Proteomics. 2002;1:791–804. [PubMed]
11. Matthews DJ, Wells JA. Substrate phage: selection of protease substrates by monovalent phage display. Science. 1993;260:1113–1117. [PubMed]
12. Hansen M, Wind T, Blouse GE, Christensen A, Petersen HH, Kjelgaard S, Mathiasen L, Holtet TL, Andreasen PA. A urokinase-type plasminogen activator-inhibiting cyclic peptide with an unusual P2 residue and an extended protease binding surface demonstrates new modalities for enzyme inhibition. J Biol Chem. 2005;280:38424–38437. [PubMed]
13. Baggio R, Burgstaller P, Hale SP, Putney AR, Lane M, Lipovsek D, Wright MC, Roberts RW, Liu R, Szostak JW, et al. Identification of epitope-like consensus motifs using mRNA display. J Mol Recognit. 2002;15:126–134. [PubMed]
14. Backes BJ, Harris JL, Leonetti F, Craik CS, Ellman JA. Synthesis of positional-scanning libraries of fluorogenic peptide substrates to define the extended substrate specificity of plasmin and thrombin. Nat Biotechnol. 2000;18:187–193. [PubMed]
15. Thornberry NA, Rano TA, Peterson EP, Rasper DM, Timkey T, Garcia-Calvo M, Houtzager VM, Nordstrom PA, Roy S, Vaillancourt JP, et al. A combinatorial approach defines specificities of members of the caspase family and granzyme B. Functional relationships established for key mediators of apoptosis. J Biol Chem. 1997;272:17907–17911. [PubMed]
16. Diamond SL. Methods for mapping protease specificity. Curr Opin Chem Biol. 2007;11:46–51. [PubMed]
17. Schilling O, Overall CM Proteome-derived, database-searchable peptide libraries for identifying protease cleavage sites. Nat Biotechnol. 2008;26:685–694. [PubMed]
The authors create a peptide library by digesting lysates with residue-specific proteases. Subsequent treatment with proteases of interest followed by MS analysis enables the rapid identification of protease sequence specificities.
18. Tonge R, Shaw J, Middleton B, Rowlinson R, Rayner S, Young J, Pognan F, Hawkins E, Currie I, Davison M. Validation and development of fluorescence two-dimensional differential gel electrophoresis proteomics technology. Proteomics. 2001;1:377–396. [PubMed]
19. Shao W, Yeretssian G, Doiron K, Hussain SN, Saleh M. The caspase-1 digestome identifies the glycolysis pathway as a target during infection and septic shock. J Biol Chem. 2007;282:36321–36329. [PubMed]
20. Jang M, Park BC, Kang S, Lee do H, Cho S, Lee SC, Bae KH, Park SG. Mining of caspase-7 substrates using a degradomic approach. Mol Cells. 2008;26:152–157. [PubMed]
21. Dix MM, Simon GM, Cravatt BF Global mapping of the topography and magnitude of proteolytic events in apoptosis. Cell. 2008;134:679–691. [PubMed]
The authors use 1D gels coupled with quantitative MS to track apoptosis in human Jurkat cells. The use of label-free quantitation enabled a detailed kinetic evaluation of proteolysis during apoptosis.
22. Simon G, Dix MM, Cravatt BF. Comparative assessment of large-scale proteomic studies of apoptotic proteolysis. ACS Chem Biol. 2009 [PMC free article] [PubMed]
23. Van Damme P, Martens L, Van Damme J, Hugelier K, Staes A, Vandekerckhove J, Gevaert K Caspase-specific and nonspecific in vivo protein processing during Fas-induced apoptosis. Nat Methods. 2005;2:771–777. [PubMed]
The authors present the first example of N-terminal peptide identification for tracking proteolysis during apoptosis. This paper demonstrated the utility of globally tracking N-terminal identifications during apoptosis and set the stage for evaluating numerous additional proteases.
24. Gevaert K, Goethals M, Martens L, Van Damme J, Staes A, Thomas GR, Vandekerckhove J. Exploring proteomes and analyzing protein processing by mass spectrometric identification of sorted N-terminal peptides. Nat Biotechnol. 2003;21:566–569. [PubMed]
25. Van Damme P, Impens F, Vandekerckhove J, Gevaert K. Protein processing characterized by a gel-free proteomics approach. Methods Mol Biol. 2008;484:245–262. [PubMed]
26. Van Damme P, Maurer-Stroh S, Plasman K, Van Durme J, Colaert N, Timmerman E, De Bock PJ, Goethals M, Rousseau F, Schymkowitz J, et al. Analysis of protein processing by N-terminal proteomics reveals novel species-specific substrate determinants of granzyme B orthologs. Mol Cell Proteomics. 2009;8:258–272. [PubMed]
27. Lamkanfi M, Kanneganti TD, Van Damme P, Vanden Berghe T, Vanoverberghe I, Vandekerckhove J, Vandenabeele P, Gevaert K, Nunez G. Targeted peptidecentric proteomics reveals caspase-7 as a substrate of the caspase-1 inflammasomes. Mol Cell Proteomics. 2008;7:2350–2363. [PubMed]
28. Staes A, Van Damme P, Helsens K, Demol H, Vandekerckhove J, Gevaert K. Improved recovery of proteome-informative, protein N-terminal peptides by combined fractional diagonal chromatography (COFRADIC) Proteomics. 2008;8:1362–1370. [PubMed]
29. McDonald L, Beynon RJ. Positional proteomics: preparation of amino-terminal peptides as a strategy for proteome simplification and characterization. Nat Protoc. 2006;1:1790–1798. [PubMed]
30. McDonald L, Robertson DH, Hurst JL, Beynon RJ. Positional proteomics: selective recovery and analysis of N-terminal proteolytic peptides. Nat Methods. 2005;2:955–957. [PubMed]
31. Timmer JC, Enoksson M, Wildfang E, Zhu W, Igarashi Y, Denault JB, Ma Y, Dummitt B, Chang YH, Mast AE, et al. Profiling constitutive proteolytic events in vivo. Biochem J. 2007;407:41–48. [PubMed]
32. Brown JL, Roberts WK. Evidence that approximately eighty per cent of the soluble proteins from Ehrlich ascites cells are Nalpha-acetylated. J Biol Chem. 1976;251:1009–1014. [PubMed]
33. Abrahmsen L, Tom J, Burnier J, Butcher KA, Kossiakoff A, Wells JA. Engineering subtilisin and its substrates for efficient ligation of peptide bonds in aqueous solution. Biochemistry. 1991;30:4151–4159. [PubMed]
34. Chang TK, Jackson DY, Burnier JP, Wells JA. Subtiligase: a tool for semisynthesis of proteins. Proc Natl Acad Sci USA. 1994;91:12544–12548. [PubMed]
35. Jackson DY, Burnier J, Quan C, Stanley M, Tom J, Wells JA. A designed peptide ligase for total synthesis of ribonuclease A with unnatural catalytic residues. Science. 1994;266:243–247. [PubMed]
36. Mahrus S, Trinidad JC, Barkan DT, Sali A, Burlingame AL, Wells JA Global sequencing of proteolytic cleavage sites in apoptosis by specific labeling of protein N termini. Cell. 2008;134:866–876. [PubMed]
Using enzymatic rather than chemical methods for N-terminal peptide identification, the authors discover the largest current collection apoptotic cleavages. These substrates reveal unexpected biology of the caspases including a preference for cleavage within protrein-protein complexes.
37. Yoshihara HA, Mahrus S, Wells JA. Tags for labeling protein N-termini with subtiligase for proteomics. Bioorg Med Chem Lett. 2008;18:6000–6003. [PMC free article] [PubMed]
38. Dayon L, Hainard A, Licker V, Turck N, Kuhn K, Hochstrasser DF, Burkhard PR, Sanchez JC. Relative quantification of proteins in human cerebrospinal fluids by MS/MS using 6-plex isobaric tags. Anal Chem. 2008;80:2921–2931. [PubMed]
39. Ross PL, Huang YN, Marchese JN, Williamson B, Parker K, Hattan S, Khainovski N, Pillai S, Dey S, Daniels S, et al. Multiplexed protein quantitation in Saccharomyces cerevisiae using amine-reactive isobaric tagging reagents. Mol Cell Proteomics. 2004;3:1154–1169. [PubMed]
40. Hardt M, Witkowska HE, Webb S, Thomas LR, Dixon SE, Hall SC, Fisher SJ. Assessing the effects of diurnal variation on the composition of human parotid saliva: quantitative analysis of native peptides using iTRAQ reagents. Anal Chem. 2005;77:4947–4954. [PubMed]