The data described provide a comprehensive resource for the family-wide active site specificity portrayal of the human matrix metalloproteinase family. We used the high-throughput proteomic technique PICS (Proteomic Identification of protease Cleavage Sites) to comprehensively assay 9 different MMPs. We identified more than 4300 peptide cleavage sites, spanning both the prime and non-prime sides of the scissile peptide bond allowing detailed subsite cooperativity analysis. The proteomic cleavage data were expanded by kinetic analysis using a set of 6 quenched-fluorescent peptide substrates designed using these results. These datasets represent one of the largest specificity profiling efforts with subsequent structural follow up for any protease family and put the spotlight on the specificity similarities and differences of the MMP family. A detailed analysis of this data may be found in Eckhard et al. (2015) . The raw mass spectrometry data and the corresponding metadata have been deposited in PRIDE/ProteomeXchange with the accession number PXD002265.
Matrix metalloproteinases; MMPs; PICS; Proteomics; Quenched fluorescence; Specificity profiling; Cleavage sites
The Data described here provide the in depth proteomic assessment of the human dental pulp proteome and N-terminome (Eckhard et al., 2015) . A total of 9 human dental pulps were processed and analyzed by the positional proteomics technique TAILS (Terminal Amine Isotopic Labeling of Substrates) N-terminomics. 38 liquid chromatography tandem mass spectrometry (LC-MS/MS) datasets were collected and analyzed using four database search engines in combination with statistical downstream evaluation, to yield the by far largest proteomic and N-terminomic dataset of any dental tissue to date. The raw mass spectrometry data and the corresponding metadata have been deposited in ProteomeXchange with the PXD identifier ; Supplementary Tables described in this article are available via Mendeley Data (10.17632/555j3kk4sw.1).
Human dental pulp; N-terminome; Proteome; TAILS N-terminomics; Tandem mass spectrometry
Type II transmembrane serine proteases (TTSPs) are a family of cell membrane tethered serine proteases with unclear roles as their cleavage site specificities and substrate degradomes have not been fully elucidated. Indeed just 52 cleavage sites are annotated in MEROPS, the database of proteases, their substrates and inhibitors.
To profile the active site specificities of the TTSPs, we applied Proteomic Identification of protease Cleavage Sites (PICS). Human proteome-derived database searchable peptide libraries were assayed with six human TTSPs (matriptase, matriptase-2, matriptase-3, HAT, DESC and hepsin) to simultaneously determine sequence preferences on the N-terminal non-prime (P) and C-terminal prime (P’) sides of the scissile bond. Prime-side cleavage products were isolated following biotinylation and identified by tandem mass spectrometry. The corresponding non-prime side sequences were derived from human proteome databases using bioinformatics. Sequencing of 2,405 individual cleaved peptides allowed for the development of the family consensus protease cleavage site specificity revealing a strong specificity for arginine in the P1 position and surprisingly a lysine in P1′ position. TTSP cleavage between R↓K was confirmed using synthetic peptides. By parsing through known substrates and known structures of TTSP catalytic domains, and by modeling the remainder, structural explanations for this strong specificity were derived.
Degradomics analysis of 2,405 cleavage sites revealed a similar and characteristic TTSP family specificity at the P1 and P1′ positions for arginine and lysine in unfolded peptides. The prime side is important for cleavage specificity, thus making these proteases unusual within the tryptic-enzyme class that generally has overriding non-prime side specificity.
Clostridial collagenases are among the most efficient degraders of collagen. Most clostridia are saprophytes and secrete proteases to utilize proteins in their environment as carbon sources; during anaerobic infections, collagenases play a crucial role in host colonization. Several medical and biotechnological applications have emerged utilizing their high collagenolytic efficiency. However, the contribution of the functionally most important peptidase domain to substrate specificity remains unresolved. We investigated the active site sequence specificity of the peptidase domains of collagenase G and H from Clostridium histolyticum and collagenase T from Clostridium tetani. Both prime and non-prime cleavage site specificity were simultaneously profiled using Proteomic Identification of protease Cleavage Sites (PICS), a mass spectrometry-based method utilizing database searchable proteome-derived peptide libraries. For each enzyme we identified > 100 unique-cleaved peptides, resulting in robust cleavage logos revealing collagen-like specificity patterns: a strong preference for glycine in P3 and P1′, proline at P2 and P2′, and a slightly looser specificity at P1, which in collagen is typically occupied by hydroxyproline. This specificity for the classic collagen motifs Gly-Pro-X and Gly-X-Hyp represents a remarkable adaptation considering the complex requirements for substrate unfolding and presentation that need to be fulfilled before a single collagen strand becomes accessible for cleavage.
We demonstrate the striking sequence specificity of a family of clostridial collagenases using proteome derived peptide libraries and PICS, Proteomic Identification of protease Cleavage Sites. In combination with the previously published crystal structures of these proteases, our results represent an important piece of the puzzle in understanding the complex mechanism underlying collagen hydrolysis, and pave the way for the rational design of specific test substrates and selective inhibitors.
This article is part of a Special Issue entitled: Can Proteomics Fill the Gap Between Genomics and Phenotypes?
•Active site specificity profiling of 3 clostridial collagenases—ColG and H from C. histolyticum, and ColT from C. tetani.•Their high sequence specificity to collagen-like sequence points towards a co-evolution with the mammalian substrate.•Significant differences to MMPs and a more promiscuous cleavage mechanism facilitating rapid collagenolysis were revealed.•Human proteome-derived peptide libraries & PICS are suitable for active site specificity profiling of pathogenic proteases.•Results pave the way for rational design of test substrates and selective inhibitors.
Clostridia; Collagenase; MMPs; PICS; Mass spectrometry
During inflammation, local tissue responses are augmented by complement and acute phase proteins that exude into the tissue because of increased blood vessel permeability mediated by bradykinin, which is proteolytically released from kininogen. Moreover, proteases govern inflammatory responses by processing extracellular matrix proteins and soluble bioactive mediators; however, the consequences of their complex interplay in inflamed mammalian tissues are largely unknown. We quantified changes in the proteome and the nature of protein N-termini (the N-terminome) and the altered abundance of murine proteases and inhibitors during skin inflammation. Through analysis of the N-terminome by iTRAQ-terminal amine isotopic labeling of substrates (TAILS), we identified cotranslational and posttranslational αN-acetylation motifs, quantitative increases in protein abundance, and qualitative changes in the proteolytic signature during inflammation. Of the proteins identified in normal skin, 50% were cleaved, which increased to 60% during inflammation caused by phorbol esters, including chemokines and complement in which we identified previously uncharacterized cleavage sites. In mice deficient in matrix metalloproteinase 2 (MMP2), exudation of serum proteins was diminished compared to that in wild-type mice, and their proteolytic networks differed. We found that the complement 1 (C1) inhibitor was a key regulator linking these inflammatory responses. Cleavage and inactivation of the C1 inhibitor by MMP2 increased complement activation and bradykinin generation by plasma kallikrein in wild-type mice, leading to increased vessel permeability during inflammation. Thus, our degradomics analysis dissected proteolysis in skin inflammation and demonstrated perturbance of the proteolytic signaling network and its functional consequences arising from lack of a single protease.
Background: Bacterial collagenases degrade collagen substrates with high efficiency yet varying specificity.
Results: The newly identified calcium site, aspartate switch, and conformational selectivity filter regulate substrate access to the active sites of these collagenases.
Conclusion: The unanticipated dynamics of the substrate recognition sites plus zinc occupancy combine to tune the enzymatic activity.
Significance: The crystal structures provide a rational framework to understand and optimize the isoform-dependent collagenase activities.
Clostridial collagenases are among the most efficient enzymes to degrade by far the most predominant protein in the biosphere. Here we present crystal structures of the peptidases of three clostridial collagenase isoforms (ColG, ColH, and ColT). The comparison of unliganded and liganded structures reveals a quaternary subdomain dynamics. In the unliganded ColH structure, this globular dynamics is modulated by an aspartate switch motion that binds to the catalytic zinc. We further identified a calcium binding site in proximity to the catalytic zinc. Both ions are required for full activity, explaining why calcium critically affects the enzymatic activity of clostridial collagenases. Our studies further reveal that loops close to the active site thus serve as characteristic substrate selectivity filter. These elements explain the distinct peptidolytic and collagenolytic activities of these enzymes and provide a rational framework to engineer collagenases with customized substrate specificity as well as for inhibitor design.
Protease; Protein Degradation; Protein Structure; Proteolytic Enzymes; X-ray Crystallography; Collagenase; Metal Regulation
The ability of pathogenesis-related proteins of family 10 to bind a broad spectrum of ligands is considered to play a key role for their physiological and pathological functions. In particular, Bet v 1, an archetypical allergen from birch pollen, is described as a highly promiscuous ligand acceptor. However, the detailed recognition mechanisms, including specificity factors discriminating binding properties of naturally occurring Bet v 1 variants, are poorly understood.
Here, we report crystal structures of Bet v 1 variants in complex with an array of ligands at a resolution of up to 1.2 Å. Residue 30 within the hydrophobic pocket not only discriminates in high and low IgE binding Bet v 1 isoforms but also induces a drastic change in the binding mode of the model ligand deoxycholate. Ternary crystal structure complexes of Bet v 1 with several ligands together with the fluorogenic reporter 1-anilino-8-naphthalene sulfonate explain anomalous fluorescence binding curves obtained from 1-anilino-8-naphthalene sulfonate displacement assays. The structures reveal key interaction residues such as Tyr83 and rationalize both the binding specificity and promiscuity of the so-called hydrophobic pocket in Bet v 1.
The intermolecular interactions of Bet v 1 reveal an unexpected complexity that will be indispensable to fully understand its roles within the physiological and allergenic context.
► Ligand binding to Bet v 1 may contribute to explain its allergenicity. ► High-resolution structures reveal the binding mode of diverse ligands to Bet v 1. ► Residue 30 starkly influences the binding properties of different Bet v 1 isoforms. ► Ternary complexes with diverse ligands explain anomalous fluorescence binding curves. ► Betv1 isoforms differ in ligand binding, which may translate into their allergenicity.
ANS, 1-anilino-8-naphthalene sulfonate; BRA, brassinolide; DXC, deoxycholate; iDXC, inner deoxycholate; oDXC, outer deoxycholate; LPS, lipopolysaccharide; MPD, 2-methyl-2,4-pentanediol; NDSB-256, non-detergent sulfobetaine 256; PR-10, pathogenesis-related protein 10; PDB, Protein Data Bank; molecular allergenicity; ANS displacement assay; structure–allergenicity relationship; binding specificity and promiscuity; dressed allergens
Collagen constitutes one third of the body protein in humans, reflecting its extraordinary role in health and disease. Of similar importance, therefore, are the idiosyncratic proteases that nature evolved for collagen remodeling. Intriguingly, the most efficient collagenases are those that enable clostridial bacteria to colonize their host tissues, but despite intense studies, the structural and mechanistic basis of these enzymes has remained elusive. Here we present the crystal structure of collagenase G from Clostridium histolyticum at 2.55 Å resolution. By combining the structural data with enzymatic and mutagenesis studies, we derive a conformational two-state model of bacterial collagenolysis, in which the recognition and unraveling of collagen microfibrils into triple helices as well as the unwinding of the latter go hand in hand with collagenase opening and closing.
The catalytic domain of collagenase G from C. histolyticum was expressed in E. coli BL21 (DE3) and purified using affinity and size-exclusion column-chromatographic methods. Crystals were obtained at 290 K by the sitting-drop vapour-diffusion method and diffraction data have been collected to 2.75 Å resolution.
The catalytic domain of collagenase G from Clostridium histolyticum has been cloned, recombinantly expressed in Escherichia coli and purified using affinity and size-exclusion column-chromatographic methods. Crystals of the catalytic domain were obtained from 0.12 M sodium citrate and 23%(v/v) PEG 3350 at 293 K. The crystals diffracted to 2.75 Å resolution using synchrotron radiation. The crystals belong to an orthorhombic space group, with unit-cell parameters a = 57, b = 109, c = 181 Å. This unit cell is consistent with the presence of one molecule per asymmetric unit and a solvent content of approximately 53%.
collagenase G; Clostridium histolyticum
Clostridial collagenases are foe and friend: on the one hand, these enzymes enable host infiltration and colonization by pathogenic clostridia, and on the other hand, they are valuable biotechnological tools due to their capacity to degrade various types of collagen and gelatine. However, the demand for high-grade preparations exceeds supply due to their pathogenic origin and the intricate purification of homogeneous isoforms. We present the establishment of an Escherichia coli expression system for a variety of constructs of collagenase G (ColG) and H (ColH) from Clostridium histolyticum and collagenase T (ColT) from Clostridium tetani, mimicking the isoforms in vivo. Based on a setup of five different expression strains and two expression vectors, 12 different constructs were expressed, and a flexible purification platform was established, consisting of various orthogonal chromatography steps adaptable to the individual needs of the respective variant. This fast, cost-effective, and easy-to-establish platform enabled us to obtain at least 10 mg of highly pure mono-isoformic protein per liter of culture, ideally suited for numerous sophisticated downstream applications. This production and purification platform paves the way for systematic screenings of recombinant collagenases to enlighten the biochemical function and to identify key residues and motifs in collagenolysis.
Clostridial collagenases; Expression; Purification; Platform