|Home | About | Journals | Submit | Contact Us | Français|
Aspartyl aminopeptidase (DNPEP), with specificity towards an acidic amino acid at the N-terminus, is the only mammalian member among the poorly understood M18 peptidases. DNPEP has implicated roles in protein and peptide metabolism, as well as the renin-angiotensin system in blood pressure regulation. Despite previous enzyme and substrate characterization, structural details of DNPEP regarding ligand recognition and catalytic mechanism remain to be delineated.
The crystal structure of human DNPEP complexed with zinc and a substrate analogue aspartate-β-hydroxamate reveals a dodecameric machinery built by domain-swapped dimers, in agreement with electron microscopy data. A structural comparison with bacterial homologues identifies unifying catalytic features among the poorly understood M18 enzymes. The bound ligands in the active site also reveal the coordination mode of the binuclear zinc centre and a substrate specificity pocket for acidic amino acids.
The DNPEP structure provides a molecular framework to understand its catalysis that is mediated by active site loop swapping, a mechanism likely adopted in other M18 and M42 metallopeptidases that form dodecameric complexes as a self-compartmentalization strategy. Small differences in the substrate binding pocket such as shape and positive charges, the latter conferred by a basic lysine residue, further provide the key to distinguishing substrate preference. Together, the structural knowledge will aid in the development of enzyme-/family-specific aminopeptidase inhibitors.
Aminopeptidases (APs) catalyze the sequential removal of amino acids from the unblocked N-termini of protein or peptide substrates, a process necessary for intracellular metabolism  and implicated in several human diseases . Most APs are metalloproteases and are classified based on substrate preference towards an acidic, basic or neutral amino acid at the P1 position of the scissile peptide bond. Very few acidic APs are known to date, the most extensively studied being the membrane-bound glutamyl aminopeptidase (ENPEP, also known as aminopeptidase A; EC 18.104.22.168) . ENPEP, a membrane-bound Ca2+-activated enzyme, is involved in the renin-angiotensin system (RAS) by catalysing the conversion of angiotensin II to angiotensin III, a key regulator of blood pressure [4,5]. A second, cytosolic acidic AP has been reported in yeast, fungi and mammals, and termed aspartyl aminopeptidase (DNPEP, also known as DAP; EC 22.214.171.124) due to its preference for aspartate over glutamate at the P1 position [6-8]. In mammals, DNPEP is preferentially expressed and has high enzymatic activity in neurons and neuroendocrine tissues [6,9,10]. Its reported conversion of angiotensin I to angiotensin 2–10 , and of angiotensin II to angiotensin III in vitro, implicates a role in RAS and regulation of blood pressure. Moreover, a mild antagonist effect of DNPEP towards the bone morphogenetic protein signalling pathway has recently been reported .
DNPEP is the sole mammalian entry for the M18 metallopeptidase family, which contains ~600 putative members from bacteria and eukaryotes . The M18 family, together with the M20, M28 and M42 families, are classified into the metalloprotease H (MH) clan of proteases on the basis of active site sequence conservation according to the MEROPS database [12,13]. Only a handful of M18 enzymes have been biochemically characterized in any detail; these include yeast vacuole aminopeptidase I (API, also known as Lap4) with a broad substrate specificity for non-polar amino acids , as well as yeast yhr113w (also known as Ape4)  and mammalian DNPEP which prefer an acidic amino acid. These M18 enzymes are shown to homo-oligomerize, reminiscent of the self-compartmentalization strategy in the well-characterized proteasomes to confer specificity towards unfolded polypeptides and not folded proteins . However, the reported dodecameric form in yeast Lap4 and Ape4 [7,14] contrasts with the proposed octameric form in DNPEP .
Little is known about the structure-function relationship of DNPEP and other M18 members, which contain a binuclear metal centre in the active site but lack the signature Zn2+-binding sequence motif (HExxH+E) found in other metalloproteases such as ENPEP . Although several conserved histidines essential for catalysis have been identified in human DNPEP , their roles are yet to be elucidated. In this study we determined the crystal structure of human DNPEP (hDNPEP) complexed with catalytic Zn2+ and substrate analogue L-aspartate-β-hydroxamate (ABH), and confirmed its dodecameric architecture by electron microscopy (EM). The bound ABH ligand highlights the importance of a domain-swapped loop in constructing the active site and provides a structural basis for hDNPEP’s catalytic mechanism and substrate specificity. By comparison with available bacterial M18 structures we further develop a family-wide description of this unannotated peptidase family and suggest unifying catalytic features across the MH clan.
The structure of the hDNPEP·Zn2+·ABH complex (Figure (Figure1A),1A), determined at 2.2Å resolution, is homologous to four unpublished bacterial M18 homologues with undefined enzyme and substrate properties (DALI Z-scores ~40, rmsd 1.9-2.7Å and sequence identity 23-35%). Superposition of the structures reveals a common two-domain architecture consisting of the proteolytic and dimerization domains (Figure (Figure1A1A and C), with the active site located in a concave groove at the domain interface. The globular proteolytic domain (aa 7–98 and 249–468 in hDNPEP) features a core nine-stranded β-sheet sandwiched between several α-helices and has a small five-stranded β-subdomain resting on top (Figure (Figure1A).1A). This proteolytic domain is highly similar among all M18 structures (rmsd ~1.5Å). The dimerization domain, contributed from the central polypeptide stretch (aa 99–248 in hDNPEP), sits on top of the proteolytic domain (Figure (Figure1A).1A). This butterfly-shaped domain is built of two orthogonal β-sheets (five- and three-stranded respectively) that share in common two tilted strands β5 and β6, and also includes an extended β8-β9 loop that is important for active site formation (see next sections). Variations in the dimerization domain are observed among M18 enzymes, particularly with the location and spatial orientation of helices α3 and α4 and the connecting loop α3-α4. In hDNPEP loop α3-α4 is longer than the bacterial equivalents (Figure (Figure1B),1B), although it is partially disordered in our structure.
Structural comparison of M18 hDNPEP with members of other MH clan families (M20, M28 and M42) (Figure (Figure1D)1D) shows that the proteolytic domains of all four families can be superimposed well (pairwise rmsd ~2.3Å), particularly in the core β-sheet and the binuclear metal centre. This structural homology suggests an evolutionarily-conserved strategy for metal coordination and metal-assisted catalysis . Away from the proteolytic domain, however, the four families diverge structurally in the dimerization domain, with M28 members lacking this domain altogether (Figure (Figure1D,1D, right), a fact that is reflected in their different oligomeric states. The hDNPEP dimerization domain exhibits closer topology and orientation to the dodecameric M42 enzymes (Figure (Figure1D,1D, bottom) [18-20], but has distinct fold and tertiary arrangements compared to the counterpart domain in M20 members (Figure (Figure1D,1D, top) that are known monomers or dimers [21,22]. This observation suggests a closer structural relationship of M18 with M42 enzymes, than with M20 or M28 members, a feature not apparent from sequence-based comparisons. This is further supported by M18 and M42 members sharing similar oligomeric assembly and active site architecture (see following sections).
Application of the crystallographic 432 symmetry to the hDNPEP monomer results in a tetrahedron-shaped dodecamer built from six homodimers, a quaternary arrangement similar to M42 enzymes [15,18,19]. Each dimer, with internal two-fold symmetry on both vertical and horizontal axes (Figure (Figure2A),2A), is formed by extensive contacts that involve the swapping of loop β8-β9 between the two subunits (Additional file1, Figure S1). Mediated by four-fold symmetry, the six dimers assemble into a tetrahedron (Figure (Figure2B),2B), with each dimer constituting one edge (~118Å) of the tetrahedron (Figure (Figure2B,2B, inset). The tetrahedron has a 50Å-diameter internal cavity harbouring all twelve active sites, which is accessible to the exterior through four wide and four narrow channels situated on the three-fold axes. The entrances of the wide channels, a triangular pore of 28Å per side, are located at the centre of the four tetrahedron facets (Figure (Figure2B2B and C, blue asterisks), while the four narrow channels have their openings (9Å per side) on the four tetrahedron vertexes (Figure (Figure2B2B and C, yellow asterisks).
The hDNPEP dodecamer contrasts with an octameric arrangement previously deduced from native PAGE analysis . As an independent verification we performed EM image analysis, revealing one homogeneous particle population on micrographs (Figure (Figure2D)2D) with characteristic patches of density surrounding a hole in the middle, corresponding to the 3-fold symmetrical view down the wide channels at a facet of the tetrahedron complex on the 2D classification (Figure (Figure2E).2E). The tetrahedron shape and dimensions from the EM projection are in excellent agreement with the crystallographic dodecamer (Figure (Figure2F),2F), lending support to its physiologically relevance. While the oligomeric state of the bacterial M18 homologues is not reported, their crystal structures suggest the formation of dodecameric tetrahedrons similar to hDNPEP (Additional file1, Figure S2), pointing towards a common self-compartmentalization strategy for catalysis.
We next performed an analysis of the wide and narrow channels in hDNPEP that represent the only access route between the twelve active sites in the central chamber and the exterior. Both channels in M18 hDNPEP are remarkably similar in topology to the M42 dodecameric tetrahedrons. The wide channels, each formed from three dimers (Additional file1, Figure S3), are 20Å in width and 28Å in length with a large concave surface at the entrance lined by positively-charged residues (Figures3A and and3B).3B). This wide channel, supported by the positive electrostatic environment that would complement the substrate acidic N-termini, likely functions as an entrance for unfolded peptide substrates. The transit function, as well as the electrostatic complementarity as a basis for substrate discrimination, has been proposed for M42 tetrahedron aminopeptidases [18,23,24]. Consistent with this theory, mutation of His363 (one of the residues lining the channel) to a non-polar residue has an adverse effect on the hDNPEP kinetic property .
The narrow channels (Figure (Figure3C)3C) are located at the interface of three monomers that are constituents of different dimers, giving rise to an inner helical bundle with a β-barrel-like outer casing (Figure (Figure3D3D and Additional file1, Figure S3). The essential nature of this channel has been demonstrated for some tetrahedron aminopeptidases . In hDNPEP, we observed water, glycerol molecules and a hydrated Mg2+ ion within this channel (Figure (Figure3C3C and D), suggesting a possible route for small molecules such as cleaved amino acids to exit after hydrolysis . The narrow channel may also provide a path for the translocation of metal ions (e.g. catalytic zinc), mediated by layers of charged residues within the channel. However, to achieve either transit function, slight conformational changes may be required to open up the channel pore considering its narrow width (~3Å)(Figure 3C).
The active site is defined by the bound substrate analogue ABH and two zinc ions (Zn1 and Zn2) (Figure (Figure4A4A and B) – the latter likely carried through protein expression and purification, and confirmed by fluorescence absorption profile of the crystals (data not shown). Zn1 and Zn2, bridged by Asp264, are 3.4Å apart, consistent with the distances observed in other binuclear metalloproteases . Zn1 is further coordinated by Glu302 and His440, and Zn2 by His94 and Asp346 (Figure (Figure4C).4C). These five metal coordinating residues (His94, Asp264, Glu302, Asp346 and His440) form a ‘H.D.E.D.H’ signature strictly conserved among DNPEP paralogues and M18 members (Figure (Figure1B),1B), providing an explanation for the abolished hDNPEP activity by mutations of His94 and His440 .
Additional coordination to the binuclear zinc is provided by the bound ABH molecule, a competitive inhibitor of hDNPEP [6,17]. ABH binds to the active site with the hydroxamate moiety towards the binuclear metal centre to contribute its carbonyl and hydroxyl oxygen atoms for zinc coordination (Figure (Figure4C),4C), while its amino-acid backbone protrudes into a cavity often known as the P1 substrate pocket (Figure (Figure4B).4B). ABH engages in a number of direct or water-mediated hydrogen bonds to Glu301 and Asp346 via the hydroxamate moiety, and to Tyr381, Lys374 and His349 via the amino acid backbone (Figure (Figure4D).4D). Of particular interest is an interaction between the hydroxamate carbonyl oxygen and His170 from the opposing subunit (His170b) of a dimer (Figure (Figure4D).4D). His170b sits at the tip of the β8-β9 loop from the neighbouring subunit that crosses over to complete the active site (Figure (Figure4B4B and Additional file1, Figure S1). Such loop swapping to translocate a distant ligand-binding residue into the active site is crucial to hDNPEP catalysis, as evident by a complete abolishment of activity in a His170Phe mutant . This histidine residue is also conserved in M42 enzymes, although in the available M42 structures the equivalent loops are disordered or partially disordered. This disorder could be due to the lack of bound substrate/analogue, suggesting a substrate-induced conformational reorientation is necessary to complete the catalytic centre. Conservation of this histidine therefore implies that the loop-swapped active site is a common structural feature among M18 and M42 dodecamers built from dimeric units.
A possible catalytic mechanism for M18 hDNPEP is proposed (Figure (Figure4E),4E), on the assumption that the hydroxylamine nitrogen and carbonyl oxygen of the ABH hydroxamate represent where the amine and carbonyl groups of the substrate peptide would be coordinated by Zn2 and Zn1, respectively. A nucleophilic water molecule could feasibly occupy the position of the ABH hydroxyl oxygen and would be activated by Glu301 to attack the scissile bond. His170b can function to bind the peptide carbonyl oxygen and stabilize the tetrahedral intermediate. This mechanism is consistent with that proposed for other metallopeptidases .
The bound ABH provides a template to build dipeptide models of Asp-Ala and Glu-Ala into the active site in order to rationalize hDNPEP substrate specificity. For both peptides, the Asp and Glu sidechains fit into the P1 substrate pocket without steric constraints, while the mainchain is modeled onto the hydroxamate group of ABH in a position optimal for hydrolysis. The P1 substrate pocket (Additional file1, Figure S4A) is created by strand β15 and the β16-α12 and β17-α13 loops, with the β17-α13 loop lining the wall and restricting the dimensions of the pocket. This limited space disfavours bulky hydrophobic residues, as illustrated by a structural comparison with the P1 pockets in M28 neutral aminopeptidases where the equivalent loop is displaced away from the P1 pocket thereby generating a large cavity for bulky residues such as Phe and Met (Additional file1, Figure S4B and C).
The modelled Asp and Glu sidechains can engage in slightly different interactions with hDNPEP (Additional file1, Figure S5). While the Asp carboxylate feasibly interacts with Lys374 and forms water-mediated hydrogen bonds with His349, the longer Glu sidechain can further penetrate this cavity and interact directly with Lys374, His349 and the nearby Tyr381. Our substrate models suggest that the strict preference for an acidic amino acid at the P1 position is conferred by positively-charged and polar residues, such as Lys374 and His349. The use of electrostatic complementarity for substrate selectivity has precedence in the M42 peptidase SpPepA . Consistent with this strategy, mutation of His349 in hDNPEP has been shown to weaken substrate binding affinity . Furthermore, the conservation of Lys374 only in M18 members with acidic aminopeptidase activity (e.g. yeast Ape4), but not in M18 ‘promiscuous’ peptidases (e.g. yeast Lap4, where the equivalent is Ser) (Figure (Figure1B),1B), provides a structure-based criteria to classify putative M18 sequences into potential aspartyl aminopeptidases (Lys374 conserved) or non-aspartyl aminopeptidases (Lys374 not conserved), facilitating subsequent enzymatic characterization. Using this criteria we propose that the structurally characterized bacterial M18 members, where the equivalent Lys374 positions are substituted (Figure (Figure5),5), are unlikely to be aspartyl aminopeptidases.
In summary, we provide a structural annotation of the M18 metallopeptidase family, highlighting common catalytic residues and oligomeric properties. In particular, a loop-swapped active site utilizing a residue from an adjacent subunit for catalysis is likely a common characteristic among M18 and M42 dodecameric aminopeptidases. Furthermore, the bound substrate analogue in the active site provides insight into the reaction mechanism and substrate specificity for hDNPEP, facilitating the next steps in the development of family-specific small-molecule binders to further probe its cellular role in metabolic pathways and disease.
A DNA fragment encoding hDNPEP aa 1–468 (Uniprot ID: Q9ULA0) was sub-cloned into the pNIC-CTHF vector, incorporating a C-terminal His6-tag and TEV protease site. The recombinant protein was expressed in E. coli BL21(DE3)-R3 by induction with 0.1mM IPTG overnight at 18°C. Cells were harvested and homogenized in lysis buffer (50mM HEPES pH 7.5, 500mM NaCl, 5mM imidazole, 5% glycerol). Protein was purified by affinity (Ni-Sepharose) and size exclusion chromatography (Superdex 200). The affinity tag was removed by His-tagged TEV protease and the TEV-cleaved protein was passed over Ni-Sepharose resin. Purified protein was stored at −80°C in 10mM HEPES, pH 7.5, 500mM NaCl, 5% glycerol and 0.5mM TCEP.
hDNPEP (10mg/ml) was pre-incubated with 5mML-aspartate-β-hydroxamate (ABH) and crystallized by sitting drop vapour diffusion at 20°C in a 150-nl drop by mixing protein and reservoir solution (15% w/v PEG 3350, 0.25M MgCl2 and 0.1M Tris–HCl pH 8.0) in a 2:1 ratio. Crystals were cryo-protected with mother liquor supplemented with 25% glycerol and flash-cooled in liquid nitrogen. Diffraction data were collected at Diamond Light Source beamline I03, and processed and scaled with MOSFLM and SCALA from the CCP4 suite .
hDNPEP crystals belong to the F-centered cubic spacegroup F432 with unit cell parameters a,b,c=224.6Å and α,β,γ=90.0°. The asymmetric unit contains one hDNPEP protomer. The structure was solved by molecular replacement using PHASER  and the Pseudomonas aeruginosa M18 structure (PDB id: 2IJZ) as search model. Density modification was performed using DM  and improved phases were used for automated model building with ARP/wARP . The structure was refined using REFMAC  and rebuilt with COOT . Residues 1–6 and 204–213 are disordered and not included in the final model. Data collection and refinement statistics are summarized in TableTable1.1.
hDNPEP at ~0.7μM was applied to EM grids and stained with 2% uranyl acetate. Electron micrographs were recorded (x 45,000) using a FEI-Phillips CM120 EM. Images were digitized on a Nikon Super Coolscan 9000 (step size of 12.5μm with a pixel size of 2.78Å). The WEB and SPIDER software  were used for image processing. 4,736 particles were windowed, subjected to reference-free alignment, and sorted into classes using the K-means clustering method . Manual fitting of the hDNPEP crystal structure into the 2D map was achieved using CHIMERA .
The atomic coordinates and structure factors have been deposited in the Protein Data Bank (http://www.rcsb.org/) with accession number 4DYO.
The authors declare that they have no competing interests.
KLK, UO and WWY designed the experiment. AC, ESP, ADR, FvD, CVB performed the experiment. AC, ADR, WWY analyzed the data. AC and WWY wrote the manuscript. All authors read and approved the final manuscript.
Figure S1. Domain swapping inthe hDNPEP dimer. Figure S2. Tetrahedron complexes of available bacterial M18 structures. Figure S3. Architecture of the wide and narrow channels. Figure S4. hDNPEP P1 substrate pocket. Figure S5. Substrate peptide modelling into hDNPEP [34-37].
We thank staff at the Diamond Light Source for help with diffraction data collection. The Structural Genomics Consortium is a registered charity (number 1097737) that receives funds from the Canadian Institutes for Health Research, the Canadian Foundation for Innovation, Genome Canada through the Ontario Genomics Institute, GlaxoSmithKline, Karolinska Institutet, the Knut and Alice Wallenberg Foundation, the Ontario Innovation Trust, the Ontario Ministry for Research and Innovation, Merck & Co., Inc., the Novartis Research Foundation, the Swedish Agency for Innovation Systems, the Swedish Foundation for Strategic Research and the Wellcome Trust.