The expression system presented here was developed as part of an effort to elucidate the functional effect of the p.A16V mutation in human cationic trypsinogen, which has been reported to be associated with chronic pancreatitis by several studies (
1–
4). This variant alters the N-terminal amino acid residue of the mature, secreted trypsinogen protein. The amino-acid numbering starts with Met
1 of the pre-trypsinogen protein and the first 15 residues comprise the secretory signal peptide. It had previously been determined that other pancreatitis-associated mutations in human cationic trypsinogen increase the propensity of trypsinogen for autoactivation (
5,
6). We speculated that the p.A16V mutation might have a similar effect; however, functional characterization of the recombinantly expressed mutant trypsinogen required preparations with uniform, authentic N-termini.
For high-yield heterologous expression of human cationic trypsinogen in
Escherichia coli, an expression plasmid was constructed in which the secretory signal peptide of trypsinogen was deleted and the initiator methionine was placed immediately upstream of the mature protein. Trypsinogen expressed from this construct accumulated in the cytoplasm as inclusion bodies. The native N-terminal sequence of trypsinogen isolated from pancreatic juice is Ala
16-Pro
17-Phe
18. The expected N-terminal sequence of trypsinogen expressed in
E. coli is Met- Ala
16-Pro
17-Phe
18. However, we found that the N-terminal sequence of recombinant trypsinogen in the inclusion bodies was heterogeneous, consisting of ~30% Pro
17-Phe
18 and approximately ~70% Met-Ala
16-Pro
17-Phe
18. Apparently, part of the expressed trypsinogen was processed by methionine aminopeptidase and then by proline aminopeptidase (aminopeptidase P). Removal of the initiator methionine by methionine aminopeptidase is a well documented phenomenon in
E. coli (
7) which occurs if the second amino acid residue has a small, uncharged side chain. With high-level expression of heterologous proteins, the enzyme gets saturated and only a fraction of the proteins is processed. After cleavage of the initiator methionine, proteins can be subject to cleavage by aminopeptidase P, which cleaves the N-terminal amino acid of a protein if proline is in the penultimate position (
8).
To address the problem of unwanted N-terminal processing by aminopeptidases, we developed a novel expression system (
9). First, the gene of interest (in this case human cationic trypsinogen) was cloned in a C-terminal fusion with the
Synechocystis DnaB mini-intein (
10). In this fusion construct (
see .), translation was initiated by the start codon of the intein gene and the intein moiety was subsequently removed through intein self-cleavage (
11). To eliminate cleavage by aminopeptidase P, the fusion construct was expressed in the aminopeptidase P deficient
E. coli LG-3 strain. This strain was engineered by deleting the
pepP gene coding for aminopeptidase P from the
E. coli chromosome (
12,
13), using the recombination-based method described by Datsenko and Wanner (
14). The intein-trypsinogen fusion was expressed in LG-3 cells as inclusion bodies, solubilized with guanidine and re-natured
in vitro followed by affinity purification on immobilized ecotin (
15). Finally, we used MonoS cation-exchange chromatography to remove the small fraction of uncleaved intein fusion proteins and to obtain a pure trypsinogen preparation with uniform, authentic N-termini. The expression system described here can be useful for the heterologous expression of proteins whose N-terminal integrity is compromised by aminopeptidase activity.