We sought to develop a method that circumvents the limited specificity and loss of material associated with organelle purification in traditional MS proteomics. Our approach was to tag the proteome of interest with a chemical handle such as biotin while the cell was still alive, with all membranes, complexes, and spatial relationships preserved. We thus required a genetically targetable labeling enzyme that covalently tags its neighbors, but not more distant proteins, in living cells. One candidate is promiscuous biotin ligase (1
), but its labeling kinetics are extremely slow (requiring 24 hours (1
); Fig. S1
), and the proposed mechanism proceeds through a biotin-adenylate ester, which has a half-life of minutes, implying a large labeling radius. Horseradish peroxidase (HRP)-catalyzed nitrene generation is another possibility (4
), but we were unable to detect this labeling (Fig. S2
), and HRP is inactive when expressed in the mammalian cytosol (5
We recently introduced engineered ascorbate peroxidase (APEX) as a genetic tag for electron microscopy (EM) (5
). Unlike HRP, APEX is active within all cellular compartments. In addition to catalyzing the H2
-dependent polymerization of diaminobenzidine for EM contrast, APEX also oxidizes numerous phenol derivatives to phenoxyl radicals. Such radicals are short lived (<1 msec (6
)) have a small labeling radius (<20 nm (8
)) and can covalently react with electron-rich amino acids such as Tyr, Trp, His and Cys (10
). This chemistry forms the basis of tyramide signal amplification (14
) but it has not been extended to living cells.
To examine whether APEX could be employed for proteomic labeling (), we targeted APEX to the mitochondrial matrix of human embryonic kidney (HEK) cells, and initiated labeling by adding biotin-phenol and 1 mM H2
to the cell medium. Labeling was terminated after 1 minute by cell fixation or lysis. Imaging by confocal () or super-resolution STORM (15
) () microscopy showed that biotinylated proteins overlapped tightly with the mito-APEX construct. Streptavidin blot analysis of cell lysate showed that numerous endogenous proteins were biotinylated in an APEX- and H2
-dependent manner ( and S3
Labeling the mitochondrial matrix proteome in living HEK cells
Other constructs targeting APEX to different cellular regions were also analyzed to test the generality of the approach (Figs. S4–S5
). Seven different cytosol-facing APEX fusions each gave distinct “fingerprints” in a streptavidin blot analysis, suggesting that targeted APEX biotinylates only a subset of cytosolic proteins, likely those in its immediate vicinity. Additional experiments were performed to characterize the small-molecule specificity of APEX (Fig. S2
), the membrane permeability of the phenoxyl radical (Fig. S6
), and covalent adducts formed with amino acids in vitro
We used mitochondrial matrix-targeted APEX to perform a proteomic experiment. Though mitochondria have been extensively characterized by MS proteomics, all previous studies have used mitochondrial purification, which is associated with sample loss and contamination. This is why the most comprehensive inventory of mitochondrial proteins (16
) integrates MS proteomic data with GFP imaging and computational analysis. Furthermore, proteome-scale maps of the matrix subcompartment in mammalian cells contain only a small number of proteins (17
), representing very low coverage, likely because of the challenge of enriching for this subcompartment.
Endogenous proteins biotinylated by mito-APEX for 1 minute in live HEK cells as in were purified using streptavidin beads, digested to peptides, and identified by tandem MS. We used stable isotope labeling (SILAC (18
)) of experimental and control samples to distinguish between biotinylated proteins and non-specific binders (Fig. S8
). Two independent replicates were performed and each produced a bimodal distribution of proteins based on isotope ratio (Fig. S8C
). The high-ratio distributions were strongly enriched for mitochondrial proteins, so we separated these hits and intersected the results from both replicates to obtain a list of 495 proteins (Table S1
), which we call our “matrix proteome.” This list is expected to contain both soluble matrix proteins and inner mitochondrial membrane proteins that are exposed to the matrix space.
Crossing our matrix proteome with prior literature revealed that it was highly enriched for both mitochondrial and mitochondrial matrix proteins (). 464 proteins (94%) had prior mitochondrial annotation, leaving 31 “mitochondrial orphans” without any previously known connection to mitochondria (Table S2
). To further quantify the specificity of our matrix proteome, we examined the components of the electron transport chain () and the TOM/TIM/PAM protein import pathway (), because they are structurally and/or topologically well-characterized. Only subunits with exposure to the matrix space were detected in our matrix proteome, illustrating the specificity and membrane-impermeability of our tagging.
Specificity and depth of coverage of the mitochondrial matrix proteome
To analyze depth of coverage, we checked our matrix proteome for well-established groups of soluble matrix proteins (). 80–90% of the members of each group were detected. Nearly identical subsets of proteins were detected in each of the two replicates, suggesting that coverage was high, but only for ~85% of proteins. The proteins we consistently did not detect were not low-abundance proteins (Fig. S8F
), and they did not lack surface-exposed tyrosines. We hypothesize that these proteins were sterically buried in macromolecular complexes, making them inaccessible to the phenoxyl radical.
For a subset of proteins in our proteome, we detected directly biotinylated peptides (Fig. S9
and Table S4
). Tandem MS sequencing showed that the biotin-phenol was conjugated to tyrosine sidechains. In nearly all cases, the biotinylated tyrosine mapped to a surface-exposed site on a soluble protein, or a matrix-exposed site on a transmembrane protein.
Our matrix proteome of 495 proteins provides a number of interesting insights. First, the 31 mitochondrial orphans may be newly discovered mitochondrial proteins. We selected and imaged six of these at random and found complete or partial mitochondrial localization for all of them (Fig. S10
). Second, 240 proteins with unknown sub-mitochondrial localization can now be assigned by our data to the matrix compartment (Table S3
). Third, we detected six proteins previously assigned to the IMS or outer mitochondrial membrane (PPOX, CPOX, PNPT1, CHCHD3, COASY, and SAMM50). To determine if our detection of these proteins in the matrix was accurate, we performed EM imaging, taking advantage of APEX's additional functionality as an EM tag (5
). APEX fusions to five of the six proteins showed matrix staining by EM ( and S11
). We were unable to examine one protein, SAMM50, because APEX insertion at four different sites abolished mitochondrial targeting (data not shown).
Sub-mitochondrial localization of the heme biosynthesis enzymes CPOX and PPOX
PPOX and CPOX are particularly interesting in this group because they catalyze two of the later steps in heme biosynthesis (). Previous studies on purified mitochondria or mitoplasts treated with proteases or membrane-impermeant inhibitors have localized both enzymes to the IMS (19
). Structural analysis and modeling have suggested that PPOX docks to FECH (ferrochelatase), the final iron-inserting enzyme of heme biosynthesis, through the inner mitochondrial membrane (IMM) (23
) (). This model is inconsistent with our EM data because we found that both the C-terminus and amino acid 205 of PPOX localize to the matrix (). Our EM data on CPOX, on the other hand, are consistent with previous literature, because we found that residue 70 localizes to the matrix (explaining CPOX's detection in our matrix proteome), while the C-terminus and residue 120 flanking the active site localized to the IMS (). Our reassignment of PPOX from the IMS to the matrix has implications for the nature of its interactions with CPOX and FECH and the mechanism by which its heme precursor substrate is transported across the IMM.
In summary, we have developed a method for mapping the proteomic composition of cellular organelles, using a genetically-targetable peroxidase that catalyzes the generation of short-lived, highly-reactive, and membrane-impermeant radicals in live cells. With a temporal resolution of 1 minute, labeled proteins are harvested and identified by MS using well-established techniques. In addition to its simplicity, the method has no noticeable toxicity, requires far less material than conventional organellar proteomics, and takes hours to implement rather than days (as for subcellular fractionation). Our initial demonstration on the human mitochondrial matrix proteome shows that specificity is exceptionally high, because labeling is performed in living cells while membranes and other structures are still intact. A key feature of the method is that it provides insight into the topology of identified proteins. Depth of coverage is also high for the majority of proteins – likely those that are sterically accessible to the phenoxyl radical. Finally, the same peroxidase, APEX, can be used for both proteomic mapping and EM visualization (5