Determining the functions of post-translational modifications (PTMs)
1 is a grand challenge in post-genomic biology. Locating modified sites is often an essential first step toward ascertaining the biological roles of PTMs, and liquid chromatography coupled to tandem mass spectrometry (LC-MS/MS) has emerged as a choice technology for this purpose. For example, LC-MS/MS has been successfully utilized in recent proteomic surveys of phosphorylation (
1,
2), acetylation (
3,
4), ubiquitination (
5,
6), and glycosylation (
7–
10) sites. Modifications such as these can profoundly impact the molecular biology of a protein. Unambiguous assignment of PTM locations using existing LC-MS/MS methodology is sometimes difficult to achieve, particularly in cases involving low protein abundance or low site occupancy. Adding to the technical challenge of site mapping, PTMs such as glycosylation are known to reduce ionization efficiency and, therefore, detectability via MS (
11–
13). In highly complex proteomic samples, low intensity ions generated from glycopeptides may be overlooked because of instrument-dependent limitations on the rate at which ions can be selected for fragmentation during data-dependent acquisition (
14). Here, we describe a targeted proteomic approach, in which LC-MS/MS analysis is focused specifically on peptides bearing
N-glycosylation, a common PTM found in all major phylogenetic branches of life. In contrast to data-dependent methods that select a subset of relatively intense ions for fragmentation, our approach is specifically designed to provide intensity-independent fragmentation priority to peptides most likely to bear
N-glycans and boost confidence in correct PTM site assignment.
In eukaryotes,
N-glycosylation is a PTM frequently found on proteins that are translated into the endoplasmic reticulum (ER). Structurally conserved
N-glycan precursors are synthesized via the dolichol pathway and transferred onto nascent proteins; the glycosylated Asn residues are typically found within a subset of the polypeptide N
X(S/T) motifs, where
X denotes a non-proline residue (
15,
16). Biologically,
N-glycans contribute to the folding, trafficking, and thermodynamic stability of proteins exposed to ER, vacuolar, and Golgi lumens, as well as those destined for exposure to the extracellular milieu (
17). The
N-glycan precursors are enzymatically edited to yield a plethora of mature glycoforms with compositions largely dependent on cell type and protein localization. Sometimes, specific
N-glycan structures are required for protein function, whereas in other cases,
N-glycans primarily contribute to protein stability and solubility (
18). As a class, the
N-glycosylation modification is biologically essential; chemical or genetic disruption of
N-glycan biosynthesis is lethal, and aberrant
N-glycosylation is associated with several human disease states (
19,
20).
Many existing LC-MS/MS approaches for mapping
N-glycosites depend on enzymatic removal of the entire
N-glycan following stringent sample enrichment to remove nonglycosylated peptides prior to analysis (
9,
10). These methods rely on detection of a 0.98-Da mass increase resulting from the enzymatic deamination of glycosylated Asn residues by peptide:
N-glycosidase F (PNGase F). Enzymatic deamination is often performed in the presence of
18O-labeled water, imparting a 2.98-Da mass shift to the peptide to increase confidence in site assignment (
7). Unfortunately, complete removal of
N-glycans with PNGase F can lead to instances of incorrectly mapped glycosites. During the course of PNGase F treatment, spontaneous deamination of nonglycosylated Asn residues and other instances of PNGase F-independent incorporation of
18O can potentially yield false positives (
21–
23). This drawback has led to the development of alternative strategies utilizing partial rather than total removal of
N-glycans; for instance, treating samples with the enzyme endoglycosidase H preserves a single core GlcNAc residue, leaving direct evidence for
N-glycosylation intact (
23,
24). Unfortunately, the presence of even a single sugar residue on peptides is known to considerably suppress ionization efficiency (
8,
13), potentially biasing data-dependent LC-MS/MS data acquisition against glycopeptide ions. Despite this limitation, detection of the retained glycan by LC-MS/MS provides unequivocal evidence for glycosite assignment instead of indirect evidence that the
N-glycan modification once existed at a given site. Therefore, LC-MS/MS methods that could select low abundance ions for fragmentation could greatly benefit glycosite mapping.
Recently, we described a strategy for addressing the challenge of identifying low abundance species in complex mixtures based on a directed LC-MS/MS approach (
25). Termed isotopic signature transfer and mass pattern prediction (IsoStamp), the technique exploits the perturbing effects of a dibrominated chemical tag on the isotopic envelope of a peptide. Once covalently modified with the tag, dibrominated peptides can be readily detected with high sensitivity and fidelity in high resolution LC-MS data using a novel computational pattern-searching algorithm. Pattern identification allows targeted proteomic analyses of labeled peptides in complex biological samples; inclusion lists containing
m/
z values and retention times of ions bearing recoded envelopes are used to trigger fragmentation. Thus, isotopic pattern rather than ion abundance is used to drive this directed proteomic approach. The IsoStamp method was an extension of the concept of isotopic distribution encoding tagging reported by Goodlett
et al. (
26).
Although the isotopic signature of a halogenated tag can effectively highlight labeled peptides within a complex LC-MS data set, its utility is restricted to those situations in which the desired subset of peptides can be chemospecifically modified. Here, we demonstrate that it is possible to impart a similar perturbation to the isotopic envelope of a peptide without the requirement for chemical tagging. Instead, our approach metabolically embeds a dibromide-like isotopic signature directly into glycans. In this study, we mimic the dibromide isotopic signature with a stoichiometrically defined mixture of GlcNAc isotopologs, referred to as a GlcNAc isomix. The isomix is metabolically installed into structurally conserved N-glycan core positions, marking them with a uniquely identifiable isotopic signature. We employed the technique to map occupied N-glycosites on proteins from whole Saccharomyces cerevisiae lysates. Via preferential fragmentation of isotopically recoded glycopeptides, we identified numerous N-glycosites within the yeast proteome, nearly doubling the number that was previously known. The isomix method offers an enhanced level of confidence for mapping glycosylation sites that was not previously available to LC-MS/MS analyses because of the unique isotopic envelope of an isomix-containing peptide. Here, we showcase the utility of isotopic recoding by surveying metabolically labeled N-glycans in yeast, but we believe the technology is extendable to other PTMs and organisms through a variety of labeling strategies.