|Home | About | Journals | Submit | Contact Us | Français|
We are interested in asparagine-linked glycans (N-glycans) of Plasmodium falciparum and Toxoplasma gondii, because their N-glycan structures have been controversial and because we hypothesize that there might be selection against N-glycans in nucleus-encoded proteins that must pass through the endoplasmic reticulum (ER) prior to threading into the apicoplast. In support of our hypothesis, we observed the following. First, in protists with apicoplasts, there is extensive secondary loss of Alg enzymes that make lipid-linked precursors to N-glycans. Theileria makes no N-glycans, and Plasmodium makes a severely truncated N-glycan precursor composed of one or two GlcNAc residues. Second, secreted proteins of Toxoplasma, which uses its own 10-sugar precursor (Glc3Man5GlcNAc2) and the host 14-sugar precursor (Glc3Man9GlcNAc2) to make N-glycans, have very few sites for N glycosylation, and there is additional selection against N-glycan sites in its apicoplast-targeted proteins. Third, while the GlcNAc-binding Griffonia simplicifolia lectin II labels ER, rhoptries, and surface of plasmodia, there is no apicoplast labeling. Similarly, the antiretroviral lectin cyanovirin-N, which binds to N-glycans of Toxoplasma, labels ER and rhoptries, but there is no apicoplast labeling. We conclude that possible selection against N-glycans in protists with apicoplasts occurs by eliminating N-glycans (Theileria), reducing their length (Plasmodium), or reducing the number of N-glycan sites (Toxoplasma). In addition, occupation of N-glycan sites is markedly reduced in apicoplast proteins versus some secretory proteins in both Plasmodium and Toxoplasma.
Animals, fungi, and plants synthesize Asn-linked glycans (N-glycans) by means of a lipid-linked precursor containing 14 sugars (dolichol-PP-Glc3Man9GlcNAc2) (26). Recently we used bioinformatics and experimental methods to show that numerous protists are missing sets of glycosyltransferases (Alg1 to Alg14) and so make truncated N-glycan precursors containing 0 to 11 sugars (46). For example, Entamoeba histolytica, which causes dysentery, makes N-glycan precursors that contain seven sugars (Man5GlcNAc2) (33). Giardia lamblia, a cause of diarrhea, makes N-glycan precursors that contain just GlcNAc2 (41). N-glycan precursors may be identified by metabolic labeling with radiolabeled mannose (Entamoeba) or glucosamine (Giardia) (46). Unprocessed N-glycans of each protist may be recognized by wheat germ agglutinin 1 (WGA-1) (GlcNAc2 of Giardia) or by the antiretroviral lectin cyanovirin-N (Man5GlcNAc2 of Entamoeba) (2, 33, 41).
N-glycans are transferred from lipid-linked precursors to sequons (Asn-Xaa-Ser or Asn-Xaa-Thr, where Xaa cannot be Pro) on nascent peptides by an oligosaccharyltransferase (OST) (28). For the most part, transfer of N-glycans by the OST is during translocation, although there are human and Trypanosoma OSTs that transfer N-glycans after translocation (34, 45).
N-glycan-dependent quality control (QC) systems for protein folding and endoplasmic reticulum (ER)-associated degradation (ERAD), which are present in most eukaryotes, are missing from Giardia and a few other protists that make truncated N-glycans (5, 26, 53). There is positive Darwinian selection for sequons (sites of N-glycans) that contain Thr in secreted and membrane proteins of organisms that have N-glycan-dependent QC (12). This selection occurs for the most part by an increased probability that Asn and Thr will be present in sequons rather than elsewhere in secreted and membrane proteins. In contrast, there is no selection on sequons that contain Ser, and there is no selection on sequons in the secreted proteins of organisms that lack N-glycan-dependent QC.
For numerous reasons, we are interested in the N-glycans of Plasmodium falciparum and Toxoplasma gondii, which cause severe malaria and disseminated infections, respectively.
(i) There has been controversy for a long time as to whether Plasmodium makes N-glycans. While some investigators identified a 14-sugar Plasmodium N-glycan resembling that of the human host (29), others identified no N-glycans (6, 22).
(ii) There is also controversy concerning whether the N-glycans of Toxoplasma, after removal of Glc by glucosidases in the ER lumen, contain either 7 sugars (Man5GlcNAc2), like Entamoeba (32, 33), or 11 sugars (Man9GlcNAc2), like the human host (16, 19, 26). If it is Man5GlcNAc2, then Toxoplasma uses the dolichol-PP-linked glycan predicted by its set of Alg enzymes (32, 46). If it is Man9GlcNAc2, then Toxoplasma uses the dolichol-PP-linked glycan of the host cell (16, 19, 26).
(iii) Both Plasmodium and Toxoplasma are missing proteins involved in N-glycan-dependent QC of protein folding (5).
(iv) We hypothesize that there may be negative selection against N-glycans in Plasmodium and Toxoplasma, because the N-glycans added in the ER lumen during translocation will likely interfere with threading of nucleus-encoded apicoplast proteins into a nonphotosynthetic, chloroplast-derived organelle called the apicoplast (21, 35, 37, 48, 52, 54). Nucleus-encoded apicoplast proteins have a bipartite signal at the N terminus, which targets proteins first to the lumen of the ER and second to lumen of the apicoplast. This bipartite signal has been used in transformed plasmodia where green fluorescent protein (GFP) is targeted to the apicoplast with the bipartite signal of the acyl carrier protein (ACPleader-GFP), to the secretory system with the signal sequence only (ACPsignal-GFP), and to the cytosol with the organelle-targeting transit peptide only (ACPtransit-GFP) (55). Similar constructs have been used to characterize signals that target nucleus-encoded proteins of Toxoplasma to the apicoplast (11, 25).
Here we use a combination of bioinformatic, biochemical, and morphological methods to characterize the N-glycans of Plasmodium and Toxoplasma and to test our hypothesis that there is negative selection against N-glycans in protists with apicoplasts.
Predicted proteins of Plasmodium, Toxoplasma, Cryptosporidium, Eimeria, Babesia, Neospora, and Theileria, which are present in the NR database at NCBI and/or databases at GeneDB, PlasmoDB, ToxoDB, and CryptoDB, were searched using BLASTP and Saccharomyces Alg enzymes that make precursors to Asn-linked glycans and OST peptides that transfer the N-glycan to the nascent peptide (1, 3, 4, 7, 9, 15, 17, 18, 26, 28, 38, 46, 49). Because protein prediction is difficult for Toxoplasma and Eimeria, which contain many introns in their genes, TBASTN was also used to search the contigs and/or expressed sequence tags (ESTs) of these organisms (17, 49). Figure 1 was drawn based upon the predicted sets of N-glycans made by these apicomplexan parasites, rather than those that are experimentally determined (which might include host N-glycan precursors in the case of Toxoplasma) (16, 19).
Sequons (potential sites of N-linked glycosylation) were identified in secreted and membrane proteins, which contain an N-terminal signal peptide or transmembrane helices, using previously described methods (12, 27). About 500 predicted nucleus-encoded apicoplast proteins of Plasmodium were downloaded from the PlasmoDB website (4, 11), while 18 experimentally confirmed apicoplast proteins of Toxoplasma were obtained from Omar Harb (11, 25). Selection on sequons was determined by plotting the actual sequon density per 500 amino acids of secreted and membrane proteins on the y axis versus the expected sequon density of these same proteins based upon the concentrations of Asn, Ser, Thr, and Pro (see Fig. 2) (12).
The 3D7 (original genome project) strain of Plasmodium falciparum was grown in human red blood cells (RBCs), for the most part without synchronization (36). When necessary (for example, for metabolic labeling), plasmodia were synchronized with sorbitol. Transformed plasmodia with GFP targeted to the apicoplast (ACPleader-GFP), the parasitophorous vacuole (ACPsignal-GFP), or the cytosol (ACPtransit-GFP) were obtained from MR4 (55). Antibodies to Plasmodium proteins which are present in the ER (Pf39), food vacuole (DPAP1), and plasma membrane (MSP1-19) were obtained from MR4 (30, 51). An antibody to RhopH3 was a generous gift from J. F. Dubremetz (14).
The RH strain of Toxoplasma gondii was grown in human foreskin fibroblasts (44). A vector for Toxoplasma with yellow fluorescent protein (YFP) targeted to the apicoplast (ACP-YFP) was a generous gift from Boris Striepen, University of Georgia (35). Vectors for Toxoplasma with YFP targeted to the ER (P30-YFP-HDEL) and to the Golgi apparatus (GRASP55-YFP) were generous gifts from Graham Warren, Yale University (24, 39). Vectors were transiently transfected into Toxoplasma by electroporations. Antibodies to Toxoplasma proteins which are present in the rhoptry (ROP1; monoclonal antibody [MAb] Tg49) (47) and plasma membrane (SAG1; MAb DG52) (8) were generous gifts from Peter Bradley (UCLA) and Jeroen Saeij (MIT), respectively.
Plasmodia were labeled in vitro using methods described previously for labeling of glycosylphosphatidylinositol (GPI) anchors (22). RBCs were infected with Plasmodium, synchronized with sorbitol, and grown to a density of 12 to 14%. Depending upon when Plasmodium-infected RBCs were harvested, trophozoites, which are mid-stage, or schizonts, which are later stage, predominated. RBCs containing trophozoites and schizonts were radiolabeled with 500 μCi/ml [46-3H]Man or [28-3H]GlcN in a Glc-free medium for 15 min in a final volume of 250 μl. Infected RBCs were isolated on Percoll gradients, and dolichol-PP-glycans were extracted and separated by thin-layer chromatography (TLC). Controls included Giardia, which makes dolichol-PP-GlcNAc2; Saccharomyces, which makes dolichol-PP-Glc3Man9GlcNAc2; and uninfected RBCs, which cannot make N-glycans in the absence of the ER and Golgi. Alternatively, plasmodia were released from RBCs by saponin lysis, proteins were delipidated by extractions with chloroform-methanol-water, and N-glycans were released by treatment with 500 units of peptide:N-glycanase F (PNGaseF) (New England Biolabs) for 16 h at 37°C (46).
Size exclusion chromatography of radiolabeled glycans, after release from dolichol with mild acid or release from proteins with PNGaseF, was performed on a 1.5- by 100-cm Bio-Gel P-4 column with 0.1 M acetic acid–1% (vol/vol) n-butanol as the mobile phase. Internal standards for Plasmodium glycans were GlcNAc and GlcNAc2 (commercially available). Other standards from Saccharomyces Δalg3, Δalg5, and Δalg3 Δalg5 mutants included Glc3Man5GlcNAc2, Man9GlcNAc2, and Man5GlcNAc2, respectively. Radioactivity was measured by scintillation counting.
Late-stage plasmodia were released from RBCs with saponin, and washed plasmodia were treated with 0.1% Triton X-100 in the presence of EDTA-free Complete protease inhibitor cocktail (Roche). Insoluble material was removed by centrifugation (>12,000 × g), and soluble proteins were applied to a WGA-1-Sepharose column (EY Laboratories, Inc.) (56). Because there may be nonspecific binding to this column, Plasmodium glycoproteins were eluted with 50 mM diacetylchitotriose rather than with SDS. WGA-1-purified Plasmodium proteins were run on SDS-PAGE with a 4 to 20% gradient of acrylamide (Bio-Rad). In parallel lanes were Plasmodium proteins solubilized with 0.1% Triton X-100 but not applied to the WGA-1 column. Alternatively, Plasmodium proteins were denatured and incubated with 1,000 units PNGaseF for 4 h at 37°C in NEB G7 phosphate buffer prior to loading for SDS-PAGE. After SDS-PAGE, proteins either were fixed and stained with silver or Coomassie blue or were transferred to nitrocellulose membranes by electroporation, incubated with horseradish peroxidase (HRP)-conjugated GSL-II, and developed with ECL chemiluminescent substrate (Pierce).
Alternatively, WGA-1-bound Plasmodium glycoproteins were run 0.5 cm into SDS-PAGE, excised, digested with trypsin, and identified by liquid chromatography-tandem MS (LC-MS-MS), using methods that we have previously used to identify the cyst proteins of Giardia (41). Briefly, reversed-phase chromatography was carried out using a nano-high-pressure liquid chromatography (nano-HPLC) pump and autosampler (Surveyor and MicroAS; Thermo Finnigan, San Jose, CA) on a 10-cm by 100-μm (inner diameter) Magic C18 reversed-phase capillary column (Michrom, Auburn, CA) at the Boston University Proteomics Core Facility or at the MIT Center for Cancer Research Biopolymers Laboratory. Peptides were separated using gradients of 5% to 90% acetonitrile over 30 to 120 min in the presence of 0.5% acetic acid (57). Peptides were analyzed using an LTQ ProteomeX ion trap mass spectrometer (Thermo Finnigan, San Jose, CA), and mass spectra were compared to tryptic digests of predicted Plasmodium proteins using SEQUEST and theGPM software (www.thegpm.org). All searches were conducted on a reverse database to ensure that the false-positive rate for protein identification was kept below 2%. Tryptic peptides with a SEQUEST XCorr score of >1.5, 2.5, or 3.5 for Z = 1, 2, or 3, respectively, and a peptide score of <0.05 were considered a match. GPM-based searches were scored based on having a loge value greater than a cutoff threshold of 2% false positives, typically loge < −10. Proteins with one or more high-scoring fully tryptic peptides were considered present.
RBCs infected with plasmodia were washed once in phosphate-buffered saline (PBS), fixed for 30 min at room temperature (RT) in 4% paraformaldehyde and 0.0075% glutaraldehyde in PBS, and washed once in PBS. Except in those circumstances where we wished to label only the surface of infected RBCs, Triton X-100 was added to a final concentration of 0.1% to permeabilize the organisms for 10 min at RT. Cells were washed with PBS, and unreacted aldehydes were quenched with 0.1-mg/ml sodium borohydride in PBS for 10 min at RT. After washing with PBS, we incubated cells with 3% bovine serum albumin (BSA) in PBS for 30 min prior to staining with lectins or antibodies. To label the plasma membranes of plasmodia, RBCs which were infected with late-stage plasmodia were lysed with 0.15% saponin prior to fixing and labeling.
Ulex europaeus agglutinin 1 (UEA-1) was labeled with Alexafluor 488 (Molecular Probes), which uses the same filter sets as fluorescein but quenches less easily. Griffonia simplicifolia lectin (GSL-II) was labeled with Alexafluor 594, which uses the same filter sets as rhodamine. Plasmodium-infected RBCs were colabeled with 2 μg/ml each of GSL-II and UEA-1 in PBS plus 3% BSA for 60 min at RT and then washed twice in PBS. Alternatively, transformed plasmodia expressing GFP targeted to the cytosol, apicoplast, or parasitophorous vacuole were incubated with GSL-II. As a control for accessibility to exogenous probes, transformed plasmodia expressing GFP were incubated with anti-GFP antibodies labeled with Alexafluor 594 instead of GSL-II. In other experiments, nontransformed plasmodia were incubated with GSL-II and Alexafluor 488-labeled antibodies to the rhoptries, ER, food vacuole, or plasma membrane.
Toxoplasma tachyzoites released from human foreskin fibroblasts were fixed, permeabilized, and washed, as described for Plasmodium. Toxoplasma tachyzoites were incubated with Alexafluor-labeled cyanovirin-N, scytovirin, UEA-1, or Maclura pomifera agglutinin (MPA-1) (2). Alternatively, transformed Toxoplasma tachyzoites expressing GFP or YFP targeted to the apicoplast, ER, or Golgi were incubated with cyanovirin-N. As a control for accessibility to exogenous probes, transformed Toxoplasma tachyzoites expressing GFP were incubated with anti-GFP antibodies instead of cyanovirin-N. In other experiments, nontransformed Toxoplasma tachyzoites were labeled with cyanovirin-N and Alexafluor-labeled antibodies to rhoptries or ER.
The nuclei of labeled Plasmodium and Toxoplasma cells were labeled with 2 μg/ml DAPI (4′,6′-diamidino-2-phenylindole), and SlowFade antifade solution (Invitrogen) was added. Slides were examined by three-dimensional multiple-wavelength fluorescence microscopy using an Olympus IX70 microscope equipped for Deltavision deconvolution (Applied Precision). This system employs restorative as well as deconvolution techniques. Images were collected at 0.2-mm optical sections for the indicated wavelengths and were subsequently deconvolved using SoftWoRx (Applied Precision). Data were examined either as optical sections or as a projection of the entire stack.
High-mannose oligosaccharides, including biosynthetic Man5, were synthesized via a linear strategy as described previously (42). To facilitate carbohydrate microarray fabrications, a thiol-terminated ethylene glycol linker was synthetically incorporated on the reducing end of the mannosides in place of GlcNAc2 (2). Glycans were robotically printed onto maleimide-functionalized amine-coated slides using established protocols (2). The carbohydrate arrays were probed with Alexafluor-labeled cyanovirin, scytovirin, WGA-1, and concanavalin A. Alexafluor-labeled cyanovirin-N and scytovirin were used to label the surface of wild-type Saccharomyces, which makes N-glycans based upon a precursor composed of Man9GlcNAc2. These same lectins were applied to the Saccharomyces Δalg3 mutant, which makes N-glycans based upon a precursor composed of Man5GlcNAc2 (26). Yeast cells were examined with the deconvolving microscope and were analyzed with a BD FACSCalibur flow cytometer. In parallel, Toxoplasma cells labeled with cyanovirin-N and scytovirin were analyzed with the flow cytometer. The relative amounts of Man5GlcNAc2 and Man9GlcNAc2 in Toxoplasma N-glycans were estimated using the assumption that cyanovirin-N binds to both N-glycans, while scytovirin binds only to Man9GlcNAc2.
Because we studied two different organisms (Plasmodium and Toxoplasma) in parallel using multiple methods, a “road map” to Results is presented here. Bioinformatic predictions concerning the length of N-glycans made by each organism (see Fig. 1) and the density of sites of N-linked glycans (sequons) on secreted and membrane proteins (see Fig. 2) are presented first. Experimental demonstrations of the actual N-glycans made by Plasmodium (see Fig. 3 and and4)4) and Toxoplasma (see Fig. 5) are presented next. Finally, colocalizations of the N-glycans of Plasmodium and Toxoplasma with the apicoplast and secretory compartments of each organism (see Fig. 6 and and7,7, respectively) are presented.
When we previously examined Alg enzymes of diverse eukaryotes, the enzymes that add the second GlcNAc to the N-glycan precursor (Alg13 and Alg14) were not identified, and whole-genome sequences of Theileria, Babesia, Eimeria, and Neospora were not available (7, 10, 17, 26, 38, 46, 49). Here we found that secondary loss among Alg genes of apicomplexan parasites is even more extensive than previously demonstrated (Fig. 1) (46). While Alg enzymes of metazoans, plants, and most fungi predict an N-glycan precursor with 14 sugars (Glc3Man9GlcNAc2), Alg enzymes of Toxoplasma, Neospora, and Tetrahymena predict an N-glycan precursor composed of 10 sugars (Glc3Man5GlcNAc2). The predicted N-glycan precursors of Cryptosporidium and Eimeria consist of eight sugars (Glc1Man5GlcNAc2) and seven sugars (Man5GlcNAc2), respectively (1). Plasmodium spp. (including P. falciparum and P. vivax) and Babesia spp. have genes encoding Alg7, Alg13, and Alg14, as well as four OST peptides, and so they are predicted to make an N-glycan precursor with two sugars (GlcNAc2), the same as that of Giardia that we have recently extensively studied (9, 18, 41, 46). Theileria is missing all of the Alg enzymes and OST peptides and so makes no N-glycans (46). The predicted N-glycans of Plasmodium and Toxoplasma were experimentally tested as described below.
Relative to Plasmodium, Toxoplasma has greatly reduced densities of sites of N glycosylation (sequons) in its secreted and nucleus-encoded apicoplast proteins (Fig. 2) (11, 12, 25). Because of the reduced sequon density in Toxoplasma, nearly half of the apicoplast proteins lack sequons and so cannot contain N-glycans (see Fig. S1 in the supplemental material). In contrast, <10% of Plasmodium secreted and nucleus-encoded apicoplast proteins lack sequons, while ~20% of human secreted proteins lack sequons (12).
Sequon density is reduced in Toxoplasma secreted and apicoplast proteins by two different mechanisms. First, because Asn is encoded by AAT or AAC, codon usage bias has a strong effect on the density of N-glycan sites in secreted proteins (12). Toxoplasma, which is 41% AT in its coding sequences, has the lowest density of N-glycan sites in its secreted proteins of 33 eukaryotes examined (12). Plasmodium, which is 76% AT in its coding sequences, has the third-highest density of N-glycan sites in its secreted proteins (Fig. 2). Humans, which are 48% AT in their coding sequences, have an intermediate density of sequons in their secreted proteins (12).
Second, there is a decreased probability that Asn, Ser, and Thr will be present in sequons rather than elsewhere in apicoplast proteins of Toxoplasma. This type of negative selection is shown by plotting the actual density of sequons (y axis) versus the calculated density of sequons based upon the composition of Asn, Ser, Thr, and Pro in these proteins (x axis) (Fig. 2) (12). There is no selection on sequons in secreted proteins of Toxoplasma or secreted and apicoplast proteins of Plasmodium, so all of these points fall on the line with the slope of 1 where actual sequon density equals calculated sequon density (Fig. 2). Points below the line with slope of 1 (sequons of apicoplast proteins of Toxoplasma) reflect negative selection against sequons where actual sequon density is less than the calculated sequon density. In contrast, there is positive selection for sequons with Thr in human secreted proteins (Fig. 2), so this point falls above the line with the slope of 1 (the actual sequon density is twice the calculated sequon density) (12).
Because N-glycan synthesis by Plasmodium has been so controversial (6, 22, 29), we used multiple biochemical and morphological methods to determine whether asexual stages of Plasmodium falciparum grown in red blood cells (RBCs) make the predicted N-glycan (GlcNAc2). Evidence for the very short N-glycans of Plasmodium included the following.
(i) Plasmodia metabolically labeled with tritiated GlcN make lipid-linked N-glycan precursors and N-glycans composed of GlcNAc or GlcNAc2, depending upon the stage (Fig. 3A to C). In contrast, there is no labeling of N-glycan precursors or N-glycans with tritiated Man, which labels Plasmodium GPI anchors (data not shown) (6, 22). Using the same methods, Giardia N-glycans were found to be predominantly GlcNAc2 (46).
(ii) The GlcNAc-binding Griffonia simplicifolia lectin (GSL-II) binds to numerous Plasmodium glycoproteins, which have been separated on SDS-PAGE and blotted to polyvinylidene difluoride (PVDF) membranes (Fig. 3D and E) (59). Pretreatment of Plasmodium glycoproteins with peptide:N-glycanaseF (PNGaseF) substantially decreases the binding of the GSL-II.
(iv) Mass spectroscopy of Plasmodium proteins shows that WGA-1-affinity chromatography enriches secreted proteins, including those from rhoptries and the ER (Table 1) (57). Because the peptide coverage was limited, we were not able to identify glycopeptides containing N-linked GlcNAc.
(v) The GlcNAc-binding lectin GSL-II labels well N-glycans of Plasmodium, while the fucose-binding lectin Ulex europaeus agglutinin 1 (UEA-1) binds to RBCs (Fig. 4, rows A and B). GSL-II labeling of plasmodia is inhibited by excess diacetyl-chitobiose (GlcNAc2) (Fig. 4, row C), and there is no overlap between GSL-II binding and GFP targeted to the cytosol (see Fig. S2, row A, in the supplemental material). While GSL-II labeling of the surface of RBCs infected with plasmodia is very low or absent (data not shown), GSL-II labels the surface of trophozoites and schizonts, which have been released from RBCs with saponin and costained with antibodies to MSP-1 (see Fig. S2, row B). GSL-II, which also binds to N-glycans of Giardia (see Fig. S3A), was used because WGA-1 (used to affinity purify glycoproteins of Plasmodium and Giardia) (41) binds to sialic acid residues on RBCs (see Fig. S3B).
Using biochemical methods, Garénaux et al. (19) recently concluded that Toxoplasma uses its own N-glycan precursor (Glc3Man5GlcNAc2) as well as that of the host (Glc3Man9GlcNAc2) to make N-glycans. In contrast, other investigators identified N-glycans based upon a Glc3Man5GlcNAc2 precursor (32) or a Glc3Man9GlcNAc2 precursor (16). The goal here was to use antiretroviral lectins which bind to Man9GlcNAc2 only (scytovirin) or bind to both Man5GlcNAc2 and Man9GlcNAc2 (cyanovirin-N) (2) to localize N-glycans of Toxoplasma and so rule out the possibility that Toxoplasma N-glycans based upon a Glc3Man9GlcNAc2 precursor were the result of host glycoprotein contamination.
The specificity of the antiretroviral lectins (scytovirin and cyanovirin-N) for particular N-glycans was demonstrated in three ways. First, we prepared a carbohydrate microarray containing a series of synthetic high-mannose oligosaccharides, which were prepared with a chemical linker in place of the GlcNAc2 on the reducing end (2, 42). With the array, we confirmed that cyanovirin-N binds to Man5 and Man9, as well as to the D1 arm (present in Man5) and the D3 arm (absent in Man5) (Fig. 5A to C). Scytovirin, which binds to Man9 and the D3 arm, does not bind to either biosynthetic Man5 or the D1 arm (Fig. 5D). In contrast, the plant lectin concanavalin A, which has previously been used to visualize Toxoplasma glycoproteins with N-glycans (16, 32), binds to any mannose-containing glycan (Fig. 5E). Second, we used fluorescence microscopy to show that cyanovirin-N and scytovirin both bind to wild-type Saccharomyces, which builds its N-glycans on a precursor that contains nine Man residues (Fig. 5G) (26). In contrast, cyanovirin-N but not scytovirin binds to a Saccharomyces Δalg3 mutant, which builds its N-glycans on a precursor that contains five Man residues (Fig. 5H) (26). Third, we used flow cytometry to measure the relative binding of cyanovirin-N and scytovirin to wild-type Saccharomyces and to Saccharomyces Δalg3 mutants (Fig. 5J and K). As expected, we found that cyanovirin-N binds to both wild-type and Δalg3 Saccharomyces, while scytovirin binds only to wild-type Saccharomyces.
The presence of Toxoplasma N-glycans composed of Man9GlcNAc2, which derive from host cell N-glycan precursors, was confirmed in two ways (19). First, scytovirin (specific for Man9GlcNAc2) and cyanovirin-N (which binds to both Man5GlcNAc2 and Man9GlcNAc2) label the same apical structures in Toxoplasma, which were fixed and permeabilized (Fig. 5I). Second, although cyanovirin-N labels Toxoplasma more strongly than does scytovirin, the scytovirin labeling is well above background with no label, as measured by flow cytometry (Fig. 5L). A rough calculation made from the ratios of cyanovirin-N binding to scytovirin binding to Toxoplasma suggests that ~2/3 of the Toxoplasma N-glycans derive from the endogenous precursor (Glc3Man5GlcNAc2) and 1/3 derive from the host precursor (Glc3Man9GlcNAc2). In summary, the results here argue for the presence of N-glycans which derive from both protist and host precursors in apical glycoproteins of Toxoplasma, as recently suggested (19).
For colocalization studies with Plasmodium N-glycans (labeled red with GSL-II), either we examined transfected plasmodia expressing GFP targeted to various subcellular structures (55) or we labeled nontransformed plasmodia with antibodies to proteins associated with particular subcellular structures (Fig. 6). Both methods worked very well, and unsynchronized cultures were used to sample various stages of development. GSL-II labeling overlaps best with the binding of antibodies to rhoptry protein 3 (anti-RhopH3) (Fig. 6, row A) (14). Rhoptry proteins were also enriched after binding Plasmodium glycoproteins to the WGA-1 resin (Table 1).
GSL-II overlaps to a lesser degree with binding of antibodies to Pf39, which is a membrane-associated calcium-binding protein with a C-terminal ER retention signal (Fig. 6, row B) (51). In contrast, there is no overlap between GSL-II and GFP targeted to the apicoplasts (ACPleader-GFP) (Fig. 6, row C), parasitophorous vacuole (ACPsignal-GFP) (Fig. 6, row D), and cytosol (ACPtransit-GFP) (see Fig. S2A in the supplemental material) (55). Control antibodies to GFP label the apicoplasts and parasitophorous vacuole, showing that accessibility does not prevent labeling of these compartments by GSL-II (see Fig. S4A and S4B, respectively). For the most part, GSL-II does not overlap with antibodies to dipeptidyl aminopeptidase I (anti-DPAP1), which is a food vacuole protein (Fig. 6, row E) (30). We conclude that occupation of N-glycan sites is markedly reduced in apicoplast proteins compared to some secretory proteins in Plasmodium.
Cyanovirin-N, which binds N-glycans of Toxoplasma, colocalizes extensively but not exclusively with antibodies to rhoptry protein 1 (anti-ROP1; MAb Tg49) (Fig. 7, row A) (47). Cyanovirin-N also colocalizes in apical secretory organelles with the GalNAc-binding plant lectin Maclura pomifera agglutinin 1 (MPA-1), which binds to O-linked glycans of Toxoplasma, as well as the fucose-binding lectin UEA-1 (data not shown) (50). Cyanovirin-N weakly colocalizes with GFP targeted to the ER (P30-GFP-HDEL) (Fig. 7, row B) and with antibodies to surface antigen 1 (anti-SAG1; MAb DG52) (Fig. 7, row C) (8, 24). In contrast, there is no overlap between cyanovirin-N and yellow fluorescent protein fused to the acyl carrier protein (ACP-YFP), which labels apicoplasts of Toxoplasma (Fig. 7, row D) (35). There is also no labeling of the apicoplast with MPA-1 and UEA-1 (data not shown). Control antibodies to GFP label Toxoplasma apicoplasts, showing that lack of accessibility does not prevent labeling of the organelle by cyanovirin-N and the plant lectins (see Fig. S5B in the supplemental material). There is no overlap between the Golgi marker (GRASP55) and cyanovirin-N (Fig. 7, row E) (40). This result may be secondary to modifications of N-glycans in the Golgi apparatus of Toxoplasma which block cyanovirin-N binding. Finally, cyanovirin-N does not bind to cisternae of the inner membrane complex (23). We conclude that glycoproteins containing unmodified N-glycans (recognized by cyanovirin-N) are present in the apical secretory vesicles but absent from the apicoplast.
The biochemical methods used here showed that Plasmodium makes the N-glycan precursor predicted by its Alg enzymes (dolichol-PP-GlcNAc2), as well as dolichol-PP-GlcNAc (Fig. 1, ,3,3, and and4)4) (26, 46). We did not find any host (high-mannose) N-glycans associated with Plasmodium, as previously suggested (29). The very short Plasmodium N-glycans were likely missed by other investigators because they made the reasonable assumption that Plasmodium N-glycans contain mannose residues, as was the case for all N-glycans studied to that date (6, 22, 26, 46).
Because WGA-1, which was used to localize GlcNAc2 on N-glycans of Giardia (41), cross-reacts with sialic acid residues on RBC glycoproteins, we used GSL-II to localize Plasmodium N-glycans (Fig. 3, ,4,4, and and6;6; see Fig. S2 and S3 in the supplemental material) (59). As GSL-II, like WGA-1 and other GlcNAc-binding lectins, may have a weaker affinity for a single GlcNAc rather than GlcNAc polymers, we may have failed to localize some Plasmodium N-glycans containing a single GlcNAc. Still, GSL-II brightly labels numerous secretory compartments of plasmodia and is an excellent “counterstain” for organelles localized with GFP or antibodies (Fig. 6) (14, 21, 30, 51, 52). Indeed GSL-II labeling provides three-dimensional images of plasmodia using deconvolution methods, while interference contrast microscopy, which is commonly used in association with GFP or antiparasite antibodies, provides information in only two dimensions. GSL-II labeling of plasmodia, in conjunction with UEA-1 labeling of RBCs and DAPI labeling of RBCs (Fig. 4), could be used to facilitate diagnosis of malaria.
The antiretroviral lectins cyanovirin-N and scytovirin used here to localize Toxoplasma N-glycans have much greater specificity for high-mannose N-glycans than the plant lectin concanavalin A, which was previously used to localize Toxoplasma N-glycans (Fig. 5 and and7)7) (2, 16, 32). With scytovirin, we confirmed the remarkable finding of Garénaux et al. (19) that Toxoplasma N-glycans derive from precursors predicted by the parasite Alg enzymes (Man5GlcNAc2), as well as those of the host (Man9GlcNAc2) (26, 32, 46). While recent studies demonstrate membrane fusion between the host cell ER and the parasitophorous vacuole (20), it is not clear how lipid-pyrophosphate-linked N-glycan precursors move from the parasitophorous vacuole to the protist ER, where the OST adds N-glycans to nascent proteins (28, 45). Regardless, scytovirin should be a useful tool for studies of mechanisms of membrane fusion between the host ER and the parasitophorous vacuole.
The present experiments support our hypothesis that there is Darwinian selection against N-glycans in nucleus-encoded apicoplast proteins, which traverse through the ER prior to threading into the organelle (Fig. 8A). (11, 21, 25, 35, 37, 39, 48, 52, 55). Negative selection against N-glycans appears to occur by multiple mechanisms.
(i) There is extensive secondary loss of Alg enzymes that make N-glycan precursors in apicomplexans, which are transmitted via an arthropod vector. Plasmodium and Babesia are capable of making dolichol-PP-GlcNAc2, while Theileria makes no N-glycan at all (Fig. 1 and and8B8B).
(ii) In Plasmodium, early blood stage parasites make lipid-linked N-glycan precursors which are composed predominantly of a single GlcNAc, while later blood stage parasites make N-glycan precursors which are a mixture of GlcNAc and GlcNAc2 (Fig. 3). This result suggests that Plasmodium is underutilizing the few Alg enzymes that the organism has (Fig. 1).
(iii) The density of sequons (sites of N-linked glycosylation) in Toxoplasma secreted and apicoplast proteins is extremely low, which correlates with the marked GC-rich codon usage of this protist (Fig. 2 and and8C)8C) (12). In addition, there is decreased probability that Asn, Ser, and Thr will be in sequons rather than elsewhere in nucleus-encoded apicoplast proteins of Toxoplasma, so that nearly half of these proteins lack any sites of N-linked glycosylation. Positive selection for sequons with Thr in secreted proteins of eukaryotes, which use N-glycans for quality control of protein folding, also occurs by changes in the probability that Asn and Thr will be in sequons rather than elsewhere in the protein (12).
(iv) There appears to be a marked reduction in the occupation of N-glycan sites (sequons) in nucleus-encoded apicoplast proteins of Plasmodium and Toxoplasma in comparison to secreted proteins of each protist (Fig. 6, ,7,7, and and8D).8D). In the case of Plasmodium, nucleus-encoded apicoplast proteins have the same high density of sequons as secreted proteins (Fig. 2), so we had expected strong GSL-II labeling of the Plasmodium apicoplast. In the case of Toxoplasma, nucleus-encoded apicoplast proteins have about half the density of N-glycan sites as secreted proteins. While previous investigators concluded that concanavalin A colocalizes with the Toxoplasma apicoplast, the pictures shown suggest that concanavalin A stains the ER adjacent to the apicoplast (16).
(v) Our N-glycan occupancy results are consistent with recent studies of protein targeting to the apicoplast that (i) demonstrate a pore (Tic20 homolog) through which Toxoplasma apicoplast proteins are threaded (54) and (ii) suggest the possibility that apicoplast proteins are rapidly dislocated from the ER lumen by an ERAD-type mechanism prior to targeting to the organelle (13, 21).
(vi) How apicoplast-targeted proteins avoid N glycosylation is unclear. One possibility is that the nucleus-encoded apicoplast proteins are shuttled out of the ER and into the organelle while the proteins are being translocated and so before N-glycans are added (28, 39, 48). An alternative possibility is that oligosaccharyltransferases (OSTs) of apicomplexans do not transfer N-glycans to nascent proteins during translocation. Instead, parasite OSTs may transfer N-glycans to proteins after translocation into the ER, as has been recently described for some human and Trypanosoma OSTs (34, 45). This suggests the possibility that apicoplast proteins remain without N-glycans, because they are removed from the lumen of the ER prior to addition of the N-glycans by the posttranslational OST.
Alternative interpretations and limitations of the evidence are as follows.
(i) Our results show that apicomplexans transmitted by arthropods (Plasmodium, Babesia, and Theileria) have very short or no N-glycans, while apicomplexans which have an oocyst form and are transmitted orally (Toxoplasma, Neospora, Cryptosporidium, and Eimeria) have longer N-glycan precursors (Fig. 1). Because apicomplexans transmitted by an arthropod vector are phylogenetically more similar to each other than to apicomplexans with oocyst forms, it is not clear whether differences in N-glycan length are the result of differences in mode of transmission or the result of common ancestry (31).
(ii) Extensive secondary loss of Alg enzymes also occurs in Giardia, which makes the same very short N-glycans as Plasmodium, and in microsporidia, which make no N-glycans (12, 41, 46). Because these organisms, which are unrelated to Plasmodium and Theileria, do not have an apicoplast, it is likely that very short N-glycans have other benefits for parasitic protists. For example, the very short N-glycan GlcNAc2 is not immunogenic, while longer GlcNAc polymers, such as those present in chitin, are immunogenic (43).
(iii) A strong GC-rich codon bias also occurs in Leishmania, which does not have an apicoplast (12). While a high GC content is necessary for genome stability in thermophilic bacteria (58), the forces driving high GC content in other organisms are not well understood.
(iv) As discussed above, it is likely that the lectins used here do not identify all of the N-glycans present on glycoproteins of Plasmodium and Toxoplasma. In the case of plasmodia, there may be apicoplast and secreted proteins with N-glycans composed of a single GlcNAc that are missed by GSL-II. In the case of Toxoplasma, there may be complex N-glycans, which have been modified in the Golgi apparatus, that are not recognized by cyanovirin-N or scytovirin (2).
(v) In addition to apicoplasts, other secretory compartments of Plasmodium (e.g., the food vacuole, parasitophorous vacuole, and RBC surface) are not labeled with GSL-II (Fig. 6). Similarly, in addition to apicoplasts, other secretory compartments of Toxoplasma (the Golgi apparatus, inner membrane complex, and plasma membrane) are not labeled by cyanovirin-N (Fig. 7). These results suggest that there is a complicated relationship between N-linked glycosylation and protein targeting in these apicomplexans that we do not yet understand.
(vi) This relative lack of specificity with regard to selection against N-glycans in apicomplexans is in contrast to the great specificity of positive selection for sites of N-linked glycosylation (sequons) in the vast majority of eukaryotes (12). Positive selection for sequons occurs only in organisms with N-glycan-dependent QC of protein folding, is for secreted proteins but not cytosolic proteins, is for sequons with Thr but not those with Ser, and is based upon the increased probability that Asn and Thr are present in sequons rather than elsewhere in secreted proteins.
(vii) There is a built-in redundancy or inconsistency with regard to the apparent selection against N-glycans in these apicoplexans. If apicoplast proteins of both Plasmodium and Toxoplasma are selectively underglycosylated (Fig. 6, ,7,7, and and8D),8D), why is there also apparent selection against N-glycan length in Plasmodium (Fig. 1 and and8B)8B) and N-glycan density in Toxoplasma (Fig. 2 and and8C8C)?
Because of the alternative explanations of the data and because we do not have a mechanism for the reduced occupancy of N-glycan sites of Plasmodium and Toxoplasma, we think that these studies provide suggestive evidence for our hypothesis that there is selection against N-glycans in nucleus-encoded apicoplast proteins (Fig. 8A). A strength of the hypothesis is that it makes sense of disparate facts concerning these parasites, which are otherwise unexplained (e.g., secondary loss of Alg enzymes in apicomplexans [Fig. 1 and and8B]8B] and marked decrease in sites of N-linked glycosylation in Toxoplasma [Fig. 2 and and8C]).8C]). Experiments to test our hypothesis and to further explore the complex relationship between protein-targeting and glycosylation in these apicomplexans are in progress.
Recent work by Boris Striepen and colleagues (S. Agrawal, G. G. van Dooren, W. L. Beatty, and B. Striepen, J. Biol. Chem. 284:33683–33691, 2009) shows that an endosymbiont-derived endoplasmic reticulum-associated degradation(ERAD) system functions in the import of apicoplast proteins. Their results suggest that apicoplast proteins may by removed from the lumen of the ER prior to addition of N-glycans.
This work was supported by NIH grants AI048082 (to J.S.), AI057919 (to M.T.D.), and GM31318 (to P.W.R.). M.T.D. is a Burroughs Wellcome Fund Investigator in the Pathogenesis of Infectious Diseases. Support for D.M.R. was provided by the Training Program in Host Pathogen Interactions (T32 AI052070). Support for M.J.G. was provided by American Heart Association Scientist Development grant 0635480N and a March of Dimes Basil O'Connor Starter Scholar Research Award.
We thank Dick Cook of MIT for some mass spectroscopy data.
†Supplemental material for this article may be found at http://ec.asm.org/.
Published ahead of print on 25 September 2009.