|Home | About | Journals | Submit | Contact Us | Français|
The extracellular domain of Notch contains epidermal growth factor (EGF) repeats that are extensively modified with different O-linked glycans. O-Fucosylation is essential for receptor function, and elongation with N-acetylglucosamine, catalyzed by members of the Fringe family, modulates Notch activity. Only recently, genes encoding enzymes involved in the O-glucosylation pathway have been cloned. In the Drosophila mutant rumi, characterized by a mutation in the protein O-glucosyltransferase, Notch signaling is impaired in a temperature-dependent manner, and a mouse knock-out leads to embryonic lethality. We have previously identified two human genes, GXYLT1 and GXYLT2, encoding glucoside xylosyltransferases responsible for the transfer of xylose to O-linked glucose. The identity of the enzyme further elongating the glycan to generate the final trisaccharide xylose-xylose-glucose, however, remained unknown. Here, we describe that the human gene C3ORF21 encodes a UDP-xylose:α-xyloside α1,3-xylosyltransferase, acting on xylose-α1,3-glucoseβ1-containing acceptor structures. We have, therefore, renamed it XXYLT1 (xyloside xylosyltransferase 1). XXYLT1 cannot act on a synthetic acceptor containing an α-linked xylose alone, but requires the presence of the underlying glucose. Activity on Notch EGF repeats was proven by in vitro xylosylation of a mouse Notch1 fragment recombinantly produced in Sf9 insect cells, a bacterially expressed EGF repeat from mouse Notch2 modified in vitro by Rumi and Gxylt2 and in vivo by co-expression of the enzyme with the Notch1 fragment. The enzyme was shown to be a typical type II membrane-bound glycosyltransferase localized in the endoplasmic reticulum.
Notch signaling is based on the interaction between transmembrane ligands of the Delta and Jagged/Serrate family and the extracellular domain of Notch receptors. This interaction enables two subsequent proteolytic cleavages that release the intracellular domain of the Notch receptor. After relocation to the nucleus, it binds to nuclear factors, thereby promoting target gene expression. The extracellular domains of Notch receptors and their ligands are composed of tandem epidermal growth factor-like (EGF) repeats (reviewed in Ref. 1) that are modified by unusual O-linked carbohydrates. The EGF repeats harbor consensus sequences containing serine or threonine residues to which O-linked fucose (Fuc), glucose (Glc), or N-acetylglucosamine (GlcNAc) are transferred (2, 3). Modification of O-linked GlcNAc has not been identified yet (3). In contrast, both fucose and glucose are further elongated to form the final tetrasaccharide Siaα2,3/6Galβ1,4GlcNAcβ1,3Fucα1-O-Ser/Thr (2) and trisaccharide Xylα1,3Xylα1,3Glcβ1-O-Ser (2, 4–6), respectively. These O-glycans are found on a variety of other proteins including coagulation factor VII and IX and thrombospondin (7, 8), but are functionally most renowned as modulators of Notch signaling (9–11).
Mice and flies lacking the protein O-fucosyltransferase-1 (Pofut1)2 exhibit Notch loss-of-function phenotypes (12, 13). Although initial fucosylation seems to be a prerequisite for Notch receptor function, transfer of GlcNAc catalyzed by enzymes of the Fringe family results in altered receptor-ligand interactions. Signaling induced by the Notch ligand Delta is increased, whereas signaling induced by Jagged/Serrate is decreased (14–18) as a result of Fringe activity.
In contrast, the importance of the trisaccharide Xyl-Xyl-Glc-O-Ser is less well understood. The initiating protein O-glucosyltransferase (Poglut) has been identified only recently through the Drosophila mutant rumi (19). Rumi-deficient flies show a temperature-sensitive phenotype, which is most severe and equivalent to the complete loss of Notch signaling in flies grown at 28–30 °C. Experiments performed by Acar et al. (19) provide evidence that the proteolytic cleavage of the Notch receptor is impaired in rumi flies but that interaction with the ligand Delta is not influenced. Thus, O-glucosylation appears to play a role for conformational stabilization of Notch at high temperatures to enable the proteolytic cleavage necessary for signal transmission (9, 11, 19, 20). A mammalian ortholog of Rumi has been demonstrated to encode an active Poglut and results in an early lethal phenotype when knocked out in mice (21). In contrast to phenotypes of other global regulators of Notch (13, 22, 23), animals lacking Rumi showed more severe and complex phenotypes, suggesting that other essential proteins are targets for O-glucosylation (21). Very recently, Rumi was shown to function as a protein O-xylosyltransferase (Poxylt, utilizing UDP-Xyl) as well as a Poglut, and it was shown that the O-xylose can be extended to a trisaccharide (Xyl-Xyl-Xyl-O-Ser) on mouse Notch2 (24). Enzymatic activity of the two α1,3-xylosyltransferases involved in extension of O-glucose had been detected in bovine liver cells (25–27), but the identity of the encoding genes long remained unknown.
Recently, we have shown (28) that two members of the human glycosyltransferase 8 family (GT8) (29), GXYLT1 and GXYLT2 (glucoside-xylosyltransferase 1/2), are able to transfer the first α1,3-linked xylose to O-glucosylated mammalian Notch EGF repeats. The enzymes exhibit about 50% identity at amino acid sequence level with differences most apparent in the stem region, but no differences in their acceptor specificity could be observed so far. GXYLT1 and GXYLT2 are, however, not able to elongate substrates containing the disaccharide Xyl-Glc- (28), indicating the requirement for an additional xylosyltransferase. Here, we describe the identification of another human member of the GT8 family, only distantly related to the previously analyzed GT8 members, encoding a xyloside-xylosyltransferase (XXYLT), which further elongates the Xyl-Glc-O-Ser disaccharide on Notch EGF repeats.
Protein A fusion constructs of GXYLT1 and GLT8D1 for recombinant expression via the pFast Bac system (Invitrogen) were reported previously (28). To generate the equivalent plasmid of xyloside xylosyltransferase 1 (XXYLT1), the putative luminal C-terminal domain, starting from Ser-43, was amplified by PCR from human prostate Marathon-Ready cDNA (Clontech) using the primers 5′-atctgaattcaggccgggagaccttctc-3′ and 5′-atctgaattcctagtcctccgggatggga-3′. After EcoRI digestion, the sequence was cloned into the pFast Bac1 vector (Invitrogen), containing the honeybee melittin secretion sequence followed by the protein A coding sequence from pProtA (ProtA-XXYLT1). C-terminal Myc/His-tagged mouse Notch EGF1–5, encoding the first five EGF domains of mouse Notch1 cloned into pFast Bac1, was described previously (28, 30).
The cDNAs encoding the luminal domains of mouse Gxylt2, starting from Arg-26, or Xxylt1, starting from Ser-43, were subcloned into a pSecTag2 vector (Invitrogen) so that the recombinant proteins were expressed with a C-terminal Myc/His tag in mammalian cells and secreted into the culture media.
The full-length open reading frame of XXYLT1 (National Center for Biotechnology Information (NCBI) reference sequence NM_152531) was cloned into the pcDNA3 vector (Invitrogen) and modified with protein tags to generate the single-tagged XXYLT1-HA (XXYLT1-SRYPYDVPDYASL) and FLAG-XXYLT1 (MDYKDDDDKEF-XXYLT1) and the double-tagged FLAG-XXYLT1-HA (MDYKDDDDKEF-XXYLT1-SRYPYDVPDYASL) plasmids. Similar constructs (B4GALT1-HA and FLAG-B4GALT1-HA) were generated for the control gene, the human β1,4-galactosyltransferase 1 (B4GALT1; NCBI reference sequence NM_001497). To generate an appropriate ER marker, the Golgi-located pAcGFP1-Golgi Vector (Clontech), encoding a GFP fusion protein located in the Golgi apparatus, was modified by insertion of a C-terminal KDEL signal sequence. After PCR on the original plasmid, the modified insert was recloned via BamHI and XbaI restriction sites into the vector. The new construct GFP1-KDEL was tested by immunofluorescence studies in CHO cells. DNA sequences of the constructs were confirmed by sequencing. All the primers are available upon request.
Protein A fusion constructs of soluble secreted human GLT8D1, GXYLT1, and XXYLT1 were expressed in Sf9 cells by baculovirus infection (Bac-to-Bac®; Invitrogen) and purified by IgG-Sepharose-6 Fast Flow beads (GE Healthcare) as described (28). Soluble secreted mouse Gxylt2 and Xxylt1 were expressed in HEK293T cells and purified from the culture media by nickel-nitrilotriacetic acid affinity chromatography as described previously (24). Protein concentration was determined by Coomassie Blue staining using BSA or protein A as standard.
Assays were performed in a total volume of 50 μl in 100 mm MOPS, pH 7.5, 10 mm MnCl2, 10 mm ATP as described before (28). IgG bead-coupled protein A fused enzymes were incubated for 1 h at 37 °C in the presence of radiolabeled donor sugars UDP-[6-3H]Gal, UDP-[1-3H]Glc (GE Healthcare), or UDP-[U-14C]Xyl (PerkinElmer Life Sciences) at 5 μm with specific activity 4 kBq/nmol for 3H-sugars and 0.75 kBq/nmol for [14C]Xyl, obtained by dilution with cold nucleotide sugars (Sigma and CarboSource Services) and a number of synthetic acceptors (Xyl-Xyl-Glc-R; Xyl-Glc-R, Glc-R (31) or para-nitrophenol (pNP)-linked carbohydrates) at a final concentration of 100 μm. To determine the amount of transferred radiolabeled sugars, donor and acceptor were separated via C18 columns (Sep-PakR Vac 3cc; Waters Corp.), and samples were counted by liquid scintillation (LS 6500; Beckman Coulter). The acquired enzymatic activity was expressed as nmol of xylose transferred per nmol of protein per hour, calculated from a Coomassie Blue staining using a protein A standard as reference. Note that this assay was set up to show specificity but does not allow drawing conclusions about enzymatic efficiency due to the approximately equimolar amount of UDP-Xyl and enzyme in the assay.
A single 16th EGF repeat (EGF16) from mouse Notch2 was prepared and modified with Glc-O or Xyl-O by Rumi as described previously (24). In vitro xylosylation of the Glc-EGF repeat was performed using mouse Gxylt2 that was produced and purified by nickel affinity from medium of HEK293T cells. Typically, a 5-ml reaction mixture contained 50 mm HEPES, pH 7.0, 10 mm MnCl2, 5 μm Glc-EGF repeat, 200 μm UDP-Xyl, 0.5% Nonidet P-40, and ~10 μg of Gxylt2. Samples were incubated at 37 °C overnight, and the product was purified by reversed phase high performance liquid chromatography (HPLC) as described previously (24). Substrate specificity of XXYLT1 on these differently glycosylated EGF16 acceptors was tested using radiolabeled UDP-Xyl in a reaction volume of 10 μl containing 50 mm HEPES, pH 7.0, 10 mm MnCl2, 10 μm acceptor substrate, 10 μm UDP-[14C]Xyl (5 kBq/nmol; American Radiolabeled Chemicals), 0.5% Nonidet P-40, and 10 ng of the mouse Xxylt1 protein, purified as described above for Gxylt2. After 20 min of incubation at 37 °C, the reaction was stopped by adding 900 μl of 100 mm EDTA, pH 8.0. Samples were loaded onto C18 cartridges (100 mg, Agilent). After washing with 5 ml of H2O, the EGF repeats were eluted with 1 ml of 80% methanol. Incorporation of [14C]Xyl into the EGF repeats was determined by liquid scintillation. Reactions without substrates were used as background controls.
1 mg of synthetic acceptor Xyl-Glc-R was xylosylated in 500 μl of reaction buffer by 100 μl of bead-coupled XXYLT1 in the presence of equimolar amounts of cold UDP-Xyl under standard in vitro activity assay conditions. The sample was taken up in 100 μl of 25% aqueous acetonitrile and analyzed (20 μl) by reverse phase HPLC on a column (25 cm × 4.6 mm) with LC-8 (5 μm) sorbent with elution by 25% aqueous acetonitrile at a flow rate of 0.8 ml/min. The elution profile was compared with profiles of reference trisaccharide Xyl-Xyl-Glc-R and disaccharide Xyl-Glc-R acceptor (31).
For linkage confirmation in the product of in vitro xylosylation, 1H nuclear magnetic resonance (NMR) (600 MHz, 303 K, D2O, Bruker Avance 600), natural abundance 13C NMR (125 MHz, 303 K, D2O, Bruker Avance 600), and two-dimensional heteronuclear single quantum correlation (HSQC) NMR (600 MHz, 303 K, D2O, Bruker Avance 600) of the reaction mixture were acquired and compared with spectral data for the synthetic reference compounds (see Fig. 3 and supplemental Figs. S1 and S2 and Table S1).
C-terminal Myc/His-tagged mouse Notch EGF1–5 was expressed in 300 ml of Sf9 insect cells cultured in Insect-Xpress medium (Lonza) by baculovirus infection. After 72 h, the secreted protein was purified by nickel affinity chromatography (HisTrap HP, 1 ml; GE Healthcare) as described (28). Protein fractions, reactive with anti-Myc, were pooled and concentrated/desalted to 150 μl using an Amicon Ultra-4 centrifugal devise (Millipore) and stored in 100 mm MOPS, pH 7, at −20 °C. SDS-Page followed by Coomassie Blue staining and immunoblotting with monoclonal antibody 9E10 (anti-Myc) confirmed the protein purification. Under standard activity assay conditions, 10 μl of purified Notch EGF1–5 was incubated in the presence of 100 μm cold UDP-Xyl as donor and 10 μl of bead-coupled enzyme for 4 h, separated by SDS-PAGE, and analyzed by mass spectrometry (MS).
Protein bands of Notch EGF1–5 were excised from gel and trypsin-digested, and peptides were recovered as described (32). Reverse phase chromatography using acetonitrile as eluent was performed on a Waters nanoACQUITY UPLC device equipped with an analytical column (Waters, BEH130 C18, 100 μm × 100 mm, 1.7-μm particle size) coupled online to an ESI Q-TOF Ultima (Waters). Spectra were recorded in positive ion mode, and peptides were automatically subjected to fragmentation (tandem mass spectrometry (MS/MS)). Unglycosylated or glycosylated versions of EGF16 from mouse Notch2 were analyzed by nano-LC-MS/MS as described previously (24).
Notch EGF1–5 was co-expressed with XXYLT1, or as control, the inactive enzyme GLT8D1 (28) in Sf9 cells by baculovirus infection for 72 h. Purification and analysis of peptide glycosylation of the Notch fragment were performed as specified above.
Localization studies were carried out in CHO cells. Cell transfection was performed on glass coverslips in 24-well plates using METAFECTENE (Biontex). After 24 h, cells were fixed using 4% PFA in PBS, permeabilized with 0.1% saponin in PBS, and stained with respective primary (rabbit anti-HA (Sigma), mouse anti-FLAG M5 (Sigma), or rabbit α-mannosidase II (gift of Dr. K. Moremen, University of Georgia, Athens, GA)) and secondary antibodies (anti-mouse IgG-Cy3 (Sigma) and anti-rabbit IgG-Alexa Fluor 488 (Invitrogen)) using standard staining conditions. As an ER marker, the GFP1-KDEL construct was co-transfected. Images were acquired with a Zeiss Axiovert 200m.
CHO cells were grown in 75-cm2 flasks and transfected using METAFECTENE (Biontex) with XXYLT1-HA, FLAG-XXYLT1-HA, B4GALT1-HA, or FLAG-B4GALT1-HA. After 24 h, cells were harvested and lysed by 2 mm EDTA, 1 mm MgCl2, 1% Nonidet P-40 in 50 mm Tris/HCl, pH 8.0, buffer, including protease inhibitor (Roche Applied Science). Proteins were immunoprecipitated using ~25 μg of mouse anti-HA 12CA5 antibody coupled to 50 μl of protein A-SepharoseTM CL-4B (GE Healthcare). Beads were washed according to the manufacturer's instructions and applied to SDS-PAGE. Protein expression was analyzed by Western blotting using primary antibodies rabbit anti-HA and rabbit anti-FLAG (both from Sigma) followed by visualization with rabbit IRDye 800CW (LI-COR) using the LI-COR Odyssey imager.
The human genes GXYLT1 and GXYLT2 transfer xylose in α1,3 to O-linked glucose (Fig. 1B). The gene responsible for the transfer of the second xylose remained unknown. Considering that both enzymatic reactions result in the addition of xylose in α1,3 linkage, the genes were anticipated to be homologous. Still, none of the four initially identified members of the GT8 family (GXYLT1, GXYLT2, GLT8D1, and GLT8D2) that were selected based on homology with UDP-Glc:glycoprotein glucosyltransferase (genes UGGT1 and UGGT2) had xylosyltransferase activity toward Xylα1,3Glcβ1 terminated acceptors (28). Using GXYLT1 in a position-specific iterated (PSI)-Blast, a fifth gene was identified, showing less than 20% overall identity at the amino acid level (Fig. 1A). This gene, named C3ORF21 (chromosome 3 open reading frame 21), possessed two conserved DXD motifs and an N-terminal signal sequence or membrane anchor. Therefore, its potential involvement in the synthesis of the Xylα1,3Xylα1,3Glcβ1-O oligosaccharide was investigated.
To test the enzymatic activity of the putative xylosyltransferase, renamed XXYLT1, the protein, lacking the predicted membrane-bound domain, was fused N-terminally to a protein A tag and cloned in a baculoviral vector with a signal sequence to promote secretion of the fusion protein. After expression in Sf9 insect cells, the protein was isolated from the culture medium by capture with IgG beads, binding the N-terminal protein A tag. Bead-coupled enzyme was used for in vitro glycosyltransferase activity assays with artificial synthetic compounds that mimic the natural acceptor structures (28, 31). Assays carried out with the radiolabeled donor substrate UDP-[14C]Xyl revealed xylosyltransferase activity of XXYLT1 with the Xyl-Glc-R synthetic acceptor (Fig. 2A). Marginal increased values, when compared with the negative control, were observed for the Xyl-Xyl-Glc-R acceptor, which can be interpreted as due to the presence of degradation products within the acceptor compound, rather than as resulting from generation of a product with three xylose residues. This conclusion is supported by the lack of any signal in the HPLC profile of Fig. 3B that could represent such a product. XXYLT1 showed a clear donor substrate specificity for UDP-Xyl (Fig. 2C). Interestingly, no enzymatic activity could be observed with a panel of sugars linked to pNP as acceptors, (Fig. 2B) including α-linked Xyl-pNP, suggesting that the enzyme requires the disaccharide Xylα1,3Glcβ1 as minimal acceptor structure.
The reaction product generated by XXYLT1 was investigated by HPLC and NMR studies to confirm the structure of the formed trisaccharide, especially of the direction and configuration of the linkage between the two xylosyl units (Fig. 3). HPLC data of Xyl-Glc-R modified by XXYLT1 in the presence of UDP-Xyl, shown in Fig. 3B, indicated that the enzyme generated a product migrating at the same position as the synthetic reference trisaccharide Xyl-Xyl-Glc-R (Fig. 3C). Moreover, acquired 13C and 1H NMR spectra of the enzymatically modified disaccharide (Fig. 3B; supplemental Fig. S1B) are identical to the spectra of the synthetic reference trisaccharide (Fig. 3C; supplemental Fig. S1C). In particular, the characteristic low field (80.3 ppm) location of C3′ signal in the 13C NMR spectrum unequivocally shows that the xylosyl residue is linked through an α1,3 linkage, whereas the α-configuration of the terminal xylose unit is indicated by the coupling constant JH1,H2 of 3.7 Hz (supplemental Fig. S1; supplemental Table S1). Finally, the precise nature of the product was also confirmed by a two-dimensional HSQC NMR spectrum (supplemental Fig. S2).
To determine whether Notch is modified by XXYLT1, an in vitro assay was carried out to test a naturally occurring EGF acceptor. A mouse Notch fragment containing five EGF repeats with consensus sequences for O-glucosylation in repeats 2 and 4 (Notch EGF1–5) was expressed in Sf9 insect cells and purified by nickel-affinity chromatography. The purified acceptor presented both Glc-O and Xyl-Glc-O modifications in about a one-to-one ratio as shown by mass spectral analysis of the glycopeptide fragment from EGF4 (m/z 1166 and 1210, Fig. 4A). Incubation of this acceptor with GXYLT1 resulted in the expected conversion of Glc-O into Xyl-Glc-O, visible by the disappearance of the peak at m/z 1166 (Fig. 4B). In contrast, incubation of Notch EGF1–5 with XXYLT1 resulted in the de novo generation of a product at m/z 1254 (Fig. 4C) and loss of the Xyl-Glc-O peak at m/z 1210. The new peak at m/z 1254 demonstrated the addition of one pentose residue to the Xyl-Glc-O-containing peptide. Modification of Notch EGF1–5 with both enzymes, GXYLT1 and XXYLT1, resulted in a complete shift toward the formation of the trisaccharide Xyl-Xyl-Glc-O on EGF repeat 4 (Fig. 4D). The same was observed for the peptide of EGF repeat 2, but overlapping signals from another peptide troubled the picture (data not shown).
Because Rumi was recently shown to be capable of transferring either xylose or glucose to certain EGF repeats (e.g. EGF16 from mouse Notch2), and Xyl-O can be further elongated to a Xyl-Xyl-Xyl-O trisaccharide (24), we wanted to test whether XXYLT1 can add xylose to an O-xylosylated EGF repeat. Bacterially expressed EGF16 (unglycosylated) was incubated with Rumi in the presence of UDP-Glc or UDP-Xyl to generate the O-glucosylated and O-xylosylated forms, respectively. Reverse phase HPLC analysis showed that the glycosylated EGF repeats eluted slightly earlier than the unglycosylated form (Fig. 5, A–C). Glc-O-EGF16 was converted to Xyl-Glc-O-EGF16 by incubation with mouse Gxylt2 and UDP-Xyl (Fig. 5D; all structures were confirmed by mass spectrometry, supplemental Fig. S4). Each of these EGF repeats was tested as a potential acceptor substrate for mouse Xxylt1, but only Xyl-Glc-O-EGF16 was converted by the enzyme (Fig. 5, E and F), showing that Xxylt1 is not able to elongate Xyl-O-EGF16.
In vivo enzymatic activity of XXYLT1 was tested by a co-expression approach. The MS data shown in Fig. 4A already indicated that insect cells endogenously express a glucoside xylosyltransferase, which partially xylosylates O-glucose of mouse Notch EGF repeats. However, so far, no trisaccharide modification has been detected in Sf9 cells. Upon co-transfection of Notch EGF1–5 and XXYLT1 in Sf9 cells, we could detect the trisaccharide modification at m/z 1254 (Fig. 6B). In comparison, samples obtained from co-expression of Notch EGF1–5 and the inactive enzyme GLT8D1 only showed Glc-O and Xyl-Glc-O modifications (Fig. 6A).
To determine the intracellular localization of the newly identified glycosyltransferase, we transiently expressed N-terminal FLAG-tagged and/or C-terminal HA-tagged XXYLT1 in CHO cells followed by immunofluorescence staining. Expression of the three constructs resulted in identical subcellular expression patterns, and signals acquired from the double-tagged FLAG-XXYLT1-HA construct clearly matched (Fig. 7, A and B). Co-localization of FLAG-XXYLT1-HA and an ER-retained GFP construct indicated that XXYLT1 is located in the ER (Fig. 7C). In contrast, no merge between the Golgi marker mannosidase II and XXYLT1 was found (Fig. 7D). The same construct of β4-galactosyltransferase (B4GALT1), which was used as a control, was always located, as expected, in the Golgi apparatus (Fig. 7E) (33).
Both Poglut/Rumi and Pofut1 are soluble enzymes in the ER (19, 34), whereas most other glycosyltransferases are type II membrane proteins in the Golgi (35). The question, therefore, arose whether XXYLT1 is a typical type II glycosyltransferase with a signal anchor or is cleaved after a signal sequence has directed the protein into the ER. Moreover, a prediction program (SignalP 3.0) (36) predicted the protein to have a cleaved signal sequence with a total probability of 0.78 (Fig. 7F). To answer this question, the double-tagged FLAG-XXYLT1-HA and C-terminally tagged XXYLT1-HA proteins were analyzed by Western blotting. As control, B4GALT1, which is an established non cleaved type II transmembrane protein (33), was investigated in parallel. Two identical blots were generated and developed with either an anti-FLAG or an anti-HA antibody (Fig. 7G). In all cases, XXYLT1 behaved like B4GALT1. The double-tagged protein was visible as a single band, and no smaller products reacting only with the anti-HA antibody were observed. As the N-terminal fused FLAG tag might shield a possible cleavage signal at the N terminus, the XXYLT1-HA was investigated too. The absence of the FLAG tag reduced the molecular mass by ~1.4 kDa. For both B4GALT1 and XXYLT1, a slightly shortened protein, corresponding to this difference, was now visible in the blot developed with the HA antibody. However, again, no smaller, potentially cleaved, products were observed. Protein bands running at higher molecular weight most likely represented dimers of XXYLT1. These data indicated that XXYLT1 most likely is a typical type II transmembrane protein, but localized in the ER.
This study describes the identification of an α1,3-xylosyltransferase acting on the Xylα1,3Glc-O-linked glycan of Notch EGF domains and was accordingly named XXYLT1. Like the α1,3-xylosyltransferases GXYLT1 and GXYLT2 involved in the preceding glycosylation step, XXYLT1 belongs to the glycosyltransferase family 8. However, although it catalyzes the transfer of a xylose residue in α1,3 linkage as the GXYLTs, XXYLT1 uses a different acceptor substrate and only presents a low identity of less than 20% to these enzymes. XXYLT1 has all the properties to encode the xylosyltransferase activity previously detected in HepG2 cells (26) and to be responsible for xylosylation of Notch and other EGF repeat-containing proteins. Formal proof that this is the only enzyme in the human genome with this activity, however, still has to be provided.
The glycosyltransferase family 8 comprises a broad range of glycosyltransferases, from both bacteria and eukaryotes, involved in the transfer of a variety of sugars. Mammalian members of this family have only been shown to transfer either glucose or xylose. Glucose is the donor sugar for glycogenin (37) (the autocatalytic precursor for glycogen) and for UDP-Glc:glycoprotein glucosyltransferase (38). GXYLT1, GXYLT2, and the newly identified XXYLT1 have now been shown to encode xylosyltransferases acting on O-glucosylated EGF repeats. However, the enzymatic activity of the remaining enzymes LARGE1, LARGE2, GLT8D1, and GLT8D2 is still unknown. LARGE proteins comprise two predicted catalytic domains of glycosyltransferases, one of the GT8 family and one of family 49 (39), and are known to act on dystroglycan (40).
Both GXYLT and XXYLT seem to have emerged at the same time with the appearance of metazoans. As for GXYLT1 and GXYLT2, there is no ortholog of XXYLT in the unicellular Monosiga brevicollis, which actually has a LARGE-like glycosyltransferase. However, clear orthologs of both GXYLT and XXYLT are found in most metazoans, including Drosophila melanogaster. The latter is rather surprising because we have not observed the product of XXYLT on Notch produced in Sf9 insect cells (28). Similarly, in O-linked glycans isolated from Drosophila, only the Xyl-Glc disaccharide was detected (41). The expression and activity of Drosophila XXYLT still have to be confirmed.
In contrast to GXYLT1 and GXYLT2, XXYLT1 showed no activity with xylose linked to pNP or with Xyl-O-EGF16, a structure recently demonstrated to be formed by Poglut/Rumi (24). The GXYLTs were able to use Glcβ-pNP as substrate, but XXYLT1 was only active with an acceptor containing Xylα1,3Glcβ1. This is actually a logical requirement to prevent the enzyme from using its transferred xylose in α1,3 linkage again as acceptor substrate. Indeed, no further extension beyond the trisaccharide is observed both naturally and in our in vitro assays. Whether the xylosyltransferases recognize the underlying EGF repeat is not known. Recognizing EGF repeats would be required to differentially xylosylate EGF repeats. At least in Drosophila, xylosylation appears to be EGF repeat-dependent (42), indicating that the Drosophila xylosyltransferase shows site specificity. On the other hand, mouse Notch1 overexpressed in tissue culture cells is mainly uniformly modified with the trisaccharide Xyl-Xyl-Glc (43).
Human XXYLT1 is predicted to have a cleavable signal sequence rather than an N-terminal signal anchor (Fig. 7). This is in fact the case for many glycosyltransferases that are in reality non-cleaved type II transmembrane proteins of the Golgi (35). In contrast, Pofut1 and Poglut/Rumi, the enzymes responsible for initial O-fucosylation and O-glucosylation, are soluble proteins in the ER and have a cleavable signal sequence (19, 34). Our experiments clearly showed that XXYLT1 is not cleaved and behaves like the archetypal type II membrane-bound glycosyltransferase B4GALT1 (33). On the other hand, it does not show the typical Golgi localization of B4GALT1. It is localized primarily to the ER. To avoid influencing the subcellular localization by changing the N- or C-terminal sequence, we have generated both N-terminally and C-terminally tagged constructs. These always showed identical localization as well as a construct that was double-tagged. We, thus, conclude from these experiments that XXYLT1 is a type II transmembrane protein, typical for glycosyltransferases, but is located in the ER. XXYLT1 behaves very similar to COSMC in these respects. COSMC is an ER-localized chaperone specific for the Golgi glycosyltransferase T-synthase and is itself a type II transmembrane protein with homology to the same T-synthase. It resides in the ER due to formation of a homodimer by a disulfide bond within the transmembrane domain that is essential for ER retention (44). Indeed, we observed dimer formation for XXYLT1 (Fig. 7G), and two cysteine residues, potentially responsible for intermolecular disulfide bond formation, are present within the transmembrane domain. There are, therefore, strong indications that XXYLT1 is retained in the ER by the same mechanism as COSMC. Moreover, the xylosyltransferase has an AXXXAXXXA motif within the predicted transmembrane domain. AXXXA or GXXXG motifs in other proteins have been shown to promote membrane helix interactions (45, 46).
With the previous identification of the glucosyltransferase Rumi, the two GXYLTs, and now the cloning of XXYLT1, all genes encoding the enzymes required for the formation of the Xyl-Xyl-Glc- glycotope are currently known. These important steps will now allow investigating the function of these modifications.
UDP-Xyl isolation by the CarboSource Services at the Complex Carbohydrate Research Center, University of Georgia, Athens GA, was supported in part by National Science Foundation Research Coordination Networks Grant 0090281.
*This work was supported by funding from the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) for the Cluster of Excellence REBIRTH (From Regenerative Biology to Reconstructive Therapy) and by National Institutes of Health Grant GM61126 (to R. S. H.).
This article contains supplemental Figs. S1–S4 and Table S1.
2The abbreviations used are: