Search tips
Search criteria 


Logo of plospathPLoS PathogensSubmit to PLoSGet E-mail AlertsContact UsPublic Library of Science (PLoS)View this Article
PLoS Pathog. 2011 July; 7(7): e1002152.
Published online 2011 July 21. doi:  10.1371/journal.ppat.1002152
PMCID: PMC3141052

Crystallography of a Lewis-Binding Norovirus, Elucidation of Strain-Specificity to the Polymorphic Human Histo-Blood Group Antigens

Barbara Sherry, Editor


Noroviruses, an important cause of acute gastroenteritis in humans, recognize the histo-blood group antigens (HBGAs) as host susceptible factors in a strain-specific manner. The crystal structures of the HBGA-binding interfaces of two A/B/H-binding noroviruses, the prototype Norwalk virus (GI.1) and a predominant GII.4 strain (VA387), have been elucidated. In this study we determined the crystal structures of the P domain protein of the first Lewis-binding norovirus (VA207, GII.9) that has a distinct binding property from those of Norwalk virus and VA387. Co-crystallization of the VA207 P dimer with Ley or sialyl Lex tetrasaccharides showed that VA207 interacts with these antigens through a common site found on the VA387 P protein which is highly conserved among most GII noroviruses. However, the HBGA-binding site of VA207 targeted at the Lewis antigens through the α-1, 3 fucose (the Lewis epitope) as major and the β-N-acetyl glucosamine of the precursor as minor interacting sites. This completely differs from the binding mode of VA387 and Norwalk virus that target at the secretor epitopes. Binding pocket of VA207 is formed by seven amino acids, of which five residues build up the core structure that is essential for the basic binding function, while the other two are involved in strain-specificity. Our results elucidate for the first time the genetic and structural basis of strain-specificity by a direct comparison of two genetically related noroviruses in their interaction with different HBGAs. The results provide insight into the complex interaction between the diverse noroviruses and the polymorphic HBGAs and highlight the role of human HBGA as a critical factor in norovirus evolution.

Author Summary

The interactions of noroviruses with histo-blood group antigens (HBGAs) are diverse, in which strains in both genogroups I and II (GI and GII) recognizing either the secretor or the non-secretor (Lewis) HBGAs have been reported. The crystal structures of the HBGA binding interfaces of two secretor binders (Norwalk virus, GI.1 and VA387, GII.4) have been elucidated. In this study we determined the crystal structure of the HBGA-binding interface of the first Lewis-binder (VA207, GII.9) and compared it with those of the two secretor binders. VA207 binds to the Lewis antigens via the Lewis epitope (α-1, 3 fucose) as the major interacting residue, which is distinct from the two secretor binders that interact with the secretor antigens through the A or H epitope as a major interacting residue. In addition, precursor saccharide was involved in binding and has a role in strain-specificity of VA207. VA207 shares a conserved HBGA binding interfaces with VA387, suggesting a strong selection of human HBGAs in norovirus evolution. The distinct binding modes between these two GII strains suggest a potential host-driving force on the diversity of noroviruses by the polymorphic HBGAs. The crystal structures resolved in this study also would facilitate the antiviral drug design against noroviruses.


Noroviruses, a genus in the family Caliciviridae, are a group of small-round structured, nonenveloped RNA viruses causing epidemic acute gastroenteritis in all age in both developing and developed countries [1], [2], [3]. Structurally, norovirus possesses an outer protein capsid that encapsulates a single-stranded, positive sense RNA genome of ~7.5 kb. Norovirus capsid displays an icosahedral symmetry that is composed of 180 copies of a single major structural protein, the capsid protein (VP1), which organize into 90 dimers [4]. Each VP1 can be divided into two major domains, the N-terminal shell (S) and the C-terminal protruding (P) domains, linked by a short hinge. The S domain forms the interior shell; the P domain constitutes the arch-like P dimer of the capsid, while the hinge provides flexibility between the two major domains [4]. The P domain is further divided into P1 and P2 subdomains, forming the leg and the head of the protruding P dimer, respectively [4].

In vitro expression of full-length VP1 assembles empty virus-like particles (VLPs) that are structurally and antigenically indistinguishable from the authentic virus. The S and the P domains were found to be structurally and functionally independent. Expression of the S domain alone formed smooth, thin-layer S particles without the protruding structures [5], [6], while production of P domain in vitro resulted in P dimer, P particle, and small P particle [6], [7], [8], [9], [10], [11], [12], [13]. Functional analyses showed that the P dimer and the P particle, but not the S particle, bind to histo-blood group antigens (HBGAs), the host susceptible factors of noroviruses (see below), indicating that the P domain is the HBGA binding domain [3], [6], [8], [13], [14]. Since human noroviruses are not cultivatable in the laboratory, these VLPs and subviral particles are valuable tools for studying the virus-host interaction and as vaccine candidates for noroviruses. The P particles are particularly useful due to their simple procedures and high yields of production in a bacterial system.

Noroviruses recognize the HBGAs as important host susceptible factors [3], [14], [15]. HBGAs are the determinants of blood types that are complex carbohydrates presenting at the outermost ends of N- or O-linked glycans or glycolipids [16], [17] on the mucosal epithelia of intestinal and genitourinary tracts. They are also present as free oligosaccharides in biologic fluids such as blood, saliva and milk. Noroviruses interact with HBGAs in a strain-specific manner, in which all common HBGAs such as A, B, H, and Lewis antigens are involved in norovirus recognition. The carbohydrate binding interfaces of noroviruses have been elucidated by crystallography [18], [19], [20] followed by mutagenesis studies [10], [11], [21]. They are located at the distal end of the arch-like P dimer, corresponding to the outermost surface of the viral capsid.

The HBGA-binding sites of two noroviruses, Norwalk virus (GI.1) and VA387 (GII.4), each representing a major genogroup (GI and GII) of human noroviruses, have been elucidated [18], [19], [20]. The HBGA-binding sites of the two strains differ in their precise locations, structures, sequence compositions and binding modes to carbohydrates, although they share the same targets of A and H antigens [22], [23]. Further study showed that the HBGA-binding sites are conserved within but not between the two major genogroups of human noroviruses [11], [19], [20], although the two genogroups share the same ligand repertoire of human HBGAs. These data indicate that the HBGA binding is a prerequisite of norovirus infection and therefore important in the evolution of human noroviruses [11].

It has been noted for Norwalk virus (GI.1) that the same HBGA-binding interface can interact with different carbohydrate antigens by distinct bonds. For example, in the interaction with the type A trisaccharides, the α-GalNAc is the major contact, while in the case of the H pentasaccharide, both the β-Gal and the α-1, 2 Fuc are the major contacts [18], [20]. On the other hand, VA387 (GII.4) recognizes both the A and B trisaccarides through the same α-1, 2 Fuc as major contact [19]. These data indicate a great flexibility in the interaction between noroviruses and HBGAs, in which the same binding interface can bind to different HBGAs (Norwalk to A/H and VA387 to A/B antigen), while distinct binding interfaces are able to bind the same HBGAs (Norwalk and VA387 to A antigen). This structural information helps the explanation of the complex virus-host interaction and evolution of noroviruses.

Both Norwalk virus and VA387 [3], [14] recognize the secretor antigens (A/B/H), but not the non-secretor antigens. In this study, we performed a crystallographic study on a first non-secretor binding strain (VA207, GII.9) [23]. Our data indicated that VA207 uses the common genogroup II HBGA binding interface described for VA387 but interacts with a different set of HBGAs (Ley and sialyl Lex) through a completely different binding mode. Instead of targeting at the secretor epitopes of the HBGAs in secretor binders (VA387 and Norwalk virus), VA207 targets at the non-secretor (Lewis) epitope (the α-1, 3 fucose) as the major contact. Mutagenesis study identified amino acids that are responsible for the type-specificity of VA207. These data for the first time elucidate the genetic and structural basis of strain-specificity by a direct comparison of two genetically related noroviruses in their interaction with different HBGAs, highlighting the role of human HBGA as a critical factor in norovirus evolution. The structural data would help antiviral development for disease control and prevention of noroviruses.

Materials and Methods

Expression and purification of recombinant P protein for crystallization

The cDNA fragment encoding P domain of norovirus VA207 (GII.9) (amino acid 222–537, GenBank accession#: AY038599) was cloned into the expression vector pGEX-6P-1 (GE Healthcare Life Sciences) and was expressed as a GST fusion protein in E. coli BL21 DE3 at 16°C overnight induced with 0.5 mM Isopropyl β-D-1-Thiogalactopyranoside (IPTG). The recombinant protein was purified using glutathione-sepharose 4B (GE Healthcare Life Sciences) according to the manufacturer's protocol. The P protein was cleaved from the GST tag with Prescission Protease (GE Healthcare Life Sciences) at 4°C overnight. The eluted P protein was confirmed by Mass Spectrometry Fingerprint to bind to E. coli chaperone GroEL, which could be further separated from P protein by Mono Q anion ion exchange (GE Healthcare Life Sciences) at pH 7.3. P protein was eluted at ~300 mM NaCl while GroEL was eluted at ~500 mM NaCl. P protein was then dialysed against 20 mM HEPES (pH 7.3), 150 mM NaCl before crystallization.

Production of P particle for mutagenesis study

The expression construct of wild type P particles of VA207 were generated previously [11] by cloning the P protein-encoding sequences into the plasmid pGEX-4T-1 (GE Healthcare Life Sciences). Mutant P particles with single amino acid mutation at the HBGA binding interface were constructed by site-directed mutagenesis using the wild type constructs as templates. Site-directed mutagenesis was conducted using the QuickChange Site-Directed Mutagenesis Kit (Stratagene, La Jolla, CA) using following primer pairs: caggtgacgccacggcggcccatgaggcaag/cttgcctcatgggccgccgtggcgtcacctg (R346A), ctcaacctcaagcgcttttgaaacaaacc/ggtttgtttcaaaagcgcttgaggttgag (D374A), ccaataggtatcgccattgagggcaattct/agaattgccctcaatggcgatacctattgg(Y389A), ccaggagctagtgcccacacaaatggg/cccatttgtgtgggcactagctcctgg (G440A), ccaggagctagtggcgccacaaatggggagatg/catctccccatttgtggcgccactagctcctgg (H441A), ttcatcccaggagctgctggccacacaaatgg/ccatttgtgtggccagcagctcctgggatgaa (S439A), as described previously [6], [10], [11], [21]. The wild type and mutant P particles were expressed in E. coli BL21 DE3 and then purified using the Glutathione Sepharose 4 flow (GE Healthcare Life Sciences, Piscataway, NJ) according to the manufacturer's protocol as described elsewhere [6], [8], [9], [10], [11]. The P proteins were released from GST tag by thrombin (GE Healthcare Life Sciences) digestion. The formation of P particle was determined by gel filtration using a size-exclusion column Superdex 200 (GE Healthcare Life Sciences) powered by an AKTA-FPLC system (model 920, GE Healthcare Life Sciences) followed by SDS-PAGE electrophoresis, in which the P particles formed a peak at ~830 kDa. None of the designed single residue mutations in this study affected P particle formation and none of the mutants revealed any detectable reduction in reactivity to the hyperimmune serum against VLP compared with that of the wild type P particle.

Crystallization of P protein and its complexes with Ley and sialyl Lex tetrasaccharides

P protein was concentrated to 12 mg/ml. Native crystals were grown using the hanging drop vapor diffusion method by mixing 1.5 µl protein solution with an equal volume of reservoir solution containing 12% (w/v) polyethylene glycol (PEG) 3350 and 50 mM magnesium formate. For growth of complex crystals, the two tetrasaccharides, Lewis y {α-Fuc-(1→2)-β-Gal-(1→4)-[α-Fuc-(1→3)]-GlcNAc, Ley} and Sialyl Lewis x {α-NeuNAc-(2→3)-β-Gal-(1→4)[α-Fuc-(1→3)]-GlcNAc, SLex} (Sigma) were first separately dissolved in ddH2O to a concentration of 0.3 mM and mixed with P protein to final molar ratio of 60~100[ratio]1. The mixtures were incubated at 4°C for 2 h before mixed with reservoir solution containing 10–15% (w/v) PEG 3350 and 50 mM magnesium formate. The hanging drops were equilibrated over 500 µl reservoir solution, and crystals could be harvested in two weeks. The biosynthesis pathways of Ley and SLex are shown in Figure 1 with indications of sequences, structures, linkages, and nomenclatures.

Figure 1
Biosynthesis pathways of Lewis y and sialyl Lewis x starting from the type II precursor (Galβ1–4GlcNAcβ1-R).

Data collection and processing

X-ray diffraction data were collected from flash-cooled crystals using an in-house Rigaku FR-E rotating anode X-ray generator with an R-AXIS IV++ detector at a wavelength of 1.5418 Å for native P protein crystals and P protein complexed with Ley tetrasaccharide crystals. Diffraction data for P protein complexed with SLex tetrasaccharide crystals were collected at BL17A of the KEK Photon Factory (Japan) at a wavelength of 1.0000 Å. The cryoprotectant solution contained 12% (w/v) PEG 3350, 50 mM magnesium formate and 15% (v/v) glycerol. The data were processed, scaled and merged using the HKL2000 program package [24]. Statistics for data collection and processing are summarized in Table 1.

Table 1
Data collection statistics.

Structure determination and refinement

We used the crystal structure of VA387 P domain as the search model and the program Phaser [25] to solve the phases of the crystal structure of VA207 P protein. The amino acid sequence was then replaced with that of VA207 P domain, and manual adjustments were carried out with the program COOT [26] by the guidance of (2Fo-Fc) and (Fo-Fc) electron density maps, where Fo is the observed structure factor and Fc is the calculated structure factor. Further adjustment and refinement were carried out with the programs CNS [27], Refmac [28] and Phenix [29]. At the final stage of refinement, water molecules were added in (Fo-Fc) map at peaks (>3σ) where they could form good hydrogen bonds with nearby residues. Statistics for structure refinement are summarized in Table 2. The refined structures were validated with the program PROCHECK [30]. No residue was found in a disallowed region of the Ramachandran plot. The structure analysis was performed by programs EdPDB [31] and PyMOL (DeLano Scientific LLC ).

Table 2
Structure refinement statistics.

HBGA-binding assay

The synthetic oligosaccharide-based binding assays were performed basically as described elsewhere [22], [23]. The affinity-column purified P particles were diluted to 10 µg/ml as working solution. Synthetic oligosaccharides representing H1, H2, H3, A, B, Lea, Leb, Lex, and Ley were purchased from GlycoTech (Gaithersburg, MD), while the other two representing SLea and SLex were from The Consortium for Functional Glycomics (CFG, USA). The oligosaccharides were coated on 96-well microtiter plates (Dynex Immulon; Dynatech, Franklin, MA). After blocking by nonfat milk, P particles at 10 ng/µl were added. The bound P particle were detected using a rabbit anti-VA207 VLP antiserum (1[ratio]3300), followed by the addition of HRP-conjugated goat anti-rabbit IgG (ICN, Aurora, OH).

Protein data bank accession

The coordinates and structure factors for the VA207 P protein (3PUM) as well as its complexes with Ley tetrasaccharide (3PUN) and SLex tetrasaccharide (3PVD) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ.


Crystallization of VA207 P protein

Norovirus VA207 (GII.9) has been demonstrated to recognize Ley, Lex and SLex antigens [23] and thus was chosen as a model to study the structure of the binding interface to Lewis antigens. The P domain of VA207 (amino acid 222 to 537) was expressed in E. coli and the HBGA binding profile of the resulting P protein was verified (data not shown) before being used in the crystallography study. The protein sample was concentrated to 12 mg/ml and the crystallization was performed in the presence or absence of the Ley or SLex tetrasaccharide. Structures of the native P protein and the P protein complexed with the oligosaccharide were solved with the molecular replacement method using diffraction data up to 2.2 and 2.0 angstroms (Å) resolution, respectively.

The overall structure of VA207 P protein protomer

The crystal of the native P protein of VA207 belongs to space group of P21212 and contains a homodimer of P protein in an asymmetric unit. Most residues from 226 to 527 could be modeled in the electron density map, while residues 295 to 297 and 392 to 394 could not be modeled due to lack of interpretable electron density map in these loop regions. However, in both complex structures with the Ley and SLex tetrasaccharides, the above missing residues could be modeled, suggesting that binding to either of the two tetrasaccharides stabilizes the two flexible loops.

The P protein protomer of VA207 shares similar structure features with that of VA387 (GII.4) [18], [19] and Norwalk virus (GI.1) [4], [18], [20] (Figures 2 and and3).3). It can also be divided into two subdomains, P1 and P2. P2 subdomain, spanning from residue 275 to 415, is an insertion into P1 subdomain, splitting the latter into two fragments (222 to 274 and 416 to 537, Figure 2). The P1 subdomain has a mixed α/β structure. Two twisted antiparallel β sheets (β1-β8-β10 and β14-β1-β8-β13-β12-β11-β15, where β1 and β8 strands are shared within the two β-sheets) and an α helix form a hydrophobic core (Figure 3). While P1 forms the legs, P2 constitutes the head of the P dimer with a β-barrel made of 6 antiparallel β strands folded as a Greek key topology. The structure of the P protein is very similar between VA207 and VA387, with a perfect match in their P1 subdomains. However, significant differences were observed in the four loops on the P2 subdomain connecting β2-β3, β4-β5, β6-β7 and β7-β8, as a result of sequence variations in the four loops between the two strains (Figure 2).

Figure 2
Structure-based sequence alignment between VA207 and VA387 P domains.
Figure 3
Overall structure of VA207 P protein monomer and comparison of VA207 and VA387 P dimers complexed with oligosaccharides.

Structure of VA207 P protein dimer

The P protein forms a homodimer along a non-crystallographic two-fold axis, and the dimerization is essential for forming a functional carbohydrate binding site on the outermost surface of the virus capsid (Figure 3) (see below). The P dimer has a dimension of 56 by 66 by 70 Å and contains a large buried surface area of 3, 300 Å2 (for two protomers) formed by both polar and non-polar residues. Both subdomains contributed to the dimer interface: the α helices in the P1 subdomain interacts with each other by hydrogen bonds and hydrophobic interaction, while strands β5 and β9 from the β barrel in P2 subdomain was bound mainly by hydrogen bonds. Similar to the protomers, the general structure of VA207 P dimer is very similar to that of VA387, particularly in the P1 subdomain region. However, clear difference can be seen on the top surface of the P dimer (Figures 3).

The carbohydrate-binding interface of VA207

Co-crystallization of the P protein with Ley and SLex tetrasaccharides has resulted in crystals in a new form of space group P212121 instead of the P21212 native form. In this new crystal form, an asymmetric unit also contains one homodimer along a noncrystallographic two-fold axis. The two complex structures were solved with the molecular replacement method using the native P protein dimer structure as the initial model. In both complex crystals the four saccharide rings could be clearly discerned from the (fo-fc) electron density map at a resolution of 2Å, and their structures were modeled accordingly (Figure 4). Torsion angles of glycosidic linkages for the two tetrasaccharides are listed in Table 3.

Figure 4
(Fo-Fc) omit electron density map of the Lewis y and 3′-sialyl Lewis x tetrasaccharide.
Table 3
Torsion angles of glycosidic linkages in Ley and SLex.

The carbohydrate binding site of VA207 is located at the distal surface of the P dimer (Figure 3), corresponding to the outermost surface of a norovirus capsid. Like VA387 [19] the ligand binding site is located in the interface between two P proteins, indicating both protomers contribute to the ligand binding. The binding pocket for the Ley and SLex tetrasaccharide is constituted by 7 amino acid residues, of which T345, R346, and D374 are from one protomer forming the “bottom” (T345 and R346) and a “wall” (D374), while G440, H441, S439 and Y389 are from the other protomer constituting the other “walls” of the pocket (Figures 5, ,66 and and77).

Figure 5
Extensive interaction network between VA207 P dimer and Lewis Y tetrasaccharide.
Figure 6
Extensive interaction network between VA207 P dimer and 3′ sialyl-Lewis X tetrasaccharide.
Figure 7
The crystal structures of the HBGA-binding interfaces of VA387 (GII.4) and VA207 (GII.9).

Extensive hydrogen bond networks were observed between these amino acid residues and the tetrasaccharides of the Lewis antigens. In particular, T345, R346, D374 and G440 interact with the α-1, 3 fucose (Lewis epitope), Y389 and S439 with the N-acetylglucosamine saccharide, one of the two precursor saccharides, and H441 with the α-1, 2 fucose (H epitope on Ley tetrasaccharide) via a water molecule as a bridge (Figures 5 and and6).6). It is noted that 5 of the 7 amino acid residues that participate directly in interaction of VA207 with the Ley antigen are conserved in VA387 (GII.4) [11], [19], supporting the notion that these two GII strains share a conserved genogroup II HBGA-binding interface as predicted previously [11]. However, unlike VA387 that recognizes A and B antigens through the α-1, 2 fucose (H epitope) as its major contact [19], VA207 interacts with the two Lewis antigens through the α-1, 3 fucose (Lewis epitope) as its main recognition moiety. Thus, a conserved carbohydrate binding interface can recognize different HBGAs with different modes among genetically related but distinct GII viruses.

The sialic acid did not participate in binding with VA207

The sialyl modified Lex was recently suggested to participate in binding to some GII noroviruses [32], [33], and our binding assays showed that VA207 binds strongly to the SLex (Figure 8A). Therefore, we particularly studied whether the sialic acid residue participates in binding of VA207 by co-crystallization of the P protein with SLex. Our data showed the sialic acid was far away from the P dimer surface (Figure 6) and there seemed no hydrogen bond or other direct interaction with the VA207 capsid. The number of hydrogen bonds between the SLex and P dimer was reduced to 9, which is one less than that in P dimer-Ley complex, due to the absence of the α-1, 2 fucose in the SLex tetrasaccharide (Figures 5 and and66).

Figure 8
HBGA-binding outcomes of various mutant P particles with single amino acid mutations in the HBGA-binding interface of VA207.

Both α-1, 3 and α-1, 2 fucoses are critical for interacting of VA207 with HBGAs

To further characterize the role of individual amino acids in the HBGA-binding interface, mutations were introduced to residues that form hydrogen bonds with carbohydrate ligands followed by binding assays of the mutant P particles [7], [8] to HBGA receptors. When single amino acid mutations were introduced to amino acids interacting with the α-1, 3 fucose- (R346A, D374A and G440A) and the α-1, 2 fucose (H441A) (Figures 5 and and6),6), all mutant P particles lost their binding completely (R346A, D374A and G440A) or nearly completely (H441A) to H3, Ley, Lex and SLex compared with the wild type P particles (Figure 8 A to E). Saliva-based binding assay confirmed the results (data not shown) [11]. These data indicated that both α-1, 3 and α-1, 2 fucoses are important in VA207 recognition of HBGAs, in which the interactions of the α-1, 3 fucose may be more important.

The N-acetylglucosamine binding site affects the binding specificity of VA207

When single mutations were introduced to the two amino acids that form the N-acetylglucosamine (GlcNAc) binding site (S439A and Y389A) observed for VA207, the resulting VA207 P particle mutants demonstrated a change in binding specificity to HBGAs (Figure 8, comparing A with F and G). While both mutants lost binding to the Ley and SLex, the Y389A mutant gained a strong binding to Leb (Figure 8F), and the S439A mutant retained binding to H type 3 antigen (Figure 8G). Since the GlcNAc of Leb, Ley/Lex, and H type 3 are from type I, II, and III precursors, respectively, with different linkages and side chains (Figure 9), our data indicated that this GlcNAc binding site determines the specificity of VA207 in a type-specific manner (see discussion).

Figure 9
The structures of the type I and type II disaccharide precursors of HBGAs and their possible steric locations interacting with the VA207 P dimer.


In this study we determined the atomic structure of the carbohydrate binding interface of the first Lewis-binding norovirus (VA207, GII.9) using X-ray crystallography followed by a mutagenesis study. As a member of GII noroviruses, VA207 is expected to share a conserved carbohydrate binding interface [11] with that of VA387 (GII.4) [10], [19]. This has been now demonstrated in this study by showing that the HBGA binding sites of VA207 and VA387 share highly conserved location, amino acid composition and overall structures of the binding pockets. We also elucidated the structural basis of the two strains in recognizing different sets of HBGAs through different interaction modes. VA207 recognizes the Lewis antigens through the α-1, 3 fucose (the Lewis epitope) as a major contact (Figures 5 and and6),6), while VA387 interacts with the secretor antigens through the α-1, 2 fucose (the H epitope) as the major binding target [10], [19]. These data provide structural evidence on how subtle changes in the HBGA binding interface affect binding specificity and revolution of genetic linkage of noroviruses under the selection by the polymorphic human HBGAs.

Genetically, VA207 and VA387 belong to two genotypes within the same genogroup II but share only 54% amino acid identity in the P domains. However, the P dimers of the two strains share very similar structures with a nearly perfect match between their P1 subdomains. The basic structures of the carbohydrate binding pockets of the two strains are also highly conserved, in which 4 of the 5 amino acids constituting the core structure of the binding pockets are identical between the two strains. These data indicate that the recognition of HBGAs is a prerequisite of norovirus infection, which plays an important role in norovirus evolution. In contrast, no such similarity is found with Norwalk virus (GI.1), confirming the genetic linkage between the two GII noroviruses. Major sequence variations between these two GII noroviruses have also been found at the top surface (Figure 7, top panel) of the P2 subdomains. This region is associated with the host immune selection in addition to HBGA recognition.

The conservation of the HBGA binding sites raises a question on how the strain-specificities of HBGA binding are determined. VA207 binds the Lex, Ley and H type 3 antigens, while VA387 has a broader binding spectrum to A, B, H type 3, Leb, and Ley [22], [23]. Our mutagenesis analysis on both VA387 [10] and VA207 (this report) indicated that the integrity of the core structure of the HBGA binding pocket is strictly required for the binding function. Introducing single mutations to any residue involved in the core structures have resulted in completely loss of binding to all HBGA types ([10], Figure 8 B to E). These core structures include those interacting to the α-1, 3 fucose of VA207 (this report), the α-1, 2 fucose of VA387 and Norwalk virus, and the GalNAc of Norwalk virus [10], [11], [18], [19], [20].

However, some residues outside the core structures also play a role in determining the binding specificity of noroviruses and mutations at these residues can change carbohydrate targets depending on the types of HBGAs being involved. For example, Q331 and K348 in VA387 interact with GalNAc of the A antigen and α-Gal of the B antigen [10], [11], while S439 and Y389 in VA207 interact with GlcNAc (Figures 4 and and5),5), a β-1, 4 linked saccharide in the type II precursor of Lex and Ley antigens. These residues are located in the similar positions outside the core structures of the binding pockets of the two GII strains (Figure 7), providing further evidence of selection of noroviruses by the host HBGAs. The demonstration of carbohydrate target switch helps the understanding of the complex interactions between the diverse noroviruses and polymorphic HBGAs of human hosts.

Additional strain-specificity of VA207 binding to HBGAs was observed through the mutagenesis study of S439 and Y389. Several hydrogen bonds between the GlcNAc of Ley and SLex and the side chains of S439 and Y389 were evident in crystallography studies (Figures 5 and and6).6). These bonds, together with those with the α-1, 3 and the α-1, 2 fucoses, form an interacting network responsible for the binding of VA207 to the Ley and the SLex antigens. A single mutation of Y389A lost the binding to the type II Lewis antigens (Lex, SLex, and Ley) but gained strong binding to the type I Lewis antigen (Leb), while no change in binding to the H type III antigens (Figure 8, comparing A with F). Similarly, the S439A mutant abolished the binding to Lex and Ley without altering the binding to H type III (Figure 8, comparing A with G). These results can be well explained by the structural difference between the type I (Lea and Leb) and type II (Lex and Ley) Lewis antigens. Both Lex and Ley are derivatives of the type II precursor (Galβ1-4GlcNAcβ-R, Figure 9B), of which the β-1, 4 GlcNAc interacts with S439/Y389 (Figures 5, ,6,6, and and9B),9B), whereas the Leb is a derivative of the type I precursor (Galβ1-3GlcNAcβ-R, Figure 9A) with a β-1, 3 GlcNAc. Thus, in Leb, the glycosidic linkages of the GlcNAc with α-Fuc and β-Gal should be switched compared with Ley, resulting in the GlcNAc residue flipping by about 180°around the axis which is across and perpendicular to the C1-O5 and C3-C4 bonds of the GlcNAc. Consequently, the –CH3OH group of the GlcNAc in Leb would collide with the side chain of S439 (1.3 Å, Figure 9A, right), making the Leb sterically difficult to bind to the wild type VA207. However, the explanation as to how the two mutations (Y389A and S439A) changed the binding specificity of VA207 P protein remains unidentified based on the current complex structures. Since a simple replacement of the Y389 and S439 with an alanine in the crystal structure cannot explain well the observed change of binding specificity, unknown conformational change in the vicinity of Y389 and S439 may occur due to the mutation, which remains to be defined. These data indicate that the precursor may be a determinant of the type-specific HBGA interaction of noroviruses.

The facts that VA207 binds strongly to Ley but weakly to Lex and that the Y389A mutant binds strongly to Leb but not to Lea suggest that the interaction between H441 and α-1, 2 fucose may be important for binding activity. H441 forms a water-mediated hydrogen bond with α-1, 2 fucose of Ley (Figure 5), most likely also of Leb in the Y389A mutant, which was absent in either Lea or Lex; this may be the reason of difference of binding between Lea/Lex and Leb/Ley. It may also explain why the H441A mutation abolishes all binding function. In addition, we noticed the presence of Van der Waals interaction between the side chain of H441 and 6-methyl group of the α-1, 3 fucose (Lewis epitope), which further emphasizes the importance of the H441 in the binding function of VA207. A similar structure formed by Y443 is also present in VA387 which interacts through Van der Waals force with the α-1, 2 fucose of the A/B antigens, and this hydrophobic interaction has been shown to be vital for the binding to HBGA receptors [19].

The role of sialic acid in binding of SLex to VA207 was suggested by our in vitro binding assays (Figure 8A). Similar binding activities of SLex to GII.4 viruses were also observed in two previous studies [32], [33]. However, our crystallography data excluded a direct participation of the sialic acid in binding to VA207 by the lack of direct interaction between the sialic acid and the P protein in the crystal structure. We hypothesize that the addition of the sialic acid residue may stabilize the structure of the SLex antigen and thus impact positively the binding to VA207 compared with Lex without sialic acid. Since VA387 is the only GII.4 virus with known crystal structure of the HBGA binding interface, we measured whether VA387 binds to SLea and SLex antigens. We did not observe binding of VA387 to either antigens. We also noticed weaker interactions of VA207 wild type and mutants with Lex and Lea than Leb and Ley (Figure 8), in which the occurrence of the α-1, 2 fucose (H epitope) in the Leb and Ley antigens clearly play a role. Further study is needed to clarify this issue.

The binding of VA207 to the H type 3 (Fucα1-2Galβ1-3GalNAc-R1, H3) antigen in addition to the two Lewis antigens (Lex and Ley) extends our understanding on flexibility of carbohydrate binding mode. Since the H antigen does not have an α-1, 3 fucose (Lewis epitope, [3], [14]), VA207 must recognize another saccharide as the major contact. The fact that single mutations at the core binding pocket abolish binding to all HBGAs indicates that VA207 must recognize the H type 3 antigen through the same binding interface. However, mutations of S439A and Y389A did not decrease the binding to H type 3 antigen, suggesting that this GlcNAc binding site is not required for binding to the H type 3. Thus, future study would be of significance to find out how the same binding interface of VA207 binds to the H type 3 antigen with a different binding mode in addition to the known binding to Lex and Ley antigens.

In summary, our study elucidates the structural basis of norovirus-Lewis antigen interaction at atomic resolution, which well explains the type-specific HBGA recognition. The structural and functional data generated in this study are valuable to understanding the complex interaction between diverse noroviruses and the polymorphic HBGAs of human hosts. Our current results also highlight the importance of human HBGAs as a critical factor in norovirus evolution, in which a functional carbohydrate binding interface is a prerequisite for norovirus survival. The high resolution pictures of the Lewis antigen binding interface illustrated in this study expand the foundation of our strategies to control and prevent norovirus-associated diseases.


The authors have declared that no competing interests exist.

The research described in this article was supported by "863" Project (China), NSFC (National Science Foundation) and Infectious Diseases Project (2006AA02A322, 30970568 and 2009ZX10603) to X.L. This study was also supported by the National Institute of Health, the National Institute of Allergy and Infectious diseases (R01 AI37093 and R01 AI055649) to X.J. In addition, an Institutional Clinical and Translational Science Award, NIH/NCRR Grant Number 1UL1RR026314-01 and the Infectious Disease Grant (Nipert Foundation) to M. T contributed to this research. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.


1. Green K, Chanock R, Kapikian A. Human Calicivirus. In: Knipe DM, Howley PM, Griffin DE, Lamb RA, Martin MA et al, editors. Fields Virology. 4th ed. Philadelphia:Lippincott Williams & Wilkins; 2001. pp. 841–874.
2. Tan M, Farkas T, Jiang X. Molecular Pathogenesis of Human Norovirus. In: Yang DC, editor. RNA Virus: Host Gene Responses to Infection. 1 ed. Singapore:World Scientific; 2009. pp. 575–600.
3. Tan M, Jiang X. Norovirus-host interaction: implications for disease control and prevention. Expert Rev Mol Med. 2007;9:1–22. [PubMed]
4. Prasad BV, Hardy ME, Dokland T, Bella J, Rossmann MG, et al. X-ray crystallographic structure of the Norwalk virus capsid. Science. 1999;286:287–290. [PubMed]
5. Bertolotti-Ciarlet A, White LJ, Chen R, Prasad BV, Estes MK. Structural requirements for the assembly of Norwalk virus-like particles. J Virol. 2002;76:4044–4055. [PMC free article] [PubMed]
6. Tan M, Hegde RS, Jiang X. The P domain of norovirus capsid protein forms dimer and binds to histo-blood group antigen receptors. J Virol. 2004;78:6233–6242. [PMC free article] [PubMed]
7. Tan M, Fang P, Chachiyo T, Xia M, Huang P, et al. Noroviral P particle: Structure, function and applications in virus-host interaction. Virology. 2008;382:115–123. [PubMed]
8. Tan M, Jiang X. The p domain of norovirus capsid protein forms a subviral particle that binds to histo-blood group antigen receptors. J Virol. 2005;79:14017–14030. [PMC free article] [PubMed]
9. Tan M, Meller J, Jiang X. C-terminal arginine cluster is essential for receptor binding of norovirus capsid protein. J Virol. 2006;80:7322–7331. [PMC free article] [PubMed]
10. Tan M, Xia M, Cao S, Huang P, Farkas T, et al. Elucidation of strain-specific interaction of a GII-4 norovirus with HBGA receptors by site-directed mutagenesis study. Virology. 2008;379:324–334. [PubMed]
11. Tan M, Xia M, Chen Y, Bu W, Hegde RS, et al. Conservation of carbohydrate binding interfaces: evidence of human HBGA selection in norovirus evolution. PLoS One. 2009;4:e5058. [PMC free article] [PubMed]
12. Tan M, Huang P, Xia M, Fang PA, Zhong W, et al. Norovirus P particle, a novel platform for vaccine development and antibody production. J Virol. 2011;85:753–764. [PMC free article] [PubMed]
13. Tan M, Fang PA, Xia M, Chachiyo T, Jiang W, et al. Terminal modifications of norovirus P domain resulted in a new type of subviral particles, the small P particles. Virology. 2011;410:345–352. [PMC free article] [PubMed]
14. Tan M, Jiang X. Norovirus and its histo-blood group antigen receptors: an answer to a historical puzzle. Trends Microbiol. 2005;13:285–293. [PubMed]
15. Tan M, Jiang X. Norovirus gastroenteritis, carbohydrate receptors, and animal models. PLoS Pathog. 2010;6:e1000983. [PMC free article] [PubMed]
16. Le Pendu J. Histo-blood group antigen and human milk oligosaccharides: genetic polymorphism and risk of infectious diseases. Adv Exp Med Biol. 2004;554:135–143. [PubMed]
17. Ravn V, Dabelsteen E. Tissue distribution of histo-blood group antigens. Apmis. 2000;108:1–28. [PubMed]
18. Bu W, Mamedova A, Tan M, Xia M, Jiang X, et al. Structural basis for the receptor binding specificity of Norwalk virus. J Virol. 2008;82:5340–5347. [PMC free article] [PubMed]
19. Cao S, Lou Z, Tan M, Chen Y, Liu Y, et al. Structural basis for the recognition of blood group trisaccharides by norovirus. J Virol. 2007;81:5949–5957. [PMC free article] [PubMed]
20. Choi JM, Hutson AM, Estes MK, Prasad BV. Atomic resolution structural characterization of recognition of histo-blood group antigens by Norwalk virus. Proc Natl Acad Sci U S A. 2008;105:9175–9180. [PubMed]
21. Tan M, Huang P, Meller J, Zhong W, Farkas T, et al. Mutations within the P2 Domain of Norovirus Capsid Affect Binding to Human Histo-Blood Group Antigens: Evidence for a Binding Pocket. J Virol. 2003;77:12562–12571. [PMC free article] [PubMed]
22. Huang P, Farkas T, Marionneau S, Zhong W, Ruvoen-Clouet N, et al. Noroviruses Bind to Human ABO, Lewis, and Secretor Histo-Blood Group Antigens: Identification of 4 Distinct Strain-Specific Patterns. J Infect Dis. 2003;188:19–31. [PubMed]
23. Huang P, Farkas T, Zhong W, Tan M, Thornton S, et al. Norovirus and histo-blood group antigens: demonstration of a wide spectrum of strain specificities and classification of two major binding groups among multiple binding patterns. J Virol. 2005;79:6714–6722. [PMC free article] [PubMed]
24. Otwinowski Z, Minor W. Processing of X-ray diffraction data collected in oscillation mode. Method Enzymol. 1997;276:307–326.
25. McCoy AJ, Grosse-Kunstleve RW, Storoni LC, Read RJ. Likelihood-enhanced fast translation functions. Acta Crystallogr D. 2005;61:458–464. [PubMed]
26. Emsley P, Cowtan K. Coot: model-building tools for molecular graphics. Acta Crystallogr D. 2004;60:2126–2132. [PubMed]
27. Brunger AT, Adams PD, Clore GM, DeLano WL, Gros P, et al. Crystallography & NMR system: a new software suite for macromolecular structure determination. Acta Crystallogr D. 1998;54:905–921. [PubMed]
28. Murshudov GN, Vagin AA, Dodson EJ. Refinement of macromolecular structures by the maximum-likelihood method. Acta Crystallogr D. 1997;53:240–255. [PubMed]
29. Adams PD, Grosse-Kunstleve RW, Hung LW, Ioerger TR, McCoy AJ, et al. PHENIX: building new software for automated crystallographic structure determination. Acta Crystallogr D. 2002;58:1948–1954. [PubMed]
30. Laskowski RA, MacArthur MW, Moss DS, Thornton JM. PROCHECK: a program to check the stereochemical quality of protein structures. J Appl Crystallogr. 1993;26:283–291.
31. Zhang XJ, Matthews BW. EDPDB: a multifunctional tool for protein structure analysis. J Appl Crystallogr. 1995;28:624–630.
32. de Rougemont A, Ruvoen-Clouet N, Simon B, Estienney M, Elie-Caille C, et al. Qualitative and Quantitative Analysis of the Binding of GII.4 Norovirus Variants onto Human Blood Group Antigens. J Virol. 2011;85:4057–4070. [PMC free article] [PubMed]
33. Rydell GE, Nilsson J, Rodriguez-Diaz J, Ruvoen-Clouet N, Svensson L, et al. Human noroviruses recognize sialyl Lewis x neoglycoprotein. Glycobiology. 2009;19:309–320. [PubMed]

Articles from PLoS Pathogens are provided here courtesy of Public Library of Science