Human noroviruses are the major viral pathogens of epidemic acute gastroenteritis. These genetically diverse viruses comprise two major genogroups (GI and GII) and approximately 30 genotypes. Noroviruses recognize human histo-blood group antigens (HBGAs) in a diverse, strain-specific manner. Recently the crystal structures of the HBGA-binding interfaces of the GI Norwalk virus and the GII VA387 have been determined, which allows us to examine the genetic and structural relationships of the HBGA-binding interfaces of noroviruses with variable HBGA-binding patterns. Our hypothesis is that, if HBGAs are the viral receptors necessary for norovirus infection and spread, their binding interfaces should be under a selection pressure in the evolution of noroviruses.
Methods and Findings
Structural comparison of the HBGA-binding interfaces of the two noroviruses has revealed shared features but significant differences in the location, sequence composition, and HBGA-binding modes. On the other hand, the primary sequences of the HBGA-binding interfaces are highly conserved among strains within each genogroup. The roles of critical residues within the binding sites have been verified by site-directed mutagenesis followed by functional analysis of strains with variable HBGA-binding patterns.
Conclusions and Significance
Our data indicate that the human HBGAs are an important factor in norovirus evolution. Each of the two major genogroups represents an evolutionary lineage characterized by distinct genetic traits. Functional convergence of strains with the same HBGA targets subsequently resulted in acquisition of analogous HBGA binding interfaces in the two genogroups that share an overall structural similarity, despite their distinct locations and amino acid compositions. On the other hand, divergent evolution may have contributed to the observed overall differences between and within the two lineages. Thus, both divergent and convergent evolution, as well as the polymorphic human HBGAs, likely contribute to the diversity of noroviruses. The finding of genogroup-specific conservation of HBGA binding interfaces will facilitate the development of rational strategies to control and prevent norovirus-associated gastroenteritis.
Citation: Tan M, Xia M, Chen Y, Bu W, Hegde RS, Meller J, et al. (2009) Conservation of Carbohydrate Binding Interfaces — Evidence of Human HBGA Selection in Norovirus Evolution. PLoS ONE 4(4): e5058. doi:10.1371/journal.pone.0005058
Editor: Robert J. Geraghty, University of Minnesota, United States of America
Received: December 17, 2008; Accepted: February 4, 2009; Published: April 1, 2009
Copyright: © 2009 Tan et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: The research described in this article was supported by the National Institute of Health, the National Institute of Allergy and Infectious diseases (R01 AI37093 and R01 AI55649) and National Institute of Child Health (PO1 HD13021), and the Department of Defense (PR033018) to X.J. This work was also supported by a grant from the Translational Research Initiative of Cincinnati Children's Hospital Medical Center (SPR102032) to M.T. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Noroviruses, a group of single-stranded, positive sense RNA viruses, constitute one of the six genera of the family Caliciviridae , , . Noroviruses are genetically highly diverse containing 5 genogroups (G) and ~30 genotypes . GI, GII and GIV infect humans and cause acute gastroenteritis , , , GIII infect cattle causing similar diseases , while GV infect mice and cause disease only in immune compromised mice . GI and GII are studied extensively owing to their importance in human disease. Although some genogroup-specific features related to the epidemiology and environmental transmission have been analyzed , , , the biological significance and evolutionary relationships regarding the HBGA interaction of the two major genogroups of human noroviruses requires further elucidation.
Noroviruses contain a major structural protein (VP1) that forms the viral capsid . The VP1 has two principle domains, the shell (S) and the protruding (P) domains, linked by a short hinge. The S domain forms the interior icosahedral shell, while the P domain constitutes the arch-like P dimer protruding from the shell. The P domain can be further divided into two subdomains, P1 and P2, with the P2 subdomain at the outermost surface of the viral capsid , , , . The S and P domains appear to be structurally and functionally independent, as suggested by the facts that the P domain alone forms a dimer and P particle (12 P dimers), and both P dimer and P particle retain receptor-binding function , , , while the S domain alone forms S particle without receptor-binding function . In addition, a large amount of soluble P protein has been found in the stools of Norwalk virus-infected patients , , , although its biological significance remains unknown.
Human noroviruses recognize human histo-blood group antigens (HBGAs), most likely as receptors or co-receptors (, , , , , , , , , reviewed in , ). HBGAs are complex carbohydrates linked to proteins or lipids on the surface of red blood cells and mucosal epithelia of the respiratory, genitourinary, and digestive tracts, or as free oligosaccharide in biological fluids such as milk and saliva. A number of distinct binding patterns of noroviruses to HBGAs have been described according to the ABO, Lewis and secretor types of the human HBGAs , , . The prototype Norwalk virus (GI-1) represents one of these patterns and binds to saliva of A and O secretors. Other binding patterns include the A, B, O secretor binder of VA387 (GII-4), A, B binder of MOH (GII-5), and A, O secretor and nonsecretor binder of VA207 (GII-9) and Boxer (GI-8) . The variable binding patterns have been further sorted into two major binding groups, the A/B binding group and Lewis binding group based on shared HBGA targets within binding groups. The A/B binding group recognizes mainly the A/B/H but not the Lewis epitopes, while the Lewis binding group binds the Lewis epitopes but not the A/B epitopes (, reviewed in , ). Strains of both A/B and Lewis binding groups can be found in the two major genogroups of human noroviruses. In addition, virus-HBGA interaction has also been found in two other genera of caliciviruses. For example, the rabbit hemorrhagic disease virus (RHDV) binds to H antigen , while the Tulane virus binds to B antigen , suggesting that the involvement of HBGAs in calicivirus infection may be a common phenomenon.
High resolution 3D structures of HBGA-binding interfaces of Norwalk virus (GI-1) and VA387 (GII-4) have recently been determined , , . As shown by these studies, the binding interfaces of both strains are located in the same region of the outermost P2 domain, but the positions and amino acid composition of individual binding sites and the HBGA binding modes are different. The HBGA-binding interface of Norwalk virus is located within a single monomer of the P dimer (Fig. 1, left panel), while the binding interface of VA387 involves both monomers of the P dimer (Fig. 1, right panel), although in both cases dimerization of the P domains appears to be required for their binding function , , . While both Norwalk virus and VA387 belong to the A/B binding group and share the same A and H antigens, the primary sequences of their binding sites and the binding modes to HBGAs are different. Norwalk virus interacts with the α-GalNAc and α-Fus of the A trisaccharide or α-Fuc and β-Gal of the H tetrasaccharide , , while VA387 interacts with all three terminal sugars of A (α-GalNAc-β-Gal-α-Fuc) and B (α-Gal-β-Gal-α-Fuc) trisaccharides .
The surface models of the P dimers (top views) with indications of the HBGA-binding interfaces (colored regions) are shown in (A) and (B) with one monomer being shown in darker gray than another. Enlargements of the HBGA-binding interfaces are shown in (C) and (D) correspondingly with labels of individual amino acids, in which the prime symbol indicates a residue of another protomer. The three major components of the binding interfaces are colored in green (site I), red (site II), and orange (site III), respectively, while the trisaccharides binding to the interface are in yellow in (A) and (B) or in variable colors (C-cyans, O-red, and N-blue) in (C) and (D). The amino acids around the interface that affect the binding specificity are in light blue. (E) and (F) are schematic diagrams of hydrogen bonding network (dash lines) between the amino acids of the P dimers of Norwalk virus (E), or VA387 (F) and the A- or B- type trisaccharides. The water-bridged hydrogen bonds are indicated by W. (A) to (D) were prepared by software PyMOL version 1.0 (Delano Scientific), while (E) and (F) by software ChemDraw Pro version 11.0 (Adept Scientific). (E) is adapted from  with permission. The original data were published in , , .
In this study, we characterized the genetic relatedness of human noroviruses in the context of their host carbohydrate-binding specificity by sequence alignment and structural analysis, followed by site-directed mutagenesis and functional assay for strains representing both GI and GII and both A/B and Lewis binding groups. We showed that the HBGA-binding interfaces are highly conserved among strains within, but not between, the two genogroups, and that the conserved binding interfaces are able to interact with variable HBGAs of the ABH and Lewis families. Our results suggested that human HBGA may be a selection factor in norovirus evolution. The polymorphic human HBGAs and the highly adaptative nature of noroviruses may underlie the observed diversity of noroviruses. The high conservation of HBGA-binding interfaces within genogroups may also help in the development of new strategies to control and prevent norovirus infection.
Materials and Methods
Construction of mutant P particles
Bacterial expression constructs for wild type P particles were made by cloning of the P domain-encoding sequences into the plasmid pGEX-4T-1 (Amersham Bioscience, Piscataway, NJ). To enhance the efficiency of P particle formation a cysteine-containing short peptide was linked to either the N- (Norwalk virus, CNGRC) or C- (Boxer, MOH, and VA207, CDCRGDCFC) termini of the P domains, as reported previously , , , , . Mutant P particles with single amino acid substitutions were designed and constructed by site-directed mutagenesis using the corresponding wild type constructs as templates. Site-directed mutagenesis was performed using the QuickChange Site-Directed Mutagenesis Kit (Stratagene, La Jolla, CA) and the corresponding primer pairs with mutations (Table 1) as described previously , , , , . The wild type and mutant P particles were expressed and purified as described previously , , , , . Briefly, after sequence confirmation through DNA sequencing, the mutant constructs were expressed in E. coli strain BL21 with an induction of IPTG (0.25 mM) at room temperature (~25°C) overnight. The P protein-GST fusion was purified by the Glutathione Sepharose 4 flow (GE Healthcare Life Sciences, Piscataway, NJ) according to the manufacturer's protocol. P proteins were released from GST tag by thrombin (GE Healthcare Life Sciences, Piscataway, NJ) digestion. The formation of P particle and P dimer was determined by gel filtration using a size-exclusion column Superdex 200 (GE Healthcare Life Sciences, Piscataway, NJ) powered by an AKTA-FPLC system (model 920, GE Healthcare Life Sciences, Piscataway, NJ) followed by SDS-PAGE electrophoresis, in which the P particles form a peak at ~830 kDa and the P dimer at ~69 kDa, respectively previously , , , , . The efficiency of P particle formation for Norwalk virus, Boxer, MOH, and VA207 were ~70%, ~80%, ~80%, and ~90%, respectively. None of the designed single residue mutations in this study affected P particle formation.
Saliva-based HBGA binding assay
These were performed as described elsewhere , , , . The affinity-column purified P particles were first diluted to 1 mg/ml as a starting solution. They were then diluted further in a 3-fold-series to indicated concentration directly on the testing Elisa plates that had been coated with different saliva. The different P particles were incubated with coated saliva samples for 60 min at 37°C. Five well-characterized saliva samples representing typical blood types of “O”, “A”, “B”, “AB” secretor and “O” nonsecretor ,  were used for the binding assays.
Crystal structure visualization and analysis
The crystal structures of the P dimers of Norwalk virus and VA387 complexed with type A-, B-trisaccharides, and/or H-pentasaccharide were analyzed using the PyMOL software (DeLano Scientific LLC, Palo Alto, CA) and the Polyview-3D server (http://polyview.cchmc.org). The PDB files of the Norwalk virus P protein in complex with A-trisaccharide (3D26 and 2ZL7) or H-pentasaccharide (2ZL6) ,  and VA387 P protein in complex with A-trisaccharide (2OBS) or with B-trisaccharide (2OBT)  were downloaded from the Protein Data Bank at Rutgers University, New Brunswick, NJ (http://www.rcsb.org).
Characterization of the HBGA-binding interface of Norwalk virus
The prototype Norwalk virus (GI-1) and strain VA387 (GII-4), each representing genogroups (G) I and II of human noroviruses, exhibit distinct HBGA binding patterns but share the ability to bind to the A and H antigens . Crystallographic studies indicated that these two strains use distinct binding interfaces and modes of interaction with HBGA receptors (Fig. 1, , , ). Our recent site-directed mutagenesis analysis of VA387 has identified a number of additional amino acids around the carbohydrate binding interface that are also involved in HBGA binding . To further elucidate the differences and similarities between the two HBGA-binding interfaces and modes, we extended such mutagenesis analysis to the Norwalk virus.
Among 17 mutant P particles with single amino acid changes into alanines/serines in and around the HBGA-binding interface of the Norwalk virus (Fig. 2 and Table 2), six mutants (D327A, H329A, D344A, W375A, S377A, and S380A) lost their binding to HBGAs completely or nearly completely (H329A) (Fig. 1 and Fig. 2, ), indicating that these amino acids are critical for the structural integrity of the binding interface. Residue Q342 that was shown to interact with the type H but not the type A oligosaccharide ,  affected binding mainly to the H when it was replaced by an alanine (Fig. 2). Similar effects were also seen in mutants P378A, A430S, and Y431A, respectively, although P378 was predicted to interact with both types H and A oligosaccharides, while A430 and Y431 do not appear to interact with either of the two oligosaccharides , . Furthermore, a replacement of S338 by an alanine (S338A) did not affect binding to either A or H antigens (Fig. 2 and ), but a change to an asparagine (S338N) wiped out binding to H without affecting binding to A antigen (Fig. 2), although S338 from a heterologous P monomer interacted with the α-N-acetylgalacosamine (α-GalNAc) of the A antigen via a water-bridged hydrogen bond . The involvement of residues S338, P378, A430, and Y431 in HBGA-binding specificity may be supported by their common location in a region on one side of the binding interface (Fig. 1C), although the mechanism remains to be elucidated. In contrast, although residues H337, S339, Q340, N362, and G363 are also located around the binding interface, their mutations into alanines did not have significant impact on binding to HBGAs (Fig. 2 and Table 2).
X-axes show protein concentrations of the P particles and Y axes indicate the optical densities at 450 nm (OD450) that were the average values of triplicate experiments. “O”, “A” and “N” represent the saliva samples of type O secretor (containing H antigen), type A secretor, and type O nonsecretor, respectively. Data of mutations D327A, H329A, S338'A, and S377A are adapted with permission from . Mutants with prime symbol (') indicate the mutated residues of another P monomer.
Distinct HBGA-binding interfaces and modes between Norwalk virus and VA387
The crystal structures of the binding interfaces , ,  and the site-directed mutagenesis (and data in the previous section) suggest that the HBGA-binding interfaces of both Norwalk virus and VA387 can be divided into three major analogous regions, representing the bottom (site I) and the walls (site II and III) of the interface (Fig. 1). These sites are composed of either a single or a cluster of sterically closed amino acids, including D327 and H329 (site I), Q342 and D344 (site II), and W375, S377, and S380 (site III) for Norwalk virus and S343 to H347 (site I), D374 (site II), and S441 and G442 (site III) for VA387, but none of these sites are shared by the two strains. It should be noted that all three sites of Norwalk virus are formed by residues of a single P2 subdomain without direct interactions with the P1 subdomain or the dimer-related P2 subdomain, while site III (S441 and G442) of VA387 is formed by the top of an exposed loop of P1 subdomain, and it involves the other chain of the VA387 P dimer. In both strains these three sites interact with at least two sugars of the A, B or H antigens. However, in Norwalk virus the major contacts are on the α-N-acetyl galactosamine (α-GalNAc) of the A trisaccharide or the β-galactose (β-Gal) of H pentasaccharide , , while in VA387 the major contacts are on the α-fucose (α-Fuc) of the A and B trisaccharides .
The HBGA-binding interfaces are conserved within but not between genogroups
Sequence alignments of the P domains of noroviruses representing 8 GI and 17 GII genotypes (Fig. 3) show that the three sites of the HBGA-binding interfaces are highly conserved within each genogroup. Sites I and III are more conserved than site II for the GI viruses, while all three sites are highly conserved among GII viruses except for strains of GII-13, including site III that is in the P1 subdomain of the capsid (Fig. 1 and 3). The overall sequence identities of the P2 subdomains are only 31–56% for strains within each of the two genogroups, further indicating the selective pressures of the HBGAs on the receptor binding interfaces.
Sequence of the three major components (red letters) of the HBGA-binding interfaces of 10 genogroup I (GI) (A) and 17 genogroup II (GII) (B) noroviruses, representing each of the 8 GI and 17 GII genetic types, respectively, are aligned based on the two known binding interfaces of Norwalk virus (GI) and VA387 (GII). Star symbols label the residues that have been experimentally shown to be required for binding to HBGAs. The two strains that have no detectable binding to examined HBGAs are underlined. The accession numbers of the sequence are: M87661 (Norwalk virus), L23828 (KY 89), L07418 (SOV), AF414403 (HLL), U04469 (DSV), AY038598 (VA115), AB042808 (Chiba), AJ277614 (Musgrove), AY502008 (Wiscon), AJ277609 (Winchester), AF538679 (Boxer), AY038600 (VA387), U07611 (Hawaii), AY134748 (SMV), U22498 (Mexico), X86557 (Lordsdale), AF397156 (MOH), AF414407 (Florida269), AJ277608 (Leeds), AF195848 (Amsterdam), AAK84676 (VA207), AF427118 (Erfurt), AB074893 (SW918), AJ277618 (Wortley), AY113106 (Fayettevil), AY130761 (M7), AY130762 (J23), AY502010 (Tiffin), AY502009 (CS-E1).
All three conserved sites are required for Boxer (GII-8) binding to HBGAs
The high conservation of the HBGA-binding interfaces raises an important question on the role of human HBGAs in norovirus evolution. For strains with similar binding patterns within the same genogroups, such as Norwalk virus (GI-1) and C59 (GI-2) in the A/B binding group that both bind to types A and H antigens of secretors but not to the Lewis antigens of the non-secretors , such conservation is understandable. However, the conservation of HBGA binding interfaces among strains with distinct binding patterns, such as Boxer (GI-8) of the Lewis binding group , seems to challenge the hypothesis that HBGAs confer selective pressure on norovirus evolution.
To address this apparent inconsistency and further elucidate the relationship between structure and HBGA binding patterns, we performed mutagenesis analysis on the role of the three conserved sites in HBGA binding of the Boxer virus. Three sets of mutant P particles were constructed. The first set contained 5 mutants with single residue mutations in each of the three GI conserved sites (Fig. 3A), including H334A (site I), G348A, D349A, P350A (site II), and W392A (site III); the second set were 7 mutants with mutations in regions corresponding to each of the three GII conserved sites (Fig. 3B), including T347A (site I), E377A, L378A, D379A, Q380A, F381A (site II), and N444A (site III); while the third set contained 3 mutants with mutations (I340A, N341A and P342A) away from the predicted binding interface as control (Fig. 4 and Table 3). The saliva-based binding results showed that all 5 mutants of the first set but none of the 7 mutants in the second set lost their binding to HBGAs completely or nearly completely (Fig. 4 and Table 3). As expected, the three mutants in the third set did not affect the binding. Therefore, all three conserved sites deduced from the GI Norwalk virus for the A/B binding group are also involved in HBGA-binding of the GI Boxer virus of the Lewis binding group. These data provided functional evidence for the conservation of the HBGA binding interface of GI noroviruses.
The X-axes show the protein concentrations of the P particles and the Y axes indicate the optical densities at 450 nm (OD450) that were the average value of triplicate experiments. “O” and “A” represent the saliva samples of type O (containing H antigen) and A secretor, respectively, while “N” one of nonsecretor.
Similar genetic relatedness also exists among GII noroviruses
Similar variations of HBGA binding have also been found among GII strains. For example, strains MOH (GII-5), Buds (GII-2), Parris Island (GII-13), MxV (GII-3), members of the A/B binding group , target the common A, B and/or H antigens as VA387 (GII-4) does. Using a similar approach we constructed three mutant P particles (R347A, D376A and G441A) of MOH, each with a single residue change in the three GII conserved sites deduced from VA387 (Fig. 3B) and the binding to HBGAs of all three mutants were completely (R347A and D376A) or nearly completely (G441A) lost (Fig. 5 and Table 4). These results demonstrated that MOH shares the three conserved binding sites with VA387 which is consistent with their recognition of the common A and B antigens.
The X-axes show the protein concentrations of the P particles and the Y axes indicate the optical densities at 450 nm (OD450) that were the average value of triplicate experiments. “O”, “A”, “B” and “AB” represent the saliva samples of type O (containing H antigen), A, B, and AB secretor, respectively, while “N” one of nonsecretor.
We then examined the role of the three GII conserved sites in HBGA-binding of VA207 (GII-9), a strain of the Lewis binding group that recognizes the Lex and Ley but not the A and B antigens. Construction of three mutants (R346A, D374A and G440A) with a single residue mutation at each of the three binding sites (Fig. 3B) resulted in complete (R346A) or nearly complete (D374A and G440A) loss of binding to HBGAs (Fig. 5 and Table 4). These data showed that VA207 shares the common HBGA-binding interface with those A/B binding strains within GII noroviruses and this has been recently confirmed by the crystal structure of VA207 P dimer in complex with Lewis antigen (Y. Chen, X. Jiang and X. Li to be published data, also see discussion).
In this study, we used sequence alignment, structural analysis and site-directed mutagenesis to examine the evolutionary relatedness of human noroviruses in terms of their interaction with the HBGA receptors. We showed that strains with distinct HBGA binding patterns within genogroups share common receptor binding interfaces in their interactions with variable HBGAs, likely tuned up by subtle structural differences within the binding interfaces. At the same time, strains in different genogroups that use different binding interfaces, as defined by their locations and sequence motifs, can recognize the same HBGA-targets, pointing to the overall functional and structural similarity of these distinct binding sites. These results provide evidence that the human HBGAs exert an important selection pressure in norovirus evolution. The two major genogroups (G I and GII) of human noroviruses that cause acute gastroenteritis represent two major evolutionary lineages, while strains in the A/B and Lewis binding groups within the two genogroups, such as those represented by the Norwalk virus and Boxer in GI and those by VA387 and VA207 in GII, may further divide into evolutionary sub-lineages as a result of divergent evolution within each branch (Fig. 6).
Six calicivirus genera may be evolved from a common calicivirus ancestor and at least one strain from four genera (orange) has been shown to bind to carbohydrates. Similarly, five norovirus genogroups (G) may be evolved from a common norovirus ancestor and two of the three human norovirus genogroups have been demonstrated to recognize HBGAs (purple). GI and GII noroviruses share conserved genogroup-specific HBGA-binding interfaces and both genogroups contain strains binding to either A/B/H antigens (A/B binding groups, yellow) or Lewis antigens (Lewis binding group, green). GII-13 (Blue) is a unique genotype that does not share conserved binding sites with other GII genotypes and thus may represent a sublineage parallel to other GII genotypes, in which a strain (OIF) has been shown to bind to Lewis antigens (green). Arrows indicate the direction of evolution. Solid line shows the evolutionary lineages with defined binding to HBGAs, while the dashed line shows the lineages with unknown interaction with carbohydrates. RHDV, rabbit hemorrhagic disease virus; TV, Tulane virus; FCV, feline calicivirus; OIF, norovirus strain that was isolated from troops deployed to the Operation of Iraqi Freedom.
The HBGA-binding interfaces of the two major genogroups of human noroviruses share some similarity in the overall structure and location; both are located in the outermost P2 regions of the capsids , ,  and both are composed of three major structural components, corresponding to the bottom and the walls of the binding pocket (Fig. 1). However, the two binding interfaces differ in their primary sequences, detailed locations, and modes of interaction with the HBGA-receptors , , . The binding interface of the GI strains (Norwalk virus) is constituted by three groups of amino acids from the P2 subdomain and positioned mainly in one P monomer, although it is near the interface of two P monomers of the Norwalk virus P dimer. On the other hand, the binding interface of the GII viruses (VA387) is composed of residues from both P1 and P2 subdomains and is located right at the interface of two monomers in the VA387 P dimer , , , . The conservation of the binding interfaces within GII has been confirmed by the crystal structures of the VA207 P dimers in complex with the Lewis x and Lewis y tetrasaccharides, respectively, in which the binding interface of VA207, a Lewis binding strain, is constituted by the conserved amino acids and interacts with the α-1,3/4 fucose of the Lewis y antigen in a similar way like that of VA387 (Y. Chen, X. Jiang and X. Li, to be published data).
The two types of binding interfaces differ also in their binding modes to HBGAs. Based on crystal structures of the P dimers complexed with oligosaccharides, Norwalk virus has a smaller or narrower binding interface, while VA387 has a larger or broader one (Fig. 1, , , ). As a result only two sugars of the A trisaccharide and the H pentasaccharide are involved in interaction with Norwalk virus , , as opposed to all three sugars of the A and B trisaccharides in case of VA387 , . In addition, more amino acid residues of VA387 appear to be involved in binding to HBGAs, compared to Norwalk virus. Specifically, crystal structures revealed 11 residues of VA387 P domain interacting with the B trisaccharides, as opposed to only 7 in case of Norwalk virus P domain binding to the A or H oligosaccharides , , . Furthermore, mutagenesis studies mapped another 8 amino acids around the binding interface of VA387 affecting the binding function , while only 2 such residues of Norwalk virus were found (this report). Nevertheless, the aforementioned binding modes are based solely on the P dimers interacting with oligosaccharides under the condition of co-crystallization. The native interactions between norovirus and HBGAs in vivo remain to be elucidated.
Another observation emerging from this study is the possibility of interplay between convergent and divergent evolution of noroviruses. The two major genogroups (GI and GII) of human noroviruses are characterized by distinct genetic traits with significant differences in the primary sequence within their P domains. These two distinct lineages may have evolved in the course of divergent evolution from a common ancestor. On the other hand, the acquisition of the common function of binding to HBGAs by distinct binding interfaces and modes is consistent with functional convergence as a result of adaptation to and selection by the same niche of human HBGAs. The two strains described in this study, VA387 and Norwalk virus, provide strong support for this hypothesis. Convergent evolution of protein function and/or structure in conjunction with acquired ligand binding specificity has been observed previously , , . One such example includes sugar binding families of LacI/GalR repressors and their PBP analogues, in which evolutionarily divergent lineages acquired independently similar ligand binding patterns through convergent evolution .
The fact that almost all known HBGAs have their noroviral counterparts suggests that noroviruses are highly adaptive human pathogens. In addition, it has been noted that some strains with conserved binding interfaces appear not to recognize HBGAs, such as the Desert Shield virus (DSV, GI-3)  and Hunter virus (GII-4) , while other strains lacking the conserved binding interfaces retain the HBGA-binding ability, such as OIF of the GII-13 noroviruses , . These variations further highlight the adaptive nature of noroviruses that may recognize other carbohydrates or even non-carbohydrates as receptors. As long as noroviruses remain a human pathogen, the diversity of HBGA-binding patterns seen today will probably extend into the future.
Limited studies have shown that the GI and GII noroviruses are biologically different. For example, the GI noroviruses are more involved in environmental contamination and cause outbreaks year around without apparent seasonal peaks, while GII strains are easier to spread via person-to-person contact , ,  and commonly cause outbreaks with clear fall/winter peaks. While future studies are required to identify factors and genetic markers responsible for these differences, this work can help to elucidate the evolutionary relatedness of the GI and GII noroviruses and improve the classification of caliciviruses (Figure 6). Each of the four major genera and the two newly discovered “Becovirus”  and “Recovirus”  genera should represent an evolutionary lineage in this virus family. While each of them has adapted well into individual host species, the binding to carbohydrates has apparently been maintained or acquired in at least some strains of most genera of caliciviruses, other than human noroviruses. For example, the rabbit hemorrhagic disease virus (RHDV) of the Lagovirus and the Tulane virus (TV) of the Recovirus recognize HBGAs , , while feline calicivirus (FCV) bind to sialic acid . Since the common ancestor of these genetically distinct species might not possess the HBGA binding trait, one might speculate that these common characteristics were acquired independently as a result of adapting to similar biological niches, suggesting a possible convergent evolution of caliciviruses.
Our mutagenesis study further demonstrated that, in addition to the conserved binding sites, a number of nearby amino acids also play an important role in the binding specificity to HBGAs, possibly by contributing to the conformational flexibility of the carbohydrate binding interfaces, and these residues are less conserved. For example, residues Q331, K348, I389, and G392 of VA387 are likely involved in the binding to the A but not the B antigens , while S338, A430 and Y431 of Norwalk virus affect the binding strongly to H but weakly to A antigen (this study). Similar role of D393 of another GII-4 strain was also observed . The recent studies on the globally dominant GII-4 noroviruses suggests that the host herd immunity may play a role in the epochal evolution of GII-4 viruses , . Future studies focusing on these non-conserved residues for their potential roles in the antigenicity and immunogenicity of the viruses may be necessary.
In this study 39 mutant P particles of four strains (Norwalk, Boxer, MOH, and VA207) have been generated to address the conservation issue of the HBGA binding interfaces of noroviruses. This task would be very difficult to complete by using the VLPs as the model, because VLP production are very time-consuming compared to P particles. In our previous studies we have demonstrated that the P particle is a good model for studying norovirus-HBGAs interaction by the observations that P particle uses the same HBGA binding interface and shares very similar HBGA binding profile as that of its VLP counterpart , , , . In addition, we used the saliva binding assay for its simplicity, convenience and sensitivity. All saliva samples used in this report have been well characterized for their phenotypes and binding patterns to noroviruses in our previous studies , , , , , , . We do not expect significant differences with respect to synthetic oligosaccharide-based assays in evaluation of the importance of HBGA binding sites.
The findings of the conservation of HBGA-binding interfaces within genogroups can greatly facilitate the design and development of therapeutics against noroviruses. For example, a single compound that inhibits the function of the conserved HBGA-binding interface may be capable of blocking infection of all strains with the same type of HBGA binding interface. Thus, only two compounds might be sufficient to block most noroviruses in the two genogroups studied here, each group sharing a similar binding interface that could be blocked by one common inhibitor.
We thank Weiming Zhong for technical support in performing the protein expression and Dr. Xuejun C. Zhang for helpful comments to this manuscript.
Conceived and designed the experiments: MT XJ. Performed the experiments: MT MX. Analyzed the data: MT MX YC WB RH JM XL. Contributed reagents/materials/analysis tools: MT RH JM. Wrote the paper: MT MX XJ.
- 1. Green K, Chanock R, Kapikian A (2001) Human Calicivirus. In: Knipe DM, Howley PM, Griffin DE, Lamb RA, Martin MA, et al., editors. Fields Virology. 4th ed. Philadelphia: Lippincott Williams & Wilkins. pp. 841–874.
- 2. Farkas T, Sestak K, Wei C, Jiang X (2008) Characterization of a rhesus monkey calicivirus representing a new genus of Caliciviridae. J Virol 82: 5408–5416.
- 3. Oliver SL, Asobayire E, Dastjerdi AM, Bridger JC (2006) Genomic characterization of the unclassified bovine enteric virus Newbury agent-1 (Newbury1) endorses a new genus in the family Caliciviridae. Virology 350: 240–250.
- 4. Zheng DP, Ando T, Fankhauser RL, Beard RS, Glass RI, et al. (2006) Norovirus classification and proposed strain nomenclature. Virology 346: 312–323.
- 5. Estes MK, Prasad BV, Atmar RL (2006) Noroviruses everywhere: has something changed? Curr Opin Infect Dis 19: 467–474.
- 6. Tan M, Jiang X (2008) Norovirus gastroenteritis, increased understanding and future antiviral options. Curr Opin Investig Drugs 9: 146–151.
- 7. Scipioni A, Mauroy A, Vinje J, Thiry E (2008) Animal noroviruses. Vet J 178: 32–45.
- 8. Wobus CE, Thackray LB, Virgin HWt (2006) Murine norovirus: a model system to study norovirus biology and pathogenesis. J Virol 80: 5104–5112.
- 9. da Silva AK, Le Saux JC, Parnaudeau S, Pommepuy M, Elimelech M, et al. (2007) Evaluation of removal of noroviruses during wastewater treatment, using real-time reverse transcription-PCR: different behaviors of genogroups I and II. Appl Environ Microbiol 73: 7891–7897.
- 10. Moe C, Honorat E, Leon J, Eisenberg J (2007) Epidemiologic patterns of published norovirus outbreak reports, 1981–2006. Program and Abstracts of Third International Calicivirus Conference 39.
- 11. Hedlund K, Anderson Y, Hjertqvist M, Lysen M (2007) Norovirus genogroup I dominates in Swedish Waterborne Outbreaks. Program and Abstracts of Third International Calicivirus Conference 38.
- 12. Prasad BV, Hardy ME, Dokland T, Bella J, Rossmann MG, et al. (1999) X-ray crystallographic structure of the Norwalk virus capsid. Science 286: 287–290.
- 13. Bu W, Mamedova A, Tan M, Xia M, Jiang X, et al. (2008) Structural basis for the receptor binding specificity of Norwalk virus. J Virol 82: 5340–5347.
- 14. Cao S, Lou Z, Tan M, Chen Y, Liu Y, et al. (2007) Structural basis for the recognition of blood group trisaccharides by norovirus. J Virol 81: 5949–5957.
- 15. Choi JM, Hutson AM, Estes MK, Prasad BV (2008) Atomic resolution structural characterization of recognition of histo-blood group antigens by Norwalk virus. Proc Natl Acad Sci U S A 105: 9175–9180.
- 16. Tan M, Hegde RS, Jiang X (2004) The P domain of norovirus capsid protein forms dimer and binds to histo-blood group antigen receptors. J Virol 78: 6233–6242.
- 17. Tan M, Jiang X (2005) The p domain of norovirus capsid protein forms a subviral particle that binds to histo-blood group antigen receptors. J Virol 79: 14017–14030.
- 18. Tan M, Meller J, Jiang X (2006) C-terminal arginine cluster is essential for receptor binding of norovirus capsid protein. J Virol 80: 7322–7331.
- 19. Greenberg HB, Valdesuso JR, Kalica AR, Wyatt RG, McAuliffe VJ, et al. (1981) Proteins of Norwalk virus. J Virol 37: 994–999.
- 20. Hardy ME, White LJ, Ball JM, Estes MK (1995) Specific proteolytic cleavage of recombinant Norwalk virus capsid protein. J Virol 69: 1693–1698.
- 21. Huang P, Farkas T, Marionneau S, Zhong W, Ruvoen-Clouet N, et al. (2003) Noroviruses Bind to Human ABO, Lewis, and Secretor Histo-Blood Group Antigens: Identification of 4 Distinct Strain-Specific Patterns. J Infect Dis 188: 19–31.
- 22. Huang P, Farkas T, Zhong W, Tan M, Thornton S, et al. (2005) Norovirus and histo-blood group antigens: demonstration of a wide spectrum of strain specificities and classification of two major binding groups among multiple binding patterns. J Virol 79: 6714–6722.
- 23. Hutson AM, Airaud F, LePendu J, Estes MK, Atmar RL (2005) Norwalk virus infection associates with secretor status genotyped from sera. J Med Virol 77: 116–120.
- 24. Hutson AM, Atmar RL, Graham DY, Estes MK (2002) Norwalk virus infection and disease is associated with ABO histo-blood group type. J Infect Dis 185: 1335–1337.
- 25. Hutson AM, Atmar RL, Marcus DM, Estes MK (2003) Norwalk virus-like particle hemagglutination by binding to h histo-blood group antigens. J Virol 77: 405–415.
- 26. Harrington PR, Lindesmith L, Yount B, Moe CL, Baric RS (2002) Binding of Norwalk virus-like particles to ABH histo-blood group antigens is blocked by antisera from infected human volunteers or experimentally vaccinated mice. J Virol 76: 12335–12343.
- 27. Harrington PR, Vinje J, Moe CL, Baric RS (2004) Norovirus Capture with Histo-Blood Group Antigens Reveals Novel Virus-Ligand Interactions. J Virol 78: 3035–3045.
- 28. Lindesmith L, Moe C, Marionneau S, Ruvoen N, Jiang X, et al. (2003) Human susceptibility and resistance to Norwalk virus infection. Nat Med 9: 548–553.
- 29. Marionneau S, Ruvoen N, Le Moullac-Vaidye B, Clement M, Cailleau-Thomas A, et al. (2002) Norwalk virus binds to histo-blood group antigens present on gastroduodenal epithelial cells of secretor individuals. Gastroenterology 122: 1967–1977.
- 30. Tan M, Jiang X (2005) Norovirus and its histo-blood group antigen receptors: an answer to a historical puzzle. Trends Microbiol 13: 285–293.
- 31. Tan M, Jiang X (2007) Norovirus-host interaction: implications for disease control and prevention. Expert Rev Mol Med 9: 1–22.
- 32. Shirato H, Ogawa S, Ito H, Sato T, Kameyama A, et al. (2008) Noroviruses distinguish between type 1 and type 2 histo-blood group antigens for binding. J Virol 82: 10756–10767.
- 33. Ruvoen-Clouet N, Ganiere JP, Andre-Fontaine G, Blanchard D, Le Pendu J (2000) Binding of rabbit hemorrhagic disease virus to antigens of the ABH histo-blood group family. J Virol 74: 11950–11954.
- 34. Farkas T, Sestak K, Jiang X (2008) Tulane virus specifically binds to type B histo-blood group antigen. Scientific Program and Abstracts of 27th Annual Meeting for American Society for Virology 271.
- 35. Tan M, Xia M, Cao S, Huang P, Farkas T, et al. (2008) Elucidation of strain-specific interaction of a GII-4 norovirus with HBGA receptors by site-directed mutagenesis study. Virology 379: 324–334.
- 36. Tan M, Fang P, Chachiyo T, Xia M, Huang P, et al. (2008) Noroviral P particle: structure, function and applications in virus-host interaction. Virology 382: 115–123.
- 37. Briscoe AD (2000) Six opsins from the butterfly Papilio glaucus: molecular phylogenetic evidence for paralogous origins of red-sensitive visual pigments in insects. J Mol Evol 51: 110–121.
- 38. Stewart CB, Schilling JW, Wilson AC (1987) Adaptive evolution in the stomach lysozymes of foregut fermenters. Nature 330: 401–404.
- 39. Sumiyama K, Kitano T, Noda R, Ferrell RE, Saitou N (2000) Gene diversity of chimpanzee ABO blood group genes elucidated from exon 7 sequences. Gene 259: 75–79.
- 40. Fukami-Kobayashi K, Tateno Y, Nishikawa K (2003) Parallel evolution of ligand specificity between LacI/GalR family repressors and periplasmic sugar-binding proteins. Mol Biol Evol 20: 267–277.
- 41. Lindesmith LC, Donaldson EF, Lobue AD, Cannon JL, Zheng DP, et al. (2008) Mechanisms of GII.4 norovirus persistence in human populations. PLoS Med 5: e31.
- 42. Tan M, Zhong W, Song D, Thornton S, Jiang X (2004) E. coli-expressed recombinant norovirus capsid proteins maintain authentic antigenicity and receptor binding capability. J Med Virol 74: 641–649.
- 43. Stuart AD, Brown TD (2007) Alpha2,6-linked sialic acid acts as a receptor for Feline calicivirus. J Gen Virol 88: 177–186.
- 44. Siebenga JJ, Vennema H, Renckens B, de Bruin E, van der Veer B, et al. (2007) Epochal evolution of GGII.4 norovirus capsid proteins from 1995 to 2006. J Virol 81: 9932–9941.
- 45. Tan M, Huang P, Meller J, Zhong W, Farkas T, et al. (2003) Mutations within the P2 Domain of Norovirus Capsid Affect Binding to Human Histo-Blood Group Antigens: Evidence for a Binding Pocket. J Virol 77: 12562–12571.