O-linked N-acetylglucosaminylation (O-GlcNAcylation) is a reversible post-translational modification that plays essential roles in many cellular pathways. Research in this field, however, is hampered by the lack of suitable probes to identify, accumulate, and purify the O-GlcNAcylated proteins. We have previously reported the identification of a lectin from the mushroom Agrocybe aegerita, i.e., Agrocybe aegerita lectin 2, or AAL2, that could bind terminal N-acetylglucosamine with higher affinities and specificity than other currently used probes. In this paper, we report the crystal structures of AAL2 and its complexes with GlcNAc and GlcNAcβ1-3Galβ1-4GlcNAc and reveal the structural basis of GlcNAc recognition by AAL2 and residues essential for the binding of terminal N-acetylglucosamine. Study on AAL2 may enable us to design a protein probe that can be used to identify and purify O-GlcNAcylated proteins more efficiently.
Citation: Ren X-M, Li D-F, Jiang S, Lan X-Q, Hu Y, Sun H, et al. (2015) Structural Basis of Specific Recognition of Non-Reducing Terminal N-Acetylglucosamine by an Agrocybe aegerita Lectin. PLoS ONE 10(6): e0129608. https://doi.org/10.1371/journal.pone.0129608
Editor: Avadhesha Surolia, Indian Institute of Science, INDIA
Received: January 6, 2015; Accepted: May 11, 2015; Published: June 26, 2015
Copyright: © 2015 Ren et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Data Availability: Coordinates of the AAL2 and its complexes with GlcNAc and GlcNAcβ1-3Galβ1-4GlcNAc have been deposited in the Protein Data Bank with the accession codes 4TQJ, 4TQK, and 4TQM.
Funding: This work was supported by the funding from Chinese Ministry of Science and Technology 973 Programs (2011CB911100, 2011CB910304) to DCW, National Natural Science Foundation of China (31370849), the Chinese 111 Project (B06018), and from Wuhan University (2042014kf0226) to HS.
Competing interests: The authors have declared that no competing interests exist.
Lectins are a large group of non-enzyme proteins that recognize and specifically bind to carbohydrate moieties that are either free or conjugated on the surfaces of proteins or lipids . They are distinct from antibodies and mono- or oligosaccharide sensors/transporters. Lectins are ubiquitous in nature. They have been found in all kingdoms of organisms ranging from virus, bacteria, to eukaryotes including fungi, plants and animals . Lectins play crucial roles in diverse biological processes at both molecular and cellular levels , such as cell-cell interactions, host-pathogen recognitions, and innate immune responses and metastasis . Because of their capabilities to specifically recognize a highly diverse array of sugar moieties, lectins have been widely used in glycan studies: immobilized lectins, for example, have been invaluable tools in the enrichments and purifications of glycoproteins.
O-linked addition of N-acetylglucosamine (O-GlcNAcylation) on the serine and threonine residues of cytosolic and nuclear proteins is a reversible post-translational modification that plays essential cellular roles. Recent studies have shown that O-GlcNAcylation has extensive cross talk with phosphorylation, another essential protein modification, and serves as a nutrient sensor for the modulation of numerous signaling and transcription pathways [4, 5]. It was observed that major oncogenic factors such as p53, c-Myc, NFκB, and β-catenin are O-GlcNAcylated and that chromatin is modulated by O-GlcNAcylation, indicating that this modification may also serve as a cancer hallmark . O-GlcNAcylation is distinct from other types of glycosylations in that it only adds one GlcNAc, a small, non-charged moiety, onto the serine/threonine residues, resulting in a modification that is very difficult to detect using techniques such as 2-dimensional electrophoresis; the modification is also too labile to be readily detected by conventional mass spectroscopies [4, 7]. The lack of efficient probes led to the discovery of O-GlcNAcylation only in early 1980s , decades later than that of other protein glycosylations, despite its ubiquitous presence in eukaryotic cells. This lack of efficient tools still hampers the research in this important field [4, 7].
In this paper, we report the crystal structures of a lectin from Agrocybe aegerita, designated as AAL2, in its apo- and complex forms with GlcNAc and GlcNAcβ1-3Galβ1-4GlcNAc, as part of our endeavors to develop a system to facilitate the identification and purification of O-GlcNAcylated proteins.
Lectins from fungi (mainly from mushrooms) have attracted wide attention in recent years owing to their diverse bioactivities , including antitumor, antiproliferative, and immunomodulatory activities [10–12]. AAL2 was recently discovered from the fruit body of the edible mushroom A. aegerita. It consists of 407 amino acid residues, with a molecular weight of 43.17 kDa. It was the second lectin purified from A. aegerita to be characterized by x-ray crystallography by our groups, after A. aegerita AAL, an antitumor lectin that recognizes and binds Thomsen-Friedenreich antigen . AAL2 was found to specifically bind nonreducing terminal N-acetylglucosamine (GlcNAc) moieties of either alpha- or beta-linkage and was reported to have the activity of inducing apoptosis in hepatoma cells in vitro as well as inhibiting hepatoma growth in tumour-bearing mice in vivo . Glycan array analysis demonstrated that out of 465 glycans tested, AAL2 has a unique preference for GlcNAc-(Gal-GlcNAc)1–3 over (Gal-GlcNAc)1–3, which lacks the terminal GlcNAc . ITC assays also revealed that AAL2 has higher affinity towards GlcNAc compared to WGA (wheat germ agglutinin) and GSL-II (Griffonia simplicifolia lectin-II), which have been widely used in glycan studies for their GlcNAc-binding activities [15, 16]. AAL2, therefore, has the potential to be a better tool for research involving O-GlcNAcylated proteins.
To illustrate the mechanisms of how AAL2 specifically recognizes non-reducing terminal GlcNAc, we performed structural and surface plasmon resonance (SPR) affinity characterizations on AAL2 and its binding with GlcNAc and GlcNAcβ1-3Galβ1-4GlcNAc, as well as its divalent metal cation binding properties. These results showed that AAL2 has homologous sequence and similar structure with previously characterized fungal lectins PVL (Psathyrella velutina Lectin) , and it exclusively recognizes glycans with non-reducing terminal GlcNAc and the binding affinity between AAL2 and GlcNAc is relatively high. Our work, therefore, lays a solid foundation for the development of an efficient macromolecule probe that can be used to identify, enrich, and purify O-GlcNAcylated proteins and to locate the sites of modification.
The crystal structure of PVL, a fungal lectin from Psathyrella velutina, was reported in 2006 , which was the first member of the seven-blade β-propeller family of fungal lectins to be structurally characterized. But the specific recognition mechanism of glycans by the lectin protein needed further understanding. AAL2 is homologous with PVL and belongs to the same lectin family. To illustrate the mechanisms of how AAL2 specifically recognizes non-reducing terminal GlcNAc, we performed structural and SPR affinity characterizations on AAL2 and its binding with GlcNAc and GlcNAcβ1-3Galβ1-4GlcNAc, as well as its divalent metal cation binding properties. The results revealed the main structural basis for the exclusive recognition between AAL2 and glycans with non-reducing terminal GlcNAcs. Structure comparison showed a rather high similarity between AAL2 and PVL, indicating that these two lectins adopt a unique structure to specifically recognize glycans with non-reducing terminal GlcNAc. Our present work, in association with previous information, therefore, lays a solid foundation for the development of an efficient macromolecule probe that can be used to identify, enrich, and purify O-GlcNAcylated proteins and to locate the sites of modification.
Materials and Methods
Purification and crystallization
The gene of AAL2 was amplified from A. aegerita and cloned into pET30a expression vector between NdeI and HindIII restriction sites, without any tag. The recombined plasmid was transformed into Escherichia coli host strain BL21 (DE3) competent cells for expression. The AAL2 protein expression and first-step purification with a GlcNAc-coupled Sepharose 6B affinity column has been described in detail previously [14, 18]. The protein samples were then applied to a HiTrap SP XL column (GE Healthcare Life Science) to remove the glycan ligands that may bind AAL2. A Superdex 200 10/300 GL gel-filtration column (Amersham Bio-sciences) was subsequently used to further purify and to identify the oligomeric state. The purified protein was concentrated to 30 mg/ml in a buffer containing 50mM Tris-HCl pH 7.5 and 150mM NaCl.
AAL2 was mixed with GlcNAc (from Sigma) and GlcNAcβ1-3Galβ1-4GlcNAc trisaccharide (a gift from Consortium for Functional Glycomics), respectively, at AAL2:ligand molar ratios of 1:10, and the mixtures as well as pure AAL2 sample were used in crystallization condition searches. Crystallization was performed using the hanging-drop vapor diffusion method, by mixing protein and reservoir solutions with equal volume of 1μl and equilibrated against 400μl reservoir solutions at 293K. Crystals of AAL2 were obtained under the condition of 30% PEG 8000 (w/v), 0.2M sodium acetate and 0.1M sodium cacodylate pH 6.4. Crystals of AAL2-GlcNAc complex were obtained with the solution containing 25% PEG 4000 (w/v), 6% Tacsimate pH 6.0 and 0.1M MES pH 6.0. The crystals of AAL2-trisaccharide complex were obtained in solution containing 2.3M ammonium sulfate and 0.1M sodium acetate pH 4.4.
Data collection, structural solution and refinement
All the three types of crystals were mounted on cryoloops (Hampton Research) directly from the crystallization drops and flash-cooled in a nitrogen-gas stream at 95 K. X-ray diffraction data for AAL2 was collected on an in-house FR-E Bluemax/R-AXIS IV++/Varimax HF (Rigaku) diffractometer at wavelength of λ = 1.5418Å; the data for AAL2 complexed with GlcNAc was collected on beamline 1W2B at Beijing Synchrotron Radiation Facility (BSRF; Beijing, China) at λ = 1 Å; and the data for AAL2 complexed with GlcNAcβ1-3Galβ1-4GlcNAc was collected on an in-house MicroMax 007 Saturn944+ VarimaxHF (Rigaku) diffractometer at λ = 1.5418 Å. Each dataset was collected with a total of 180 images using an oscillation of 1° per image.
The data were indexed and integrated using the program MOSFLM v.7.0.4  and then scaled by SCALA v.6.0 from the CCP4 v.6.0.2 program suite . A sequence-homology BLAST search at NCBI website (http://www.ncbi.nlm.nih.gov/BLAST) identified a structurally characterized protein PVL that has 60% sequence identity with AAL2 . The structure of AAL2 was solved using molecular replacement method as implemented in the AutoMR program of the Phenix suite v.1.8.2  with the structure of PVL (PDB 2BWR) as a search model. The determination of the structures of the two complexes was similarly performed using AutoMR program in Phenix, with the structure of solved AAL2 as search models. The carbohydrate ligands were added and adjusted manually according to the electronic density map displayed in Coot . The structural refinement was performed using the Refine program in Phenix alternated with manual rebuilding using the graphical program Coot.
Surface plasmon resonance
The SPR experiments were carried out on a BIAcore T100 (GE Healthcare, Sweden) using CM5 sensor chips at 298 K. AAL2 protein solution was diluted to a concentration of 20μg/ml with fixing buffer (pH 5.0) and subsequently immobilized by amine coupling onto the chips. The immobilized level of AAL2 reached 3000 response units. Flow path 1 was used as an activated blank control for subtraction. Interaction experiments were performed in a running buffer of 50mM Tris-HCl pH 7.5, 150mM NaCl, 0.005% Tween 20 at a flow rate of 30μl/min. The ligands in sample buffer were employed as the flowing phase respectively. Each interaction experiment was carried out with gradient concentrations from low to high programmed to inject at a 30μl/min flow rate under a sensorgram. Binding curves were corrected, aligned and fitted kinetically using the BIAcore T100 evaluation program (GE Healthcare).
Determination of metal binding properties
The purified AAL2 was loaded on a HiTrap Desalting Column (GE Healthcare) with buffer containing 50mM Tris-HCl pH 7.5, 150mM NaCl, and 5mM EDTA to remove metal ions from the protein. The flow-through fraction of protein was incubated in 277K for 1 hr. and then loaded on a HiTrap Desalting Column once again with the buffer of 50mM Tris-HCl pH 7.5 and 150mM NaCl to remove EDTA and EDTA-coupled metal ions. The flow-through fraction of protein was then evenly divided into five parts, and was added solutions of MgCl2, CaCl2, MnCl2, CoCl2, and NiSO4, respectively, to concentrations of 10mM. All the samples were incubated at 277K overnight and then applied to a HiTrap Desalting Column again to remove free metal ions. The metal content of the five samples as well as the protein solution not processed with EDTA were determined on a Varian Inductively Coupled Plasma—Atomic Emission Spectrometry (ICP-AES) (Agilent Technologies, USA). SPR experiments were also performed on EDTA-treated sample in buffers containing 5mM CaCl2 to find out if the metal binding state of AAL2 would affect its affinity to ligands.
GlcNAc is the most effective ligand for AAL2
To measure the affinities between AAL2 and glycan ligands, we performed Surface Plasmon Resonance (SPR) experiments. The ligands we used were GlcNAc monosaccharide and GlcNAcβ1-3Galβ1-4GlcNAc, a trisaccharide showing very high affinity toward AAL2 in glycan array analysis. In SPR experiments, the experimental curves obtained automatically from different sensorgrams showed that both GlcNAc and GlcNAcβ1-3Galβ1-4GlcNAc could be captured efficiently by the sensor chip surface through binding to AAL2 and quickly dissociated to the baseline level (Fig 1). The data obtained from the analysis of stable-state affinity model showed that AAL2 binds to GlcNAc with a dissociation constant (Kd) of 28.31 μM (Fig 1a), while previous assays using ITC techniques obtained a Kd value of 195μM , which was comparable to other GlcNAc-binding fungal lectins such as PVL (Psathyrella velutina Lectin) (Kd = 214 μM)  and BLL (Boletopsis leucomelas lectin) (Kd = 435 μM) . AAL2 was also shown via SPR to bind to the trisaccharide with a Kd value of 41.65 μM, which is close to that of GlcNAc (Fig 1b), demonstrating that Gal-GlcNAc moiety beyond GlcNAc may not affect the binding between AAL2 and trisaccharide.
(a) The binding curves measured at different concentrations of GlcNAc and the steady state affinity indicated by the value of Kd. (b) The binding curves measured at different concentrations of GlcNAcβ1-3Galβ1-4GlcNAc and the steady state analysis shows that the affinity of AAL2 to the trisaccharide is close to but a little lower than that to GlcNAc.
Overall structures of AAL2 and AAL2-glycan complexes
The structures of AAL2, AAL2-GlcNAc and AAL2-GlcNAcβ1-3Galβ1-4GlcNAc were determined at resolutions of 2.0Å, 2.1Å, and 2.0Å, respectively. The data collection and refinement statistics are summarized in Table 1.
There are two virtually identical AAL2 monomers arranged around a two-fold pseudoaxis of symmetry in an asymmetric unit in both the structures of AAL2 and AAL2-GlcNAc, and one AAL2 monomer in AAL2-GlcNAcβ1-3Galβ1-4GlcNAc, while the protein exists as monomers in solution as determined from its elution volume on a Superdex 200 10/300 GL column. Discussions thereafter will be focused on monomer A unless specified otherwise.
As expected from the sequence homology, the structures of AAL2 and AAL2-GlcNAc complex are very similar to those of the corresponding structures of PVL, with root-mean-square deviations (RMSDs) of 0.7Å for Cα atoms.
AAL2 adopts a beta-propeller fold.
The overall structure of AAL2 monomer is a single domain with the shape of cylindrical torus with a diameter of about 51Å and a height of about 27Å, showing a very regular, seven-bladed β-propeller fold formed by the blades packing in a circular array (Fig 2a). Each blade is a β-sheet consisting of four anti-parallel β-strands arranged in a W-like shape and of about 56 residues in length, and is named W-motif according to its shape. Seven tandem W-motifs of the AAL2 monomer, numbered WI to WVII from N- to C-terminus (Fig 2b), are organized around a seven-fold axis of pseudo symmetry.
(a) Two orthogonal views of the crystal structure of native AAL2. (b) Topology diagram of AAL2 shows that the seven blades have a shifting of one β-strand compared with the seven sequence repeats. GlcNAc binding sites and calcium binding sites are represented by green hexagons and solid yellow circles. (c) Alignment of the seven W-motifs of AAL2. Solid red triangles, black triangles, and red pentagons indicate the residues forming polar interactions with GlcNAc, the residues coordinate calcium ions, and the differences between WI and other W-motifs making it incapable of binding GlcNAc, respectively.
The four β-strands of each W-motif are numbered 1–4 according to their spatial arrangement from AAL2 center to its surface (Fig 2b). The first β-strands of the seven W-motifs are located innermost of the AAL2 monomer and form a central tunnel that is filled with water molecules and almost blocked by a long N-terminus loop (except for a small opening). These 7 innermost strands point to the same direction, from the ‘entrance’ to the ‘exit’ of the central tunnel, and run almost parallel to each other along the direction of the seven-fold propeller axis (Fig 2a). Strands 1 and 2 are connected by a long loop of about eight residues, while strands 2 and 3 are connected by a short loop of only four residues (except for a β-turn in the first W-motif). Strands 3 and 4 are connected by a long loop of about eleven residues. After strand 4, the peptide chain goes towards the direction of the central tunnel through the large ‘connecting segment’ to start the strand 1 of next β-sheet. When the AAL2 molecule is viewed along the central 7-fold symmetry axis, all the seven ‘blades’ are twisted (Fig 2a right panel), with each β-strand twists about 30° clockwise compared to the prior one, resulting in the arrangements of the outmost β-strands as almost perpendicular to the propeller axis and form a outer circular edge of the propeller between themselves. This arrangement may help stabilize the protein structure. Sequence alignment (Fig 2c) and superposition of the seven W-motifs show that they have high homologies to each other.
What is notable is that the seven sequence repeats observed in AAL2 do not match exactly with the seven blades, but instead have a shifting of one β-strand (Fig 2b). WI consists of three consecutive strands, i.e., strands 2, 3 and 4 at the N-terminus, and one strand, i.e., strand 1, from the C-terminus. This property of ‘end to end’, which has been discovered in many propeller structures, has a great evolutionary advantage of stabilizing protein structures.
The carbohydrate binding sites
The structure of AAL2-GlcNAc complex revealed six binding sites out of the seven W-motifs, with the exception of motif WI. All the bound GlcNAc molecules have virtually identical orientations in the structure.
Glycan ligands are bound in shallow pockets located on the entrance side, between the connecting segment and the 2–3 loop of each W-motifs (except for WI) (Fig 3a). Electron density maps show that the two ligands in the complex are well ordered and all the carbohydrate rings in the ligands adopt chair conformation (Fig 3b). Both hydrogen bonds and hydrophobic interactions contribute to the binding of AAL2 with the ligands (Fig 3c).
(a) Horizontal view of AAL2-GlcNAc complex (left) and the surface electrostatic potential diagram of the vertical view of AAL2- GlcNAcβ1-3Galβ1-4GlcNAc complex (right). (b) Sigma-A weighted Fo-Fc omit electron density map around GlcNAc (left) and GlcNAcβ1-3Galβ1-4GlcNAc (right). (c) The hydrogen bonds and hydrophobic interactions between AAL2 and GlcNAc (left) and GlcNAcβ1-3Galβ1-4GlcNAc (right), respectively, shown in Ligplot.
AAL2-GlcNAc recognition pockets.
Because the six binding pockets of AAL2 for glycan ligands are almost identical, we chose motif WIII for detailed discussion and the descriptions also applied to other W-motifs unless specified otherwise. The carbohydrate binding pockets of AAL2 are very similar to those of PVL, as implied by the sequence homology.
The binding pocket for GlcNAc consists of two parts. The major part is formed by the segment connecting WII and WIII, including the side chains of Asn104, Trp112 and His117 and main chain atoms of Gly110 and Gly111. The other part is formed by the main chain atoms of Gly136, Glu137, and Lys138 from the 2–3 loop of WIII. Among the residues mentioned above, Asn104 is located at the beginning of the connecting segment and it is either Asn or Asp in the corresponding position of each blade, and offers its OD1 and ND2/OD2 to form hydrogen bonds with O3 and O4 hydroxyl groups of GlcNAc, respectively. The O3 hydroxyl of GlcNAc moiety also forms a hydrogen bond with NE1 of Trp112. The N-acetyl group of GlcNAc forms two hydrogen bonds with the main chain of the protein: N2 donates one to O of Gly110, and O7 carbonyl accepts one from N of Glu137 (Fig 4a). The hydrogen bonding networks between AAL2 and GlcNAc of all six binding sites are listed in Table 2. Furthermore, the inner part of the pocket comprised of Trp112, His117, Gly136 and Glu137 is hydrophobic, helping stabilize the complex structures (Fig 4b). With the presence of both the polar and hydrophobic interactions, GlcNAc is bound tightly in the shallow pocket formed on the protein surface, with the head of N-acetyl group buried inside the pocket and the reduced O1 as well as O6 hydroxyl pointing outside into the solvent region.
(a) Stereo view of GlcNAc bound to the binding pocket in WIII. (b) (c) Surface electrostatic potential diagrams of the binding pockets in WIII for GlcNAc and GlcNAcβ1-3Galβ1-4GlcNAc, respectively.
GlcNAcβ1-3Galβ1-4GlcNAc ligands are bound to the same pockets of AAL2, with the non-reducing terminal GlcNAc (1’-GlcNAc) adopted the same position as GlcNAc. The polar interactions between AAL2 and the trisaccharide are summarized in Table 3. The interactions between AAL2 and 1’-GlcNAc of the trisaccharide are identical to those with GlcNAc monosaccharide. Although there are several hydrogen bond interactions between AAL2, such as Lys138, and the LacNAc moiety after 1’-GlcNAc, AAL2 doesn’t demonstrate specific preference to the 2’-Gal or 3’-GlcNAc in all the six blades (Fig 4c), since the interactions seem to be random. This result was rather unexpected based on the glycan array. However, our results have further proved that AAL2 recognizes the terminal non-reducing GlcNAc moiety of a polysaccharide.
Essential residues and structural elements for substrate recognition.
Sequence alignment of all seven W-motifs (Fig 2c) shows that the residues involved in the polar interactions with GlcNAc are highly conserved in all the seven blades. We have mutated five Asn104-counterpart residues in other W-motifs to alanine and found that the mutant proteins could not bind the GlcNAc-coupled Sepharose 6B affinity column. This showed that Asn104 and its counterparts in other W-motifs are essential for hydrogen bond interactions with GlcNAc. The Tachylectin-2  from Tachypleus tridentatus binds GlcNAc in a manner almost identical to that of AAL2, although the binding sites of these two proteins do not share the same architecture (Fig 5). While residues Lys37 and Trp40 are responsible for the GlcNAc specificity of Tachylectin-2, their equivalent residues are Asn104 and Trp112, respectively. Several other residues contribute to hydrogen bond formation with the substrates with their main-chain atoms. Another important common element involved in GlcNAc-binding is a hydrophobic pocket for the accommendation of the acetyl group of GlcNAc, which is formed by two aromatic residues, i.e., residues Trp40 and Trp75 in Tachylectin and Trp112 and His117 in AAL2. We therefore propose that AAL2 represents a common mode of GlcNAc recognition, although more GlcNAc-binding lectin structures are needed to firmly establish the structural patterns.
(a) The comparison of hydrogen bonds network between AAL2 (left) and Tachylectin-2 (right) and GlcNAc. (b) Both polar and hydrophobia interactions between AAL2 (left) and Tachylectin-2 (right) and GlcNAc. GlcNAc is shown in purple and the residues that are marked bold show the same components of the glycan binding pockets in the two lectins.
Unique stereochemistry for AAL2 specific recognition.
The putative binding site of WI seems intact and should be able to bind one more GlcNAc. However, two differences rule out this possibility. Firstly, instead of glycine, it is a lysine at residue 389, with a relatively large side chain stretching towards the binding pocket and it would block the binding of GlcNAc to the protein (Fig 6a). Another difference is observed at residue 395, where a threonine residue replaces the conserved histidine residue, which, as mentioned above, contributes to the hydrophobic interaction in all other six W-motifs.
(a) Surface electrostatic potential diagram of WI that shows Lys395 blocked the potential glycan binding pocket. (b) Comparison between GlcNAc and Gal in conformation in the binding pocket of WIII from AAL2. (c) Superposition of the binding pockets in WIII from AAL2 (yellow) and PVL (magenta) shows that Neu5Ac (cyan) is not a specific ligand for AAL2.
Mutation assays also explained why AAL2 has the specificity towards GlcNAc instead of GalNAc. Since galactose is a C-4 epimer of glucose, the O4 hydroxyl group in GlcNAc is equatorial (in the "plane" of the ring) while in GalNAc it is axial (up from the "plane" of the ring). If a Gal moiety were to take the position and orientation of GlcNAc as in the complex structures, the key hydrogen bonds between Asn104 and O4 hydroxyl of GlcNAc would be disrupted (Fig 6b). AAL2 did not show any binding affinities toward Neu5Ac as assayed by SPR, even though it shares 60% sequence identity and has very similar structure to those of PVL, which, in contrast, binds both GlcNAc and Neu5Ac. This lack of affinity towards Neu5Ac might be resulted from the fact that Neu5Ac does not have the O4 hydroxyl to form the key hydrogen bond with AAL2 as mentioned above. Moreover, the side chain of residue Tyr 140 of PVL provides the Neu5Ac specificity by forming a hydrogen bond with sialic acid moiety of Neu5Ac (Fig 6c), but its counterpart in AAL2, i.e., residue Leu141, is too small to form a similar hydrogen bond. The structural evidences prove that AAL2 has an exclusive specificity to non-reducing terminal GlcNAc which perfectly fits with the glycan array and the hydrogen bond donated by O4 hydroxyl group of GlcNAc is crucial for its binding to AAL2. In other words, the corresponding residues of Asn104 play a most important role in determining the glycan specificity of AAL2.
Metal binding property of AAL2
In addition to glycans, AAL2 also binds two cations per molecule as observed in the structure of AAL2-GlcNAc complex (Fig 7a). Similar metal ion binding properties were also observed for PVL . These cations are located in the 1–2 loops of WV and WVII of AAL2, where highly negatively-charged cages are formed (Fig 7b).
(a) Overview of the locations of two Ca2+, which are represented by yellow spheres, in the whole structure. (b) Larger and detailed view of the image section in red frame of (a).
To identify the cation-binding preferences of AAL2, we tested Mg2+, Ca2+, Mn2+, Co2+and Ni2+for their binding affinities. The binding assay results are shown in Table 4. Because of heavy precipitation upon adding zinc salt into protein solution, the binding assay of Zn2+ was not shown. But there was indeed zinc in the purified native protein samples. Calcium is the main element AAL2 binds, which is the reason why we modeled cations in the AAL2-GlcNAc complex as Ca2+. The preference of the divalent cation type may lie in the diameter fitting the size of the negative charged pocket. SPR assay were also carried out and illustrated that the adding of Ca2+ did not affect the glycan affinity of AAL2. Although it has been reported that Ca2+ binding property is a common phenomenon observed in propeller structures and it was believed that the binding of Ca2+ play a role of stabilizing the overall structure , similar conclusions could not be made for AAL2 based on our studies. So the effect of metal binding of AAL2 remains to be investigated.
In this study, we have demonstrated that AAL2 has relatively high affinities and exclusive specificity towards terminal nonreducing GlcNAc and illustrated the structural basis of its ligand specificity.
Although further studies are required to reveal the antitumor mechanisms of AAL2, the high affinities and exclusive specificity of this lectin toward nonreducing GlcNAc moieties strongly point to the possibilities that AAL2 may exert its functions through interactions with O-GlcNAcylated proteins. Many tumor suppressors and oncogenes that involved in transcription regulations, such as c-Myc, p53, and retinoblastoma, are modified by O-GlcNAcylation. The extensive cross talk between O-GlcNAcylation and phosphorylation involving these proteins underlines the roles played by O-GlcNAcylation in tumor pathogenicity [4, 6], but the exact mechanisms and ligand proteins of AAL2 antitumor functions remain to be revealed.
The studies reported here, on the other hand, have demonstrated that AAL2 is a very promising candidate to replace WGA in glycan research for the enrichment and purification of O-GlcNAcylated proteins. WGA lectin weak affinity chromatography has been the most frequently used method in the enrichment and purification of O-GlcNAcylated proteins , but this method suffers from, as implied by its name, the weak affinity of WGA toward its ligands and low specificities, because WGA binds sialic acids in addition to O-GlcNAcylated proteins. AAL2, in contrast, has higher affinities and exclusive specificities towards terminal GlcNAc moieties. Further research is in progress in our labs to employ AAL2 in affinity chromatography for the efficient enrichment and purification of O-GlcNAcylated peptides and proteins.
The results reported here illustrated the structural basis of how AAL2 recognizes and binds terminal GlcNAc moiety and laid a solid foundation for us to reveal the antitumor mechanisms of this lectin and to develop a better reagent for glycan studies involving O-GlcNAcylated proteins.
We are grateful to Y. Han and S. Liu for technique support for in-house X-ray diffraction data collections, and Y. Chen for technique support for SPR assays. Coordinates of the AAL2 and its complexes with GlcNAc and GlcNAcβ1-3Galβ1-4GlcNAc have been deposited in the Protein Data Bank with the accession codes 4TQJ, 4TQK, and 4TQM.
Conceived and designed the experiments: HS DCW. Performed the experiments: XMR DFL SJ XQL. Analyzed the data: XMR DFL YH. Wrote the paper: XMR YH DCW.
- 1. Lis H, Sharon N. Lectins: carbohydrate-specific proteins that mediate cellular recognition. Chem Rev. 1998;92(2):28.
- 2. Sharon N, Lis H. History of lectins: from hemagglutinins to biological recognition molecules. Glycobiology. 2004;14(11):53R–62R. pmid:15229195.
- 3. Vijayan M, Chandra N. Lectins. Curr Opin Struct Biol. 1999;9(6):707–14. Epub 1999/12/23. pmid:10607664.
- 4. Hart GW, Slawson C, Ramirez-Correa G, Lagerlof O. Cross talk between O-GlcNAcylation and phosphorylation: roles in signaling, transcription, and chronic disease. Annu Rev Biochem. 2011;80:825–58. pmid:21391816; PubMed Central PMCID: PMC3294376.
- 5. Harwood KR, Hanover JA. Nutrient-driven O-GlcNAc cycling—think globally but act locally. J Cell Sci. 2014;127(Pt 9):1857–67. Epub 2014/04/26. pmid:24762810
- 6. Fardini Y, Dehennaut V, Lefebvre T, Issad T. O-GlcNAcylation: a new cancer hallmark? Front Endocrinol (Lausanne). 2013;4:99. Epub 2013/08/22. pmid:23964270; PubMed Central PMCID: PMC3740238.
- 7. Hart GW, Housley MP, Slawson C. Cycling of O-linked beta-N-acetylglucosamine on nucleocytoplasmic proteins. Nature. 2007;446(7139):1017–22. pmid:17460662.
- 8. Torres C-R, Hart GW. Topography and polypeptide distribution of terminal N-acetylglucosamine residues on the surfaces of intact lymphocytes. Evidence for O-linked GlcNAc. J Biol Chem. 1984;259(5):3308–17. pmid:6421821
- 9. Khan F, Khan MI. Fungal lectins: current molecular and biochemical perspectives. Int J Biol Chem. 2011;5(1):20.
- 10. She QB, Ng TB, Liu WK. A novel lectin with potent immunomodulatory activity isolated from both fruiting bodies and cultured mycelia of the edible mushroom Volvariella volvacea. Biochem Biophys Res Commun. 1998;247(1):106–11. Epub 1998/06/24. pmid:9636663.
- 11. Wang H, Gao J, Ng TB. A new lectin with highly potent antihepatoma and antisarcoma activities from the oyster mushroom Pleurotus ostreatus. Biochem Biophys Res Commun. 2000;275(3):810–6. Epub 2000/09/07. pmid:10973803.
- 12. Wasser SP, Weis AL. Therapeutic effects of substances occurring in higher Basidiomycetes mushrooms: a modern perspective. Crit Rev Immunol. 1999;19(1):65–96. Epub 1999/02/13. pmid:9987601.
- 13. Feng L, Sun H, Zhang Y, Li DF, Wang DC. Structural insights into the recognition mechanism between an antitumor galectin AAL and the Thomsen-Friedenreich antigen. Faseb J. 2010;24(10):3861–8. ISI:000285005900024. pmid:20530247
- 14. Jiang S, Chen Y, Wang M, Yin Y, Pan Y, Gu B, et al. A novel lectin from Agrocybe aegerita shows high binding selectivity for terminal N-acetylglucosamine. Biochem J. 2012;443(2):369–78. pmid:22268569; PubMed Central PMCID: PMC3316157.
- 15. Vosseller K, Trinidad JC, Chalkley RJ, Specht CG, Thalhammer A, Lynn AJ, et al. O-linked N-acetylglucosamine proteomics of postsynaptic density preparations using lectin weak affinity chromatography and mass spectrometry. Mol Cell Proteomics. 2006;5(5):923–34. ISI:000237415200014. pmid:16452088
- 16. Trinidad JC, Barkan DT, Gulledge BF, Thalhammer A, Sali A, Schoepfer R, et al. Global identification and characterization of both O-GlcNAcylation and phosphorylation at the murine synapse. Mol Cell Proteomics. 2012;11(8):215–29. ISI:000308024200001. pmid:22645316
- 17. Cioci G, Mitchell EP, Chazalet V, Debray H, Oscarson S, Lahmann M, et al. Beta-propeller crystal structure of Psathyrella velutina lectin: an integrin-like fungal protein interacting with monosaccharides and calcium. J Mol Biol. 2006;357(5):1575–91. pmid:16497330.
- 18. Ren X, Jiang S, Li D, Sun H, Wang D. Crystallization and preliminary crystallographic studies of AAL-2, a novel lectin from Agrocybe aegerita that binds nonreducing terminal N-acetylglucosamine. Acta Crystallogr F. 2013;69(6):650–2.
- 19. Leslie AG. The integration of macromolecular diffraction data. Acta Crystallogr D. 2006;62(Pt 1):48–57. Epub 2005/12/22. pmid:16369093.
- 20. 4 CCPN. The CCP4 suite: programs for protein crystallography. Acta Crystallogr D. 1994;50(Pt 5):760–3. Epub 1994/09/01. pmid:15299374.
- 21. Adams PD, Afonine PV, Bunkoczi G, Chen VB, Davis IW, Echols N, et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr D. 2010;66(Pt 2):213–21. Epub 2010/02/04. pmid:20124702; PubMed Central PMCID: PMCPMC2815670.
- 22. Emsley P, Cowtan K. Coot: model-building tools for molecular graphics. Acta Crystallogr D. 2004;60(Pt 12 Pt 1):2126–32. Epub 2004/12/02. pmid:15572765.
- 23. Nakamura-Tsuruta S, Kominami J, Kamei M, Koyama Y, Suzuki T, Isemura M, et al. Comparative analysis by frontal affinity chromatography of oligosaccharide specificity of GlcNAc-binding lectins, Griffonia simplicifolia lectin-II (GSL-II) and Boletopsis leucomelas lectin (BLL). J Biochem. 2006;140(2):285–91. pmid:16835257.
- 24. Beisel HG, Kawabata S, Iwanaga S, Huber R, Bode W. Tachylectin-2: crystal structure of a specific GlcNAc/GalNAc-binding lectin involved in the innate immunity host defense of the Japanese horseshoe crab Tachypleus tridentatus. EMBO J. 1999;18(9):2313–22. pmid:10228146