Structural and Biochemical Insights into the Peptidoglycan Hydrolase Domain of FlgJ from Salmonella typhimurium

FlgJ is a glycoside hydrolase (GH) enzyme belonging to the Carbohydrate Active enZyme (CAZy) family GH73. It facilitates passage of the bacterial flagellum through the peptidoglycan (PG) layer by cleaving the β-1,4 glycosidic bond between N-acetylglucosamine and N-acetylmuramic acid sugars that comprise the glycan strands of PG. Here we describe the crystal structure of the GH domain of FlgJ from bacterial pathogen Salmonella typhimurium (StFlgJ). Interestingly, the active site of StFlgJ was blocked by the C-terminal α-helix of a neighbouring symmetry mate and a β-hairpin containing the putative catalytic glutamic acid residue Glu223 was poorly resolved and could not be completely modeled into the electron density, suggesting it is flexible. Previous reports have shown that the GH73 enzyme Auto from Listeria monocytogenes is inhibited by an N-terminal α-helix that may occlude the active site in similar fashion. To investigate if the C-terminus of StFlgJ inhibits GH activity, the glycolytic activity of StFlgJ was assessed with and without the C-terminal α-helix. The GH activity of StFlgJ was unaffected by the presence or absence of the α-helix, suggesting it is not involved in regulating activity. Removal of the C-terminal α-helix did, however, allow a crystal structure of the domain to be obtained where the flexible β-hairpin containing residue Glu223 was entirely resolved. The β-hairpin was positioned such that the active site groove was fully solvent-exposed, placing Glu223 nearly 21.6 Å away from the putative general acid/base residue Glu184, which is too far apart for these two residues to coordinate glycosidic bond hydrolysis. The mobile nature of the StFlgJ β-hairpin is consistent with structural studies of related GH73 enzymes, suggesting that a dynamic active site may be common to many GH73 enzymes, in which the active site opens to capture substrate and then closes to correctly orient active site residues for catalysis.

Most GH enzymes use two enzymatic carboxyl groups in a catalytic reaction that proceeds through one of two mechanisms. The first mechanism involves two-steps where, initially, one carboxyl group acts as a general acid to protonate the glycosidic oxygen while the other acts as a nucleophile that attacks the anomeric carbon of the glycosidic bond. This combined action cleaves the glycosidic bond and results in the formation of a glycosyl-enzyme intermediate. Subsequently, the carboxyl group that initially protonated the glycosidic oxygen acts as a general base to activate a water that attacks the anomeric center of the intermediate and yields a product with retained anomeric stereochemistry [15,16]. In the second, single-step mechanism, one carboxyl group acts as a general acid to protonate the glycosidic oxygen while the other acts as a general base to activate an intervening water molecule that attacks the anomeric carbon of the glycosidic bond, resulting in bond cleavage and a product with inverted anomeric stereochemistry [15,16]. Some GH enzymes use a third mechanism, that employ a two-step configuration-retaining mechanism that exploits the presence of a neighbouring 2-acetamido group on the substrate, as first predicted by Koshland [17]. In this case, the 2-acetamido group acts as a nucleophile in place of an enzyme carboxyl group; this process is known as substrateassisted catalysis (SAC; for a comprehensive review of glycosidase mechanisms see [18]).
Previous studies have identified an invariant glutamate in GH73 enzymes that likely acts as the general acid catalyst protonating the glycosidic bond in the above mechanisms [2,4,5,[19][20][21][22]. However, identification of a second conserved active site carboxylate remains elusive and it appears some GH73 enzymes may use substrate-assisted catalysis as opposed to the classical mechanisms that require two enzymatic carboxyl groups [5,19,20]. Additionally, it is unclear whether GH73 enzymes with two catalytic carboxylate-carrying residues proceed through a configuration retaining or inverting mechanism. Characterized examples of GH73 enzymes with two identified carboxylate residues that are currently predicted to use an inverting mechanism include FlgJ, Auto and TM06330 [2,4,5,21], while those presumably lacking a second catalytic carboxylate residue, and predicted to use a SAC mechanism, include the autolysins Atl WM , AcmA, and LytB [5,19,20].
To explore the structural basis of FlgJ activity, we determined the crystal structure of the FlgJ GH domain from S. typhimurium (StFlgJ) to 1.8 Å resolution. Interestingly, the active site of the domain was occupied by a C-terminal α-helix of a neighbouring symmetry mate. Previous reports have described similarly occluded active sites in the crystal structure of the GH73 enzyme Auto, a virulence factor of L. monocytogenes, as well as in the crystal structure of the Sphingomonas sp. FlgJ enzyme (SpFlgJ) [2,23]. The active sites of Auto and SpFlgJ were occupied in trans by an N-terminal α-helix and C-terminal strand, respectively, from neighbouring monomers within the crystal lattice. Furthermore, the N-terminal α-helix of Auto was previously found to inhibit glycolytic activity, and necessitated proteolytic removal in order to activate the domain [2]. The discovery of three GH73 enzymes (Auto, SpFlgJ and StFlgJ) with occluded active sites led us to hypothesize that autoinhibition by N-or C-terminal motifs may control the activity of many GH73 enzymes.
To investigate whether the C-terminus of StFlgJ regulates activity in the same way as the Nterminus of Auto, a related GH73 enzyme, the glycolytic activity of StFlgJ was measured in the presence and absence of the C-terminal α-helix using zymogram assays. Contrary to our hypothesis, we found the catalytic activity of StFlgJ to be unaffected by presence or absence of its C-terminal α-helix, suggesting the helix does not regulate function. Removing the C-terminal α-helix, however, allowed us to determine a 2.1 Å crystal structure of the domain in which the extended β-hairpin, known to accommodate a second putative catalytic carboxyl group, was well ordered and revealed that the active site of StFlgJ seems capable of opening widely, possibly aiding in substrate capture and turnover.

Expression, purification and crystallization of GH73 enzyme StFlgJ
The open reading frame for the C-terminal glycoside hydrolase domain of StFlgJ (residues 151-316; NCBI Gene ID: 1252700) was amplified by PCR from genomic DNA isolated from Salmonella enterica subsp. enterica serovar Typhimurium str. LT2 (B) using primers 5'-GATATACATATGGACAGTAAAGACTTTCTGGCC-3' and 5'-GATATAGGATCCAAAGAGAT TGTCGAGATTCGC-3'. The primers introduced NdeI and BamHI restriction sites at the 5' and 3' ends of the PCR amplicon, respectively, to facilitate ligation of the DNA into a modified pET28b(+) vector that fuses a His 6 purification tag in-frame to the 3'-end of the GH domain open reading frame. The resulting expression vector, pET28-StFlgJ GH(151-316) , was used to transform Escherichia coli BL21 Gold (DE3) cells. The cells were grown at 37°C to an optical density (OD 600nm ) of 0.4-0.6 in 500 mL LB medium supplemented with 35 μg/mL kanamycin (Fisher). The culture temperature was subsequently lowered to 28°C, and recombinant protein expression induced with the addition of 1 mM isopropyl-D-thiogalactoside (IPTG) (Invitrogen) for 4 hours with shaking. Cells were pelleted by centrifugation (3450 x g, 20 min at 4°C) and either stored at -80°C or immediately re-suspended in 20 mL ice cold lysis buffer (1M NaCl, 50 mM Tris-HCl pH 8.0, 0.1 mM phenylmethylsulfonyl fluoride) for cell disruption. Cells were lysed by French press (Aminco) and the soluble fraction recovered by centrifugation (17200 x g, 30 min at 4°C). StFlgJ GH(151-316) was isolated by affinity chromatography using Ni-NTA resin (Qiagen) pre-equilibrated with the lysis buffer. Recombinant protein was bound to the resin over a 20 min incubation period with agitation on ice and transferred to a column for thorough washing with lysis buffer (1M NaCl, 50 mM Tris-HCl pH 8.0), which was also used as a binding buffer. StFlgJ GH(151-316) was eluted using 750 mM NaCl, 37.5 mM Tris-HCl pH 8.0, 250 mM imidazole. The eluted protein was dialyzed overnight against 800 mM NaCl, 10 mM Tris-HCl pH 8.0; and once more the following day against 500 mM NaCl, 10 mM Tris-HCl pH 8.0. The dialyzed protein was concentrated and purified using an FPLC (ÄKTA) and HiLoad 16/60 Superdex 75 size exclusion column (GE Healthcare) equilibrated with 500 mM NaCl 10 mM Tris-HCl pH 8.0. Fractions containing StFlgJ GH(151-316) were pooled, concentrated and subjected to crystallization trials.
To remove the 15 amino acids from the C-terminus in the StFlgJ GH(151-316) GH domain found to be blocking the domain active site in trans, the plasmid pET28-StFlgJ GH(151-316) was redesigned using a new primer set (5'GATATACATATGGACAGTAAAGACTTTCTGGCC-3' and 5'-GATATAGGATCCACTCATCGCTTTCAACTGCTG-3', sense and antisense, respectively) which amplified a sequence that excluded the last 15 codons of the full-length domain and placed a BamHI site at the 3'end. The modified StFlgJ amplicon, encoding residues 151-301 of FlgJ, was ligated into the pET28b(+) vector described above to generate pET28-StFlgJ GH(151-301) , from which protein was expressed and purified as describe above for StFlgJ GH(151-316) .
Crystallization of StFlgJ GH(151-316) catalytic domain was performed at 20°C using the hanging drop vapor diffusion method. Crystal were grown within 72 hours using 2 μL of protein solution at 35 mg/mL in 500 mM NaCl 10 mM Tris-HCl pH 8.0 mixed with an equal volume of mother liquor consisting of 18-22% (w/v) polyethylene glycol 3350 and 250-350 mM NaI. The drops were equilibrated by vapour diffusion against 150 μL of mother liquor. Crystals grew to maximum dimensions of~0.1 x 0.1 x 1 mm. StFlgJ GH(151-301) and StFlgJ GH(151-316) crystalized under identical conditions, albeit in different space groups.
X-ray data collection, processing and structure determination Crystals were harvested by sweeping through a solution containing 22% (w/v) PEG 3350, 350 mM NaI, 15% (v/v) glycerol, before being flash-cooled in liquid nitrogen. X-ray diffraction data were collected in-house at 93K from individual crystals using a MicroMax-007 HF X-ray source and R-axis 4++ detector (Rigaku). X-ray data were integrated using iMosflm and scaled and merged using Aimless within the CCP4 software suit [24]. Both StFlgJ GH(151-316) and StFlgJ GH(151-301) crystals were grown in the presence of iodide and initial phase estimates could be determined using a single wavelength anomalous dispersion (SAD) phasing experiment, using an in-house Cu K-α X-ray source. Iodide substructures used for phasing diffraction data were identified using phenix.autosol [25]. All models were built using Coot [26], and refinement was carried out using phenix.refine [25].
The zymogram gel was prepared as previously described (15% acrylamide gel containing 0.2% M. lysodeikticus lyophilized cells (Sigma), but no sodium dodecyl sulfate (SDS) [2]. Protein samples were mixed with sample loading buffer lacking SDS and β-mercaptoethanol and not boiled prior to gel loading to prevent denaturation. Following electrophoresis, the zymogram gel was rinsed with Milli-Q H 2 O, prestained with 100 mM NaAcetate, pH 4.8 and stained with 0.1%Methylene Blue, 0.01% KOH as previously described and destained in Milli-Q H 2 O overnight [2]. Standard SDS-PAGE analysis on a 12% acrylamide gel was carried out to assess protein purity.

Results and Discussion
The crystal structure StFlgJ GH(151-316 had a single molecule comprising the asymmetric unit and was refined to 1.8 Å resolution (Table 1). The domain was found to adopt a lysozyme-like fold common to other GH73 enzymes, which consists of an α-helical subdomain packed against a β-lobe that forms an extended β-hairpin motif [2,5,23] (Fig 1A). The α-helical domain and β-hairpin of StFlgJ GH(151-316) together form a groove-shaped active site, with the α-helical subdomain containing an α-central helix that houses a conserved glutamate general acid/base (E184), and the β-hairpin, in turn, containing a second glutamate residue (E223) previously determined to be required for GH activity [4]. Residues 223-229-which form a loop connecting the two strands of the β-hairpin motif-were disordered and could not be modeled. Similar results have been reported for the crystal structures of other GH73 enzymes, indicating that the β-hairpin is likely mobile in this family of enzymes [5,[21][22][23]. Interestingly, the active site of StFlgJ GH(151-316) was found to be occupied by the C-terminal α-helix of a neighbouring symmetry mate (Fig 1B). The C-terminal α-helix StFlgJ GH(151-316) packs into the active site of the symmetry mate via hydrophobic and hydrogen-bonding interactions. Hydrophobic interactions were found to occur between L312 of the helix and F204 and Y249 of the active site of the symmetry mate, as well as between Y308 of the helix and L183 and Y286 of the symmetry mate active site. Hydrogen bonds occur between Y308 and N311 of the helix and K289 and backbone amide of Y280 of the symmetry mate active site. A similar in trans blockage of the GH domain active site of SpFlgJ has also been observed, where the C-terminus extends into the active site of its neighbour [21,23] (Fig 1C). Unlike StFlgJ, the C-terminal extension of SpFlgJ forms extensive hydrogen-bonding interactions with the active site of the symmetry mate that includes the putative general acid/base residue E185, which forms a salt bridge and hydrogen bond with R312 of the C-terminal extension. The observed active site occlusion in these FlgJ proteins is reminiscent of the N-terminal α-helix of the L. monocytogenes GH73 enzyme Auto, which also blocks the active site in trans in the crystalline state [2]. As for SpFlgJ (Fig 1C), an arginine (R64) on the N-terminal α-helix of Auto forms a salt bridge and hydrogen bond with the active site general acid base E122 of the enzyme ( Fig  1D). It should be noted that Bublitz et al. found it necessary to truncate the N-terminus of Auto to achieve crystallization of the protein, and the in trans obstruction of the Auto active site in the crystal structure was suggested to be non-physiological, since the protein is monomeric in solution [2]. However, they also proposed that in solution, the natural, full-length Nterminus blocks the active site of Auto in cis and they demonstrated that it must be removed proteolytically for the enzyme to become activated [2]. A superposition of the StFlgJ and SpFlgJ GH domain structures with Auto reveals that the C-and N-termini of these enzymes are spatially conserved (Fig 2), which suggested that FlgJ might be auto-inhibited by the C-terminal αhelix, in similar fashion to the auto-inhibition of Auto by an N-terminal α-helix.
To determine whether the C-terminus of StFlgJ GH(151-316) autoinhibits the enzyme in solution, the full-length StFlgJ GH domain, StFlgJ GH(151-316) and a C-terminal truncated form of the domain (StFlgJ GH(151-301) ) were tested for activity using a zymogram assay. Contrary to our hypothesis, the assays revealed that the StFlgJ GH domain is fully active both with and without the C-terminal α-helix, indicating that the helix does not auto-inhibit StFlgJ GH activity and is likely not functionally equivalent to the N-terminal auto-inhibitory helix of Auto (Fig 3). In contrast, mutating the putative catalytic residue E223 to glutamine in the beta-hairpin of StFlgJ GH(151-316) reduced hydrolase activity to residual levels, which is consistent with previous work identifying E223 as a catalytic residue [4]. Given the C-terminus of StFlgJ did not block activity of the enzyme as we had anticipated, it may instead participate in coiled-coil interactions with other flagellar component proteins as previously proposed [30].
Next, we determined the crystal structure of the truncated GH domain (StFlgJ GH(151-301) ) to 2.2 Å resolution (Table 1) to study the active site of StFlgJ in the absence of the C-terminal αhelix. The asymmetric unit contained three copies of the truncated domain (Fig 4A). Though the C-terminal α-helix no longer occupied the active site, the catalytic β-hairpin of each domain was found to partly occlude the active site of neighbouring domains in the crystal  StFlgJ GH(151-316) . A) Cartoon diagram of the FlgJ GH domain from Salmonella typhimurium (StFlgJ GH(151-316) ) (blue) is shown with the catalytic glutamate (E184; orange), within the active site groove. Circled is the incomplete β-hairpin, with missing residues 223-229, that forms half of the active site groove. B) All molecules within the crystal structure were found to have their active sites blocked in trans by the C-terminal α-helix (red) of a neighbouring symmetry mate (grey) (conserved catalytic residue E184 is orange). C) In trans blockage of the GH domain active site of SpFlgJ (grey carbons) by the C-terminus of a symmetry mate (green carbons) [21,23]. D) In trans blockage of the GH domain active site of Auto (grey carbons) by the Nterminal inhibitory helix of a symmetry mate (magenta carbons) [2]. Structural models were generated using PyMol [28].   (Fig 4B). While this crystal packing did not provide an unobstructed active site, it did impart significant structural order to the β-hairpin. The β-hairpin was found to be located further from the helical domain of the enzyme than any previously reported structure of a GH73 enzyme, including Auto and SpFlgJ (Fig 5). A superposition of StFlgJ with the structures of other GH73 enzymes reveals substantial differences in the position of the β-hairpin harboring the putative catalytic E223 residue (Fig 5).
FlgJ [4,9,21], Auto [2], and the recently characterized GH73 enzyme from Thermotoga maritima TM0633 [5], all have a conserved glutamate on the β-hairpin that is required for catalysis, which suggests one of three possible catalytic mechanisms: 1) With substrate bound, the β- Each ASU is comprised of three copies of StFlgJ GH(151-301) . One ASU is coloured green with three neighbouring ASUs coloured grey for clarity. B) The β-hairpin of each monomer of each ASU packs identically against the active site of a monomer within a neighbouring symmetry related ASU (grey) within the crystal structure. This packing imparts order to β-hairpin of each monomer while it partly occluding the active site of each copy of the protein in the crystal (conserved active site residue E184 shown in orange). hairpin moves deep enough into the active site to allow the glutamate to act as a nucleophile, directly attacking the anomeric carbon of GlcNAc to promote glycosidic bond cleavage that proceeds via a glycosyl-enzyme intermediate where anomeric stereochemistry is retained; 2) The hairpin does not move deep enough into the active site to allow the glutamate to directly attack GlcNAc but instead positions the glutamate to activate an intervening water molecule that, in turn, attacks the anomeric carbon of GlcNAc to promote glycoside bond cleavage with inversion of anomeric configuration; or finally 3) The β-hairpin moves to position the glutamate so that it can help facilitate a substrate assisted catalytic mechanism whereby the 2-acetamido group of GlcNAc acts as the nucleophile to drive a reaction that proceeds via an oxazolinium ion intermediate and generates a product with retained stereochemistry. Current structural and functional studies point towards the inverting mechanism (#2 above) as the likely mechanism given the large distances (~10-20 Å) that have been observed between the glutamate general acid on the helical domain and putative general base on the β-hairpin of these enzymes. GH enzymes using an inverting mechanism typically have the general acid and general base~10.5 Å apart, whereas, those using a retaining mechanism that proceeds through a glycosyl-enzyme intermediate place these residues~5.5 Å apart [16]. Though a distance approaching~5.5 Å has yet to be observed between the catalytic residues of a GH73 enzyme, given the apparent flexibility of the β-hairpin either mechanism remains possible and stereochemical outcome studies will be needed to rule out the latter.
Alternatively, GH18, GH20, GH56, and GH84 enzymes employ a second enzyme carboxyl group (independent of the general acid/base) that appears to activate the 2-acetamido group of the substrate in order to participate in a substrate-assisted reaction and help to stabilize the  [2] and SpFlgJ (PDB: 3VWO; yellow). β-hairpin residues (E223 for StFlgJ GH(151-301) , E156 for Auto, and E224 for SpFlgJ) are shown on each β-hairpin. Also superposed is the structure of StFlgJ GH(151-316) (PDB: 5DN4; blue). Superposition using the DaliLight server [29] showed, for Auto: C α rmsd of 3.3 Å over 140 equivalent positions to StFlgJ GH(151-301) ; SpFlgJ: C α rmsd of 2.3 Å over 140 equivalent positions to StFlgJ GH(151-301) . doi:10.1371/journal.pone.0149204.g005 Structure of Salmonella typhimurium FlgJ Hydrolase Domain resulting oxazolinium ion intermediate (#3 above) [31][32][33][34][35]. The SAC mechanism of GH85 enzymes uses an asparagine in place of an enzyme carboxylate to facilitate 2-acetamido group participation [36]. While it is possible that the glutamate on the β-hairpin could serve as the second carboxyl group that engaging the 2-acetamido group in a SAC mechanism, enzymes that use a SAC mechanism typically have ridged active site architectures and a number of additional residues to help orient the carbonyl oxygen of the 2-acetamido within striking distance of the anomeric center of GlcNAc [37]. Evidence for these additional residues is lacking in StFlgJ, SpFlgJ and Auto, which may make this mechanism less likely for these enzymes. Interestingly however, there exist additional groups of GH73 enzymes where the β-hairpin contains a carboxyl group that is not needed for catalysis, as in Staphylococcus warneri M Atl WM [20]; in the GH73 enzyme AcmA from Lactococcus lactis ssp. cremoris MG1363, the β-hairpin contains a carboxyl group (Glu) that can be exchanged for an amide group (Gln) without complete loss of activity [19]. Additionally, some GH73 sequences appear to completely lack a suitable acidic residue within the β-hairpin [5]. These enzymes may indeed use a SAC mechanism that does not depend on the use of a second enzymic carboxyl group, but may instead use an alternative residue, such as an asparagine as found in GH85 enzymes. Further structural and functional studies, including the development of straightforward kinetic assays using substrate analogues, will be needed to resolve which catalytic mechanism(s) are employed by GH73 enzymes.