P. aeruginosa SGNH Hydrolase-Like Proteins AlgJ and AlgX Have Similar Topology but Separate and Distinct Roles in Alginate Acetylation

The O-acetylation of polysaccharides is a common modification used by pathogenic organisms to protect against external forces. Pseudomonas aeruginosa secretes the anionic, O-acetylated exopolysaccharide alginate during chronic infection in the lungs of cystic fibrosis patients to form the major constituent of a protective biofilm matrix. Four proteins have been implicated in the O-acetylation of alginate, AlgIJF and AlgX. To probe the biological function of AlgJ, we determined its structure to 1.83 Å resolution. AlgJ is a SGNH hydrolase-like protein, which while structurally similar to the N-terminal domain of AlgX exhibits a distinctly different electrostatic surface potential. Consistent with other SGNH hydrolases, we identified a conserved catalytic triad composed of D190, H192 and S288 and demonstrated that AlgJ exhibits acetylesterase activity in vitro. Residues in the AlgJ signature motifs were found to form an extensive network of interactions that are critical for O-acetylation of alginate in vivo. Using two different electrospray ionization mass spectrometry (ESI-MS) assays we compared the abilities of AlgJ and AlgX to bind and acetylate alginate. Binding studies using defined length polymannuronic acid revealed that AlgJ exhibits either weak or no detectable polymer binding while AlgX binds polymannuronic acid specifically in a length-dependent manner. Additionally, AlgX was capable of utilizing the surrogate acetyl-donor 4-nitrophenyl acetate to catalyze the O-acetylation of polymannuronic acid. Our results, combined with previously published in vivo data, suggest that the annotated O-acetyltransferases AlgJ and AlgX have separate and distinct roles in O-acetylation. Our refined model for alginate acetylation places AlgX as the terminal acetlytransferase and provides a rationale for the variability in the number of proteins required for polysaccharide O-acetylation.


Introduction
Pseudomonas aeruginosa is an opportunistic, Gram-negative pathogen that can cause acute and chronic infections. The bacterium is the dominant bacterial species in the lungs of cystic fibrosis (CF) patients and if left untreated, is the leading cause of morbidity and mortality among these individuals [1]. P. aeruginosa is able to persist through the formation of a biofilm where communities of surface attached bacteria are encapsulated in a matrix composed primarily of secreted extracellular polysaccharides. Bacteria embedded within a biofilm are more resistant than their planktonic counterparts to environmental stresses such as antibiotics and disinfectants, and are able to evade the defense mechanism(s) of the host [2][3][4][5][6][7]. P. aeruginosa has the genetic capability to produce at least three different biofilm exopolysaccharides: Pel, Psl, and alginate [8,9]. P. aeruginosa clinical isolates obtained from CF patients with chronic pulmonary infections secrete large amounts of alginate [10,11]. This exopolysaccharide is synthesized in the cytoplasm and is translocated across the inner membrane as a linear homopolymer of D-mannuronic acid [12,13]. The polymer is subsequently modified in the periplasm through O-acetylation and epimerization to form a b1-4 linked non-repeating chain of D-mannuronic acid and its C5 epimer Lguluronic acid [14,15].
Modification of polysaccharides through the addition or removal of acetate is an important biological process for survival and virulence in many bacterial species. For example, biofilm formation by the human pathogens Escherichia coli, Staphylococcus aureus and Staphylococcus epidermidis, requires the partial de-N-acetylation of the exopolysaccharide poly-b-1,6-N-acetyl-Dglucosamine (PNAG) [16][17][18][19]. Similarly, deacetylation of Pel from P. aeruginosa is required for Pel-dependent biofilm formation [20] while deacetylation of the holdfast polysaccharide synthesized by Caulobacter crescentus is required for adhesion and cohesion [21]. In each case the deacetylation or removal of acetate from the acetylated polysaccharide requires a single enzyme, which has been shown to be a member of carbohydrate esterase family 4 (CE4). In comparison, O-acetylation of polysaccharides is a complex process requiring two enzyme functionalities. The first functionality is the transfer of an acetyl-donor into the periplasm, which is hypothesized to be catalyzed by a membrane bound Oacetyltransferase (MBOAT) [22]. The second functionality, catalyzed by a periplasmic O-acetyltransferase, transfers acetate onto the polysaccharide. There are currently three distinct systems that are differentiated by the number of proteins required for Oacetylation; a single protein system whereby both functionalities are encoded on one polypeptide, a two-protein system with one MBOAT and one periplasmic O-acetyltransferase and, in the case of alginate and cellulose, a four protein system comprised of an MBOAT protein, two periplasmic O-acetyltransferase and a protein of unknown function [22,23].
The O-acetylation of the C6 hydroxyl of muramoyl residues in peptidoglycan is critical in many human bacterial pathogens including; Methicillin-resistant S. aureus (MSRA), Bacillus anthracis, Neisseria meningitides and Neisseria gonorrhoeae [24,25] as it confers resistance to degradation by endogenous autolysins and the host immune system during infection [26]. A single integral membrane O-acetylpeptidoglycan transferase (Oat) is utilized in Gram-positive bacteria, while Gram-negative bacteria utilize a two-protein system composed of PatA, the MBOAT, and PatB, the periplasmic O-acetyltransferase [27][28][29]. The Oacetylation of alginate in P. aeruginosa, is an important modification as acetylated alginate is less susceptible to recognition and clearance by the host immune system than its non-Oacetylated counterpart [30]. Similar to the acetylation of cellulose by Pseudomonas fluorescens through the combined action of WssG, WssH, WssI and WssF, the O-acetylation of alginate requires four proteins; AlgF, AlgI, AlgJ and AlgX [12,13,[31][32][33]. Acetylation of alginate can occur at the C2 and C3 hydroxyl groups of mannuronic acid residues. AlgI is predicted to be a member of the MBOAT family, while AlgJ and AlgX are required in the O-acetylation of alginate as the polymer passages through the periplasm [13,31,[33][34][35]. The function of AlgF, which is not predicted to have a catalytic domain, is currently unknown. It is also unclear why alginate acetylation requires two active Oacetyltransferases.
To probe the role of AlgX in alginate acetylation, we recently determined its structure and found that it is a two-domain protein with an N-terminal SGNH hydrolase-like domain and a Cterminal carbohydrate-binding module (CBM) [34]. Our in vivo functional characterization of AlgX demonstrated that three catalytic residues, D174, H176 and S269, located in the active site are required for alginate O-acetylation. In the present study, we sought to delineate the role of AlgJ, and examine why both AlgJ and AlgX are required for alginate acetylation [13]. To this end, we have determined the structure of Pseudomonas putida AlgJ 75-370 (PpAlgJ 75-370 ) to 1.83 Å resolution and have functionally characterized the protein.

Results
AlgJ contains an SGNH hydrolase-like core The TMHMM server v2.0 indicates that AlgJ from P. aeruginosa PAO1 possesses a transmembrane helix from residues 12-29 that tethers the periplasmic domain to the cytoplasmic membrane [36]; a prediction that is supported by in vivo localization studies [13]. To probe the function of AlgJ in alginate biosynthesis, we attempted to crystallize a soluble domain, residues 79-379, of P. aeruginosa AlgJ (PaAlgJ  ). Although this protein was recalcitrant to crystallization, we were able to

Author Summary
Bacteria utilize many defense strategies to protect themselves against external forces. One mechanism used by the bacterium Pseudomonas aeruginosa is the production of the long sugar polymer alginate. The bacteria use this polymer to form a biofilm -a barrier to protect against antibiotics and the host immune response. During its biosynthesis alginate undergoes a chemical modification whereby acetate is added to the polymer. Acetylation of alginate is important as this modification makes the bacterial biofilm less susceptible to recognition and clearance by the host immune system. In this paper we present the atomic structure of AlgJ; one of four proteins required for O-acetylation of the polymer. AlgJ is structurally similar to AlgX, which we have shown previously is also required for alginate acetylation. To understand why both enzymes are required for O-acetylation we functionally characterized the proteins and found that although AlgJ exhibits acetylesterase activity -catalyzing the removal of acetyl groups from a surrogate substrate -it does not bind to short mannuornic acid polymers. In contrast, AlgX bound alginate in a length-dependent manner and was capable of transfering acetate from a surrogate substrate onto alginate. This has allowed us to not only understand how acetate is added to alginate, but increases our understanding of how acetate is added to other bacterial sugar polymers.
crystallize the orthologous domain from P. putida, PpAlgJ 75-370 , which shares 83% and 53% sequence similarity and identity to PaAlgJ 79-379 , respectively. Selenomethionine-incorporated (Se-Met) protein was expressed, purified, crystallized, and the structure determined to 1.83 Å using the single-wavelength anomalous dispersion (SAD) technique (Table 1). PpAlgJ  crystallized in space group C2 with two molecules in the asymmetric unit. After iterative rounds of model building and refinement, the structure yielded models with an R work and R free of 17.5% and 21.2%, respectively (Table 1). Analytical size exclusion chromatography suggests that AlgJ is a monomer in solution (data not shown). The dimer observed in the crystal structure is therefore not believed to be biologically relevant. The two molecules in the asymmetric unit superimpose well with a root mean square deviations (RMSD) of 0.19 Å over the 906 aligned backbone atoms. Due to the poor quality of the electron density we were unable to build residues N75 to G77 and A266 to K278 in both molecules, and residues R78 and L242 in molecule A, and P244 and L245 in molecule B. Given the absence of two residues in the loop between L242 and F246 in molecule B, all structural analyses were performed using molecule A. All structural features defined using molecule A were also present in molecule B with no significant deviations.
The SGNH hydrolase superfamily consists of enzymes with varying hydrolytic activities (e.g., proteases and lipases) [37]. SGNH hydrolases have an a/b/a fold with each of the four conserved active site residues responsible for catalysis residing in one of four conserved blocks. The structure of PpAlgJ 75-370 reveals that the protein has an a/b/a fold with a core of four parallel bstrands ( Figure 1: b3, b6-8) and an isolated b-bridge at b3, which are comparable to the five parallel b-strands found in canonical SGNH hydrolases [37,38]. The core b-strands are surrounded by nine a-helices (Figure 1: a1, a3-10) that complete the a/b/a fold. A surface representation of PpAlgJ 75-370 reveals a shallow groove that crosses the face of the protein (Figure 2A, left, centre). Examination of the electrostatic surface potential shows a distinct region of electronegativity on the face of the protein located within and below the shallow groove ( Figure 2B, centre).
Previous in vivo complementation experiments have suggested that AlgJ functions as an O-acetyltransferase and that this activity is dependent on conserved residues D193 and H195 (P. aeruginosa numbering) [33]. In PpAlgJ the corresponding residues D190 and H192 are located in the shallow electronegative groove ( Figure 2B, centre). Residues within the groove are well conserved in AlgJ homologs from six Pseudomonas sp. (P. putida, P. aeruginosa, P. syringae, P. protegens, P. entomophila, and P. alkylphenolia) and Azobacter vinelandii (Figure 2A, centre). A 90u rotation along the horizontal axis relative to the groove depicts a generally electroneutral surface ( Figure 2B, right) that also contains highly conserved residues (Figure 2A, right).
A search for structurally similar proteins using DALI reveals that the core of AlgJ is most similar to the N-terminal domain of P. aeruginosa AlgX, residues 44-344 (PaAlgX 44-344 ), with an RMSD of 2.06 Å over 165 aligned Ca-atoms ( Figure 3A). The active site of PpAlgJ 75-370 and the orientation of the putative catalytic triad D190, H192 and S288, is analogous to PaAlgX 27-474 and other serine esterases and proteases ( Figure 3B) [39,40]. Similar to PaAlgX 27-474 , the structure of PpAlgJ 75-370 reveals several differences relative to canonical SGNH hydrolases [32]. Firstly, block III that contains the conserved asparagine residue that forms part of the oxyanion hole is absent ( Figure 3C). Examination of the PpAlgJ 75-370 structure suggests that Y348 is in a position to act as a hydrogen bond donor in place of the conserved asparagine ( Figure 3B). In PaAlgX 27-474 a tyrosine residue, Y328, also occupies this position. PpAlgJ 75-370 also deviates from the canonical GDSL(S) and DxxH motifs in blocks I and V, respectively ( Figure 3C). AlgJ has a GTSYS motif in block I which contains the catalytic serine (S288), while a single spacer residue in block V separates the two remaining catalytic triad residues (D190 and H192) to form a DxH motif. AlgJ appears to be a circularly permuted member of SGNH hydrolases as the order of conserved residues in primary sequence (H-S-G-Y) is different than that of SGNH hydrolases (S-G-N-H). However, the overall three dimensional shape and fold are structurally similar despite the rearrangement of conserved residues. PpAlgJ 75-370 also contains secondary structure features not present across SGNH superfamily members. Two long anti-parallel b-strands are present on one side of the protein (b4 and b5, Figure 1), along with eight 3 10 helices (t1-8) and a small cap domain above the proposed active site which consists of two short anti-parallel b-strands (b1-2), five 3 10 helices (t1-3, t5-6) and one a-helix (a2) (Figure 1). Despite these differences, PpAlgJ 75

The AlgJ signature motifs
Two conserved sequence motifs, termed the AlgJ signature motifs, have been defined in P. aeruginosa AlgJ and its homologs [33]. These motifs are characterized by conserved regions of WWWPxK (W represents any hydrophobic residue), and (R/ K)TDTHW. Mutation of residues within these signature motifs leads to impairment or ablation of alginate O-acetylation [33]. Utilizing the structure of PpAlgJ 75-370 , we identified the location of the signature motif residues ( Figure 4A). Two distinct types of intramolecular interaction networks are observed within the motifs, which localize to the cap domain. The cap domain sits atop the SGNH hydrolase-like core that typically contains the active site and catalytic residues of canonical SGNH hydrolases [37,38]. The first network composed of residues; K134, T189, D190 and H192 form a hydrogen-bonding network with L187, D254 and S288 ( Figure 4B) in PpAlgJ  . It was previously observed that alginate O-acetylation was abrogated in vivo for the PaAlgJ variants K137A, D193A and H195A [33]. Superposition of PpAlgJ 75-370 with AlgX  suggests that D190 and H192 (D193 and H195 in PaAlgJ 79-379 ) form part of the catalytic triad ( Figure 3). Thus, two of the three catalytic triad residues (aspartic acid and histidine) in AlgJ reside in the separate cap domain distinct from the a/b/a fold, yet occupy an equivalent spatial position in the active site to their SGNH counterparts. The location of D193 and H195 in PaAlgJ (D190 and H192 in PpAlgJ) and the ablation of acetylation in vivo for the D193A and H195A variants provide further evidence of their crucial role in the catalytic mechanism of the protein. The second intramolecular interaction network is composed of a series of hydrophobic interactions centered around the conserved W193, with V131, Y289, W295 and F297 ( Figure 4C). W193 is completely buried and comprises part of the hydrophobic core of AlgJ.

The Ser-His-Asp triad is responsible for acetylesterase activity in vitro
To determine whether AlgJ is catalytically active we examined the ability of the enzyme to exhibit O-acetylesterase activity, a commonality among SGNH hydrolases and the first half of the acetyltransferase reaction. PaAlgJ 79-379 and PpAlgJ 75-370 were both assayed to demonstrate that the enzymes are functionally equivalent. Using the substrate 3-carboxyumbelliferyl acetate, the kinetic parameters for PaAlgJ  and PpAlgJ 75-370 were observed to be comparable (Table 2), with a 2-fold difference in K m . The k cat /K m obtained for AlgX, which has been previously demonstrated to catalyze this reaction, differed by only 3-fold compared to the AlgJ orthologs. To assess the function of putative Conservation is displayed from magenta (highly conserved) to cyan (variable). Surface residue conservation was analyzed using ConSurf [71]. The dashed circle indicates the region where the shallow groove was identified. (B) and (C) Electrostatic surface representation of PpAlgJ 75-370 and PaAlgX 44-344 , respectively. Electrostatic surface potentials were calculated using the APBS Tools 2.1 plugin within PyMOL [70]. The electrostatic surfaces are displayed in both panels from 27 kT/e (red) to +7 kT/e (blue). doi:10.1371/journal.ppat.1004334.g002 catalytic residues; D190, H192 and S288 in PpAlgJ and D193, H195 and S297 in PaAlgJ were substituted with alanine. Interestingly, while catalytic alanine variants in AlgX result in the abrogation of 3-carboxyumbellifyl acetate hydrolysis [32], mutation of the catalytic triad in both AlgJ orthologs only reduced the catalytic activity by ,80% ( Figure 5). Circular dichroism spectroscopy of the AlgJ orthologs and their respective variants exhibited no significant difference in spectra (data not shown). This indicates that the protein variants are properly folded and that the differences in catalytic activity are not due to large structural perturbations.

AlgX but not AlgJ interacts with mannuronic acid oligomers
Comparison of PpAlgJ 75-370 and AlgX 44-344 indicates a significant difference in electrostatic surface potential between the active site regions of the enzymes ( Figure 2B and C). PpAlgJ 75-370 contains a shallow electronegative groove architecture around the active site whereas AlgX 44-344 contains a deep electropositive groove compatible with binding the anionic alginate orpolymannuronic acid polymer. In addition, AlgX has a C-terminal carbohydratebinding module, which presumably aids in binding and guiding alginate either to or from the active site. To examine whether PaAlgJ 79-379 and AlgX 27-474 interact with mannuronic acid oligomers, a direct electrospray ionization mass spectrometry (ESI-MS) binding assay was carried out using nine mannuronic acid oligosaccharides ranging from 4-12 sugar units in length (ManA 4 -ManA 12 ). Representative ESI mass spectra acquired for aqueous ammonium acetate solutions of AlgX and the protein reference scFv (P ref ) with ManA 6 or ManA 12 are shown in Figures 6A and 6B, respectively. In the mass spectrum shown in Figure 6A, ion signals corresponding to protonated AlgX monomer and protonated 1:1 (AlgX+ManA 6 ) complex, the +13 to +16 charge states, are observed. Signals corresponding to protonated P ref and (P ref + ManA 6 or ManA 12 ) are also present, indicating that nonspecific carbohydrate-protein binding took place during the ESI process. The mass spectrum shown in Figure 6B is qualitatively similar to that shown in Figure 6A, although a visibly larger fraction of the protein is in the bound form. Similar results were obtained for the other alginate ligands tested (data not shown). ESI-MS measurements were also performed on solutions of AlgX and undeca-and pentadeca-hyaluronic acid (HA 11 and HA 15 ). Importantly, these negative controls revealed no evidence of specific binding between AlgX and these acidic oligosaccharides, thereby confirming the specificity of AlgX for the alginate oligomers rather than any acidic oligosaccharide. Listed in Table 3 are the association constants (K a ) determined by ESI-MS, following correction for nonspecific carbohydrate-protein binding. Notably, the K a values for AlgX are seen to clearly increase with the length of the mannuronic acid oligosaccharides. It must be noted that the location of ligand binding site cannot be explicitly identified using this methodology and the observed interaction for AlgX may include both ligandactive site and ligand-CBM interactions.
The same assay was utilized to quantify binding of PaAlgJ 79-379 to the mannuronic acid oligomers. Representative ESI mass spectra are shown in Figures 6C and D for ManA 6 and ManA 12 , respectively. Analysis of the ESI mass spectra reveals two distinct charge state distributions (+10 to +13 and +14 to + 26) for the protonated ions of AlgJ. Ions corresponding to 1:1 complexes of AlgJ with alginate ligand were only observed at the lower charge states. Taken together, these results suggest that a fraction of AlgJ is unfolded in solution (corresponds to the +14 to +26 charge state distribution) and does not bind to the oligosaccharides [41][42][43][44][45]. Only the lower charge state AlgJ ions were considered for the K a determinations (Table 3). Because the relative protein ion abundances measured by ESI-MS do not

AlgX is able to O-acetylate alginate in vitro
Since AlgX interacts with mannuronic acid oligomers under the conditions tested, we examined the ability of AlgX to transfer an acetyl group from the pseudo-acetyl donors, 4-nitrophenyl acetate and 3-carboxyumbelliferyl acetate, to a decamer of polymannuronic acid (ManA 10 ). Reactions analyzed by ESI-MS revealed the production of both mono-and di-acetylated mannuronic acid oligomers in the AlgX containing reactions as observed by an m/z shift of 42.01 for the sodiated singly acetylated and 42.01 for the protonated doubly acetylated. The mass increase of the protonated singly acetylated species was 42.00 (Figure 7). Control reactions containing all of the reaction components except AlgX, did not result in acetylated alginate even though 4-nitrophenyl acetate exhibited low spontaneous hydrolysis at pH 7.0 resulting in the production of 4-nitrophenyl and acetate. Neutral, commercially available sugars, cellohexose, xylohexose and maltotriose were not O-acetylated in the presence of AlgX. While both AlgJ and AlgX are necessary for alginate O-acetylation in vivo [13,33] our binding and acetyltransferase data suggest AlgJ and AlgX do not have overlapping functions. Our current in vitro data provide additional evidence to support previous in vivo studies that there is no redundancy in the alginate acetylation machinery [31][32][33].

Discussion
The O-acetylation of alginate requires the concerted action of four proteins: the putative MBOAT protein, AlgI; a protein of unknown function, AlgF; and two annotated O-acetyltransferases AlgJ and AlgX. To shed light on the role of each O-acetyltransferase, we determined the structure of PpAlgJ 75-370 and compared it with the recently solved structure of PaAlgX 27-474 . Additionally, we kinetically characterized the acetylesterase activity of AlgJ and tested the ability of AlgJ and AlgX to bind and Oacetylate short polymannuronic acid oligomers. The structure of PpAlgJ 75-370 reveals a fold that is structurally comparable to AlgX 44-344 and other SGNH hydrolases. PpAlgJ, like AlgX, is best described as an SGNH hydrolase-like protein as they both exhibit several key differences to canonical SGNH hydrolases. Not only is the order of the catalytic residues circularly permuted, but both proteins contain a cap domain and two long anti-parallel b-strands on one side of the protein that are not observed in other SGNH members. Although the function of the two long anti-parallel b-strands is currently unknown, given the involvement of other proteins in the O-acetylation system it is tempting to speculate that these regions may be involved in protein-protein interactions. A prediction for potential interaction surfaces on AlgJ was made using the consensus Protein-Protein Interaction Site Predictor (cons-PPISP) [46,47]. Two predicted clusters of residues had positive scores for a possible interface. One cluster is comprised of residues 353-363 and is located on a loop and the beginning of a10 on the C-terminal end of the protein. A second cluster comprised of residues P79, G80, V81, D235 and F239 in PpAlgJ is localized to the cap domain, located above the SGNH core. This cluster contains the conserved AlgJ signature motif residues that are imperative for alginate O-acetylation. The structure of PpAlgJ has allowed us to confidently define the location of these residues, and propose roles for their function in O-acetylation. Signature residue variants in PaAlgJ; P135A, K137A, D193A, H195A and W196F (PpAlgJ equivalents; P132, K134, D190, H192 and W193) ablate O-acetylation [33]. The structure of PpAlgJ, in addition to our kinetic data, indicates that D190 and H192 are part of the catalytic triad. Therefore, alanine  variants D190A and H192A would disrupt the proposed catalytic serine by increasing the pK a of the nucleophile and altering its orientation in the active site. The relatively rare internal lysine, K134 uses hydrogen bonding to stabilize the backbone oxygens of L187, R188 and D190 that form a loop between helices a 3 and a 4 . It is expected that the loss of hydrogen bonding in the K134A variant disrupts the proper positioning of D190 in the DxH motif required for catalysis. The P132A variant located proximal to K134 would alter the secondary structure of, and impede proper function of K134. Lastly, the W193A variant is anticipated to disrupt the hydrophobic interactions with its neighbouring residues. Given that W193 is located proximal to catalytic H192, structural perturbations caused by disruption to the hydrophobic interactions are expected to have a negative impact on the catalytic triad. Our in vitro enzymatic analysis probed the ability of PaAlgJ and PpAlgJ to perform the initial acetylesterase step of the overall acetyltransferase reaction. The results indicate that both AlgJ enzymes exhibit comparable catalytic parameters to AlgX for the hydrolysis of acetate (acetylesterase) from 3-carboxyumbelliferyl acetate [32]. Catalytic variants reduced acetylesterase activity by .80%. Although, the complete loss of activity was observed when similar catalytic residues were replaced in AlgX, in vitro residual activity in catalytic variants has been reported in at least one SGNH esterase [40]. The precise reason for residual activity is not clear since there are limited in vitro studies characterizing catalytic triad variants in SGNH hydrolase superfamily members. However, one possibility is the surrogate acetyl-donor tested does not optimally mimic the native acetyl donor of AlgJ. Residual hydrolysis of esterase substrates may also be attributed to surface amino acid residues that form microenvironments that promote spontaneous, non-enzymatic hydrolysis. For example, non-enzymatic hydrolysis has been reported for the SGNH superfamily member glutathione-S-transferase [48] and in the non-enzymatic protein albumin [49,50]. The observation that the catalytic triad variants ablate O-acetylation in vivo further supports the notion that non-specific hydrolysis occurs in vitro.
The electrostatic surfaces of PpAlgJ 75-370 and the SGNH hydrolase-like domain of PaAlgX 27-474 are distinctly different with respect to the active site region. PaAlgX contains an electropositive groove that stretches from the active site to the C-terminal CBM [32]. We previously proposed that this highly conserved electropositive region would be compatible for binding the anionic alginate polymer. A direct ESI-MS binding assay confirmed that AlgX is able to bind mannuronic acid oligomers, with longer, more physiologically relevant polymers of mannuronic acid exhibiting higher affinity for the protein. The addition of a surrogate acetyl-donor in the presence of mannuronic acid oligomers led to the first demonstration in vitro that AlgX is an O-acetyltransferase capable of transferring acetyl groups to mannuronic acid oligomers. While ESI-MS cannot explicitly identify the location of carbohydrate-protein interaction, the ability for AlgX to O-acetylate mannuronic acid oligomers demonstrates that these oligosaccharides must, in part, bind to the active site. In comparison, PpAlgJ demonstrated very weak or no affinity toward mannuronic acid oligosaccharides under the conditions tested. Taken together, these data suggest that AlgX may be the only enzyme that O-acetylates alginate and that it functions in a non-redundant, successive mechanism with the other proteins in the acetylation complex machinery. Our data allow us to propose an updated model for alginate O-acetylation ( Figure 8). Briefly, AlgI interacts with an unknown acetyl donor molecule in the cytoplasm and transfers acetate or acetyl donor across the inner membrane to either AlgJ and/or AlgF in the periplasm. The transmembrane domain of AlgJ (residues [12][13][14][15][16][17][18][19][20][21][22][23][24][25][26][27][28][29] tethers the enzyme to the membrane and potentially closer to AlgI than either AlgF or AlgX which lack this spatial constraint. Given the current state of knowledge, the function and mechanism of the intermediate step(s) involving AlgJ or AlgF can only be speculated. The order of transfer of the acetyl donor is uncertain, but the fact that AlgJ can perform the initial esterase stage of transferase reaction suggests that, given the right donor, recipient, and environment, this enzyme is directly involved in the O-acetylation process. This possibility cannot be ruled out for AlgF either, but the lack of identifiable catalytic residues in this protein suggests that it may serve a more accessory role. However, it is clear that AlgX can catalyze the direct O-acetylation of alginate. Thus, it is conceivable that AlgX receives the acetate or acetyl donor molecule from either AlgJ or AlgF, and then subsequently catalyzes the transfer of this substrate to O-acetylate alginate. This model is supported by the observation that active site variants of either AlgJ or AlgX are sufficient to abolish acetylation in vivo [32,33]. In addition, the data presented herein exclude the possibility that both enzymes can O-acetylate alginate.
Although the O-acetylation of polysaccharides requires a minimum of two functionalities, O-acetylation of alginate and cellulose requires four distinct proteins including a membrane associated and non-membrane associated periplasmic O-acetyltransferase. In contrast, the O-acetylation of peptidoglycan utilizes a single O-acetyltransferase that may not be constrained to the inner membrane depending on the presence of an N-terminal transmembrane domain. Regardless of the number of proteins involved in the system, the catalytic mechanism of acetate transfers across the inner membrane and the acetylation of the polymer is undoubtedly highly conserved. Additionally, the role of O-acetylation of these polysaccharides in both Gram-negative and Gram-positive bacteria is functionally similar, allowing for protection against external agents [16][17][18][19][20][21]26,27]. Intuitively, the requirement of four proteins involved in relaying acetate from the cytosol to the polysaccharide appears inefficient when compared to the O-acetylation of peptidoglycan. While additional proteins could facilitate increased regulation, the extent of O-acetylation of peptidoglycan and alginate between species, strain, and culture conditions, varies between 20-70% (relative to muramic acid content) and 4-57%, respectively [51,52]. This indicates that the amount of O-acetylation cannot be simply categorized based on the number of proteins involved or their localization.
A defining characteristic of cellulose and alginate biosynthesis compared to peptidoglycan is that these polysaccharides must traverse two membranes for export before reaching their final location, which in turn requires the involvement of several additional proteins in polymer modification and export. Therefore, four O-acetylation proteins may be an inherent requirement to adapt the O-acetylation machinery to the biosynthetic export machinery. In support of this, AlgX has been demonstrated to interact with both outer membrane alginate export proteins and periplasmic proteins required in alginate modification [53][54][55][56][57]. Such interactions have not been observed in the O-acetylation of peptidoglycan. The proteins WssFGHI are compulsory for cellulose acetylation [58] and WssGHI are homologous to AlgFIJ, with amino acid sequence identities of 24, 46, and 33%, respectively. We have previously suggested that WssF, which is predicted to belong to the SGNH hydrolase superfamily may be analogous to AlgX although it lacks the CBM present in AlgX [32]. Since proteins involved in cellulose acetylation have not been studied at either the structural or functional level, our present studies on AlgJ and AlgX are significant as they provide further data to support the roles of these proteins beyond sequence homology and phenotypic analysis.
We have successfully determined the structure of AlgJ, a protein involved in the alginate O-acetylation pathway. AlgJ exhibits acetylesterase activity that is mediated through the Ser-His-Asp catalytic triad similar to that of the SGNH hydrolase-like enzyme AlgX. ESI-MS confirmed that AlgX but not AlgJ binds polymannuronic acid and we have conclusively demonstrated that AlgX is an O-acetyltransferase, and that it is the only annotated O-acetyltransferase in the pathway that can both interact with, and O-acetylate alginate. Refining the model for alginate O-acetylation provides new avenues for further studies into the mechanism of O-acetylation of polysaccharides in the microbial kingdom.

Materials and Methods
Chemicals, bacterial strains, plasmids, and growth media Superflow Ni 2+ NTA-agarose resin was obtained from Qiagen (Mississauga, ON). Graphitized carbon solid phase extraction columns (Carbograph SPE) are products of Grace Canada, Inc (Ajax, ON). All other chemicals and reagents, unless otherwise stated, were supplied by Sigma-Aldrich Canada Ltd. (Oakville, ON). All growth media was obtained from Bio Basic (Markham, ON). DNA manipulations were performed in E. coli DH5a and protein expression of the SeMet protein was carried out using E. coli B834 (DE3) Met-auxotroph cells and grown in media supplemented with kanamycin at 50 mg mL 21 . Protein expression for binding and enzyme assays was carried out in E. coli BL21-CodonPlus cells.

DNA manipulations
The nucleotide sequences of algJ from P. putida KT2440 and P. aeruginosa PAO1 were acquired from the Pseudomonas Genome Database [59]. The boundaries of the putative Oacetyltransferase domain of P. aeruginosa AlgJ were predicted to range from amino acids 79 to 379 (PaAlgJ 79-379 ) based on Phyre 2 [60] structural alignments using the N-terminal domain of P. aeruginosa AlgX (PaAlgX 44-344 ) [32]. Sequence alignment revealed that these boundaries correspond to amino acids 75 to 370 in the P. putida AlgJ homolog (PpAlgJ 75-370 ). PCR amplification was carried out using the high fidelity DNA polymerase, PfuTurbo (Stratagene). FastDigest restriction enzymes were obtained from Fermentas. Plasmid DNA was extracted from E. coli using the PureLink Quick plasmid miniprep kit from Invitrogen (Burlington, ON). Primer sequences used in the generation of wild type (WT) PaAlgJ 79-379 and PaAlgX 44-344 , as well as proposed active site mutants are summarized in Supplementary Table S1. The PCR reaction conditions were as follows: Pfu buffer with 2 mM MgSO 4 (Thermo Scientific), 10 ng template DNA, 10 ng forward and reverse primer each, 25 mM dNTPs, 2 mM MgSO 4 , 2.5 U Pfu polymerase (Thermo Scientific) in a total reaction volume of 25 mL. The PCR product was digested with NdeI and XhoI and ligated into a pET28a vector backbone to generate (i) a thrombin cleavable N-terminal hexahistidine tag (His 6 ) construct; and (ii) a second construct used for crystallography that contained both an N and C-terminal His 6 -tag. Site directed mutagenesis was performed using the QuikChange Lightning kit according to the prescribed protocol (Agilent Technologies). Constructs generated were verified by sequencing performed by ACGT DNA Technologies Corporation (Toronto, ON).

Expression and purification of AlgJ
The expression and purification of native wild type and mutant PaAlgJ 79-379 and PpAlgJ 75-370 constructs were identical. SeMet PpAlgJ 75-370 was used for structure determination and was expressed using the protocol described by Lee et al [61]. The expression and purification protocol was as follows: Starter cultures were grown overnight in 50 mL Luria-Bertani (LB) broth containing 50 mg mL 21 kanamycin at 310 K in a shaking incubator, with E. coli BL21-CodonPlus cells transformed with the appropriate plasmid. The cells were subsequently inoculated into 1 L LB broth containing 50 mg mL 21 kanamycin at 310 K in a shaking incubator. Upon reaching an OD 600 of 0.7, the cells were induced with isopropyl b-D-1-thiogalactopyranoside (IPTG) to a final concentration of 1 mM. The induced cells were allowed to grow for an additional 18 h at 291 K. The cells were harvested via centrifugation at 50006 g for 20 min at 277 K. The cell pellet was stored at 253 K until needed. Frozen cell pellet was thawed over ice and re-suspended in 50 mL cold lysis buffer (500 mM NaCl, 20 mM Tris-HCl pH 8.0) containing one SIGMAFAST EDTA-free protease-inhibitor cocktail tablet (Sigma). The cells were homogenized at 10,000 psi through an Emulsiflex C3 (Avestin Inc.) with at least 3 passes until uniform in consistency. The resultant cell lysate was centrifuged at 250006 g for 25 minutes at 278 K to pellet cell debris and insoluble material. The soluble cell lysate was loaded onto a 5 mL Ni 2+ -NTA gravity column equilibrated with Ni-NTA buffer (500 mM NaCl, 20 mM Tris-HCl pH 8.0, 5 mM imidazole). The column was washed with 10 column volumes of Ni-NTA buffer containing 30 mM imidazole. Protein bound to the column was eluted with 4 column volumes of Ni-NTA buffer containing 150 mM imidazole. The eluent was dialyzed against 4 L of S200 buffer (150 mM NaCl, 20 mM Tris-HCl pH 8.0). Protein concentration was measured using the Pierce BCA Protein Assay Kit from Thermo Scientific (Rockford, IL). The His 6 -tag was cleaved from the protein via incubation with thrombin (Novagen) at 0.5 U per mg of protein at 298 K for at least 2 h. The thrombin treated protein was loaded onto a 1 mL Ni 2+ -NTA column equilibrated with S200 buffer containing 5 mM imidazole. The column was washed with 10 column volumes of S200 buffer containing 30 mM imidazole. The initial flow through and wash were pooled together and contained the un-tagged protein. The untagged protein was concentrated to a 1-2 mL volume using an Amicon Ultra centrifugation filter device (Milipore) with a 30 kDa cutoff. Approximately 20 mg of purified protein could be obtained per litre of bacterial culture. The concentrated protein was further purified via size exclusion chromatography on a HiLoad 16/60 Superdex 200 gel filtration column (GE Healthcare). Fractions containing protein were pooled and analyzed via SDS-PAGE to be .95% pure. PaAlgJ  and PpAlgJ 75-370 protein could be stored at 277 K for up to 2 or 4 weeks, respectively, before significant degradation was observed by SDS-PAGE.
Crystallization and structure determination of PpAlgJ  SeMet PpAlgJ 75-370 was concentrated to ,6-8 mg mL 21 by an Amicon Ultrafiltration device (30 kDa MWCO, Milipore) for crystallization trials. Sparse-matrix screens were setup by hand using MCSG suites 1-4 (Microlytic) in 48-well VDX plates (Hampton Research). The drops consisted of a 1:1 ratio of protein to well solution at a final volume of 4 mL equilibrated over 250 mL of well solution, and stored at 293 K. Numerous hits were obtained after 1 week and were primarily found in conditions containing a divalent cation (Mg 2+ or Ca 2+ ) and polyethylene glycol (PEG) solutions between 3350-6000 Da. In most cases the crystals nucleated from a single point and radiated outwards forming a cluster with individual crystals estimated to be a maximum of 500 mm, and were of diffraction quality directly from the sparse matrix screens.
Crystals used for data collection were found in MCSG-1, condition 7 (0.2 M MgCl 2 , 0.1 M Bis-Tris:HCl pH 5.5, 25% (w/ v) PEG3350). Prior to data collection, crystals were cryo-protected by exchanging the drop solution with cryo-protectant solution (0.2 M MgCl 2 , 0.1 M Bis-Tris:HCl pH 5.5, 25% (w/v) PEG3350, 20% (v/v) ethylene glycol). The exchange was performed through the addition of cryo-protectant solution directly to the drop and the removal of the added volume until complete exchange had occurred. The crystal clusters were disrupted via physical contact with a fine needle until an isolated single crystal could be looped. Crystals were vitrified in liquid nitrogen and stored. Selenium single-wavelength anomalous dispersion (Se-SAD) X-ray diffraction data were collected on beamline X29A at the National Synchrotron Light Source (NSLS) at Brookhaven National Laboratory. 90 images at 94% beam attenuation with 2u DQ oscillation and 360 images without beam attenuation with 1u DQ oscillation were collected on an ADSC Q315 CCD detector at a 260 mm crystal-to-detector distance and 0.4 s exposure time per image. The Se-SAD data was indexed, integrated, scaled and merged using HKL-2000 (Table 1) [62], and used in conjunction with HKL2MAP to locate 12 (of 14) selenium sites, with density modified phases calculated using SOLVE/RESOLVE [63]. The electron density maps were of sufficient quality for automatic model building using PHENIX AutoBuild [64] and subsequent manual model building using COOT [65,66]. Model refinement was performed using PHENIX.REFINE [67] and progress was monitored as a function of the reduction and convergence of R work and R free (Table 1) [66]. TLS groups were added to the refinement in PHENIX through the use of the TLSMD server [68].

Structural and bioinformatics analysis of AlgJ
All figures that display the structure of PpAlgJ 75-370 and/or PaAlgX 27-474 were generated using PyMol (The PyMol Molecular Graphics System, version 1.6.0.0, Schrödinger, LLC). The secondary structure was determined using the STRIDE web server [69]. Surface representations demonstrating either electrostatics or surface residue conservation were depicted with all side chains present even if they could not be accurately modeled in the structure. Electrostatic surfaces were generated using the ABPS Tools 2.1 plugin that is integrated into PyMol [70]. Surface residue conservation was determined using the ConSurf web server [71] and visualized using the provided color scheme. Solvent accessible surface area was calculated using the PDBePISA server [72]. Surface residue conservation was determined using the ConSurf web server [71] and visualized using the provided color scheme. Solvent accessible surface area was calculated using the PDBePISA server [72].

Enzyme assay
All enzyme assays were performed at least in triplicate, in a 96well microtiter plate, using a SpectraMax M2 from Molecular Devices (Sunnyvale, CA). Standard reactions contained 3.0 mM 3carboxyumbelliferyl acetate (ACC), dissolved in DMSO for specific activity assays and variable concentrations ranging between 0.1 K m and $2 K m and 30 mg each wild-type protein and 100 mg for each variant in a total volume of 100 mL in 50 mM sodium HEPES buffer (pH 7.0 for AlgJ, pH 8.0 for AlgX) at 298 K. The final DMSO concentration did not exceed 10% (v/v). Due to low substrate solubility, reactions with higher substrate concentrations could not be obtained. Reactions were initiated by the addition of substrate and reactions were monitored in real time for a duration of 10 min using an excitation of 386 nm and an emission of 447 nm as previously described [32]. The hydrolysis and release of acetate results in an increase in the fluorescence signal. Background hydrolysis rates, in the absence of enzyme, were monitored and subtracted from enzyme-catalyzed reactions. A calibration curve for 7-hydroxycourmarin-3-carboxylic acid, the fluorescent hydrolysis product of 3-carboxyumbelliferyl acetate, was obtained under the reaction conditions and used to calculate reaction rate. The protein concentration of each enzyme variant was determined using the Pierce BCA Protein Assay Kit from Thermo Scientific (Rockford, IL). Data were fit by nonlinear regression to the Michealis-Menten equation using GraphPad Prism 6.0c for Mac, (GraphPad Software, La Jolla California USA, www.graphpad.com).

Alginate binding assay
Immediately prior to ESI-MS analysis, PaAlgX 27-474 and PaAlgJ 79-379 were each dialyzed against aqueous 100 mM ammo-nium acetate (pH 7.0) using microconcentrators (Millipore Corp., Bedford, MA) with a MW cut-off of 30 kDa (for AlgX) and 10 kDa (for AlgJ). Two different reference proteins (P ref ) were used to correct ESI mass spectra for the occurrence of nonspecific carbohydrateprotein binding (during the ESI process): a single chain fragment (scFv, MW 26 539 Da) of the monoclonal antibody (mAb) Se155-4, which was produced and purified as described previously [76], and lysozyme (Lyz, MW 14 300 Da), which was obtained from Sigma-Aldrich Canada (Oakville, Canada) and used without further purification. Each protein was concentrated and dialyzed against 50 mM ammonium acetate using microconcentrators (Millipore Corp., Bedford, MA) with a MW cut-off of 10 kDa and stored at 2 20uC until needed. Stock solutions of each of the polymannurnic acid oligomers, tetramer through dodecamer (ManA 4 -ManA 12 ), and undeca-and pentadeca-hyaluronic acid saccharides (HA 11 and HA 15 ), synthesized as previously described [77,78] were prepared by dissolving known amounts of solid compound in ultrafiltered water (Milli-Q, Millipore, Bedford, MA) to give a final concentration of ,1 mM. The ligand solutions were stored at 253 K until needed.
Affinity measurements were carried out on a Synapt G2 quadrupole-ion mobility separation-time of flight (Q-IMS-TOF) mass spectrometer (Waters, UK) equipped with a modified nanoflow ESI (nanoESI) source. Complete details of the instrumental and experimental conditions used for the ESI-MS binding measurements along with descriptions of how the data were analyzed to establish association constants (K a ) for protein-ligand interactions have been described previously [79][80][81].

O-Acetyltransferase assay
O-Acetyltransferase assays were completed in a similar fashion as those previously described for the O-acetylation of peptidoglycan with modifications [51]. Briefly, 3 mM 4-nitrophenyl acetate dissolved in ethanol (3% final concentration), and 1 mM of ManA 10 [78], was incubated with 60 mg of AlgX in 50 mM sodium phosphate buffer, pH 7.0 for a duration of 1 h at 298 K. Control reactions containing 1 mM of cellohexose, xylohexose and maltotriose in place of alginate were also completed. All reactions were repeated in the absence of AlgX as a negative control. The total reaction volume was 150 mL and the reaction was initiated by the addition of protein. The reactions were quenched by applying the entire reaction to a 4 mL graphitized carbon solid phase extraction column previously washed with three column volumes of 100% acetonitrile containing 0.1% (v/v) trifluoroacetic acid (TFA) and then equilibrated with three column volumes of water. Following application of the reaction, the column was washed successively with three column volumes of 100% acetonitrile, 100% acetonitrile containing 0.1% (v/v) TFA, 3:1 isopropyl alcohol: acetonitrile (v/v). Polymannuronic acids were eluted with 6 mL of 50% tetrohydrofuran containing 0.1% TFA (v/v) and dried under vacuum in a Speed Vac at 298 K. The dried samples were resuspended in 100 mL of water and stored at 253 K prior to analysis.
Liquid chromatography-mass spectrometry analyses were performed on an Agilent 1200 HPLC liquid chromatograph interfaced with an Agilent UHD 6530 Q-Tof mass spectrometer at the Mass Spectrometry Facility of the Advanced Analysis Centre, University of Guelph. A C18 column (Agilent Poroshell 120, 150 mm64.6 mm 2.7 mm) was used for chromatographic separation with solution A (0.1% formic acid) and solution B (100% acetonitrile with 0.1% formic acid). The mobile phase gradient was as follows: initial conditions, 2% solution B increasing to 98% solution B in 30 min followed by a column wash at 98% solution B and 10 minute re-equilibration. The first 2 and last 5 minutes of gradient were sent to waste. The flow rate was maintained at 0.1 mL/min. The mass spectrometer electrospray capillary voltage was maintained at 4.0 kV and the drying gas temperature at 523 K with a flow rate of 8 L/min. Nebulizer pressure was 30 psi and the fragmentor was set to 160. Nitrogen was used as both the nebulizing and drying gas, and the collision-induced gas. The mass-to-charge ratio was scanned across the m/z range of 100-3000 m/z in 4 GHz (extended dynamic range positive-ion auto MS/MS mode). Three precursor ions per cycle were selected for fragmentation. The instrument was externally calibrated with the ESI TuneMix (Agilent). The sample injection volume was 10 ml. The resultant spectra were analyzed using mMass software (http://www.mmass.org/).

Accession numbers
The coordinates and structure factors for PpAlgJ  have been deposited in the PDB, ID code 4O8V.

Supporting Information
Table S1 Bacterial strains and plasmids used in this study. (DOCX)