MamO Is a Repurposed Serine Protease that Promotes Magnetite Biomineralization through Direct Transition Metal Binding in Magnetotactic Bacteria

Many living organisms transform inorganic atoms into highly ordered crystalline materials. An elegant example of such biomineralization processes is the production of nano-scale magnetic crystals in magnetotactic bacteria. Previous studies implicated the involvement of two putative serine proteases, MamE and MamO, during the early stages of magnetite formation in Magnetospirillum magneticum AMB-1. Here, using genetic analysis and X-ray crystallography, we show that MamO has a degenerate active site, rendering it incapable of protease activity. Instead, MamO promotes magnetosome formation through two genetically distinct, noncatalytic activities: activation of MamE-dependent proteolysis of biomineralization factors and direct binding to transition metal ions. By solving the structure of the protease domain bound to a metal ion, we identify a surface-exposed di-histidine motif in MamO that contributes to metal binding and show that it is required to initiate biomineralization in vivo. Finally, we find that pseudoproteases are widespread in magnetotactic bacteria and that they have evolved independently in three separate taxa. Our results highlight the versatility of protein scaffolds in accommodating new biochemical activities and provide unprecedented insight into the earliest stages of biomineralization.


Introduction
Biomineralization is the widespread phenomenon by which living organisms transform inorganic atoms into highly ordered, crystalline structures. Controlling the size and shape of such materials requires specialized protein machinery that can define the nano-scale trajectory of crystal growth [1]. Incorporating biochemical principles uncovered from studying biomineralization has the potential to revolutionize the design and synthesis of nanomaterials in vitro [2]. In addition to the well-known examples of tooth, bone, and shell production by multicellular eukaryotes, a number of bacteria have the ability to biomineralize small magnetic crystals within subcellular compartments called magnetosomes [3,4]. These particles allow the cells to passively align in the earth's magnetic field, facilitating the search for their preferred oxygen environments [5]. Although these magnetotactic bacteria have drawn longstanding interest due to their ability to manipulate transition metals, the biochemical details of how they transform iron into magnetite (Fe 3 O 4 ) remain poorly understood.
Magnetotactic organisms are phylogentically diverse. Nearly all isolates come from the α-, δ-, or γclasses of Proteobacteria, but representatives from the Nitrospirae and Omnitrophica phyla have recently been identified [6]. The genes responsible for making magnetosomes are often contained in a genomic region called the magnetosome island (MAI) [7][8][9][10][11]. Comparative genomic and phylogenetic studies have identified a set of core genes that appears to have been assembled a single time and inherited vertically, indicating that magnetosome formation likely predates the divergence of the Proteobacteria [12,13]. The MAI seems to have formed by incorporating elements from other cellular processes, as the majority of the core factors have homology to ancient and widespread protein domains [14,15]. Uncovering the biochemical functions encoded in the MAI in relation to its evolutionary history provides a unique opportunity to understand how new cellular processes evolve.
Due to the availability of genetic systems, α-Proteobacteria such as Magnetospirillum magneticum AMB-1 are used as models for studying the molecular biology of magnetosome formation [16]. AMB-1 contains 15-20 magnetite crystals, each formed within a cytoplasmic membrane invagination and organized in a chain spanning the length of the cell [17,18]. By making deletions within the MAI and characterizing the ultrastructure of the mutant cells, specific genes have been assigned roles in various stages of magnetosome formation [10,11,19,20]. Genes whose deletions produced empty magnetosome compartments or compartments with abnormally small magnetite crystals were termed biomineralization factors. It is the proteins encoded by these genes that hold the secrets of how magnetotactic organisms interface with solid magnetite.
Two genes in the MAI, mamE and mamO, are homologous to the HtrA proteases, a ubiquitous family of trypsin-like enzymes that functions using His-Asp-Ser catalytic triads [21]. An additional pair of genes, called limE and limO, with homology to the protease domains of mamE and mamO, exists in a secondary genomic region termed R9 [10]. Disrupting mamE or mamO causes cells to produce empty magnetosome membranes, but removing R9 has no effect, showing that mamE and mamO are required for biomineralization and that limE and limO are not (Fig 1) [10,22]. Adding variants of mamE or mamO with all three predicted active site residues mutated to alanine could not restore normal biomineralization in the ΔmamOΔR9 or ΔmamEΔR9 strains but complemented single ΔmamO or ΔmamE deletions [23]. These genetic analyses show that mamE and mamO are required for the initiation of magnetite biomineralization. Furthermore, limE and limO are partially redundant in that they can crosscomplement the active site-dependent crystal maturation defects of their respective orthologs.
Here, we use a combination of in vivo and in vitro approaches to reveal an unexpected dual role for MamO. It promotes MamE-dependent proteolysis of three biomineralization factors through the use of its C-terminal transporter domain. Separately, the protease domain has lost the ability to carry out catalysis and has instead been repurposed to bind transition metal ions. Two surface-exposed histidine residues that contribute to this metal-binding function are required for initiating magnetite biosynthesis in vivo. Bioinformatic analysis shows that similar pseudoproteases evolved independently in the three major taxa of magnetotactic organisms, highlighting a unique evolutionary mechanism behind microbial nanoparticle synthesis.

The Catalytic Triad in MamO Is Not Required for Magnetite Nucleation
Trypsin-like proteases utilize a histidine-aspartate pair to deprotonate the hydroxyl group of a serine residue, providing a nucleophile for catalysis. MamO is unusual in that it contains a threonine instead of serine as the predicted nucleophile. To further clarify the role of MamO's unusual active site, we focused our initial efforts on assessing the contribution of each putative catalytic residue to biomineralization. We performed these initial studies using the ΔmamOΔR9 strain (referred to as ΔOΔR9) to avoid cross-complementation from limO. Consistent with our previous findings, individual H116A and D149A mutations in MamO severely reduced the cells' ability to turn in a magnetic field (Fig 2A). Surprisingly, a T225A mutation had no defects in magnetic response (Fig 2A). Using transmission electron microscopy (TEM), we confirmed that mamO H116A and mamO D149A cells have small magnetite crystals while   mamO T225A crystals are indistinguishable from wild-type, mirroring the bulk magnetic response measurements (Fig 2B-2D).
Given that the predicted nucleophile is dispensable for magnetosome formation, we found it curious that the other two catalytic triad mutations disrupted crystal maturation. Upon further examination, we found that the phenotypes associated with the mamO H116A and mamO D149A alleles were actually temperature-dependent. Growing wild-type AMB-1 at room temperature instead of the standard 30°C did not dramatically alter the magnetic response. However, both the mamO H116A and mamO D149A alleles displayed improved complementation at the lower temperature. In particular, the mamO D149A mutant restored a nearly wild-type magnetic response to the ΔOΔR9 strain under these conditions (S1 Fig). Our results suggest that protease activity from MamO is not required for biomineralization. None of the three predicted catalytic residues is required for magnetite nucleation, and, although two of the three contribute to crystal maturation, these effects are conditional, suggesting that they are not central to the biomineralization process.

Proteolytic Processing of Biomineralization Factors
During the course of our experiments, we examined the MamO variants by western blotting. Although no changes in overall protein abundance were present, we noticed that each MamO variant was proteolytically processed, having both a full-length and shorter form (S2 Fig). This finding led us to examine whether other magnetosome proteins are similarly proteolyzed. We found that MamE and another biomineralization factor, MamP, are also proteolytic targets in AMB-1 cells by using antibodies targeted to each protein ( Fig 3A). MamP is a c-type cytochrome, and its iron oxidase activity is required for the proper maturation of magnetite crystals  [24,25]. Since these three biomineralization factors exist in both full-length and shorter forms, it is likely that proteolytic maturation plays a role in their function.
To examine the potential involvement of mamE and mamO in promoting these proteolytic events, we analyzed processing of MamO, MamE, and MamP in various genetic backgrounds. In both the ΔOΔR9 and ΔEΔlimE strains, we observed only the full-length form of MamP. Similar analyses showed that MamO is required for the proteolysis of MamE and that MamE is required for the proteolysis of MamO. Thus, processing of each target requires both putative proteases (Figs 3A and S2A). Addition of the mamE WT allele restored processing of each target in the ΔEΔlimE strain, but the catalytically inactive form (mamE PD ) did not, suggesting that MamE participates in proteolysis directly. Surprisingly, both the mamO WT and mamO H116A alleles restored processing in the ΔOΔR9 strain (Figs 3A and S2). Therefore, the presence of MamO, but not its catalytic triad, appears to be required to promote the activity of MamE. While these results strongly suggest that MamE directly cleaves all three biomineralization factors, we cannot rule out the possibility that its activity contributes to a more complex targeting process.
Given that HtrA proteases are often regulated by the formation of higher order oligomers, an attractive model could be that MamO activates MamE through an interaction involving both protease domains. To test this idea, we exploited the partial redundancy from genes in the R9 region. In addition to its N-terminal protease domain, MamO has a predicted TauE-like transporter domain on its C-terminus. limO, the partial duplication of mamO in R9, is 98% identical to the protease domain of mamO but does not contain a C-terminal TauE domain (S3

Structural Analysis of the MamO Protease Domain
In the trypsin family, loop L1 contains both the nucleophilic serine and oxyanion hole, required for creating the acyl-enzyme intermediate and stabilizing the oxyanion, respectively (S4 Fig) [26]. In addition to the threonine substitution, the entire L1 loop in MamO differs significantly from the trypsin family consensus, once again suggesting that MamO might not be capable of protease activity (S4 Fig). To explore this possibility further, we determined the crystal structure of the protease domain (S1 Table). It crystallizes as a monomer with the chymotrypsin fold and the catalytic residues properly placed (S4 Fig).
Loop L1 of MamO adopts the inactive conformation seen in many HtrA proteases in which the main chain carbonyl of residue 192 (chymotrypsin numbering; W222 in MamO) prevents access to the oxyanion hole ( Fig 4A). In other HtrAs, the inactive form is thought to be in equilibrium with an active conformation in which the main chain flips approximately 180°, opening the oxyanion hole ( Fig 4A) [27,28]. Switching to the active state forces residue 193 into a configuration that is strongly disfavored for nonglycine residues. Although glycine is highly conserved at position 193 in the trypsin family and is critical for catalysis, MamO contains a glutamate (E223) at this position [29]. Therefore, the active configuration of MamO would contain a strong steric clash between the E223 side chain and the main chain carbonyl of W222.
To illustrate this steric constraint, we examined a set of trypsin-like protease structures and analyzed the configuration of residue 193. We plotted ϕ and C values for this position on a Ramachandran plot along with the favored and allowed geometries for glycine and nonglycine residues (S2 Table) [30]. As expected, the main-chain torsion angles at position 193 form two groups, distinguished by an approximately180°shift in ϕ angle. The groups correspond to active and inactive forms of loop L1. The active configuration is strongly disfavored unless glycine occupies position 193, indicating that the E223 side-chain in MamO likely prevents the formation of an oxyanion hole ( Fig 4B).
While refining the MamO structure, we observed a peptide bound near the predicted peptide-binding groove ( Fig 5A). We suspect its source is the flexible N-terminal region from a neighboring MamO in the crystal that is not built into the model. However, due to the modest An Inactive Serine Protease Regulates Biomineralization resolution, we cannot confirm the sequence. Despite this uncertainty, the peptide has an interesting mode of binding. Similar to other trypsin-like proteases, the peptide enters the binding cleft parallel to loop L2, splitting the two β-barrels of the chymotrypsin fold. However, the bulky side-chain of W222 in MamO seems to block the exit path between loops LA and LD, forcing the peptide away from the catalytic center (Fig 5B-5D). Overall, the structural features of the L1 loop appear incompatible with protease activity: E223 provides an energetic barrier to catalysis while W222 serves as a physical barrier to productive substrate binding.

Direct Transition Metal Binding by MamO
Given the results of the genetic studies and structural analysis, we conclude that MamO does not act as a protease during biomineralization. This forced us to consider the possibility that the protease domain promotes magnetite formation using a function not predicted from its primary sequence. While purifying MamO, we serendipitously observed that it consistently bound to immobilized metal affinity columns. Knowing that mutations in MamO cause defects in AMB-1's ability to transform iron, a metal, into magnetite, we speculated that direct metal binding played a role in biomineralization. Although the MamO crystals grow in acidic conditions that disfavor metal binding, they could be soaked at pH 8.0 without affecting diffraction. We solved another structure of the protease domain using crystals soaked in NiCl 2 at pH 8.0 and identified a metal binding site. Overall, the conformation of the protease domain is highly similar to the original structure (rootmean-square deviation of 0.17 Å over 184 residues), and it contains the unidentified peptide. Additionally, a single Ni 2+ ion binds between loop LC and helix 2, with H148 and H263 directly coordinating the metal (Fig 6A-6C). We confirmed the placement of the ligand at this site using single wavelength anomalous dispersion (SAD) data collected at the Ni absorption edge (Fig 6A and 6B). MamO's di-histidine motif is highly reminiscent of the Zn 2+ binding site in another trypsin-like protease, kallikrein-3, in which Zn 2+ attenuates protease activity by altering the position of catalytic triad residues H57 and D102 (S5 Fig) [31,32].
To confirm the putative metal binding site from the MamO structure, we used transition metal ion Förster Resonance Energy Transfer (tmFRET) to assay binding in solution [33]. This technique measures fluorescence quenching of a cysteine-conjugated fluorophore by a metal bound at a nearby site. Guided by the structure, we targeted Q258 because of its optimal geometry relative to the metal. Adding a number of transition metals, including iron, to a purified Q258C mutant protease domain that was labeled with fluorescein-5-maleimide caused strong fluorescence quenching (S6 and S7 Figs). Although we expect Fe 2+ to be the physiological ligand, its propensity to oxidize in ambient atmosphere added significant error to the measurements. Instead, the quenching properties and resistance to oxidation of Ni 2+ were most suitable for detailed analysis. MamO bound to Ni 2+ with 2.5 μM affinity, compared to 1.1 μM in an H148A/H263A mutant. Additionally, the FRET efficiency was significantly lower in the mutant protein, demonstrating that disrupting these residues changes the metal binding properties of MamO (Figs 6D and S7).
Although the H148A/H263A mutant displays altered behavior in the tmFRET assay, it retains the ability to bind metals (S7 Fig). We could not identify any other metal ions in our Ni 2+ soaked crystals, but the two histidines we identified did not appear to be the only metal binding residues in MamO. Consistent with this, both the wild-type and H148A/H263A forms of MamO bound Ni-NTA resin, confirming that, despite its altered binding geometry, the mutant still binds to metal (S7 Fig). Taken together, our biochemical and structural investigations show that MamO binds to transition metal ions using H148 and H263, but that metal binding is not restricted to this motif.
Because disrupting H148 and H263 altered the metal binding behavior in vitro, we predicted that these residues were important for MamO-dependent biomineralization. Indeed, ΔOΔR9 strains with mamO H148A or mamO H263A alleles had no magnetic response and failed to produce electron-dense particles, implying that proper metal binding is required to initiate biomineralization (Fig 6E and 6F). Both mutants displayed normal stability and proteolytic processing, and the biomineralization defects were not conditional, as lowering the growth temperature did not allow for a magnetic response (S1 and S2 Figs). Additionally, both alleles restored a magnetic response in the single ΔmamO background, showing that limO can provide the required metal binding activity independent of cotranslation with the TauE domain (S3 Fig). These are the most disruptive point mutations we have observed in MamO, and they completely recapitulate the phenotype of a mamO deletion. We conclude that H148 and H263 contribute to a metal binding function that is required for magnetite nucleation in vivo.

Convergent Evolution of Pseudoproteases in Magnetotactic Bacteria
While the finding that MamO has lost its protease activity to become a metal binding protein in AMB-1 was quite intriguing, we wanted to know whether this mechanism was conserved in other organisms. Due to the fact that magnetotactic Nitrospirae and Omnitrophica have not been isolated in culture, we focused our analysis on the Proteobacteria for which numerous representatives are available in pure-culture. Examining available whole-genome sequences indicated that all magnetotactic strains from the α-, δand γ-Proteobacteria contain two predicted trypsin-like proteases in their MAIs. We attempted to understand the evolutionary history of these proteins by including them in a large phylogenetic tree of the bacterial trypsin-2 superfamily (Methods). Within this tree, the MamO sequences from each magnetotactic α-Proteobacterium form a distinct, monophyletic clade (Fig 7A). Each protein has a degenerate catalytic triad, a nonglycine residue at position 193, and a bulky tryptophan in the L1 loop. The metal binding positions in LC and helix 2 also appear conserved. Although one strain has an aspartate at position 263, aspartates are also common metal coordinating residues, and we predict that this H-D motif can also participate in metal binding. We conclude that the MamO family evolved specifically in α-Proteobacterial magnetotactic organisms to be metal-binding pseudoproteases (S8 Fig). In addition to the MamO proteins, we identified a second magnetosome-specific clade in the trypsin tree (Fig 7A). We named this group the MamE clade because it contains the MamE sequence from the MAI of each representative of the α-Proteobacteria. This group also features two predicted trypsin-like proteases from the MAI of each species of the δand γ-Proteobacteria, renamed MamE1 and MamE2 here (S3 Table). The sequence phylogeny indicates that the δand γ-Proteobacteria both experienced recent duplications of their MamEs An Inactive Serine Protease Regulates Biomineralization independently. In fact, there appears to have been two rounds of duplication in the δfamily. Strikingly, each duplication event has led to a degenerate catalytic triad in one of the sequences, showing that duplication and loss of function occurred three separate times in the MamE family (Figs 7B and S9). While these inactive MamEs do not have the dihistidine motif identified in MamO, we cannot rule out the possibility that they bind metals by another mechanism. Regardless, our results imply the existence of selective pressure for pairing active and inactive proteases as each major clade of magnetotactic bacteria has evolved this feature independently.

Discussion
Magnetotactic bacteria control the growth of their associated magnetite crystals with a level of precision that cannot be replicated in vitro. The molecular details of how they perform this task can reveal novel bioinorganic interfaces and be exploited for improved synthesis of nanomaterials. Genetic analysis has shown that magnetite biomineralization is surprisingly complex. It requires over 15 factors in AMB-1, nearly all of which are predicted integral membrane proteins [2,19,34]. A subset of these is required for the initial crystallization of iron within the magnetosome compartment [10]. While the key players for this step are known, their biochemical functions have only been inferred from sequence homology, leaving the mechanism of how magnetite biosynthesis begins a mystery.
Here, we examined two magnetite nucleation factors: the putative HtrA proteases MamE and MamO. We find that the presence of both MamE and MamO is required for the proteolysis of three biomineralization factors, MamE, MamO, and MamP. These events depend on an intact catalytic triad from MamE but not MamO, indicating that MamO activates MamE in a noncatalytic manner. Thus far, we have not been able to detect a physical interaction between MamE and MamO, but we have found that the C-terminal TauE-like transporter domain of MamO is required for activation. The putative ion transport activity of this domain could be the feature that promotes proteolysis. This model is attractive because it does not require a direct interaction between the two proteins. Indirect evidence suggests that TauE family proteins transport sulfite or sulfur containing organic ions, leading us to speculate that the concentrations of specific solutes in the magnetosome might control MamE's activity [35,36].
Separately, we discovered that a metal-binding function in the protease domain of MamO is required for the initiation of magnetite biomineralization. In our structure, H148 and H263 directly coordinate a single metal ion. Disrupting these residues alters the binding behavior in a modified tmFRET assay, but the effect is unusual in that it lowers the FRET efficiency while slightly increasing the overall affinity for metal. Using the Förster equation and reported radius for Ni 2+ , we calculated that the fluorophore to metal distance changes from 12.8 Å in wild-type to 15.2 Å in the H148A/H263A mutant, suggesting that the binding geometry changes in a way that allows metal binding in the same vicinity [37]. We favor the explanation that the dihistidine motif identified here is part of a more complex metal coordinating network. Our soaking strategy may have missed additional sites that are inaccessible in our crystal form, and separate attempts at characterizing a fully metal-bound state using co-crystallization have been unsuccessful. Nevertheless, we identified dihistidine motif using the structure that contributes to an unexpected transition metal-binding activity in MamO.
Despite the presence of other binding features, metal binding through H148 and H263 is absolutely required in vivo as disruption of either residue completely abolishes magnetite formation. Our structural analysis shows that MamO has lost its ability to perform proteolysis altogether, supporting the idea that metal binding is now the central function of the protease domain. Consistent with this, T225, the predicted catalytic nucleophile, is completely dispensable for biomineralization. Though disrupting H116 and D149 in the predicted catalytic triad causes conditional crystal maturation defects, magnetite nucleation is not affected. Interestingly, H116 and D149 participate in a hydrogen bond on the opposite face of loop LC from the H148/ H263 metal binding motif, suggesting that the conditional phenotypes could be due to temperature-dependent flexibility near the metal binding site (S5 Fig). A potential link between the two motifs is consistent with the reported inhibition of protease activity through a highly analogous metal binding site in the kallikrein family that rearranges the H-D catalytic pair (S5 Fig). While templating of magnetite growth via an interaction between biomineralization factors and the mineral surface has been proposed, our findings with MamO emphasize that direct interactions with individual solute ions also play a role [34,38,39]. One of the most fascinating aspects of MamO's metal ion interaction is that the H148A and H263A forms of MamO maintain the ability to bind metals but cannot support any magnetite biosynthesis in vivo. It appears that binding is insufficient and that the precise coordination geometry must be maintained, leading us to speculate that MamO directly promotes nucleation by guiding individual iron atoms into the magnetite lattice. This model is consistent with the phenotypes observed in vivo, the modest binding affinity and the surface exposed nature of the simple dihistidine motif. Additionally, it agrees with topological predictions for MamO placing the protease domain in periplasm, which is continuous with the magnetosome lumen in AMB-1 [18]. More broadly, our results define an unexpected mechanism for MamO in biomineralization. It appears to have lost the ability to perform serine protease activity and instead performs two noncatalytic functions: direct metal binding to promote magnetite nucleation and activation of MamE's proteolytic activity (Fig 8A).
In addition to the surprising mechanism for MamO in AMB-1, we uncovered a fascinating evolutionary expansion of the trypsin family within magnetotactic bacteria. Our analysis suggests that the ancestral MAI contained a single trypsin-like protease homologous to MamE. The δand γ-Proteobacteria experienced independent duplications of this ancestral enzyme, while the α-Proteobacteria appear to have acquired a second, distantly related trypsin-like protease. Despite these different origins, having two redundant proteases seems to have allowed one copy to lose its catalytic ability in all three clades (Fig 8B). In α-Proteobacteria, MamO specialized to promote biomineralization through the two noncatalytic activities identified here. While the pseudoproteases in the other clades remain uninvestigated, the fact that inactive copies are retained strongly suggests that they also play important noncatalytic roles. The pathway that led to convergent evolution of pseudoproteases in magnetotactic organisms highlights the critical role duplication and redundancy play in facilitating diversification of protein function [40].
Perhaps more intriguing is the fact that MamO's metal binding motif is placed at the same site on the chymotrypsin fold as the highly analogous zinc-binding site seen in the distantly related kallikreins [31,32]. This hints toward the possibility that the ability to bind metals may be a latent biochemical function carried within the fold. Such activities are absent in specific evolutionary states of a protein but can quickly surface under selective pressure [41,42]. Consistent with this, trypsin-like proteases utilize catalytic residues on loops that are well separated from the core, a property termed fold polarity that correlates with the capacity for functional diversification [43,44]. Perhaps neofunctionalization of the trypsin scaffold within magnetotactic organisms is due to an inherent stability and adaptability in the fold that makes it a useful building block for biochemical innovation.

Strains, Plasmids, and Culture Conditions
The strains and plasmids used in this study are listed in S4 and S5 Tables, respectively. For general maintenance and genetic manipulation, M. magneticum AMB-1 was grown in MG medium supplemented with ferric malate (30 μM). 0.7% agar was used in plates, and kanamycin was used for antibiotic selection at a concentration of 7 μg/mL (solid) or 10 μg/mL (liquid). For sucrose counterselection, MG plates contained 2% sucrose. Cultures for magnetic response measurements, western blotting, or TEM were grown in 10 mL MG medium containing 25 mM HEPES buffer (pH 7.2) and ferric malate under a 10% oxygen atmosphere. For comparing the temperature dependence of magnetic response, the strains were treated as above, except that they were grown under anaerobic atmosphere. The magnetic response of each culture was assessed using the Coefficient of Magnetism (C mag ), which was measured as described [18].

Genetic Manipulation
For complementation of deletion mutants, we used a modified form of pAK253 [10]. This plasmid contains a neutral region of the AMB-1 genome and integrates as a single copy at this site. Each allele is inserted under the control of the mamAB promoter, allowing the constitutive expression of each protein. To create the ΔmamEΔmamOΔR9 (ΔEΔO) strain, plasmid pAK243 (sacB-based counterselection system for deleting mamE) was transformed into strain AK94 (ΔmamOΔR9), and initial integrations were selected using kanamycin. The resulting strains were grown to stationary phase in MG medium without antibiotic selection and plated on sucrose for counterselection. Deletion of mamE was confirmed by antibiotic sensitivity, PCR analysis, and complementation of the magnetic phenotype upon reintroduction of both mamE and mamO.

Protein Analysis
Cultures of AMB-1 were grown to late-log phase and harvested by centrifugation at 6k x g. The resulting pellets were resuspended in PBS. The cell suspensions were mixed with an equal volume of 4x SDS Loading buffer and heated for 10 min at 75°C. The lysates were separated on SDS-PAGE and transferred to PVDF. The membranes were blotted and visualized using standard western blotting techniques. Polyclonal antibodies to MamE and MamP were raised in rabbits against recombinant forms of the soluble portion of each protein [25]. The monoclonal anti-FLAG antibody was purchased from Sigma.

Transmission Electron Microscopy (TEM)
For TEM, cultures were grown to late-log phase. 1 mL from each culture was pelleted at 16k x g, and the pellet was resuspended in the residual medium. Cell suspensions were spotted on formvar-coated copper grids, rinsed, dabbed dry, and stored at room temperature until imaging. Imaging was performed with a FEI Tecnai 12 TEM at an accelerating voltage of 120 kV. For each strain, 15-20 cells totaling >200 crystals were analyzed.

Protein Purification
For crystallography, the frozen cell suspension was thawed on ice, lysed by sonication, and clarified at 13,000 x g for 30 min. The supernatant was loaded on a 3 mL Ni-NTA column, which was then washed with five column volumes of Buffer B (25 mM Tris-HCl pH 7.4, 500 mM NaCl, 10 mM Imidazole, 10% glycerol). The protein binds to Ni-NTA with its native metal binding site. After elution with one column volume Buffer C (25 mM Tris-HCl pH 7.4, 250 mM NaCl, 250 mM Imidazole, 10% glycerol), the sample was dialyzed overnight against Buffer D (25 mM Tris-HCl pH 8.0, 50 mM NaCl, 10% glycerol, 0.5 mM EDTA) and loaded onto a 1 mL HiTrap CaptoQImpRes column. The column was developed to Buffer E (25 mM Tris-HCl pH 8.0, 1 M NaCl, 10% glycerol), and the peak fractions containing the MamO protease domain were pooled, exchanged into Storage Buffer (25 mM Tris-HCl pH 7.4, 300 mM NaCl, 10% glycerol), concentrated to approximately 20 mg/mL and frozen in liquid nitrogen for storage at -80°C.
For tmFRET, the expression was performed as above except that the various mutants of the MmMamO protease domains were expressed as fusions to a C-terminal strepII tag. The lysate was prepared as above and loaded onto a 1 mL StrepTrap HP column, which was washed with five column volumes of Buffer F (25 mM Tris-HCl pH 7.4, 250 mM NaCl, 10% glycerol) and eluted with three column volumes of Buffer F supplemented with 2.5 mM desthiobiotin. The eluate was concentrated and loaded onto a 16/60 Superdex 200 column and developed in Storage Buffer. The peak fractions were pooled, concentrated, and used immediately for fluorescent labeling. The purity of all proteins was confirmed by SDS-PAGE.

Crystallization and Structure Determination
Frozen aliquots of untagged MamO protease domain were thawed on ice and exchanged to Buffer G (20 mM Tris-HCl pH 7.4, 150 mM NaCl, 5% glycerol, 0.5 mM DTT) while adjusting the protein concentration to 5 mg/mL in a 10 kDa cutoff Amicon ultrafilter. Crystals grew in the hanging drop vapor diffusion format after mixing the protein with an equal volume of well solution (50 mM Na-Acetate pH 4.6, 3.6 M NH 4 Cl, 5% glycerol) and equilibrating against 1 mL well solution at 18°C. Cubic crystals of MamO appeared in 1-2 d and grew to their full size after 4-6 wk. Each crystal was cryoprotected with a solution of 50 mM Na-Acetate pH 4.6, 3.6 M NH 4 Cl, and 22% glycerol before plunge-freezing in liquid N 2 .
For nickel soaking, NiCl 2 was added directly to drops containing fully-grown crystals for a 10 mM final concentration. The wells were resealed and the crystals soaked overnight. At harvest, each crystal was passed through three 1 min soaks in a buffer containing 50 mM Tris-HCl pH 8.0, 3.6 M NH 4 Cl, 22% glycerol, and 10 mM NiCl 2 before plunge-freezing.
Diffraction data was collected on beamline 8.3.1 at the Advance Light Source (Lawrence Berkeley National Laboratory, Berkeley, California). Indexing and scaling was performed using HKL2000 [45]. The apo-structure was solved by molecular replacement in Phaser with the protease domain of EcDegP (1KY9) as a search model [46]. The Ni-bound structure was solved using the apo-MamO structure as a molecular replacement search model. Model building and refinement were carried out with alternating cycles of COOT [47] and phenix.refine [48]. Placement of the Ni ion was confirmed with an anomalous map from SAD data collected at the Ni absorption edge.

Fluorescein-5-Maleimide (F-5-M) Labeling
For fluorescent labeling, variants of the MamO protease domain were diluted to 50 μM in 1 mL of Buffer H (50 mM NaPhosphate pH 7.2, 150 mM NaCl, 10% glycerol). Fluorescein-5-maleimide (dissolved at 50 mM in DMSO) was added to a final concentration of 1 mM, and the reaction was incubated overnight at 4°C. The reaction was quenched with DTT, exchanged into Buffer D using a PD-10 column, and loaded on a 1 mL HiTrap Q FF. The column was developed to Buffer E, and each protein eluted as a single fluorescent peak. The peak fractions were pooled, treated with 1 mM EDTA to remove any trace metal, exchanged extensively into Chelex-treated Storage Buffer, and frozen in small aliquots. Labeling efficiency was measured based on A 492 /A 280 and was between 50% and 55% for all preparations.
tmFRET For metal binding experiments, all buffers were prepared in acid-washed glassware and treated with chelex resin. Each protein was diluted to 60-80 nM in fluorescence buffer (10 mM Tris-HCl pH 8.0, 150 mM NaCl). Metal solutions were prepared at 10X concentration in chelex-treated H 2 O. Protein was dispensed into 96-well plates, and the plates were scanned for fluorescence emission in a Tecan Infinite 200 plate reader in top-read mode (Ex: 492 nm; Em: 505-570 nm). Metal solutions were then diluted into each well at the appropriate concentration and the plate was rescanned. Due to the spontaneous oxidation of ferrous iron in ambient atmosphere, all iron binding experiments were performed in the absence of oxygen. For iron binding, all solutions were prepared using anoxic liquids. The samples were prepared under anoxic atmosphere in a clear-bottom 96-well plate. To prevent introduction of oxygen, the wells were sealed by covering the plate with a black adhesive cover. The plate was then removed from the anaerobic chamber and scanned using bottom-read mode.
Each measurement was performed on independently prepared solutions in quadruplicate. The metal-quenched fluorescence spectrum from each well was normalized to the fluorescence before metal addition. F metal /F represents the normalized fluorescence averaged from an 11 nm window around the peak. After plotting F metal /F as a function of metal concentration, each curve was fit to a two-site model (below), in which K d2 and E represent the dissociation constant and FRET efficiency for metal binding by MamO, respectively. K d1 represents a nonspecific, solution-based quenching component [37].

Ni-NTA Binding Assays
StrepII-tagged forms of the MamO protease domain were purified as described for the tmFRET experiments with the omission of the F-5-M labeling and ion exchange steps. Proteins were diluted to approximately 15 μM in Column Buffer (25 mM Tris-HCl pH 7.4, 300 mM NaCl, 5 mM Imidazole, 10% glycerol) and loaded onto a 0.5 mL Ni-NTA column equilibrated in column buffer. The column was washed with ten column volumes of Column Buffer and eluted with two column volumes Buffer C. Binding was assessed by separating the fractions on 12% SDS-PAGE and staining with Coomassie Blue.

Phylogenetic Analysis
To understand the phylogeny of Mam proteases, we took a broad approach, characterizing their location within the trypsin-like protease family. The Trypsin_2 Pfam (PF13365) was used to generate a hidden Markov model (HMM) that was used to search a database composed of protein sequences from approximately 2,100 bacterial and archaeal genomes along with the Mam proteases [49]. This search was performed using hmmscan (HMMER3.1b1, hmmer. janelia.org), retaining all hits with an e-value for the entire sequence less than 10 −5 , identifying 6,104 proteins. The hits were clustered using CD-HIT with a sequence similarity cutoff of 0.8, yielding 3,431 sequences [50]. These were aligned using MUSCLE (v3.8.31) with the maxiters parameter set to 2 [51]. The resulting alignment was trimmed using Gblocks, using parameters appropriate for divergent datasets as described by Sassera et al. [52,53]. This alignment was then used to generate a phylogeny using FastTree 2 (v2.1.7) with the default settings [54]. However, this alignment contained only six informative positions. To improve the quality of the alignment, we iteratively removed long branches from the tree and regenerated the alignment. After removing 90 taxa over four iterations, we settled on an alignment with 18 conserved positions.
Two clear branches emerged from this analysis. One branch ('the Deg branch') contained DegP, DegQ, and DegS sequences from the γ-Proteobacteria, in addition to most of the MamE sequences. The bottom branch contained the YdgD sequence from Escherichia coli. We extracted the sequences from the Deg branch (843 sequences) and generated an alignment (46 positions) and tree using the methods described above (Fig 7A). In this tree, the MamE sequences appeared to be closely related, but their phylogenetic relationships were ambiguous. Additionally, we realized that a subset of δ-proteobacterial trypsin-like sequences were falsely excluded from the MamE branch due to substitutions in the catalytic triad that obscured the phylogenetic signal in the short alignment. Additional phylogenetic testing confirmed the relationship of these four sequences to the canonical MamE sequences, so they were merged into the MamE branch.
We extracted the MamE sequences and aligned them using five DegS sequences from the γ-Proteobacteria, which appeared to be closely related based on the phylogeny in Fig 7B. This alignment was much larger (172 positions), and the resulting tree from FastTree 2 gave wellsupported interior nodes for the MamE branch. We used two other phylogenetic methods to test this phylogeny. First, we used ProtTest 3.0 to select the best substitution matrix (in this case, the WAG model) and performed 100 independent inferences with 300 bootstraps in RAxML [55,56]. Secondly, we used the nonparametric Monte Carlo Markov Chain algorithm PhyloBayes 3 to generate a tree not based on prior assumptions about the site-specific evolution of the MamE sequences [57]. A summary tree integrating the results from PhyloBayes and RAxML is depicted in Fig 7B. Additionally, we rooted the MamE branch to five closely related sequences from the Clostridiales, and this, too, strongly supported the phylogeny in Fig Table. Suggested renaming of the MamE clade. The naming in a previous comparative genomic study is unconventional due to confusion over analogy to the MAI in α-Proteobacteria [12]. Our phylogenetic analysis clarifies the ancestry and allows us to use accepted nomenclature. (DOCX) S4