Crystal Structure of the Escherichia coli Fic Toxin-Like Protein in Complex with Its Cognate Antitoxin

FIC domain proteins mediate post-translational modifications of target proteins, which typically results in their inactivation. Depending on the conservation of crucial active site residues, the FIC fold serves as structural scaffold for various enzymatic activities, mostly target adenylylation. The founding member of the vast Fic protein family, EcFicT, was identified in Escherichia coli some time ago. The G55R point mutant of EcFicT displays the “filamentation induced by cAMP” (Fic) phenotype at high 3',5'-cyclic adenosine monophosphate (cAMP) concentrations and elevated temperature, but the underlying molecular mechanism and any putative biochemical activity of EcFicT have remained unknown. EcFicT belongs to class I Fic toxin proteins that are encoded together with a small inhibitory protein (antitoxin), named EcFicA in E. coli. Here, we report the crystal structures of two mutant EcFicT/EcFicA complexes (EcFicTG55RA and EcFicTAE28G) both showing close resemblance with the structure of the AMP-transferase VbhT from Bartonella schoenbuchensis in complex with its cognate antitoxin VbhA. However, crucial differences in the active site of EcFicT compared to VbhT and other AMP-transferases rationalize the lack of evidence for adenylylation activity. Comprehensive bioinformatic analysis suggests that EcFicT has evolved from canonical AMP-transferases and has acquired a conserved binding site for a yet to be discovered novel substrate. The G55R mutation has no effect on structure or thermal stability of EcFicT, such that the molecular basis for its associated Fic phenotype remains elusive. We anticipate that this structure will inspire further bioinformatic and experimental analyses in order to characterize the enzymatic activity of EcFicT and help revealing its physiological role.


Introduction
Filamentation induced by cAMP (Fic) was discovered in the early 1980s by Utsumi et al. [1] as a phenotype caused by the temperature sensitive allele fic-1 of the fic gene of Escherichia coli. The G55R mutation encoded by the fic-1 allele caused cell filamentation at non-permissive temperature (43°C) in the presence of high concentration of cAMP (1.5 mM) in the culture medium [2][3][4][5]. In their reports, they showed that knock-out strains of the fic gene were viable as wild-type. In contrast, bacteria expressing the fic-1 allele (G55R) displayed a filamentation phenotype, suggesting a gain of function mutation [3,4]. A complementation of the Δfic knockout mutant with the fic-1 allele also resulted in bacterial filamentation. Yet, the biochemical function of the E. coli Fic protein remained unclear.
A few years ago, a biochemical activity likely shared by the majority of Fic proteins has been identified by in vitro and in vivo analyses combined with mass spectrometry [6][7][8][9]. Representative members of FIC domain proteins containing the canonical HxFx[D/E]GNGR 1 xxR 2 motif were found to catalyze adenylylation (also known as AMPylation), the transfer of an AMP moiety from the ATP substrate onto a hydroxyl side-chain of a target protein. Crystal structures showed that the ATP substrate interacts directly with arginines R 1 and R 2 and with the central asparagine [9]. In addition, the substrate is indirectly bound to the aspartate or glutamate residue of the Fic active site motif via a magnesium ion. The histidine is thought to constitute the general base for deprotonation of the target hydroxyl side chain to allow in-line nucleophilic attack onto the α-phosphorous of ATP. However, Fic proteins are versatile enzymes in that they can also catalyze other post-translational modifications depending on their active site signature motif and the nature of the substrate binding pocket. Recent studies reported phosphocholination and phosphorylation (kinase) activity for AnkX [10] and Doc [11,12], respectively.
Catalytic activity is usually inhibited by a conserved inhibitory α-helix (α inh ) that partly obstructs the ATP binding site. The α inh can either be part of the same polypeptide chain (located relative to the Fic domain N-terminal in class II Fic proteins and C-terminal in class III Fic proteins, respectively) or provided by a separate small protein known as the antitoxin (class I Fic proteins) [9]. The α inh helix contains a [S/T]xxxE[G/N] sequence motif with the invariant glutamate forming a salt-bridge with R 2 of the FIC motif [9].
Structure/function studies on class I Fic proteins (designated as FicT toxins) and their cognate antitoxins (designated as FicA) have focused so far on the toxin-antitoxin (TA) module VbhTA of Bartonella schoenbuchensis [9,[13][14][15]. VbhT was shown to interfere with bacterial growth by adenylylation and concomitant inactivation of DNA gyrase and topoisomerase IV [15]. The VbhA antitoxin inhibits the catalytic activity of VbhT by direct interaction, and vhbA-vbhT is encoded in an operon like other ficA-ficT gene pairs. VbhTA and other FicTA systems thus satisfy the key definition of a type II TA [15].
The fic gene of E. coli (renamed as ecficT) is preceded by the small upstream gene yhfG (renamed as ecficA) that encodes a FicA antitoxin homolog and thus shares typical characteristics of FicTA modules. The bona fide antitoxin EcFicA contains a canonical [S/T]xxxE[G/N] inhibition motif and has been shown to interact with EcFicT [15,16]. Even upon mutation of the inhibition motif, we were unable to demonstrate any auto-or target-adenylylation activity catalyzed by EcFicT [15]. Similarly, any cellular target(s) and the mechanisms underlying "filamentation induced by cAMP" have remained unknown.
Here, we describe the crystal structure of EcFicT in complex with its cognate antitoxin EcFicA. Phylogenetic analysis reveals that enterobacterial FicT toxins constitute a distinct subgroup that originated by vertical transmission and may have evolved a new molecular and biological function. The crystal structure of the EcFicT/EcFicA complex reveals an extended ligand-binding pocket that is strictly conserved within this EcFicT-like subfamily. The precise definition of this newly observed substrate-binding pocket will serve as a basis for in silico docking experiments that will guide potential ligand identification and further bioinformatical and evolutionary analyses of the Fic toxin-antitoxin family.

Phylogenetic analysis
The E. coli K12 MG1655 FicT protein (EcFicT; Uniprot: P20605) was used as reference sequence for comparison with similar protein sequences (>50% identity) from the UniProt Reference Clusters (UniRef) (702 sequences). For the phylogenies shown in the Supporting Information, the protein list was shortened to 107 sequences by removing redundant entries. For the phylogenies shown in the main figures, the protein list was further reduced to 47 sequences by applying a cut-off of 90% identity. Sequence alignments were created using Clus-talW (implemented in Geneious v7.1.7; Biomatters) and manually curated. Subsequently, gapped positions were removed and a maximum likelihood phylogeny was constructed using PhyML implemented in Geneious (LG substitution model as suggested by ProtTest3 [17], 100 bootstraps), including two distant class I Fic proteins, YeFicT of Yersinia enterocolitica str. 8081 and VbhT of B. schoenbuchensis. The phylogeny was rooted using VbhT as outgroup. Phylogeny illustrations were processed using FigTree v1.4.2 (http://tree.bio.ed.ac.uk/software/ figtree). The sequence dataset described above was submitted to the SPEER server using the SCI-PHY algorithm for automated subgrouping and a relative entropy value of 1.0 [18].
WebLogos for subgroups 1 and 3 were generated using the WEBLOGO server [19]. Additionally, the overall conservation was monitored.
Analysis of the genetic loci of EcFicT homologs was performed using the www. MicrobesOnline.org web tool [20] with a limit at 50 homologs and a clustering at 95% identity. Only one representative locus is shown per species.

Differential scanning fluorimetry analysis
Temperature-induced unfolding was monitored by differential scanning fluorimetry experiments [21] performed on a Qiagen Rotor-gene Q real-time thermocycler as described previously [22].

Data collection and processing
For data collection, crystals were cryo-protected by transfer into a reservoir solution supplemented with 20% glycerol and subsequently vitrified in liquid nitrogen. X-ray data were collected at the Swiss Light Source (Villigen, Switzerland) on beamline X06SA (PXI) and X06DA (PXIII) at 100 K and a wavelength of 1.000 Å. Diffraction data were indexed and integrated using XDS [23] and subsequently merged and scaled using XSCALE [23] or AIMLESS [24]. EcFicT G55R A and EcFicTA E28G diffracted to 2.0 Å and 2.4 Å, respectively. Data collection statistics are summarized in Table 1.

Structure determination
The structures were solved by molecular replacement using PHASER [25]. The initial model of the antitoxin was built using AUTOBUILD of the PHENIX package [26]. The complex models were improved by iterative rounds of interactive model building using COOT [27] and refinement using REFMAC5 [28] or PHENIX.REFINE [29]. Final refinement statistics are given in Table 2.

Accession numbers
Coordinates and structure factors have been deposited in the Protein Data Bank with accession numbers 5JFF and 5JFZ for EcFicT G55R A and EcFicTA E28G , respectively.
Purified EcFicTA complexes were subjected to differential scanning fluorimetry (DSF) to determine their melting temperature (T m ). Wild-type EcFicTA appears rather stable with a T m of 63°C (green curve, Fig 1B). The G55R mutation that has been shown originally to cause in vivo the Fic phenotype [1] does not affect the melting temperature of the complex (EcFicT-G55R A variant, blue curve, Fig 1B) despite additional intra-molecular interactions (see below).
The melting curve appears monophasic. Thus, it is likely that the two proteins melt cooperatively, but we cannot exclude that both proteins have similar individual melting temperature. Strikingly, mutation of the putative inhibitory glutamate residue E28 (EcFicA E28G ) decreases the melting temperature of the complex by 10°C compared to the wild-type complex ( Fig 1B). The peaks of the first derivative of the melting curve of EcFicTA E28G and EcFicT G55R A E28G appear to broaden towards lower temperature, indicating that the E28G mutation may weaken the complex stability, resulting in a partial dissociation of EcFicA E28G from EcFicT.

Structure determination of EcFicT/EcFicA complexes
Crystallization trials of the wild-type EcFicTA complex did not yield any detectable crystal. However, the EcFicT G55R mutant in complex with EcFicA (EcFicT G55R A) yielded crystals that diffracted to 2.0 Å. Crystals belonged to the Laue group 6/m with unit cell parameters a = 104.69 Å, b = 104.69 Å and c = 110.52 Å. Systematic absences along the c Ã -axis (0,0,l) indicated space groups P6 1 or its enantiomorph P6 5 . Unit cell parameters and space group suggested two 1:1 complexes per asymmetric unit (2.60 Å 3 /Da). The structure of EcFicT G55R A was determined at 2.0 Å resolution (Table 1) by molecular replacement using the structure of VbhT (33% sequence identity to EcFicT) from the VbhTA complex (PDB: 3ZC7) as search model (Fig 2A). All possible space groups were tested for molecular replacement using PHA-SER [25]. The best solution (TFZ = 11.6 and LLG = 206) was obtained in space group P6 5 yielding two EcFicT molecules in the asymmetric unit. Initial refinement (R work = 43.6%, R free = 47.0%) showed additional α-helical structure (positive Fo-Fc map) in the groove of the Fic protein next to the active site, where usually the α inh helix is found [9,13]. After automatic model building of this part followed by several cycles of iterative model building and refinement, an almost complete full model of the EcFicT G55R A complex was obtained with R work and R free values of 17.7% and 20.4%, respectively. Final refinement statistics and completeness of the model are given in Table 2.
For EcFicT in complex with the antitoxin mutant EcFicA E28G , crystals of the monoclinic space group C2 were obtained that diffracted to 2.4 Å, containing three toxin-antitoxin complexes per asymmetric unit. The structure of EcFicTA E28G was solved by molecular Numbers in parentheses refer to the outer shell. * R work = ∑ hkl || F obs |-|F calc || / ∑ hkl |F obs | ** R free is the R value calculated for 5% of the data set that was not included in the refinement.   Tables 1 and 2, respectively. For both structures determined in this study, the geometry of the final model was assessed using MolProbity [30] showing >99% of the residues in the core and allowed regions of the Ramachandran plot.

Structure analysis of the EcFicT/EcFicA complexes
Despite distinct crystal packings, the structures of EcFicT G55R A and EcFicTA E28G are virtually identical (root mean square deviation (rmsd) of 0.59 Å for 232 Cα-atoms). In particular, the interactions formed between the Fic protein and the antitoxin are the same, giving confidence that they are physiological and also occur in solution. Biological relevance of this interface was also corroborated by EPPIC (Evolutionary Protein-Protein Interface Classifier [31]) analysis.
No other interfaces of potential biological significance were identified EcFicT is highly similar to other Fic domain structures. In particular, EcFicT G55R superimposes very well with the structure of VbhT from B. schoenbuchensis and BepA from Bartonella henselae, with an rmsd of 1.36 Å and 1.65 Å for 187 and 189 Cα-atoms, respectively. Noteworthy, the pairwise sequence identity of EcFicT with both VbhT and BepA is rather low (33% and 32%, respectively). EcFicT forms a seven α-helices bundle with the Fic signature motif located between α4 and α5. The flap is located between helices α2 and α3 and forms a beta-hairpin (S1A Fig) that has been previously shown [32] or predicted to be involved in target docking [13,33,34]. The Fic core, as defined by pfam (pfam 02661 [35]), includes residues 54-157 and comprises helices α2 to α5 (S1A Fig).
The EcFicA antitoxin tightly embraces EcFicT forming mainly hydrophobic interactions, as observed also in the VbhA/VbhT complex (Fig 2). Yet, EcFicA adopts a unique conformation, forming two parallel α-helices linked by a 10-residues loop (Fig 2A) compared to VbhA that forms three anti-parallel α-helices (Fig 2C). The second helix of VbhA is degenerated to a loop in EcFicA. Sequence alignment reveals an eight amino acids deletion in EcFicA compared to VbhA (S2 Fig). Furthermore, the first helix of EcFicA (α1) that is homologous to the α inh helix is kinked and two amino acids longer than the corresponding helix of VbhA. The kink occurs at the same position as in PhD, the antitoxin of the Doc toxin from Bacteriophage P1, although the angle and direction are different (S1B Fig).
The SRRLEG segment at the C-terminal end of the first helix of EcFicA ( Fig 2B) conforms to the canonical [S/T]xxxE[G/N] inhibition motif and protrudes into the EcFicT active site as in the VbhTA complex. S24 and E28 of EcFicA form, respectively, H-bond and salt-bridge interactions with R146 (R 2 ) of the canonical Fic signature motif (compare Fig 2B with Fig 2D). Additionally, R25 of EcFicA interacts with T40 of EcFicT ( Fig 2B). An interface of 1641 Å 2 is formed between EcFicT and EcFicA, corresponding to a gain in free energy (Δ i G) of -19.6 kcal/ mol upon formation of the interface as calculated by the PISA web server [36]. This interface is mainly hydrophobic: 13 residues of EcFicA (out of 33) are hydrophobic and contribute to a buried surface area of 727 Å 2 and 17 residues of EcFicT (and out 44) are hydrophobic and contribute to a buried surface area of 710 A 2 . This large and hydrophobic interface is consistent with the stable complex that has been observed in solution (Fig 1).

Phylogenetic analysis of EcFicT homologs
In order to obtain insight into the functional evolution of EcFicT, we generated a multiple sequence alignment comprising 702 sequences of EcFicT homologs down to 50% sequence identity, which was reduced to 102 sequences by manually removing redundant sequences (S3 Fig). Interestingly, these close EcFicT homologs all belong to Enterobacteriaceae, indicating that vertical transmission and not horizontal gene transfer played a major role in their evolutionary history. This hypothesis is strongly supported by the finding of largely syntenic loci for all close enterobacterial homologs of EcFicT, but no conserved genetic associations for other, distantly related, FicTA modules (S4 Fig). The inferred vertical transmission of the genes coding for EcFicT-like proteins contrasts with the situation found for type II toxin-antitoxin modules that, during evolution, are typically transmitted via horizontal gene transfer [37]. The same has been recently suggested for FIC domain proteins in general [38].
The phylogenetic relation between the sequences of the multiple sequence alignment as derived from a maximum-likelihood analysis is shown in S3 Fig, and a reduced version of the tree (after filtering out redundant sequences with > 90% identity) is given in Fig 3A. The phylogeny was rooted using VbhT, a very distantly related FicT toxin of B. schoenbuchensis, and additionally contains the distinct, yet enterobacterial YeFicT toxin for comparison [15]. These two proteins and ancestral branches of the tree (subgroup 1, yellow background in Fig 3A and  S3 Fig) form a group of rather distant EcFicT homologs (sequence identity between 50% and 60%). Most group members harbor the asparagine and both arginines of the canonical HxFx [D/E]GNGR 1 xxR 2 adenylylation-competent motif, and the AMPylation activity of VbhT has been demonstrated [15]. Subgroup 2 (shown in the middle of the tree) is characterized by a specific pattern at the N-terminus. While all the analyzed sequences are of very similar length, a five amino-acids deletion appears at the N-terminus of subgroup 2 sequences. Conspicuously, this group and the EcFicT-like homologs (subgroup 3; displayed at the bottom of the tree with purple background in Fig 3A and S3 Fig) have signature motifs that differ from the ancestral state in subgroup 1. Compared to the canonical sequence, a small to medium-sized hydrophobic residue invariably substitutes arginine R 1 that in AMP transferases is heavily involved in ATP binding [9,13] suggesting that these proteins are not adenylylation-competent and may have evolved a new biochemical function.

Specificity determining positions that distinguish EcFicT from canonical FIC AMP transferases
To validate the subgrouping derived from the phylogenetic analysis and to systematically identify subgroup defining positions (SDPs), we submitted the multiple sequence alignment to the SPEER server (specificity prediction using amino acid's properties, entropies and evolution rate [18]). Four subgroups (I-IV) were identified by SPEER as indicated in S3 Fig that matched almost perfectly the grouping derived from the phylogenetic analysis, though with subgroup 1 being divided in two parts (SPEER groups II and IV).
To reveal the SDPs, we computed the sequence logos for the subgroups of canonical AMP transferases (subgroup 1) and of EcFicT-like proteins (subgroup 3), which are displayed in Fig  3B in the top and bottom row, respectively. While about 40% of the residues are almost strictly conserved amongst the two subgroups (marked in grey on Fig 3B), there are several residues that are conserved only in one or the other subgroup (marked with magenta or red background, respectively). These SDPs are found throughout the entire sequence and their number is considerably larger in the EcFicT-like than in the canonical subgroup (12 magenta vs. 43 red positions). Possibly, this reflects additional evolutionary constraints on the structure and function of the EcFicT-like proteins. While many of the SDPs correspond to buried residues (see also Fig 3C) that, thus, would have maintained their specificity for structural reasons, there are also several positions that are exposed and probably related to function. Latter positions map to (i) the canonical ligand-binding pocket, (ii) the flap that is expected to interact with a putative target, and (iii) the N-terminus (Fig 3C). Some additional positions were manually identified (shown in green). These positions are slightly variable in each subgroup and, therefore, not picked up automatically using a sequence identity threshold.
Consistent with our preliminary assignment as canonical AMP transferases, subgroup 1 exhibits an almost canonical HPFR 0 (D/E)GNGR 1 xxR 2 signature motif. The R 1 position is clearly a SDP for that group and contrasts strongly with the small to medium-sized hydrophobic residue in EcFicT-like proteins (subgroup 3). The conserved central asparagine residue is found mutated to a serine in about half of the EcFicT-like sequences. The full phylogeny of subgroup 1 (S3 Fig) reveals that this asparagine to serine mutation has occurred several times independently. An asparagine or a serine can most likely adopt this position without altering the H-bond interaction with the main-chain amide of the strictly conserved structural phenylalanine of the signature motif (F137 of EcFicT). Unexpectedly, the D/E residue of the canonical motif (E140 in VbhT) that is involved in Mg 2+ coordination in known structures (Fig 3C, left) is not well conserved in subgroup 1. It would be interesting to further functionally characterize this group with respect to magnesium requirement. In subgroup 3, the equivalent position is almost invariably a leucine or isoleucine.
There are several additional SDPs close to the site that binds ATP in VbhT (subgroup 1, Fig  3C). In VbhT, the aforementioned R 1 (R144) is close to F68; in EcFicT, the corresponding residues are L143 and Y67 (Fig 3C). In VbhT, a tyrosine at position 68 would clash with arginine R 1 . It is noteworthy, that Y67 of EcFicT is interacting indirectly with R25 of the antitoxin via a water molecule. Removal of the antitoxin may open up a binding site for a yet unidentified substrate potentially involving this tyrosine.
Strikingly, a highly conserved glutamate of subgroup 1 (E35 of VbhT) is mutated to alanine (A36 of EcFicT) in most subgroup 3 sequences. The glutamate interacts with R 1 and thus, the glutamate to alanine change may represent co-variation with the R 1 to hydrophobic amino acid mutation (Figs 3C and 4). Finally, a strict tyrosine to phenylalanine change is apparent in position 72 of VbhT (71 in EcFicT), bordering the signature loop. In summary, next to the signature loop, EcFicT-like proteins exhibit an extended groove lined by hydrophobic residues (A36, F71, V139) and bordered by the polar residues Y67 and T40 with exposed hydroxyl groups (Fig 4). All 5 positions are SDPs, i.e. specifically conserved is subgroup 3. It is well conceivable that EcFicT-like proteins have evolved this binding pocket to accommodate a substrate distinct from ATP used by canonical AMP transferases (S5 Fig). Also at the flap and close to it, several residues are well-conserved in the EcFicT-like subgroup. In particular, E79 and Y83 are interesting, since they are exposed and may provide specificity for the interaction with a yet unidentified target protein. The strictly conserved alanine and methionine at positions 178 and 179, respectively, are located just below the flap and may thus be important for proper positioning of the flap for target recognition and registration of the modifiable segment. The overall analysis of the conservation of amino acids at the surface of EcFicT-like Fic proteins reveals two well-conserved patches: the first one located below the flap (residues 123, 126, 130, 133, 175 and 178-180) and a second one on the "back" of the Fic protein (residues 92, 96, 100, 103, 107), see Weblogo on Fig 3B and ConSurf representation on Fig 5. These two patches may be involved in target recognition of a subgroup specific target that remains to be discovered.

Putative regulation of FicT proteins by tyrosine auto-modification
The sequence logos of Fig 3B reveal a DPY motif that is strictly conserved in subgroups 1 and 3. In the crystal structure of EcFicTA, the first eight amino acids of EcFicT are unresolved, but the structure of the DPY motif (residue numbers [10][11][12] is resolved. Fig 6A shows that the aspartate (D10) and tyrosine (Y12) side-chains interact with the side-chain of arginine R138 (4 th residue of the Fic signature motif, referred to as R 0 ) and the main-chain amide of residue 79 (preceding the flap), respectively. The same organization can be seen in the apo VbhTA crystal structure [9]. This interaction may be crucial for the observed fixation of the VbhT N-terminus to the flap (Fig 6B), which is the docking site for the modifiable segment of the target as e.g. observed in IbpA FIC2 /Cdc42 (Fig 6C) [32] and inferred from other Fic structures that show a chain-terminus from an adjacent Fic molecules in the crystal bound to this site [13].
The interaction of the N-terminus with the target dock, which had been over-looked so far, may constitute a physiological auto-inhibition mechanism (N-terminal lock), since, obviously, it would impede productive target binding. In analogy to the relief of auto-inhibition exerted by the auto-modification of the α inh in NmFic and probably most of class III Fic proteins [22], it is tempting to speculate, that also the conserved tyrosine of the DPY motif can get auto-modified to relieve auto-inhibition. It is striking that R 0 of the Fic motif is highly variable in adenylylating Fic enzymes in general, but strictly conserved in subgroups 1 and 3 of Enterobacterial class I Fic proteins (see [15] and also S3 Fig). Noteworthy, the DPY motif is absent in subgroup 2 sequences (S3 Fig), which correlates perfectly with the residue homologous to R 0 being a non-arginine. Thus, the conservation of R 0 in subgroups 1 and 3 may indeed indicate its functional role in the N-terminal lock stabilization.

Effect of the EcFicT G55R mutation
Our crystal structures show that the G55R mutation encoded by the fic-1 allele, which results in the cell filamentation phenotype [3], does not affect the structure of EcFicT (see above and Fig 7A). The Ramachandran angles of R55 (F, C = -120.7, -160.4) are virtually the same as those of the G55 wild-type residue (F, C = -133.6, -155.8 as measured in the EcFicTA E28G structure). The arginine 55 side-chain forms a salt-bridge with glutamate 107 of helix α3 ( Fig  7B), which is conserved in EcFicT-like homologs (Fig 3B). One could speculate that the glycine to arginine mutation may reduce the main-chain flexibility. However, despite the additional salt-bridge, the G55R variant is not more stable than the wild-type protein, as has been shown above by DSF (Fig 1B).  Close-up view of the salt-bridge formed between R55 and E107 in the EcFicT G55R A complex. Note that the two crystal structures were obtained from two different crystal forms (space groups P6 5 and C2 for EcFicT G55R A and EcFicTA E28G , respectively; see Table 1 for details). Effect of the EcFicA E28G mutation EcFicTA E28G crystallized in the space group C2 with three complexes per asymmetric unit. The structure of this mutant is virtually identical to that of EcFicT G55R A, except for disorder of residues 83-86 at the tip of the flap. In EcFicT G55R A, the flap is stabilized by crystal contacts and is fully ordered. The lack of electron density of the flap of the E28G mutant has already been observed in several other FIC domain protein structures (e.g. Bep1 from Bartonella clarridgeiae, PBD: 4NPS or NmFic E102R from Neisseria meningitidis, PDB: 5CGL). We speculate that, in solution, this region gets stabilized only upon substrate and/or target binding and that it is likely to be disordered for EcFicT G55R A too.
In canonical AMP-transferases, the conserved glutamate of α inh is engaged in a salt-bridge with arginine R 2 of the Fic motif, thereby preventing competent ATP binding. Also in EcFicT- G55R A, such an intermolecular salt-bridge is observed (E28-R146). Though the side-chain truncation in EcFicA E28G increases the size of the putative substrate binding site, it may not be sufficient to allow productive substrate binding and relieve the inhibition of EcFicT. Because of the hydrophobic pocket on the left-hand side of the active site that may accommodate a different substrate, de-binding of EcFicA or more drastic mutations may be required to allow ligand binding.

Discussion
The Fic protein from E. coli (now named EcFicT) was identified more than 30 years ago [1] and was shown to be responsible for the filamentation induced by cAMP (fic) phenotype when carrying a G55R amino acid exchange (encoded by the fic-1 allele), but its molecular and biological function remained elusive.
Since then, a number of Fic proteins from other organisms have been shown to catalyze adenylylation of target proteins, resulting in modulation of their functions, e.g. by inhibiting Rho-GTPase pathways or bacterial DNA topoisomerases [6,7,15,39]. The vast majority of adenylylation-competent Fic proteins are expressed in an inhibited form [9]. The location of the inhibition motif lead to the classification of Fic proteins in class I, II and III depending on this motif being contributed by an antitoxin, or found at the N-terminal or C-terminal end of the Fic protein, respectively. According to this classification, EcFicT belongs to class I with the inhibitory helix provided by EcFicA, a small protein coded immediately upstream of the ecficT gene. The synteny of the corresponding genetic loci reveals that the genes were primarily transmitted vertically within the Enterobacteriaceae, suggesting a specific physiological function. This is in contrast to typical type II TA modules that show considerable mobility via horizontal gene transfer, likely related to their biological functions in the context of mobile genetic elements [40].
The crystal structure of the tight EcFicTA complex reported in this study shows that the EcFicT protein adopts the canonical fold with a somewhat degenerated signature motif and that it is embraced by the cognate EcFicA antitoxin very similarly as in the AMP-transferase VbhTA complex [9]. The inhibitory glutamate of EcFicA forms a salt-bridge with the last arginine (R 2 ) of the Fic motif of EcFicT, as in VbhTA (Fig 2B and 2D). Despite the close similarity, the structure suggests that EcFicT most likely does not bind ATP, consistent with the lack of any evidence for auto-adenylylation [15]. In this respect, the presence of a leucine residue instead of the canonical arginine R 1 appears most relevant. R 1 interacts with the β-phosphate of the ATP substrate in the inhibition relieved form of VbhT [13] and with the α-phosphate of CDP-choline in AnkX [41] (S5 and S6 Figs). Still, in EcFicT, binding of a diphosphate moiety to the anion-binding nest at the N-terminus of the helix appears still sterically feasible due to the conservation of the second glycine of the signature motif (Fig 3B). Another salient difference seen in EcFicT is the presence of a hydrophobic residue at the position following R 0 (Fig 3B), which in canonical FIC AMP-transferases is taken by a charged residue to form a direct (lysine, PDB: 4WGJ, SSGCID) or indirect (glutamate via a Mg 2+ ion) interaction with substrate phosphate(s). The adenine-binding pocket as observed in VbhT appears present in EcFicT and the asparagine preceding the Fic signature motif is strictly conserved in EcFicT homologs (subgroup 3, Fig 3B). In the EcFicT G55R A structure, the base-binding pocket is nevertheless partly restricted by the conformation that adopts the glutamine 84 of EcFicT located at the tip of the flap. The flap is a rather flexible part of the protein and likely only stabilizes upon target binding or artificially by crystal contacts in x-ray structures. An alternative rotamer and/or a slight different conformation of the flap would result in an unrestricted base-binding pocket.
The comparative analysis of EcFicT-like and Fic AMP-transferase sequences not only showed that the two aforementioned charged residues of AMP-transferases are replaced by hydrophobic residues in EcFicT-like proteins, but that there are about a dozen subgroup defining positions (SDPs) (Fig 3). Since many SDPs of the EcFicT subgroup are localized close to the signature motif, it appears likely that they are involved in binding of a yet unidentified substrate. If in addition R 2 were also involved in substrate binding, its interaction with the inhibitory antitoxin glutamate may again, as in e.g. VbhTA toxin-antitoxin complex, compromise substrate binding and, thus, inhibit the putative catalytic activity. Additional SDPs of EcFicTlike proteins cluster at the exposed face of the flap and in the N-terminal segment (Fig 3C,  right). These positions may well be conserved for subgroup-specific recognition of a putative target protein.
Phylogenetic analysis showed that EcFicT has evolved out of canonical AMP transferases in the Fic protein family (Fig 3A and S3 Fig). Considering the well-defined and conserved structural features of EcFicT-like proteins, we suggest that during evolution the Fic structure has been reshaped to accommodate a yet unidentified substrate, and to recognize a new range of targets. Interestingly, the histidine, crucial for deprotonation of the incoming target side-chain hydroxyl group, remains conserved in EcFicT-like proteins.
Class I Fic proteins in general show a conserved DPY segment that seems to tether the Nterminal segment to the body of the FIC domain as observed in EcFicT, but also in the VbhT structure (Fig 6A and 6B). Interestingly and underappreciated in our previous work [9], the very N-terminus of VbhT (which is unresolved in the EcFicT structure) is laterally bound to the flap, thereby augmenting the small β-sheet, in a similar manner as observed for target peptide binding (Fig 6C) [32]. This finding suggests that class I Fic protein may have, apart from the inhibitory glutamate preventing competent substrate binding, another auto-inhibitory element competing with target binding. In the light of recent reports that auto-adenylylation can relieve auto-inhibition in Fic proteins [22,42], we note that the conserved tyrosine of the N-terminal DPY motif (Y12 of EcFicT) is buried ( Fig 6A) and that, therefore, its modification with a bulky group may interfere with binding of the N-terminal segment to the target dock and therefore relieve inhibition.
The "filamentation induced by cAMP" phenotype is hallmarked by the inhibition of cell division in an environment of elevated temperature and cAMP concentrations (43°C) and caused by the fic-1 allele of ecficT [1]. The EcFicT G55R A crystal structure revealed that the site of mutation is neither close to the presumed active site nor the presumed target recognition site. The comparison of the thermal stability of EcFicTA and EcFicT G55R A revealed the same melting temperature for both complexes (Fig 2B). But, once dissociated from its cognate partner EcFicA, the melting temperature of EcFicT may be more severely decreased than the one of EcFicT G55R . Furthermore, it has been suggested in a recent review [43] that the G55R mutation may result in "long-range conformational changes that facilitate the dissociation of the corresponding antitoxin". However, we did not observe such effects as the EcFicTA E28G and EcFicT G55R A structures are very similar. Also, the oligomeric state of the EcFicTA protein complex is not affected by the mutation (Fig 2A). Nonetheless, we cannot exclude that upon unbinding of the antitoxin, long-range structural changes may occur. Noteworthy, the Doc toxin (mutant H66Y) forms a sub-domain swapped dimer in absence of its cognate PhD antitoxin [44]. Our structural work does not provide an explanation or potential mechanism for the results obtained for the originally described FIC domain protein EcFicT [1]. Further experiments, such as ligand docking combined with biochemical analyses, may help revealing the role of enterobacterial FIC-domain proteins.

Conclusion
The E.coli Fic protein EcFicT forms a tight complex with its cognate antitoxin EcFicA. EcFicT adopts the conserved Fic fold, but with a distinct active site signature motif and an additional hydrophobic binding pocket in the vicinity of the active site loop. EcFicA forms two parallel αhelices that tightly embrace EcFicT. The structural information provided in this manuscript may serve as a basis to identify new potential substrates of Fic proteins with EcFicT-like active site motif, if combined with the appropriate bioinformatical ligand docking experiments. Still, further in silico, in vitro and/or in vivo analyses are required to understand the activity of the first described FIC domain protein.
The combination of cognate FicA antitoxins and the N-terminal lock may form a doublelock mechanism of regulation for class I Fic proteins. This may be analogous to the doublelock mechanism of class III Fic proteins via oligomerization and auto-adenylylation that we have recently described [22]. Given that all structural features required for the two mechanisms of catalysis regulation are fully conserved in the EcFicT-like subfamily, we find it likely that these proteins are indeed enzymes whose catalytic activity merely remains to be uncovered.
Taken together, we propose that the EcFicT-like subfamily of class I Fic proteins has adapted the conserved FIC domain fold for a new (possibly catalytic) molecular activity in the context of a biological function outside the framework of classical FicTA toxin-antitoxin modules that disrupt DNA topology [15]. (a) Cartoon representation of the EcFicT G55R A complex. EcFicT G55R is colored in beige with the active site loop highlighted in yellow and EcFicA is colored in light blue with the α inh helix highlighted in dark blue as in Fig 2A. (b) Cartoon representation of the Doc/PhD complex in the same orientation as the EcFicT G55R A complex shown in panel a. Doc is colored in light brown with the active site loop highlighted in yellow and PhD is colored in sea green. Doc is a homolog of EcFicT and both proteins belong to the Fido protein family [45]. Note that the kink in the helix of the antitoxin occurs at the same position in both rather distant complexes. shown in an extended view (left) for comparison with a full-length alignment of the respective protein sequences (right). Though some inner branchings of the phylogeny are not well supported, the tree topology clearly shows that the mutation of asparagine to serine at the center of the active site loop occurred several times independently (red arrow). The four subgroups (I-IV) identified by the analysis of the multiple sequence alignment using the SPEER server are indicated on the left-hand side. Furthermore, the remarkable conservation of a tyrosine at the N-terminus of FicT proteins is clearly apparent (black arrow). The alignment also shows the distinctive N-terminal region of a FicT subgroup in Enterobacter (no background coloring) that does not contain the aforementioned tyrosine.