Structural and Functional Characterization of a Novel Family of Cyclophilins, the AquaCyps

Cyclophilins are ubiquitous cis-trans-prolyl isomerases (PPIases) found in all kingdoms of life. Here, we identify a novel family of cyclophilins, termed AquaCyps, which specifically occurs in marine Alphaproteobacteria, but not in related terrestric species. In addition to a canonical PPIase domain, AquaCyps contain large extensions and insertions. The crystal structures of two representatives from Hirschia baltica, AquaCyp293 and AquaCyp300, reveal the formation of a compact domain, the NIC domain, by the N- and C-terminal extensions together with a central insertion. The NIC domain adopts a novel mixed alpha-helical, beta-sheet fold that is linked to the cyclophilin domain via a conserved disulfide bond. In its overall fold, AquaCyp293 resembles AquaCyp300, but the two proteins utilize distinct sets of active site residues, consistent with differences in their PPIase catalytic properties. While AquaCyp293 is a highly active general PPIase, AquaCyp300 is specific for hydrophobic substrate peptides and exhibits lower overall activity.


Introduction
Prolyl cis-trans isomerizations are intrinsically slow reactions, which often determine proteinfolding reactions [1]. Prolyl isomerases (PPIases) catalyze this interconversion in both peptides and proteins [2]. PPIases are ubiquitous enzymes and three families are known: the cyclophilins [3], the FK506 binding proteins (FKBPs) [4], and the parvulins [5]. Cyclophilin-type PPIases are involved in a multitude of physiological functions. They act, among others, as chaperones or foldases, for instance in the maturation of steroid hormone receptor complexes [6], in the interplay between NinaA and rhodopsin [7,8], or are required for the maturation of the HIV capsid protein Gag [9]. Cyclophilins are also critical for cell responses under stress conditions [10,11] and are involved in the adaptation to environmental stress, in cell cycle control, signal transduction, and transcriptional regulation [11,12]. In addition, they have been reported to contribute to the virulence of fungal and parasitic pathogens [13][14][15][16][17], and to stress tolerance and pathogenicity of bacteria, such as Legionella pneumophila [18], Enterococcus faecalis [19], and Streptococcus pneumoniae [20].
The prototypic member of the cyclophilin family is human hCyp18 [21][22][23], a singledomain PPIase of 165 amino acids, with an eight-stranded antiparallel β-barrel core [24,25]. Most prokaryotic cyclophilins are also monomeric single-domain proteins with variations only in loop regions [26] and are structurally and functionally well characterized [27]. However, some cyclophilins are extended by additional structural elements, such as helical repeat domains, and for most of those extended cyclophilins, structure and function are not known [28,29].
In the bacterium Hirschia baltica, a marine Hyphomonadaceae within the class of Alphaproteobacteria, we identified two genes that code for novel unusually large cyclophilins. Hirschia baltica is suggested to have an important role in marine biofilm formation [30] and has a dimorphic life-cycle; newborn swarmer cells are motile and differentiate into stalked sessile cells, which reproduce by budding motile daughter cells [31]. Hirschia baltica contains two cytosolic and five periplasmic canonical single-domain PPIases, but also two large, homologous cyclophilins with 293 and 300 residues, which we term AquaCyp293 and AquaCyp300, respectively. The core PPIase domain of the two AquaCyps shows N-and C-terminal extensions, as well as an insertion, which together double the size relative to single-domain cyclophilins. The structure and function of AquaCyps and their insertion elements are unknown. Single gene knockouts of their respective homologues in Caulobacter crescentus had revealed that neither AquaCyp homologue alone is essential in this organism [32].
Here, we determined the crystal structures of both AquaCyp293 and AquaCyp300 and characterized their functional properties. Our data reveal a unique two-domain architecture of AquaCyp proteins, in which the catalytically active cyclophilin domain maintains extensive contacts with a composite domain formed by terminal extensions and internal insertion elements. Both AquaCyps are functional in prolyl isomerase assays but differ with respect to oligomerization, catalytic efficiency and substrate specificity.

Expression and Purification of AquaCyp293 and AquaCyp300
For the expression of the AquaCyp293 (C6XJ17, res. 25-293) and AquaCyp300 (C6XII3, res. 21-300) the gene fragments were PCR-amplified from Hirschia baltica strain ATCC 49814 and cloned into the expression plasmid pNIC28-Bsa4 [33], where they are N-terminally linked to an hexa-histidine tag followed by a TEV-cleavage site. The proteins were overproduced in E. coli BL21(DE3) ΔslyD (gift from B. Eckert). After lysis of the cells in 50 mM Hepes/NaOH, 500 mM NaCl, pH 7.4, 40 mM imidazol with a sonicator and centrifugation, all proteins were found in soluble form. The proteins were purified by immobilized metal-affinity chromatography on a Ni-NTA column (elution with 250 mM imidazole), overnight digested by tobacco etch virus protease [34], followed by a second Ni-NTA chromatography step. The high salt concentrations used during Ni-NTA chromatography significantly reduced the amounts of protein impurities. The flow-through was concentrated in Amicon Ultra units (Millipore) and then subjected to size-exclusion chromatography in 20 mM Hepes/NaOH pH 7.4 on a Superdex S75 column (GE Healthcare). The protein-containing fractions were pooled and concentrated. Yields were about 15 mg L -1 culture. RCM-T1 was expressed and purified as described [35].

Thermal Shift Assay
Solutions of 2 μl protein (100 μM in 20 mM Hepes pH 7.5, 250 mM NaCl, 5% Glycerol) with 8 μl 5× SYPRO Orange (Invitrogen) were added to 10 μl of different buffer solutions to the wells of a 96-well PCR-plate (Biorad). The plates were sealed with highly transparent polypropylene film and heated in an Biorad CFX96 detection system from 4°C to 70°C in increments of 0.5°C. Fluorescence changes in the wells of the plate were measured three times per minute at three different gain settings. The wavelengths for excitation and emission were 492 and 568 nm respectively. The midpoint of unfolding T m , was derived from the first derivative. The unfolding curves were identical between 0.5 and 5 μM protein. All data were processed using Biorad software.

Prolyl Isomerase Activity Assay
The prolyl isomerase activities were measured by a protease-free fluorescence assay [36] [37]. For the assay, the peptide substrate Abz-Ala-Xaa-Pro-Phe was dissolved in trifluoroethanol containing 0.5 M LiCl. Under these conditions, about 50% of the peptide molecules are in the cis conformation. The kinetics of the decrease in cis-Pro content was measured by the change in fluorescence at 416 nm (5 nm bandwidth) after excitation at 316 nm (3 nm bandwidth) in 100 mM K-phosphate 7.0 at 15°C. Under these conditions, the cis-to-trans isomerization of the prolyl bond was a mono-exponential process, and its rate constant was determined by using GraFit 3.0 (Erithacus Software, Staines, UK). The folding experiments of RCM-T1 were performed as described [38] (Table 1).

Size Exclusion Chromatography Coupled with Multi-Angle Light Scattering
For size exclusion chromatography coupled with multi-angle light scattering (SEC-MALS) measurements, 20-or 100-μl samples of 1 mg/ml of AquaCyp293 or AquaCyp300 were applied to a GE Healthcare Superdex 200 5/150 GL SEC column equilibrated overnight in 20 mM Hepes, pH 7.4, at 5°C, using an Agilent 1100 series HPLC system. Light scattering and differential refractive index measurements were made using a Wyatt miniDawn TriStar detector and a Wyatt Optilab rRex detector, respectively. The inter-detector delay volumes, band broadening, and the light scattering detector normalization, were calibrated according to the manufacturer's protocol using a 2 mg/ml of BSA solution (Thermo Pierce) in the same buffer. The absolute refractive index of the buffer was measured using the refractive index detector. The data were collected and processed using Wyatt Astra 5 software. The molar mass was calculated from a global fit of the light scattering signals from three detectors at different angles, and the differential refractive index signal, using algorithms in the Astra 5 software.

Protein Crystallization and Structure Determination
AquaCyp293 and AquaCyp300 were crystallized by vapor diffusion using the sitting-drop method at 20°C. The reservoir solution (100 μl) contained 10% PEG20000, 20% PEGMME550 in 0.03 M CaCl 2 , 0.03M MgCl 2 , 0.1 M Mops/Hepes-Na pH 7.5. Diffraction data were collected at the Swiss Light Source PXI beamline. The data sets were processed and scaled using XDS [39,40]. The AquaCyp293 crystals belong to space group P 1 2 1 1 with cell dimensions a = 47.9 Å, b = 72.7 Å, and c = 73.9 Å, β = 93°and contain two molecules per asymmetric unit. Aqua-Cyp300 also crystallized in space group P 1 2 1 1 with cell dimensions a = 50.6 Å, b = 103.2 Å, and c = 173.5 Å, β = 91.5°containing six molecules in the asymmetric unit. Structure determination of AquaCyp293 was performed by molecular replacement with the human cyclophilin A structure as the search model (Protein Data Bank accession code 2CPL) using Phaser [41]. Model building and structure refinement were performed with Coot [42] and PHENIX [43], respectively ( Table 2). The atomic coordinates for AquaCyp293 and AquaCyp300 have been deposited in the RCSB Protein Data Bank and are available under the accession code 5EX2 and 5EX1, respectively.

AquaCyps, a Novel Class of Cyclophilins Characteristic of Marine Alphaproteobacteria
To identify cyclophilins that possibly assist in the maturation of periplasmic and outer membrane proteins [44], we searched the Expasy database [45] for homologs of EcCypB (PPIB), the periplasmic cyclophilin of E. coli. We found that EcCypB is not generally conserved in Alphaproteobacteria, but several of them, such as Rhodobacterales, Rhizobiales and Caulobacterales, contain larger EcCypB homologues of 280 to 330 residues. We found about 50-60 orthologues of these cyclophilins, and because they are almost exclusively present in organisms living in marine environments, we named this cyclophilin class AquaCyps. Interestingly, Hirschia baltica contains two such cyclophilins, AquaCyp293 and AquaCyp300, which exhibit 39% sequence identity. All members of this family contain large N-terminal (20-30 residues) and C-terminal extensions (50-60 residues) relative to hCyp18 as well as a 40-50 residue insertion in the long loop that connects the β strands 4 and 5 of the cyclophilin domain (S1 Fig). Members of this family also show two conserved cysteine residues (Cys150 and Cys252 in AquaCyp293) that might form a disulfide bond between the Cyp domain and the C-terminal extension (Fig 1).
The cyclophilin domain of AquaCyp contains several invariant residues that are conserved in all cyclophilins. These include Leu179 and Asp180 (Leu113 and Asp114 in hCyp18), which are part of the characteristic 3 10 helix of the cyclophilin fold and the two glycine residues 186 and 191 (Gly126 and Gly131 in hCyp18), which are located in sterically demanding positions (Fig 1). Active site residues in AquaCyp are only partially conserved to other cyclophilins. The residues Arg65 and Gln73 of AquaCyp293 correspond to Arg55 and Gln63 in hCyp18 and are essential for catalysis [46][47][48]. At other positions in the active site, the sequence conservation is lower.

AquaCyps Exhibit a Low Thermal Stability
We produced recombinant versions of full length AquaCyp293 and AquaCyp300 and applied differential scanning fluorometry screening to optimize buffer conditions for protein stability. Based on these fluorescence-based thermal stability assays [49], both proteins were found to be folded at room temperature and between pH 6 and pH 8. AquaCyp293 and AquaCyp300 unfolded in several steps resulting in broad melting curves with midpoints at about 38-40°C (Fig 2). These melting points are about 5-10°C lower than those of other prokaryotic and eukaryotic cyclophilins [50,51]. The lower stability might be correlated to the temperature growth optimum of Hirschia baltica in seawater at 22-28°C [52]. Optimal stability was observed in 20 mM Hepes pH 7.4, containing 250 mM NaCl and 10% glycerol. This buffer was subsequently used for protein storage. Its pH is significantly higher than the pI values of 4.  AquaCyps Do Not Display Molecular Chaperone Activity Cyclophilin proteins from several organisms show molecular chaperone activity [53][54][55]. To examine whether AquaCyp293 and AquaCyp300 interact with unfolded proteins and prevent aggregation, we employed the widely used citrate synthase (CS) aggregation and inactivation assay [56]. It exploits the strong aggregation tendency of folding intermediates of citrate synthase. Unfolded citrate synthase aggregates spontaneously after dilution with refolding buffer, which is accompanied by a strong increase in light scattering. AquaCyp293 and Aqua-Cyp300 were both unable to suppress the temperature-induced aggregation of CS even when added in large excess (S3 Fig). In a control experiment, the chaperone SlyD from E. coli abolished aggregation under the same conditions [57].

Only AquaCyp293 Is an Efficient General PPIase
Most prolyl isomerases of the cyclophilin family catalyze prolyl isomerization in peptides and proteins with very high efficiency but show low sequence specificity for the amino acid preceding proline [58,59]. To examine the catalytic activity of AquaCyp293 and AquaCyp300, we used a fluorimetric protease-free assay and proline-containing tetrapeptides that carry an aminobenzoyl (Abz) group at the aminoterminus and a para-nitroanilide (pNA) group at the carboxyterminus. The short peptides have the general formula Abz-Ala-Xaa-Pro-Phe-pNA. In these peptides, the Xaa position was occupied by a charged (Glu, Lys), an aliphatic (Ala, Leu), or an aromatic (Phe) residue. AquaCyp293 is a highly active PPIase (Fig 3A-3C). 12 nM AquaCyp293 accelerated the isomerization of Abz-Ala-Ala-Pro-Phe-pNA 5-fold ( Fig 3A). From measurements of the isomerization rate as a function of the AquaCyp293 concentration (as in Fig 3C), the catalytic efficiencies (k cat /K M ) for five Xaa-Pro sequences were derived (Table 1). Similar to the highly active human Cyp18, AquaCyp293 shows very high k cat /K M values, which range between 10 6 and 10 7 M -1 s -1 , and the maximal differences between peptides are only 6-fold (between Leu-Pro and Glu-Pro). AquaCyp293 thus resembles other cyclophilins in both high PPIase activity and low substrate specificity [37,58,60].
AquaCyp300 is active as a PPIase as well (Table 1), but it differs significantly from Aqua-Cyp293 and human Cyp18. For four out of the five peptides tested, the activities are 12-120-fold lowered relative to AquaCyp293. A moderate PPIase activity was observed only for the Leu-Pro containing peptide, pointing to pronounced substrate specificity for hydrophobic residues preceding proline. To our knowledge, such high substrate specificity has hitherto been observed only for prolyl isomerases of the FKBP family [36], not for cyclophilins [37]. The differences in activity and substrate specificity suggest that AquaCyp293 and AquaCyp300 might have distinct functions in the periplasm of Hirschia baltica.
Next, we investigated the efficiencies of the two AquaCyp enzymes in the catalysis of a proline-limited protein folding reaction. Reduced and carboxymethylated RNase T1 (RCM-T1) was used as the model substrate (Fig 3B). Its refolding reaction is limited in rate by the trans-!cis isomerization of a Tyr-Pro bond. Refolding can be induced by increasing the NaCl concentration [61,62] and is monitored by the strong increase in tryptophan fluorescence upon refolding. AquaCyp293 and AquaCyp300 contain four tryptophan residues and thus contribute strongly to the measured fluorescence, which decreases the signal/noise ratio in the folding assays (Fig 3B). With a k cat /K M value of 1.0 × 10 4 M -1 s -1 , AquaCyp293 catalyzes prolyl isomerization in a folding protein about 100-fold less efficiently than in the tetrapeptide substrates. With these catalytic properties it resembles human Cyp18, as well as CypA from Escherichia coli and Bacillus subtilis [58,60]. AquaCyp300 also catalyzes RCM-T1 refolding, but is about fivefold less efficient than AquaCyp293 (Fig 3D).  Table 1.

AquaCyps Share Characteristic Extensions of the Cyclophilin Fold
AquaCyp293 and AquaCyp300 were crystallized under similar conditions. Plate like crystals appeared in mixtures of short and long polyethylene glycols at neutral pH in the presence of divalent cations. The X-ray crystal structures of AquaCyp293 and AquaCyp300 were solved by molecular replacement with the structure of hCyp18 (pdb: 2CPL) as a search model and refined at resolutions of 1.3 Å and 2.05 Å to R work /R free values of 17.0/19.5% and 17.6/21.3%, respectively. Data collection and refinement statistics are shown in Table 2. The crystallographic asymmetric units for AquaCyp293 and for AquaCyp300 contain two and six molecules, respectively. The individual AquaCyp293 and AquaCyp300 molecules are virtually identical at root mean square deviation of mainchain atom positions (rmsd) of 0.1 and 0.2 Å, respectively (S4A and S4B Fig). Minor differences in surface exposed loops probably originate from crystal packing. In both crystal structures Mg 2+ ions mediate crystal contacts.
The cyclophilin domains of both AquaCyp293 and AquaCyp300 consist of an eightstranded antiparallel β-barrel, and two α-helices covering the top and the bottom of the barrel, and an additional small 3 10 -helix (Fig 4A and 4B). They superimpose well with hCyp18 with rmsd values of 0.9 Å and 1.0 Å, respectively (S4C and S4D Fig). The closest homologue for both, AquaCyp293 and AquaCyp300, is EcCypB from E. coli [63], which also is a periplasmic PPIase. All secondary structure elements of the AquaCyp293 and AquaCyp300 cyclophilin domain superpose well with EcCypB, differences are restricted to loop regions (S4E and S4F Fig). 29 of the 30 cyclophilin structures in the protein data bank are single domain proteins (155-186 residues). Bovine Cyp40 is the only structurally resolved cyclophilin with an extra domain, which, in this case is a C-terminal tetratricopeptide repeat (TPR) domain [64]. All single-domain cyclophilin structures superpose well with an rmsd of better than 1.

Variations in Active Site Structure between AquaCyp293 and AquaCyp300
Residues forming the active site of cylcophilins are mainly located on β-strands 3, 4 and 6 [65]. Most of the residues known to be important for activity in hCyp18 and EcCypB are also conserved in AquaCyp293 (Fig 5), in agreement with their catalytic competence (Table 1). In the high-resolution structure of AquaCyp293, alternative side chain conformations are observed for Arg55 and Phe113 (Fig 5A), suggesting that flexibility is important for substrate binding and catalysis. Although many high-resolution cyclophilin structures are known, such alternate conformations for active site residues have rarely been reported for cyclophilins. However, the relevance of alternate conformations of active site residues in cyclophilins for catalysis was demonstrated by NMR and ambient-temperature X-ray crystallographic data collection for human hCyp18 [66].
In contrast to AquaCyp293, AquaCyp300 (Fig 5B) differs from hCyp18 ( Fig 5C) and EcCypB (Fig 5D) at six out of nine active site positions (Fig 5E). Only Arg55, Phe60 and Gln63, which are most important for catalysis [65], are conserved. The residues His54, Gln111, His126, which contribute to the PPIase activity of hCyp18 as well, are substituted by Tyr, Asp, Met, respectively (Fig 5E) [67], providing a plausible explanation for the low PPIase activity and high substrate specificity of AquaCyp300.

Core-Fold Extensions in AquaCyps Form a Contiguous Structural Entity, the NIC Domain
The three extra chain regions of AquaCyp293 and AquaCyp300 outside the PPIase domain are well ordered. Although the terminal extensions and the insert are separated in sequence space, they form a contiguous structural entity in both proteins, which we term the NIC domain (Fig 4).
In AquaCyp293, the N-terminal extension starts with a short α-helix (Lys5-Asp11), which is followed by a short β-strand (Asn14-Val18) (blue in Fig 4A) that pairs with a β-strand  Characterization of a Novel Cyclophilin Protein Family (Pro226-Met231) provided by the C-terminal extension (red in Fig 4A). The insert in the cyclophilin domain (residues 95-146, green in Fig 4A) folds into a four-stranded (parallel-antiparallel) β-sheet (Val95-Asp99; Ser116-Tyr122; Phe125-Arg130; Arg142-Met147) and an α-helix (Glu131-Thr137). The C-terminal extension forms an elongated structure that contacts the Nterminal extension, the inserted domain, and the cyclophilin (PPIase) domain. It starts with two short 3 10 helices and continues into β-strands, which provide a structural link with the βstrand of the N-terminal extension and a β-strand (Ser259-Phe262) of the cyclophilin domain.
In AquaCyp300, the first eight residues are not resolved in the crystal structure, presumably because they are mobile. Ile9 is ordered and packs against Asn160 in the loop between βstrands 5 and 6 of the cyclophilin domain (Fig 4B). The subsequent residues of the N-terminal extension (blue in Fig 4B) wrap around the cyclophilin domain forming a loop that packs against the C-terminal residues, then going back and in a short β-strand (Gly26-Ile32) leading into the cyclophilin domain. As for AquaCyp293, the large insertion in the cyclophilin domain is located in the long loop between β-strand 4 and 5 and consists of loops and three short βstrands (Lys126-Leu130; Phe133-Asp138; Glu140-Leu145) (green in Fig 4B). It is placed between the N-terminal and C-terminal extensions and overlaps well with the insertion domain of AquaCyp293. The C-terminal extension is the largest additional part of Aqua-Cyp300 (red in Fig 4B). After the cyclophilin domain, it starts with two short 3 10 helices and then it forms a β-strand (Asn233-Met237) that pairs on one site with the β-sheet from the Nterminal extension and on the other with the β-strand (Pro275-Val279). This β-strand is followed by a α-helix (Ala242-Arg252), which leads into a long hairpin. The NIC domains of AquaCyp293 and AquaCyp300 superimpose well with an rmsd of 1.5 Å and apparently represent novel domain folds. A structural similarity search with DALI did not reveal structural homologs with a Z-Score above 2 [68].

A Disulfide Bond Tethers the NIC to the Cyclophilin Domain in AquaCyps
Both AquaCyp293 and AquaCyp300 possess a buried disulfide bond at structurally equivalent positions (Fig 4A and 4B). It is formed by a cysteine residue in the cyclophilin domain (Cys150 in AquaCyp293; Cys147 in AquaCyp300) and a cysteine in the C-terminal extension (Cys252 in AquaCyp293; Cys272 in AquaCyp300) and firmly links the C-terminal extension to the cyclophilin domain. The disulfide bond is remote from the active site, and we suggest that it has a structural rather than a functional role. In two other reported cases, disulfide bonds in cyclophilins are linked to enzymatic activity: In CypA from Schistoma masoni, SmCypA, the disulfide bond is solvent exposed and located close to the active site. A regulation mechanism via oxidation has been suggested, as the oxidized form is inactive, whereas the reduced form of SmCypA shows high PPIase activity [69]. CYP20-3 from Arabidopsis thaliana also contains a disulfide bond close to the active site, and its enzymatic activity is decreased in the oxidized form [70].

The NIC Domain Mediates Dimerization of AquaCyp300
The SEC-MALS analysis demonstrated that AquaCyp300 in solution is in monomer/dimer equilibrium. In the AquaCyp300 crystal structure the six monomers are indeed arranged into three dimers. 50 residues form the AquaCyp300 dimer with an interface area of 1713.4 Å 2 (12.9% of the molecule surface) [71], placing the active sites of AquaCyp300 on opposing faces of the dimer (Fig 6A). Residues from both the cyclophilin domain and the NIC domain contribute to dimerization. They show low conformational flexibility (S6 Fig) and high sequence conservation (Fig 6B), suggesting that other AquaCyp300 homologues are also dimeric. In particular, an extended hairpin structure enriched in hydrophobic residues (S7 Fig) (including three phenylalanines Phe258-Phe260) at the C-terminal extension interlocks the two monomers ( Fig 6C). The tip of this dimerization hairpin is in close proximity to the respective active site of the related dimeric AquaCyp molecule and might be involved in substrate interaction. To our knowledge AquaCyp300 is the first reported dimeric cyclophilin. For FKBP or parvulin containing PPIases several dimeric PPIases (e.g. FkpA, Fkbp22, Mip, PrsA and Peb4) are known, where extra alpha-helical domains mediate dimerization and in addition function as chaperones. In these dimers, the PPIase domains are not on opposite sides but face each other, and this combination of chaperone and prolyl isomerase activity is thought to be important for an optimal function as protein folding enzyme [44,[72][73][74][75][76].

Conclusion
Protein maturation in the periplasm is a complex process that involves a network of foldases and chaperones. In a bioinformatics analysis, we identified a distinct class of cyclophilin PPIases with complex architecture in organisms with a dimorphic, marine lifestyle, such as Hyphomonas, Caulobacter and Hirschia. To our knowledge, it represents the first class of periplasmic cyclophilins with additional non-catalytic domains. The crystal structures of AquaCyp293 and AquaCyp300 demonstrate that AquaCyps fold into two domains, a classical cyclophilin-type PPIase domain and the NIC domain with a novel mixed alpha-helical/betastrand structure. Both AquaCyp proteins are stabilized by a conserved inter-domain disulfide bond. Despite pronounced structural similarities, they also show distinct features: AquaCyp293 is monomeric and resembles canonical cyclophilins in its enzymatic properties, whereas Aqua-Cyp300 dimerizes via the NIC domain at higher protein concentration and shows low enzymatic activity and high substrate specificity, suggesting distinct functions for the two PPIases. Indeed, the vast majority of marine Alphaproteobacteria contain both AquaCyp homologous. Marine Organisms with dimorphic life style encode a large number of characteristic proteins that are components of the outer membrane or involved in cell envelope biogenesis [30,52,77]. Presumably, AquaCyps are involved in the maturation of one or more of these proteins. Their distinct catalytic properties suggest distinct sets of target proteins, but don't exclude partially overlapping functions. Further analysis will be required to identify the substrates of the two AquaCyp PPIases and to reveal their interplay with other folding helpers in the periplasm of marine Alphaproteobacteria.
Supporting Information S1 Fig. Domain structure and sequence conservation of AquaCyp. Multiple sequence alignment to analyze the sequence and secondary structure conservation of PrsA. Highly conserved residues are red (>70% conservation) or white in red boxes (100% conservation). On top, the cyclophilin domain are shown in grey, the additonal N-terminal-, insertion, and C-terminal structural elements are coloroured in blue, green and red, respectively. The secondary structure of Homo sapiens hCyp18 is shown on top of the protein sequence. Sequences of representative AquaCyp proteins were retrieved from the UniProt database [78] and aligned using MULTA-LIN [79]. The final figure was generated using the ESPript server [80].  Fig 4A and 4B, respectively. (C,D) Surface representation of AquaCyp293 (C) and AquaCyp300 (D) color coded ranging from hydrophobic (green) to hydrophilic (grey) according to the normalized consensus hydrophobicity scale of the exposed residues [82], in the same orientation as in S6A and S6B Fig, repsectively. AquaCyp300 residues involved in dimerization (e.g. the C-terminal dimerization hairpin) are enriched in hydrophobic residues (E,F) Surface representation of of AquaCyp293 (E) and AquaCyp300 (F) color-coded ranging from negative charged (red) to positive charged (blue). The PPIase active site is indicated as ellipse. (TIF)