Chlamydia trachomatis is the most prevalent cause of bacterial sexually transmitted diseases and the leading cause of preventable blindness worldwide. Global control of Chlamydia will best be achieved with a vaccine, a primary target for which is the major outer membrane protein, MOMP, which comprises ∼60% of the outer membrane protein mass of this bacterium. In the absence of experimental structural information on MOMP, three previously published topology models presumed a16-stranded barrel architecture. Here, we use the latest β-barrel prediction algorithms, previous 2D topology modeling results, and comparative modeling methodology to build a 3D model based on the 16-stranded, trimeric assumption. We find that while a 3D MOMP model captures many structural hallmarks of a trimeric 16-stranded β-barrel porin, and is consistent with most of the experimental evidence for MOMP, MOMP residues 320–334 cannot be modeled as β-strands that span the entire membrane, as is consistently observed in published 16-stranded β-barrel crystal structures. Given the ambiguous results for β-strand delineation found in this study, recent publications of membrane β-barrel structures breaking with the canonical rule for an even number of β-strands, findings of β-barrels with strand-exchanged oligomeric conformations, and alternate folds dependent upon the lifecycle of the bacterium, we suggest that although the MOMP porin structure incorporates canonical 16-stranded conformations, it may have novel oligomeric or dynamic structural changes accounting for the discrepancies observed.
Citation: Feher VA, Randall A, Baldi P, Bush RM, de la Maza LM, Amaro RE (2013) A 3-Dimensional Trimeric β-Barrel Model for Chlamydia MOMP Contains Conserved and Novel Elements of Gram-Negative Bacterial Porins. PLoS ONE 8(7): e68934. https://doi.org/10.1371/journal.pone.0068934
Editor: Andrea Motta, National Research Council of Italy, Italy
Received: February 20, 2013; Accepted: June 4, 2013; Published: July 25, 2013
Copyright: © 2013 Feher et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was funded in part by the National Institutes of Health through the NIH Director's New Innovator Award Program DP2-OD007237 and through the NSF TeraGrid Supercomputer resources grant LRAC CHE060073N to REA; National Institute of General Medical Sciences MIDAS grant U01-GM076499 to RMB;NIH/NLM Pathway to Independence Award (K99LM010821) to AR; NIH (5T15LM007743 and LM010235-01A1) and NSF (MRI EIA-0321390 and 0513376) to PB, and Public Health Service grant AI-67888 from the National Institute of Allergy and Infectious Diseases to LdlM. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
C. trachomatis, a gram-negative bacterium, is estimated to infect 90 million people worldwide each year . In some individuals these infections cause long-term consequences including pelvic inflammatory disease, ectopic pregnancy, sterility or blindness –. Although Chlamydia can be treated with anti-bacterials, many infections are asymptomatic and therefore go untreated , . A prophylactic vaccine has long been sought against C. trachomatis but has yet to be obtained –.
The Chlamydia major outer membrane protein, MOMP, is a primary target of vaccine development because it is highly antigenic and comprises ∼60% of the outer membrane protein mass , . MOMP, coded by the ompA gene, is considered a member of the general porin class of proteins (http://scop.mrc-lmb.cam.ac.uk/scop) , a group important for the passive transport of ions, sugars and nucleotides across the outer membrane of gram-negative bacteria –. Porins have a structural topology comprised of antiparallel β-strands spanning the outer membrane, a water-filled inner channel, tight β-turns extending into the periplasmic region and flexible loops reaching beyond the extracellular surface. Several studies have confirmed that the C. trachomatis MOMP shares the characteristics of other gram-negative bacterial porins; it has a monomeric molecular weight of 39.5 kDa, a hydrophobic residue content of ∼40%, and migration patterns on SDS-page gels consistent with trimeric oligomerization . Native MOMP also transports sugars at rates similar to Pseudomonas aeruginosa OprF –.
Structural characterization of MOMP remains elusive for a number of reasons. Attempts to generate crystals for X-ray crystallographic studies with native MOMP proteins have been unsuccessful. Recombinant Chlamydia MOMP protein has been expressed in other systems but has been intractable due to inclusion body formation and very low refolding yields, most likely due to its highly hydrophobic nature and presence of a high number of cysteine residues. Notably, Chlamydia does not have an available genetic system like other bacteria, such as E. coli, and only recently a transformation system limited to the plasmid was described . Yet, there is an urgent need to elucidate MOMP structural details and understand the conformation of its protective epitopes in order to formulate a vaccine.
Computational homology modeling is a useful alternative method for generating protein structural models in the absence of experimental structural data. However, direct homology modeling methods cannot be performed for the MOMP because it has little detectable sequence identity to β-barrels with solved structures. In general, protein structure is much more conserved than sequence as species diverge, often to the point where no detectable sequence similarity exists between pairs of sequences sharing the same structural fold. The β-barrel porins are an excellent example of this phenomenon with many sequence pairs sharing very low sequence identity (sometimes only ∼11%) based on structural alignment. These diverged sequences all share strongly conserved structural features that can be used to inform computational models. These features include: (1) even number of anti-parallel trans-membrane strands; (2) a relatively high abundance of small hydrophobic residues at the center of the trans-membrane strands facing the membrane; and (3) girdles of aromatic residues at the membrane interfaces , , .
Algorithms have been developed over the past decade to predict β-barrel secondary structure features using “rules” based on these conserved features with a varying degree of detail –. Some algorithms predict whether the query sequence more likely belongs to a membrane β-barrel versus a globular protein –, while others predict the number (i.e. 8, 10 12, 16, 22-stranded) and position of membrane spanning β-strands , , .
To date, several groups have assigned secondary structure topology for MOMP of Chlamydia muridarum (previously called Chlamydia trachomatis mouse pneumonitis, MoPn) and C. trachomatis serovars D and F based on these algorithms, biochemical and immunological data –. These studies provide fairly similar assignments of the transmembrane strands; all suggest an overall 16-stranded topology, the models vary primarily in the length and placement of periplasmic turns and external loops with a few variations in positions of β-strands. However, these topologies cannot be directly translated into a 3D molecular model based on published 16-stranded trimeric β-barrels coordinates because the topology maps do not incorporate one or more of the structural elements of these 3D coordinates. For example, all known 16-stranded β-barrel porin crystal structures have an eyelet loop that folds into the pore center and is formed by the third loop ; the number of residues demarked as loop 3 in two of the three published topology maps are too short to form this conserved structural feature.
In this work, a recently developed suite of β-barrel topology prediction algorithms, TMBpro , combined with comparative homology building methods, and model refinement were used to develop a three-dimensional 16-stranded β-barrel homo-trimer model for the C. trachomatis MOMP protein. These methods and the resulting structures illustrate regions where the MOMP protein sequence is consistent with structural and functional features of 16-stranded templates, as well as regions that are structurally inconsistent with known templates. This enigma motivates further experimental structural studies of this important vaccine target.
Prediction of β-barrel topology
A consensus sequence for human C. trachomatis serovar C MOMP, excluding the N-terminal leader sequence, was submitted to the TMBpro server  for initial predictions of overall β-barrel topology (Figure 1). The results assign fairly equal probability to the MOMP monomer subunit being comprised of 10-, 12-, 14-, 16- or 18- β–strands. Additionally, the TMBpro server predicts the probabilities of individual residues in transmembrane β-strands (SI Table 1). Figure 2 compares the predicted β-strand limits to those predicted using Pred-TMBB  and TMBeta-NET ,  webservers and the previously published 2D topology maps for the MOMP porin. Several trends can be observed from this comparison: (1) the variable domain regions are predicted to be outside the trans-membrane β-strands; (2) all the algorithms predict similar residues to be in the first 2 and final β-strands; and (3) there is not a consensus for the limits of the remaining β-strands or, in some regions, whether a β-strand is present or not. Specifically, it is clear that all the algorithms suggest the presence of a long cysteine-containing loop spanning residues G20 to D34, but the Y89 to T100, F113– F139, V203 - I217, A280 – P288, and I341 – V355 sequences have little prediction consensus.
Normalized predicted probabilities for various strand models of (A) full length C. trachomatis serovar C sequence; (B) example output for C. trachomatis serovar C excluding a subset of residues that contains the leader sequence and all or parts of the four VDs: M1 – A23, A64 – P85, K152 – A158, T216 – I247, T282 – V310; (C) Neissera meningitidis PorB (3A2T.pdb protein sequence) ; and (D) Klebsiella pneumonia OmpK36 (1OSM.pdb protein sequence) .
The output from β-strand prediction algorithms TMBpro, TMPRED and TMBBETA-NET (labeled pred1, pred2, and pred3, respectively) are delineated as blue boxes on the MOMP sequences from C. trachomatis serovar C. Previously reported β-strand predictions from 2D topology mapping for MOMP are shown for comparison on the sequences used in each publication, namely MOMP C. trachomatis serovar F  and serovar D  and C. muridarum . Red and black boxes delineate the N-terminal leader sequence and 4 variable domains, respectively. Sequence alignments and illustration made with UCSF-Chimera visualization tools (http://www.cgl.ucsf.edu/chimera) .
The TMBpro algorithm and other β-strand assignment algorithms predict several long strand sequences of 18 residues or more that are consistent with the long β-strands protruding from the extracellular membrane, as observed in 12- and 14-stranded barrels. However, comparisons of results across multiple algorithms illustrate that some of these long strands can also be predicted as two distinct strands with an intervening turn. Such a result is more consistent with shorter strand – turn – strand assignments observed in 16 -, 18- or 22-stranded barrels. Thus, depending upon where a given algorithm predicts the strand termini, there can be a significant difference in the overall topology predicted.
The topology predictions for two sequences from known crystal structures not included in the original TMBpro training sets are shown to compare the relative prediction accuracy, especially for β-barrels with twilight zone sequence identity. The Neisseria meningitidis porin, PorB , has less than 11% sequence identity to any of the crystal structures used in the original TMBpro training set. Similar to the topology probabilities observed for MOMP, several β-strand topologies had approximately equal probabilities (Figure 1C), however, the predictions are more distinguishable than those for MOMP. Another sequence, Klebsiella pneumoniae Omp36K , a porin that has 60% identity to one of the 16-stranded barrels used in the TMBpro training set has a much more distinct prediction for a 16-stranded topology (Figure 1D).
As transmembrane β-barrel prediction methods are also known to be sensitive to insertions/deletions of long extracellular sequences , MOMP sequences were submitted to these servers in several different forms, each varying by the exclusion of one or more VD. This procedure and the comparison of results from three algorithms, each generated using different prediction protocols, allowed us to evaluate the robustness of the predictions. Omission of various subsets of residues containing all or parts of the four VD shift the predicted probability for a particular β-barrel topology, but all such experiments produced nearly identical prediction of the residues involved in transmembrane β-strands (see Figure 1B for one such example).
MOMP monomer models
Monomer modeling proceeded by assuming a 16-stranded β-barrel topology. Figure 3 panels A and B illustrate the 10 lowest energy 16-stranded model for MOMP. These models were generated through the iterative superposition of MOMP residues predicted as β-strand residues to the composite porin crystal structure template and refinement as described in Methods. These models represent ∼70% of the MOMP amino acid sequence.
(A, B) Backbone superpositions of the 10 lowest energy monomer structures. The ribbons illustrate the relative RMS deviations for the backbone heavy atoms to the average structure by color from low  to high (red). (C) The average backbone representation for the MOMP monomer  superposed onto backbone coordinates for crystal structures OmpF, OmpC , Omp36K , PhoE , PorB  and OprP  (gray ribbon). Superpositions were made using MOE, optimizing for structural and sequence alignments. (D, E) The MOMP monomer 16-stranded β-barrel with backbone in ribbon representation, aromatic girdle residues  and position of basic residue girdle (blue spheres), trimer interface and membrane interfaces, respectively. (F) The MOMP model, looking down the pore channel from the membrane exterior side (cyan ribbon) with putative anion pathway residues shown (cyan stick representation for R39, H92, R112) superposed on E. coli OmpF structure illustrating anion pathway residues (gray, 1HXX ).
The final monomer models have >95% of the residues in the allowed Ramachandran values and where most of the outliers are found in loop junctions with the β-strands (outliers in 5 or more of the 10 lowest energy structures are Y104, I109, R112, S145, L182, A320, D321 and S335; see predicted topology map Figure 4). The limits of the 16 β-strands are P9 - G23, S37 - F46, L52 -F60, G90 - T100, Y104 - W110, E161 - E165, F172 - R179, T189 - A197, P201- A212, Y249 - Y260, Y268 - W273, T322 - S327, L330 - K334, C339 - V348, Y353 - I362, and A366 - R374 for the average structure. Given the uncertainty in the β-strand termini based on TMBpro predictions and the variation in template termini coordinates, these termini may differ up to +/− 3 residues; in fact, Kabsch & Sanders secondary structure calculations  assign alternate strand termini across the 10 lowest energy monomer structures for all β-strands except those of β -strands 6 and 7 (data not shown). Overall, the strand limits assigned superpose well with strand lengths defined by the composite template, as judged by the low RMSD in these regions (green, Figure 3A, 3B). Regions of higher RMSD (yellow – red) are limited to outer loops, the periplasmic turn between β- strands 2 and 3 and β-strands 12 and 13. Surprisingly external loops 5–7 converge to a single conformation for these structures.
Residues T322 to R336 have a high probability to be in the TM region of the pore by TMBpro (SI Table 1), with a clear decrease in probability for residues S338 – G340 and a strong probability for another TM β-strand starting at I341. According to the topology requirements of a 16-stranded β-barrel and the experimentally derived requirement to have VD4 (residues L290 – A320) externally exposed, residues T322 to R336 must span the membrane twice. Further limiting the assignment of this region to TM strands are the highly polar nature of residues K332, K334, S335, R336, K337 and S338, expected to be in a loop or turn outside of the membrane. Figure 3C shows the superposition of the average MOMP monomer model structure to six other 16-stranded β-barrel crystal structures. It illustrates that substituting another template structure for the hybrid template used will not solve this dilemma as the number of residues required to span β-strands 12 and 13 would be too short for any of these templates.
Aromatic girdles, a hallmark of membrane β-barrels, are most prominent in the MOMP β-strands comprising the trimer interface (Figure 3D) and around the periplasmic layer interface (Figure 3E). A clustering of charged lysine and arginine residues at the β-strand termini of β-strands 11, 13–16 may compensate for the lack of aromatic residues at the external membrane interface. The MOMP girdles are similarly located to polar girdles observed in other porins ,  where these charged residues can provide anchors at the membrane interface because of their highly unfavorable partition energies into the membrane . Yet, unique to MOMP is a sequence region assigned to β-strands 12–14 that is devoid of aromatic residues.
When the MOMP monomer model is aligned to a number of other 16-stranded porin structures, the MOMP sequences corresponding to β-strands 1–2 and 16 demonstrate the highest sequence identity (SI Figure 1). Specifically residues G15, G41 and G44 are on strands 1 and 2, and aromatic residues F373 and F375 are on strand 16. F375 participates in “aromatic rescue”, a structural feature in which the aromatic ring covers and stabilizes a glycine residue  on β-strand 1. This alignment also suggests that the MOMP pore contains a cluster of basic residues in the channel wall at positions R39, H92 and R112 (Figure 3F) that typically comprise the porin anion transport pathway , , . The H92 residue in MOMP is typically an arginine in other porins. H92 may be doubly protonated and therefore, also carry a formal positive charge, alternatively, this position may be occupied by MOMP's K91 via a slightly different alignment or a β-bulge. Porin structures typically also have a conserved glutamate in the anion transport pathway in β-strand 3, and the MOMP model has N57 at this position (SI Figure 1). Although several attempts were made to align E59 to this position by readjustment of the alignment by two residues, these models placed a K58 (which is typically a hydrophobic residue in other porins) at the trimer interface.
MOMP Trimer Model
The MOMP model trimer interface is comprised of β-strands 1–4 and 16. The average monomer-monomer surface area is 1316 Å2 and has exclusively hydrophobic inter-strand interactions. Typical barrel interfaces are slightly larger, ∼1400 Å2 and contain hydrogen bonding and/or salt bridge interactions in non-strand regions of the interface, in addition to the exclusively hydrophobic interactions between β-strands (data not shown). These differences between the MOMP model and other trimeric porins and the PISA algorithm low probability for protein-protein interface formation may be due to the omission of the loop region sequences from MOMP at β-strand termini where these types of interactions are typically found.
Recently, an algorithm has been developed by Naveed and colleagues that identifies weakly stable transmembrane β-strand regions in β-barrel porin structures . These porin regions are correlated with protein-protein oligomerization interfaces and the per β-strand empirical energy values have been experimentally validated , . Submission of our β-strand residue regions and the C. trachomatis serovar C amino acid sequence to the webserver for this algorithm identifies β-strands 1–5, 12–13 and 16 as relatively unstable and therefore candidate regions for oligomerization  (data not shown). Overall, the empirical energy profile calculated from our β-strands 1–5 and 16 closely match those of oligomeric β-barrel membrane protein profiles (H. Naveed, personal communication) and are consistent with oligomeric trimer formation at this interface.
Figure 5A illustrates a view of the trimer model from the perspective of the external membrane surface. Although the VDs are not modeled, the sequence termini leading to VDs 2, 3, and 4 appear in close proximity to one another, and the loops containing these residues may form a surface within each monomer. VD1 is located on Loop 2, a loop that is observed to form interactions between monomers in other trimeric porins. Figure 5A,B also shows that the VD1 sequences (Loop 2) within each monomer are in close proximity to each other at the trimer interface. Although E. coli's OmpF has a salt bridge in this “latching Loop 2” , there is no evidence that this is conserved in other trimeric porins or MOMP. Given the length of the omitted residues from Loop 2 of this model, the mapping of B-cell epitopes to this loop and the inter-monomer proximity of the three Loop 2 regions at the trimer interface, it is likely the Loop 2 region of each monomer combine to form a structural epitope that protrudes from the membrane surface. Conserved features also observed in the MOMP trimer are the Y89 residues from each monomer forming a ring at the top of the central core. In 16- and 18-stranded β-barrels these tyrosine hydroxyl groups coordinate bound water.
(A) MOMP model surface, mapping of VD. Positions attaching the VDs to the barrel mapped (orange) onto the molecular surface of the MOMP trimer model. (B) MOMP loops 1 and 4 potential inter-monomer stabilizing contacts. Two of the three monomer β-barrels (green and magenta ribbon representation) illustrate the proximity of Loops 1 and 4. Their tryptophan and cysteine residues (space-filling atom representation) on neighboring trimer subunits are shown (C29 not modeled). Residues W252 and W273 are at the exterior membrane interface in the putative aromatic girdle.
The development of a 3D structural model with no detectable sequence similarity to a protein with experimentally determined structure is non-trivial and can only be accomplished with certainty in protein families with highly conserved structural features (e.g. porins). Highly conserved structural features observed among porin β-barrels have led to the development of algorithms to detect and predict the presence of β-strands and the overall topology of membrane porins. However, these algorithms are not without pitfalls. As shown with the MOMP example, there can be ambiguity in delimiting the number of β-strands. In addition, as more membrane bound β-barrel crystal structures are determined, exceptions to the list of 10 explicit rules governing porin structure, as outlined by Schulz , are found. For example, in recent structures for mitochondrial protein VDAC, the β-barrel is comprised of 19 β-strands (2JK4) , , where it was previously considered that β-barrels were comprised solely of an even number of strands. In another recent structure, the barrel is made up of 3 subunits, each making up 4 strands of the 12-stranded barrel , illustrating another unique barrel fold.
Despite these challenges and the ambiguity in the TMBpro overall topology prediction for the MOMP sequence in the C-terminal region, we proceeded to construct and assess a 16-stranded β-barrel model based on previously published experimental evidence. The most convincing experimental evidence is the detection of oligomeric structures for MOMP , which would exclude all currently solved 14-stranded barrels as templates and be more consistent with a 16- or 18-stranded barrel. Furthermore, the size dependence for measured transport rates of various sugars in MOMP containing micelles  suggest a MOMP pore size similar to that of 16-stranded OprF β-barrel. Electron microscopy for C. trachomatis elementary bodies results in a measured outer membrane width of 20 Å , consistent with other known porin widths.
The 3D MOMP model that was constructed contains several features common to other 16-stranded trimer structures. These features include: (1) basic residues at key transport positions in the pore channel (Figure 3F); (2) aromatic rings at the membrane boundaries (Figure 3D, 3E); and (3) highly conserved residues in β-strands comprising the trimer interface. Furthermore, relative secondary structure distribution for our final MOMP model is similar to that measured for native MOMP by circular dichroism: ∼38% β-strand and 3% αlpha-helix versus 38–44% β-strand and ∼5% alpha-helix, respectively .
The Chlamydia MOMP sequence has 8–10 cysteine residues, compared to other porin proteins, which typically have none. Eight of these cysteines, C26, C29, C33, C117, C185, C187, C210 and C339, are essentially completely conserved across serovars  and six are conserved across various MOMP orthologs; thus there appears to be evolutionary pressure to maintain cysteines at these particular positions. Additionally, MOMP requires some disulfide bonds to function as a pore . The function of these cysteine residues has been postulated to provide stability to the outer membrane – by forming either intra-monomer and/or inter-monomer disulfide bonds to compensate for the loss of a peptidoglycan stabilized membrane –. Others suggest that at least some cysteine residues face the periplasm where they can form disulfide bridges with other proteins as part of a larger, stabilizing complex . More recently, for other porins, such exposed cysteines have been postulated to be involved in the folding process with accessory proteins in the cytoplasm . The MOMP model presented here supports the possibility of the first hypothesis, as the cysteine residues are found at the external membrane surface and primarily in the external loops, Figure 4. Three of the eight cysteine residues, C26, C29 and C33, are found in Loop 1. These may form an intra-monomer disulfide bond, similar to that seen in Loop 1 of 18-stranded barrels for E. coli and S. enterica maltoporin (1AF6 and 2MPR, respectively) , ; or, alternately, the proximity of MOMP's Loop 1 to Loop 4 of the neighboring monomer, containing C185 and C187, may allow inter-monomer disulfide bond formation. Cysteine residues that protrude into the lipid bilayer have been exhibited by other β-barrel proteins (e.g. VDAC1 ), and this cannot yet be ruled out as a possibility for MOMP. Mapping of disulfide bonds in MOMP has yielded conflicting results , 
The tryptophan residues found in the aromatic girdle and at the trimer interface may provide additional stability to the MOMP trimer. While 16-stranded porins typically have 3–4 tryptophan residues, P. aeruginosa OprP has 10 (2O4V)  and C. trachomatis MOMP has 7. Tryptophan has the highest stabilization energy contribution of any residue for membrane proteins and is known to play a key role in the proper folding of membrane proteins –. Most of the MOMP tryptophan residues are observed in the aromatic girdle regions of the protein and thus may provide critical stabilizing interactions at the interface of membrane surface. An additional tryptophan residue on Loop 1 (W32) in our model provides key packing interactions with putative cysteine disulfide bonds and a tryptophan in Loop 4 (W183) (Figure 5B). A notable outcome of the three-dimensional model is the finding that Loop 1 and Loop 4 may be involved in stabilizing the trimer interface, as these loops have not been previously identified as being involved in immunogenic epitopes . Loops 1 and 4 from neighboring monomers are often proximal in experimentally derived crystal structures for porin trimers , , , . The residues forming inter-monomer contacts in this region are predominately hydrophilic, providing hydrogen bonds or electrostatic interactions. Disulfide bond and aromatic pi stacking interactions between these loops present novel structural features for trimer porin stabilization.
The MOMP model presented here exhibits several key structural features consistent with experimental evidence for this porin. However, the inability to resolve the structure features of residues 320–334 with what is known about other porins and the necessity of this region to span the trans-membrane suggest MOMP has some as of yet unknown unique structural characteristics for a bacterial porin. This unique nature of MOMP is not likely to be merely a shortcoming of trying to model a twilight zone protein, but rather reflects constraints that cannot be resolved based on the necessity for VD4 to be external to the pore and the unlikelihood that a sequence of charged residues K334 – S339 form a trans-membrane strand. Baehr et al.  proposed that the TTLNPTIAGK region in VD4, that is conserved among all the human C. trachomatis serovars, might be folded to form a practically inaccessible loop that can served as a site for attachment to the eukaryotic cell. This constraint may explain some of the difficulties we have encountered modeling this region of the protein. The possibility that MOMP exists in multiple conformations, similar to the major porin OprF of P. aeruginosa, cannot be ruled out . Unlike P. aeruginosa, Chlamydia do not have a peptidoglycan layer, although other proteins (in particular the cysteine-rich proteins 15 and 60-kDa) may play a compensatory biological role and MOMP may interact with them. It seems likely that MOMP adopts multiple conformations during the developmental cycle of Chlamydia and, in addition, that MOMP cysteine residues interact with other periplasmic proteins (again the 15 and 60 kDa cysteine-rich proteins) as indicated by Sugawara et al . Indeed, the inability of current algorithms to assign significant probabilities to the C-terminus strand assignments for MOMP (Figures 1 and 2) could arise from the multiple conformations, on which the prediction algorithms have not been trained.
While methods other than template based homology modeling exist for developing structural models, namely ab initio folding through constrained simulated annealing , perhaps including constraints based on residue covariance across orthologs , , MOMP is at the upper limit of protein sequence length tried with covariance methods. Additionally, the number of MOMP orthologs sequenced is comparatively low for developing these constraints and recent attempts for covariance constraints on β-barrel proteins have given results with mixed success . Overall, this 3D MOMP model represents the best case for incorporation of current experimentally derived knowledge for MOMP and structural knowledge for bacterial β-barrels. Although the model is speculative in relation to a comparative model built with higher sequence identity to 16-stranded β-barrels, it does illustrate intriguing unique features that motivate further structural and functional investigations for this important vaccine target.
Materials and Methods
Figure 6 provides a Flowchart for the data input and the iterative model building procedure described below.
TMBpro Prediction for Barrel Topology and β-strand probabilities
A C. trachomatis serovar C majority consensus sequence was constructed from all serovar C omp1 gene sequences available in GenBank on December 2, 2011. This consensus sequence is identical at the amino acid level to that of isolate CS-362-07 (GenBank accession DQ116399). This sequence, excluding the first 22 residues, was submitted to TMBpro server (http://www.igb.uci.edu/~baldig/tmb.html) . Additional submissions were made excluding residues (i) A226–A238 and L290 - D316, (ii) A64–A 83 and L290 – D316 or (iii) A64 – P85, K152 – A158, T216 – I247, T282 – V310. Additional 16-stranded β-barrel crystal structure coordinates were added to the TMBpro template database manually for comparison of automated 3D MOMP output by the server; pdb codes 1OSM , 1PHO  and 2J1N  and a composite template based on E. coli OmpF and C. acidovorans Omp32; residues A1 – R100 and F267 – F340 from 2OMF  and residues L94 – G250 from 1E54 . The algorithm's refinement steps toward reducing β-carbon clashes between neighboring residues on the barrel strands  were considered and these β-strand to template sequence alignments were used as starting points for further refinement using MOE (see below).
The C. trachomatis serovar C majority consensus sequence described above was also submitted to servers for PRED-TMBB (http://biophysics.biol.uoa.gr/PRED-TMBB/)  and TMBbeta-NET (http://psfs.crc.jp/tmbeta-net/) . A sequence omitting the majority of variable domain residues (Q61 – P85, T141 – A162, A226 – A238, L290 – A320)  was also submitted to each server.
Porin template selection
Structural template selection was based on the (i) the lowest overall 3D model search score for the four models built on 16-stranded templates used by TMBpro and (ii) accommodation of the putative MOMP Loop 3 and VD2. Table 1 lists initial search scores for 3D MOMP models built by TMBpro protocols on four different 16-stranded β-barrel templates prior to any manual modifications in β-strand assignment. MOMP had the lowest search scores with E. coli OmpF and Comamonas acidovorans Omp32 porin  template structures.
Special consideration was needed for modeling the putative Loop 3 region, which forms a pore filling eyelet in all solved 16-stranded porin structures , . MOMP residues from ∼W110 to F139 were predicted to fold into the pore based on their C-terminal proximity to β-strand 5 and the variable and weak probabilities to form β-strands (SI Table 1, Figure 2). VD2 is immediately C-terminal to F139 and known to contain epitopes, thus is unlikely that this domain would also be buried within the channel. Evaluation of structures for nine known unique 16-stranded β-barrel porins revealed two examples where the C-terminal region of the eyelet loop formed an additional loop protruding toward the external membrane surface, e.g., C. acidovorans Omp32 (1E54)  and Rhodopseudomonas blastica porin (3PRN) , and thus this may also be the case for MOMP. Based on the relatively low TMBpro search scores for MOMP built on the Omp porin templates, where lower scores are considered to be favorable, a composite MOMP template was built based on OmpF that incorporated the eyelet loop in the Omp32 structure and its C-terminal protrusion. In addition to Loop 3, β-strands 6 through 12 of Omp32 were included in the hybrid template because the Loop 3 protrusion in Omp32 would overlap with OmpF loop coordinates protruding from its strands 6–12. This template reduced potential packing clashes between the channel wall and internal Loop 3. Basing C-terminal strands 13–16 on the OmpF template resulted in a consistent trimer interface, packing interactions and closure of the barrel with the N-terminal strands. Ultimately, building the MOMP sequence on this composite template and guiding manual strand position constraints for β-strands 1–6, 15 and 16 resulted in a significantly lower search score compared to prior templates (Table 1).
Monomer Model Construction
Monomer models were built with the homology module in MOE (Molecular Operating Environment, 2011.10, Chemical Computing Group Inc., 1010 Sherbooke St. West, Suite #910, Montreal, QC, Canada H3A 2R7, 2011). Default settings, including automated disulfide bond detection were set to give 25 “medium” refined structures per alignment and scored based on contact energy, packing score and MOE's GB/VI energy. Manually derived sequence alignments were maintained between each template and MOMP sequence during model building by constraining the putative β-strand regions (17–20 constraints per run). This limited number of constraints allows some modifications in β-strand alignment thus different model outcomes. The models were analyzed for relative energies and protein geometry to minimize the number of outlier residues for Ramachandran values, bond lengths, angles, and rotameric states.
The TMBpro server output includes per residue predictions for the query sequence into three bins; membrane-facing, channel-facing or non-transmembrane. Adjustment to the automated TMBpro β-strand to template alignments were made after considering the following criteria: (i) consensus of β-strand residues by various algorithms, Figure 2, (ii) comparison with β-strand positions in published 2D topology maps, Figure 2, (iii) the probabilities for channel facing (C) and membrane-facing (M) (SI Table 1), and (iv) aromatic residue positions consistent with the putative membrane interface.
The probabilities for channel facing vs. membrane-facing residues provided a crucial criterion for assessing specific residue alignments to the templates, especially where residues would not be considered entirely hydrophobic. For example, lysine or arginine residues are generally considered unfavorable as membrane-facing residues due to their charged functional groups, yet there is precedence for these residues to “snorkel”  at β-strand termini. Snorkeling involves the positioning of long hydrophobic side chains inside the membrane and charged groups at the polar membrane interface. Additionally, although threonine residues are not hydrophobic, they are found to be membrane facing more often than expected. Deviations from the concept of merely alternating hydrophobic and hydrophilic residue patterns in trans-membrane β-strands has been noted by Wimley, et al. , who found that analyzing abundance patterns for residues enriched at β-strand centers or membrane limits may provide a more reliable prediction scheme for residue orientation.
Homo-Trimer Construction and Analysis
β-strand limits for the MOMP monomer were used for align input files to the Prime homology module (Prime, version 3.0, Schrödinger, Inc., New York, NY, 2011) the module used to build the MOMP homo-trimer model. A remodeled OmpF-Omp32 hybrid template based on the high resolution E. coli OmpF coordinates 1HXX , and superimposed three copies for the monomer template to each subunit of the E. coli OmpF trimer to served as the input template for trimeric MOMP model. The higher resolution OmpF coordinates have fewer contact errors at the trimer interface and hence propagates fewer errors to the MOMP model. Each MOMP model monomer has residues L1 – C26, T31 – G63, P87 – S147, P155 – F215, D248 – R275, A320 – F375.
Final models were submitted to the PISA server for trimer interface surface area calculation and analysis (http://www.ebi.ac.uk/msd-srv/prot_int/pistart.html) . Note that the surface area is only counted once for each interface.
MOMP sequence alignment based on structural alignment to other 16-stranded porin structures. β-strand positions (blue arrows), porin residues described as conserved by Tanabe et al.  (yellow) and the hybrid template splicing positions (orange line) are shown. Variable domain positions (truncated in some cases) are indicated in red, position of eyelet loop shown in green.
Support from the National Biomedical Computation Resource, the Center for Theoretical Biological Physics, and the UCSD Drug Discovery Institute is gratefully acknowledged.
Conceived and designed the experiments: LMdlM RMB REA VAF AR. Performed the experiments: VAF AR. Analyzed the data: VAF REA RMB LMdlM. Contributed reagents/materials/analysis tools: AR PB REA RMB. Wrote the paper: VAF AR RMB LMdlM REA. Contributed equally to research direction: LMdlM RMB REA. Reviewed results: VAF AR PB RMB LMdlM REA.
- 1. WHO (2013) Initiative for Vaccine Research. Available: http://www.who.int/vaccine_research/diseases/soa_std/en/index1.html.
- 2. Schachter J, Dawson CR (1978) Human chlamydial infections. Littleton, Mass.: PSG Publishing Co.
- 3. Stamm WE (2008) Chlamydia trachomatis infections of the adult. In: Holmes KK, Stamm WE, Piot P, Wasserheit JW, Corey L, et al.., editors. Sexually transmtted diseases. New York: McGrawHill Book Co. pp. 575–593.
- 4. Taylor HR (2008) Trachoma: a blinding scourge from the Bronze Age to the twenty-first century. East Melbourne, Australia: Haddington Press.
- 5. Westrom L, Joesoef R, Reynolds G, Hagdu A, Thompson SE (1992) Pelvic inflammatory disease and fertility. A cohort study of 1,844 women with laparoscopically verified disease and 657 control women with normal laparoscopic results. Sexually transmitted diseases 19: 185–192.
- 6. Miller WC, Ford CA, Morris M, Handcock MS, Schmitz JL, et al. (2004) Prevalence of chlamydial and gonococcal infections among young adults in the United States. JAMA : the journal of the American Medical Association 291: 2229–2236.
- 7. Brunham RC, Pourbohloul B, Mak S, White R, Rekart ML (2005) The unexpected impact of a Chlamydia trachomatis infection control program on susceptibility to reinfection. The Journal of infectious diseases 192: 1836–1844.
- 8. de la Maza LM, Peterson EM (2002) Vaccines for Chlamydia trachomatis infections. Current opinion in investigational drugs 3: 980–986.
- 9. Morrison RP, Caldwell HD (2002) Immunity to murine chlamydial genital infection. Infection and immunity 70: 2741–2751.
- 10. Caldwell HD, Kromhout J, Schachter J (1981) Purification and partial characterization of the major outer membrane protein of Chlamydia trachomatis. Infection and immunity 31: 1161–1176.
- 11. Murzin AG, Brenner SE, Hubbard T, Chothia C (1995) SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol 536–540.
- 12. Schulz GE (2000) beta-Barrel membrane proteins. Curr Opin Struct Biol 10: 443–447.
- 13. Schulz GE (2002) The structure of bacterial outer membrane proteins. Biochimica et biophysica acta 1565: 308–317.
- 14. Nikaido H (2003) Molecular basis of bacterial outer membrane permeability revisited. Microbiology and molecular biology reviews : MMBR 67: 593–656.
- 15. Sun G, Pal S, Sarcon AK, Kim S, Sugawara E, et al. (2007) Structural and functional analyses of the major outer membrane protein of Chlamydia trachomatis. J Bacteriol 189: 6222–6235.
- 16. Bavoil P, Ohlin A, Schachter J (1984) Role of disulfide bonding in outer membrane structure and permeability in Chlamydia trachomatis. Infect Immun 44: 479–485.
- 17. Jones HM, Kubo A, Stephens RS (2000) Design, expression and functional characterization of a synthetic gene encoding the Chlamydia trachomatis major outer membrane protein. Gene 258: 173–181.
- 18. Wang Y, Kahane S, Cutcliffe LT, Skilton RJ, Lambden PR, et al. (2011) Development of a transformation system for Chlamydia trachomatis: restoration of glycogen biosynthesis by acquisition of a plasmid shuttle vector. PLoS pathogens 7: e1002258.
- 19. Wimley WC (2002) Toward genomic identification of beta-barrel membrane proteins: composition and architecture of known structures. Protein Sci 11: 301–312.
- 20. Jackups R Jr, Liang J (2005) Interstrand pairing patterns in beta-barrel membrane proteins: the positive-outside rule, aromatic rescue, and strand registration prediction. J Mol Biol 354: 979–993.
- 21. Diederichs K, Freigang J, Umhau S, Zeth K, Breed J (1998) Prediction by a neural network of outer membrane beta-strand protein topology. Protein Sci 7: 2413–2420.
- 22. Bagos PG, Liakopoulos TD, Spyropoulos IC, Hamodrakas SJ (2004) PRED-TMBB: a web server for predicting the topology of beta-barrel outer membrane proteins. Nucleic Acids Res 32: W400–404.
- 23. Bagos PG, Liakopoulos TD, Hamodrakas SJ (2005) Evaluation of methods for predicting the topology of beta-barrel outer membrane proteins and a consensus prediction method. BMC Bioinformatics 6: 7.
- 24. Zhai Y, Saier MH Jr (2002) The beta-barrel finder (BBF) program, allowing identification of outer membrane beta-barrel proteins encoded within prokaryotic genomes. Protein Sci 11: 2196–2207.
- 25. Berven FS, Flikka K, Jensen HB, Eidhammer I (2004) BOMP: a program to predict integral beta-barrel outer membrane proteins encoded within genomes of Gram-negative bacteria. Nucleic Acids Res 32: W394–399.
- 26. Gromiha MM, Suwa M (2005) A simple statistical method for discriminating outer membrane proteins with better accuracy. Bioinformatics 21: 961–968.
- 27. Garrow AG, Agnew A, Westhead DR (2005) TMB-Hunt: an amino acid composition based method to screen proteomes for beta-barrel transmembrane proteins. BMC Bioinformatics 6: 56.
- 28. Waldispuhl J, Berger B, Clote P, Steyaert JM (2006) transFold: a web server for predicting the structure and residue contacts of transmembrane beta-barrels. Nucleic Acids Res 34: W189–193.
- 29. Randall A, Cheng J, Sweredoski M, Baldi P (2008) TMBpro: secondary structure, beta-contact and tertiary structure prediction of transmembrane beta-barrel proteins. Bioinformatics 24: 513–520.
- 30. Rodriguez-Maranon MJ, Bush RM, Peterson EM, Schirmer T, de la Maza LM (2002) Prediction of the membrane-spanning beta-strands of the major outer membrane protein of Chlamydia. Protein Sci 11: 1854–1861.
- 31. Findlay HE, McClafferty H, Ashley RH (2005) Surface expression, single-channel analysis and membrane topology of recombinant Chlamydia trachomatis Major Outer Membrane Protein. BMC Microbiol 5: 5.
- 32. Wang Y, Berg EA, Feng X, Shen L, Smith T, et al. (2006) Identification of surface-exposed components of MOMP of Chlamydia trachomatis serovar F. Protein Sci 15: 122–134.
- 33. Zeth K, Thein M (2010) Porins in prokaryotes and eukaryotes: common themes and variations. Biochem J 431: 13–22.
- 34. Gromiha MM, Ahmad S, Suwa M (2004) Neural network-based prediction of transmembrane beta-strand segments in outer membrane proteins. Journal of computational chemistry 25: 762–767.
- 35. Dutzler R, Rummel G, Alberti S, Hernandez-Alles S, Phale P, et al. (1999) Crystal structure and functional characterization of OmpK36, the osmoporin of Klebsiella pneumoniae. Structure 7: 425–434.
- 36. Kabsch W, Sander C (1983) Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22: 2577–2637.
- 37. White SH, Wimley WC (1998) Hydrophobic interactions of peptides with membrane interfaces. Biochimica et biophysica acta 1376: 339–352.
- 38. Zeth K, Diederichs K, Welte W, Engelhardt H (2000) Crystal structure of Omp32, the anion-selective porin from Comamonas acidovorans, in complex with a periplasmic peptide at 2.1 A resolution. Structure 8: 981–992.
- 39. Tanabe M, Nimigean CM, Iverson TM (2010) Structural basis for solute transport, nucleotide regulation, and immunological recognition of Neisseria meningitidis PorB. Proc Natl Acad Sci U S A 107: 6811–6816.
- 40. Naveed H, Jackups R Jr, Liang J (2009) Predicting weakly stable regions, oligomerization state, and protein-protein interfaces in transmembrane domains of outer membrane proteins. Proceedings of the National Academy of Sciences of the United States of America 106: 12735–12740.
- 41. Naveed H, Jimenez-Morales D, Tian J, Pasupuleti V, Kenney LJ, et al. (2012) Engineered oligomerization state of OmpF protein through computational design decouples oligomer dissociation from unfolding. Journal of molecular biology 419: 89–101.
- 42. Geula S, Naveed H, Liang J, Shoshan-Barmatz V (2012) Structure-based analysis of VDAC1 protein: defining oligomer contact sites. The Journal of biological chemistry 287: 2179–2190.
- 43. Naveed H, Liang J (2012) TMBB-Explorer: A Webserver to Predict the Structure, Oligomerization State, Ppi Interface, and Thermodynamic Properties of the Transmembrane Domains of Outer Membrane Proteins. Biophys J 102: 469a.
- 44. Phale PS, Philippsen A, Kiefhaber T, Koebnik R, Phale VP, et al. (1998) Stability of trimeric OmpF porin: the contributions of the latching loop L2. Biochemistry 37: 15663–15670.
- 45. Zeth K, Meins T, Vonrhein C (2008) Approaching the structure of human VDAC1, a key molecule in mitochondrial cross-talk. Journal of bioenergetics and biomembranes 40: 127–132.
- 46. Ujwal R, Cascio D, Colletier JP, Faham S, Zhang J, et al. (2008) The crystal structure of mouse VDAC1 at 2.3 A resolution reveals mechanistic insights into metabolite gating. Proc Natl Acad Sci U S A 105: 17742–17747.
- 47. Meng G, Surana NK, St Geme JW 3rd, Waksman G (2006) Structure of the outer membrane translocator domain of the Haemophilus influenzae Hia trimeric autotransporter. The EMBO journal 25: 2297–2304.
- 48. Huang Z, Chen M, Li K, Dong X, Han J, Zhang Q (2010) Cryo-eectron tomography of Chlamydia trachomatis gives a clue to the mechanism of outer membrane changes. J Electron Microsc (Tokyo) 59: 237–241.
- 49. Nunes A, Nogueira PJ, Borrego MJ, Gomes JP (2010) Adaptive evolution of the Chlamydia trachomatis dominant antigen reveals distinct evolutionary scenarios for B- and T-cell epitopes: worldwide survey. PLoS One 5.
- 50. Barbour AG, Amano K, Hackstadt T, Perry L, Caldwell HD (1982) Chlamydia trachomatis has penicillin-binding proteins but not detectable muramic acid. J Bacteriol 151: 420–428.
- 51. Ghuysen JM, Goffin C (1999) Lack of cell wall peptidoglycan versus penicillin sensitivity: new insights into the chlamydial anomaly. Antimicrob Agents Chemother 43: 2339–2344.
- 52. Fox A, Rogers JC, Gilbart J, Morgan S, Davis CH, et al. (1990) Muramic acid is not detectable in Chlamydia psittaci or Chlamydia trachomatis by gas chromatography-mass spectrometry. Infect Immun 58: 835–837.
- 53. Yen TY, Pal S, de la Maza LM (2005) Characterization of the disulfide bonds and free cysteine residues of the Chlamydia trachomatis mouse pneumonitis major outer membrane protein. Biochemistry 44: 6250–6256.
- 54. Hatch TP, Vance DW Jr, Al-Hossainy E (1981) Identification of a major envelope protein in Chlamydia spp. Journal of Bacteriology 146: 426–429.
- 55. Newhall WJ, Jones RB (1983) Disulfide-linked oligomers of the major outer membrane protein of chlamydiae. Journal of Bacteriology 154: 998–1001.
- 56. Sugawara E, Nagano K, Nikaido H (2012) Alternative folding pathways of the major porin OprF of Pseudomonas aeruginosa. The FEBS journal
- 57. Wang YF, Dutzler R, Rizkallah PJ, Rosenbusch JP, Schirmer T (1997) Channel specificity: structural basis for sugar discrimination and differential flux rates in maltoporin. Journal of molecular biology 272: 56–63.
- 58. Meyer JE, Hofnung M, Schulz GE (1997) Structure of maltoporin from Salmonella typhimurium ligated with a nitrophenyl-maltotrioside. Journal of molecular biology 266: 761–775.
- 59. Moraes TF, Bains M, Hancock RE, Strynadka NC (2007) An arginine ladder in OprP mediates phosphate-specific transfer across the outer membrane. Nat Struct Mol Biol 14: 85–87.
- 60. Yau WM, Wimley WC, Gawrisch K, White SH (1998) The preference of tryptophan for membrane interfaces. Biochemistry 37: 14713–14718.
- 61. Hong H, Park S, Jimenez RH, Rinehart D, Tamm LK (2007) Role of aromatic side chains in the folding and thermodynamic stability of integral membrane proteins. J Am Chem Soc 129: 8320–8327.
- 62. Sanchez KM, Kang G, Wu B, Kim JE (2011) Tryptophan-lipid interactions in membrane protein folding probed by ultraviolet resonance Raman and fluorescence spectroscopy. Biophys J 100: 2121–2130.
- 63. Kreusch A, Schulz GE (1994) Refined structure of the porin from Rhodopseudomonas blastica. Comparison with the porin from Rhodobacter capsulatus. J Mol Biol 243: 891–905.
- 64. Basle A, Rummel G, Storici P, Rosenbusch JP, Schirmer T (2006) Crystal structure of osmoporin OmpC from E. coli at 2.0 A. J Mol Biol 362: 933–942.
- 65. Baehr W, Zhang YX, Joseph T, Su H, Nano FE, et al. (1988) Mapping antigenic domains expressed by Chlamydia trachomatis major outer membrane protein genes. Proceedings of the National Academy of Sciences of the United States of America 85: 4000–4004.
- 66. Brunger AT, Adams PD, Clore GM, DeLano WL, Gros P, et al. (1998) Crystallography & NMR system: A new software suite for macromolecular structure determination. Acta crystallographica Section D, Biological crystallography 54: 905–921.
- 67. Crowder S, Holton J, Alber T (2001) Covariance analysis of RNA recognition motifs identifies functionally linked amino acids. Journal of molecular biology 310: 793–800.
- 68. Marks DS, Colwell LJ, Sheridan R, Hopf TA, Pagnani A, et al. (2011) Protein 3D structure computed from evolutionary sequence variation. PLoS One 6: e28766.
- 69. Cowan SW, Schirmer T, Rummel G, Steiert M, Ghosh R, et al. (1992) Crystal structures explain functional properties of two E. coli porins. Nature 358: 727–733.
- 70. Yuan Y, Zhang YX, Watkins NG, Caldwell HD (1989) Nucleotide and deduced amino acid sequences for the four variable domains of the major outer membrane proteins of the 15 Chlamydia trachomatis serovars. Infection and immunity 57: 1040–1049.
- 71. Schulz GE (1996) Porins: general to specific, native to engineered passive pores. Current opinion in structural biology 6: 485–490.
- 72. Schmid B, Maveyraud L, Kromer M, Schulz GE (1998) Porin mutants with new channel properties. Protein Sci 7: 1603–1611.
- 73. Phale PS, Philippsen A, Widmer C, Phale VP, Rosenbusch JP, et al. (2001) Role of charged residues at the OmpF porin channel constriction probed by mutagenesis and simulation. Biochemistry 40: 6319–6325.
- 74. Xu Q, Canutescu AA, Wang G, Shapovalov M, Obradovic Z, et al. (2008) Statistical analysis of interface similarity in crystals of homologous proteins. J Mol Biol 381: 487–507.
- 75. Pettersen EF, Goddard TD, Huang CC, Couch GS, Greenblatt DM, et al. (2004) UCSF Chimera–a visualization system for exploratory research and analysis. Journal of computational chemistry 25: 1605–1612.
- 76. Karshikoff A, Spassov V, Cowan SW, Ladenstein R, Schirmer T (1994) Electrostatic properties of two porin channels from Escherichia coli. J Mol Biol 240: 372–384.