Vibrio cholerae, the enteropathogenic gram negative bacteria is one of the main causative agents of waterborne diseases like cholera. About 1/3rd of the organism's genome is uncharacterised with many protein coding genes lacking structure and functional information. These proteins form significant fraction of the genome and are crucial in understanding the organism's complete functional makeup. In this study we report the general structure and function of a family of hypothetical proteins, Domain of Unknown Function 3233 (DUF3233), which are conserved across gram negative gammaproteobacteria (especially in Vibrio sp. and similar bacteria). Profile and HMM based sequence search methods were used to screen homologues of DUF3233. The I-TASSER fold recognition method was used to build a three dimensional structural model of the domain. The structure resembles the transmembrane beta-barrel with an axial N-terminal helix and twelve antiparallel beta-strands. Using a combination of amphipathy and discrimination analysis we analysed the potential transmembrane beta-barrel forming properties of DUF3233. Sequence, structure and phylogenetic analysis of DUF3233 indicates that this gram negative bacterial hypothetical protein resembles the beta-barrel translocation unit of autotransporter Va secretory mechanism with a gene organisation that differs from the conventional Va system.
Citation: Prakash A, Yogeeshwari S, Sircar S, Agrawal S (2011) Protein Domain of Unknown Function 3233 is a Translocation Domain of Autotransporter Secretory Mechanism in Gamma proteobacteria. PLoS ONE 6(11): e25570. https://doi.org/10.1371/journal.pone.0025570
Editor: Eugene A. Permyakov, Russian Academy of Sciences, Institute for Biological Instrumentation, Russian Federation
Received: August 15, 2011; Accepted: September 7, 2011; Published: November 1, 2011
Copyright: © 2011 Prakash et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: We thank the Department of Information Technology (DIT), Government of India, New Delhi, India for supporting stipends to AP, YS and SS. We also thank BioCOS Life Sciences Private Limited, Bangalore, India (www.biocosls.com) for financial support and technical assistance to Dr. Agrawal. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have read the journal's policy and have the following conflicts: Dr. Agrawal is currently affiliated with BioCOS Life Sciences Pvt. Limited. There are no patents, products in development or marketed products to declare. This does not alter the authors' adherence to all the PLoS ONE policies on sharing data and materials, as detailed online in the guide for authors.
Domain of Unknown Function (DUF) 3233 (PFAM: PF11557) is a family of uncharacterised hypothetical proteins conserved among gram negative gammaproteobacteria. Representative members of this domain include marine bacteria from genus Vibrio, Shewanella, Colwellia and Alcanivorax of which Vibrio cholerae, Vibrio parahaemolyticus, Vibrio splendidus and Vibrio vulnificus are pathogenic to human and aquatic life. Vibrio cholerae causes seasonal outbreaks of cholera of epidemic proportions in developing countries with high mortality rates . The enterotoxins produced by the bacteria after colonising the host small intestine disrupts the ion transport by the intestinal epithelial cells causing outflow of large volumes of fluids into the intestine leading to watery diarrhoea, dehydration and in severe cases, death ,.
Significant fraction of genomes of Vibrio species lack structure function annotation and most of these gene products are classified as hypothetical proteins or domains of unknown function . The PFAM  database in its 24th release lists about 3000 DUF families. Many of these DUF families are kingdom specific (DUF2883, DUF3328, DUF3329), limited/shared between kingdoms (DUF1497, DUF3609) or restricted/specific to certain organisms (DUF1196, DUF2667). The specific and ubiquitous nature of these domains suggests their functional importance in organism specific niches or a common biological role.
Identifying homologous protein families through sequence based search marks the first step in the annotation of DUFs, providing an initial broad picture of the protein's probable family and function. Sequence homology search becomes increasingly powerful when we advance from normal sequence-sequence based searches to methods that uses profile or HMM information like HHsenser , which increases the efficiency of finding remote homologues. In silico structure prediction methods together with sequence similarity detection methods assist the annotation of fold-function space. Fold recognition methods like I-TASSER  help predict the 3 dimensional (3D) structure and functions of proteins that share low sequence identity with other known structures.
In this study we analyse the sequence and structural characteristics of DUF3233 using computational approaches and try to infer various properties of this domain. Sequence search by HHsenser identifies similarity with the beta-barrel translocation unit of autotransporter Va secretory proteins. Sequence homology combined with secondary structure prediction indicates a beta-barrel domain of 12 beta-strands. The predicted 3D model from I-TASSER shows the structure with an overall beta-barrel topology with an N terminal helix running along the central barrel axis perpendicular to the 12 antiparallel strands that form the barrel. Amphipathicity and membrane barrel discrimination analysis suggest the domain is a potential outer membrane gram negative beta-barrel protein.
Autotransporter translocation units belong to the transmembrane beta-barrel fold in SCOP database , defined by a beta-barrel of 12 to 14 antiparallel strands with an N terminal helix perpendicular to the barrel. Finally with the analysis of genomic context of DUF3233 we could infer that this outer membrane beta-barrel translocation domain has a gene organisation that is not typical of the autotransporter Va secretory mechanism.
Sequence based characterization of DUF3233 as autotransporter β-domain protein
Sequence search for homologues with PSI-BLAST using a representative query, Vibrio cholerae DUF3233 (RefSeq: NP_232949) against the NCBI nr database with a threshold 0.005, reached convergence at the 4th iteration. The resulting sequences identified were hypothetical proteins conserved among gram negative proteobacteria. For improved search and better coverage of homologous sequence space, information from aligned regions of DUF3233 sequences in the form of a multiple sequence alignment profile was queried with HHsenser. From the resulting sequences in the permissive alignment list we were able to infer homology between DUF3233 and the outer membrane beta-barrel translocation domain of autotransporter proteins. DUF3233 shares sequence similarity with outer membrane beta-barrel domain of Ochrobactrum intermedium autotransporter (e-value 1E-34, 95% coverage, 22% identity), Rhizobium leguminosarum adhesin autotransporter (e-value 2E-29, 94% coverage, 18% identity) and Yersinia aldovae AidA adhesin autotransporter (e-value 2E-26, 89% coverage, 18% identity). Interestingly, a number of gram negative hypothetical proteins were picked up as potential homologues, which showed fair amount of similarity to the autotransporter beta-domain (Table S1).
Position specific scoring matrix (PSSM) profile based discrimination analysis using TMBETADISC-RBF , predicts the outer membrane beta-barrel nature of DUF3233. To confirm homology with autotransporter protein family we queried DUF3233 sequences against putative outer membrane proteins (OMPs). A pairwise hidden Markov model (HMM) search by HHomp  identifies DUF3233 as an OMP. V. cholerae DUF3233 shares homology with HHomp cluster 12.1.6 (96% probability). This cluster comprises profile HMMs of autotransporter sequences whose 12 stranded beta-barrel transmembrane domains conform to the translocation unit of autotransporter NalP . DUF3233 sequences from Colwellia (94% probability), Shewanella (96% probability) and Ferrimonas (95% probability) were all found to share homology with autotransporter protein family.
DUF3233 is a solitary outer membrane autotransporter β-barrel domain
Proteins targeted for transport across membranes posses leader sequence or signal peptide at their N-terminus, which directs translocation. We analysed DUF3233 sequences using a combination of artificial neural networks and HMMs implemented in SignalP  to predict the presence and location of signal peptide cleavage sites. SignalP identified the presence of N-terminal signal peptide of an average length of 23 amino acid residues having a positively charged amino terminal followed by a hydrophobic region and hydrophilic carboxy terminal. Signal peptides are cleaved from the exported protein by specific proteases called signal peptidases (SPases) . Prediction of cleavage mechanism of these signal sequences by LipoP  identifies SPase 1 target site, indicating DUF3233 might be a non-lipoprotein.
We browsed DUF3233 genomic region of all representative organisms with STRING  to look for possible gene fusion events with other domains and found no such occurrence. DUF3233 is a single domain protein found on the small chromosome 2 in Vibrio species with an upstream gamma-glutamyltranspeptidase (GGT) or response regulatory protein transcribed in one potential operon (Table 1). These upstream proteins lack the N-terminal signal sequence for inner membrane transport and analysis through SecretomeP  indicates that these proteins are not exported through non-classical secretory system. Gene organisation of DUF3233 therefore suggests a solitary translocation unit with an absent upstream secretory protein.
Structure based validation of DUF3233 as transmembrane β-barrel domain of autotransporter proteins
DUF3233 sequence based PSI-BLAST search for proteins with known structures (72,386 structures in PDB as of April 2011) fetched results with a maximum alignment length covering 51 residues. Secondary structure assignment by PSIPRED  predicts an N-terminal α-helix (αN) followed by 12 consecutive β-strands (β1–β12) interspersed by two short turns of α-helices α1 and α2 predicted to occur between β1–β2 and between β5–β6 respectively. With no suitable template with significant sequence homology available, we used the fold recognition algorithm implemented in I-TASSER to predict a 3D model of DUF3233. The translocation unit of NalP from N. meningitidis (PDB: 1UYN_X, 15% identity, 85% coverage, normalised Z-score above 1) was identified by I-TASSER in the top four threading templates to model V. cholerae DUF3233. The predicted structure of V. cholerae DUF3233 (Figure 1) resembles the beta-barrel translocation unit of autotransporter proteins, aligning over 75% structurally equivalent positions with the template and an RMSD of 2.3. The domain has an N-terminal α-helix running along the central axis surrounded by beta-barrel formed by twelve anti-parallel beta-strands. Predicted strand assembly within the outer membrane shows the carboxy and amino terminal of the beta-barrel point towards the periplasmic space, the central helix is oriented such that its N-terminal is pointed towards the external environment. Secondary structure based sequence alignment of DUF3233 with the translocation unit of autotransporters shows a similar domain organisation (Figure 2). Using alignment of DUF3233 sequences the average hydropathy, amphipathicity and similarity plots was generated with AveHAS . Figure 3, shows 12 hydrophobicity and amphipathicity peaks with an average stretch of 10 to 15 residues per peak that may form transmembrane beta-strands.
A) The predicted structure resembles the beta-barrel translocation unit of autotransporter proteins. The 12 beta-strands (blue) orient in anti-parallel fashion enclosing the N-terminal helix (red), the two additional helices α1 and α2 are shown in green. B) Superimposed structures of DUF3233 (magenta) and NalP translocation unit of N. meningitidis 1uyn_x (blue). Figures were generated with Rasmol.
Secondary structure diagram for NalP is shown at the top. The two extra helices (α1 and α2) present among DUF3233 proteins are shown in green. DUF3233 and autotransporter sequences representing each cluster are used in the alignment. Nme_Neisseria meningitidis, Pae_Pseudomonas aeruginosa, Eco_Escherichia coli, Sen_Salmonella enterica, Bpe_Bordetella pertussis, Ctr_Chlamydia trachomatis, Hin_Haemophilus influenzae, Rri_Rickettsia rickettsii, Mca_Moraxella catarrhalis, Vch_Vibrio cholerae, Cps_Colwellia psychrerythraea, Slo_Shewanella loihica.
DUF3233 is evolutionarily linked to the autotransporters
To infer evolutionary relation with type V secretory proteins, we analysed DUF3233 representatives with members of both Va and Vb family (Table S2). The third type of proteins found in the type V secretory system, type Vc or AT-2 proteins, which are characterised by trimeric C-terminal beta-barrel  were not considered for phylogenetic analysis. The inferred phylogenetic tree (Figure 4) classifies members of the two families into two separate clans. Proteins are grouped into clusters with similar function, architecture and organism type as analysed in  and . DUF3233 family sequences though related to autotransporters form a distinct group from the main autotransporter clan indicating that these domains represent new cluster of autotransporters.
51 Va family (red), 25 Vb family (blue) translocation unit and 3 DUF3233 (green) representative sequences were used to generate a 1000 replicate bootstrap phylogenetic tree. Eukaryotic Vb sequences representing clusters 1 and 2 were not considered in the analysis. Sequences of each family are grouped into previously reported clusters (C 1 to 11 and C 1 to 4); DUF3233 sequences form a new cluster of the autotransporter family. Numbers at the node indicate neighbour-joining bootstrap percentages. Phylogenetic analyses were conducted in MEGA4.
A vital part in the survival and adaptation mechanism of bacteria lies in the constant interaction with their extra-cellular environment. Bacteria secrete a wide range of molecules into the extra-cellular milieu that includes enzymes, which break down carbohydrates, proteins and lipids, and virulence factors such as adhesins and toxins by those involved in pathogenesis. Transport of these molecules is mediated by protein complexes through conserved secretory pathways. Of the 6 types of secretory mechanisms known in gram-negative bacteria (type I to type VI), type V represents the simplest transport system. Proteins of the type V secretory system fall under the autotransporter (Va), two partner secretion (Vb) and the AT-2 (Vc) families, and share a similar domain organisation: an N-terminal signal peptide for inner membrane translocation followed by a passenger protein which is normally a virulence determinant and a C-terminal translocation unit for transporting the upstream passenger protein .
Proteins of the autotransporter (Va) family were the first to be described  and form the largest representation of this system . Autotransporters export a wide range of toxins and enzymes  to the cell surface or secrete them into the external environment. The passenger domain and translocation unit of autotransporters are both expressed as a single polypeptide  making the translocation unit highly specific and committed for transporting only the upstream passenger. Solved experimental structures of the autotransporter translocation unit – show that they all possess a similar structure, a beta-barrel of 12 antiparallel strands with a central N-terminal helix running along the barrel axis. Proteins of the two partner secretion (Vb) are widely distributed and follow a similar mode of function, transporting cytolysins, adhesions and metalloproteases . The secreted exoprotein and the transporter unlike the Va proteins are not linked but, are expressed as two separate proteins transcribed in a single operon . Vb transporters are predicted to have a multidomain architecture  and a relatively wider barrel made of 16  to 20  beta-strands. The newly discovered AT-2 family (Vc)  represents proteins secreted via a homotrimeric mechanism . Proteins secreted through this system are mainly implicated in virulence . With a domain organisation similar to that of autotransporters, the system functions with the coming together of three individual proteins each complete with an N-terminal signal peptide, a passenger unit and four beta-strand domain at the C-terminal which makes a complete closed 12 stranded beta-barrel translocation unit upon trimerisation ,.
The present work describes sequence and structure based characterization of proteobacteria DUF3233 as a beta-barrel transmembrane domain of autotransporter proteins. DUF3233 packs an average 312 amino acid residues (including N-terminal signal peptide) and is currently classified as a domain of unknown function.
DUF3233 is encoded as a single domain protein, homologous to the translocation unit of autotransporters. One aspect of DUF3233 that distinguishes it from other main class autotransporters is that it lacks a covalently linked N-terminal passenger domain, to which C-terminal translocation units of all autotransporters are committed to transport. Few autotransporter representatives of two-polypeptide architecture  might suggest the secretion of co-transcribed upstream proteins similar to the TPS system, but considering the cytosolic nature of upstream proteins, extracellular translocation seems unlikely.
Few representative members from the Vibrio genus express DUF3233 and upstream putative GGT or response regulatory proteins in a single operon (Table 1). Over expression of GGT  and GGDEF domain proteins , are implicated in pathogenesis. The prokaryotic GGT is shown to be a major factor in the colonisation of gut and gastric mucosa ,. The upstream response regulators are two-domain proteins with an N-terminal CheY-like regulatory and a conserved C-terminal GGDEF effector domain, which is responsible for eliciting pathogenic response through cyclic di-GMP mediated exopolysaccharide synthesis and biofilm formation . Genes encoding virulence products in V. cholerae are organised in clusters or operons , and since gene encoding DUF3233 is located among virulent genes, the possibility of the involvement of DUF3233 in pathogenesis cannot be overlooked.
The translocation units of autotransporters exhibit conserved amino acid consensus motif at their carboxy terminus, the barrel closing beta-strand displays alternate arrangement of polar and hydrophobic residues terminating with a conserved aromatic amino acid at the barrel terminus which is usually a phenylalanine or a tryptophan . Hendrixson et al.,  demonstrated the importance of C-terminal consensus motif on the viability of H. influenzae Hap translocation unit. Deletion of terminal 12 residues proved detrimental to the outer membrane localisation; while the stability and/or outer membrane localisation of the translocation unit was affected with the deletion of all three terminal residues, point mutations of these residues showed no effect on the outer membrane localisation or secretion of the mature protein . DUF3233 displays consensus pattern at its C-terminal that resembles the conserved motif found among autotransporters discussed above (Figure S1). A stretch of alternating polar and hydrophobic residues precedes the terminal beta-strand having a hydrophobic segment and a conserved ‘terminal’ phenylalanine or a tyrosine residue. Interestingly the extreme carboxy terminus harbours three conserved polar residues [N/D][Q/E][D/E] after the ‘terminal’ F/Y. Secondary structure and predicted models of DUF3233 show the hydrophilic residues of the “polar tail” are not part of the terminal beta-sheet, but instead form a short overhang pointed towards the periplasm. As yet, we do not know the significance and possible role of these tail polar residues on the outer membrane localisation and stability.
DUF3233 exhibits certain features that are in common with the translocation units of type Va secretory proteins and yet possesses characteristics that are not typical to the proteins of the above system. DUF3233 represents a translocation unit that is devoid of a secretable passenger unit. Considering its location in the representative genomes alongside other virulence genes, we hypothesize that this domain is involved in pathogenesis. However, the mechanism apparently looks new and different than a typical type Va secretion system.
Our study on the proteobacterial protein DUF3233 with a combination of methods like sequence similarity searches, outer membrane beta-barrel discrimination, phylogenetic analysis, and fold recognition has led us to a consensus at annotating fold and function to this domain.
Sequence similarity search suggests that DUF3233 has remote homology with the translocation unit of autotransporter proteins of the type V secretory system. The domain's outer membrane beta-barrel nature was further emphasised by signal peptide, outer membrane beta-barrel discrimination and amphipathicity analysis. Secondary structure prediction and alignment with translocation units, and inputs from the predicted model suggests that DUF3233 and the translocation unit of autotransporter proteins share a similar domain organisation.
Drawing a consensus from various in silico prediction methods it appears that DUF3233 is a cluster of remote homologues of autotransporters. This is the first report of an autotransporter like protein family in Vibrio species, and though within the realm of bioinformatics we were able to infer its probable family and fold, the pathogenic mechanism still remains to be explored and seeks further experimental studies.
Materials and Methods
Sequence similarity search
20 DUF3233 genes from NCBI comprising one sequence each from Aliivibrio, Colwellia and Ferrimonas, five sequences from Shewanella and twelve from Vibrio species were fetched using their corresponding RefSeq ids. YP_001366070 and YP_001555810 were excluded because of their short domain size and a total of 18 DUF3233 sequences were included in this study. The signal peptides of DUF3233 sequences have not been included in various analyses of this study unless mentioned. PSI-BLAST  profile-sequence search was used to probe homologous protein families against NCBI nonredundant (nr) database with default parameters. ClustalW multiple sequence alignment profile of DUF3233 protein sequences was taken in as input for HHsenser to search the nr database with a threshold e-value of 0.001 and default parameters for improved homolog coverage. TMBETADISC-RBF , PSSM profile based discrimination of beta-barrel OMPs from other folding types like globular and membrane proteins was used to assess the outer membrane nature of DUF3233. DUF3233 sequences were queried against HHomp database  to detect homology with other known OMPs.
Signal peptide and Genomic context analysis
The presence of N-terminal signal peptide and the putative cleavage sites were predicted with SignalP 3.0 . Using LipoP 1.0  the signal peptide sequences were checked for lipoprotein signal peptide signatures that differentiate them from other signal peptides and subsequently cleavage by signal peptidase II from signal peptidase I. SignalP 3.0 and SecretomeP 2.0  were used to determine inner membrane transport of proteins upstream of DUF3233 via Sec dependent or other non classical secretory pathways. STRING (version 8.3)  and DOOR (version 2.0)  were used for gene neighbourhood and operon analysis.
Structure prediction and transmembrane β-barrel analysis
Secondary structure assignments were made using PSIPRED . 3D structure of V. cholerae DUF3233 (RefSeq: NP_232949) was predicted using I-TASSER fold recognition method . The predicted structure was superimposed with the template using TopMatch  and visualised with Rasmol . The WHAT  program was used to predict hydropathy and amphipathicity using sliding windows of 13, 15, 17 residues and an angle of 100° for α-helix and 180° for β-strand. Multiple sequence alignment profile was used to plot the average hydropathy, amphipathicity and similarity plot using the AveHAS  program. Orientation of the domain within the outer membrane was predicted using the Viterbi method implemented in PRED-TMBB . Secondary structure based sequence alignment of DUF3233 family and with the representative autotransporter translocation units was done with inputs from ClustalX , ProbCons , Ali2D  and the alignment was adjusted manually.
Amino acid sequences of translocation unit of autotransporters, and pore forming beta-domain of two-partner secretion (TPS) family members, reported in  and  were used for phylogeny analysis. Three representative DUF3233 sequences from V. cholerae (RefSeq: NP_232949), C. psychrerythraea (RefSeq: YP_269983) and S. loihica (RefSeq: YP_001095898) along with beta-barrel domain sequences of the Va, Vb secretory systems were analysed with MEGA 4 . Pairwise distances were calculated and the phylogenetic tree of the aligned sequences was generated using minimum evolution method. A bootstrap test of phylogeny was performed with p-distance model on the inferred evolutionary tree and a consensus bootstrap tree was generated from 1000 replicates.
Sequence alignment of DUF3233 representative sequences. DUF3233 has an average 23 amino acid N-terminal signal sequence (gray) which guides inner membrane translocation. The signal peptidase I cleavage site is marked by a red arrow. Cps_Colwellia psychrerythraea (YP_269983.1), Vfi_Vibrio fischeri (YP_206466.1), Vpa_Vibrio parahaemolyticus (NP_800436.1), Vvu_Vibrio vulnificus (NP_762291.1), Vch_Vibrio cholerae (NP_232949.1), Vsp_Vibrio splendidus (YP_002395311.1), Sba_Shewanella baltica (YP_001052604.1), Slo_Shewanella loihica (YP_001095898.1), Spi_Shewanella piezotolerans (YP_002312093.1), Fba_Ferrimonas balearica (YP_003912385.1).
List of DUF3233 homologous sequences. Sequence search by HHsenser (permissive) picked gram-negative and cyanobacterial (Cyn) hypothetical proteins that share significant homology with DUF3233. These hypothetical proteins posses sequence characteristics that apparently resemble the autotransporter beta-domain. # Insignificant Pfam A hits, $ Significant Pfam A hits.
Conceived and designed the experiments: SA AP. Analyzed the data: AP YS SS SA. Contributed reagents/materials/analysis tools: AP YS. Wrote the paper: SA AP.
- 1. Faruque SM, Albert MJ, Mekalanos JJ (1998) Epidemiology, genetics, and ecology of toxigenic Vibrio cholerae. Microbiol Mol Biol Rev 62: 1301–14.
- 2. Sack DA, Sack RB, Nair GB, Siddique AK (2004) Cholera. Lancet 363: 223–33.
- 3. Markowitz VM, Korzeniewski F, Palaniappan K, Szeto E, Werner G, et al. (2006) The integrated microbial genomes (IMG) system. Nucleic Acids Res 34: D344–8.
- 4. Bateman A, Birney E, Cerruti L, Durbin R, Etwiller L, et al. (2002) The Pfam protein families database. Nucleic Acids Res 30: 276–80.
- 5. Söding J, Remmert M, Biegert A, Lupas AN (2006) HHsenser: exhaustive transitive profile search using HMM-HMM comparison. Nucleic Acids Res 34: W374–8.
- 6. Zhang Y (2008) I-TASSER server for protein 3D structure prediction. BMC Bioinformatics 9: 40.
- 7. Murzin AG, Brenner SE, Hubbard T, Chothia C (1995) SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol 247: 536–40.
- 8. Ou YY, Gromiha MM, Chen SA, Suwa M (2008) TMBETADISC-RBF: Discrimination of beta-barrel membrane proteins using RBF networks and PSSM profiles. Comput Biol Chem 32: 227–31.
- 9. Remmert M, Linke D, Lupas AN, Söding J (2009) HHomp–prediction and classification of outer membrane proteins. Nucleic Acids Res 37: W446–51.
- 10. Bendtsen JD, Nielsen H, von Heijne G, Brunak S (2004) Improved prediction of signal peptides: SignalP 3.0. J Mol Biol 340: 783–95.
- 11. Paetzel M, Karla A, Strynadka NC, Dalbey RE (2002) Signal peptidases. Chem Rev 102: 4549–80.
- 12. Juncker AS, Willenbrock H, Von Heijne G, Brunak S, Nielsen H, et al. (2003) Prediction of lipoprotein signal peptides in Gram-negative bacteria. Protein Sci 12: 1652–62.
- 13. Szklarczyk D, Franceschini A, Kuhn M, Simonovic M, Roth A, et al. (2011) The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored. Nucleic Acids Res 39: D561–8.
- 14. Bendtsen JD, Kiemer L, Fausbøll A, Brunak S (2005) Non-classical protein secretion in bacteria. BMC Microbiol 5: 58.
- 15. McGuffin LJ, Bryson K, Jones DT (2000) The PSIPRED protein structure prediction server. Bioinformatics 16: 404–5.
- 16. Zhai Y, Saier MH Jr (2001) A web-based program for the prediction of average hydropathy, average amphipathicity and average similarity of multiply aligned homologous proteins. J Mol Microbiol Biotechnol 3: 285–6.
- 17. Roggenkamp A, Ackermann N, Jacobi CA, Truelzsch K, Hoffmann H, et al. (2003) Molecular analysis of transport and oligomerization of the Yersinia enterocolitica adhesin YadA. J Bacteriol 185: 3735–44.
- 18. Henderson IR, Navarro-Garcia F, Desvaux M, Fernandez RC, Ala'Aldeen D (2004) Type V protein secretion pathway: the autotransporter story. Microbiol Mol Biol Rev 68: 692–744.
- 19. Yen MR, Peabody CR, Partovi SM, Zhai Y, Tseng YH, et al. (2002) Protein-translocating outer membrane porins of Gram-negative bacteria. Biochim Biophys Acta 1562: 6–31.
- 20. Pohlner J, Halter R, Beyreuther K, Meyer TF (1987) Gene structure and extracellular secretion of Neisseria gonorrhoeae IgA protease. Nature 325: 458–62.
- 21. Henderson IR, Nataro JP (2001) Virulence functions of autotransporter proteins. Infect Immun 69: 1231–43.
- 22. Oomen CJ, van Ulsen P, van Gelder P, Feijen M, Tommassen J, et al. (2004) Structure of the translocator domain of a bacterial autotransporter. EMBO J 23: 1257–66.
- 23. Van den Berg B (2010) Crystal structure of a full-length autotransporter. J Mol Biol 396: 627–33.
- 24. Barnard TJ, Dautin N, Lukacik P, Bernstein HD, Buchanan SK (2007) Autotransporter structure reveals intra-barrel cleavage followed by conformational changes. Nat Struct Mol Biol 14: 1214–20.
- 25. Jacob-Dubuisson F, Locht C, Antoine R (2001) Two-partner secretion in Gram-negative bacteria: a thrifty, specific pathway for large virulence proteins. Mol Microbiol 40: 306–13.
- 26. Surana NK, Buscher AZ, Hardy GG, Grass S, Kehl-Fie T, et al. (2006) Translocator proteins in the two-partner secretion family have multiple domains. J Biol Chem 281: 18051–8.
- 27. Clantin B, Delattre AS, Rucktooa P, Saint N, Méli AC, et al. (2007) Structure of the membrane protein FhaC: a member of the Omp85-TpsB transporter superfamily. Science 317: 957–61.
- 28. Könninger UW, Hobbie S, Benz R, Braun V (1999) The haemolysin-secreting ShlB protein of the outer membrane of Serratia marcescens: determination of surface-exposed residues and formation of ion-permeable pores by ShlB mutants in artificial lipid bilayer membranes. Mol Microbiol 32: 1212–25.
- 29. Hoiczyk E, Roggenkamp A, Reichenbecher M, Lupas A, Heesemann J (2000) Structure and sequence analysis of Yersinia YadA and Moraxella UspAs reveal a novel class of adhesins. EMBO J 19: 5989–99.
- 30. Kim DS, Chao Y, Saier MH Jr (2006) Protein-translocating trimeric autotransporters of gram-negative bacteria. J Bacteriol 188: 5655–67.
- 31. Cotter SE, Surana NK, St Geme JW 3rd (2005) Trimeric autotransporters: a distinct subfamily of autotransporter proteins. Trends Microbiol 13: 199–205.
- 32. Meng G, Surana NK, St Geme JW 3rd, Waksman G (2006) Structure of the outer membrane translocator domain of the Haemophilus influenzae Hia trimeric autotransporter. EMBO J 25: 2297–304.
- 33. Larocque RC, Harris JB, Dziejman M, Li X, Khan AI, et al. (2005) Transcriptional profiling of Vibrio cholerae recovered directly from patient specimens during early and late stages of human infection. Infect Immun 73: 4488–93.
- 34. Nakhamchik A, Wilde C, Rowe-Magnus DA (2008) Cyclic-di-GMP regulates extracellular polysaccharide production, biofilm formation, and rugose colony development by Vibrio vulnificus. Appl Environ Microbiol 74: 4199–209.
- 35. Beyhan S, Tischler AD, Camilli A, Yildiz FH (2006) Transcriptome and phenotypic responses of Vibrio cholerae to increased cyclic di-GMP level. J Bacteriol 188: 3600–13.
- 36. Barnes IH, Bagnall MC, Browning DD, Thompson SA, Manning G, et al. (2007) Gamma-glutamyl transpeptidase has a role in the persistent colonization of the avian gut by Campylobacter jejuni. Microb Pathog 43: 198–207.
- 37. Chevalier C, Thiberge JM, Ferrero RL, Labigne A (1999) Essential role of Helicobacter pylori gamma-glutamyltranspeptidase for the colonization of the gastric mucosa of mice. Mol Microbiol 31: 1359–72.
- 38. Ryjenkov DA, Tarutina M, Moskvin OV, Gomelsky M (2005) Cyclic diguanylate is a ubiquitous signalling molecule in bacteria: insights into biochemistry of the GGDEF protein domain. J Bacteriol 187: 1792–8.
- 39. Heidelberg JF, Eisen JA, Nelson WC, Clayton RA, Gwinn ML, et al. (2000) DNA sequence of both chromosomes of the cholera pathogen Vibrio cholerae. Nature 406: 477–83.
- 40. Henderson IR, Navarro-Garcia F, Nataro JP (1998) The great escape: structure and function of the autotransporter proteins. Trends Microbiol 6: 370–8.
- 41. Hendrixson DR, de la Morena ML, Stathopoulos C, St Geme JW 3rd (1997) Structural determinants of processing and secretion of the Haemophilus influenzae hap protein. Mol Microbiol 26: 505–18.
- 42. Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389–402.
- 43. Mao F, Dam P, Chou J, Olman V, Xu Y (2009) DOOR: a database for prokaryotic operons. Nucleic Acids Res 37: D459–63.
- 44. Zhang Y (2007) Template-based modeling and free modeling by I-TASSER in CASP7. Proteins 69: 108–17.
- 45. Sippl MJ, Wiederstein M (2008) A note on difficult structure alignment problems. Bioinformatics 24: 426–7.
- 46. Sayle RA, Milner-White EJ (1995) RASMOL: biomolecular graphics for all. Trends Biochem Sci 20: 374.
- 47. Zhai Y, Saier MH Jr (2001) A web-based program (WHAT) for the simultaneous prediction of hydropathy, amphipathicity, secondary structure and transmembrane topology for a single protein sequence. J Mol Microbiol Biotechnol 3: 501–2.
- 48. Bagos PG, Liakopoulos TD, Spyropoulos IC, Hamodrakas SJ (2004) PRED-TMBB: a web server for predicting the topology of beta-barrel outer membrane proteins. Nucleic Acids Res 32: W400–4.
- 49. Jeanmougin F, Thompson JD, Gouy M, Higgins DG, Gibson TJ (1998) Multiple sequence alignment with Clustal X. Trends Biochem Sci 23: 403–5.
- 50. Do CB, Mahabhashyam MS, Brudno M, Batzoglou S (2005) ProbCons: Probabilistic consistency-based multiple sequence alignment. Genome Res 15: 330–40.
- 51. Biegert A, Mayer C, Remmert M, Söding J, Lupas AN (2006) The MPI Bioinformatics Toolkit for protein sequence analysis. Nucleic Acids Res 34: W335–9.
- 52. Tamura K, Dudley J, Nei M, Kumar S (2007) MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol 24: 1596–9.