Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Protein Domain of Unknown Function 3233 is a Translocation Domain of Autotransporter Secretory Mechanism in Gamma proteobacteria

Protein Domain of Unknown Function 3233 is a Translocation Domain of Autotransporter Secretory Mechanism in Gamma proteobacteria

  • Ananth Prakash, 
  • S. Yogeeshwari, 
  • Sanchari Sircar, 
  • Shipra Agrawal
PLOS
x

Abstract

Vibrio cholerae, the enteropathogenic gram negative bacteria is one of the main causative agents of waterborne diseases like cholera. About 1/3rd of the organism's genome is uncharacterised with many protein coding genes lacking structure and functional information. These proteins form significant fraction of the genome and are crucial in understanding the organism's complete functional makeup. In this study we report the general structure and function of a family of hypothetical proteins, Domain of Unknown Function 3233 (DUF3233), which are conserved across gram negative gammaproteobacteria (especially in Vibrio sp. and similar bacteria). Profile and HMM based sequence search methods were used to screen homologues of DUF3233. The I-TASSER fold recognition method was used to build a three dimensional structural model of the domain. The structure resembles the transmembrane beta-barrel with an axial N-terminal helix and twelve antiparallel beta-strands. Using a combination of amphipathy and discrimination analysis we analysed the potential transmembrane beta-barrel forming properties of DUF3233. Sequence, structure and phylogenetic analysis of DUF3233 indicates that this gram negative bacterial hypothetical protein resembles the beta-barrel translocation unit of autotransporter Va secretory mechanism with a gene organisation that differs from the conventional Va system.

Introduction

Domain of Unknown Function (DUF) 3233 (PFAM: PF11557) is a family of uncharacterised hypothetical proteins conserved among gram negative gammaproteobacteria. Representative members of this domain include marine bacteria from genus Vibrio, Shewanella, Colwellia and Alcanivorax of which Vibrio cholerae, Vibrio parahaemolyticus, Vibrio splendidus and Vibrio vulnificus are pathogenic to human and aquatic life. Vibrio cholerae causes seasonal outbreaks of cholera of epidemic proportions in developing countries with high mortality rates [1]. The enterotoxins produced by the bacteria after colonising the host small intestine disrupts the ion transport by the intestinal epithelial cells causing outflow of large volumes of fluids into the intestine leading to watery diarrhoea, dehydration and in severe cases, death [1],[2].

Significant fraction of genomes of Vibrio species lack structure function annotation and most of these gene products are classified as hypothetical proteins or domains of unknown function [3]. The PFAM [4] database in its 24th release lists about 3000 DUF families. Many of these DUF families are kingdom specific (DUF2883, DUF3328, DUF3329), limited/shared between kingdoms (DUF1497, DUF3609) or restricted/specific to certain organisms (DUF1196, DUF2667). The specific and ubiquitous nature of these domains suggests their functional importance in organism specific niches or a common biological role.

Identifying homologous protein families through sequence based search marks the first step in the annotation of DUFs, providing an initial broad picture of the protein's probable family and function. Sequence homology search becomes increasingly powerful when we advance from normal sequence-sequence based searches to methods that uses profile or HMM information like HHsenser [5], which increases the efficiency of finding remote homologues. In silico structure prediction methods together with sequence similarity detection methods assist the annotation of fold-function space. Fold recognition methods like I-TASSER [6] help predict the 3 dimensional (3D) structure and functions of proteins that share low sequence identity with other known structures.

In this study we analyse the sequence and structural characteristics of DUF3233 using computational approaches and try to infer various properties of this domain. Sequence search by HHsenser identifies similarity with the beta-barrel translocation unit of autotransporter Va secretory proteins. Sequence homology combined with secondary structure prediction indicates a beta-barrel domain of 12 beta-strands. The predicted 3D model from I-TASSER shows the structure with an overall beta-barrel topology with an N terminal helix running along the central barrel axis perpendicular to the 12 antiparallel strands that form the barrel. Amphipathicity and membrane barrel discrimination analysis suggest the domain is a potential outer membrane gram negative beta-barrel protein.

Autotransporter translocation units belong to the transmembrane beta-barrel fold in SCOP database [7], defined by a beta-barrel of 12 to 14 antiparallel strands with an N terminal helix perpendicular to the barrel. Finally with the analysis of genomic context of DUF3233 we could infer that this outer membrane beta-barrel translocation domain has a gene organisation that is not typical of the autotransporter Va secretory mechanism.

Results

Sequence based characterization of DUF3233 as autotransporter β-domain protein

Sequence search for homologues with PSI-BLAST using a representative query, Vibrio cholerae DUF3233 (RefSeq: NP_232949) against the NCBI nr database with a threshold 0.005, reached convergence at the 4th iteration. The resulting sequences identified were hypothetical proteins conserved among gram negative proteobacteria. For improved search and better coverage of homologous sequence space, information from aligned regions of DUF3233 sequences in the form of a multiple sequence alignment profile was queried with HHsenser. From the resulting sequences in the permissive alignment list we were able to infer homology between DUF3233 and the outer membrane beta-barrel translocation domain of autotransporter proteins. DUF3233 shares sequence similarity with outer membrane beta-barrel domain of Ochrobactrum intermedium autotransporter (e-value 1E-34, 95% coverage, 22% identity), Rhizobium leguminosarum adhesin autotransporter (e-value 2E-29, 94% coverage, 18% identity) and Yersinia aldovae AidA adhesin autotransporter (e-value 2E-26, 89% coverage, 18% identity). Interestingly, a number of gram negative hypothetical proteins were picked up as potential homologues, which showed fair amount of similarity to the autotransporter beta-domain (Table S1).

Position specific scoring matrix (PSSM) profile based discrimination analysis using TMBETADISC-RBF [8], predicts the outer membrane beta-barrel nature of DUF3233. To confirm homology with autotransporter protein family we queried DUF3233 sequences against putative outer membrane proteins (OMPs). A pairwise hidden Markov model (HMM) search by HHomp [9] identifies DUF3233 as an OMP. V. cholerae DUF3233 shares homology with HHomp cluster 12.1.6 (96% probability). This cluster comprises profile HMMs of autotransporter sequences whose 12 stranded beta-barrel transmembrane domains conform to the translocation unit of autotransporter NalP [9]. DUF3233 sequences from Colwellia (94% probability), Shewanella (96% probability) and Ferrimonas (95% probability) were all found to share homology with autotransporter protein family.

DUF3233 is a solitary outer membrane autotransporter β-barrel domain

Proteins targeted for transport across membranes posses leader sequence or signal peptide at their N-terminus, which directs translocation. We analysed DUF3233 sequences using a combination of artificial neural networks and HMMs implemented in SignalP [10] to predict the presence and location of signal peptide cleavage sites. SignalP identified the presence of N-terminal signal peptide of an average length of 23 amino acid residues having a positively charged amino terminal followed by a hydrophobic region and hydrophilic carboxy terminal. Signal peptides are cleaved from the exported protein by specific proteases called signal peptidases (SPases) [11]. Prediction of cleavage mechanism of these signal sequences by LipoP [12] identifies SPase 1 target site, indicating DUF3233 might be a non-lipoprotein.

We browsed DUF3233 genomic region of all representative organisms with STRING [13] to look for possible gene fusion events with other domains and found no such occurrence. DUF3233 is a single domain protein found on the small chromosome 2 in Vibrio species with an upstream gamma-glutamyltranspeptidase (GGT) or response regulatory protein transcribed in one potential operon (Table 1). These upstream proteins lack the N-terminal signal sequence for inner membrane transport and analysis through SecretomeP [14] indicates that these proteins are not exported through non-classical secretory system. Gene organisation of DUF3233 therefore suggests a solitary translocation unit with an absent upstream secretory protein.

thumbnail
Table 1. Genomic context of DUF3233: Proteins upstream of DUF3233 in various representative organisms are organised in operons and have a probable pathogenic role.

https://doi.org/10.1371/journal.pone.0025570.t001

Structure based validation of DUF3233 as transmembrane β-barrel domain of autotransporter proteins

DUF3233 sequence based PSI-BLAST search for proteins with known structures (72,386 structures in PDB as of April 2011) fetched results with a maximum alignment length covering 51 residues. Secondary structure assignment by PSIPRED [15] predicts an N-terminal α-helix (αN) followed by 12 consecutive β-strands (β1–β12) interspersed by two short turns of α-helices α1 and α2 predicted to occur between β1–β2 and between β5–β6 respectively. With no suitable template with significant sequence homology available, we used the fold recognition algorithm implemented in I-TASSER to predict a 3D model of DUF3233. The translocation unit of NalP from N. meningitidis (PDB: 1UYN_X, 15% identity, 85% coverage, normalised Z-score above 1) was identified by I-TASSER in the top four threading templates to model V. cholerae DUF3233. The predicted structure of V. cholerae DUF3233 (Figure 1) resembles the beta-barrel translocation unit of autotransporter proteins, aligning over 75% structurally equivalent positions with the template and an RMSD of 2.3. The domain has an N-terminal α-helix running along the central axis surrounded by beta-barrel formed by twelve anti-parallel beta-strands. Predicted strand assembly within the outer membrane shows the carboxy and amino terminal of the beta-barrel point towards the periplasmic space, the central helix is oriented such that its N-terminal is pointed towards the external environment. Secondary structure based sequence alignment of DUF3233 with the translocation unit of autotransporters shows a similar domain organisation (Figure 2). Using alignment of DUF3233 sequences the average hydropathy, amphipathicity and similarity plots was generated with AveHAS [16]. Figure 3, shows 12 hydrophobicity and amphipathicity peaks with an average stretch of 10 to 15 residues per peak that may form transmembrane beta-strands.

thumbnail
Figure 1. Predicted 3D model of DUF3233.

A) The predicted structure resembles the beta-barrel translocation unit of autotransporter proteins. The 12 beta-strands (blue) orient in anti-parallel fashion enclosing the N-terminal helix (red), the two additional helices α1 and α2 are shown in green. B) Superimposed structures of DUF3233 (magenta) and NalP translocation unit of N. meningitidis 1uyn_x (blue). Figures were generated with Rasmol.

https://doi.org/10.1371/journal.pone.0025570.g001

thumbnail
Figure 2. Sequence alignment of DUF3233 and translocation units of select autotransporters.

Secondary structure diagram for NalP is shown at the top. The two extra helices (α1 and α2) present among DUF3233 proteins are shown in green. DUF3233 and autotransporter sequences representing each cluster are used in the alignment. Nme_Neisseria meningitidis, Pae_Pseudomonas aeruginosa, Eco_Escherichia coli, Sen_Salmonella enterica, Bpe_Bordetella pertussis, Ctr_Chlamydia trachomatis, Hin_Haemophilus influenzae, Rri_Rickettsia rickettsii, Mca_Moraxella catarrhalis, Vch_Vibrio cholerae, Cps_Colwellia psychrerythraea, Slo_Shewanella loihica.

https://doi.org/10.1371/journal.pone.0025570.g002

thumbnail
Figure 3. Average hydropathy, amphipathicity and similarity plot of 18 DUF3233 representative sequences.

The plot shows 12 hydrophobic (green) and amphipathic (red) peaks corresponding to possible transmembrane beta-strand forming segments The plot was generated using AveHAS program.

https://doi.org/10.1371/journal.pone.0025570.g003

DUF3233 is evolutionarily linked to the autotransporters

To infer evolutionary relation with type V secretory proteins, we analysed DUF3233 representatives with members of both Va and Vb family (Table S2). The third type of proteins found in the type V secretory system, type Vc or AT-2 proteins, which are characterised by trimeric C-terminal beta-barrel [17] were not considered for phylogenetic analysis. The inferred phylogenetic tree (Figure 4) classifies members of the two families into two separate clans. Proteins are grouped into clusters with similar function, architecture and organism type as analysed in [18] and [19]. DUF3233 family sequences though related to autotransporters form a distinct group from the main autotransporter clan indicating that these domains represent new cluster of autotransporters.

thumbnail
Figure 4. Bootstrap phylogenetic tree of DUF3233 with Va and Vb family proteins.

51 Va family (red), 25 Vb family (blue) translocation unit and 3 DUF3233 (green) representative sequences were used to generate a 1000 replicate bootstrap phylogenetic tree. Eukaryotic Vb sequences representing clusters 1 and 2 were not considered in the analysis. Sequences of each family are grouped into previously reported clusters (C 1 to 11 and C 1 to 4); DUF3233 sequences form a new cluster of the autotransporter family. Numbers at the node indicate neighbour-joining bootstrap percentages. Phylogenetic analyses were conducted in MEGA4.

https://doi.org/10.1371/journal.pone.0025570.g004

Discussion

A vital part in the survival and adaptation mechanism of bacteria lies in the constant interaction with their extra-cellular environment. Bacteria secrete a wide range of molecules into the extra-cellular milieu that includes enzymes, which break down carbohydrates, proteins and lipids, and virulence factors such as adhesins and toxins by those involved in pathogenesis. Transport of these molecules is mediated by protein complexes through conserved secretory pathways. Of the 6 types of secretory mechanisms known in gram-negative bacteria (type I to type VI), type V represents the simplest transport system. Proteins of the type V secretory system fall under the autotransporter (Va), two partner secretion (Vb) and the AT-2 (Vc) families, and share a similar domain organisation: an N-terminal signal peptide for inner membrane translocation followed by a passenger protein which is normally a virulence determinant and a C-terminal translocation unit for transporting the upstream passenger protein [18].

Proteins of the autotransporter (Va) family were the first to be described [20] and form the largest representation of this system [19]. Autotransporters export a wide range of toxins and enzymes [21] to the cell surface or secrete them into the external environment. The passenger domain and translocation unit of autotransporters are both expressed as a single polypeptide [20] making the translocation unit highly specific and committed for transporting only the upstream passenger. Solved experimental structures of the autotransporter translocation unit [22][24] show that they all possess a similar structure, a beta-barrel of 12 antiparallel strands with a central N-terminal helix running along the barrel axis. Proteins of the two partner secretion (Vb) are widely distributed and follow a similar mode of function, transporting cytolysins, adhesions and metalloproteases [19]. The secreted exoprotein and the transporter unlike the Va proteins are not linked but, are expressed as two separate proteins transcribed in a single operon [25]. Vb transporters are predicted to have a multidomain architecture [26] and a relatively wider barrel made of 16 [27] to 20 [28] beta-strands. The newly discovered AT-2 family (Vc) [29] represents proteins secreted via a homotrimeric mechanism [30]. Proteins secreted through this system are mainly implicated in virulence [31]. With a domain organisation similar to that of autotransporters, the system functions with the coming together of three individual proteins each complete with an N-terminal signal peptide, a passenger unit and four beta-strand domain at the C-terminal which makes a complete closed 12 stranded beta-barrel translocation unit upon trimerisation [31],[32].

The present work describes sequence and structure based characterization of proteobacteria DUF3233 as a beta-barrel transmembrane domain of autotransporter proteins. DUF3233 packs an average 312 amino acid residues (including N-terminal signal peptide) and is currently classified as a domain of unknown function.

DUF3233 is encoded as a single domain protein, homologous to the translocation unit of autotransporters. One aspect of DUF3233 that distinguishes it from other main class autotransporters is that it lacks a covalently linked N-terminal passenger domain, to which C-terminal translocation units of all autotransporters are committed to transport. Few autotransporter representatives of two-polypeptide architecture [19] might suggest the secretion of co-transcribed upstream proteins similar to the TPS system, but considering the cytosolic nature of upstream proteins, extracellular translocation seems unlikely.

Few representative members from the Vibrio genus express DUF3233 and upstream putative GGT or response regulatory proteins in a single operon (Table 1). Over expression of GGT [33] and GGDEF domain proteins [34],[35] are implicated in pathogenesis. The prokaryotic GGT is shown to be a major factor in the colonisation of gut and gastric mucosa [36],[37]. The upstream response regulators are two-domain proteins with an N-terminal CheY-like regulatory and a conserved C-terminal GGDEF effector domain, which is responsible for eliciting pathogenic response through cyclic di-GMP mediated exopolysaccharide synthesis and biofilm formation [38]. Genes encoding virulence products in V. cholerae are organised in clusters or operons [39], and since gene encoding DUF3233 is located among virulent genes, the possibility of the involvement of DUF3233 in pathogenesis cannot be overlooked.

The translocation units of autotransporters exhibit conserved amino acid consensus motif at their carboxy terminus, the barrel closing beta-strand displays alternate arrangement of polar and hydrophobic residues terminating with a conserved aromatic amino acid at the barrel terminus which is usually a phenylalanine or a tryptophan [40]. Hendrixson et al., [41] demonstrated the importance of C-terminal consensus motif on the viability of H. influenzae Hap translocation unit. Deletion of terminal 12 residues proved detrimental to the outer membrane localisation; while the stability and/or outer membrane localisation of the translocation unit was affected with the deletion of all three terminal residues, point mutations of these residues showed no effect on the outer membrane localisation or secretion of the mature protein [41]. DUF3233 displays consensus pattern at its C-terminal that resembles the conserved motif found among autotransporters discussed above (Figure S1). A stretch of alternating polar and hydrophobic residues precedes the terminal beta-strand having a hydrophobic segment and a conserved ‘terminal’ phenylalanine or a tyrosine residue. Interestingly the extreme carboxy terminus harbours three conserved polar residues [N/D][Q/E][D/E] after the ‘terminal’ F/Y. Secondary structure and predicted models of DUF3233 show the hydrophilic residues of the “polar tail” are not part of the terminal beta-sheet, but instead form a short overhang pointed towards the periplasm. As yet, we do not know the significance and possible role of these tail polar residues on the outer membrane localisation and stability.

DUF3233 exhibits certain features that are in common with the translocation units of type Va secretory proteins and yet possesses characteristics that are not typical to the proteins of the above system. DUF3233 represents a translocation unit that is devoid of a secretable passenger unit. Considering its location in the representative genomes alongside other virulence genes, we hypothesize that this domain is involved in pathogenesis. However, the mechanism apparently looks new and different than a typical type Va secretion system.

Our study on the proteobacterial protein DUF3233 with a combination of methods like sequence similarity searches, outer membrane beta-barrel discrimination, phylogenetic analysis, and fold recognition has led us to a consensus at annotating fold and function to this domain.

Sequence similarity search suggests that DUF3233 has remote homology with the translocation unit of autotransporter proteins of the type V secretory system. The domain's outer membrane beta-barrel nature was further emphasised by signal peptide, outer membrane beta-barrel discrimination and amphipathicity analysis. Secondary structure prediction and alignment with translocation units, and inputs from the predicted model suggests that DUF3233 and the translocation unit of autotransporter proteins share a similar domain organisation.

Drawing a consensus from various in silico prediction methods it appears that DUF3233 is a cluster of remote homologues of autotransporters. This is the first report of an autotransporter like protein family in Vibrio species, and though within the realm of bioinformatics we were able to infer its probable family and fold, the pathogenic mechanism still remains to be explored and seeks further experimental studies.

Materials and Methods

Sequence similarity search

20 DUF3233 genes from NCBI comprising one sequence each from Aliivibrio, Colwellia and Ferrimonas, five sequences from Shewanella and twelve from Vibrio species were fetched using their corresponding RefSeq ids. YP_001366070 and YP_001555810 were excluded because of their short domain size and a total of 18 DUF3233 sequences were included in this study. The signal peptides of DUF3233 sequences have not been included in various analyses of this study unless mentioned. PSI-BLAST [42] profile-sequence search was used to probe homologous protein families against NCBI nonredundant (nr) database with default parameters. ClustalW multiple sequence alignment profile of DUF3233 protein sequences was taken in as input for HHsenser to search the nr database with a threshold e-value of 0.001 and default parameters for improved homolog coverage. TMBETADISC-RBF [8], PSSM profile based discrimination of beta-barrel OMPs from other folding types like globular and membrane proteins was used to assess the outer membrane nature of DUF3233. DUF3233 sequences were queried against HHomp database [9] to detect homology with other known OMPs.

Signal peptide and Genomic context analysis

The presence of N-terminal signal peptide and the putative cleavage sites were predicted with SignalP 3.0 [10]. Using LipoP 1.0 [12] the signal peptide sequences were checked for lipoprotein signal peptide signatures that differentiate them from other signal peptides and subsequently cleavage by signal peptidase II from signal peptidase I. SignalP 3.0 and SecretomeP 2.0 [14] were used to determine inner membrane transport of proteins upstream of DUF3233 via Sec dependent or other non classical secretory pathways. STRING (version 8.3) [13] and DOOR (version 2.0) [43] were used for gene neighbourhood and operon analysis.

Structure prediction and transmembrane β-barrel analysis

Secondary structure assignments were made using PSIPRED [15]. 3D structure of V. cholerae DUF3233 (RefSeq: NP_232949) was predicted using I-TASSER fold recognition method [44]. The predicted structure was superimposed with the template using TopMatch [45] and visualised with Rasmol [46]. The WHAT [47] program was used to predict hydropathy and amphipathicity using sliding windows of 13, 15, 17 residues and an angle of 100° for α-helix and 180° for β-strand. Multiple sequence alignment profile was used to plot the average hydropathy, amphipathicity and similarity plot using the AveHAS [16] program. Orientation of the domain within the outer membrane was predicted using the Viterbi method implemented in PRED-TMBB [48]. Secondary structure based sequence alignment of DUF3233 family and with the representative autotransporter translocation units was done with inputs from ClustalX [49], ProbCons [50], Ali2D [51] and the alignment was adjusted manually.

Phylogenetic analysis

Amino acid sequences of translocation unit of autotransporters, and pore forming beta-domain of two-partner secretion (TPS) family members, reported in [18] and [19] were used for phylogeny analysis. Three representative DUF3233 sequences from V. cholerae (RefSeq: NP_232949), C. psychrerythraea (RefSeq: YP_269983) and S. loihica (RefSeq: YP_001095898) along with beta-barrel domain sequences of the Va, Vb secretory systems were analysed with MEGA 4 [52]. Pairwise distances were calculated and the phylogenetic tree of the aligned sequences was generated using minimum evolution method. A bootstrap test of phylogeny was performed with p-distance model on the inferred evolutionary tree and a consensus bootstrap tree was generated from 1000 replicates.

Supporting Information

Figure S1.

Sequence alignment of DUF3233 representative sequences. DUF3233 has an average 23 amino acid N-terminal signal sequence (gray) which guides inner membrane translocation. The signal peptidase I cleavage site is marked by a red arrow. Cps_Colwellia psychrerythraea (YP_269983.1), Vfi_Vibrio fischeri (YP_206466.1), Vpa_Vibrio parahaemolyticus (NP_800436.1), Vvu_Vibrio vulnificus (NP_762291.1), Vch_Vibrio cholerae (NP_232949.1), Vsp_Vibrio splendidus (YP_002395311.1), Sba_Shewanella baltica (YP_001052604.1), Slo_Shewanella loihica (YP_001095898.1), Spi_Shewanella piezotolerans (YP_002312093.1), Fba_Ferrimonas balearica (YP_003912385.1).

https://doi.org/10.1371/journal.pone.0025570.s001

(TIF)

Table S1.

List of DUF3233 homologous sequences. Sequence search by HHsenser (permissive) picked gram-negative and cyanobacterial (Cyn) hypothetical proteins that share significant homology with DUF3233. These hypothetical proteins posses sequence characteristics that apparently resemble the autotransporter beta-domain. # Insignificant Pfam A hits, $ Significant Pfam A hits.

https://doi.org/10.1371/journal.pone.0025570.s002

(DOC)

Table S2.

List of representative sequences used for phylogeny analysis. C-terminal translocation unit sequences of Autotransporter (Va) [18], Two partner secretion (Vb) [19] proteins and DUF3233 representatives used for phylogenetic analysis.

https://doi.org/10.1371/journal.pone.0025570.s003

(DOC)

Acknowledgments

We thank Mr. G. Goldsmith, IBAB for his engaging and fruitful discussions.

Author Contributions

Conceived and designed the experiments: SA AP. Analyzed the data: AP YS SS SA. Contributed reagents/materials/analysis tools: AP YS. Wrote the paper: SA AP.

References

  1. 1. Faruque SM, Albert MJ, Mekalanos JJ (1998) Epidemiology, genetics, and ecology of toxigenic Vibrio cholerae. Microbiol Mol Biol Rev 62: 1301–14.SM FaruqueMJ AlbertJJ Mekalanos1998Epidemiology, genetics, and ecology of toxigenic Vibrio cholerae.Microbiol Mol Biol Rev62130114
  2. 2. Sack DA, Sack RB, Nair GB, Siddique AK (2004) Cholera. Lancet 363: 223–33.DA SackRB SackGB NairAK Siddique2004Cholera.Lancet36322333
  3. 3. Markowitz VM, Korzeniewski F, Palaniappan K, Szeto E, Werner G, et al. (2006) The integrated microbial genomes (IMG) system. Nucleic Acids Res 34: D344–8.VM MarkowitzF. KorzeniewskiK. PalaniappanE. SzetoG. Werner2006The integrated microbial genomes (IMG) system.Nucleic Acids Res34D3448
  4. 4. Bateman A, Birney E, Cerruti L, Durbin R, Etwiller L, et al. (2002) The Pfam protein families database. Nucleic Acids Res 30: 276–80.A. BatemanE. BirneyL. CerrutiR. DurbinL. Etwiller2002The Pfam protein families database.Nucleic Acids Res3027680
  5. 5. Söding J, Remmert M, Biegert A, Lupas AN (2006) HHsenser: exhaustive transitive profile search using HMM-HMM comparison. Nucleic Acids Res 34: W374–8.J. SödingM. RemmertA. BiegertAN Lupas2006HHsenser: exhaustive transitive profile search using HMM-HMM comparison.Nucleic Acids Res34W3748
  6. 6. Zhang Y (2008) I-TASSER server for protein 3D structure prediction. BMC Bioinformatics 9: 40.Y. Zhang2008I-TASSER server for protein 3D structure prediction.BMC Bioinformatics940
  7. 7. Murzin AG, Brenner SE, Hubbard T, Chothia C (1995) SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol 247: 536–40.AG MurzinSE BrennerT. HubbardC. Chothia1995SCOP: a structural classification of proteins database for the investigation of sequences and structures.J Mol Biol24753640
  8. 8. Ou YY, Gromiha MM, Chen SA, Suwa M (2008) TMBETADISC-RBF: Discrimination of beta-barrel membrane proteins using RBF networks and PSSM profiles. Comput Biol Chem 32: 227–31.YY OuMM GromihaSA ChenM. Suwa2008TMBETADISC-RBF: Discrimination of beta-barrel membrane proteins using RBF networks and PSSM profiles.Comput Biol Chem3222731
  9. 9. Remmert M, Linke D, Lupas AN, Söding J (2009) HHomp–prediction and classification of outer membrane proteins. Nucleic Acids Res 37: W446–51.M. RemmertD. LinkeAN LupasJ. Söding2009HHomp–prediction and classification of outer membrane proteins.Nucleic Acids Res37W44651
  10. 10. Bendtsen JD, Nielsen H, von Heijne G, Brunak S (2004) Improved prediction of signal peptides: SignalP 3.0. J Mol Biol 340: 783–95.JD BendtsenH. NielsenG. von HeijneS. Brunak2004Improved prediction of signal peptides: SignalP 3.0.J Mol Biol34078395
  11. 11. Paetzel M, Karla A, Strynadka NC, Dalbey RE (2002) Signal peptidases. Chem Rev 102: 4549–80.M. PaetzelA. KarlaNC StrynadkaRE Dalbey2002Signal peptidases.Chem Rev102454980
  12. 12. Juncker AS, Willenbrock H, Von Heijne G, Brunak S, Nielsen H, et al. (2003) Prediction of lipoprotein signal peptides in Gram-negative bacteria. Protein Sci 12: 1652–62.AS JunckerH. WillenbrockG. Von HeijneS. BrunakH. Nielsen2003Prediction of lipoprotein signal peptides in Gram-negative bacteria.Protein Sci12165262
  13. 13. Szklarczyk D, Franceschini A, Kuhn M, Simonovic M, Roth A, et al. (2011) The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored. Nucleic Acids Res 39: D561–8.D. SzklarczykA. FranceschiniM. KuhnM. SimonovicA. Roth2011The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored.Nucleic Acids Res39D5618
  14. 14. Bendtsen JD, Kiemer L, Fausbøll A, Brunak S (2005) Non-classical protein secretion in bacteria. BMC Microbiol 5: 58.JD BendtsenL. KiemerA. FausbøllS. Brunak2005Non-classical protein secretion in bacteria.BMC Microbiol558
  15. 15. McGuffin LJ, Bryson K, Jones DT (2000) The PSIPRED protein structure prediction server. Bioinformatics 16: 404–5.LJ McGuffinK. BrysonDT Jones2000The PSIPRED protein structure prediction server.Bioinformatics164045
  16. 16. Zhai Y, Saier MH Jr (2001) A web-based program for the prediction of average hydropathy, average amphipathicity and average similarity of multiply aligned homologous proteins. J Mol Microbiol Biotechnol 3: 285–6.Y. ZhaiMH Saier Jr2001A web-based program for the prediction of average hydropathy, average amphipathicity and average similarity of multiply aligned homologous proteins.J Mol Microbiol Biotechnol32856
  17. 17. Roggenkamp A, Ackermann N, Jacobi CA, Truelzsch K, Hoffmann H, et al. (2003) Molecular analysis of transport and oligomerization of the Yersinia enterocolitica adhesin YadA. J Bacteriol 185: 3735–44.A. RoggenkampN. AckermannCA JacobiK. TruelzschH. Hoffmann2003Molecular analysis of transport and oligomerization of the Yersinia enterocolitica adhesin YadA.J Bacteriol185373544
  18. 18. Henderson IR, Navarro-Garcia F, Desvaux M, Fernandez RC, Ala'Aldeen D (2004) Type V protein secretion pathway: the autotransporter story. Microbiol Mol Biol Rev 68: 692–744.IR HendersonF. Navarro-GarciaM. DesvauxRC FernandezD. Ala'Aldeen2004Type V protein secretion pathway: the autotransporter story.Microbiol Mol Biol Rev68692744
  19. 19. Yen MR, Peabody CR, Partovi SM, Zhai Y, Tseng YH, et al. (2002) Protein-translocating outer membrane porins of Gram-negative bacteria. Biochim Biophys Acta 1562: 6–31.MR YenCR PeabodySM PartoviY. ZhaiYH Tseng2002Protein-translocating outer membrane porins of Gram-negative bacteria.Biochim Biophys Acta1562631
  20. 20. Pohlner J, Halter R, Beyreuther K, Meyer TF (1987) Gene structure and extracellular secretion of Neisseria gonorrhoeae IgA protease. Nature 325: 458–62.J. PohlnerR. HalterK. BeyreutherTF Meyer1987Gene structure and extracellular secretion of Neisseria gonorrhoeae IgA protease.Nature32545862
  21. 21. Henderson IR, Nataro JP (2001) Virulence functions of autotransporter proteins. Infect Immun 69: 1231–43.IR HendersonJP Nataro2001Virulence functions of autotransporter proteins.Infect Immun69123143
  22. 22. Oomen CJ, van Ulsen P, van Gelder P, Feijen M, Tommassen J, et al. (2004) Structure of the translocator domain of a bacterial autotransporter. EMBO J 23: 1257–66.CJ OomenP. van UlsenP. van GelderM. FeijenJ. Tommassen2004Structure of the translocator domain of a bacterial autotransporter.EMBO J23125766
  23. 23. Van den Berg B (2010) Crystal structure of a full-length autotransporter. J Mol Biol 396: 627–33.B. Van den Berg2010Crystal structure of a full-length autotransporter.J Mol Biol39662733
  24. 24. Barnard TJ, Dautin N, Lukacik P, Bernstein HD, Buchanan SK (2007) Autotransporter structure reveals intra-barrel cleavage followed by conformational changes. Nat Struct Mol Biol 14: 1214–20.TJ BarnardN. DautinP. LukacikHD BernsteinSK Buchanan2007Autotransporter structure reveals intra-barrel cleavage followed by conformational changes.Nat Struct Mol Biol14121420
  25. 25. Jacob-Dubuisson F, Locht C, Antoine R (2001) Two-partner secretion in Gram-negative bacteria: a thrifty, specific pathway for large virulence proteins. Mol Microbiol 40: 306–13.F. Jacob-DubuissonC. LochtR. Antoine2001Two-partner secretion in Gram-negative bacteria: a thrifty, specific pathway for large virulence proteins.Mol Microbiol4030613
  26. 26. Surana NK, Buscher AZ, Hardy GG, Grass S, Kehl-Fie T, et al. (2006) Translocator proteins in the two-partner secretion family have multiple domains. J Biol Chem 281: 18051–8.NK SuranaAZ BuscherGG HardyS. GrassT. Kehl-Fie2006Translocator proteins in the two-partner secretion family have multiple domains.J Biol Chem281180518
  27. 27. Clantin B, Delattre AS, Rucktooa P, Saint N, Méli AC, et al. (2007) Structure of the membrane protein FhaC: a member of the Omp85-TpsB transporter superfamily. Science 317: 957–61.B. ClantinAS DelattreP. RucktooaN. SaintAC Méli2007Structure of the membrane protein FhaC: a member of the Omp85-TpsB transporter superfamily.Science31795761
  28. 28. Könninger UW, Hobbie S, Benz R, Braun V (1999) The haemolysin-secreting ShlB protein of the outer membrane of Serratia marcescens: determination of surface-exposed residues and formation of ion-permeable pores by ShlB mutants in artificial lipid bilayer membranes. Mol Microbiol 32: 1212–25.UW KönningerS. HobbieR. BenzV. Braun1999The haemolysin-secreting ShlB protein of the outer membrane of Serratia marcescens: determination of surface-exposed residues and formation of ion-permeable pores by ShlB mutants in artificial lipid bilayer membranes.Mol Microbiol32121225
  29. 29. Hoiczyk E, Roggenkamp A, Reichenbecher M, Lupas A, Heesemann J (2000) Structure and sequence analysis of Yersinia YadA and Moraxella UspAs reveal a novel class of adhesins. EMBO J 19: 5989–99.E. HoiczykA. RoggenkampM. ReichenbecherA. LupasJ. Heesemann2000Structure and sequence analysis of Yersinia YadA and Moraxella UspAs reveal a novel class of adhesins.EMBO J19598999
  30. 30. Kim DS, Chao Y, Saier MH Jr (2006) Protein-translocating trimeric autotransporters of gram-negative bacteria. J Bacteriol 188: 5655–67.DS KimY. ChaoMH Saier Jr2006Protein-translocating trimeric autotransporters of gram-negative bacteria.J Bacteriol188565567
  31. 31. Cotter SE, Surana NK, St Geme JW 3rd (2005) Trimeric autotransporters: a distinct subfamily of autotransporter proteins. Trends Microbiol 13: 199–205.SE CotterNK SuranaJW St Geme 3rd2005Trimeric autotransporters: a distinct subfamily of autotransporter proteins.Trends Microbiol13199205
  32. 32. Meng G, Surana NK, St Geme JW 3rd, Waksman G (2006) Structure of the outer membrane translocator domain of the Haemophilus influenzae Hia trimeric autotransporter. EMBO J 25: 2297–304.G. MengNK SuranaJW St Geme 3rdG. Waksman2006Structure of the outer membrane translocator domain of the Haemophilus influenzae Hia trimeric autotransporter.EMBO J252297304
  33. 33. Larocque RC, Harris JB, Dziejman M, Li X, Khan AI, et al. (2005) Transcriptional profiling of Vibrio cholerae recovered directly from patient specimens during early and late stages of human infection. Infect Immun 73: 4488–93.RC LarocqueJB HarrisM. DziejmanX. LiAI Khan2005Transcriptional profiling of Vibrio cholerae recovered directly from patient specimens during early and late stages of human infection.Infect Immun73448893
  34. 34. Nakhamchik A, Wilde C, Rowe-Magnus DA (2008) Cyclic-di-GMP regulates extracellular polysaccharide production, biofilm formation, and rugose colony development by Vibrio vulnificus. Appl Environ Microbiol 74: 4199–209.A. NakhamchikC. WildeDA Rowe-Magnus2008Cyclic-di-GMP regulates extracellular polysaccharide production, biofilm formation, and rugose colony development by Vibrio vulnificus.Appl Environ Microbiol744199209
  35. 35. Beyhan S, Tischler AD, Camilli A, Yildiz FH (2006) Transcriptome and phenotypic responses of Vibrio cholerae to increased cyclic di-GMP level. J Bacteriol 188: 3600–13.S. BeyhanAD TischlerA. CamilliFH Yildiz2006Transcriptome and phenotypic responses of Vibrio cholerae to increased cyclic di-GMP level.J Bacteriol188360013
  36. 36. Barnes IH, Bagnall MC, Browning DD, Thompson SA, Manning G, et al. (2007) Gamma-glutamyl transpeptidase has a role in the persistent colonization of the avian gut by Campylobacter jejuni. Microb Pathog 43: 198–207.IH BarnesMC BagnallDD BrowningSA ThompsonG. Manning2007Gamma-glutamyl transpeptidase has a role in the persistent colonization of the avian gut by Campylobacter jejuni.Microb Pathog43198207
  37. 37. Chevalier C, Thiberge JM, Ferrero RL, Labigne A (1999) Essential role of Helicobacter pylori gamma-glutamyltranspeptidase for the colonization of the gastric mucosa of mice. Mol Microbiol 31: 1359–72.C. ChevalierJM ThibergeRL FerreroA. Labigne1999Essential role of Helicobacter pylori gamma-glutamyltranspeptidase for the colonization of the gastric mucosa of mice.Mol Microbiol31135972
  38. 38. Ryjenkov DA, Tarutina M, Moskvin OV, Gomelsky M (2005) Cyclic diguanylate is a ubiquitous signalling molecule in bacteria: insights into biochemistry of the GGDEF protein domain. J Bacteriol 187: 1792–8.DA RyjenkovM. TarutinaOV MoskvinM. Gomelsky2005Cyclic diguanylate is a ubiquitous signalling molecule in bacteria: insights into biochemistry of the GGDEF protein domain.J Bacteriol18717928
  39. 39. Heidelberg JF, Eisen JA, Nelson WC, Clayton RA, Gwinn ML, et al. (2000) DNA sequence of both chromosomes of the cholera pathogen Vibrio cholerae. Nature 406: 477–83.JF HeidelbergJA EisenWC NelsonRA ClaytonML Gwinn2000DNA sequence of both chromosomes of the cholera pathogen Vibrio cholerae.Nature40647783
  40. 40. Henderson IR, Navarro-Garcia F, Nataro JP (1998) The great escape: structure and function of the autotransporter proteins. Trends Microbiol 6: 370–8.IR HendersonF. Navarro-GarciaJP Nataro1998The great escape: structure and function of the autotransporter proteins.Trends Microbiol63708
  41. 41. Hendrixson DR, de la Morena ML, Stathopoulos C, St Geme JW 3rd (1997) Structural determinants of processing and secretion of the Haemophilus influenzae hap protein. Mol Microbiol 26: 505–18.DR HendrixsonML de la MorenaC. StathopoulosJW St Geme 3rd1997Structural determinants of processing and secretion of the Haemophilus influenzae hap protein.Mol Microbiol2650518
  42. 42. Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389–402.SF AltschulTL MaddenAA SchäfferJ. ZhangZ. Zhang1997Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.Nucleic Acids Res253389402
  43. 43. Mao F, Dam P, Chou J, Olman V, Xu Y (2009) DOOR: a database for prokaryotic operons. Nucleic Acids Res 37: D459–63.F. MaoP. DamJ. ChouV. OlmanY. Xu2009DOOR: a database for prokaryotic operons.Nucleic Acids Res37D45963
  44. 44. Zhang Y (2007) Template-based modeling and free modeling by I-TASSER in CASP7. Proteins 69: 108–17.Y. Zhang2007Template-based modeling and free modeling by I-TASSER in CASP7.Proteins6910817
  45. 45. Sippl MJ, Wiederstein M (2008) A note on difficult structure alignment problems. Bioinformatics 24: 426–7.MJ SipplM. Wiederstein2008A note on difficult structure alignment problems.Bioinformatics244267
  46. 46. Sayle RA, Milner-White EJ (1995) RASMOL: biomolecular graphics for all. Trends Biochem Sci 20: 374.RA SayleEJ Milner-White1995RASMOL: biomolecular graphics for all.Trends Biochem Sci20374
  47. 47. Zhai Y, Saier MH Jr (2001) A web-based program (WHAT) for the simultaneous prediction of hydropathy, amphipathicity, secondary structure and transmembrane topology for a single protein sequence. J Mol Microbiol Biotechnol 3: 501–2.Y. ZhaiMH Saier Jr2001A web-based program (WHAT) for the simultaneous prediction of hydropathy, amphipathicity, secondary structure and transmembrane topology for a single protein sequence.J Mol Microbiol Biotechnol35012
  48. 48. Bagos PG, Liakopoulos TD, Spyropoulos IC, Hamodrakas SJ (2004) PRED-TMBB: a web server for predicting the topology of beta-barrel outer membrane proteins. Nucleic Acids Res 32: W400–4.PG BagosTD LiakopoulosIC SpyropoulosSJ Hamodrakas2004PRED-TMBB: a web server for predicting the topology of beta-barrel outer membrane proteins.Nucleic Acids Res32W4004
  49. 49. Jeanmougin F, Thompson JD, Gouy M, Higgins DG, Gibson TJ (1998) Multiple sequence alignment with Clustal X. Trends Biochem Sci 23: 403–5.F. JeanmouginJD ThompsonM. GouyDG HigginsTJ Gibson1998Multiple sequence alignment with Clustal X.Trends Biochem Sci234035
  50. 50. Do CB, Mahabhashyam MS, Brudno M, Batzoglou S (2005) ProbCons: Probabilistic consistency-based multiple sequence alignment. Genome Res 15: 330–40.CB DoMS MahabhashyamM. BrudnoS. Batzoglou2005ProbCons: Probabilistic consistency-based multiple sequence alignment.Genome Res1533040
  51. 51. Biegert A, Mayer C, Remmert M, Söding J, Lupas AN (2006) The MPI Bioinformatics Toolkit for protein sequence analysis. Nucleic Acids Res 34: W335–9.A. BiegertC. MayerM. RemmertJ. SödingAN Lupas2006The MPI Bioinformatics Toolkit for protein sequence analysis.Nucleic Acids Res34W3359
  52. 52. Tamura K, Dudley J, Nei M, Kumar S (2007) MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol 24: 1596–9.K. TamuraJ. DudleyM. NeiS. Kumar2007MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0.Mol Biol Evol2415969