Figures
Abstract
The species from the order Xanthomonadales, which harbors many important plant pathogens and some human pathogens, are currently distinguished primarily on the basis of their branching in the 16S rRNA tree. No molecular or biochemical characteristic is known that is specific for these bacteria. Phylogenetic and comparative analyses were conducted on 26 sequenced Xanthomonadales genomes to delineate their branching order and to identify molecular signatures consisting of conserved signature indels (CSIs) in protein sequences that are specific for these bacteria. In a phylogenetic tree based upon sequences for 28 proteins, Xanthomonadales species formed a strongly supported clade with Rhodanobacter sp. 2APBS1 as its deepest branch. Comparative analyses of protein sequences have identified 13 CSIs in widely distributed proteins such as GlnRS, TypA, MscL, LysRS, LipA, Tgt, LpxA, TolQ, ParE, PolA and TyrB that are unique to all species/strains from this order, but not found in any other bacteria. Fifteen additional CSIs in proteins (viz. CoxD, DnaE, PolA, SucA, AsnB, RecA, PyrG, LigA, MutS and TrmD) are uniquely shared by different Xanthomonadales except Rhodanobacter and in a few cases by Pseudoxanthomonas species, providing further support for the deep branching of these two genera. Five other CSIs are commonly shared by Xanthomonadales and 1–3 species from the orders Chromatiales, Methylococcales and Cardiobacteriales suggesting that these deep branching orders of Gammaproteobacteria might be specifically related. Lastly, 7 CSIs in ValRS, CarB, PyrE, GlyS, RnhB, MinD and X001065 are commonly shared by Xanthomonadales and a limited number of Beta- or Gamma-proteobacteria. Our analysis indicates that these CSIs have likely originated independently and they are not due to lateral gene transfers. The Xanthomonadales-specific CSIs reported here provide novel molecular markers for the identification of these important plant and human pathogens and also as potential targets for development of drugs/agents that specifically target these bacteria.
Citation: Naushad HS, Gupta RS (2013) Phylogenomics and Molecular Signatures for Species from the Plant Pathogen-Containing Order Xanthomonadales. PLoS ONE 8(2): e55216. https://doi.org/10.1371/journal.pone.0055216
Editor: Celine Brochier-Armanet, Université Claude Bernard - Lyon 1, France
Received: May 2, 2012; Accepted: December 19, 2012; Published: February 8, 2013
Copyright: © 2013 Naushad, Gupta. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: NSERC (Natural Science and Engineering Research Council of Canada) 249924. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
The Xanthomonadales are gram-negative, non-spore forming, catalase-positive, aerobic, rod shape bacteria [1], which are part of the class Gammaproteobacteria [2]. This order is comprised of two families Xanthomonadaceae and Sinobacteraceae that contain 22 and 6 genera, respectively (http://www.bacterio.cict.fr/classifphyla.html#Proteobacteria). The Xylella and Xanthomonas species, which are part of the order Xanthomonadales, cause a wide variety of serious diseases in more than 400 agriculturally important plants. Some of the economically important crops that are affected by species from these two genera include tomato, cabbage, pepper, banana, citrus, rice, grapes, peach, plum, almond, coffee and maple [3]–[9] Additionally, Xylella fastidiosa is responsible for causing leaf scorch disease in many landscape and ornamental plants including oak, elm, mulberry, sycamore, maple and oleander [7], [9]–[11]. The diseases caused by these bacteria lead to major crop losses globally and thus they constitute serious agricultural and economic threat. In addition to these important phytopathogens, the Xanthomonadales also harbors the genus Stenotrophomonas, whose members (viz. S. maltophila) are multidrug resistant opportunistic pathogens, responsible for many hospital-acquired infections in immuno-compromised patients. These latter bacteria are also implicated in respiratory infections in cystic fibrosis patients [12]–[14].
The species from the order Xanthomonadales and its different families/genera are currently distinguished from other bacteria primarily on the basis of their branching in the 16S rRNA trees [1], [4], [15]. There is no biochemical, morphological or physiological characteristics known that are uniquely shared by various species from this order. Although Xanthomonadales are an order within the class Gammaproteobacteria, in phylogenetic trees based upon some genes/proteins sequences, these species are observed to branch with other classes of proteobacteria, particularly the Betaproteobacteria [16]–[20]. However, detailed phylogenetic studies based upon two independent, large datasets of concatenated protein sequences have now established that the species from the order Xanthomonadales are a deep branching clade within the class Gammaproteobacteria [21], [22]. Several recently identified molecular signatures that are uniquely shared by Xanthomonadales and all other Gammaproteobacteria also support the placement of this group within the Gammaproteobacteria [21], [23]. The anomalous branching of Xanthomonadales in some phylogenetic trees possibly results from the deep branching of Xanthomonadales within the Gammaproteobacteria and also in some cases by lateral gene transfers (LGTs). In particular, extensive work by Menck and coworkers indicate that about 25% of the genes in Xanthomonas, which include many genomic islands as well as some genes involved in the biosynthesis of NAD, arginine and cysteine, are acquired by LGTs [16], [24]–[28].
Because Xanthomonadales harbor many major phytopathogens and also some important human pathogens, it is important to understand the evolutionary relationships among these bacteria and identify molecular markers that are specific for either all Xanthomonadales or its different genera. Due to the importance of these bacteria for agriculture and human health, the complete genome sequences for 26 Xanthomonadales species/strains are now available in the NCBI database (see Table 1). In addition, genomes for many other species/strains from this order are currently being sequenced and partial sequence information for them is also available in the databases. These genomes provide valuable resource for discovering molecular and biochemical characteristics that are uniquely shared by these bacteria and which should provide novel means for their identification and also as potential new targets for development of drugs targeting these bacteria. Earlier comparative genomic studies on Xanthomonadales have focused on identifying characteristics that are responsible for the virulence and host specificity of different strains and pathovars of Xanthomonas and Xylella and on understanding the role of LGTs in their genome evolution [3], [4], [4], [7], [8], [11], [29]–[34], [34]–[36]. A recent study on DNA repair proteins also identified four conserved indels that were specific for the available Xanthomonadales species [28]. However, thus far no detailed study has been carried out which is aimed at identifying genetic or molecular characteristics that are uniquely shared by either all Xanthomonadales or its different genera.
Using genome sequence data, our recent work has focused on identifying Conserved Signature Indels (inserts or deletions) (CSIs) of defined lengths that are present at specific locations in widely distributed proteins and which are uniquely found in particular groups of organisms [37]–[40]. The most parsimonious explanation of these CSIs is that they resulted from highly specific genetic changes that first occurred in a common ancestor of the particular groups of species and were then passed on to various descendants [37], [40], [41]. Further, depending upon the presence or absence of these CSIs in outgroup species, it is possible to infer whether a given CSI is an insert or a deletion and this information can be used to develop rooted phylogenetic relationships independently of phylogenetic trees [21], [37], [42]–[45]. Additionally, the shared presence of some CSIs in unrelated groups of bacteria can also identify possible cases of LGTs [46]. In this work, we report detailed phylogenetic and comparative analyses of protein sequences from Xanthomonadales genomes to identify CSIs that are specific for these organisms. These studies have identified 13 CSIs that are specific for all sequenced Xanthomonadales species and many others CSIs that provide information regarding evolutionary relationships among these bacteria. These molecular signatures provide novel and highly specific means for identification of Xanthomonadales species and for different types of studies on these bacteria. We also report here several CSIs that are commonly shared by Xanthomonadales and either Beta- and/or Alpha-proteobacteria. However, our analysis indicates that the shared presence of these CSIs in Xanthomonadales and these other bacterial groups is due to independent occurrence of similar genetic changes and not due to LGTs.
Methods
Phylogenetic Analyses
Phylogenetic analyses were conducted on a concatenated sequence alignment for 28 conserved and widely distributed proteins that have been widely used for phylogenetic studies [21], [47], [48] and are present in all the Xanthomonadales. These proteins included, alanyl-tRNA synthetase, arginyl-tRNA synthetase, cell division protein FtsY, chaperonin GroEL, dimethyladenosine transferase, DNA gyrase subunit A, DNA gyrase subunit B, DNA polymerase I, DNA-dependent helicase II, elongation factor Tu, histidyl-tRNA synthetase, isoleucyl-tRNA synthetase, methionyl-tRNA synthetase, molecular chaperone DnaK, O-sialoglycoprotein endopeptidase, phenylalanyl-tRNA synthetase subunit alpha, phosphatidate cytidylyltransferase, prolyl-tRNA synthetase, RpoB, RpoC, SecA, SecY, serine hydroxymethyltransferase, seryl-tRNA synthetase, signal recognition particle protein, thioredoxin reductase, tryptophanyl-tRNA synthetase and valyl-tRNA synthetase. For each of these proteins, sequences for all sequenced Xanthomonadales and a number of other Gamma-, Beta- and Alpha-proteobacteria were retrieved by Blastp searches and multiple sequence alignments were created by using the CLUSTAL_X 2.0 [49]. These sequence alignments were concatenated into a single large file and the poorly aligned regions from the alignment were removed using Gblocks 0.91 b program [50]. After removal of poor aligned regions, a total of 14621 aligned positions were present in the final dataset. A neighbor-joining (NJ) tree based on 100 bootstrap replicates was constructed using the JTT matrix-based method [51] in MEGA 5 [52]. A maximum-likelihood tree based upon the same sequence data set was also constructed using the Whelan and Goldman+Freq. model [53] using MEGA5. All positions containing gaps were not considered during these tree constructions.
Identification of Xanthomonadales Specific Conserved Signature Indels (CSIs)
To search for signature sequences in different proteins that are specific for Xanthomonadales or for its subclades, Blastp searches were carried out on each proteins (open reading frame) from the genome of Xylella fastidiosa 9a5c against the NCBI nr database [35]. The results of blast searches were examined for high scoring homologs. For those proteins for whom high scoring homologs (E value <1e−20) were present in Xanthomonadales and several other bacteria, about 10–15 sequences representing different groups were retrieved and multiple sequence alignments were constructed using the CLUSTAL_X 2.0 program [49]. The sequence alignments were visually inspected to identify any conserved indels that were restricted to Xanthomonadales and which were flanked on both sides by at least 5–6 identical/conserved residues in the neighboring 30–50 amino acids [21], [40], [54]. The conserved indels, which in addition to Xanthomonadales were also present in few other species, were also retained. The indels that were not flanked on both sides by conserved regions were not further evaluated as they do not provide useful molecular markers [23], [37], [40]. The species distribution of all indels thus identified (∼150) was further examined by detailed Blastp searches against the nr database (500 top hits) on short sequence segments containing the indels and their flanking conserved regions. Based upon detailed Blast searches, many original indels queries were found to be uninformative for this study due to a variety of reasons including their presence in only a single species/strain, lack of sequence conservation, presence of other confounding indels in the same area in other species, lack of specificity of the indels for any particular group and large variation in their lengths, etc. Hence, such indels were not further studied. However, for different indels those were specific for Xanthomonadales or present in a limited number of other bacteria, sequence information for them were compiled into signature files that are shown here. Due to space considerations, the signature files shown here contain information for only a limited number of species from other bacteria such as Alpha, Beta and Gammaproteobacteria and different strains of the same species are also not shown. However, unless otherwise noted, all of these CSIs are specific for the indicated groups and they are also present in different strains of the Xanthomonadaceae species for which sequence information is available (Table 1).
Results
Phylogenetic Analysis of Genome Sequenced Xanthomonadales
The genome sequences are now available for 26 Xanthomonadales including 15 for Xanthomonas species/strains [5], [6], [29]–[33], [55], [56], 5 for different strains/pathovars of Xylella fastidiosa [10], [34], [35], [57], 3 for Stenotrophomonas species/strains [12], 2 for Pseudoxanthomonas species [58] and for the Rhodanobacter sp. 2APBS1. Some characteristics of these genomes are listed in Table 1. Their genome sizes varied from 2.5 Mb to 5.3 Mb and the xylem-inhabiting bacterium Xylella fastidiosa had the smallest or most reduced genome. Further, in contrast to other Xanthomonadales species/strains whose mol G+C % was in the range of 61–67%, the Xylella strains/pathovars have much lower G+C content. The reduced genome size and the lower G+C mole content of Xylella strains/pathovars have likely resulted from their adaptation to the more stable xylem environment [7].
The sequence information from these genomes was also used to examine the evolutionary relationships among the sequenced Xanthomonadales species. Detailed phylogenetic studies on Gammaproteobacteria and other proteobacteria based upon concatenated sequences for different large datasets of protein sequences have been reported previously [19], [21], [22]. In these trees [4], [28], [59], species from the order Xanthomonadales formed a monophyletic clade and one of the deepest branching lineages within the Gammaproteobacteria [21], [22]. Hence, in the present work, phylogenetic trees based upon concatenated sequences were mainly constructed to clarify the branching order of species within the order Xanthomonadales. The dataset employed in this study included sequence information for only a limited number of other proteobacteria. Figure 1 shows a NJ distance tree based upon concatenated protein sequences, which was rooted using sequences from Alphaproteobacteria. The branching order of various Xanthomonadales species in the ML tree (Figure S1) is very similar to that seen in the NJ tree. In both ML and NJ tree, the Xanthomonadales species formed a strongly supported clade branching within the other Gammaproteobacteria. This clade was separated from all other Gammaproteobacteria by a long branch. Similar monophyletic grouping and branching of the Xanthomonadales species within the Gammaproteobacteria have been observed in earlier studies [19], [21], [22]. Among the sequenced Xanthomonadales species, Rhodanobacter was found to be the deepest branching species and it was separated from all other Xanthomonadales by a long branch. Interestingly, the sequenced Xanthomonas species showed polyphyletic branching in the tree, with X. albilineans branching deeply and separately from the other Xanthomonas species (Figure 1 and Figure S1). The tree shown in Figure 1 provides a phylogenetic framework for understanding and interpreting the significance of various CSIs observed in this work.
The tree shown is a NJ distance tree, however, similar branching was observed in the ML tree (Figure S1). The observed bootstrap scores for various nodes are shown on the branch points. The tree was rooted using sequences from Alphaproteobacteria.
Identification of Conserved Signature Indels that are Specific for Xanthomonadales
Our work has identified 13 CSIs that are uniquely present in all sequenced Xanthomonadales including the deepest branching Rhodanobacter. Two examples of these CSIs are shown in Figure 2A & B. In the first case (Figure 2A), an 18 aa insert in highly conserved region of the protein glutaminyl-tRNA synthetase, which plays an essential role in protein synthesis by linking glutamine to its cognate tRNA [60]. The large insert in GlnRS is uniquely shared by all available sequences from Xanthomonadales species but not found in any other bacteria (at least the top 500 blast hits). In the other example shown here (Figure 2B), a 4 aa insert in a GTP-binding elongation factor protein (typA) is commonly shared by all sequenced Xanthomonadales, but again it is not found in any other bacteria. Both these CSIs are present in highly conserved regions of the proteins and their sequences are also highly conserved. Because these CSIs are lacking in other bacteria, they constitute inserts in the Xanthomonadales rather than deletions in other bacteria [38]. The sequence information for other CSIs that are uniquely present in all sequenced species/strains of Xanthomonadales is presented in Figures S2–S12 and a summary of their characteristics is provided in Table 2 (first 13 entries). These CSIs include a 7 aa insert in amino acid/peptide transported protein; 5 aa insert in conserved region of the large-conductance mechanosensitive channel protein; a 3 aa insert in LysRS; 2 aa insert in highly conserved region of the protein lipoyl synthase (LipA); 1 aa inserts in the proteins Tgt, LpxA and TolQ; a 13 aa deletion in alpha-2-macroglobulin domain-containing protein and 1 aa deletions in the ParE, PolA and TyrB proteins. Because these CSIs are present in all sequenced Xanthomonadales but not found in any other bacteria, the most likely explanation is that genetic changes responsible for them first occurred in a common ancestor of the Xanthomonadales and then passed on to various descendants by vertical descent.
Excerpts are shown from the sequence alignments of (A) Glutaminyl t-RNA synthetase and (B) GTP-binding elongation factor proteins showing two CSIs that are uniquely found in various sequenced Xanthomonadales species, but not found in any other bacteria. Information for other CSIs that are specific for the Xanthomonadales is provided in Figures S2–S12 and Table 2. The dashes in these as well as all other alignments show identity with the amino acid on the top line. The Gene bank identification numbers of various sequences are shown in the second column and the numbers on the top indicate the position of this sequence in the species shown on the top line. The sequence information is shown here for only representative species. However, unless otherwise indicated, these CSIs are highly specific for the indicated group of species.
In addition to these CSIs that are uniquely found in all Xanthomonadales, we have also come across 6 other CSIs, where in addition to the Xanthomonadales, the identified CSIs are also present in 1–3 other Gammaproteobacteria. These species are generally from some of the other deep branching orders of Gammaproteobacteria such as Chromatiales, Methylococcales and Cardiobacteriales, which branch in the proximity of Xanthomonadales [21], [22], [28]. One example of such a CSI consisting of a 1 aa deletion in a conserved region of the protein glutaminyl-tRNA synthetase that is commonly shared by various Xanthomonadales and also by a few Methylococcales and Cardiobacteriales species is presented in Figure 3. Sequence information for others CSIs of this kind is presented in Figures S13–S17 and in Table 2 (last six records). Cutino-Jimenez et al. [28]also reported a CSI in Topoisomerase I that was commonly shared by various Xanthomonadales, Methylococcales, Cardiobacteriales, Chromatiales, Legionellales and Thiotrichales. The information provided by these CSIs could prove useful in establishing a specific relationship of the Xanthomonadales to these other deep branching orders of Gammaproteobacteria.
This CSI as well as a few other CSIs identified in this work (see Table 2 and Figures S13–S17) suggest a possible relationship of Xanthomonadales to these deep branching orders of Gammaproteobacteria.
CSIs Supporting the Deeper Branching of Rhodanobacter within the Xanthomonadales
In the phylogenetic tree shown in Figure 1 and Figure S1, Rhodanobacter sp. 2APBS1 exhibited the deepest branching amongst the sequenced Xanthomonadales. During our analyses, we have found 15 CSIs that are uniquely shared by all other Xanthomonadales except Rhodanobacter, supporting the deeper branching of this species in comparison to other Xanthomonadales. Two examples of such CSIs that are uniquely found in different Xanthomonadales, but not in Rhodanobacter are shown in Figure 4A & B. In both these cases 5 aa inserts in highly conserved regions of the proteins uroporphyrinogen decarboxylase (Figure 4A) and in the protein tRNA delta(2)-isopentenylpyrophosphate transferase (Figure 4B) are uniquely shared by different sequenced Xanthomonadales except Rhodanobacter. These CSIs are not present in any other bacteria. A summary of the characteristics of different CSIs showing this type of species distribution pattern is presented in Table 3 and the sequence alignments of the corresponding proteins are provided as Figures S18–S30. The proteins in which these CSIs are found include protoheme IX farnesyltransferase (CoxD), DNA polymerase III alpha subunit (DnaE), DEAD box helicase domain-containing protein, ribose-5-phosphate isomerase A (RpiA), DNA polymerase I (PolA), glucose-6-phosphate 1-dehydrogenase (Zwf1), AspRS, 2-oxoglutarate-dehydrogenase E1 component (SucA), coproporphyrinogen III oxidase (CpoX), and TrmD. In a few of these cases, the CSIs under consideration was also not found in one or both of the Pseudoxanthomonas species, supporting their deeper branching in comparison to other Xanthomonadales genera (viz. Xylella, Xanthomonas and Stenotrophomonas) (Figures S32–S34). In a recent study, Cutino-Jimenez et al. [28] had reported four CSIs in DNA repair proteins that were indicated to be specific for Xanthomonadales. Our analyses of these CSIs, which were also identified in our work, indicate that they are lacking in either Rhodanobacter (4 aa insert in DnaE and 1 aa insert in RecA) or both Rhodanobacter and in P. suwonensis (5 aa insert in MutS and >50 aa insert in LigA) (Figures S29–S31 and S35). The information for these CSIs is also summarized in Table 3. Based upon the species distribution of these CSIs and the branching positions of Rhodanobacter (and Pseudoxanthomonas) in phylogenetic trees, the genetic changes responsible for these CSIs likely occurred in common ancestors of other Xanthomonadales species after the divergence of Rhodanobacter sp. 2APBS1 and also in some cases that of Pseudoxanthomonas species.
Excerpts are shown from the sequence alignments of (A) uroporphyrinogen decarboxylase (HemE) and (B) tRNA delta(2)-isopentenylpyrophosphate transferase (MiaA) proteins showing two conserved signature indels (boxed) that are specifically found in various sequenced Xanthomonadales species, except Rhodanobacter sp. 2APBS1. These CSIs were likely introduced in these genes in a common ancestor of the Xanthomonadales after branching of Rhodanobacter. Information for CSIs in other proteins showing similar species specificities is provided in Figures S18–S30 and Table 3.
In addition to the CSIs discussed above 4 other proteins contains CSIs of different lengths at the same position, which are uniquely shared by all sequenced species/strains of Xanthomonadales except Rhodanobacter sp. 2APBS1 and in some cases Pseudoxanthomonas. However, these CSIs due to differences in their lengths are also able to distinguish between different genera of Xanthomonadaceae. Two examples of such CSIs are presented in Figure 5. In the first case in the protein 5′-nucleotidase (Figure 5A), which catalyzes the hydrolysis of nucleotides to nucleosides, a 13 aa insert is uniquely shared by all Xanthomonas and Xylella species, whereas the two Stenotrophomonas species have an 11 aa insert in the same position. Because both these CSIs are present at the same position and they are related in sequences, the most likely explanation about their occurrence is that a 13 aa insert was initially introduced in a common ancestor of the Xanthomonas, Xylella and Stenotrophomonas genera and it was followed by a 2 aa deletion in the genus Stenotrophomonas. Alternatively, an 11 aa insert was initially introduced in a common ancestor of these three genera followed by another 2 aa insert in a common ancestor of the Xanthomonas and Xylella genera. Likewise, in a conserved region of the asparagine synthase b protein (AsnB), a 5 aa insert is present in various Xylella, Xanthomonas and Pseudoxanthomonas, whereas the two Stenotrophomonas species have a smaller insert (4 aa) in this position (Figure S27). The AsnB protein also contains another CSI in a different position (see Figure S28), where a 1 aa insert is present in Xylella, Xanthomonas and Pseudoxanthomonas, species, whereas the two Stenotrophomonas species have a 2 aa insert in the same position. In another example of this kind, in the protein CTP synthetase, a 2 aa insert in a conserved region is uniquely shared by various Xylella, Xanthomonas and Stenotrophomonas species/strains, whereas the two Pseudoxanthomonas species contain a 1 aa insert in this position (Figure 5B). These CSIs, in addition to supporting the deeper branching of Rhodanobacter in comparison to other Xanthomonadales, also serve to differentiate Stenotrophomonas and Pseudoxanthomonas species from other genera of Xanthomonadaceae.
Partial sequence alignments are shown of the proteins (A) 5′-nucleotidase and (B) CTP synthetase showing two CSI, which due to their different lengths are able to distinguish between two different clades of Xanthomonadales. In (A), a 13 aa insert is present in all of the Xanthomonas and Xylella species, whereas the two Stenotrophomonas spp. contain an 11 aa insert in this position. Similarly, in (B), all of the Xanthomonas, Xylella and Stenotrophomonas species have a 2 aa insert, whereas the two Pseudoxanthomonas spp. contain a 1 aa insert in this position. Different possibilities to account for these CSIs are discussed in the text.
CSIs that are Commonly Shared by Xanthomonadales and Some Alpha- and Beta-proteobacteria
In addition to the above proteins that contained CSIs, which were highly specific for Xanthomonadales species (or 1–2 closely related species), our analyses have also identified 7 other CSIs, which in addition to various Xanthomonadales are also shared by some Betaproteobacteria and/or Alphaproteobacteria. Two examples of these CSIs are shown in Figures 6 and 7. In the protein valyl-tRNA synthetase, which plays an essential role in protein synthesis, a 13 aa insert in a highly conserved region is present in all sequenced Xanthomonadales, except Rhodanobacter (Figure 6). Interestingly, a very similar CSI is also present in several species belonging to the class Alphaproteobacteria (e.g. Ahrensia sp. R2A130, Labrenzia alexandrii, Rhodobacter capsulatus, Sagittula stellata etc.) whereas other Alphaproteobacteria do not contain this insert. In the other example shown here (Figure 7), in the protein carbamoyl phosphate synthase large subunit (CarB), a 1 aa insert in a conserved region is commonly shared by various Xanthomonadales and a subgroup of Betaproteobacteria (mainly Burkholderiales), but not by any other bacterial groups. The shared presence of similar CSIs by different Xanthomonadales and species from these other classes of proteobacteria could result from a variety of possibilities including lateral transfers of genes for these proteins between these two groups of bacteria or alternatively by independent occurrence of similar genetic changes in these lineages.
Other Alpha- and Gamma-proteobacteria do not contain this insert.
The distinct branching of these two groups in a phylogenetic tree based upon CarB sequence (Figure S36) provides evidence that this shared CSIs is not a result of LGT.
To distinguish between these possibilities, phylogenetic trees for the ValRS and CarB sequences for the same species as shown in Figures 6 and Figure 7 were constructed. In the tree based upon ValRS sequences, which is shown Figure 8, all of the Alphaproteobacteria species (both containing and lacking the insert) formed a strongly supported clade that branched distinctly from the Xanthomonadales. The Xanthomonadales species in this tree branched in between the clades consisting of Betaproteobacteria and the other Gammaproteobacteria, but that is not surprising in view of phylogenetic position within the Gammaproteobacteria. If the shared presence of the CSI in the Xanthomonadales and the CSI-containing Alphaproteobacteria was due to LGTs, then the Alphaproteobacteria containing this CSI should have branched with the Xanthomonadales, which is not observed here. Similarly, in the tree based upon CarB sequences (Figure S36), all of the Betaproteobacteria branched together and no association was observed between the insert containing Betaproteobacteria and the Xanthomonadales. These results do not support the possibility that LGT was responsible for the shared presence of CSIs in these two groups. Instead in the phylogenetic trees shown in Figure 8 and Figure S36, the clades comprising of the inserts containing Alphaproteobacteria or Betaproteobacteria formed distinct subclades within the rest of the Alpha- or Beta-proteobacteria. Thus, it is likely that the genetic changes responsible for these CSI occurred independently in the common ancestors of these subclades of species.
The distinct branching of Xanthomonadales and the Alphaproteobacteria containing this insert suggests that the shared presence of this CSIs in these two groups is not due to a LGT.
Besides these two proteins that contained CSIs, which were commonly shared by Xanthomonadales and either some Alpha- or Beta-proteobacteria, five other proteins were identified that contained CSIs showing similar species distributions. These included: two CSIs consisting of 1 aa conserved deletions in a hypothetical protein XOO1065 and the protein orotate phosphoribosyltransferase (PyrE) that are commonly shared by various Xanthomonadales and some Betaproteobacteria (Figures S37 and S38); two CSIs consisting of 1 aa and 2 aa inserts in the proteins putative ribonuclease HII (RnhB) and glycyl-tRNA synthetase subunit beta (GlyS) that are also commonly shared by various Xanthomonadales and some Betaproteobacteria (Figures S39 and S40); a 1 aa deletion in a conserved region in the septum site-determining protein MinD that is commonly shared by Xanthomonadales and some Alpha- and Beta-proteobacteria (Figure S41). The phylogenetic trees based upon the sequences of these proteins are shown in Figures S42 to S46. In all of these trees, the proteobacterial groups which contained similar CSIs as found in the Xanthomonadales did not branch with the Xanthomonadales. These results provide evidence that the CSIs in these other bacterial groups have originated independently and their shared presence is not due to LGTs from Xanthomonadales.
Discussion
The Xanthomonadales species harbor many major plant pathogens [3], [4], [9] as well as some important human pathogens. However, these bacteria are presently distinguished from other bacteria solely on the basis of their branching in phylogenetic trees (primarily 16S rRNA) and no molecular or biochemical characteristic that is uniquely shared by various species from this group of bacteria is currently known [1]. This paper reports detailed phylogenetic and comparative genomic analyses of sequenced Xanthomonadales species to identify molecular markers that are specific for these bacteria and which are also helpful in understanding their evolutionary relationships. We report here for the first time 13 molecular signatures consisting of conserved indels in widely distributed proteins that are distinctive characteristics of all sequenced Xanthomonadales species, but they are not found in any other bacteria. In view of their Xanthomonadales-specificity, the most parsimonious explanation to account for these CSIs is that the rare genetic changes responsible for them occurred only once in a common ancestor of the Xanthomonadales and were then passed on to various descendent species vertically as shown in Figure 9 [37], [54], [61]. Further, the absence of these CSIs in all other bacteria strongly indicates that the genes for these proteins have not been laterally transferred from Xanthomonadales to other bacterial groups or vice versa. Thus, these molecular signatures (or synapomorphies) provide novel means for the identification and circumscription of species from the order Xanthomonadales in clear molecular terms.
We also report in this work detailed phylogenetic analyses of (sequenced) Xanthomonadales species based upon concatenated sequences for 28 widely distributed proteins. Earlier phylogenetic studies on Xanthomonadales are mainly based upon 16S rRNA or single genes such as Gyrase B and most of them cover only the genus Xanthomonas [4], [15], [59], [62], [63]. Among a number of novel relationships seen in this tree, these trees showed that Rhodanobacter sp. 2APBS1 formed the deepest branch within the Xanthomonadales and it was separated from all other species by a long branch. The branching of Pseudoxanthomonas and then other Xanthomonadaceae genera followed it. Importantly, our analyses have also identified 15 CSIs that are uniquely present in all other Xanthomonadales, except Rhodanobacter and in a few cases also by the Pseudoxanthomonas species. The genetic changes responsible for theses CSIs were likely introduced in a common ancestor of the other Xanthomonadales after the branching of Rhodanobacter and also in some cases Pseudoxanthomonas (Figure 9) and they provide independent evidence for the deep branching of these lineages with respect to other genera within this order.
Xanthomonadales species are indicated to have undergone extensive LGTs with other prokaryotic taxa particularly Alpha, Beta and some orders of Gamma- proteobacteria and in some cases with Archaea as well [16], [24]–[27]. In the present work, we have also identified several examples where a given CSI, in addition to being shared by all or most Xanthomonadales, was also present in some species from other groups of bacteria, most commonly from Alpha-, Beta- and Gamma- proteobacteria. Of these CSIs, five were present only in 1–3 species from other deep branching orders of Gammaproteobacteria and their possible significance is discussed below. Seven other CSIs were commonly shared by various Xanthomonadales and also several Betaproteobacteria and/or both Alpha- and Beta- proteobacteria. The shared presence of these CSIs between Xanthomonadales and these other proteobacteria could result from a number of possibilities including later transfer of the corresponding genes between these groups of bacteria or independent occurrence of similar genetic changes in these groups. However, phylogenetic trees based upon these protein sequences showed that Xanthomonadales species and the Alpha- and/or Beta- proteobacteria containing similar CSIs branched separately from each other, indicating that the presence of similar CSIs in these groups of bacteria was not due to LGTs. Therefore, genetic changes leading to similar CSIs in these groups likely occurred independently due to similar functional requirements for these CSIs. Although in our work we have not come across many examples of LGTs between Xanthomonadales and other groups of bacteria, our analyses is based only on proteins that contain conserved indels. Such genes/proteins represent only a small fraction of the total genes that are found in various genomes. Because most of these proteins are involved in essential functions, they are less prone to LGTs. In contrast, extensive work that Menck and coworkers have carried out on identification of cases of LGTs is primarily on species from the genus Xanthomonas [16], [24]–[27], which have thus far not studied in detail.
Xanthomonadales is one of the deepest branching orders within the Class Gammaproteobacteria. Some of the other orders that branch in its proximity include Chromatiales, Methylococcales, Cardiobacteriales, Legionelalles and Thiotrichales. However, the relationship of Xanthomonadales to these other orders is presently not understood. In the present work, we also identified six other CSIs (Table 2, last six entries), which in addition to various Xanthomonadales were also uniquely shared by 1 or 2 species from these orders of Gammaproteobacteria. The shared presence of these CSIs by Xanthomonadales and some of these other orders of Gammaproteobacteria suggests that either these orders are closely related or that similar genetic changes have occurred in them independently. However, further information from additional species from these orders will be necessary to establish whether the Xanthomonadales and some of these other orders of Gammaproteobacteria are specifically related and form a higher taxonomic clade within the Gammaproteobacteria.
The focus of the present study was on identifying molecular signatures that are specific for either the entire Xanthomonadales order or some of its deep branching lineages. Thus far, we have not carried out careful analyses of various signature sequences that are specific for specific genera viz. Xylella, Xanthomonas, Stenotrophomonas and Pseudoxanthomonas and such studies will be part of our future work. Nonetheless, based upon the identified molecular signatures it is now possible to identify and circumscribe species from the order Xanthomonadales from all other bacteria in clear molecular terms based upon large numbers of discrete molecular characteristics. Based upon our earlier work on CSIs for other groups/phyla of bacteria, most of these CSIs have degree of predictive ability [21], [64]–[66] and thus they are useful in identifying both known as well as unknown species belonging to these clades (viz. Xanthomonadales) in different environments. Xanthomonadales harbor many important plant pathogens that cause a variety of diseases in economically important crops and plants [3]–[9]. In addition, they also contain Stenotrophomonas, which are opportunistic human pathogens [12]–[14]. Thus, novel methods for sensitive and specific identification of species from this order in different settings are of much importance. Most of the Xanthomonadales-specific CSIs discovered in this work are present in highly conserved regions of the genes/proteins. Hence, based upon these gene sequences degenerate PCR primers (based upon either flanking conserved regions or the indel region and a flanking conserved region) could be readily designed to examine the presence or absence of gene sequences containing these CSIs in any given sample [64], [67]. Thus, molecular probes based upon these CSIs and/or their flanking regions should provide novel and specific means for the detection of new as well as existing Xanthomonadales species in different environments. The Xanthomonadales-specific CSIs, in addition to their usefulness for evolutionary and diagnostic studies, also provide novel and useful tools for genetic and biochemical investigations and possible means for identification of agents that specifically target these plant pathogenic bacteria.
Supporting Information
Figure S1.
A maximum-likelihood tree based upon concatenated sequences for 28 conserved proteins. The tree shows the branching of Xanthomonadales group with Gammaproteobacteria. The tree was rooted using Alphaproteobacteria. The numbers on the nodes indicate statistical support for the nodes.
https://doi.org/10.1371/journal.pone.0055216.s001
(PDF)
Figure S2.
Partial sequence alignment of a conserved region in the amino acid/peptide transporter showing a 7 aa insert that is specific for all Xanthomonadales.
https://doi.org/10.1371/journal.pone.0055216.s002
(PDF)
Figure S3.
Partial sequence alignment of large-conductance mechanosensitive channel protein showing the presence of a 5 aa insert that is commonly shared by Xanthomonadales.
https://doi.org/10.1371/journal.pone.0055216.s003
(PDF)
Figure S4.
Partial sequence alignment of lysyl-tRNA synthetase showing a 3 aa insert that is uniquely shared by all members of Xanthomonadales.
https://doi.org/10.1371/journal.pone.0055216.s004
(PDF)
Figure S5.
Partial sequence alignment of lipoyl synthase showing a 2 aa insert that is commonly shared by Xanthomonadales.
https://doi.org/10.1371/journal.pone.0055216.s005
(PDF)
Figure S6.
Partial sequence alignment of a conserved region in the queuine tRNA-ribosyltransferase showing a 1 aa insert that is specific for Xanthomonadales.
https://doi.org/10.1371/journal.pone.0055216.s006
(PDF)
Figure S7.
Partial sequence alignment of a conserved region in the acyl-(acyl-carrier-protein)–UDP-N-acetylglucosamine O-acyltransferase showing a 1 aa insert that is commonly shared by Xanthomonadales.
https://doi.org/10.1371/journal.pone.0055216.s007
(PDF)
Figure S8.
Partial sequence alignment of a conserved region in the TolQ protein showing a 1 aa insert that is commonly shared by Xanthomonadales.
https://doi.org/10.1371/journal.pone.0055216.s008
(PDF)
Figure S9.
Partial sequence alignment of a conserved region in alpha-2-macroglobulin domain-containing protein showing a 13 aa deletion that is uniquely present in Xanthomonadales.
https://doi.org/10.1371/journal.pone.0055216.s009
(PDF)
Figure S10.
Partial sequence alignment of a conserved region of DNA topoisomerase IV subunit B showing a 1 aa deletion that is commonly specifically found in Xanthomonadales.
https://doi.org/10.1371/journal.pone.0055216.s010
(PDF)
Figure S11.
Partial sequence alignment of DNA polymerase I showing a 1 aa deletion that is uniquely shared by all members of Xanthomonadales.
https://doi.org/10.1371/journal.pone.0055216.s011
(PDF)
Figure S12.
Partial sequence alignment of a conserved region of aromatic amino acid aminotransferase showing a 1 aa deletion that is uniquely shared by Xanthomonadales.
https://doi.org/10.1371/journal.pone.0055216.s012
(PDF)
Figure S13.
Partial sequence alignment of a conserved region of DNA polymerase III subunit beta showing a 1 aa deletion that is present in Xanthomonadales. The CSI has also been found to be shared by Marinomonas sp. MWYL1 and Thioalkalivibrio sp. HL-EbGR7.
https://doi.org/10.1371/journal.pone.0055216.s013
(PDF)
Figure S14.
Partial sequence alignment of a conserved region of lipid-A-disaccharide synthase a 2 aa insert that is present in Xanthomonadales. The CSI has also been found to be shared by Cardiobacterium hominis, Allochromatium vinosum and Alteromonadales bacterium.
https://doi.org/10.1371/journal.pone.0055216.s014
(PDF)
Figure S15.
Partial sequence alignment of carbamoyl phosphate synthase large subunit a 1 aa insert that is present in Xanthomonadales. The CSI has also been found to be shared by Marinobacter sp. ELB17.
https://doi.org/10.1371/journal.pone.0055216.s015
(PDF)
Figure S16.
Partial sequence alignment of putative secreted protein showing a 1 aa insert that is present in Xanthomonadales. The CSI has also been found to be shared by Teredinibacter turnerae.
https://doi.org/10.1371/journal.pone.0055216.s016
(PDF)
Figure S17.
Partial sequence alignment of a conserved region of aminopeptidase P, showing a 1 aa deletion that is commonly shared by Xanthomonadales. The CSI has also been found to be shared by Thioalkalivibrio sp. HL-EbGR7 and Alkalilimnicola ehrlichii.
https://doi.org/10.1371/journal.pone.0055216.s017
(PDF)
Figure S18.
Partial sequence alignment of protoheme IX farnesyltransferase showing a 4 aa insert that is uniquely shared by subclade of Xanthomonadales after the divergence of Rhodanobacter sp. 2APBS1.
https://doi.org/10.1371/journal.pone.0055216.s018
(PDF)
Figure S19.
Partial sequence alignment of DNA polymerase III subunit alpha, showing a 1 aa insert that is uniquely present in Xanthomonadales except Rhodanobacter sp. 2APBS1.
https://doi.org/10.1371/journal.pone.0055216.s019
(PDF)
Figure S20.
Partial sequence alignment of a conserved region in the DEAD box helicase domain-containing protein showing a 1 aa insert that is specific for Xanthomonadales except Rhodanobacter sp. 2APBS1.
https://doi.org/10.1371/journal.pone.0055216.s020
(PDF)
Figure S21.
Partial sequence alignment of a conserved region in ribose-5-phosphate isomerase A, showing a 1 aa insert that is commonly shared by Xanthomonadales except Rhodanobacter sp. 2APBS1.
https://doi.org/10.1371/journal.pone.0055216.s021
(PDF)
Figure S22.
Partial sequence alignment of a conserved region in DNA polymerase I, showing a 1 aa insert that is uniquely shared by a subclade of Xanthomonadales except Rhodanobacter sp. 2APBS1.
https://doi.org/10.1371/journal.pone.0055216.s022
(PDF)
Figure S23.
Partial sequence alignment of aspartyl-tRNA synthetase, showing a 4 aa deletion that is commonly shared by all Xanthomonadales except Rhodanobacter sp. 2APBS1.
https://doi.org/10.1371/journal.pone.0055216.s023
(PDF)
Figure S24.
Partial sequence alignment of a conserved region of 2-oxoglutarate-dehydrogenase E1 component, showing a 1 aa deletion that is unique to Xanthomonadales except Rhodanobacter sp. 2APBS1.
https://doi.org/10.1371/journal.pone.0055216.s024
(PDF)
Figure S25.
Partial sequence alignment of a conserved region of coproporphyrinogen III oxidase, showing a 1 aa deletion that is unique to Xanthomonadales except Rhodanobacter sp. 2APBS1.
https://doi.org/10.1371/journal.pone.0055216.s025
(PDF)
Figure S26.
Partial sequence alignment of a conserved region in 2-oxoglutarate-dehydrogenase E1 component, showing a 1 aa deletion that is uniquely shared by Xanthomonadales except Rhodanobacter sp. 2APBS1.
https://doi.org/10.1371/journal.pone.0055216.s026
(PDF)
Figure S27.
Partial sequence alignment of a conserved region of asparagine synthase b protein that is showing a 4–5 aa insert, unique to Xanthomonadales except Rhodanobacter sp. 2APBS1.
https://doi.org/10.1371/journal.pone.0055216.s027
(PDF)
Figure S28.
Partial sequence alignment of a conserved region of Asparagine synthase b protein, showing a 1–2 aa insert that is uniquely shared by Xanthomonadales except Rhodanobacter sp. 2APBS1. While genus Stenotrophomonas can be differentiated from other Xanthomonadales because of having 1 aa insert instead of 2 aa.
https://doi.org/10.1371/journal.pone.0055216.s028
(PDF)
Figure S29.
Partial sequence alignment of a conserved region DNA polymerase III subunit alpha showing a 4 aa insert that is commonly shared by Xanthomonadales except Rhodanobacter sp. 2APBS1. This CSI was previously identified by [28] as all Xanthomonadales specific signature.
https://doi.org/10.1371/journal.pone.0055216.s029
(PDF)
Figure S30.
Partial sequence alignment of a conserved region of RecA showing a 2 aa insert that is commonly shared by Xanthomonadales except Rhodanobacter sp. 2APBS1. This CSI was previously identified by [28] as all Xanthomonadales specific signature.
https://doi.org/10.1371/journal.pone.0055216.s030
(PDF)
Figure S31.
Partial sequence alignment of a conserved region of MutS showing a 5 aa insert that is commonly shared by Xanthomonadales except Pseudoxanthomonas suwonensis and Rhodanobacter sp. 2APBS1. This CSI was previously identified by [28] as all Xanthomonadales specific signature.
https://doi.org/10.1371/journal.pone.0055216.s031
(PDF)
Figure S32.
Partial sequence alignment of a conserved region of DNA polymerase III subunit alpha, showing a 2 aa deletion that is commonly shared by all Xanthomonadales except Pseudoxanthomonas suwonensis and Rhodanobacter sp. 2APBS1. These two have only 1 aa deletion at the same position.
https://doi.org/10.1371/journal.pone.0055216.s032
(PDF)
Figure S33.
Partial sequence alignment of tRNA (guanine-N(1)-)-methyltransferase, showing a 2 aa insert that is commonly present in all members of Xanthomonadales except Pseudoxanthomonas suwonensis and Rhodanobacter sp. 2APBS1.
https://doi.org/10.1371/journal.pone.0055216.s033
(PDF)
Figure S34.
Partial sequence alignment of a conserved region in glucose-6-phosphate 1-dehydrogenase, showing a 4 aa deletion that is uniquely present in all Xanthomonadales except Pseudoxanthomonas spadix BD-a59 and Rhodanobacter sp. 2APBS1 which has 3 aa insert.
https://doi.org/10.1371/journal.pone.0055216.s034
(PDF)
Figure S35.
Partial sequence alignment of a conserved region of DNA ligase NAD dependent, showing a 57–65 aa insert that is commonly shared by all Xanthomonadales except Pseudoxanthomonas suwonensis and Rhodanobacter sp. 2APBS1. Both these species do not contain the insert of same length. This CSI was previously identified by [28] as all Xanthomonadales specific signature.
https://doi.org/10.1371/journal.pone.0055216.s035
(PDF)
Figure S36.
A Neighbor-joining tree based upon carbamoyl phosphate synthase large subunit sequence. The Tree is showing the distinct branching of Xanthomonadales from various β-Proteobacteria with and without insert.
https://doi.org/10.1371/journal.pone.0055216.s036
(PDF)
Figure S37.
Partial sequence alignment of a conserved region of Hypothetical protein XOO1065, showing a 1 aa deletion that is present in Xanthomonadales. The deletion has also been found to be shared by few species from β-Proteobacteria but not in all of them.
https://doi.org/10.1371/journal.pone.0055216.s037
(PDF)
Figure S38.
Partial sequence alignment of a conserved region of orotate phosphoribosyltransferase, showing a 1 aa deletion that is present in all Xanthomonadales. The deletion has also been found to be shared by species from β-Proteobacteria.
https://doi.org/10.1371/journal.pone.0055216.s038
(PDF)
Figure S39.
Partial sequence alignment of a conserved region of Putative ribonuclease HII, showing a 1 aa insert that is present in Xanthomonadales. The CSI has also been found to be shared by few species from β-Proteobacteria but is not present in all.
https://doi.org/10.1371/journal.pone.0055216.s039
(PDF)
Figure S40.
Partial sequence alignment of a conserved region of glycyl-tRNA synthetase subunit beta, showing a 2 aa insert that is present in Xanthomonadales. The insert has also been found to be shared by some β-Proteobacteria.
https://doi.org/10.1371/journal.pone.0055216.s040
(PDF)
Figure S41.
Partial sequence alignment of a conserved region of the septum-site determining protein MinD, showing a 1 aa deletion that is present in Xanthomonadales. This CSI is also present in some species from β-Proteobacteria.
https://doi.org/10.1371/journal.pone.0055216.s041
(PDF)
Figure S42.
A Neighbor-joining tree based upon sequences from hypothetical protein X001065. The Tree is showing the Xanthomonadales and various β-Proteobacteria that share the 1 aa deletion. Species representing some other Gammaproteobacteria are also shown in tree.
https://doi.org/10.1371/journal.pone.0055216.s042
(PDF)
Figure S43.
A Neighbor-joining tree for Proteobacterial species based upon orotate phosphoribosyltransferase sequences. The Xanthomonadales and different β-Proteobacteria that contain the 1 aa deletion in this protein do not branch together in this tree suggesting that the deletion in these two groups have likely occurred independently. The tree shows only representative species from other Gammaproteobacteria and Alphaproteobacteria and it was rooted using sequences from Epsilonproteobacteria.
https://doi.org/10.1371/journal.pone.0055216.s043
(PDF)
Figure S44.
A Neighbor-joining tree based upon sequences from putative ribonuclease HII. The Tree is showing the Xanthomonadales and various β-Proteobacteria with insert. The tree also shows representative species from other Gammaproteobacteria and Alphaproteobacteria.
https://doi.org/10.1371/journal.pone.0055216.s044
(PDF)
Figure S45.
A maximum-likelihood tree based upon sequences from glycyl-tRNA synthetase subunit beta. The Tree shows the branching of Xanthomonadales separately from the other insert containing Betaproteobacteria. The species distribution of this insert could be explained by either the independent occurrence of a similar genetic event in the Betaproteobacteria and the Xanthomonadales, or that this insert was introduced in a common ancestor of the Beta- and Gamma-proteobacteria, followed by its loss from other Gammaproteobacteria after the divergence of deep-branching Xanthomonadales.
https://doi.org/10.1371/journal.pone.0055216.s045
(PDF)
Figure S46.
A Neighbor-joining tree based upon sequences from septum site-determining protein MinD protein. The Tree is showing the branching of Xanthomonadales distinctly from the other insert containing Betaproteobacteria.
https://doi.org/10.1371/journal.pone.0055216.s046
(PDF)
Author Contributions
Conceived and designed the experiments: RSG. Performed the experiments: RSG HSN. Analyzed the data: HSN RSG. Contributed reagents/materials/analysis tools: RSG. Wrote the paper: RSG HSN.
References
- 1.
Saddler GS, Bradbury J.S (2005) “Order Xanthomonadales” Brenner DJ, Krieg NR, Staley JT editors Bergey’s Manual of Systematic Bacteriology, Vol. 2, 2nd edition, New York: Springer.
- 2.
Brenner DJ, Krieg NR, Staley JT (2005) Bergey’s Manual of Systematic Bacteriology (2005) 2nd edition, Vol. 2, The Proteobacteria, New York: Springer.
- 3. Chatterjee S, Almeida RP, Lindow S (2008) Living in two worlds: the plant and insect lifestyles of Xylella fastidiosa. Annu Rev Phytopathol 46: 243–271.
- 4. Ryan RP, Vorholter FJ, Potnis N, Jones JB, Van Sluys MA, et al. (2011) Pathogenomics of Xanthomonas: understanding bacterium-plant interactions. Nat Rev Microbiol 9: 344–355.
- 5. Lee BM, Park YJ, Park DS, Kang HW, Kim JG, et al. (2005) The genome sequence of Xanthomonas oryzae pathovar oryzae KACC10331, the bacterial blight pathogen of rice. Nucleic Acids Res 33: 577–586.
- 6. Salzberg SL, Sommer DD, Schatz MC, Phillippy AM, Rabinowicz PD, et al. (2008) Genome sequence and rapid evolution of the rice pathogen Xanthomonas oryzae pv. oryzae PXO99A. BMC Genomics 9: 204.
- 7. Van Sluys MA, Monteiro-Vitorello CB, Camargo LE, Menck CF, da Silva AC, et al. (2002) Comparative genomic analysis of plant-associated bacteria. Annu Rev Phytopathol 40: 169–189.
- 8. Bhattacharyya A, Stilwagen S, Ivanova N, D’Souza M, Bernal A, et al. (2002) Whole-genome comparative analysis of three phytopathogenic Xylella fastidiosa strains. Proc Natl Acad Sci U S A 99: 12403–12408.
- 9. Purcell AH, Hopkins DL (1996) Fastidious xylem-limited bacterial plant pathogens. Annu Rev Phytopathol 34: 131–151.
- 10. Chen J, Xie G, Han S, Chertkov O, Sims D, et al. (2010) Whole genome sequences of two Xylella fastidiosa strains (M12 and M23) causing almond leaf scorch disease in California. J Bacteriol 192: 4534.
- 11. Monteiro-Vitorello CB, de Oliveira MC, Zerillo MM, Varani AM, Civerolo E, et al. (2005) Xylella and Xanthomonas Mobil’omics. OMICS 9: 146–159.
- 12. Crossman LC, Gould VC, Dow JM, Vernikos GS, Okazaki A, et al. (2008) The complete genome, comparative and functional analysis of Stenotrophomonas maltophilia reveals an organism heavily shielded by drug resistance determinants. Genome Biol 9: R74.
- 13. Looney WJ, Narita M, Muhlemann K (2009) Stenotrophomonas maltophilia: an emerging opportunist human pathogen. Lancet Infect Dis 9: 312–323.
- 14. Waters V, Yau Y, Prasad S, Lu A, Atenafu E, et al. (2011) Stenotrophomonas maltophilia in Cystic Fibrosis: Serologic Response and Effect on Lung Disease. Am J Respir Crit Care Med 183: 635–640.
- 15. Yarza P, Ludwig W, Euzeby J, Amann R, Schleifer KH, et al. (2010) Update of the All-Species Living Tree Project based on 16S and 23S rRNA sequence analyses. Syst Appl Microbiol 33: 291–299.
- 16. Martins-Pinheiro M, Galhardo RS, Lage C, Lima-Bessa KM, Aires KA, et al. (2004) Different patterns of evolution for duplicated DNA repair genes in bacteria of the Xanthomonadales group. BMC Evol Biol 4: 29.
- 17. Comas I, Moya A, Azad RK, Lawrence JG, Gonzalez-Candelas F (2006) The evolutionary origin of Xanthomonadales genomes and the nature of the horizontal gene transfer process. Mol Biol Evol 23: 2049–2057.
- 18. Dutilh BE, Huynen MA, Bruno WJ, Snel B (2004) The consistent phylogenetic signal in genome trees revealed by reducing the impact of noise. J Mol Evol 58: 527–539.
- 19. Gupta RS, Sneath PHA (2007) Application of the character compatibility approach to generalized molecular sequence data: Branching order of the proteobacterial subdivisions. J Mol Evol 64: 90–100.
- 20. Schneider A, Dessimoz C, Gonnet GH (2007) OMA Browser–exploring orthologous relations across 352 complete genomes. Bioinformatics 23: 2180–2182.
- 21. Gao B, Mohan R, Gupta RS (2009) Phylogenomics and protein signatures elucidating the evolutionary relationships among the Gammaproteobacteria. Int J Syst Evol Microbiol 59: 234–247.
- 22. Williams KP, Gillespie JJ, Sobral BW, Nordberg EK, Snyder EE, et al. (2010) Phylogeny of gammaproteobacteria. J Bacteriol 192: 2305–2314.
- 23. Gupta RS (2000) The phylogeny of Proteobacteria: relationships to other eubacterial phyla and eukaryotes. FEMS Microbiol Rev 24: 367–402.
- 24. Lima WC, Van Sluys MA, Menck CF (2005) Non-gamma-proteobacteria gene islands contribute to the Xanthomonas genome. OMICS 9: 160–172.
- 25. Lima WC, Paquola AC, Varani AM, Van Sluys MA, Menck CF (2008) Laterally transferred genomic islands in Xanthomonadales related to pathogenicity and primary metabolism. FEMS Microbiol Lett 281: 87–97.
- 26. Lima WC, Menck CF (2008) Replacement of the arginine biosynthesis operon in Xanthomonadales by lateral gene transfer. J Mol Evol 66: 266–275.
- 27. Lima WC, Varani AM, Menck CF (2009) NAD biosynthesis evolution in bacteria: lateral gene transfer of kynurenine pathway in Xanthomonadales and Flavobacteriales. Mol Biol Evol 26: 399–406.
- 28. Cutino-Jimenez AM, Martins-Pinheiro M, Lima WC, Martin-Tornet A, Morales OG, et al. (2010) Evolutionary placement of Xanthomonadales based on conserved protein signature sequences. Mol Phylogenet Evol 54: 524–534.
- 29. Thieme F, Koebnik R, Bekel T, Berger C, Boch J, et al. (2005) Insights into genome plasticity and pathogenicity of the plant pathogenic bacterium Xanthomonas campestris pv. vesicatoria revealed by the complete genome sequence. J Bacteriol 187: 7254–7266.
- 30. Qian W, Jia Y, Ren SX, He YQ, Feng JX, et al. (2005) Comparative and functional genomic analyses of the pathogenicity of phytopathogen Xanthomonas campestris pv. campestris. Genome Res 15: 757–767.
- 31. Potnis N, Krasileva K, Chow V, Almeida NF, Patil PB, et al. (2011) Comparative genomics reveals diversity among xanthomonads infecting tomato and pepper. BMC Genomics 12: 146.
- 32. Pieretti I, Royer M, Barbe V, Carrere S, Koebnik R, et al. (2009) The complete genome sequence of Xanthomonas albilineans provides new insights into the reductive genome evolution of the xylem-limited Xanthomonadaceae. BMC Genomics 10: 616.
- 33. Bogdanove AJ, Koebnik R, Lu H, Furutani A, Angiuoli SV, et al. (2011) Two new complete genome sequences offer insight into host and tissue specificity of plant pathogenic Xanthomonas spp. J Bacteriol 193: 5450–5464.
- 34. Van Sluys MA, de Oliveira MC, Monteiro-Vitorello CB, Miyaki CY, Furlan LR, et al. (2003) Comparative analyses of the complete genome sequences of Pierce’s disease and citrus variegated chlorosis strains of Xylella fastidiosa. J Bacteriol 185: 1018–1026.
- 35. Simpson AJ, Reinach FC, Arruda P, Abreu FA, Acencio M, et al. (2000) The genome sequence of the plant pathogen Xylella fastidiosa. The Xylella fastidiosa Consortium of the Organization for Nucleotide Sequencing and Analysis. Nature 406: 151–159.
- 36. Doddapaneni H, Yao J, Lin H, Walker MA, Civerolo EL (2006) Analysis of the genome-wide variations among multiple strains of the plant pathogenic bacterium Xylella fastidiosa. BMC Genomics 7: 225.
- 37. Gupta RS (1998) Protein phylogenies and signature sequences: A reappraisal of evolutionary relationships among archaebacteria, eubacteria, and eukaryotes. Microbiol Mol Biol Rev 62: 1435–1491.
- 38. Gupta RS, Griffiths E (2002) Critical issues in bacterial phylogeny. Theor Popul Biol 61: 423–434.
- 39. Griffiths E, Gupta RS (2006) Molecular signatures in protein sequences that are characteristics of the Phylum Aquificales. Int J Syst Evol Microbiol 56: 99–107.
- 40. Gupta RS (2009) Protein signatures (molecular synapomorphies) that are distinctive characteristics of the major cyanobacterial clades. Int J Syst Evol Microbiol 59: 2510–2526.
- 41. Rokas A, Holland PW (2000) Rare genomic changes as a tool for phylogenetics. Trends Ecol Evol 15: 454–459.
- 42. Baldauf SL, Palmer JD (1993) Animals and fungi are each other’s closest relatives: congruent evidence from multiple proteins. Proc Natl Acad Sci USA 90: 11558–11562.
- 43. Rivera MC, Lake JA (1992) Evidence that eukaryotes and eocyte prokaryotes are immediate relatives. Science 257: 74–76.
- 44. Gupta RS (2010) Molecular signatures for the main phyla of photosynthetic bacteria and their subgroups. Photosynth Res 104: 357–372.
- 45. Gupta RS (2001) The branching order and phylogenetic placement of species from completed bacterial genomes, based on conserved indels found in various proteins. Int Microbiol 4: 187–202.
- 46. Griffiths E, Gupta RS (2006) Lateral transfers of serine hydroxymethyl transferase (glyA) and UDP-N-acetylglucosamine enolpyruvyl transferase (murA) genes from free-living Actinobacteria to the parasitic chlamydiae. J Mol Evol 63: 283–296.
- 47. Harris JK, Kelley ST, Spiegelman GB, Pace NR (2003) The genetic core of the universal ancestor. Genome Res 13: 407–412.
- 48. Ciccarelli FD, Doerks T, von Mering C, Creevey CJ, Snel B, et al. (2006) Toward automatic reconstruction of a highly resolved tree of life. Science 311: 1283–1287.
- 49. Jeanmougin F, Thompson JD, Gouy M, Higgins DG, Gibson TJ (1998) Multiple sequence alignment with Clustal x. Trends Biochem Sci 23: 403–405.
- 50. Castresana J (2000) Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol Biol Evol 17: 540–552.
- 51. Jones DT, Taylor WR, Thornton JM (1992) The rapid generation of mutation data matrices from protein sequences. Computer applications in the biosciences : CABIOS 8: 275–282.
- 52. Tamura K, Dudley J, Nei M, Kumar S (2007) MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol 24: 1596–1599.
- 53. Whelan S, Goldman N (2001) A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol Biol Evol 18: 691–699.
- 54. Naushad HS, Gupta RS (2012) Molecular signatures (conserved indels) in protein sequences that are specific for the order Pasteurellales and distinguish two of its main clades. Antonie van Leeuwenhoek 101: 105–124.
- 55. Jalan N, Aritua V, Kumar D, Yu F, Jones JB, et al. (2011) Comparative genomic analysis of Xanthomonas axonopodis pv. citrumelo F1, which causes citrus bacterial spot disease, and related strains provides insights into virulence and host specificity. J Bacteriol 193: 6342–6357.
- 56. da Silva AC, Ferro JA, Reinach FC, Farah CS, Furlan LR, et al. (2002) Comparison of the genomes of two Xanthomonas pathogens with differing host specificities. Nature 417: 459–463.
- 57. Schreiber HL, Koirala M, Lara A, Ojeda M, Dowd SE, et al. (2010) Unraveling the First Xylella fastidiosa Subsp. Fastidiosa Genome from Texas. Southwestern Entomologist 35: 479–483.
- 58. Lee SH, Jin HM, Lee HJ, Kim JM, Jeon CO (2012) Complete Genome Sequence of the BTEX-Degrading Bacterium Pseudoxanthomonas spadix BD-a59. J Bacteriol 194: 544.
- 59. Parkinson N, Cowie C, Heeney J, Stead D (2009) Phylogenetic structure of Xanthomonas determined by comparison of gyrB sequences. Int J Syst Evol Microbiol 59: 264–274.
- 60. Handy J, Doolittle RF (1999) An attempt to pinpoint the phylogenetic introduction of glutaminyl-tRNA synthetase among bacteria. Journal of Molecular Evolution 49: 709–715.
- 61. Gupta RS, Mathews DW (2010) Signature proteins for the major clades of Cyanobacteria. BMC Evol Biol 10: 24.
- 62. Parkinson N, Aritua V, Heeney J, Cowie C, Bew J, et al. (2007) Phylogenetic analysis of Xanthomonas species by comparison of partial gyrase B gene sequences. Int J Syst Evol Microbiol 57: 2881–2887.
- 63. Young JM, Park DC, Shearman HM, Fargier E (2008) A multilocus sequence analysis of the genus Xanthomonas. Syst Appl Microbiol 31: 366–377.
- 64. Gao B, Gupta RS (2005) Conserved indels in protein sequences that are characteristic of the phylum Actinobacteria. Int J Syst Evol Microbiol 55: 2401–2412.
- 65. Gupta RS (2011) Origin of diderm (Gram-negative) bacteria: antibiotic selection pressure rather than endosymbiosis likely led to the evolution of bacterial cells with two membranes. Antonie van Leeuwenhoek 100: 171–182.
- 66. Gao B, Gupta RS (2012) Microbial systematics in the post-genomics era. Antonie van Leeuwenhoek 101: 45–54.
- 67. Griffiths E, Gupta RS (2002) Protein signatures distinctive of chlamydial species: Horizontal transfer of cell wall biosynthesis genes glmU from Archaebacteria to Chlamydiae, and murA between Chlamydiae and Streptomyces. Microbiology 148: 2541–2549.