The SAR11 Group of Alpha-Proteobacteria Is Not Related to the Origin of Mitochondria

Naiara Rodríguez-Ezpeleta; T. Martin Embley

doi:10.1371/journal.pone.0030520

Abstract

Although free living, members of the successful SAR11 group of marine alpha-proteobacteria contain a very small and A+T rich genome, two features that are typical of mitochondria and related obligate intracellular parasites such as the Rickettsiales. Previous phylogenetic analyses have suggested that Candidatus Pelagibacter ubique, the first cultured member of this group, is related to the Rickettsiales+mitochondria clade whereas others disagree with this conclusion. In order to determine the evolutionary position of the SAR11 group and its relationship to the origin of mitochondria, we have performed phylogenetic analyses on the concatenation of 24 proteins from 5 mitochondria and 71 proteobacteria. Our results support that SAR11 group is not the sistergroup of the Rickettsiales+mitochondria clade and confirm that the position of this group in the alpha-proteobacterial tree is strongly affected by tree reconstruction artefacts due to compositional bias. As a consequence, genome reduction and bias toward a high A+T content may have evolved independently in the SAR11 species, which points to a different direction in the quest for the closest relatives to mitochondria and Rickettsiales. In addition, our analyses raise doubts about the monophyly of the newly proposed Pelagibacteraceae family.

Citation: Rodríguez-Ezpeleta N, Embley TM (2012) The SAR11 Group of Alpha-Proteobacteria Is Not Related to the Origin of Mitochondria. PLoS ONE 7(1): e30520. https://doi.org/10.1371/journal.pone.0030520

Editor: Purificación López-García, Université Paris Sud, France

Received: July 4, 2011; Accepted: December 18, 2011; Published: January 23, 2012

Copyright: © 2012 Rodríguez-Ezpeleta, Embley. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Funding: NR-E was supported by the Programa de Formación de Investigadores del Departamento de Educación, Universidades e Investigación (Government of Basque Country). TME was supported by a Royal Society Wolfson Research Merit Award. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

According to the endosymbiotic theory, mitochondria and related organelles such as hydrogenosomes and mitosomes arose from an endosymbiotic bacterium [1], [2], [3]. Phylogenetic analyses based on mitochondrial-encoded proteins have well established that this bacterium was a member of the alpha-proteobacterial division [4], [5], and although the exact position of the mitochondrion within the alpha-proteobacteria is still debated [6], most of the trees published place the mitochondrial ancestor within or as sister group to the intracellular obligate parasitic Rickettsiales [7], [8], [9], [10].

Within the alpha-proteobacteria, the SAR11 lineage of free-living marine organisms has received great attention since its discovery in the early 1990s for being among the most successful organisms on the planet [11], [12]. Phylogenetic trees based on four bacterial encoded proteins place Candidatus Pelagibacter ubique, the first cultured organism of this group, within a cluster of alpha-proteobacteria that excludes Rhodospirillales and Rickettsiales [12], whereas another study, based on 16 bacterial/mitochondrial proteins places this species as sister group of the mitochondria/Rickettsiales clade [9]. Recently, the controversy has been revigorated up by four other studies, published almost simultaneously, that support alternative placements for the SAR11 group: Thrash et al. [13] and Georgiades et al. [14], who conclude that the SAR11 group belongs to the mitochondria/Rickettsiales clade, and Brindefalk et al. [15] and Viklund et al. [16], who argue that these two groups are not related.

From the four studies, only Thrash et al [13] include more representatives of the SAR11 group other than Ca. Pelagibacter ubique in analyses based on several genes, the others including only this species to represent the whole group. Moreover, Brindefalk et al [15] and Georgiades et al [14] use a single species (Reclinomonas americana) to represent mitochondria in their analyses based on several genes, and Viklund et al [16] does not include this organelle in their inferences. On the other hand, Thrash et al [13] and Georgiades et al [14] did not use sophisticated models of sequence evolution that take compositional bias into account, as did Viklund et al [16], and Brindefalk et al [15]. Finally, the conclusions drawn from some of these studies are derived from combining modestly supported results [13] or from deciding on one of the contradictory outcomes obtained (see Figures 1 and S2A of Viklund et al. 2011).

Download:

Figure 1. Nucleotide and amino acid composition of the dataset used in this study.

A) Bar graph displaying the percentage of A+T in the concatenation of the nucleotide sequences. B) Reduced dimensionality plot showing the main principal components of the global amino acid compositions. Dots are coloured as in Figure 1A. The group of overlapping dots at the bottom right corner contains the species with the lowest A+T %. The variances that explain the two first axes are respectively 92.3% and 6.3%.

https://doi.org/10.1371/journal.pone.0030520.g001

Therefore, in order to settle the controversy on the position of the SAR11 group within the alpha-proteobacteria and its relation to the origin of mitochondria, phylogenetic analyses based on a broad taxon sampling, including more available members of the SAR11 group, and using sophisticated models of sequence evolution are necessary. Here, we have assembled a dataset of 24 evolutionarily conserved orthologous proteins from 5 mitochondria, 62 diverse alpha-proteobacteria, including 6 members of the SAR11 group, and 9 other bacteria as outgroup. Our results confirm that the SAR11 group does not belong to, nor is sister group of, the mitochondria/Rickettsiales clade.

Results and Discussion

Phylogenies based upon nucleotide data

Shared high percentage of A+T in rickettsial and mitochondrial genomes has been blamed for the difficulties in confidently identifying the closest relatives to michochondria [8], [10]. This shared nucleotide composition is illustrated in Figure 1A, where the high proportions of A+T of mitochondrial and rickettsial genomes are noticeable; the same pattern is also observed in members of the SAR11 group. Remarkably, in a phylogenetic analysis based on nucleotide data, the effect of the compositional bias is so great that the assemblages obtained seem to reflect nucleotide composition more than evolutionary relationships; for example, the monophyly of the outgroup not recovered (Supporting Information S1 and S2). Nucleotide compositional bias is a well documented source of phylogenetic inference artefacts that, if not taken into account in the evolutionary model used, may negatively influence tree reconstruction [17]. The RY coding [18], which consists on grouping purines (A&G) and pyrimidines (C&T) in two character states (R and Y) prior to phylogenetic inference, can sometimes reduce the effect of compositional bias [19], [20]. In our case, this approach has a positive effect on the obtained tree (Supporting Information S3), which looks more consistent with current ideas for alpha-proteobacterial phylogeny [21] than the tree built using the four character states; additionally, it strongly supports the grouping of mitochondria, rickettsiales and the SAR11 group. This result is in accord with the conclusions of Williams, Sobral, and Dickerman [9], Thrash et al. [13] and Geogiades et al. [14].

Phylogenies based upon amino acid data

Biases in nucleotide proportions affect amino acid composition [22], an effect that is also observed in our dataset (Figure 1B). Yet the phylogeny obtained with amino acid data is not as biased as the one obtained with nucleotide data and recovers all major alpha-proteobacterial groups and the relationships within them using both Bayesian (Figure 2) and ML inferences (Supporting Information S4). This is expected given that amino acids generally have more character states and are therefore less prone to homoplasy than nucleotides. In the Bayesian tree constructed based on the WAG matrix for amino acid substitution, mitochondria and Rickettsiales are not monophyletic nor are any of them are related to the SAR11 group. Interestingly, applied to our dataset, the site heterogeneous mixture model CAT [23] drastically changes the position of the SAR11 group, which is now placed within a cluster of alpha-proteobacteria that excludes Rickettsiales and Rhodospirillales. Additionally, the sister-group relationship of mitochondria and Rickettsiales is strongly supported in this case (Figure 2).

Download:

Figure 2. Phylogeny based on 24 mitochondrial/bacterial proteins (6,542 amino acid positions) inferred by Bayesian Inference with the WAG + F (left) or the CAT mixture (right) model.

Numbers indicate posterior probability values. Branches without values are supported by posterior probabilities of 1.0. The scale bar denotes the estimated number of amino acid substitution per site. See supplementary material for complete species names and proteins used.

https://doi.org/10.1371/journal.pone.0030520.g002

Analogous to the RY coding for nucleotides, the Dayhoff coding in functional categories has been proposed to reduce amino acid compositional biases [24]. Applied to our dataset, this method does not have a remarkable effect in the tree based on the WAG matrix (Figure 3 and Supporting Information S5). However, when the CAT model is employed on the Dayhoff recoded dataset, an even more drastic change in the position of the SAR11 group is observed: all members of the group but one are placed as a monophyletic assembly in a cluster that excludes Rickettsiales, Rhodospirillales and Sphingomonadales (Figure 3). Surprisingly, alpha proteobacterium HIMB59 is not placed within the SAR11 in this case and appears related to the SAR116 group member Candidatus Puniceispirillum IMCC1322.

Download:

Figure 3. Phylogeny based on 24 mitochondrial/bacterial proteins (6,542 amino acid positions) inferred by Bayesian Inference with the GTR + F (left) or the CAT mixture (right) model on the Dayhoff recoded dataset.

(see Methods for details). Branches without values are supported by posterior probabilities of 1.0. The scale bar denotes the estimated number of amino acid substitution per site. See supplementary material for complete species names and proteins used.

https://doi.org/10.1371/journal.pone.0030520.g003

Alpha proteobacterium HIMB59 may not be a member of the SAR11 group

In order to further understand the evolutionary position of alpha proteobacterim HIMB59 and the effect of the inclusion of this species on the position of the SAR11 group, we performed phylogenetic analyses excluding this taxon or having it as the sole representative of the SAR11 group. As shown in Figure 4, excluding HIMB59 makes the result from the non recoded and recoded datasets more similar to each other, specially in the case of the CAT mixture model, where both datasets place the SAR11 in a cluster that excludes Rickettsiales, Rhodospirillales and Sphingomonadales. In turn, when only HIMB59 is included to represent the SAR11 group, this species maintains the evolutionary position observed in Figures 2 and 3 under both models and data recodings used (Supporting Information S6, S7, S8 and S9). This results suggests that HIMB59 may not be a member of the SAR11 group and that the definition of the family Pelagibacteraceae, fam. nov. recently proposed by Thrash et al. [13], needs to be revisited. Indeed, in their analyses, Thrash et al. [13] also observed some instability of HIMB59, which in some cases branched with mitochondria.

Download:

Figure 4. Phylogeny based on 24 mitochondrial/bacterial proteins (6,542 amino acid positions) inferred by Bayesian Inference with the WAG/GTR + F (left) or the CAT mixture (right) model on the non recoded and Dayhoff recoded dataset.

(see Methods for details). Branches without values are supported by posterior probabilities of 1.0. When at least one dataset gives posterior probability <1, both values are shown, standard coding on the left and Dayhoff coding on the right. The scale bar denotes the estimated number of amino acid substitution per site. See supplementary material for complete species names and proteins used.

https://doi.org/10.1371/journal.pone.0030520.g004

Taxon sampling and compositional bias affect the positioning of the SAR11 group and of HIMB59 in phylogenetic trees

Among the competing alternative positions for the SAR11 group observed in our analyses based on amino acid data, none supports the conclusions of Williams, Sobral, and Dickerman [9], Thrash et al. [13] and Geogiades et al. [14], who suggested a common ancestor of mitochondria and the SAR11 group. This is also true when using %AT rich or poor outgroups (Supporting Information S10 and S11). Only the tree based on nucleotide data points to this direction, implying that the sister group of mitochondria/Rickettsiales with the SAR11 group may be the result of a tree reconstruction artefact caused by compositional bias. Indeed, as we apply methods to correct for compositional bias such as Dayhoff recoding or use sophisticated models of evolution that take site-specific compositional heterogeneity into account such as the CAT model, the SAR11 group gets further and further away from mitochondria and Rickettsiales and branch deeper in the alpha-mitochondrial tree. This may indicate that the true evolutionary position of the SAR11 group is within a group of alpha-protebacteria that excludes Rickettsiales, Rhodospirillales and Sphingomonadales, but that the shared compositional bias of mitochondrial, Rickettsiales and the SAR11 sequences causes the SAR11 group be attracted to the other two groups. This idea is supported by the analyses of Viklund et al. [16], who pointed to a compositional bias as responsible for the traditional relationship of Pelagibacter with Rickettsiales. Similarly, the positioning of HIMB59 within the SAR11 group may also be caused by an artefact due to shared amino acid composition between this taxa and the SAR11 group (see Figure 1B).

Outlook

The genome of Candidatus Pelagibacter ubique is among the smallest to replicate independently and has an extremely high percentage of A+T content [12], two features that are often observed in obligate intracellular parasites such as Rickettsiales [25], but which are more surprising in a free-living marine organism [26]. These two features together with the phylogenetic analyses of Williams, Sobral, and Dickerman [9] have lead some authors to label Pelagibacter as the closest free-living outgroup to Rickettsiales [13], [27], [28], implying that mitochondria diverged from the alpha-proteobacteria at some time between the divergence of a marine clade and a strictly intracellular lineage adapted to various eukaryotes.

The analyses presented here suggest however that the SAR11 group is not specifically related to the Rickettsiales, and that genome reduction and bias toward a high A+T content have evolved independently in both lineages. Indeed, it has already been suggested that a reduction in genome size, which is often accompanied by an increase in A+T content, may be a selective advantage in the open ocean where nutrients are scarce [26]. This may explain the presence of these exceptional parasite-type features in Pelagibacter and why this species belongs to one of the most successful groups of organisms in the planet. Additionally, some features common to all Rickettsiales that are not present in Pelagibacter and vice versa support the view that those two lineages are not related. For example, Rickettsiales contain an unusual histidyl t-RNA synthetase that is not present in Pelagibacter [29], and Pelagibacter contains a particular type IV secretion system [30], a 2/2 Hb1 globin [31], a unique glycine activated and a SAM-V riboswitch [32], [33], and a signal recognition particle protein [27] that are not present in Rickettsiales.

In conclusion, our analyses based on a broad taxon sampling including several members of the SAR11 group are consistent with the current view that Rickettsiales are the closest relatives to mitochondria, but they do not support a close relationship of Pelagibacter and the SAR11 group to the origin of this clade. Therefore, if the potential relationship of SAR11 to mitochondria has been used as an argument to support the search for mitochondrial related organisms in marine environments [9], [13], the alternative placement of SAR11shown here should also encourage research focused on how genome reduction evolves in free-living organisms.

Methods

Dataset construction

Starting from the 67 protein coding genes of the Reclinomonas americana mitochondrial genome, BLASTP searches were performed on the 539 complete alpha-, beta- and gamma-proteobacterial genomes available as of September 2011 in GenBank, on 5 slowly evolving mitochondrial genomes and on five additionaly SAR11 group genomes retrieved form GenBank (Candidatus Pelagibacter ubique HTCC 1002, Candidatus Pelagibacter sp HTCC7211 and alpha proteobacterium HIMB114) and from the JCVI (HIMB5 and HIMB59). All protein sequences with a Blast e-value lower than 10⁻⁴ were retrieved. Each set of sequences was aligned at the amino acid level with Muscle [34], manually refined with ED [35], and trimmed of unambiguous aligned blocks of positions with Gblocks [36] with the following parameters: a minimum of 50% and 75% of sequences identical for a conserved or flanking position respectively and a maximum of 5 contiguous non conserved positions and a minimum of 5 positions for a block. Selected blocks were manually verified and introduced in the dataset if missing data was responsible for the automatic removal. Nine beta- and gamma- proteobacteria representing a broad range of %GC content were selected as outgroup, and 62 proteobacteria were selected by selecting one species per genera and by removing the ones that had more than 50% of missing data (Supporting Information S12). The 24 genes that contained at least three eukaryotes and that lacked at most 14 of the 76 selected species were retained for further analyses (Supporting Information S13). Once orthologous sequences selected with SCaFoS [37], new alignments were performed with Muscle and trimmed with Gblocks (same parameters as above) at the protein level. The corresponding nucleotide sequences were extracted using in house software, and alignments, including trimmed sites, were matched to the protein datasets with Revtrans [38] and in house software. The concatenation of the 24 protein coding genes comprises a total of 6,542 amino acids and 19,626 nucleotide positions. 7% of the data is missing (see Supporting Information S12 and S13). The amino acid composition bias of the taxa in the dataset was visualized by assembling a 76×20 matrix containing the percentage of each amino acid per species using the NET program [35]. This matrix is displayed as a two dimensional plot in a Principal Component Analysis (PCA).

Phylogenetic analyses

Author Contributions

Conceived and designed the experiments: NR-E TME. Performed the experiments: NR-E. Analyzed the data: NR-E. Wrote the paper: NR-E TME.

References

1. Woese CR (1987) Bacterial evolution. Microbiol Rev 51: 221–271.
- View Article
- Google Scholar
2. Embley TM (2006) Multiple secondary origins of the anaerobic lifestyle in eukaryotes. Philos Trans R Soc Lond B Biol Sci 361: 1055–1067.
- View Article
- Google Scholar
3. Stechmann A, Hamblin K, Perez-Brocal V, Gaston D, Richmond GS, et al. (2008) Organelles in Blastocystis that blur the distinction between mitochondria and hydrogenosomes. Curr Biol 18: 580–585.
- View Article
- Google Scholar
4. Andersson SG, Zomorodipour A, Andersson JO, Sicheritz-Ponten T, Alsmark UC, et al. (1998) The genome sequence of Rickettsia prowazekii and the origin of mitochondria. Nature 396: 133–140.
- View Article
- Google Scholar
5. Lang BF, Gray MW, Burger G (1999) Mitochondrial genome evolution and the origin of eukaryotes. Annu Rev Genet 33: 351–397.
- View Article
- Google Scholar
6. Esser C, Ahmadinejad N, Wiegand C, Rotte C, Sebastiani F, et al. (2004) A genome phylogeny for mitochondria among alpha-proteobacteria and a predominantly eubacterial ancestry of yeast nuclear genes. Mol Biol Evol 21: 1643–1660.
- View Article
- Google Scholar
7. Davidov Y, Huchon D, Koval SF, Jurkevitch E (2006) A new alpha-proteobacterial clade of Bdellovibrio-like predators: implications for the mitochondrial endosymbiotic theory. Environ Microbiol 8: 2179–2188.
- View Article
- Google Scholar
8. Fitzpatrick DA, Creevey CJ, McInerney JO (2006) Genome phylogenies indicate a meaningful alpha-proteobacterial phylogeny and support a grouping of the mitochondria with the Rickettsiales. Mol Biol Evol 23: 74–85.
- View Article
- Google Scholar
9. Williams KP, Sobral BW, Dickerman AW (2007) A robust species tree for the alphaproteobacteria. J Bacteriol 189: 4578–4586.
- View Article
- Google Scholar
10. Wu M, Sun LV, Vamathevan J, Riegler M, Deboy R, et al. (2004) Phylogenomics of the reproductive parasite Wolbachia pipientis wMel: a streamlined genome overrun by mobile genetic elements. PLoS Biol 2: E69.
- View Article
- Google Scholar
11. Giovannoni SJ, Britschgi TB, Moyer CL, Field KG (1990) Genetic diversity in Sargasso Sea bacterioplankton. Nature 345: 60–63.
- View Article
- Google Scholar
12. Giovannoni SJ, Tripp HJ, Givan S, Podar M, Vergin KL, et al. (2005) Genome streamlining in a cosmopolitan oceanic bacterium. Science 309: 1242–1245.
- View Article
- Google Scholar
13. Thrash JC, Boyd A, Huggett MJ, Grote J, Carini P, et al. (2011) Phylogenomic evidence for a common ancestor of mitochondria and the SAR11 clade. Scientific Reports 1:
- View Article
- Google Scholar
14. Georgiades K, Madoui MA, Le P, Robert C, Raoult D (2011) Phylogenomic Analysis of Odyssella thessalonicensis Fortifies the Common Origin of Rickettsiales, Pelagibacter ubique and Reclimonas americana Mitochondrion. PLoS One 6: e24857.
- View Article
- Google Scholar
15. Brindefalk B, Ettema TJ, Viklund J, Thollesson M, Andersson SG (2011) A Phylometagenomic Exploration of Oceanic Alphaproteobacteria Reveals Mitochondrial Relatives Unrelated to the SAR11 Clade. PLoS One 6: e24457.
- View Article
- Google Scholar
16. Viklund J, Ettema TJ, Andersson SG (2011) Independent Genome Reduction and Phylogenetic Reclassification of the Oceanic SAR11 Clade. Mol Biol Evol. In press.
- View Article
- Google Scholar
17. Foster PG (2004) Modeling compositional heterogeneity. Syst Biol 53: 485–495.
- View Article
- Google Scholar
18. Woese CR, Achenbach L, Rouviere P, Mandelco L (1991) Archaeal phylogeny: reexamination of the phylogenetic position of Archaeoglobus fulgidus in light of certain composition-induced artifacts. Syst Appl Microbiol 14: 364–371.
- View Article
- Google Scholar
19. Phillips MJ, Delsuc F, Penny D (2004) Genome-scale phylogeny and the detection of systematic biases. Mol Biol Evol 21: 1455–1458.
- View Article
- Google Scholar
20. Jeffroy O, Brinkmann H, Delsuc F, Philippe H (2006) Phylogenomics: the beginning of incongruence? Trends Genet 22: 225–231.
- View Article
- Google Scholar
21. Gupta RS, Mok A (2007) Phylogenomics and signature proteins for the alpha proteobacteria and its main groups. BMC Microbiol 7: 106.
- View Article
- Google Scholar
22. Foster PG, Jermiin LS, Hickey DA (1997) Nucleotide composition bias affects amino acid content in proteins coded by animal mitochondria. J Mol Evol 44: 282–288.
- View Article
- Google Scholar
23. Lartillot N, Philippe H (2004) A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process. Mol Biol Evol 21: 1095–1109.
- View Article
- Google Scholar
24. Hrdy I, Hirt RP, Dolezal P, Bardonova L, Foster PG, et al. (2004) Trichomonas hydrogenosomes contain the NADH dehydrogenase module of mitochondrial complex I. Nature 432: 618–622.
- View Article
- Google Scholar
25. Moran NA (2003) Tracing the evolution of gene loss in obligate bacterial symbionts. Curr Opin Microbiol 6: 512–518.
- View Article
- Google Scholar
26. Dufresne A, Garczarek L, Partensky F (2005) Accelerated evolution associated with genome reduction in a free-living prokaryote. Genome Biol 6: R14.
- View Article
- Google Scholar
27. Meyer MM, Ames TD, Smith DP, Weinberg Z, Schwalbach MS, et al. (2009) Identification of candidate structured RNAs in the marine organism ‘Candidatus Pelagibacter ubique’. BMC Genomics 10: 268.
- View Article
- Google Scholar
28. Gillespie JJ, Williams K, Shukla M, Snyder EE, Nordberg EK, et al. (2008) Rickettsia phylogenomics: unwinding the intricacies of obligate intracellular life. PLoS One 3: e2018.
- View Article
- Google Scholar
29. Wang C, Sobral BW, Williams KP (2007) Loss of a universal tRNA feature. J Bacteriol 189: 1954–1962.
- View Article
- Google Scholar
30. Gillespie JJ, Brayton KA, Williams KP, Diaz MA, Brown WC, et al. (2010) Phylogenomics reveals a diverse Rickettsiales type IV secretion system. Infect Immun 78: 1809–1823.
- View Article
- Google Scholar
31. Vinogradov SN, Hoogewijs D, Bailly X, Arredondo-Peter R, Gough J, et al. (2006) A phylogenomic profile of globins. BMC Evol Biol 6: 31.
- View Article
- Google Scholar
32. Tripp HJ, Schwalbach MS, Meyer MM, Kitner JB, Breaker RR, et al. (2009) Unique glycine-activated riboswitch linked to glycine-serine auxotrophy in SAR11. Environ Microbiol 11: 230–238.
- View Article
- Google Scholar
33. Poiata E, Meyer MM, Ames TD, Breaker RR (2009) A variant riboswitch aptamer class for S-adenosylmethionine common in marine bacteria. RNA 15: 2046–2056.
- View Article
- Google Scholar
34. Edgar RC (2004) MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics 5: 113.
- View Article
- Google Scholar
35. Philippe H (1993) MUST, a computer package of Management Utilities for Sequences and Trees. Nucleic Acids Res 21: 5264–5272.
- View Article
- Google Scholar
36. Talavera G, Castresana J (2007) Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Syst Biol 56: 564–577.
- View Article
- Google Scholar
37. Roure B, Rodriguez-Ezpeleta N, Philippe H (2007) SCaFoS: a tool for selection, concatenation and fusion of sequences for phylogenomics. BMC Evol Biol 7: Suppl 1S2.
- View Article
- Google Scholar
38. Wernersson R, Pedersen AG (2003) RevTrans: Multiple alignment of coding DNA from aligned amino acid sequences. Nucleic Acids Res 31: 3537–3539.
- View Article
- Google Scholar
39. Stamatakis A, Ludwig T, Meier H (2005) RAxML-III: a fast program for maximum likelihood-based inference of large phylogenetic trees. Bioinformatics 21: 456–463.
- View Article
- Google Scholar
40. Lartillot N, Lepage T, Blanquart S (2009) PhyloBayes 3: a Bayesian software package for phylogenetic reconstruction and molecular dating. Bioinformatics 25: 2286–2288.
- View Article
- Google Scholar
41. Rodriguez-Ezpeleta N, Brinkmann H, Roure B, Lartillot N, Lang BF, et al. (2007) Detecting and overcoming systematic errors in genome-scale phylogenies. Syst Biol 56: 389–399.
- View Article
- Google Scholar
42. Lartillot N, Brinkmann H, Philippe H (2006) Suppression of long-branch attraction artefacts in the animal phylogeny using a site-heterogeneous model. BMC Evol Biol. In press.
- View Article
- Google Scholar

[ref1] 1. Woese CR (1987) Bacterial evolution. Microbiol Rev 51: 221–271.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Embley TM (2006) Multiple secondary origins of the anaerobic lifestyle in eukaryotes. Philos Trans R Soc Lond B Biol Sci 361: 1055–1067.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. Stechmann A, Hamblin K, Perez-Brocal V, Gaston D, Richmond GS, et al. (2008) Organelles in Blastocystis that blur the distinction between mitochondria and hydrogenosomes. Curr Biol 18: 580–585.
View Article
Google Scholar

[8] View Article

[9] Google Scholar

[ref4] 4. Andersson SG, Zomorodipour A, Andersson JO, Sicheritz-Ponten T, Alsmark UC, et al. (1998) The genome sequence of Rickettsia prowazekii and the origin of mitochondria. Nature 396: 133–140.
View Article
Google Scholar

[11] View Article

[12] Google Scholar

[ref5] 5. Lang BF, Gray MW, Burger G (1999) Mitochondrial genome evolution and the origin of eukaryotes. Annu Rev Genet 33: 351–397.
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref6] 6. Esser C, Ahmadinejad N, Wiegand C, Rotte C, Sebastiani F, et al. (2004) A genome phylogeny for mitochondria among alpha-proteobacteria and a predominantly eubacterial ancestry of yeast nuclear genes. Mol Biol Evol 21: 1643–1660.
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref7] 7. Davidov Y, Huchon D, Koval SF, Jurkevitch E (2006) A new alpha-proteobacterial clade of Bdellovibrio-like predators: implications for the mitochondrial endosymbiotic theory. Environ Microbiol 8: 2179–2188.
View Article
Google Scholar

[20] View Article

[21] Google Scholar

[ref8] 8. Fitzpatrick DA, Creevey CJ, McInerney JO (2006) Genome phylogenies indicate a meaningful alpha-proteobacterial phylogeny and support a grouping of the mitochondria with the Rickettsiales. Mol Biol Evol 23: 74–85.
View Article
Google Scholar

[23] View Article

[24] Google Scholar

[ref9] 9. Williams KP, Sobral BW, Dickerman AW (2007) A robust species tree for the alphaproteobacteria. J Bacteriol 189: 4578–4586.
View Article
Google Scholar

[26] View Article

[27] Google Scholar

[ref10] 10. Wu M, Sun LV, Vamathevan J, Riegler M, Deboy R, et al. (2004) Phylogenomics of the reproductive parasite Wolbachia pipientis wMel: a streamlined genome overrun by mobile genetic elements. PLoS Biol 2: E69.
View Article
Google Scholar

[29] View Article

[30] Google Scholar

[ref11] 11. Giovannoni SJ, Britschgi TB, Moyer CL, Field KG (1990) Genetic diversity in Sargasso Sea bacterioplankton. Nature 345: 60–63.
View Article
Google Scholar

[32] View Article

[33] Google Scholar

[ref12] 12. Giovannoni SJ, Tripp HJ, Givan S, Podar M, Vergin KL, et al. (2005) Genome streamlining in a cosmopolitan oceanic bacterium. Science 309: 1242–1245.
View Article
Google Scholar

[35] View Article

[36] Google Scholar

[ref13] 13. Thrash JC, Boyd A, Huggett MJ, Grote J, Carini P, et al. (2011) Phylogenomic evidence for a common ancestor of mitochondria and the SAR11 clade. Scientific Reports 1:
View Article
Google Scholar

[38] View Article

[39] Google Scholar

[ref14] 14. Georgiades K, Madoui MA, Le P, Robert C, Raoult D (2011) Phylogenomic Analysis of Odyssella thessalonicensis Fortifies the Common Origin of Rickettsiales, Pelagibacter ubique and Reclimonas americana Mitochondrion. PLoS One 6: e24857.
View Article
Google Scholar

[41] View Article

[42] Google Scholar

[ref15] 15. Brindefalk B, Ettema TJ, Viklund J, Thollesson M, Andersson SG (2011) A Phylometagenomic Exploration of Oceanic Alphaproteobacteria Reveals Mitochondrial Relatives Unrelated to the SAR11 Clade. PLoS One 6: e24457.
View Article
Google Scholar

[44] View Article

[45] Google Scholar

[ref16] 16. Viklund J, Ettema TJ, Andersson SG (2011) Independent Genome Reduction and Phylogenetic Reclassification of the Oceanic SAR11 Clade. Mol Biol Evol. In press.
View Article
Google Scholar

[47] View Article

[48] Google Scholar

[ref17] 17. Foster PG (2004) Modeling compositional heterogeneity. Syst Biol 53: 485–495.
View Article
Google Scholar

[50] View Article

[51] Google Scholar

[ref18] 18. Woese CR, Achenbach L, Rouviere P, Mandelco L (1991) Archaeal phylogeny: reexamination of the phylogenetic position of Archaeoglobus fulgidus in light of certain composition-induced artifacts. Syst Appl Microbiol 14: 364–371.
View Article
Google Scholar

[53] View Article

[54] Google Scholar

[ref19] 19. Phillips MJ, Delsuc F, Penny D (2004) Genome-scale phylogeny and the detection of systematic biases. Mol Biol Evol 21: 1455–1458.
View Article
Google Scholar

[56] View Article

[57] Google Scholar

[ref20] 20. Jeffroy O, Brinkmann H, Delsuc F, Philippe H (2006) Phylogenomics: the beginning of incongruence? Trends Genet 22: 225–231.
View Article
Google Scholar

[59] View Article

[60] Google Scholar

[ref21] 21. Gupta RS, Mok A (2007) Phylogenomics and signature proteins for the alpha proteobacteria and its main groups. BMC Microbiol 7: 106.
View Article
Google Scholar

[62] View Article

[63] Google Scholar

[ref22] 22. Foster PG, Jermiin LS, Hickey DA (1997) Nucleotide composition bias affects amino acid content in proteins coded by animal mitochondria. J Mol Evol 44: 282–288.
View Article
Google Scholar

[65] View Article

[66] Google Scholar

[ref23] 23. Lartillot N, Philippe H (2004) A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process. Mol Biol Evol 21: 1095–1109.
View Article
Google Scholar

[68] View Article

[69] Google Scholar

[ref24] 24. Hrdy I, Hirt RP, Dolezal P, Bardonova L, Foster PG, et al. (2004) Trichomonas hydrogenosomes contain the NADH dehydrogenase module of mitochondrial complex I. Nature 432: 618–622.
View Article
Google Scholar

[71] View Article

[72] Google Scholar

[ref25] 25. Moran NA (2003) Tracing the evolution of gene loss in obligate bacterial symbionts. Curr Opin Microbiol 6: 512–518.
View Article
Google Scholar

[74] View Article

[75] Google Scholar

[ref26] 26. Dufresne A, Garczarek L, Partensky F (2005) Accelerated evolution associated with genome reduction in a free-living prokaryote. Genome Biol 6: R14.
View Article
Google Scholar

[77] View Article

[78] Google Scholar

[ref27] 27. Meyer MM, Ames TD, Smith DP, Weinberg Z, Schwalbach MS, et al. (2009) Identification of candidate structured RNAs in the marine organism ‘Candidatus Pelagibacter ubique’. BMC Genomics 10: 268.
View Article
Google Scholar

[80] View Article

[81] Google Scholar

[ref28] 28. Gillespie JJ, Williams K, Shukla M, Snyder EE, Nordberg EK, et al. (2008) Rickettsia phylogenomics: unwinding the intricacies of obligate intracellular life. PLoS One 3: e2018.
View Article
Google Scholar

[83] View Article

[84] Google Scholar

[ref29] 29. Wang C, Sobral BW, Williams KP (2007) Loss of a universal tRNA feature. J Bacteriol 189: 1954–1962.
View Article
Google Scholar

[86] View Article

[87] Google Scholar

[ref30] 30. Gillespie JJ, Brayton KA, Williams KP, Diaz MA, Brown WC, et al. (2010) Phylogenomics reveals a diverse Rickettsiales type IV secretion system. Infect Immun 78: 1809–1823.
View Article
Google Scholar

[89] View Article

[90] Google Scholar

[ref31] 31. Vinogradov SN, Hoogewijs D, Bailly X, Arredondo-Peter R, Gough J, et al. (2006) A phylogenomic profile of globins. BMC Evol Biol 6: 31.
View Article
Google Scholar

[92] View Article

[93] Google Scholar

[ref32] 32. Tripp HJ, Schwalbach MS, Meyer MM, Kitner JB, Breaker RR, et al. (2009) Unique glycine-activated riboswitch linked to glycine-serine auxotrophy in SAR11. Environ Microbiol 11: 230–238.
View Article
Google Scholar

[95] View Article

[96] Google Scholar

[ref33] 33. Poiata E, Meyer MM, Ames TD, Breaker RR (2009) A variant riboswitch aptamer class for S-adenosylmethionine common in marine bacteria. RNA 15: 2046–2056.
View Article
Google Scholar

[98] View Article

[99] Google Scholar

[ref34] 34. Edgar RC (2004) MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics 5: 113.
View Article
Google Scholar

[101] View Article

[102] Google Scholar

[ref35] 35. Philippe H (1993) MUST, a computer package of Management Utilities for Sequences and Trees. Nucleic Acids Res 21: 5264–5272.
View Article
Google Scholar

[104] View Article

[105] Google Scholar

[ref36] 36. Talavera G, Castresana J (2007) Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Syst Biol 56: 564–577.
View Article
Google Scholar

[107] View Article

[108] Google Scholar

[ref37] 37. Roure B, Rodriguez-Ezpeleta N, Philippe H (2007) SCaFoS: a tool for selection, concatenation and fusion of sequences for phylogenomics. BMC Evol Biol 7: Suppl 1S2.
View Article
Google Scholar

[110] View Article

[111] Google Scholar

[ref38] 38. Wernersson R, Pedersen AG (2003) RevTrans: Multiple alignment of coding DNA from aligned amino acid sequences. Nucleic Acids Res 31: 3537–3539.
View Article
Google Scholar

[113] View Article

[114] Google Scholar

[ref39] 39. Stamatakis A, Ludwig T, Meier H (2005) RAxML-III: a fast program for maximum likelihood-based inference of large phylogenetic trees. Bioinformatics 21: 456–463.
View Article
Google Scholar

[116] View Article

[117] Google Scholar

[ref40] 40. Lartillot N, Lepage T, Blanquart S (2009) PhyloBayes 3: a Bayesian software package for phylogenetic reconstruction and molecular dating. Bioinformatics 25: 2286–2288.
View Article
Google Scholar

[119] View Article

[120] Google Scholar

[ref41] 41. Rodriguez-Ezpeleta N, Brinkmann H, Roure B, Lartillot N, Lang BF, et al. (2007) Detecting and overcoming systematic errors in genome-scale phylogenies. Syst Biol 56: 389–399.
View Article
Google Scholar

[122] View Article

[123] Google Scholar

[ref42] 42. Lartillot N, Brinkmann H, Philippe H (2006) Suppression of long-branch attraction artefacts in the animal phylogeny using a site-heterogeneous model. BMC Evol Biol. In press.
View Article
Google Scholar

[125] View Article

[126] Google Scholar