Did Genetic Drift Drive Increases in Genome Complexity?

Kenneth D. Whitney; Theodore Garland Jr

doi:10.1371/journal.pgen.1001080

Abstract

Mechanisms underlying the dramatic patterns of genome size variation across the tree of life remain mysterious. Effective population size (N_e) has been proposed as a major driver of genome size: selection is expected to efficiently weed out deleterious mutations increasing genome size in lineages with large (but not small) N_e. Strong support for this model was claimed from a comparative analysis of N_eu and genome size for ≈30 phylogenetically diverse species ranging from bacteria to vertebrates, but analyses at that scale have so far failed to account for phylogenetic nonindependence of species. In our reanalysis, accounting for phylogenetic history substantially altered the perceived strength of the relationship between N_eu and genomic attributes: there were no statistically significant associations between N_eu and gene number, intron size, intron number, the half-life of gene duplicates, transposon number, transposons as a fraction of the genome, or overall genome size. We conclude that current datasets do not support the hypothesis of a mechanistic connection between N_e and these genomic attributes, and we suggest that further progress requires larger datasets, phylogenetic comparative methods, more robust estimators of genetic drift, and a multivariate approach that accounts for correlations between putative explanatory variables.

Author Summary

Genome size (the amount of nuclear DNA) varies tremendously across organisms but is not necessarily correlated with organismal complexity. For example, genome sizes just within the grasses vary nearly 20-fold, but large-genomed grass species are not obviously more complex in terms of morphology or physiology than are the small-genomed species. Recent explanations for genome size variation have instead been dominated by the idea that population size determines genome size: mutations that increase genome size are expected to drift to fixation in species with small populations, but such mutations would be eliminated in species with large populations where natural selection operates at higher efficiency. However, inferences from previous analyses are limited because they fail to recognize that species share evolutionary histories and thus are not necessarily statistically independent. Our analysis takes a phylogenetic perspective and, contrary to previous studies, finds no evidence that genome size or any of its components (e.g., transposon number, intron number) are related to population size. We suggest that genome size evolution is unlikely to be neatly explained by a single factor such as population size.

Citation: Whitney KD, Garland T Jr (2010) Did Genetic Drift Drive Increases in Genome Complexity? PLoS Genet 6(8): e1001080. https://doi.org/10.1371/journal.pgen.1001080

Editor: Nancy A. Moran, Yale University, United States of America

Received: March 15, 2010; Accepted: July 22, 2010; Published: August 26, 2010

Copyright: © 2010 Whitney, Garland. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Funding: KDW was supported by NSF DEB-0716868 and TG was supported in part by NSF DEB-0416085. The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

The vast array of genome sizes is a pattern that begs for explanation [1], [2]. Haploid (1C) genome size (measured either in base pairs or mass, where 10⁶ Kb ≈1 picogram) spans eight orders of magnitude: the known eukaryotic range is ≈2,249–978,000,000 Kb [3], while Archaea and Bacteria range from 491–5,751 Kb and 76–13,034 Kb, respectively [4].

Lynch and colleagues [5]–[7] have argued strongly for a central role for nonadaptive processes such as mutation and drift in the evolution of genome size and complexity. In contrast to proposed neutral and adaptive models of genome size evolution (see, e.g. [8], [9]), they outline a model positing that mutations increasing genome size are slightly deleterious. Under this model, lineages differ in effective population size (N_e) and, as a result, differ in the efficacy with which natural selection will counteract genome expansion. Thus, lineages with small N_e will experience drift towards larger genomes [7]. As support for their argument, they presented a comparative analysis of roughly 30 taxa, ranging from bacteria to angiosperms, fungi, and mammals. Among these taxa, they reported a statistically significant negative relationship between N_eu (a composite parameter including effective population size and nucleotide mutation rate) and genome size. Strikingly, the relationship was quite strong: 66% of the variation in genome size was explained by N_eu [7]. This is truly an astounding result, considering the widely divergent selective regimes, life histories, and modes of reproduction found across these diverse organisms.

The Lynch & Conery model has sparked intense interest and >330 citations. Some objections on theoretical and methodological grounds have been voiced. Charlesworth and Barton [10] point out that N_e is confounded with many different aspects of organismal biology (e.g., developmental rate, body size), and thus that both N_e and genome size may be correlated effects of one or more other causal factors. Daubin and Moran [11] outline several objections, including that taxon differences in mutation rates make N_eu a poor proxy for N_e that estimates of N_e from silent-site nucleotide diversity in bacteria (as in [7]) are skewed by population subdivision and cryptic species, and further that such N_e estimates are overly sensitive to recent evolutionary history. Nevertheless, the idea that N_e drives genome size and complexity seems to have gained acceptance [12]–[14], with some going so far as to characterize it as “the principal explanatory framework for understanding the evolution of genome organization” ([12], p. 303).

Here, we argue that such conclusions are premature without phylogenetic comparative analyses of genome size evolution. When species are used as data points, relationships between raw values of any two traits (e.g., N_e and genome size) are difficult to interpret, as shared phylogenetic history means that assumptions of statistical independence are likely to be violated [15]–[17]. Special methods are required to recover independence of observations and to test for evolutionary associations between traits. Frequently, conventional (nonphylogenetic) analyses overestimate the strength of the association between traits relative to phylogenetic methods [18]. In an extreme case, a strong correlation in the raw data can be driven by a single association at the base of the phylogenetic tree, e.g., it can reflect a single instance of correlated change in the traits, followed by uncorrelated changes and/or stasis in trait values during subsequent evolutionary history (Figure 1). In this study, we revisit the Lynch & Conery dataset with a phylogenetic perspective, taking advantage of new phylogenetic data and analysis tools.

Download:

Figure 1. Ignoring phylogenetic history can lead to incorrect conclusions about the nature of evolutionary associations between traits.

In this hypothetical example, eight species have been measured for two traits, x and y, as indicated by pairs of values at the tips of the phylogenetic tree (A). Ordinary least-squares linear regression (OLS) indicates a statistically significant positive relationship (B; r² = 0.62, P = 0.02), potentially leading to an inference of a positive evolutionary association between x and y. However, inspection of the scatterplot (B) in relation to the phylogenetic relationships of the species (A) indicates that the association between x and y is negative for the four species within each of the two major lineages. Regression through the origin with phylogenetically independent contrasts (computed using [34] and setting all branches to length 1.0), which is equivalent to phylogenetic generalized least squares (PGLS) analysis, accounts for the nonindependence of species and indicates no overall evolutionary relationship between the traits (C, standardized contrasts, r² = 0.01, P = 0.82; basal contrast indicated in red). The apparent pattern across species was driven by positively correlated trait change only at the basal split of the phylogeny; throughout the rest of the phylogeny, the traits mostly changed in opposite directions (A; basal contrast in red). Notes: In A, the estimated nodal values for both traits are shown in parentheses. These are intermediate steps in the independent contrasts algorithm and are not to be taken as optimal estimates of the states at internal nodes; rather, they are a type of “local parsimony” estimate (except the estimate at the basal node, which is equivalent to the estimate under squared-change parsimony). Contrasts are taken between sister nodes on a phylogeny, not along each branch segment [15], [16], [18].

https://doi.org/10.1371/journal.pgen.1001080.g001

Results

Model fitting

A phylogenetic topology and reconstruction of genome sizes is presented in Figure 2, illustrating that close relatives have similar genome sizes. Initial simple linear regressions of genome size on N_eu explored four branch length models and found that the phylogenetic generalized least squares (PGLS) model with all branches = 1.0 provided a better fit than the nonphylogenetic ordinary least squares (OLS) model (Table 1). Subsequent analyses therefore used branch lengths of 1.0. For all variables except intron number, phylogenetic models (PGLS) exhibited better fit than nonphylogenetic (OLS) models (Table 1). For genome size and gene number, estimation of the Ornstein-Uhlenbeck transformation parameter d indicated substantial phylogenetic signal (d = 1.31 and 1.16, respectively), and the resulting RegOU models fit significantly better than the OLS models (ln likelihood ratio tests (LRTs), χ² = 5.88, P = 0.015 and χ² = 7.90, P = 0.005, respectively). In comparing the two phylogenetic models, the RegOU model did not produce significantly better fit vs. PGLS (LRTs, χ² = 1.84, P = 0.175 and χ² = 0.46, P = 0.498 for genome size and gene number, respectively).

Download:

Figure 2. Phylogeny for the species in the Lynch & Conery dataset [7], with a reconstruction of genome sizes.

(See Materials and Methods).

https://doi.org/10.1371/journal.pgen.1001080.g002

Download:

Table 1. Relationships between N_eu and genomic attributes in nonphylogenetic (OLS) and phylogenetic (PGLS, RegOU) models.

https://doi.org/10.1371/journal.pgen.1001080.t001

Phylogenetic regressions do not detect relationships between N_eu and genomic attributes

Although there were strong negative relationships between N_eu and six of the seven genomic attributes in nonphylogenetic regressions, the patterns disappeared when phylogenetic models were applied (Table 1). For example, the strong negative relationship between N_eu and genome size (OLS, P<0.001, Figure 3A) was replaced with a nonsignificant relationship under better-fitting phylogenetic models (PGLS, P = 0.137, Figure 3B; RegOU, P = 0.328). Similar patterns were evident for gene number, the half-life of gene duplicates, intron size, intron number, transposon number, and transposon fraction (Table 1).

Download:

Figure 3. Relationship between N_eu and genome size across 22 eukaryotic and 7 prokaryotic species from the dataset of Lynch & Conery [7].

(A) Ordinary least squares regression (OLS); r² = 0.64, P<0.0001. (B) Standardized phylogenetically independent contrasts (equivalent to PGLS) using branch lengths of 1.0; r² = 0.08, P = 0.138. Values have been “positivized” on the x-axis [35].

https://doi.org/10.1371/journal.pgen.1001080.g003

Discussion

Accounting for phylogenetic history substantially altered the perceived strength of the relationship between N_eu and genomic attributes. In phylogenetic analyses, there were no consistent evolutionary associations between N_eu and gene number, intron size, intron number, the half-life of gene duplicates, transposon number, transposons as a fraction of the genome, or overall genome size. Thus, a phylogenetically controlled reanalysis of the Lynch & Conery dataset [7] does not support the conclusion that N_e drives genome size patterns across the tree of life.

The few existing comparative analyses of more phylogenetically restricted datasets either do not support or provide only equivocal support for the Lynch & Conery model. Whitney et al. [19] conducted a phylogenetically controlled analysis of 205 species of seed plants and found no association between N_e and genome size. Kuo et al. [20] analyzed 42 paired bacterial genomes, using the efficacy of purifying selection in coding regions to quantify genetic drift. Bacterial taxa experiencing greater levels of genetic drift – implying a smaller evolutionary N_e – had smaller genomes, a pattern opposite that predicted by the Lynch & Conery model as articulated in [7]. Finally, in putative support of the model, Yi & Streelman [21] reported a significant negative relationship between N_e and genome size in a phylogenetically corrected analysis of 33 species of ray-finned fish. However, this analysis has been challenged as artifactual. Gregory & Witt [22] argue that Pleistocene population bottlenecks and polyploidy shaped both N_e and genome size of fishes in such a way as to generate a non-causal correlation between N_e and genome size in this particular dataset.

Future investigations of the role of genetic drift in determining genome size across the tree of life would benefit from several approaches. First, utilizing phylogenetic comparative methods, for which we advocate here, is an important step towards drawing robust inferences from species-level comparative analyses. Second, larger datasets would certainly increase confidence in our interpretations. While statistically nonsignificant, we note the relationships between N_eu and genomic attributes (Table 1) are negative and thus are at least qualitatively consistent with the Lynch & Conery model, suggesting that power may be an issue. Furthermore, given that the N_eu estimates in the current analysis required sequence data, species with small genomes relative to averages within clades are likely overrepresented; thus it would be important to ensure that species with large genomes are included in future analyses. Third, future studies would benefit from more robust estimates of genetic drift, as N_eu estimated from silent-site diversity (as in [7] and the present reanalysis) has several undesirable properties. Because the mutation rate u differs among lineages [11], [23], [24], using N_eu as a proxy for N_e could obscure any relationship between N_e and genome size. Further, N_e estimated from silent-site diversity may signal the effects of recent evolutionary events more than the long-term history under which genome size evolved [11]. K_a/K_s ratios (ratios of nonsynonymous to synonymous substitutions per site) are a promising alternative to N_eu for estimating genetic drift [11], [20]. Finally, genome size is a complex trait that is unlikely to be explained by univariate analyses [10]. Phylogenetic comparative methods should be combined with multivariate models that are capable of distinguishing the contributions of highly correlated predictor variables. A recent analysis [19] is a step in the right direction: plant outcrossing rate and N_e were simultaneously examined in a multiple regression analysis of phylogenetically independent contrasts, allowing the partial contribution of each variable to be characterized. To make further progress on the population genetics of genome size and complexity, we clearly need phylogenetic comparative analyses of large datasets capable of distinguishing the contributions of N_e and its multiple correlates, including body size, developmental rate, and metabolic rate.

Materials and Methods

Data sources

Data on N_eu and genome sizes for 22 eukaryotic and 7 prokaryotic species were obtained from the Supporting Online Material of [7]. For a subset of these species, data on gene number, intron size, intron number, and the half-life of gene duplicates were also obtained from the same source. Data on total transposon number and fraction of the genome occupied by transposons were obtained directly from M. Lynch; these data combine counts of LTR, non-LTR, and DNA transposons and correspond to the fourth panel of Fig. 4 of [7]. All traits were log₁₀ transformed prior to analysis; for total transposon number and transposon fraction, constants of 1.0 and 0.01, respectively, were added prior to log-transformation.

Phylogeny construction

A composite tree for the species was constructed in Mesquite v. 2.71 [25] based on phylogenetic trees reported in [26]–[28]. As a visual heuristic, genome sizes were traced onto the phylogeny using the Parsimony Ancestral States method [29] with an assumption that all branch lengths equal 1.0.

Phylogenetic comparative analyses

All dependent variables were regressed on N_eu using REGRESSIONv2.m [30] running in MATLAB v. 7.9.0. Three types of models were examined: ordinary least squares (OLS), phylogenetic generalized least squares (PGLS), and phylogenetic regression under an Ornstein-Uhlenbeck process (RegOU) [30], [31]. OLS is traditional ‘nonphylogenetic’ regression, which in effect assumes a star phylogeny in which all species are equally unrelated, and corresponds to the N_eu vs. genome size analysis reported in [7]. PGLS assumes that residual variation among species is correlated, with the correlation given by a Brownian-motion like process along the specified phylogenetic tree (topology and branch lengths). PGLS is functionally equivalent to Felsenstein's [15] phylogenetically independent contrast method [31]. Finally, the RegOU model estimates (via restricted maximum likelihood) the strength of phylogenetic signal in the residual variation simultaneously with the regression coefficients; the former is given by d, the Ornstein-Uhlenbeck transformation parameter. An OU evolutionary model is typically used to model the effects of stabilizing selection around an optimum [30]. When d = 0, there is no phylogenetic signal in the residuals from the regression model; when d is significantly greater than 0, significant phylogenetic signal exists [30], [32].

Following [33], starter branch lengths corresponding to all branches = 1.0, Grafen's arbitrary lengths, Pagel's arbitrary lengths, and Nee's arbitrary lengths were compared in PGLS and RegOU regressions of genome size on N_eu. Based on their likelihoods, the models with all branches = 1.0 achieved the best fit, and thus these branch lengths were used in all subsequent phylogenetic analyses. Model selection for each variable then proceeded in two steps. First, we compared the likelihoods of the PGLS model and the OLS model, with a higher likelihood taken as evidence of a better-fitting model. Second, we used ln likelihood ratio tests (LRTs) to compare the RegOU model with the PGLS and OLS models with 1 d.f. [30]. Given the issue of small sample sizes (see [32]) for most dependent variables and the fact that RegOU models require estimation of an extra parameter, RegOU models were examined only for genome size and gene number.

Acknowledgments

Many thanks to Eric Baack, Mike Barker, Joe Felsenstein, Jon Gelfond, Owen Gilbert, Tony Ives, Michael Kohn, David Queller, and Jennifer Rudgers for discussion and to Mike Lynch for discussion and data sharing.

Author Contributions

Analyzed the data: KDW TG. Contributed reagents/materials/analysis tools: TG. Wrote the paper: KDW. Conceived and designed the study: KDW.

References

1. Baack EJ, Whitney KD, Rieseberg LH (2005) Hybridization and genome size evolution: timing and magnitude of nuclear DNA content increases in Helianthus homoploid hybrid species. New Phytol 167: 623–630.
- View Article
- Google Scholar
2. Gregory TR, editor. (2005) The evolution of the genome. Amsterdam: Elsevier.
3. Gregory TR, Nicol JA, Tamm H, Kullman B, Kullman K, et al. (2007) Eukaryotic genome size databases. Nucleic Acids Res 35: D332–D338.
- View Article
- Google Scholar
4. Center for Biological Sequence Analysis (2010) Genome Atlas Database. Lyngby, Denmark: Technical University of Denmark. http://www.cbs.dtu.dk/services/GenomeAtlas/.
5. Lynch M (2007) The origins of genome architecture. Sunderland, , Massachusetts, USA: Sinauer Associates.
6. Lynch M (2007) The frailty of adaptive hypotheses for the origins of organismal complexity. Proc Natl Acad Sci USA 104: 8597–8604.
- View Article
- Google Scholar
7. Lynch M, Conery JS (2003) The origins of genome complexity. Science 302: 1401–1404.
- View Article
- Google Scholar
8. Bennett MD, Leitch IJ (2005) Genome size evolution in plants. In: Gregory TR, editor. The evolution of the genome. Amsterdam: Elsevier. pp. 89–162.
9. Petrov DA (2002) Mutational equilibrium model of genome size evolution. Theor Pop Biol 61: 531–544.
- View Article
- Google Scholar
10. Charlesworth B, Barton N (2004) Genome size: Does bigger mean worse? Curr Biol 14: R233–R235.
- View Article
- Google Scholar
11. Daubin V, Moran NA (2004) Comment on “The origins of genome complexity”. Science 306: 978a.
- View Article
- Google Scholar
12. Koonin EV (2009) Evolution of genome architecture. Int J Biochem Cell Biol 41: 298–306.
- View Article
- Google Scholar
13. Pritham EJ (2009) Transposable elements and factors influencing their success in eukaryotes. J Hered 100: 648–655.
- View Article
- Google Scholar
14. Yi SV (2006) Non-adaptive evolution of genome complexity. Bioessays 28: 979–982.
- View Article
- Google Scholar
15. Felsenstein J (1985) Phylogenies and the comparative method. Am Nat 125: 1–15.
- View Article
- Google Scholar
16. Garland T Jr, Bennett AF, Rezende EL (2005) Phylogenetic approaches in comparative physiology. J Exp Biol 208: 3015–3035.
- View Article
- Google Scholar
17. Harvey PH, Pagel MD (1991) The comparative method in evolutionary biology. Oxford: Oxford University Press.
18. Garland T, Midford PE, Ives AR (1999) An introduction to phylogenetically based statistical methods, with a new method for confidence intervals on ancestral values. Am Zool 39: 374–388.
- View Article
- Google Scholar
19. Whitney KD, Baack EJ, Hamrick JL, Godt MJW, Barringer BC, et al. (2010) A role for nonadaptive processes in plant genome size evolution? Evolution 64: 2097–2109.
- View Article
- Google Scholar
20. Kuo CH, Moran NA, Ochman H (2009) The consequences of genetic drift for bacterial genome complexity. Genome Res 19: 1450–1454.
- View Article
- Google Scholar
21. Yi S, Streelman JT (2005) Genome size is negatively correlated with effective population size in ray-finned fish. Trends Genet 21: 643–646.
- View Article
- Google Scholar
22. Gregory TR, Witt JDS (2008) Population size and genome size in fishes: a closer look. Genome 51: 309–313.
- View Article
- Google Scholar
23. Drake JW, Charlesworth B, Charlesworth D, Crow JF (1998) Rates of spontaneous mutation. Genetics 148: 1667–1686.
- View Article
- Google Scholar
24. Lynch M (2010) Rate, molecular spectrum, and consequences of human mutation. Proc Natl Acad Sci USA 107: 961–968.
- View Article
- Google Scholar
25. Maddison WP, Maddison DR (2009) Mesquite: a modular system for evolutionary analysis. Version 2.71. http://mesquiteproject.org.
26. Gupta RS (2000) The phylogeny of proteobacteria: relationships to other eubacterial phyla and eukaryotes. FEMS Microbiol Rev 24: 367–402.
- View Article
- Google Scholar
27. Maddison DR, Schulz K-S (2007) The Tree of Life Web Project. http://tolweb.org.
28. Song J, Xu QK, Olsen R, Loomis WF, Shaulsky G, et al. (2005) Comparing the Dictyostelium and Entamoeba genomes reveals an ancient split in the Conosa lineage. PLoS Comp Biol 1: e71.
- View Article
- Google Scholar
29. Maddison WP (1991) Squared-change parsimony reconstructions of ancestral states for continuous-valued characters on a phylogenetic tree. Syst Zool 40: 304–314.
- View Article
- Google Scholar
30. Lavin SR, Karasov WH, Ives AR, Middleton KM, Garland T (2008) Morphometrics of the avian small intestine compared with that of nonflying mammals: A phylogenetic approach. Physiol Biochem Zool 81: 526–550.
- View Article
- Google Scholar
31. Garland T, Ives AR (2000) Using the past to predict the present: Confidence intervals for regression equations in phylogenetic comparative methods. Am Nat 155: 346–364.
- View Article
- Google Scholar
32. Blomberg SP, Garland T, Ives AR (2003) Testing for phylogenetic signal in comparative data: Behavioral traits are more labile. Evolution 57: 717–745.
- View Article
- Google Scholar
33. Hutcheon JM, Garland T (2004) Are megabats big? J Mamm Evol 11: 257–276.
- View Article
- Google Scholar
34. Midford PE, Garland T Jr, Maddison W (2002) PDAP:PDTREE package for Mesquite, version 1.00. http://mesquiteproject.org/pdap_mesquite/.
35. Garland T, Harvey PH, Ives AR (1992) Procedures for the analysis of comparative data using phylogenetically independent contrasts. Syst Biol 41: 18–32.
- View Article
- Google Scholar

[ref1] 1. Baack EJ, Whitney KD, Rieseberg LH (2005) Hybridization and genome size evolution: timing and magnitude of nuclear DNA content increases in Helianthus homoploid hybrid species. New Phytol 167: 623–630.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Gregory TR, editor. (2005) The evolution of the genome. Amsterdam: Elsevier.

[ref3] 3. Gregory TR, Nicol JA, Tamm H, Kullman B, Kullman K, et al. (2007) Eukaryotic genome size databases. Nucleic Acids Res 35: D332–D338.
View Article
Google Scholar

[6] View Article

[7] Google Scholar

[ref4] 4. Center for Biological Sequence Analysis (2010) Genome Atlas Database. Lyngby, Denmark: Technical University of Denmark. http://www.cbs.dtu.dk/services/GenomeAtlas/.

[ref5] 5. Lynch M (2007) The origins of genome architecture. Sunderland, , Massachusetts, USA: Sinauer Associates.

[ref6] 6. Lynch M (2007) The frailty of adaptive hypotheses for the origins of organismal complexity. Proc Natl Acad Sci USA 104: 8597–8604.
View Article
Google Scholar

[11] View Article

[12] Google Scholar

[ref7] 7. Lynch M, Conery JS (2003) The origins of genome complexity. Science 302: 1401–1404.
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref8] 8. Bennett MD, Leitch IJ (2005) Genome size evolution in plants. In: Gregory TR, editor. The evolution of the genome. Amsterdam: Elsevier. pp. 89–162.

[ref9] 9. Petrov DA (2002) Mutational equilibrium model of genome size evolution. Theor Pop Biol 61: 531–544.
View Article
Google Scholar

[18] View Article

[19] Google Scholar

[ref10] 10. Charlesworth B, Barton N (2004) Genome size: Does bigger mean worse? Curr Biol 14: R233–R235.
View Article
Google Scholar

[21] View Article

[22] Google Scholar

[ref11] 11. Daubin V, Moran NA (2004) Comment on “The origins of genome complexity”. Science 306: 978a.
View Article
Google Scholar

[24] View Article

[25] Google Scholar

[ref12] 12. Koonin EV (2009) Evolution of genome architecture. Int J Biochem Cell Biol 41: 298–306.
View Article
Google Scholar

[27] View Article

[28] Google Scholar

[ref13] 13. Pritham EJ (2009) Transposable elements and factors influencing their success in eukaryotes. J Hered 100: 648–655.
View Article
Google Scholar

[30] View Article

[31] Google Scholar

[ref14] 14. Yi SV (2006) Non-adaptive evolution of genome complexity. Bioessays 28: 979–982.
View Article
Google Scholar

[33] View Article

[34] Google Scholar

[ref15] 15. Felsenstein J (1985) Phylogenies and the comparative method. Am Nat 125: 1–15.
View Article
Google Scholar

[36] View Article

[37] Google Scholar

[ref16] 16. Garland T Jr, Bennett AF, Rezende EL (2005) Phylogenetic approaches in comparative physiology. J Exp Biol 208: 3015–3035.
View Article
Google Scholar

[39] View Article

[40] Google Scholar

[ref17] 17. Harvey PH, Pagel MD (1991) The comparative method in evolutionary biology. Oxford: Oxford University Press.

[ref18] 18. Garland T, Midford PE, Ives AR (1999) An introduction to phylogenetically based statistical methods, with a new method for confidence intervals on ancestral values. Am Zool 39: 374–388.
View Article
Google Scholar

[43] View Article

[44] Google Scholar

[ref19] 19. Whitney KD, Baack EJ, Hamrick JL, Godt MJW, Barringer BC, et al. (2010) A role for nonadaptive processes in plant genome size evolution? Evolution 64: 2097–2109.
View Article
Google Scholar

[46] View Article

[47] Google Scholar

[ref20] 20. Kuo CH, Moran NA, Ochman H (2009) The consequences of genetic drift for bacterial genome complexity. Genome Res 19: 1450–1454.
View Article
Google Scholar

[49] View Article

[50] Google Scholar

[ref21] 21. Yi S, Streelman JT (2005) Genome size is negatively correlated with effective population size in ray-finned fish. Trends Genet 21: 643–646.
View Article
Google Scholar

[52] View Article

[53] Google Scholar

[ref22] 22. Gregory TR, Witt JDS (2008) Population size and genome size in fishes: a closer look. Genome 51: 309–313.
View Article
Google Scholar

[55] View Article

[56] Google Scholar

[ref23] 23. Drake JW, Charlesworth B, Charlesworth D, Crow JF (1998) Rates of spontaneous mutation. Genetics 148: 1667–1686.
View Article
Google Scholar

[58] View Article

[59] Google Scholar

[ref24] 24. Lynch M (2010) Rate, molecular spectrum, and consequences of human mutation. Proc Natl Acad Sci USA 107: 961–968.
View Article
Google Scholar

[61] View Article

[62] Google Scholar

[ref25] 25. Maddison WP, Maddison DR (2009) Mesquite: a modular system for evolutionary analysis. Version 2.71. http://mesquiteproject.org.

[ref26] 26. Gupta RS (2000) The phylogeny of proteobacteria: relationships to other eubacterial phyla and eukaryotes. FEMS Microbiol Rev 24: 367–402.
View Article
Google Scholar

[65] View Article

[66] Google Scholar

[ref27] 27. Maddison DR, Schulz K-S (2007) The Tree of Life Web Project. http://tolweb.org.

[ref28] 28. Song J, Xu QK, Olsen R, Loomis WF, Shaulsky G, et al. (2005) Comparing the Dictyostelium and Entamoeba genomes reveals an ancient split in the Conosa lineage. PLoS Comp Biol 1: e71.
View Article
Google Scholar

[69] View Article

[70] Google Scholar

[ref29] 29. Maddison WP (1991) Squared-change parsimony reconstructions of ancestral states for continuous-valued characters on a phylogenetic tree. Syst Zool 40: 304–314.
View Article
Google Scholar

[72] View Article

[73] Google Scholar

[ref30] 30. Lavin SR, Karasov WH, Ives AR, Middleton KM, Garland T (2008) Morphometrics of the avian small intestine compared with that of nonflying mammals: A phylogenetic approach. Physiol Biochem Zool 81: 526–550.
View Article
Google Scholar

[75] View Article

[76] Google Scholar

[ref31] 31. Garland T, Ives AR (2000) Using the past to predict the present: Confidence intervals for regression equations in phylogenetic comparative methods. Am Nat 155: 346–364.
View Article
Google Scholar

[78] View Article

[79] Google Scholar

[ref32] 32. Blomberg SP, Garland T, Ives AR (2003) Testing for phylogenetic signal in comparative data: Behavioral traits are more labile. Evolution 57: 717–745.
View Article
Google Scholar

[81] View Article

[82] Google Scholar

[ref33] 33. Hutcheon JM, Garland T (2004) Are megabats big? J Mamm Evol 11: 257–276.
View Article
Google Scholar

[84] View Article

[85] Google Scholar

[ref34] 34. Midford PE, Garland T Jr, Maddison W (2002) PDAP:PDTREE package for Mesquite, version 1.00. http://mesquiteproject.org/pdap_mesquite/.

[ref35] 35. Garland T, Harvey PH, Ives AR (1992) Procedures for the analysis of comparative data using phylogenetically independent contrasts. Syst Biol 41: 18–32.
View Article
Google Scholar

[88] View Article

[89] Google Scholar

Figures

Abstract

Author Summary

Introduction

Results

Model fitting

Phylogenetic regressions do not detect relationships between Neu and genomic attributes

Discussion

Materials and Methods

Data sources

Phylogeny construction

Phylogenetic comparative analyses

Acknowledgments

Author Contributions

References

Phylogenetic regressions do not detect relationships between N_eu and genomic attributes