Episodic Evolution and Adaptation of Chloroplast Genomes in Ancestral Grasses

Bojian Zhong; Takahiro Yonezawa; Yang Zhong; Masami Hasegawa

doi:10.1371/journal.pone.0005297

Abstract

Background

It has been suggested that the chloroplast genomes of the grass family, Poaceae, have undergone an elevated evolutionary rate compared to most other angiosperms, yet the details of this phenomenon have remained obscure. To know how the rate change occurred during evolution, estimation of the time-scale with reliable calibrations is needed. The recent finding of 65 Ma grass phytoliths in Cretaceous dinosaur coprolites places the diversification of the grasses to the Cretaceous period, and provides a reliable calibration in studying the tempo and mode of grass chloroplast evolution.

Methodology/Principal Findings

By using chloroplast genome data from angiosperms and by taking account of new paleontological evidence, we now show that episodic rate acceleration both in terms of non-synonymous and synonymous substitutions occurred in the common ancestral branch of the core Poaceae (a group formed by rice, wheat, maize, and their allies) accompanied by adaptive evolution in several chloroplast proteins, while the rate reverted to the slow rate typical of most monocot species in the terminal branches.

Conclusions/Significance

Our finding of episodic rate acceleration in the ancestral grasses accompanied by adaptive molecular evolution has a profound bearing on the evolution of grasses, which form a highly successful group of plants. The widely used model for estimating divergence times was based on the assumption of correlated rates between ancestral and descendant lineages. However, the assumption is proved to be inadequate in approximating the episodic rate acceleration in the ancestral grasses, and the assumption of independent rates is more appropriate. This finding has implications for studies of molecular evolutionary rates and time-scale of evolution in other groups of organisms.

Citation: Zhong B, Yonezawa T, Zhong Y, Hasegawa M (2009) Episodic Evolution and Adaptation of Chloroplast Genomes in Ancestral Grasses. PLoS ONE 4(4): e5297. https://doi.org/10.1371/journal.pone.0005297

Editor: Simon Joly, McGill University, Canada

Received: January 29, 2009; Accepted: March 19, 2009; Published: April 24, 2009

Copyright: © 2009 Zhong et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Funding: The study was financially supported by Shanghai Leading Academic Discipline Project (B111) and National Infrastructure of Natural Resources for Science and Technology (2005DKA21403) to Yang Zhong. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

The grass family, Poaceae, is one of the largest plant families, comprising about 10,000 species including the most important agricultural plants, rice, wheat and maize, and grass-dominated ecosystems comprise about one-third of Earth's vegetative cover and support a vast number of animals [1]. It has long been suggested that the chloroplast (chl) genomes of the grass family have undergone an elevated evolutionary rate compared to other angiosperms [2]–[5], yet little is known when, why and how this rate change occurred.

To examine how the rate change occurred during evolution, it is prerequisite to know the time-scale of evolution. It has become increasingly feasible to estimate the phylogenetic tree and the time-scale of Angiosperm evolution by using chl genome sequences [5]–[9]. A reliable calibration is necessary to obtain reliable time estimates, but lack of good fossil evidence of the ancestral grasses has prevented us from addressing this issue. The recent finding of 65 Ma grass phytoliths in Cretaceous dinosaur coprolites [10], [11] places the diversification of the grasses to the Cretaceous period, and provides a reliable calibration in studying the tempo and mode of grass chl evolution. By using this calibration, we here find that episodic rate acceleration occurred in the common ancestral branch of the core Poaceae (a clade formed by rice Oryza, wheat Triticum, maize Zea, and their allies) accompanied by adaptive evolution in several chl proteins, while the rate reverted to the slow rate typical of most monocot species in the terminal branches. We also find that the widely used method for estimating divergence times based on the assumption of correlated rates between ancestral and descendant lineages [12]–[14] proved to be inadequate in approximating the process of grass chl evolution, and the assumption of independent rates [15] is more appropriate to studies of rate change over time. These results have implications for studies of molecular evolutionary rates and time-scale of evolution in other groups of organisms.

Results

Estimation of time-scale and pattern of rate change

Fig. 1 shows the ML phylogenetic tree of Angiosperm chl with Gymnosperm as an outgroup. The elongated branches of Poaceae are in accord with their widely accepted rate acceleration [2]–[5]. The global clock model in Poales (including Poaceae and Typha)+Musa was rejected when compared with the 2-local-clocks model (Poaceae lineages have a different rate from basal lineages such as Typha and Musa) by the likelihood ratio test (LRT) (, with the codon-substitution model). Moreover, longer distances of the Poaceae species from Musa than the Typha/Musa distance both in terms of non-synonymous and synonymous substitutions (Fig. 1) indicate that both types of substitutions have undergone rate acceleration along the line leading to Poaceae.

Download:

Figure 1. The phylogenetic tree of chloroplast genomes for the 31 species.

The tree topology in Fig. 3 of ref.[6] was used, and the branch lengths were estimated by the ML with the codon-substitution model [21], [22] (CODEML in PAML [19]). The root was arbitrarily placed between Gymnosperm and Angiosperm. Non-synonymous (d_N) and synonymous (d_S) distances of Poales from Musa were estimated by CODEML.

https://doi.org/10.1371/journal.pone.0005297.g001

To explore the pattern of rate change during the course of grass evolution in more detail, we estimated the time-scale of Angiosperm phylogeny, particularly focusing on monocots. Although several powerful methods have been developed for molecular time estimation allowing the rate change (a relaxed clock) [12]–[17], the poor quality of the fossil record for early grasses has prevented us from addressing this issue. Previously, the divergence among major groups of Poaceae was thought to have occurred in early Cenozoic, and the 60 Ma [18] and 50–60 Ma [5] date calibrations for the maize/wheat divergence were used in estimating the monocots/eudicots divergence time with chl DNA sequences. However, recent findings of grass phytoliths in Cretaceous dinosaur coprolites [10], [11] provided evidence that the major groups of core Poaceae had already diversified before Cretaceous/Tertiary (K/T) boundary of 65 Ma. Fig. 2A gives time estimates of the monocots evolution (Fig. 3 and Table 1 for the whole angiosperms) by a relaxed clock based on the Bayesian method implemented in MCMCTREE (in PAML [19]) with a constraint of >65 Ma for the Zea/Oryza divergence and with the independent-rates model for the rate change along lineages [15], [17]. In order to illustrate the rate change during evolution, a single instance of estimated rates along the lineage from the root to Oryza is also shown in Fig. 2A, where elevation of the rate only occurred on the common ancestral branch of Poaceae after they diverged from Typha.

Download:

Figure 2. Posterior estimates of divergence times and rate change during evolution.

Estimations were done by using MCMCTREE in PAML [19] with the IR model [15] for the rate change along lineages. Shape and scale parameters, α and β, in the gamma prior for parameter σ² were 1.0 and 10.0, respectively. Only Poales+Musa part of the whole tree is shown, and a numbering of a node follows that of the whole tree in Fig. 3. The upper lines of the colored area trace the estimated rates along the lineage from the root to Oryza (the lineage indicated by colored lines in the tree) where 95% highest posterior density (HPD) is shown by a vertical line segment with two short horizontal line segments at boundaries. (A) >65 Ma constraint and (B) no constraint to the Zea/Oryza separation (for other calibrations, see Materials and Methods).

https://doi.org/10.1371/journal.pone.0005297.g002

Download:

Figure 3. Posterior estimates of divergence times of a whole Angiosperm tree.

Estimations were done by using MCMCTREE [19] with the IR model [15]. The >65 Ma constraint to the Zea/Oryza separation was applied.

https://doi.org/10.1371/journal.pone.0005297.g003

Download:

Table 1. Posterior estimates of divergence times with the >65 Ma constraint to the Zea/Oryza separation.

https://doi.org/10.1371/journal.pone.0005297.t001

Although the fossil evidence for the >65 Ma constraint of the Zea/Oryza divergence is important in demonstrating the rate acceleration in ancestral grasses with subsequent slow-down, it is not a prerequisite. Even when the constraint was removed, almost the same pattern of rate change as that with the >65 Ma constraint was obtained when the IR model was used (Fig. 2B and Table 2), although the time estimate of the Zea/Oryza separation became younger (55.0 Ma). This time estimate is consistent with a conservative date of >50 Ma presented in refs. [5], [20], and our conclusion of the reverted slow rate in contemporary Poaceae can be regarded as robust to the calibration points used.

Download:

Table 2. Posterior estimates of divergence times without constraint to the Zea/Oryza separation.

https://doi.org/10.1371/journal.pone.0005297.t002

In the first model of the relaxed clock implemented by Thorne and colleagues [12], [13], rates are auto-correlated between ancestral and descendant lineages on the tree, and the model is called the correlated-rates (CR) model. Sanderson's method of nonparametric rate-smoothing [14] was also based on the same idea. Later, an alternative model named the independent-rates (IR) model with no auto-correlation was developed [15], [17]. In Table 1, estimates of divergence times with the >65 Ma constraint for the Zea/Oryza separation are compared between the IR and CR models. The CR model tends to give older estimates for the nodes preceding the Zea/Oryza separation than the IR model. For example, the monocots/eudicots divergence time estimate was 239.1 Ma with the CR model, while the estimate with the IR model was 212.5 Ma which is more in accord with the recently published estimate of 140–150 Ma [5] even though it is still older. Without the >65 Ma constraint, the time estimate of the Zea/Oryza separation became too young (36.9 Ma) from the CR model to be compatible with the suggestion of >50 Ma from the previous works [5], [20], while the IR model gave compatible estimate of 55.0 Ma as mentioned before.

In order to examine the impact of including rapidly evolving Poaceae in the analysis, a comparison between the two models was carried out excluding Poaceae (Table 3). The time estimates were similar between the two models, and were similar to those from the IR model including Poaceae. For the monocots/eudicots separation, the IR model gave almost consistent estimates of 212.5, 207.7, and 216.5 Ma, respectively, with the >65 Ma constraint for the Zea/Oryza separation, without the constraint, and excluding Poaceae, while the CR model gave more diverged estimates of 239.1, 220.4, and 223.2 Ma, respectively. Interestingly, the estimates for this separation were very close between the two models when Poaceae species were excluded. This suggests that the episodic rate acceleration in ancestral Poaceae causes biased estimates, which the CR model cannot accommodate.

Download:

Table 3. Posterior estimates of divergence times excluding Poaceae.

https://doi.org/10.1371/journal.pone.0005297.t003

In the above mentioned analyses, before fixing the shape and scale parameters (α and β) in the gamma prior for parameter σ², which specifies how variable the rates are across branches, impact of priors on these parameters to posterior time and rate estimates was examined in detail (Tables S1, S2, S3, S4). Posterior time estimate for the Zea/Oryza separation depended less on the choice of the gamma prior for parameter σ² with the IR model (Tables S1 and S3) than with the CR model (Tables S2 and S4), and therefore α and β in the gamma prior for parameter σ² were arbitrarily chosen to be 1.0 and 10.0, respectively, in the analyses of Tables 1–3 and Figs. 2 and 3.

In order to further check the robustness of the time estimation on the choice of the substitution model, additional analyses based on a more realistic model of codon-substitution [21], [22] were carried out (Tables S5 and S6 with and without the >65 Ma constraint to the Zea/Oryza separation, and Table S7 excluding Poaceae). Comparisons of Tables S5 vs 1, Tables S6 vs 2, and Tables S7 vs 3 indicate that the estimated times are very similar between the two models, and that the estimated times are robust to the choice of a substitution model.

Adaptive evolution

Non-synonymous/synonymous rate ratio (ω = d_N/d_S) is widely used as an indicator of adaptive evolution or positive selection [23]. Table 4 compares ω ratios along the branches estimated by different models. The minimum AIC [24] model shows that a pronounced increase of ω ratio occurred in the common ancestral lineage of Poaceae after they diverged from Typha, followed by reversion in the terminal branches to the lower level typical of basal lineages. The elevation of the ω ratio can be due either by adaptive evolution or by relaxation of selective constraints. A higher ω value than 1 is usually regarded as an evidence of adaptive evolution, but since the analysis shown in the table averages over the entire genomes, we would not get such a high value even if positive selection operated in some parts of some proteins. Therefore, the branch-site model [25], [26] was applied.

Download:

Table 4. Estimation of non-synonymous/synonymous rate ratio (ω) under different models by using CODEML in PAML [19].

https://doi.org/10.1371/journal.pone.0005297.t004

To identify positively selected sites, among 61 protein-encoding “genes”, we at first selected 16 “genes”, for which the reverted 2ω-model (with the rate ratio ω₁ of the common ancestral branch of Poaceae larger than the rate ratio ω₀ of other branches) is significantly better than the 1ω-model (P<0.05) (Table 5), and by using the branch-site model [25], [26], we identified 5 genes (atpE, cemA, clpP, rpoB, and rps11) which have P value of the branch-site LRT less than 0.05 and contain positively selected sites (Table 6).

Download:

Table 5. LRT of 1ω-model vs reverted 2ω-model for individual genes.

https://doi.org/10.1371/journal.pone.0005297.t005

Download:

Table 6. Branch-site test of positive selection.

https://doi.org/10.1371/journal.pone.0005297.t006

Among the 16 genes with significantly higher ω₁ than ω₀ in Table 5 and among the 5 genes with positively selected sites in Table 6, only atpE is among the 14 genes with significant heterogeneity of nucleotide substitution rates for maize vs. rice, maize vs. wheat, or rice vs. wheat comparisons listed in Table 5 of ref. [27]. Four “genes”, psaC, rbcL, rpl6, and PS13, have significantly lower ω₁ than ω₀ (stronger purifying selection in the ancestral branch of Poaceae than in other branches). In Table 5, we carried out multiple tests for 61 “genes”. The Bonferroni correction is a safeguard against multiple tests falsely giving the appearance of significance, since 1 out of every 20 hypotheses tests is expected to be significant at the 5% level purely by chance. After performing the Bonferroni correction, 7 genes with * in Table 5 remained significant, that means, all the genes listed in Table 6 remained significant even by the conservative test of Bonferroni. On the other hand, the 4 “genes” with lower ω₁ than ω₀ in Table 5 were not significant after the Bonferroni correction.

Discussion

In our study, the IR model gives more consistent results than the CR model, which has been widely used in estimating divergence times [9], [12]–[14], [28]–[34]. A basic assumption of the CR model is that rates change gradually over the tree. Our results suggest that the magnitude of the rate acceleration is underestimated by the CR model and that the IR model is more appropriate in approximating the rate change in the grass chl evolution. Although there exists a case in which the CR model outperforms the IR model [34], a number of authors have recently begun to notice that the IR model is superior to the CR model in approximating the evolution of evolutionary rates in several cases [17], [35]–[37].

Our analysis has revealed an episodic acceleration of the evolutionary rate of chl genomes during the emergence of core Poaceae, accompanied by adaptive evolution in several protein-encoding genes. Because the elevation of the rate occurred not only in non-synonymous substitutions but also in synonymous substitutions and because the elevated substitution rates were accompanied also by an elevated rate of insertions/deletions of nucleotides [9], the elevation of the mutation rate of chl genomes might have acted as a trigger of the adaptive evolution in the ancestral grasses, which might have facilitated the successful radiation and diversification of their descendants.

Suggested positive selection of clpP in Oenothera and Sileneae accompanied by elevated synonymous rate [38] might be related to our finding of rate acceleration in ancestral grasses both in terms of synonymous and non-synonymous substitutions. A more extensive study of chl genomes showed highly accelerated non-synonymous rates of ribosomal protein and RNA polymerase genes in Geraniaceae accompanied with the elevation of the ω ratio [39]. Interestingly, the 4 genes (atpE, cemA, rpoB, and rps11) detected to have positively-selected sites in our analysis (Table 6) are included in the gene group with significantly high ω ratio in Geraniaceae relative to other angiosperms (clpP was not analyzed in ref. [39]).

Recently, Smith and Donoghue [40] tested evolutionary rates across five groups of angiosperms, and found that the rates are generally low in trees/shrubs compared to related herbs. This is an interesting finding which links life history of plants to their rates of molecular evolution, and their conclusion generally holds in five different groups of Angiosperm. What we have shown in this work, however, is that the pattern of rate change during evolution is more complicated than has previously been anticipated. Our finding highlights the need for paying attention to rates of internal branches rather than averaging along a lineage in addressing the rate heterogeneity problem.

Materials and Methods

Since our main interest was on grass evolution, we used all the monocot genera (13 species) and selected 18 species from outside monocots (31 species in total) among the 64 species in ref. [6]. We used 75 chl genes among 77 protein-encoding genes in ref. [6], excluding infA and ycf2 because of missing data.

Estimation of divergence times

The concatenated 75 gene sequences of chl from 31 species (from ref. [6]) and the tree topology in ref. [6] were used. To estimate divergence times and molecular evolutionary rates, a Bayesian method implemented in MCMCTREE (in PAML [19]) was applied either with the CR model [12], [13] or with the IR model [15] (using the GTR model with a discrete gamma distribution with five rate categories (Γ₅) for nucleotide substitutions), and multiple calibrations were incorporated through the time prior. The Gymnosperm/Angiosperm divergence time was set at 280–310 Ma [5], [7]. Three nodes were constrained with minimum ages as follows; (1) the minimum age of the Zea/Oryza divergence was set either to 65 Ma [10], [11] or without this constraint, (2) >115 Ma constraint to the divergence of Poales from other monocots based on the earliest fossils of Poales [41], [42], and (3) >125 Ma for the most basal divergence in eudicots [7]. In order to check the robustness of the time estimation on the choice of the substitution model, the codon-substitution [21], [22]+Γ₅ model was also used. The program adopts soft bounds, so that the probability that the true divergence time is outside the bounds is small but not zero [43]. In the Bayesian framework, priors are assigned not only on times, but also on the overall substitution rate parameter μ and on the rate-drift parameter σ². So we roughly estimated the prior mean of the overall rate μ using the strict molecular clock with 295 Ma constraint to Gymnosperm/Angiosperm divergence time, and assigned the gamma prior G(4, 80) and G(4, 22) for this prior parameter in applying the nucleotide and codon substitution models, respectively. We next examined the impact of the rate-drift parameter σ² by giving various priors for σ² in applying the nucleotide substitution model. Posterior distributions of parameters were approximated using two independent MCMC analyses of 10⁷ steps each, following a discarded burn-in of 10⁶ steps. All the analyses were repeated with different inseed values to check for convergence of the MCMC chain.

Non-synonymous/synonymous rate ratio

To the concatenated sequences of 75 protein-encoding genes of chl from 6 Poaceae species (Oryza, Triticum, Hordeum, Zea, Saccharum, and Sorghum), Typha, and Musa, we applied the codon-based likelihood models that allow for variable ω ratios among different lineages [44]. We used the likelihood ratio test (LRT) to compare the likelihood of one-ω ratio model, which assumes the same ω for all branch in the tree, with the two-ω ratio models, which assumes two different ω ratios. One of the two-ratio models (named “Simple 2ω-model”) assumes that Poaceae (including the common ancestral branch) has different ω from other parts of the tree as is represented bywhile the other (named “Reverted 2ω-model”) assumes that only the ancestral branch of Poceae has a different ω ratio than all the other branches in the tree as is represented byAll the analyses were carried out with the CODEML program in PAML [19] using the codon-substitution model with the F61 codon frequency.

Branch-site test of positive selection

The branch-site test was applied to the dataset of 11 monocot species in our dataset excluding the two most basal monocots; i.e., Dioscorea and Acorus. The branch preceding the common ancestor of the core Poaceae was specified as a foreground branch, and all the others as background branches. LRT is constructed to compare an alternative model that allows for some codons under positive selection on the foreground branch with a null model that does not. The null model restricts codons on the foreground lineage to be undergoing neutral evolution (ω = 1). The specific codons which evolved under positive selection were identified on the foreground branch using a Bayes empirical Bayes procedure [25], [26].

Supporting Information

Table S1.

Impact of the shape and scale parameters (α and β) in the gamma prior for parameter σ² using IR model with the >65 Ma constraint to the Zea/Oryza separation. 95% HPD is shown in parentheses. Times and rates are represented in 100 Ma (10⁸ years ago) and 10⁻⁸ substitutions/site/years, respectively.

https://doi.org/10.1371/journal.pone.0005297.s001

(0.04 MB DOC)

Table S2.

Impact of the shape and scale parameters (α and β) in the gamma prior for parameter σ² using CR model with the >65 Ma constraint to the Zea/Oryza separation. 95% HPD is shown in parentheses. Times and rates are represented in 100 Ma (10⁸ years ago) and 10⁻⁸ substitutions/site/years, respectively.

https://doi.org/10.1371/journal.pone.0005297.s002

(0.04 MB DOC)

Table S3.

Impact of the shape and scale parameters (α and β) in the gamma prior for parameter σ² using IR model without the constraint to the Zea/Oryza separation. 95% HPD is shown in parentheses. Times and rates are represented in 100 Ma (10⁸ years ago) and 10⁻⁸ substitutions/site/years, respectively.

https://doi.org/10.1371/journal.pone.0005297.s003

(0.04 MB DOC)

Table S4.

Impact of the shape and scale parameters (α and β) in the gamma prior for parameter σ² using CR model without the constraint to the Zea/Oryza separation. 95% HPD is shown in parentheses. Times and rates are represented in 100 Ma (10⁸ years ago) and 10⁻⁸ substitutions/site/years, respectively.

https://doi.org/10.1371/journal.pone.0005297.s004

(0.04 MB DOC)

Table S5.

Posterior estimates of divergence times by MCMCTREE in PAML [19] using the codon-substitution+Γ₅ model with the >65 Ma constraint to the Zea/Oryza separation. Shape and scale parameters, α and β, in the gamma prior for parameter σ² were 1.0 and 10.0, respectively. 95% HPD is shown in parentheses. Rate (10⁻⁸ substitutions/codon/year) refers to the rate of the branch preceding the node. Node numbers refer to those in Fig. 3, and taxa in parentheses refer to those branched off from the lineage leading to Oryza.

https://doi.org/10.1371/journal.pone.0005297.s005

(0.04 MB DOC)

Table S6.

Posterior estimates of divergence times by MCMCTREE in PAML [19] using the codon-substitution+Γ₅ model without constraint to the Zea/Oryza separation. Shape and scale parameters, α and β, in the gamma prior for parameter σ² were 1.0 and 10.0, respectively. 95% HPD is shown in parentheses. Rate (10⁻⁸ substitutions/codon/year) refers to the rate of the branch preceding the node. Node numbers refer to those in Fig. 3, and taxa in parentheses refer to those branched off from the lineage leading to Oryza.

https://doi.org/10.1371/journal.pone.0005297.s006

(0.04 MB DOC)

Table S7.

Posterior estimates of divergence times by MCMCTREE in PAML [19] using the codon-substitution+Γ₅ model excluding Poaceae. Shape and scale parameters, α and β, in the gamma prior for parameter σ² were 1.0 and 10.0, respectively. 95% HPD is shown in parentheses. Rate (10⁻⁸ substitutions/codon/year) refers to the rate of the branch preceding the node. Node numbers refer to those in Fig. 3, and taxa in parentheses refer to those branched off from the lineage leading to Oryza.

https://doi.org/10.1371/journal.pone.0005297.s007

(0.04 MB DOC)

Acknowledgments

We thank James Crabbe for improving the manuscript. Simon Joly provided thoughtful comments that helped improve the manuscript.

Author Contributions

Conceived and designed the experiments: BZ MH. Analyzed the data: BZ TY. Wrote the paper: BZ TY YZ MH.

References

1. Jacobs BF, Kingston JD, Jacobs LL (1999) The origin of grass-dominated ecosystems. Ann Mo Bot Gard 86: 590–643.
- View Article
- Google Scholar
2. Gaut BS, Muse SV, Clark WD, Clegg MT (1992) Relative rates of nucleotide substitution at the rbcL locus of monocotyledonous plants. J Mol Evol 35: 292–303.
- View Article
- Google Scholar
3. Bousquet J, Strauss SH, Doerksen AH, Price RA (1992) Extensive variation in evolutionary rate of rbcL gene sequences among seed plants. Proc Natl Acad Sci USA 89: 7844–7848.
- View Article
- Google Scholar
4. Muse SV, Gaut BS (1997) Interlocus comparisons of the nucleotide substitution process in the chloroplast genome. Genetics 146: 393–399.
- View Article
- Google Scholar
5. Chaw S-M, Chang C-C, Chen H-L, Li W-H (2004) Dating the monocot–dicot divergence and the origin of core eudicots using whole chloroplast genomes. J Mol Evol 58: 424–441.
- View Article
- Google Scholar
6. Jansen RK, Cai Z, Raubeson LA, Daniell H, dePamphilis CW, et al. (2007) Analysis of 81 genes from 64 plastid genomes resolves relationships in angiosperms and identifies genome-scale evolutionary patterns. Proc Natl Acad Sci 104: 19369–19374.
- View Article
- Google Scholar
7. Moore MJ, Bell CD, Soltis PS, Soltis DE (2007) Using plastid genome-scale data to resolve enigmatic relationships among basal angiosperms. Proc Natl Acad Sci USA 49: 19363–19368.
- View Article
- Google Scholar
8. Martin W, Deusch O, Stawski N, Grunheit N, Goremykin V (2005) Chloroplast genome phylogenetics: why we need independent approaches to plant molecular evolution. TRENDS in Plant Science 10: 203–209.
- View Article
- Google Scholar
9. Leebens-Mack J, Raubeson LA, Cui L, Kuehl JV, Fourcade MH, et al. (2005) Identifying the basal angiosperm node in chloroplast genome phylogenies: sampling one's way out of the Felsenstein zone. Mol Biol Evol 22: 1948–1963.
- View Article
- Google Scholar
10. Prasad V, Stroemberg CAE, Alimohammadian H, Sahni A (2005) Dinosaur coprolites and the early evolution of grasses and grazers. Science 310: 1177–1180.
- View Article
- Google Scholar
11. Piperno DR, Sues HD (2005) Dinosaurs dined on grass. Science 310: 1126–1128.
- View Article
- Google Scholar
12. Thorne JL, Kishino H, Painter IS (1998) Estimating the rate of evolution of the rate of molecular evolution. Mol Biol Evol 15: 1647–1657.
- View Article
- Google Scholar
13. Kishino H, Thorne JL, Bruno WJ (2001) Performance of a divergence time estimation method under a probabilistic model of rate evolution. Mol Biol Evol 18: 352–361.
- View Article
- Google Scholar
14. Sanderson MJ (1997) A nonparametric approach to estimating divergence times in the absence of rate constancy. Mol Biol Evol 14: 1218–1232.
- View Article
- Google Scholar
15. Rannala B, Yang Z (2007) Inferring speciation times under an episodic molecular clock. Syst Biol 56: 453–466.
- View Article
- Google Scholar
16. Huelsenbeck JP, Larget B, Swofford D (2000) A compound Poisson process for relaxing the molecular clock. Genetics 154: 1879–1892.
- View Article
- Google Scholar
17. Drummond AJ, Ho SYW, Phillips MJ, Rambaut A (2006) Relaxed phylogenetics and dating with confidence. PLoS Biol 4: e88.
- View Article
- Google Scholar
18. Wolfe KH, Gouy MY, Yang W, Sharp PM, Li W-H (1989) Date of the monocot-dicot divergence estimated from chloroplast chloroplast DNA sequence data. Proc Natl Acad Sci USA 86: 6201–6205.
- View Article
- Google Scholar
19. Yang Z (2007) PAML 4: Phylogenetic Analysis by Maximum Likelihood. Mol Biol Evol 24: 1586–1591.
- View Article
- Google Scholar
20. Vicentini A, Barber JC, Aliscioni SS, Giussani LM, Kellogg EA (2008) The age of the grasses and clusters of origins of C₄ photosynthesis. Global Change Biol 14: 2963–2977.
- View Article
- Google Scholar
21. Goldman N, Yang Z (1994) A codon-based model of nucleotide substitution for protein-coding DNA sequences. Mol Biol Evol 11: 725–736.
- View Article
- Google Scholar
22. Muse SV, Gaut BS (1994) A likelihood approach for comparing synonymous and nonsynonymous nucleotide substitution rates, with application to the chloroplast genome. Mol Biol Evol 11: 715–724.
- View Article
- Google Scholar
23. Yang Z (2006) Computational Molecular Evolution. Oxford: Oxford Univ Press.
24. Akaike H (1973) Information theory and an extension of the maximum likelihood principle. In: Petrov BN, Csaki F, editors. Second International Symposium on Information Theory. Budapest: Akademiai Kiado. pp. 267–281.
25. Zhang J, Nielsen R, Yang Z (2005) Evaluation of an improved branch-site likelihood method for detecting positive selection at the molecular level. Mol Biol Evol 22: 1–8.
- View Article
- Google Scholar
26. Yang Z, Wong WSW, Nielsen R (2005) Bayes empirical Bayes inference of amino acids sites under positive selection. Mol Biol Evol 22: 1107–1118.
- View Article
- Google Scholar
27. Matsuoka Y, Yamazaki Y, Ogihara Y, Tsunewaki K (2002) Whole chloroplast genome comparison of rice, maize, and wheat: implications for chloroplast gene diversification and phylogeny of cereals. Mol Biol Evol 19: 2084–2091.
- View Article
- Google Scholar
28. Bremer K (2002) Gondwanan evolution of the grass alliance of families (Poales). Evolution 56: 1374–1387.
- View Article
- Google Scholar
29. Sanderson MJ, Thorne JL, Wikstroem N, Bremer K (2004) Molecular evidence on plant divergence times. Amer J Bot 91: 1656–1665.
- View Article
- Google Scholar
30. Aris-Brosou S, Yang Z (2003) Bayesian models of episodic evolution support a late Precambrian explosive diversification of the Metazoa. Mol Biol Evol 20: 1947–1954.
- View Article
- Google Scholar
31. Nikaido M, Matsuno F, Hamilton H, Brownell RL Jr, Cao Y, et al. (2001) Retroposon analysis of major ceatcean lineages: The monophyly of toothed whales and the paraphyly of river dolphins. Proc Natl Acad Sci USA 98: 7384–7389.
- View Article
- Google Scholar
32. Hasegawa M, Thorne J, Kishino H (2003) Time scale of eutherian evolution estimated without assuming a constant rate of molecular evolution. Genes Genet Syst 78: 267–283.
- View Article
- Google Scholar
33. Yoder AD, Yang Z (2004) Divergence dates for Malagasy lemurs estimated from multiple gene loci: geological and evolutionary context. Mol Ecol 13: 757–773.
- View Article
- Google Scholar
34. Lepage T, Bryant D, Philippe H, Lartillot N (2007) A general comparison of relaxed molecular clock models. Mol Biol Evol 24: 2669–2680.
- View Article
- Google Scholar
35. Kitazoe Y, Kishino H, Waddell PJ, Nakajima N, Okabayashi T, et al. (2007) Robust time estimation reconciles views of the antiquity of placental mammals. PLoS ONE 2: e384.
- View Article
- Google Scholar
36. Brown JW, Rest JS, Garcia-Moreno J, Sorenson MD, Mindell DP (2008) Strong mitochondrial DNA support for a Cretaceous origin of modern avian lineages. BMC Biology 2008, 6: 6.
- View Article
- Google Scholar
37. Renner SS, Grimm GW, Schneeweiss GM, Stuessy TF, Ricklefs RE (2008) Rooting and dating maples (Acer) with an uncorrelated-rates molecular clock: Implications for North American/Asian disjunctions. Syst Biol 57: 795–808.
- View Article
- Google Scholar
38. Erixon P, Oxelman B (2008) Whole-gene positive selection, elevated synonymous substitution rates, duplication, and indel evolution of the chloroplast clpP1 gene. PLoS ONE 3: e1386.
- View Article
- Google Scholar
39. Guisinger MM, Kuehl JV, Boore JL, Jansen RK (2008) Genome-wide analyses of Geraniaceae plastid DNA reveal unprecedented patterns of increased nucleotide substitutions. Proc Natl Acad Sci USA 105: 18424–18429.
- View Article
- Google Scholar
40. Smith SA, Donoghue MJ (2008) Rates of molecular evolution are linked to life history in flowering plants. Science 322: 86–89.
- View Article
- Google Scholar
41. Herendeen PS, Crane PR (1995) The fossil history of the monocotyledons. In: Rudall PJ, Cribb PJ, Cutler DF, Humphries CJ, editors. Monocotyledon: Systematics and Evolution. London: Royal Botanic Gardens, Kew. pp. 1–21.
42. Linder HP, Rudall PJ (2005) Evolutionary history of Poales. Annu Rev Ecol Evol Syst 36: 107–124.
- View Article
- Google Scholar
43. Yang Z, Rannala B (2006) Bayesian estimation of species divergence times under a molecular clock using multiple fossil calibrations with soft bounds. Mol Biol Evol 23: 212–226.
- View Article
- Google Scholar
44. Yang Z, Bielawski B (2000) Statistical methods for detecting molecular adaptation. TREE 15: 496–503.
- View Article
- Google Scholar
45. Maier RM, Neckermann K, Igloi GL, Kossel H (1995) Complete sequence of the maize chloroplast genome: Gene content, hotspots of divergence and fine tuning of genetic information by transcript editing. J Mol Biol 251: 614–628.
- View Article
- Google Scholar

[ref1] 1. Jacobs BF, Kingston JD, Jacobs LL (1999) The origin of grass-dominated ecosystems. Ann Mo Bot Gard 86: 590–643.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Gaut BS, Muse SV, Clark WD, Clegg MT (1992) Relative rates of nucleotide substitution at the rbcL locus of monocotyledonous plants. J Mol Evol 35: 292–303.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. Bousquet J, Strauss SH, Doerksen AH, Price RA (1992) Extensive variation in evolutionary rate of rbcL gene sequences among seed plants. Proc Natl Acad Sci USA 89: 7844–7848.
View Article
Google Scholar

[8] View Article

[9] Google Scholar

[ref4] 4. Muse SV, Gaut BS (1997) Interlocus comparisons of the nucleotide substitution process in the chloroplast genome. Genetics 146: 393–399.
View Article
Google Scholar

[11] View Article

[12] Google Scholar

[ref5] 5. Chaw S-M, Chang C-C, Chen H-L, Li W-H (2004) Dating the monocot–dicot divergence and the origin of core eudicots using whole chloroplast genomes. J Mol Evol 58: 424–441.
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref6] 6. Jansen RK, Cai Z, Raubeson LA, Daniell H, dePamphilis CW, et al. (2007) Analysis of 81 genes from 64 plastid genomes resolves relationships in angiosperms and identifies genome-scale evolutionary patterns. Proc Natl Acad Sci 104: 19369–19374.
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref7] 7. Moore MJ, Bell CD, Soltis PS, Soltis DE (2007) Using plastid genome-scale data to resolve enigmatic relationships among basal angiosperms. Proc Natl Acad Sci USA 49: 19363–19368.
View Article
Google Scholar

[20] View Article

[21] Google Scholar

[ref8] 8. Martin W, Deusch O, Stawski N, Grunheit N, Goremykin V (2005) Chloroplast genome phylogenetics: why we need independent approaches to plant molecular evolution. TRENDS in Plant Science 10: 203–209.
View Article
Google Scholar

[23] View Article

[24] Google Scholar

[ref9] 9. Leebens-Mack J, Raubeson LA, Cui L, Kuehl JV, Fourcade MH, et al. (2005) Identifying the basal angiosperm node in chloroplast genome phylogenies: sampling one's way out of the Felsenstein zone. Mol Biol Evol 22: 1948–1963.
View Article
Google Scholar

[26] View Article

[27] Google Scholar

[ref10] 10. Prasad V, Stroemberg CAE, Alimohammadian H, Sahni A (2005) Dinosaur coprolites and the early evolution of grasses and grazers. Science 310: 1177–1180.
View Article
Google Scholar

[29] View Article

[30] Google Scholar

[ref11] 11. Piperno DR, Sues HD (2005) Dinosaurs dined on grass. Science 310: 1126–1128.
View Article
Google Scholar

[32] View Article

[33] Google Scholar

[ref12] 12. Thorne JL, Kishino H, Painter IS (1998) Estimating the rate of evolution of the rate of molecular evolution. Mol Biol Evol 15: 1647–1657.
View Article
Google Scholar

[35] View Article

[36] Google Scholar

[ref13] 13. Kishino H, Thorne JL, Bruno WJ (2001) Performance of a divergence time estimation method under a probabilistic model of rate evolution. Mol Biol Evol 18: 352–361.
View Article
Google Scholar

[38] View Article

[39] Google Scholar

[ref14] 14. Sanderson MJ (1997) A nonparametric approach to estimating divergence times in the absence of rate constancy. Mol Biol Evol 14: 1218–1232.
View Article
Google Scholar

[41] View Article

[42] Google Scholar

[ref15] 15. Rannala B, Yang Z (2007) Inferring speciation times under an episodic molecular clock. Syst Biol 56: 453–466.
View Article
Google Scholar

[44] View Article

[45] Google Scholar

[ref16] 16. Huelsenbeck JP, Larget B, Swofford D (2000) A compound Poisson process for relaxing the molecular clock. Genetics 154: 1879–1892.
View Article
Google Scholar

[47] View Article

[48] Google Scholar

[ref17] 17. Drummond AJ, Ho SYW, Phillips MJ, Rambaut A (2006) Relaxed phylogenetics and dating with confidence. PLoS Biol 4: e88.
View Article
Google Scholar

[50] View Article

[51] Google Scholar

[ref18] 18. Wolfe KH, Gouy MY, Yang W, Sharp PM, Li W-H (1989) Date of the monocot-dicot divergence estimated from chloroplast chloroplast DNA sequence data. Proc Natl Acad Sci USA 86: 6201–6205.
View Article
Google Scholar

[53] View Article

[54] Google Scholar

[ref19] 19. Yang Z (2007) PAML 4: Phylogenetic Analysis by Maximum Likelihood. Mol Biol Evol 24: 1586–1591.
View Article
Google Scholar

[56] View Article

[57] Google Scholar

[ref20] 20. Vicentini A, Barber JC, Aliscioni SS, Giussani LM, Kellogg EA (2008) The age of the grasses and clusters of origins of C₄ photosynthesis. Global Change Biol 14: 2963–2977.
View Article
Google Scholar

[59] View Article

[60] Google Scholar

[ref21] 21. Goldman N, Yang Z (1994) A codon-based model of nucleotide substitution for protein-coding DNA sequences. Mol Biol Evol 11: 725–736.
View Article
Google Scholar

[62] View Article

[63] Google Scholar

[ref22] 22. Muse SV, Gaut BS (1994) A likelihood approach for comparing synonymous and nonsynonymous nucleotide substitution rates, with application to the chloroplast genome. Mol Biol Evol 11: 715–724.
View Article
Google Scholar

[65] View Article

[66] Google Scholar

[ref23] 23. Yang Z (2006) Computational Molecular Evolution. Oxford: Oxford Univ Press.

[ref24] 24. Akaike H (1973) Information theory and an extension of the maximum likelihood principle. In: Petrov BN, Csaki F, editors. Second International Symposium on Information Theory. Budapest: Akademiai Kiado. pp. 267–281.

[ref25] 25. Zhang J, Nielsen R, Yang Z (2005) Evaluation of an improved branch-site likelihood method for detecting positive selection at the molecular level. Mol Biol Evol 22: 1–8.
View Article
Google Scholar

[70] View Article

[71] Google Scholar

[ref26] 26. Yang Z, Wong WSW, Nielsen R (2005) Bayes empirical Bayes inference of amino acids sites under positive selection. Mol Biol Evol 22: 1107–1118.
View Article
Google Scholar

[73] View Article

[74] Google Scholar

[ref27] 27. Matsuoka Y, Yamazaki Y, Ogihara Y, Tsunewaki K (2002) Whole chloroplast genome comparison of rice, maize, and wheat: implications for chloroplast gene diversification and phylogeny of cereals. Mol Biol Evol 19: 2084–2091.
View Article
Google Scholar

[76] View Article

[77] Google Scholar

[ref28] 28. Bremer K (2002) Gondwanan evolution of the grass alliance of families (Poales). Evolution 56: 1374–1387.
View Article
Google Scholar

[79] View Article

[80] Google Scholar

[ref29] 29. Sanderson MJ, Thorne JL, Wikstroem N, Bremer K (2004) Molecular evidence on plant divergence times. Amer J Bot 91: 1656–1665.
View Article
Google Scholar

[82] View Article

[83] Google Scholar

[ref30] 30. Aris-Brosou S, Yang Z (2003) Bayesian models of episodic evolution support a late Precambrian explosive diversification of the Metazoa. Mol Biol Evol 20: 1947–1954.
View Article
Google Scholar

[85] View Article

[86] Google Scholar

[ref31] 31. Nikaido M, Matsuno F, Hamilton H, Brownell RL Jr, Cao Y, et al. (2001) Retroposon analysis of major ceatcean lineages: The monophyly of toothed whales and the paraphyly of river dolphins. Proc Natl Acad Sci USA 98: 7384–7389.
View Article
Google Scholar

[88] View Article

[89] Google Scholar

[ref32] 32. Hasegawa M, Thorne J, Kishino H (2003) Time scale of eutherian evolution estimated without assuming a constant rate of molecular evolution. Genes Genet Syst 78: 267–283.
View Article
Google Scholar

[91] View Article

[92] Google Scholar

[ref33] 33. Yoder AD, Yang Z (2004) Divergence dates for Malagasy lemurs estimated from multiple gene loci: geological and evolutionary context. Mol Ecol 13: 757–773.
View Article
Google Scholar

[94] View Article

[95] Google Scholar

[ref34] 34. Lepage T, Bryant D, Philippe H, Lartillot N (2007) A general comparison of relaxed molecular clock models. Mol Biol Evol 24: 2669–2680.
View Article
Google Scholar

[97] View Article

[98] Google Scholar

[ref35] 35. Kitazoe Y, Kishino H, Waddell PJ, Nakajima N, Okabayashi T, et al. (2007) Robust time estimation reconciles views of the antiquity of placental mammals. PLoS ONE 2: e384.
View Article
Google Scholar

[100] View Article

[101] Google Scholar

[ref36] 36. Brown JW, Rest JS, Garcia-Moreno J, Sorenson MD, Mindell DP (2008) Strong mitochondrial DNA support for a Cretaceous origin of modern avian lineages. BMC Biology 2008, 6: 6.
View Article
Google Scholar

[103] View Article

[104] Google Scholar

[ref37] 37. Renner SS, Grimm GW, Schneeweiss GM, Stuessy TF, Ricklefs RE (2008) Rooting and dating maples (Acer) with an uncorrelated-rates molecular clock: Implications for North American/Asian disjunctions. Syst Biol 57: 795–808.
View Article
Google Scholar

[106] View Article

[107] Google Scholar

[ref38] 38. Erixon P, Oxelman B (2008) Whole-gene positive selection, elevated synonymous substitution rates, duplication, and indel evolution of the chloroplast clpP1 gene. PLoS ONE 3: e1386.
View Article
Google Scholar

[109] View Article

[110] Google Scholar

[ref39] 39. Guisinger MM, Kuehl JV, Boore JL, Jansen RK (2008) Genome-wide analyses of Geraniaceae plastid DNA reveal unprecedented patterns of increased nucleotide substitutions. Proc Natl Acad Sci USA 105: 18424–18429.
View Article
Google Scholar

[112] View Article

[113] Google Scholar

[ref40] 40. Smith SA, Donoghue MJ (2008) Rates of molecular evolution are linked to life history in flowering plants. Science 322: 86–89.
View Article
Google Scholar

[115] View Article

[116] Google Scholar

[ref41] 41. Herendeen PS, Crane PR (1995) The fossil history of the monocotyledons. In: Rudall PJ, Cribb PJ, Cutler DF, Humphries CJ, editors. Monocotyledon: Systematics and Evolution. London: Royal Botanic Gardens, Kew. pp. 1–21.

[ref42] 42. Linder HP, Rudall PJ (2005) Evolutionary history of Poales. Annu Rev Ecol Evol Syst 36: 107–124.
View Article
Google Scholar

[119] View Article

[120] Google Scholar

[ref43] 43. Yang Z, Rannala B (2006) Bayesian estimation of species divergence times under a molecular clock using multiple fossil calibrations with soft bounds. Mol Biol Evol 23: 212–226.
View Article
Google Scholar

[122] View Article

[123] Google Scholar

[ref44] 44. Yang Z, Bielawski B (2000) Statistical methods for detecting molecular adaptation. TREE 15: 496–503.
View Article
Google Scholar

[125] View Article

[126] Google Scholar

[ref45] 45. Maier RM, Neckermann K, Igloi GL, Kossel H (1995) Complete sequence of the maize chloroplast genome: Gene content, hotspots of divergence and fine tuning of genetic information by transcript editing. J Mol Biol 251: 614–628.
View Article
Google Scholar

[128] View Article

[129] Google Scholar

Figures

Abstract

Background

Methodology/Principal Findings

Conclusions/Significance

Introduction

Results

Estimation of time-scale and pattern of rate change

Adaptive evolution

Discussion

Materials and Methods

Estimation of divergence times

Non-synonymous/synonymous rate ratio

Branch-site test of positive selection

Supporting Information

Acknowledgments

Author Contributions

References