Species in the ivesioid clade of Potentilla (Rosaceae) are endemic to western North America, an area that underwent widespread aridification during the global temperature decrease following the Mid-Miocene Climatic Optimum. Several morphological features interpreted as adaptations to drought are found in the clade, and many species occupy extremely dry habitats. Recent phylogenetic analyses have shown that the sister group of this clade is Potentilla section Rivales, a group with distinct moist habitat preferences. This has led to the hypothesis that the ivesioids (genera Ivesia, Horkelia and Horkeliella) diversified in response to the late Tertiary aridification of western North America. We used phyloclimatic modeling and a fossil-calibrated dated phylogeny of the family Rosaceae to investigate the evolution of the ivesioid clade. We have combined occurrence- and climate data from extant species, and used ancestral state reconstruction to model past climate preferences. These models have been projected into paleo-climatic scenarios in order to identify areas where the ivesioids may have occurred. Our analysis suggests a split between the ivesioids and Potentilla sect. Rivales around Late Oligocene/Early Miocene (∼23 million years ago, Ma), and that the ivesioids then diversified at a time when summer drought started to appear in the region. The clade is inferred to have originated on the western slopes of the Rocky Mountains from where a westward range expansion to the Sierra Nevada and the coast of California took place between ∼12-2 Ma. Our results support the idea that climatic changes in southwestern North America have played an important role in the evolution of the local flora, by means of in situ adaptation followed by diversification.
Citation: Töpel M, Antonelli A, Yesson C, Eriksen B (2012) Past Climate Change and Plant Evolution in Western North America: A Case Study in Rosaceae. PLoS ONE 7(12): e50358. doi:10.1371/journal.pone.0050358
Editor: Carles Lalueza-Fox, Institut de Biologia Evolutiva - Universitat Pompeu Fabra, Spain
Received: April 25, 2012; Accepted: October 24, 2012; Published: December 7, 2012
Copyright: © 2012 Töpel et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by Helge Ax:son Johnsons stiftelse (http://www.haxsonj.se/), Nilsson-Ehle donationerna (http://www.fysiografen.se/) (nr: 24966) (MT); The Swedish Research council (http://vr.se/), Carl Tryggers Stiftelse (http://www.carltryggersstiftelse.se/) and Helge Ax:son Johnsons Stiftelse (AA); Wilhelm och Martina Lundgrens Vetenskapsfond (http://www.wmlundgren.se/) (vet1-348/2007), Adlerbertska forskningsstiftelsen (B 432 835/07) and The Swedish Research council (2004-1698) (BE). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Understanding the influence of climate change on the evolution and distribution of the world's biota constitutes a major task in biology. An accurate estimation of how species have responded to changes in the past may enable us to better predict future responses to global warming, with far-reaching implications influencing the work of policy-makers and conservational biologists .
A suitable area for assessing the effect of climate change on plant evolution is western North America. This is a botanically diverse region, rich in both total species numbers and proportion of endemic species, and has undergone major climatic and geologic changes during the Cenozoic (the last 65 Ma). At the beginning of the Eocene (∼55.8-33.9 Ma) a warm and humid tropical climate prevailed in the region, but global cooling has since then gradually changed the conditions . Onset of glaciation in Antarctica by the end of the Eocene was accompanied by rapid decline of global deep-sea temperatures . Increased upwelling of cool Pacific ocean water off the Californian coast eventually led to summer drought by mid-Miocene (∼15 Ma) . Global cooling also strengthened the westerlies , which increased winter precipitation after mid-Miocene (∼11.6 Ma). A Mediterranean type of climate, with summer droughts and winter precipitation, was in place in Late Miocene (∼10 Ma) . Climate change in the area has been suggested to trigger the evolution of evening primroses (genus Oenothera, family Onagraceae) –, but several questions remain concerning how general niche conservatism/lability has been in the area, and from which areas and habitat zones the local flora originated.
The ‘ivesioids’ are a well-supported plant clade – confined to western North America . It is nested within Potentilla L. (cinquefoil) in the Rosaceae – a cosmopolitan family of large ecological and economic importance, which includes many edible fruits (apples, plums, cherries, pears, strawberries, almonds) as well as ornamentals (roses, firethorns, hawthorns). As currently circumscribed (Figures 1 and S1; –, –), the ivesioid clade includes more than 50 species classified in three genera: Ivesia, Horkelia and Horkeliella , –. Common to many of them is that they grow under extremely dry conditions and have developed means to avoid drought (petrophily on protected rock faces, tolerance of alkalinity) or minimize water loss (increased pubescence, numerous minute leaflet segments in a tightly overlapping arrangement).
Maximum clade credibility tree obtained from 25000 post burn-in Bayesian chronograms generated in BEAST, with median branch lengths. Grey bars at nodes represent 95% Highest Posterior Densities of node ages. The red dots indicates age constraints used for the analysis; (1) The split between Rosales and Fabales was constrained to an age of 104–115 Ma based on a previous analysis , and (2) a Crataegites borealis fossil was used to set a conservative minimum age of 85.8 Ma on Rosaceae , . Subclades of Rosaceae were calibrated using fossil data from (3) Neviusia, 48.7 Ma , (4) Chamaebatiaria, 26.85 Ma , (5) Holodiscus, 34.1 Ma , (6) Spiraea, 48 Ma , (7) Rosa, 34.1 Ma , (8) Fragaria, 2.5 Ma , (9) Potentilla 11.6 Ma . A uniform prior with a maximum age of 115 Ma was used for all calibration points. Also indicated are the tribes of Rosaceae (species highlighted in blue and yellow) as well as the ivesioid clade highlighted in red. Time scale from .
Potentilla sect. Rivales is the sister group of the ivesioids –. Species in this group preferably occupy seasonally inundated flats or lake and stream shores, and have a widespread distribution in the Northern hemisphere. In contrast, the ivesioid species usually reside in extremely arid regions, alpine habitats and sites with a Mediterranean type of climate in the Great Basin (Figure 2) and adjacent arid parts of western North America, and comprise many narrowly endemic species , –.
Shaded area represents the exten of the Great Basin. Locations where climate data were sampled for the bioclimatic models for extant species are marked with dots. State names are abbreviated: AZ – Arizona, CA – California, CO – Colorado, ID – Idaho, MT – Montana, NE – Nevada, NM – New Mexico, OR – Oregon, UT – Utah, WA – Washington and WY – Wyoming. Background image by www.earthobservatory.nasa.gov.
Phyloclimatic modeling , – combines phylogenetic estimation of species relationships with bioclimatic models . These models use climate data from known species locations to predict areas of suitable climate for that species, by projecting the models into a present-day climatic scenario. They can thus estimate the total potential distribution of species even when not all localities and populations have been sampled. Furthermore, different methods for ancestral state reconstruction can be used to reconstruct the climatic preferences for ancestral nodes in a dated phylogeny. Historical distributions regulated by climatic conditions can then be estimated by projecting the optimized models into past climate scenarios, leading to an estimate of ancestral distributions. These models can thus be used to evaluate the evolutionary importance of niche conservatism for producing the distribution of plant diversity seen today (e.g., ), and help predict how this diversity may be affected in the future by global warming.
The primary objective of this study is to test the hypothesis that species in the ivesioid clade evolved in response to late Tertiary development of dry conditions in western North America. Under such circumstances, we would expect its stem node to have originated in western North America, and that the crown age of this clade - reflecting the onset of diversification of dry-adapted species - is not older than the proposed time of the aridification in the region. To address this, we have performed a molecular dating analysis of a plastid phylogeny of Rosaceae to establish the age of the ivesioid clade and produced niche models for both extant species and well-supported nodes of the phylogeny. Projections of these models into palaeoclimatic scenarios were used to estimate the geographic origin of the group and to infer changes in geographical distributions over time.
Materials and Methods
A taxonomically representative set of sequences (selected to represent all subfamilies of Rosaceae; see Table 1) from the plastid matK and trnL-trnF intergenic spacer was downloaded from the National Center for Biotechnology Information (www.ncbi.nlm.nih.gov). Pisum sativum (Fabaceae) was chosen as outgroup for the analysis, and Rhamnus cathartica (Rhamanaceae) was also included as representative for another family in the order Rosales.
In addition, new sequences from species in genus Potentilla, including the ivesioid clade, were generated. The matK region was amplified and sequenced with the trnk-3914 FM primer  and the matK2R primer . The trnL intron and the trnL-trnF intergenic spacer were amplified with the trnLc and trnLf primers . Two additional primers, trnLe and trnLd, together with the PCR amplification primers were used for sequencing. The PCR amplification of the two regions was performed using 12.5 µl MasterAmp 2× PCR PreMix G (Epicentre Biotechnologies, Madison, Wisconsin, USA), 0.6 µM of the forward and reverse primers, 1 unit Thermoprime Plus DNA Polymerase (ABgene House, Epsom, UK), 1 µl template DNA and purified water to a final volume of 25 µl.
The PCR mix was heated to 95°C for 5 minutes followed by 35–45 cycles of a denaturation step at 95°C for 30 seconds, annealing at 55°C for 30 seconds and extension for 1 minute (trnL/F) or 2 minutes (matK) at 72°C. The program ended with an additional 10 minutes (trnL/F) or 7 minutes (matK) extension step at 72°C. The resulting PCR products were sequenced by Macrogen Inc. (Seoul, Korea). The matK and trnL/F sequences (Table 1) were then aligned separately with mafft-linsi v.6.717b  and subsequently concatenated into a common matrix.
Phylogenetic inference and Molecular dating
We investigated whether the sequences evolved in a clocklike way by generating a neighbor joining tree in PAUP  and comparing Maximum Likelihood scores calculated from the data with and without enforcing a molecular clock. A likelihood ratio (LR) test was then performed with LR = 2 (Lmol. clock enforced−Lno mol. clock enforced) and assumed to be distributed as a χ2 with S-2 degrees of freedom, S being the number of taxa in the dataset. Since the LR test rejected a molecular clock (p<0.001), we chose to estimate divergence times with the relaxed clock algorithm implemented in the software BEAST v.1.6.1  using the beagle library for likelihood calculations . Fourteen runs of 10 million generations were performed, assuming an uncorrelated lognormal clock model and a pure birth (Yule) process under the GTR+Γ model, sampling every 2500th generation. The nucleotide substitution model was selected using the program MrAic  and the Aikaike information criterion. Performance of the analysis (convergence of the independent runs and effective sample sizes for all sampled parameters) was evaluated using Tracer v.1.5 , after which 2500 trees were removed from each of the fourteen tree sets as the initial burn-in. Median and 95% Highest Posterior Density (HPD) intervals of node ages were then calculated from the remaining 21000 trees using the software TreeAnnotator v.1.6.1 .
The crown age of the tree, corresponding to the split between Fabales (here represented by Pisum) and Rosales (all other species), was set as a uniform prior between 104 and 115 Ma. This interval corresponds to the lower age estimate for the Rosales stem lineage and the upper age estimate for the Rosid crown group, respectively, as inferred in BEAST in a large fossil-based dating analysis of the Rosids . Although this maximum age may be incorrect, in the absence of further evidence we consider this a conservative assumption since the Rosales clade has a well-supported position within the Rosids (Figure 1 in ). In addition, seven carefully chosen fossils were used to impose minimal age constraints on the prior distributions.
The oldest fossil assigned to the crown group of Rosaceae is Crataegites borealis  from the Kolyma area in Siberia. It belongs to the Bour-kemuss Formation of the Zyrianka Coal Basin and has been dated to Early Albian (99.6–112 Ma) in the Cretaceous, by stratigraphic methods ( and references therein). The 40Ar/39Ar dates for the geographically adjacent but stratigraphically younger Chauna group tephra was determined to fall within the Coniacian stage (85.8–89.3 Ma) in late Cretaceous . Crataegites borealis is based on a number of very well preserved leaf imprints . The similarity to modern-day leaves of Crataegus is striking and there is no obvious reason to dispute the taxonomic position of the fossils in the crown group of Rosaceae.
Fossils of Spiraea (Amygdaloideae) and Neviusia (Rosoideae) were found at Republic, Washington, USA  and dated to 48–49 Ma . Representatives of the genera Holodiscus (Amygdaloideae; ) and Rosa (Rosoideae; ) are known from Florissant, Colorado and are dated to 34.1 Ma in Late Eocene . Chamaebatiaria (Rosoideae) fossils belong to the Creede Flora, Colorado , and the formation in which they were found has been dated to early Late Oligocene (26.85 Ma ). The oldest fossils of the genus Potentilla (Rosoideae; ) are from brown coal strata in Lausitz, Germany, formed in Early–Middle Miocene (11.6–23.0 Ma; ). Reference to an older Potentilla fossil is given by Wolfe and Schorn  from the Creede Flora in North America (27.3 Ma). The fossil, a leaf imprint, was originally described as a member of Ranunculaceae by Axelrod  but was reclassified to Potentilla/Rosaceae by Wolfe and Schorn . We have examined the photography of this fossil and dispute its reclassification, choosing instead the younger European fossil for calibration of the genus. Macrofossils of Fragaria (Rosoideae) were found in the Beaufort formation, Prince Patrick Island in the Canadian Arctic . The Beaufort formation is considered to be of the same age as the Lost Chicken tephra in Alaska dated to 2.9±0.4 Ma .
Species distribution data
Locality data for ivesioid species were downloaded from the Global Biodiversity Information Facility portal (www.gbif.org), Jepson Online interchange (ucjeps.berkeley.edu/interchange.html) and the Consortium of Pacific Northwest Herbaria websites (www.pnwherbaria.org). Duplicated data points were removed manually. If, in total, less than ten locations were found in the online databases, more locality data were collected from herbarium labels. The occurrence data was then plotted in a GIS using QGIS (http://qgis.org/), to verify that it agreed with current known distributions. Data points were this way cleaned by visual inspection (e.g. samples from coastal species ending up in the ocean were excluded).
Climate datasets for present day conditions (experiment set named xakxu) and paleoclimatic scenarios for 10 Ma (xakfl), 8 Ma (xakxu) and 3 Ma (xaiud) were provided by the BRIDGE project (www.bridge.bris.ac.uk/resources/simulations). Each dataset contained the sixteen climate variables listed in Table 2.
Selection of climate variables
Various strategies have been proposed for selecting climate variables to use in bioclimatic modeling. Methods for selecting or rejecting variables have included the quantification of variable contribution to the model, or specifically for phyloclimatic modeling , an assessment of the phylogenetic conservatism of individual variables, but most have investigated the correlation of prospective variables , –. Correlated climate variables will emphasize certain climate components (e.g. temperature or precipitation) if included in the analysis, and potentially result in incorrect inference of climate models. Thuiller  used principal component analysis to select uncorrelated variables for the models and Beaumont et al. , evaluated several different methods, including random sampling of variables, to assess the extent to which parameter choice influenced the predicted areas. The latter investigation showed that the size of the predicted area of distribution decreased when more climate variables were included in the analysis. Hence, selecting climate variables is an important step in inference of ancestral distribution areas.
We have used a novel method to exclude correlated variables while taking their prediction power in the form of Area under the Receiver Operating Characteristic (AUC) values into account. AUC is a measure of how well a model discriminates between sites where a species is present, compared to where it is absent . The values range from 0 to 1, where a score of 1 indicates a perfect prediction of distribution and a score of 0.5 equals a random prediction of sites . We produced bioclimatic models for each of the 38 species in the ivesioid clade, using one of the sixteen variables at a time. We recorded the AUC value for each model, and hence, each climate variable given a particular species, and the number of environmentally unique occurrence points for each analysis. An environmentally unique occurrence point is a species location with a value not previously sampled by the projected model. We then calculated the mean AUC value from each analysis with ten or more environmentally unique locations.
Correlation between the sixteen climate variables was then assessed using the function cor in the statistical package R . Variable pairs with correlation coefficients greater than 0,8 were identified and the climate variable with the lowest AUC value was excluded (Table 2). The four variables remaining after this exclusion process were used to build the bioclimatic models for the extant species and the ancestral nodes.
Bioclimatic models for extant species
Locality data together with the four selected climate variables (Table 2) were used to define the climate preferences for each of the 38 ivesioid species, using the Envelope Score algorithm implemented in OpenModeller v.1.1.0 (openmodeller.sourceforge.net). The Envelope score is a modified version of the Bioclimatic Envelope Algorithm (Bioclim) that uses the observed maximum and minimum values in each environmental variable to determine the climate preferences for a taxon . These preferences, called the bioclimatic envelope, can then be projected into a climate scenario to identify areas with a suitable climate for the taxon. The probability of a suitable environment in the projected model is determined by the number of layers with a value within the min-max threshold, divided by the total number of layers in the model .
The Bioclim methodology treats the environmental parameters independently of each other. This is a prerequisite for the ancestral state reconstruction where each variable in the bioclimatic envelop has to be optimized independently with currently available methods. Also, the simplicity of the algorithm makes it possible to combine these optimized variables to an ancestral bioclimatic envelope. More complex algorithms do not permit this independent treatment of variables as they attempt to account for the correlation between variables, and have therefore not been used for phylogenetic niche modeling .
Ancestral state reconstruction
Ancestral climate preferences were reconstructed for each node in the ivesioid phylogeny (Figures 3 and S2) using the function ace in the package ape  of the statistical program R . Independent optimizations were done for the maximum and the minimum values of each variable by fitting a Brownian motion model using Maximum Likelihood optimization . Optimized models for each node are presented in table 3.
Posterior probability (pp.) greater than 0.5 is shown. Branches with pp. greater than 0.95 are shown with thicker lines. Nodes for which ancestral models were projected into climate scenarios are indicated with numbers in boldface.
Ancestral bioclimatic model
The optimized maximum and minimum values for the four climate variables were used to build bioclimatic models for all nodes in the ivesioid clade. Models for nodes with a posterior probability higher than 0.95 were then projected in to the climate scenario that corresponded best in time with the age of the nodes as follows: node 40, 41, 43 and 71 were projected in to the climate scenario for 10 Ma; node 57 in a 8 Ma scenario; node 49, 54, 55, 58, 66 and 73 in the climate scenario for 3 Ma.
The same models were also projected into present-day climate data to evaluate whether the variation in predicted geographic area between nodes in the tree depends on variation in the climate scenarios or in the inferred models. By keeping one of these variables constant (in this case the climate scenario), any variation in the inferred area with a suitable climate will depend on the inferred model. Differences between optimised models can that way be visualised. This analysis was performed to identify shifts in climate preferences during the evolution of the ivesioids. Additionally, the comparison of models for extant taxa projected into the present-day climate scenario, with ancestral niches projected into present-day climate scenarios permits a visual comparison of the differences between the extant and ancestral niches.
Test of models
AUC values for all niche models of extant taxa were calculated. A test of the correlation of the projected surfaces for all extant taxa was performed using the niche.overlap tool in the phyloclim package  of R . Additionally, the age.range.correlation tool (also in phyloclim) was used to test for correlation between the niche overlap of two taxa and age to their most recent common ancestor (MRCA).
Phylogenetic inference and Molecular dating
The dated phylogeny of the Rosaceae family (Figures 1 and S1) identified the three subfamilies Rosoideae, Amygdaloideae and Dryadoideae as monophyletic. Except for the position of the species Lyonothamnus floribundus, the tribes presented by Potter et al.  were also in congruence with our phylogeny. Potentilla, and the ivesioids were inferred to be monophyletic.
The estimated 95% Highest Posterior Density (HPD) of the crown age of Rosaceae was 108.3-92.9 Ma (median 101.3 Ma). The tribe Potentilleae split off from Sanguisorbeae and Rosa 86.2-61.2 Ma (median 73.8 Ma) and the split between Potentilla and Fragariinae happened 78.7-52.8 Ma (median 65.4 Ma). The Potentilla crown group diversified between 68.2-43.1 Ma (median 55.2 Ma). Furthermore, the ivesioids formed a well-supported clade (pp. 1) and had a stem age of 31.6-15.9 Ma (median 23.4 Ma). Support for the internal topology of the clade was low, with a few exceptions. Ten clades with a posterior probability greater than 0.95 were identified and selected for further investigations (figure 3). Our results show that the ivesioids diversified between 24.3-12.1 Ma (median 17.7 Ma) into a clade with a mainly eastern (present day) Great Basin distribution (clade A; Figure 3), and a clade more or less confined to the Sierra Nevada and California in the west (clade B; Figure 3).
Species distribution data.
The number of occurrence points used varied between 10 and 256 for 36 of the 38 species (Table 4). Locality data for two species, Ivesia longibracteata (five points) and I. cryptocaulis (two points), were still less than the desired ten data points after the dataset had been complemented with data from herbarium collections.
The four climate variables selected to build the models by analyzing correlation and AUC values were Standard deviation of mean temperature, Mean temperature in coolest month, Mean daily precipitation in coolest month and Mean daily precipitation in warmest month. Table 4 shows the maximum and minimum values for the four climate variables for all included species. A visualisation of one character mapped onto the final phylogeny is shown in supplementary figure S2.
Bioclimatic models, extant species.
The projected areas range from the restricted species such as I. utahensis, which is an endemic to northern Utah, up to the wide-ranging species I. kingii, which finds climatically suitable areas in part of the Great Basin. The niche correlation analysis produced D and I correlation coefficients  for each pairwise comparison of species. Coefficients range from 0–1 signifying low to high correlation between age to most recent common ancestor and niche overlap. Mean D was 0.34, whilst mean I was 0.50. These values were consistent within clades (clade A: D = 0.31, I = 0.48; clade B: D = 0.35, I = 0.51) and between sister species pairs (D = 0.27, I = 0.46), indicating a weak relationship between niche divergence and phylogenetic divergence that does not vary between the major clades. This is confirmed by the correlation of niche overlap and age to the MRCA, which gives an insignificant correlation not different from zero (Adjusted r-squared −0.01, p = 0.43).
Bioclimatic models, ancestral nodes.
Figure 3 shows the fully resolved maximum clade credibility sub-tree of the ivesioids from the BEAST analysis. Ten of the branches have a posterior probability greater than 0.95 and are subjects for further investigation.
Projections into palaeoclimatic scenarios.
The reconstructed ancestral climate models, projected into their respective climate scenarios, are shown in figure 4. Node 40 is the MRCA of the ivesioid species and its sister clade Potentilla sect. Rivales, and hence represents the age of the ivesioid stem lineage. This lineage emerged at 23.4 Ma and is shown to diverge at 17.7 Ma (crown age; node 41 in figure 3). The bioclimatic model for node 40 projected into a climate scenario from 10 Ma indicates an area of suitable climate from where the clade could have evolved (areas marked in red in Figure 5a). Most of Utah, parts of Nevada, Arizona, Colorado and New Mexico are inferred to have had a suitable climate by all four variables.
Red areas are inferred to have had a suitable climate for the ancestral population by four climate variables and areas in yellow by three. Numbers corresponds to the nodes in the ivesioid clade in figure 3.
Areas in red are inferred by all four climate variables and areas in yellow by three of them.
In node 41, the suitable area inferred by all four climate variables has decreased, but still includes parts of Utah. Due to low support for the topology of the tree, the two well-supported clades (A and B in Figure 3) as well as four taxa with uncertain position (I. lycopodioides, I. longibracteata, I. jaegeri and I. bailey) are treated as being derived from this node. Three variables inferred the radiation in clade A (node 71) to Northeastern Nevada, Northern Arizona and New Mexico. The other node with support in clade A (node 73) inferred the Northern parts of the Sierra Nevada, Northwest Nevada and Southwestern Idaho as having had a suitable climate.
The inferred suitable area for node 43, MRCA of clade B, resembles that of node 41, but is weaker (only yellow areas in figure 4, map 43) and slightly more southern. A westward movement of suitable climate is seen in nodes 54, 55, 58 and 66, which have models predicting large parts of the Sierra Nevada and the coast of Northern California. The projected models for two nodes do not corroborate this westward movement of a suitable climate. They are Node 57, with a large part of the Great Basin, Western Montana and parts of Arizona and Canada inferred, and node 49 with only a small part of Southeastern Oregon inferred by three climate variables. Most models also show a weak support for a suitable climate on the East coast of North America and Europe (data not shown).
Projections of ancestral models into present-day climate.
Projections of the ancestral model for the MRCA with P. biennis (node 40) into present-day climate shows that the ivesioids originate from a climate corresponding to what is now found in the Sierra Nevada, Nevada, Southwestern Oregon and Northeast Arizona (Figure 5b). The preferences for present-day central Sierra Nevada climate prevails for all nodes in clade B (Figure 6; Maps 49, 54, 55, 57, 58 and 66) and are only slightly weakened for nodes 55, 58 and 66. The three latter nodes have an affinity for a climate found around the San Bernardino mountains in the south. As in the projections into palaeoclimate scenarios, there is a shift in climate preferences that includes the type of climate now found along the coast of California, sometime after 12.2 Ma (node 43).
The maps illustrate the inferred climate preferences of ancestral populations, by showing where this climate type is found today. Areas in red have been inferred by four climate variables and areas in yellow by three. Numbers corresponds to the nodes in the ivesioid clade in figure 3.
Phylogenetic inference and Molecular dating
The dated phylogeny of Rosaceae is congruent with previous analysis of the relationships in the family (Figures 1 and S1). The topology of the Potentilla clade was also congruent with that reported by Dobes and Pauli  and Töpel et al.  with few exceptions.
Locality data can be highly influential on the model predictions for extant species . It is important to use locality data from all climate regions occupied by the species to be able to create a model that predicts the true climate preferences for the species. Still, less than ten locality points were used in the analyses for I. longibracteata and I. cryptocaulis. Instead of following the procedure of Evans et al.  and manually add extra points from these areas we used only the observed locations, and thereby violated the rule of thumb of only including taxa with more than 10 or 20 data points in the analysis (10 points , 10–20 points ). The two species are both narrow endemics, with the former known only from Castle Crags (41.17°N, 122.33°W) in the Trinity Mountains, California, and the latter from the summit of Mt. Charleston (36.3°N, 115.6°W) in Spring Mountains, Nevada (Barbara Ertter, personal communication). We manually analyzed the climate data from these areas, and found that adding more occurrence points from the known area of distribution would sample the same values as were already in the model. In effect, each of the two species occupies only one climatic niche (at the scale of our data) in its known area of distribution. The Envelope Score algorithm, used to build the bioclimatic models, only uses the observed minimum and maximum value for each environmental variable to define the bioclimatic envelope of a species. Adding more points from these areas would therefore not change the models. The bioclimatic models for I. longibracteata and I. cryptocaulis might therefore predict a too narrow area of suitable climate if the distribution of these two species is not limited by the climate. Our primary goal is not to model extant species, but rather reconstruct ancestral models. We therefore believe that it is better to include these minimal models than to exclude these species from the analysis.
Origin of the ivesioid clade
The crown age of the ivesioid clade (24.3-12.1 Ma; median 17.7 Ma) corresponds to the time when summer drought started to appear in western North America . This supports the hypothesis that the group evolved in response to the Miocene aridification of western North America. Furthermore, the area where the ivesioids are inferred to have originated includes the eastern parts of the Great Basin and the western side of the Rocky Mountains (Figure 5a). This region represents the eastern extension of the present day distribution of the group. Potentilla biennis, sister species of the ivesioids, has a distribution from the Sierra Nevada in the west to North Dakota in the east, and from southern British Columbia and Oregon in the north to Arizona in the south. Hence, both species in the ivesioid clade and in the sister group Potentilla sect. Rivales can still be found in their optimized ancestral area. In addition, an area outside of the present area of distribution, corresponding to the southeastern parts of North America, is inferred by three variables to have had a suitable climate (yellow area in figure 5a). The hypothesis that the ancestor of the ivesioid clade evolved on the east side of the Rocky Mountains and migrated to the Great Basin, the Sierra Nevada and the coast of California is less parsimonious than an origin and diversification in the Great Basin. The stronger prediction of the Great Basin (red areas in figure 5a) also supports this notion.
The ivesioid clade is inferred to have originated in a climate resembling that of present-day western Nevada, the Sierra Nevada and southeast Oregon (Figure 5b), which indicates that the ancestor had fairly wide climate preferences. However, this result may be due to limitations in the method used for the ancestral state reconstruction and may be unduly influenced by the outgroup, Potentilla biennis, which has a relatively wide niche. This is a generic problem with ancestral state reconstruction, but any artificial widening of the ancestral niche preferences would still encompass the ‘true’ niche.
Diversification in the ivesioid clade
The ancestral niche models for nodes older than 10 million years (node 71 in clade A and node 43 in clade B, as well as the nodes 40 and 41, figure 4), more or less uniformly infer the central Great Basin as the ancestral area. These models are all projected into the same climate scenario, permitting a direct comparison without additional uncertainty caused by potentially conflicting palaeo-climate layers. Furthermore, the geographic areas identified when these models are projected into the present-day climate scenario are also very similar (Figure 6). Hence, the result from our analyses suggests that the diversification of the group and the emergence of the two clades A and B at approximately 17.7 Ma was not driven by climate change or a shift in climate preferences. This is supported by the low correlation between age to MRCA and niche overlap, and uniformly low niche correlation within and between clades. The split may instead have been associated with a shift in pollination syndrome.
Clade A consists of species with flowers that have shallow hypanthia and narrow filaments. Their morphology points towards a pollination syndrome involving small flies and beetles .
In contrast, clade B mostly consists of species with wide and flattened filaments, forming a cone on top of a deep hypanthium and are pollinated by bees or bumblebees . Most species in genus Potentilla have shallow hypanthia and narrow filaments, and an adaptation to a bee pollination syndrome in clade B could have been an important force for this split.
Only one clade with a posterior probability greater than 0.95 was found in clade A. It includes the two species Ivesia santolinoides and I. unguiculata, which do not occur in the Great Basin. Instead, these species have the most westerly distribution of species in clade A, and are only found in the Sierra Nevada and adjacent mountain ranges. The rest of the species in clade A are mainly confined to the interior of the Great Basin. From the MRCA of clade A and B (node 41), and further into clade A there is a narrowing of climate preferences, and a more westerly area of suitable climate inferred between 10.7 Ma (node 71) and 2.0 Ma (node 73). Projecting these models into present-day climate shows that the optimized climate models only change slightly (Figure 6), and the detected westward shift of suitable area is probably due to differences in the underlying paleoclimate scenarios used for the different nodes, thus not representing a change in climate preferences.
A similar pattern is seen in clade B. The ancestral area with a suitable climate is inferred to be the interior of the Great Basin until 7.6 Ma (Figure 4; maps 41, 43 and 57) for at least part of the clade. Furthermore, projections into present-day climate demonstrate that climate preferences of the ancestral nodes remained relatively stable until that time (Figure 6; Map 41, 43 and 57). At 4.5 Ma we find the earliest indication of preference for the climate of coastal California (Figure 4; Map 66). This type of climate preferences appears in several places in the tree after 4.5 Ma (Figure 4; Maps 54, 55 and 58). Hence, a westward migration, as seen in clade A, is also inferred to have happened in clade B, but continued past the Sierra Nevada to the coastal areas of California. The Mediterranean type of climate of this area emerged approximately 10 Ma . It is therefore reasonable to believe that species in clade B have found suitable habitats in the coastal regions of California on at least two occasions between 12.3 Ma (node 43) and 4.5 Ma (node 66), and between 7.6 Ma (node 57) and 1.7 Ma (node 58).
We observe a general pattern of niche conservatism amongst earlier lineages, until around 7,5-5 Ma (e.g node 57 in figure 3). There follows a greater amount of niche partitioning amongst related lineages, including a transition towards the coastal Mediterranean type climate in parts of clade B. This partitioning is evident for extant taxa as there are low levels of niche overlap between sister species. If sister species shared more similar niches we would expect to see a pattern of correlation between niche overlap and age to MRCA (i.e. that more closely related species have more similar niches), but this is not the case. The mean niche similarity within clades A and B is similar to the overall niche similarity for all species, so there is no major niche differentiation between clades. Many species pairs in clade B, such as I. shockleyi+I. sericoleuca and I. kingii+I. cryptocaulis, follow a schizo-endemic distribution pattern, i.e. one wider ranging species sister to a narrow endemic, but these sister groupings receive low support. This pattern has been reported for a number of plant groups in the Mediterranean region , and has been interpreted as the wider-ranging species being progenitor to the local endemic. Our results corroborate the generality of this pattern, that should be especially important in areas containing distinct micro-habitats (e.g., moist rock crevices in the middle of a wide arid zone, as observed for the ivesioids). We suggest that this may be an underestimated process in plant evolution, which could potentially explain at least some of the plant species richness observed today as well as the uneven distribution of certain species as compared to others that are closely related.
The phyloclimatic evolution of the ivesioids, inferred here, provides temporal and spatial support for the hypothesis that this group evolved in response to the late Tertiary development of dry conditions in western North America. The age of the MRCA of the clade (24.3-12.1 Ma; median 17.7 Ma) at Early-Middle Miocene coincides with the time when summer drought began in western North America. The hypothesis is further supported by the fact that the eastern parts of the Great Basin and the western slopes of the Rocky Mountains are inferred to have been the ancestral area of the clade. No other part of North America is strongly inferred to have had a suitable climate for the ancestor of this node; thus, migration into the Great Basin from areas not presently occupied by ivesioid species is unlikely.
A shift in pollination syndrome possibly led to diversification of the ivesioids at approximately 17.7 Ma. The resulting two clades experienced a westward range expansion from the foothills of the Rocky Mountains and the central Great Basin to the Sierra Nevada between 10.7-2.0 Ma, in clade A, and on at least two occasions between 12.3-4.5 Ma and 7.6-1.7 Ma in clade B. After a Mediterranean type of climate became established on the coast of California ∼10 Ma, several lineages crossed the Sierra Nevada and found new suitable habitats to exploit. Our results thus suggest that the evolution and current distribution of this morphologically aberrant and diverse group to a large extent has been influenced by past climate change.
Same molecular chronogram of Rosaceae as shown in figure 1, but also including species names. Maximum clade credibility tree obtained from 25000 post burn-in Bayesian chronograms generated in BEAST, with median branch lengths. Grey bars at nodes represent 95% Highest Posterior Densities of node ages. The red dots indicates age constraints used for the analysis; (1) The split between Rosales and Fabales was constrained to an age of 104–115 Ma based on a previous analysis , and (2) a Crataegites borealis fossil was used to set a conservative minimum age of 85.8 Ma on Rosaceae , . Subclades of Rosaceae were calibrated using fossil data from (3) Neviusia, 48.7 Ma , (4) Chamaebatiaria, 26.85 Ma , (5) Holodiscus, 34.1 Ma , (6) Spiraea, 48 Ma , (7) Rosa, 34.1 Ma , (8) Fragaria, 2.5 Ma , (9) Potentilla 11.6 Ma . A uniform prior with a maximum age of 115 Ma was used for all calibration points. Also indicated are the tribes of Rosaceae (species highlighted in blue and yellow) as well as the ivesioid clade highlighted in red. Time scale from .
Phylogenetic tree of the ivesioid clade with all node numbers indicated. Nodes and tips are coloured according to the character state of precipitation in the coldest month, with lightest shades indicting the lowest precipitation values.
We are very grateful to Barbara Ertter for help with locating and identifying ivesioids in the field, gathering locality data from herbarium material, and for discussions during the initial stages of the project. We thank the UC Botanical garden at Berkeley and Gothenburg Botanical Garden for providing and caring for plant material used in this investigation. We also like to acknowledge Paul Valdes, University of Bristol, for his advice on use of the paleoclimate layers, Paula Töpel for preparing all graphics, and two anonymous reviewers for their suggestions for improving the manuscript. Support from the Gothenburg Bioinformatics Network (GOTBIN) is also gratefully acknowledged.
Conceived and designed the experiments: MT BE. Analyzed the data: MT AA CY. Wrote the paper: MT BE AA CY. Preformed the laboratory work: MT. Preformed the fieldwork: MT BE.
- 1. Andrew P, Hendry AP, Lohmann LG, Conti E, Cracraft J, et al. (2010) Evolutionary biology in biodiversity science, conservation, and policy: A call to action. Evolution 64-5: 1517–1528.
- 2. Minnich RA (2007) Climate, Paleoclimate, and Paleovegetation. In Terestrial vegetation of California. eds. AvMichael G. Barbour, Todd Keeler-Wolf, Allan A. Schoenherr. 3rd edition. University of California Press. London. England.
- 3. Zachos JC, Dickens GR, Zeebe RE (2008) An early Cenozoic perspective on greenhouse warming and carbon-cycle dynamics. Nature 451: 17. 279–283.
- 4. Jacobs DK, Haney TA, Louie KD (2004) Genes, diversity, and geologic process on the Pacific coast. Annual Review of Earth and Planetary Sciences 32: 601–52.
- 5. Pierrehumbert RT (2002) The hydrologic cycle in deep-time climate problems. Nature 419: 191–198.
- 6. Evans MEK, Hearn DJ, Hahn WJ, Spangle JM, Venable DJ (2005) Climate and life-history evolution in evening primroses (Oenothera, Onagraceae): a phylogenetic comparative analysis. Evolution 59: 1914–1927.
- 7. Evans MEK, Smith SA, Flynn RS, Donoghue MJ (2009) Climate, Niche Evolution, and Diversification of the “Bird-Cage” Evening Primerose (Oenothera, Sections Anogra and Kleiniana). The American Naturalist 173: 225–240.
- 8. Töpel M, Lundberg M, Eriksson T, Eriksen B (2011) Molecular data and ploidal levels indicate several putative allopolyploidization events in the genus Potentilla (Rosaceae). PLOS Currents: Tree of life 16: 3 RRN1237.
- 9. Dobeš C, Paule J (2010) A comprehensive chloroplast DNA-based phylogeny of the genus Potentilla (Rosaceae): Implications for its geographic origin, phylogeography and generic circumscription. Molecular Phylogenetics and Evolution 56: 1, 156–175.
- 10. Ertter B (1993) The Jepson Manual: Higher Plants of California. ed. Hickman J. C. University of California Press. Berkeley.
- 11. Eriksson T, Donoghue MJ, Hibbs MS (1998) Phylogenetic analysis of Potentilla using DNA sequences of nuclear ribosomal internal transcribed spacers (ITS), and implications for the classification of Rosoideae (Rosaceae). Plant Systematematics and Evolution 211: 155–179.
- 12. Eriksson T, Hibbs MS, Yoder AD, Delwiche CF, Donoghue MJ (2003) The Phylogeny of Rosoideae (Rosaceae) Based on Sequences of the Internal Transcribed Spacers (ITS) of Nuclear Ribosomal DNA and the trnL/F Region of Chloroplast DNA. International Journal of Plant Sciences 164: 197–211.
- 13. Ertter B (1989) Revisionary studies in Ivesia (Rosaceae: Potentilleae). Systematic Botany 14(2): 231–244.
- 14. Ertter B (1993) A re-evaluation of the Horkelia bolanderi (Roseacea) complex, with the new species Horkelia yadonii. Systematic botany 18(1): 137–144.
- 15. Graham CH, Ron SR, Santos JC, Schneider CJ, Moritz C (2004) Integrating phylogenetics and environmental niche models to explore speciation mechanisms in dendrobatid frogs. Evolution 58: 1781–1793.
- 16. Hilbert DW, Bradford M, Parker T, Westcott DA (2004) Golden bowerbird (Prionodura newtonia) habitat in past, present and futur climates: Predicted extinction of a vertebrate in tropical highlands due to global warming. Biological Conservation 116: 367–377.
- 17. Hugall A, Moritz C, Moussalli A, Stanisic J (2002) Reconciling paleodistribution models and comparative phylogeography in the wet tropics rainforest land snail Gnarosophia bellendenkere (Brazier 1875). PNAS 99: 6112–6117.
- 18. Peterson AT, Martinez-Meyer E, Gonzalez-Salazar C (2004) Reconstructing the Pleistocene geography of the Aphelocoma jay (Corvidae). Diversity and Distributions 10: 237–246.
- 19. Yesson C, Culham A (2006) Phyloclimatic Modeling: Combining Phylogenetics and Bioclimatic Modeling. Systematic Biology 55: 785–802.
- 20. Nix HA (1986) A biogeographic analysis of Australian Elapid snakes. Pages 4–15 in Australian flora and fauna Series Number 7: Atlas of Elapid snakes of Australia (R. Longmore, ed.). Australian Government Publishing Service, Canberra.
- 21. Crisp MD, Arroyo MTK, Cook LG, Gandolfo MA, Jordan GJ, et al. (2009) Phylogenetic biome conservatism on a global scale. Nature 458: 754–756.
- 22. Hayashi K, Yoshida S, Kato H, Utech FH, Whigham DF, et al. (1998) Molecular Systematics of the Genus Uvularia and Selected Liliales Based upon mat K and rbc L Gene Sequence Data. Plant Species Biology 13: 129–146.
- 23. Li J, Huang H, Sang T (2002) Molecular Phylogeny and Infrageneric Classification of Actinidia (Actinidiaceae). Systematic Botany 27: 408–415.
- 24. Taberlet P, Gielly L, Pautou G, Bouvet J (1991) Universal primers for amplification of three non-coding regions of chloroplast DNA. Plant Molecular Biology 17: 1105–1109.
- 25. Katoh K, Kuma K, Toh H, Miyata T (2005) MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Research 33: 511–518.
- 26. Swofford DL (2003) PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Methods), Version 4. Sunderland, MA: Sinauer Associates.
- 27. Drummond AJ, Rambaut A (2007) BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evolutionary Biology 7: 214.
- 28. Suchard MA, Rambaut A (2009) Many-core algorithms for statistical phylogenetics. Bioinformatics 25: 11, 1370–1376.
- 29. Nylander JAA (2004) MrAIC.pl. Program distributed by the author. Evolutionary Biology Centre, Uppsala University.
- 30. Rambaut A, Drummond AJ (2007) Tracer v1.4, Available from http://beast.bio.ed.ac.uk/Tracer
- 31. Wang H, Moore MJ, Soltis PS, Bell CD, Brockington SF, Alexandre R, Davis CC, Latvis M, Manchester SR, Soltis DE (2009) Rosid radiation and the rapid rise of angiosperm-dominated forests. PNAS 106: 3853–3858.
- 32. Samylina VA (1960) Angiosperm plants from the Lower Cretaceous deposits of the Kolyma River. Botanicheskii Zhurnal 45: 335–352 [in Russian].
- 33. Herman AB (2002) Late early–Late Cretaceous floras of the North Pacific Region: florogenesis and early angiosperm invation. Review of Palaeobotany and Palynology 122: 1–11 [indirect reference to the stratigraphy of the area and age determination of marine deposits. Original paper published in Russian].
- 34. Kelley SP, Spicer RA, Herman AB (1999) New 40Ar/39Ar dates for Cretaceous Chauna Group tephra, north-eastern Russia, and their implications for the geologic history and floral evolution of the North Pacific region. Cretaceous Research 20: 97–106.
- 35. Wehr WC, Hopkins DQ (1994) The Eocene orchards and gardens of Republic, Washington. Washington Geology 22: 27–34.
- 36. Mathews WH (1964) Potassium-Argon age determination of Cenozoic volcanic rocks from British Columbia. Geological Society of America Bulletin 75: 465–468.
- 37. Schorn HE (1998) Holodiscus lisii (Rosaceae): a new species of ocean spray from the late Eocene Florissant Formation, Colorado, USA. PaleoBios 18: 21–24.
- 38. Meyer HW (2003) The fossils of Florissant. Smithsonian Books, Washington DC.
- 39. MacIntosh WC, Chapin CE (2004) Geochronology of the central Colorado volcanic field. New Mexico Bureau of Geology & Mineral Resources Bulletin 160: 205–237.
- 40. Axelrod DI (1987) The late Oligocene Creede flora, Colorado. University of California Publications in Geological Sciences 130: 1–235.
- 41. Lanphere MA (2000) Duration of sedimentation of the Creede formation from 40Ar/39Ar ages. Geological Society of America, special paper 346: 71–76.
- 42. Mai DH (2001) Die mittelmiozaenen und obermiozaenen Floren aus der Meuroer und Raunoer Folge in der Lausitz. III. Fundstellen und Palaeobiologie. Palaeontographica Abteilung B 258: 1–85.
- 43. Standke G, Rasher J, Strauss C (1993) Relative-sea level fluctuations and brown coal formation around the Early–Middle Miocene boundary in the Lusatian brown coal district. Geologische Rundschau 82: 295–305.
- 44. Wolfe JA, Schorn HR (1990) Taxonomic revision of the Spermatopsida of the Oligocene Creede flora, southern Colorado. United States Geological Survey Bulletin 1923: 1–40.
- 45. Matthews JV, Ovenden LE (1990) Late Tertiary plant microfossils from localities in arctic/subarctic North America: a review of the data. Arctic 43: 364–392.
- 46. Matthews JV, Westgate JA, Ovenden LE, Carter LD, Fouch T (2003) Stratigraphy, fossils, and age of sediments at the upper pit of the Lost Chicken gold mine: new information on the late Pliocene environment of east central Alaska. Quaternary Research 60: 9–18.
- 47. Beaumont LJ, Hughes L, Poulsen M (2005) Predicting species distributions: use of climatic parameters in BIOCLIM and its impact on predictions of species' current and future distributions. Ecological Modelling 186: 250–269.
- 48. Thuiller W, Broennimann O, Hughes G, Alkemade JRM, Midgley GF, et al. (2006) Vulnerability of African mammals to anthropogenic climate change under conservative land transformation assumptions. Global Change Biology 12: 424–440.
- 49. Hanley JA, McNeil BJ (1982) The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143: 29–36.
- 50. Fielding AH, Bell JF (1997) A review of methods for the assessment of prediction errors in conservation presence/absence models. Environmental Conservation 24: 38–49.
- 51. R-Development-Core-Team (2009) R: A language and environment for statistical computing (R Foundation for Statistical Computing, Vienna, Austria).
- 52. Yesson C, Culham A (2011) Biogeography of Cyclamen: an application of phyloclimatic modeling. In Hodkinson T, Jones M, Waldren S, Parnell J (eds.) Climate change, ecology and systematics. Cambridge University Press. pp 265–279.
- 53. Paradis E, Claude J, Strimmer K (2004) APE: analyses of phylogenetics and evolution in R language. Bioinformatics 20: 289–290.
- 54. Schluter D, Price T, Mooers AO, Ludwig D (1997) Likelihood of ancestor states in adaptive radiation. Evolution 51: 1699–1711.
- 55. Heibl C (2009) phyloclim: Integrating phylogenetics and climatic niche modelling. R package version 0.0.1. (http://CRAN.R-project.org/package=phyloclim)
- 56. Potter D, Eriksson T, Evans RC, Oh S, Smedmark JEE, et al. (2007) Phylogeny and classification of Rosaceae. Plant Systematics and Evolution 266: 5–43.
- 57. Warren DL, Glor RE, Turelli M (2008) Environmental niche equivalency versus conservatism: quantitative approaches to niche evolution. Evolution 62: 2868–2883.
- 58. Hernandez PA, Graham CH, Master LL, Albert DL (2006) The effect of sample size and species characteristics on performance of different species distribution modeling methods. Ecography 29: 773–785.
- 59. Faegri K, van der Pijl L (1979) The Principles of Pollination Ecology, 3rd ed. William Clowes & Sons Limited. London.
- 60. Thompson JD, Lavergne S, Affre L, Gaudeul M, Debussche M (2005) Ecological differentiation of Mediterranean endemic plants. Taxon 54: 967–976.
- 61. Gradstein FM, Ogg JG, Smith AG, Agterberg FP, Bleeker W, et al.. (2004) A Geologic Time Scale 2004. Cambridge University Press, ∼500 pp.