Demographic History of European Populations of Arabidopsis thaliana

doi:10.1371/journal.pgen.1000075

Figure 1.

Bayesian clustering.

(A) Membership coefficients in K_max = 5 putative populations, computed using the average values over the 10 TESS runs with the smallest values of the deviance information criterion from a total of 100 runs. Similar results were obtained with other values of K_max from 4 to 10. (B) Interpolated membership coefficients in the three apparent subpopulations: western cluster, eastern cluster, and northern cluster.

More »

Expand

Figure 2.

Diversity regressed on geographic distance.

Correlation (R) map for the linear regression of expected heterozygosity on great circle distance. We used 300×180 points on a two-dimensional lattice covering Europe, and we computed distances from each lattice point considered as a potential source. The dots represent the centers of the 7 population samples used in the regression analysis.

More »

Expand

Figure 3.

Bayes factors.

The 4 demographic scenarios (Models A–D) and their associated Bayes factors. Model A is the model with constant population size, N₀. Model B is a model with an exponentially growing population size (present size, N₀, ancestral size, N₁, time since the onset of expansion, t₀). In Model C, the growth is exponential between two periods with constant size (present size, N₀, ancestral size, N₁, time since the onset of expansion, t₀, time since the end of expansion, t₁). Model D is similar to Model B, but it includes an ancient bottleneck before expansion. Variants of these 4 models, including variable mutation rates across loci, are considered here. The Bayes factors (top boxes) correspond to the ratio of the weight of evidence of each model to the weight of evidence of Model B. Two window sizes, δ_0.01 and δ_0.05, were used when computing the Bayes factors. These window sizes correspond to the 1% and 5% quantiles of the distance between the values of the summary statistics obtained under Model B and the observed values of the summary statistics. The Bayes factors were identical for the 2 window sizes and for values rounded for one decimal place, except for Model C, for which a minor difference was observed (1.8 for δ_0.05 instead of 1.9).

More »

Expand

Figure 4.

Onset and duration of the demographic expansion.

Plot of the joint posterior distribution for the time of onset of the expansion, t₀, and the length of the expansion, t₀−t₁. Computations were performed under demographic Model C, in which the population was initially constant, then grew exponentially until t₁, and then remained constant until the present. Percentages represent the cumulative probabilities under the density curve. The straight line indicates that the duration of expansion cannot be longer than the time elapsed since the onset of expansion.

More »

Expand

Frequency spectrum in actual and simulated data.

Minor allele frequency spectra of empirical data and data simulated under the best-fitting model of spatial range expansion. Population growth followed the logistic model within each deme (see text for the other parameter settings). The solid line (grey) corresponds to the neutral folded frequency spectrum. (A) The empirical folded spectrum was computed from the 648 inter-genic and non-coding sequences. (B) The simulated spectrum was computed using the same number of neutral nucleotides as in the data. In simulations, expansion started 9,000 years ago from a potential origin north of the Black Sea (48°N, 35°E). Other locations from a large region around this potential origin yielded very similar simulated spectra.

More »

Expand