Divergent and stabilizing selection shape the phenotypic space of Arabidopsis thaliana

Maria Stefania Przybylska; Cyrille Violle; Denis Vile; J. F. Scheepens; Denis Cornet; Gregory Beurier; Lauriane Rouan; Aurélien Estarague; Elena Kazakou; Lucie Mahaut; François Munoz; Detlef Weigel; Moises Exposito-Alonso; Oliver Bossdorf; Luis-Miguel Chevin; François Vasseur

doi:10.1371/journal.pbio.3003536

Abstract

Why do we observe some plant phenotypes but not others? The multivariate phenotypic space occupied by individuals or species often reveals both limits and phenotypes strikingly deviating from main syndromes. These observations are usually thought to indicate, respectively, inviable trait combinations and unique phenotypes adapted to specific environments. However, the evolutionary drivers underlying trait covariations often remain unclear. Here, we characterized the phenotypic space of Arabidopsis thaliana by comparing 713 wild accessions collected across the globe with 2,544 artificially-created recombinant individuals. This, combined with the detection of adaptive processes operating within species, allowed us to elucidate the roles of natural selection as a driver of phenotypic (co)variations within A. thaliana. We found that the phenotypic space of this species is constrained and driven by varying levels of divergent and stabilizing selection across different traits. Moreover, at the margins of the European geographic range, strong directional selection favored outlier phenotypes characterized by very late flowering and variation in a WRKY transcription factor gene. Genome analyses revealed that these extreme phenotypes may be explained by hybridization between ancestral and modern lineages of A. thaliana. Our findings demonstrate how interplays between population history and natural selection shape phenotypic diversity in a plant species.

Citation: Przybylska MS, Violle C, Vile D, Scheepens JF, Cornet D, Beurier G, et al. (2025) Divergent and stabilizing selection shape the phenotypic space of Arabidopsis thaliana. PLoS Biol 23(12): e3003536. https://doi.org/10.1371/journal.pbio.3003536

Academic Editor: Leonie C. Moyle, Indiana University, UNITED STATES OF AMERICA

Received: September 18, 2025; Accepted: November 14, 2025; Published: December 1, 2025

Copyright: © 2025 Przybylska et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The data and codes supporting the conclusions of this article can be found at https://doi.org/10.48579/PRO/3LQH1M. Genomic data are available at http://1001genomes.org/. Environmental data are available at https://www.chelsa-climate.org/datasets/chelsa_climatologies/ and https://files.isric.org/soilgrids/latest/data_aggregated/1000m/. Codes can also be found on GitHub: https://github.com/msprzybylska/Analysis_PhenSpace.

Funding: This work was supported by Agence Nationale de la Recherche (ANR; https://www.anr.fr) and Deutsche Forschungsgemeinschaft (DFG; https://www.dfg.de) funding as part of the AraBreed project (Grants ANR-17-CE02-0018-01, to CV, and SCHE 1899/2-1, to JFS). It was also supported by the European Research Council (ERC; https://erc.europa.eu) Starting Grant Project ‘PHENOVIGOUR’ (Grant ERC-StG-2020-949843, to FV). CV was supported by the European Research Council (ERC; https://erc.europa.eu) Starting Grant Project “Ecophysiological and biophysical constraints on domestication in crop plants” (Grant ERC-StG-2014-639706-CONSTRAINTS). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Abbreviations: BSLMM, Bayesian Sparse Linear Mixed Model; CEFE, Centre d’Ecologie Fonctionnelle et Evolutive; GWA, Genome-Wide Association; LA, leaf area; LDMC, leaf dry matter content; LES, leaf economics spectrum; LNC, leaf nitrogen content; MAF, minor allele frequency; PCA, principal component analysis; SLA, specific leaf area; VPD, vapor pressure deficit

Introduction

Explaining the exceptional diversity of organismal forms and functions is a fundamental aim of biology. This diversity, though large, is bounded in different ways across taxa, such that some combinations of trait values within and between species are rarely observed [1–5]. For instance, recent evidence suggests that the remarkable phenotypic diversity of vascular plant species covers a multivariate phenotypic space that is significantly smaller than expected from null distributions [5]. This result suggests that trait (co)variation in vascular plants is not completely random, but rather shaped by various types of constraints (e.g., biophysical, genetic) and evolutionary processes. A key to understanding plant diversity is to investigate what shapes phenotypic (co)variation, which determines how phenotypes translate into functions and are targeted by natural selection [6].

Within species, directional selection drives phenotypic divergence between populations and can even promote the emergence of extreme phenotypes. In contrast, natural selection can also act by eliminating unfit genotypes and their associated trait combinations, thereby creating empty zones at the margins of the phenotypic space. Stabilizing selection is a pervasive force contributing to the absence of extreme phenotypes [7]. Most species are expected to undergo a combination of these two forms of selection: differences in phenotypic optima lead to divergent directional selection among populations, while stabilizing selection acts within populations once they reach their local optimum. The relevant question, therefore, concerns the relative magnitudes of these two components of selection in shaping phenotypic diversity [8]. By studying natural populations, we can only observe a limited fraction of a species’ potential diversity: the allelic combinations that have not been eliminated by chance or natural selection. A complementary approach is then to harness the standing genetic variation within a species, which can be substantial and harbor the potential to recreate phenotypes previously purged by selection. Producing recombinant individuals through controlled genetic crosses (F₂s) offers a powerful approach to experimentally generate this potential diversity and compare it to the realized diversity observed in wild populations.

Meiotic recombination leads to the reshuffling of genetically based trait variation within species. Consequently, recombinant individuals have randomly segregating parental alleles that, in contrast to wild lines, do not result from past evolutionary processes [9–11]. They can then be used to estimate a null distribution of phenotypes that can be applied to test the role of natural selection as a driver of trait evolution and diversity [11]. One can notably assess how, and to what extent, the phenotypic space formed by distinct genotypes departs from that of the offspring generated from multiple crosses between them (through a multivariate approach, e.g., [12]). If parental populations harbor a phenotype distribution that does not differ from the one expected by chance (i.e., similar to the distribution observed in F₂s), they have likely evolved without strong selective pressures [11]. If the opposite is true, the type(s) of selection that drove the phenotypes of the parental lines can be further investigated.

If F₂ phenotypes fall outside the parental phenotypic range, a phenomenon called transgressive segregation, parental lines must have diverged from their common ancestor relying on distinct combinations of alleles across different loci (i.e., not all “+” alleles in one parent and all “–” alleles in the other [13]). This condition, which is especially likely for polygenic traits [14,15], can be achieved either when parental phenotypes have evolved towards the same optimum or towards different optima [13]. However, transgressive segregation is thought to be more likely when the parents are not too phenotypically divergent [13,16], as it is otherwise less probable that the progeny will exceed the parental phenotypic range. Consequently, higher phenotypic variance in the progeny is thought to be more common under stabilizing selection combined with weak divergent selection. Conversely, strong directional selection that is divergent among populations should result in less transgressive segregation and a higher likelihood that the parental phenotypic range exceeds that of the F₂s. However, these predictions remain poorly tested empirically [15] (but see [17,18] on the role of genetic distance and phenotypic differentiation).

When directional selection is strong enough, it can lead to phenotypes significantly deviating from mean trait combinations and thus at the margins of the phenotypic space (hereafter phenotypic outliers). However, phenotypic outliers can also emerge because of nonadaptive processes due to, e.g., a lag in the response of certain populations to environmental changes [19]. For example, Arabidopsis thaliana has an evolutionary history marked by lineages that survived the Ice Age with minimal hybridization over time (relicts hereafter) and constitute apparent “outlier haplotypes” [20,21]. During the last glaciation, relict lineages were restricted to regions in southern Europe, from where the European continent was re-colonized after the Ice Age [21]. Most extant European populations originated from a genetic lineage that re-colonized all of Europe, likely from the Balkans, about 10,000 years ago [21]. Despite extensive hybridization that led to the loss of much of the relict genetic variation, a higher amount of relict introgression segments can still be found at both high and low latitudes in Europe (called refugia) [21]. One hypothesis for the maintenance of this genetic variation in extant populations is that it confers high adaptive value under specific environmental conditions [21–23]. However, it remains to be tested whether outlier haplotypes translate into outliers in the phenotypic space and whether natural selection can explain this pattern.

Here, we investigated to what extent selection has shaped vegetative and phenological trait (co)variations in 713 wild accessions of A. thaliana from a large geographic range. These accessions were grown in a common garden, together with 2,544 recombinant individuals produced by crossing 358 accessions that encompassed a broad range of genetic, geographic, and phenotypic variation [24]. We compared the realized phenotypic diversity within wild accessions to the potential phenotypic diversity arising from standing genetic variation through recombination. We specifically investigated (i) whether the phenotypic space of a widely distributed species is smaller than the overall distribution of recombinant individuals and whether this pattern varies across axes of phenotypic variation; (ii) the genetic bases of the phenotypic space and the relationship between phenotypic divergence in wild accessions and phenotypic segregation in the F₂s; and (iii) the evolutionary drivers of phenotypic outliers among wild accessions. Our results demonstrate how population history and different types of selection can jointly shape the phenotypic diversity of a widely distributed plant species.

Results

Wild accessions occupy a smaller phenotypic space than recombinant F₂s, but present phenotypic outliers

The phenotypic space of A. thaliana was assessed using the hypervolume approach [12,25] (Fig 1) on six traits measured on both F₂ individuals (F₂s) and wild accessions: flowering time; plant biomass; leaf area (LA), specific leaf area (SLA), leaf dry matter content (LDMC), and leaf nitrogen content (LNC). The selected traits encompass distinct plant strategies [5], including life-history traits (i.e., flowering time and plant biomass), a size-related trait (i.e., LA), and traits linked with the leaf economics spectrum (LES; i.e., SLA, LDMC, and LNC), which depicts a trade-off between resource acquisition and conservation [26]. We used scores of five significant principal component analysis (PCA) dimensions to quantify the hypervolumes of F₂s (i.e., potential phenotypic space) and wild accessions (i.e., realized phenotypic space). While controlling for differences in the sample sizes of F₂s and wild accessions, we found that the hypervolume of F₂s was over twice as large as that of wild accessions (Fig 2a), with about 42% overlap according to Jaccard’s index (S1 Fig). To describe trait variation within the phenotypic space of A. thaliana, we used the first three PCA dimensions, which together explained 92.9% of the variation (Fig 2b). PC1 explained 63.6% of the variation and largely reflected LES traits (i.e., SLA, LDMC, and LNC) and plant biomass (Fig 2c), whereas PC2 and PC3 explained 18.2% and 11.1% of the variation and were mostly associated with LA and flowering time, respectively (Fig 2c). The variances of both PC1 and PC2 scores were significantly larger for the F₂s compared with the wild accessions, but did not differ for PC3 (Breusch–Pagan test of homogeneity of variance: P < 0.05, P < 0.001, P = 0.24, respectively).

Download:

Fig 1. Analytical framework using the hypervolume approach.

We quantified the realized and potential phenotypic spaces of Arabidopsis thaliana (here exemplified by groups A, in red, and B, in blue, respectively) using hypervolumes [12,25]. Hypervolumes are described through size (volume) and overlap metrics. Size informs about the extent of phenotypic variation, which we compared between groups using volume distributions obtained by resampling both groups to an equal number of observations. Overlap parameters are used to assess dissimilarity of phenotypic space position and unique components between groups. Dissimilarity was quantified through Jaccard’s similarity index, which is the proportion of overlap defined as the intersection of volumes A and B divided by their union. Unique components are quantified through the subtraction of the union of volumes A and B by their intersection. Within the unique components of one of the analyzed groups, here exemplified by A, we investigated the identity of the genotypes they were composed of (genotypes “x”).

https://doi.org/10.1371/journal.pbio.3003536.g001

Download:

Fig 2. The phenotypic spaces of Arabidopsis thaliana F₂s vs. wild accessions.

The results shown correspond to one of the 10 hypervolume calculations. a) Hypervolume sizes (± standard errors), in units of standard deviation to power five (the number of trait dimensions used for hypervolume computation), for F₂s (blue) vs. wild accessions (red). b) Trait hypervolumes based on the first three principal component analysis (PCA) dimensions (with opaque dots representing observed data points, and semitransparent small dots representing uniformly distributed random points generated by the algorithm used for hypervolume computation [27]). Hypervolumes with size equivalent to the mean of the 100 hypervolumes generated by resampling per group (F₂s vs. accessions) are shown. c) Trait variation within the phenotypic spaces of A. thaliana F₂s (blue) and wild accessions (red) considering the first three PCA dimensions of trait hypervolumes. Hypervolume centroids, observed data points, and uniformly distributed random points generated by the algorithm used for hypervolume computation [27] are depicted by large circles, opaque dots, and semitransparent small dots, respectively. Hypervolumes with size equivalent to the mean of the 100 hypervolumes generated by resampling per group (F₂s vs. accessions) are shown. LA, leaf area; LDMC, leaf dry matter content; LNC, leaf nitrogen content; SLA, specific leaf area. The data underlying this figure can be found at: https://doi.org/10.48579/PRO/3LQH1M.

https://doi.org/10.1371/journal.pbio.3003536.g002

Not only did the phenotypic spaces of F₂s and wild accessions differ in volume, but they also differed in their overlap metrics. Their degree of overlap depended on the PCA dimensions: overlap was high on PC1 but lower on PC2 and PC3 (Figs 2b, 2c, and S2). This pattern was mainly driven by the presence of 36 genotypes with combinations of trait values beyond those observed in F₂s (Fig 2b and 2c, right and S1 Table). These were accessions forming the unique components of the accessions’ hypervolumes and hereafter referred to as “phenotypically unique accessions”. They were mainly characterized by very late flowering, the significance of which was confirmed by a sensitivity analysis. This analysis showed that removing flowering time led to a mean reduction of 66.6% of the volume of the phenotypic space of wild accessions compared to 57.3% of F₂s (S3 and S4 Figs). Moreover, this procedure led to the detection of only four phenotypically unique accessions, confirming the importance of phenology in generating extreme variation.

Because the 713 accessions used to compute the phenotypic space of wild accessions included only 254 of the 358 parental (G₀) accessions of the F₂s, along with 459 additional accessions, we had to rule out any bias in the differences between the hypervolumes of F₂s and accessions due to their genetic composition. We first found that the whole-genome genetic diversity (π) in the 254 G₀ accessions and the other 459 analyzed accessions was highly correlated (S5 Fig). Next, we directly compared the hypervolume of the 254 G₀ parental accessions with that of the F₂s and still found that the latter was larger than the former (S6 and S7 Figs). Finally, we imputed the phenotypes of the 104 G₀ accessions for which phenotypes had not been measured directly (358 minus 254 accessions) and found that F₂s still presented a larger hypervolume when including these extra phenotypes (S8 and S9 Figs). Despite a large variation in the accuracy of imputation for some analyzed traits, flowering time was the trait that had the most accurate imputation (S2 Table). 12 and 35 phenotypically unique accessions out of the 36 initially recorded were still detected in the analysis with 254 accessions and in the analysis with imputed phenotypes, respectively (S7 and S9 Figs). In the former case, the reduction in the number of phenotypically unique accessions (66.7%, i.e., 36–12) was comparable to the reduction in the number of analyzed accessions (64.4%, i.e., 713–254).

Divergent selection on traits influences the extent of the phenotypic space of wild accessions

To understand the evolutionary mechanisms underlying lower variation in LES traits (i.e., SLA, LDMC, and LNC) and higher variation in flowering time within wild accessions compared to F₂s, we explored the genetic bases of the phenotypic space of the parental accessions of the F₂s (G₀). First, we performed Genome-Wide Association (GWA) for each trait and found no significant peak of SNP association (S10 Fig). For plant biomass, only one SNP in a noncoding genomic region was significantly associated (S10 Fig). We then performed polygenic GWA (Bayesian Sparse Linear Mixed Model [BSLMM, 28]) and found that, consistent with moderate to high heritability (“PVE” parameter, S3 Table; from 30.0% for LA to 84.4% and 91.1% for flowering time and LNC, respectively), all analyzed traits were highly polygenic, i.e., explained by many SNPs of weak effect (“pi” parameter; S3 Table). We then tested the prediction that transgressive segregation for polygenic traits is more likely when the parents are not too phenotypically divergent (and have, thus, likely evolved under stabilizing selection combined with weak divergent selection). To this end, we first compared the degree of genetic variance of traits (Q_ST) to neutral genetic variance (F_ST) among G₀ accessions to test for stabilizing versus divergent selection shaping each of the six analyzed traits. We found that LA had very low Q_ST (0.003), falling below the range of significant F_ST values (Fig 3a). In contrast, SLA, LDMC, LNC, plant biomass, and flowering time exhibited progressively higher Q_ST values relative to the median of the F_ST distribution (0.25, 0.30, 0.34, 0.37, and 0.44, respectively; Fig 3a). However, the difference was only significant (Q_ST > F_ST) for life-history traits, i.e., plant biomass and flowering time (Fig 3a). For the traits with Q_ST within the range of significant F_ST (all traits except LA), we examined the relationship between the ratio of Q_ST to F_ST and the degree of transgressive segregation (the ratio of the coefficient of variation (CV) of each trait in the F₂s to the CV of the corresponding trait in their parents). We found higher transgressive segregation for traits with lower Q_ST–F_ST ratio, i.e., SLA, LDMC, and LNC (those associated with the LES; Fig 3b, left), and lower transgressive segregation for traits with higher Q_ST–F_ST ratio, i.e., flowering time and plant biomass (life-history traits; Fig 3b, left). This relationship was strong and significant (R² = 0.95, P < 0.01; Fig 3b, left). Finally, we investigated the relationship between polygenicity (“pi” parameter) and the degree of transgressive segregation for all traits and found a moderate positive correlation between the variables (R² = 0.69, P < 0.05; Fig 3b, right).

Download:

Fig 3. Phenotypic divergence in wild accessions of Arabidopsis thaliana and its relationship with the degree of transgressive segregation in F₂s.

a) Genetic variance of traits (Q_ST values, colored lines) and their credible interval (colored area) are compared with the neutral F_ST value (black solid line), which corresponds to the median of the distribution of significant F_ST values. The significance threshold was set at the 95th percentile of a null F_ST distribution (black dashed line), above which F_ST values were considered significant. b) Relationship between the genetic variance of traits (Q_ST–F_ST ratio; all traits except LA, which did not show Q_ST within the range of significant F_ST) and the degree of transgressive segregation (left), and between polygenicity (“pi” parameter) and the degree of transgressive segregation (right). Transgressiveness was measured as the ratio between the coefficient of variation (CV) of the traits in the F₂s (CV F₂s) and the CV of the traits in their parents (G₀, CV Acc). Lines were fitted with Standardized Major Axis (SMA) regressions, and R² denotes the coefficient of determination. LA, leaf area; LDMC, leaf dry matter content; LNC, leaf nitrogen content; SLA, specific leaf area. The data underlying this figure can be found at: https://doi.org/10.48579/PRO/3LQH1M and http://1001genomes.org/.

https://doi.org/10.1371/journal.pbio.3003536.g003

Strong directional selection explains the presence of phenotypic outliers that are characterized by relict introgression segments

Since directional selection was strong for flowering time, we tested its role as a potential evolutionary driver of A. thaliana phenotypic outliers through further genomic and environmental analyses. We first found that phenotypically unique accessions were non-relicts from either Sweden (27 accessions mainly from South Sweden) or Spain (9 highland accessions mainly from North Spain, Fig 4a and S1 Table). Next, we identified SNPs that significantly differentiated these unique accessions from accessions with phenotypic syndromes that were common in the studied groups (Fig 4b). The most prominent peak of diagnostic SNPs for the phenotypically unique accessions was in chromosome 3 and included variants in a WRKY transcription factor gene that has been linked to senescence and stress response (ATWRKY58 [29], Fig 4b). We also found that ATWRKY58 coincided with an increase in Tajima’s D values compared to the surrounding genomic sequences (Fig 4c). Finally, based on the environmental analyses, the variables vapor pressure deficit (VPD), mean daily temperature, and sand content were shown to play major roles in explaining the differences between phenotypically unique and common accessions (Fig 4d). They were the variables that mostly decreased accuracy and Gini index in the random forest models (mean OOB = 0.32) and that were selected in the feature selection (Boruta) analysis (Fig 4d). VPD and mean daily temperature were significantly lower for phenotypically unique relative to common accessions (P < 0.001 and P < 0.01, respectively; S11 Fig). Soil sand content, in turn, was significantly higher for phenotypically unique relative to common accessions (P < 0.01, S11 Fig).

Download:

Fig 4. Characterization of phenotypically unique accessions.

a) Map of the geographic origin of phenotypically unique accessions, showing those from Sweden (orange) and from Spain (purple). The map was created with data from Natural Earth via the R package rnaturalearth [30] (basemap shapefile: https://www.naturalearthdata.com/downloads/50m-cultural-vectors/). b) Manhattan plot showing SNPs that were significantly associated with the phenotypically unique vs. common accession classification. A SNP corresponding to the ATWRKY58 gene, which was found to underlie the most prominent peak of diagnostic SNPs, is highlighted. The red line represents the significance threshold at P ≤ 0.05 with Bonferroni correction. c) Tajima’s D values were calculated every 100-bp window around the SNP that corresponded to the ATWRKY58 gene (±10,000 bp). d) Relevance of different environmental variables in explaining the phenotypically unique vs. common accessions classification. Mean Decrease Accuracy (top) and Mean Decrease Gini (bottom) of the random forest models are presented. Asterisks represent selected variables through feature selection (Boruta) analysis. Bdod, bulk density; cec, cation exchange capacity; clay, proportion of clay particles; cmi, climate moisture indices; hurs, near-surface relative humidity; maxmin, the difference between mean daily maximum and minimum air temperature; N, total nitrogen content; pet, potential evapotranspiration; pHH₂O, soil pH; pr, precipitation amount; rsds, surface downwelling shortwave flux in air; sand, proportion of sand particles; sfcWind, near-surface wind speed; silt, proportion of silt particles; soc, soil organic carbon content; tas, mean daily air temperature; vpd, vapor pressure deficit; wv10, volumetric water content at 10 kPa; wv33, volumetric water content at 33 kPa; wv1500, volumetric water content at 1,500 kPa. The data underlying this figure can be found at: https://doi.org/10.48579/PRO/3LQH1M, http://1001genomes.org/, https://www.chelsa-climate.org/datasets/chelsa_climatologies/, and https://files.isric.org/soilgrids/latest/data_aggregated/1000m/.

https://doi.org/10.1371/journal.pbio.3003536.g004

Using published data [21], we found relict introgression segments in phenotypically unique accessions. Spanish phenotypically unique accessions, from locations close to the oldest relict refugia [20], had more introgressions than Swedish phenotypically unique accessions (mean ± SE: 163.00 ± 17.93 and 74.52 ± 3.97, respectively) and presented an intermediate number of relict haplotypes between Spanish relicts (geographically closer to Spanish phenotypically unique accessions and possessing high frequency of relict haplotypes, mean ± SE: 737.23 ± 34.43) and French accessions (phenotypically common accessions used before as a non-relict reference due to their very low level of relict haplotypes [21], 52.84 ± 2.37, S12 Fig). We confirmed the importance of relict introgressions in phenotypically unique accessions by comparing vegetative, phenological, and reproductive traits between phenotypically unique, Spanish relict, and French accessions. We found that phenotypically unique accessions were more similar to Spanish relicts than to French accessions, mostly regarding plant biomass, but also LA and LNC (S13a Fig). For flowering time, although phenotypically unique accessions had longer flowering time than both Spanish relicts and French accessions, this difference was smaller relative to the former than to the latter (S13b Fig). Phenotypically unique accessions also generally had lower reproductive performance, either with smaller fruit length or reduced fertility compared to French accessions, but not to relicts (S13c Fig).

Discussion

Here, we combined phenotypic, genomic, environmental, and biogeographic information about A. thaliana to cast light on the evolutionary determinants that shaped its phenotypic diversity. Our results revealed that wild accessions of A. thaliana generally occupied a reduced phenotypic space compared to F₂ individuals (F₂s), suggesting that the phenotypic space of A. thaliana is constrained. However, at the margins of the European geographic range, strong directional selection acted by favoring phenotypic outliers with relict genetic variation.

The genetic mechanisms that may explain F₂ phenotypes falling outside the phenotypic range of parental populations rely on the phenomenon of transgressive segregation [13]. In accordance with previous predictions [13,16], we found that, for highly polygenic traits such as the six traits analyzed here, the degree of transgressive segregation decreases with the strength of divergent selection (estimated through Q_ST–F_ST ratio). Accordingly, lower differentiation in parental accessions was shown to be associated with higher transgressive segregation. Furthermore, polygenicity was positively associated with the degree of transgressiveness, corroborating this type of genetic architecture as key for the emergence of transgressive segregation [15]. Taken together, these observations explain why relatively more polygenic traits for which accessions had genetic variance not significantly higher than the neutral expectation (such as traits that underlay the main axis of variation of the phenotypic space of A. thaliana: LES, i.e., SLA, LDMC, and LNC) would more likely fall within the phenotypic range of the F₂s. Conversely, relatively less polygenic and highly differentiated life-history traits (among accessions), such as flowering time, would more likely fall outside the phenotypic range of the F₂s, as observed here.

In line with our findings, flowering time has been shown to be under pervasive directional selection in A. thaliana [31–34], while LES traits have been found to be differentiated among wild accessions but without a significant signal of directional selection [35]. Beyond these contrasting patterns, plant biomass showed a more nuanced trend: whereas it was associated with the LES dimension (PC1) of the phenotypic space of A. thaliana, the Q_ST–F_ST analysis suggested that it is under directional selection and intermediary in the gradient of phenotypic differentiation of parental lines versus transgressive segregation of the progeny. This is consistent with previous findings showing that plant biomass in A. thaliana can be correlated with both flowering time and LES traits [35,36] and suggests that variation in this trait is shaped by a combination of different forces: diverging and stabilizing because of its link with phenology and LES traits, respectively.

Stabilizing selection acting on phenotypic variation is inherently difficult to detect [8]. This is likely due to the fact that, in nature, fluctuating environments keep populations away from the fitness optimum [37]. To circumvent this difficulty, comparing the phenotypic distribution of F₂s versus their parents is a powerful approach, as higher phenotypic variance in the F₂s indicates forces limiting the phenotypic range of the parental lines [11]. However, among the traits explaining the major axes of variation in the phenotypic space of A. thaliana, we could confirm through Q_ST–F_ST analyses the potential action of stabilizing selection on only one trait: LA (though the F_ST values for this trait fell below the chosen significance threshold). Another mechanism that can also explain a part of the phenotypic variation between wild accessions and F₂s is overdominance (wherein recessive deleterious mutations are masked at loci that are divergent between parental populations due to heterozygosity [38]), as approximately half of the genome of F₂s is heterozygous. However, its effect is primarily on mean values (overlap metrics), as overdominance is not expected to contribute to variance [38] (volume and PC variance). Moreover, it is unlikely that there is much selection on heterozygous allelic combinations in A. thaliana due to its selfing nature [39]. Finally, it is important to highlight that genetic constraints (i.e., linkage disequilibrium and pleiotropy) may interact with stabilizing selection and divergent selection to explain the observed limits to phenotypic variation, since trait correlations were generally the same for parental accessions and F₂s (S14 Fig). Pleiotropic alleles have been previously found to underlie leaf economics traits in wild accessions of A. thaliana [23]. However, further research is required to clarify the interplay between natural selection and genetic constraints acting on multiple traits within this plant species.

Flowering time was found to be under strong directional selection, promoting phenotypic outliers in the phenotypic space of A. thaliana characterized by very late flowering. Most studies have revealed pervasive selection for earlier, rather than delayed flowering—likely as a response to climate change [31–34]. Studies demonstrating selection for late flowering have revealed that it may be driven by cold conditions [31,40–43] and late frost [44]. In accordance with these results, we identified some A. thaliana genotypes (phenotypically unique accessions) that possibly resulted from directional selection for very late flowering in cold and humid regions with high latitude (Sweden) and high altitude (Spain). We also found that balancing selection (positive Tajima’s D) seems to underlie the gene that was mostly linked with the extreme phenotype of phenotypically unique accessions, i.e., a WRKY transcription factor gene (ATWRKY58). This may indicate that variation in this gene is key for the adjustment of flowering time to changing conditions in A. thaliana. WRKY transcription factor genes have been previously associated with response to stress in A. thaliana [45], and their role in regulating senescence and the response to abiotic and biotic factors has been described across various plant species [46]. However, only more recently have the effects of the WRKY family on flowering time been highlighted [47–49]. For instance, distinct genes in this family influence flowering differently by either advancing or delaying it through hormone-induced pathways [50].

Beyond pinpointing specific genetic variation that underpins phenotypic outliers, we found that phenotypically unique accessions were not relicts stricto sensu, but that their genomes had introgressed chunks of relict genomes. This is in accordance with their very different geographic origins (Sweden or Spain), which are linked with the evolutionary history of A. thaliana and the presence of relict variation in refugia across both high and low latitudes in Europe [21,51]. Moreover, phenotypically unique accessions presented life-history (i.e., plant biomass and flowering time) and reproductive traits (i.e., fruit length and fertility) that were more similar to relicts than to French accessions. Previous studies have already shown that relictual genetic variation was associated with life-history transitions, such as flowering and senescence [23,52]. However, little is known about how these transitions relate to functional traits and fitness [43,53].

Also analyzing a Swedish genotype of A. thaliana, Postma and Ågren [43] showed that larger rosette size led to later flowering and was generally positively correlated with survival and the number of fruits produced per plant. Although the phenotypically unique accessions found here had high plant biomass, they generally presented low fertility relative to phenotypically common accessions. This is in accordance with previous studies that found late-flowering genotypes to have lower seed production than early-flowering genotypes [54,55]. However, it is possible that phenotypically unique accessions would exhibit higher fertility under colder growth conditions (such as those in Postma and Ågren’s [43] experimental setup), a hypothesis that can be tested in future studies. All these contrasting results underscore the fact that genotype-phenotype relationships and their fitness consequences are context-dependent [31,56]. We therefore hypothesize that humid and cold conditions, along with a sandy soil texture, could modulate the expression of regulatory genes that indirectly impact flowering transition to allow the maintenance of phenotypic outlier accessions.

Conclusions

We tested whether the phenotypic space of a widely distributed plant species is constrained by natural selection, and whether and how phenotypic outliers have emerged within this space. We revealed that the phenotypic space of thousands of genetic crosses (F₂s) was significantly larger than that of wild accessions of A. thaliana. This analysis, combined with the investigation of differentiation patterns within accessions, suggested that the phenotypic space of this species is constrained and that both divergent and stabilizing selection influence its extent. Despite these limits, phenotypic outliers emerged at the margins of the species’ distribution in Europe, in admixed populations with introgressed relict haplotypes and genetic variation that probably evolved by strong directional selection in response to particular environments. As the evolutionary history of A. thaliana has demonstrated, relict genomic variation may be crucial for its adaptation to climate change, and we advocate for further investigation of populations with relict introgressions. Understanding the drivers of trait (co)variation and the conditions for local adaptation is imperative if we want to predict the impacts of climate change on plant phenotypes and ecology.

Materials and methods

Plant material

We used two sets of A. thaliana genotypes, based on an initial selection of accessions covering a wide range of genetic, geographic, and phenotypic variation, thereby preventing ascertainment bias (a general caveat of trait-based tests of selection [11]). The final genotype sets included:

(i) 3,150 recombinant individuals (F₂s) from 350 crosses among 358 wild accessions (selected as explained above [24]), which we used to experimentally generate a null distribution of phenotypes (potential phenotypic space);
(ii) 713 wild accessions, including 254 parental (G₀) accessions of the F₂s and 459 additional accessions—selected from Exposito-Alonso and colleagues’ [51] list of accessions (filtered from the 1,001 Genomes Project [20] for maximizing geographic coverage and genetic diversity of A. thaliana)—, which we used to experimentally generate an observed distribution of phenotypes (realized phenotypic space).

Experimental setup

We conducted a completely randomized outdoor common garden experiment between February and July 2021 at the Centre d’Ecologie Fonctionnelle et Evolutive (CEFE), Montpellier, France. We sowed plants in pots of 0.08 L filled with a sterilized soil mixture of 50% river sand, 37.5% calcareous clay soil from the experimental field at CEFE, and 12.5% blond peat moss. A total of 3,150 genetically different F₂ individuals (F₂s) were sown at the beginning of the experiment by randomly selecting seeds from a pool of around 2,570 seeds per F₂ cross. These plants were randomly distributed across three spatial blocks, with no replicates per genotype. For wild accessions, we initially sowed one individual per accession per block (three replicates per genotype), for a total of 2,139 wild-accession individuals at the beginning of the experiment. Each block had an independent subirrigation system, and plants were irrigated three times per week until the beginning of flowering. Details of the setup have been described in Przybylska and colleagues [57].

Trait measurements

To assess both the potential (F₂s) and realized (wild accessions) phenotypic spaces of A. thaliana, we recorded six traits for each individual that reached the flowering stage (first flower at anthesis): one trait related to phenology (plant age at flowering, hereafter flowering time), four traits related to plant resource acquisition and conservation (LA, LDMC, LNC, and SLA), and one trait related to plant growth (rosette biomass, hereafter plant biomass). From the 2,139 individuals of wild accessions and the 3,150 F₂ individuals initially sown, 1,730 and 2,544, respectively, reached the flowering stage and could be measured.

Our trait measurements followed established protocols [57]. Briefly, after plants had been irrigated for at least 2 h to ensure leaf rehydration [58,59], we selected a fully developed leaf exposed to sunlight for leaf trait measurement [59]. We determined LA through image analysis, using ImageJ 1.53k software [60], and estimated LNC through near infrared spectroscopy, according to methods described before [57,61]. Leaves that were smaller than the area of the spectrometer probe could not be analyzed. As such leaves represented less than 10% of the data, we conducted “NA” imputation using the R package missMDA [62]. Leaf and rosette dry weights were recorded after subjecting the samples to 60 °C for 3 days. Flowering time was converted to growing-degree days as the daily cumulation of Celsius degrees (°C) from sowing until the flowering date (for details, see [57]).

Multivariate analyses

Before computing the phenotypic space of A. thaliana, we had to account for the significant block effects on trait variation. To do this similarly for both groups (F₂s, without genotype replication, and wild accessions, with genotype replication), we first calculated least-square estimates of the block effect on each trait of the accession group (the group for which genotype and block effects could be disentangled), and we took the estimate for the first block as a reference. Next, we calculated the estimate deviation of the two other blocks relative to the reference [63]. Finally, we corrected each trait value (for both F₂s and wild accessions) in the second and third blocks by their respective deviation to the first block. After this procedure, we calculated the average values of each trait for each wild accession. We log₁₀-transformed flowering time and SLA, and square root-transformed LNC and plant biomass to meet the assumptions of parametric analyses.

To compute the potential and realized phenotypic spaces of A. thaliana, we performed PCA on centered and standardized trait values of F₂s and wild accessions, respectively. Using scores of five significant PCA dimensions (according to a sequential Bonferroni procedure; R package ade4, testdim function [64]), we calculated separate hypervolumes (i.e., the parts of n-dimensional Euclidean spaces, composed of independent axes of variation and occupied by the observed phenotypic values) for the two types of plant material. Such hypervolumes can be characterized by geometrical metrics, such as size (volume) and overlap [12,25,27] (Fig 1), and have so far mainly been used in ecology to compare the phenotypic spaces of different species or groups of species [5,12,25,27]. Hypervolume sizes informed us about the extent of the potential and realized phenotypic spaces in A. thaliana. Overlap parameters, in turn, allowed us to assess dissimilarities (in terms of position) between the potential and realized phenotypic spaces, as well as to identify wild accessions with trait values not observed in F₂s (unique phenotypes at the margins of the realized phenotypic space, i.e., phenotypic outliers).

We calculated hypervolumes using the Support Vector Machine algorithm of the R package hypervolume, which is suitable for considering outliers during hypervolume estimation [27]. As differences in sample size can impact hypervolume comparisons [25], we resampled both F₂s and wild accessions to an equal number of 500 observations. This resampling was conducted 100 times using the hypervolume_resample function of the R package hypervolume [12,25,27]. This allowed us to obtain standard errors for the size (i.e., volume) of hypervolumes (based on 100 hypervolumes estimated for F₂s and another 100 estimated for wild accessions) and a distribution of overlap parameters (based on the 10,000 possible combinations of 100 hypervolumes of F₂s and wild accessions). The first overlap parameter obtained was Jaccard’s similarity index, which is defined as J (A, B) = |A∩B| / |A∪B|, with A and B equaling the hypervolumes in groups A and B, respectively, and || denoting a cardinal (size of a group, Fig 1). To put it simply, J (A, B) in our study was the proportion of the total phenotypic hypervolume that was shared between F₂s and wild accessions. The second overlap parameter was unique components, that is, the size of the nonoverlapping parts between hypervolumes (|A∪B|–|A∩B|, Fig 1). We used this parameter to detect unique phenotypes at the margins of the realized phenotypic space (i.e., phenotypic outliers). We used the function hypervolume_inclusion_test of the R package hypervolume [12,25,27] to test which accessions were present in each one of the 10,000 accessions’ unique components produced (Fig 1). We then selected the accessions that contributed to 95% of the entire set of accessions in unique components, here called phenotypically unique accessions. Because hypervolume delineation involves the generation of uniformly distributed random points in the surroundings of sets of data points [27], the results of the hypervolume calculations explained here may vary slightly between runs. Therefore, we repeated 10 times the calculation of the hypervolumes and overlap parameters. The final list of phenotypically unique accessions was defined based on the accessions that were present in at least six of the 10 generated lists of phenotypically unique accessions. Using this final set of accessions, we conducted further analyses to understand the molecular (genomic) and evolutionary mechanisms that could explain their unique functional traits (see the following three sections).

As mentioned in the “Plant material” section, 254 parental (G₀) accessions of the F₂s, along with 459 non-parental accessions, were phenotyped to generate the hypervolume of wild accessions, whereas 104 G₀ accessions were not phenotyped and were therefore missing in this analysis. This could generate systematic differences between the hypervolumes of accessions and F₂s. To ensure that it was not the case, we performed three additional analyses. First, we tested if the 254 analyzed G₀ accessions were systematically different from the other 459 accessions. For that, we compared the genetic diversity (π) between these two groups using VCFtools (0.1.16) [65]. For each group of genotypes, we assembled SNP data from the 1,001 Genomes Project [20] (http://1001genomes.org/), removing SNPs with lower than 5% minor allele frequency (MAF) and missing data over 5%. Then, we calculated π for every 10,000 bp window of the genome and verified the correlation of these values between groups of genotypes using the function sma of the R package smatr [66]. Second, we reran the hypervolume analyses described in the previous paragraph, this time including only the 254 G₀ accessions in the computation of the hypervolume of wild accessions. Third, using the genetic and phenotypic information of the 713 analyzed wild accessions, we imputed the phenotype of the 104 G₀ accessions for which phenotypes had not been measured directly. To this end, we first removed, using VCFtools (0.1.16) [65], SNPs with lower than 1% MAF and missing data over 5%, restricted to a list of the 817 (713 + 104) analyzed accessions. Then, using the output of a BSLMM performed with GEMMA 0.98.3 [28,67] and setting the default number of 1,000,000 iterations and 100,000 burn-in steps, we imputed the six missing traits of the 104 G₀ accessions. BSLMM computation and phenotypic imputation were conducted per trait and repeated 100 times. Imputed trait values were averaged per accession and incorporated into the phenotypic dataset of wild accessions. With this dataset now composed of six traits of 817 accessions versus 2,544 F₂s, we reran the hypervolume analyses described in the previous paragraph. To verify the accuracy of the performed phenotypic imputation, we randomly selected, 100 times, 91 accessions from the 713 for which we had trait measures (a proportion of imputed phenotypes that was equivalent to the 104 imputed phenotypes out of 817), and we imputed their values for the six analyzed traits using the same procedure described before, but restricted to the original list of 713 accessions. We averaged the trait imputations per accession and verified their correlation with the observed values using the function sma of the R package smatr [66].

Finally, because the PCA analysis indicated flowering time as the main trait underpinning phenotypically unique accessions, we performed a sensitivity analysis to test for the influence of this trait. We removed flowering time from the trait dataset and reran the PCA comparison and the hypervolume analyses of 713 accessions versus 2,544 F₂s, according to the same procedures described before.

Genomic analyses

To shed light on the possible types of selection acting on different traits in A. thaliana’s wild accessions, we explored the genetic bases of the phenotypic space of the 254 G₀ accessions. We first performed GWA for each of the six analyzed traits. To this end, we obtained A. thaliana’s SNPs from the 1,001 Genomes Project dataset [20] (http://1001genomes.org/) and used VCFtools (0.1.16) [65] to filter them according to a MAF of 5% and maximum missing data of 5%, restricted to a list of the 254 accessions here analyzed. This procedure returned a set of 546,179 SNPs, which was then used in PLINK 2.0 [68] and GEMMA 0.98.3 [67] to perform the GWA, while controlling for population structure. Next, we performed polygenic GWA. Applying the same parameters described above to filter SNPs, we used PLINK 2.0 [68] and GEMMA 0.98.3 [67] to associate them with each analyzed trait through a BSLMM [28]. We adopted the default number of iterations and burn-in steps (i.e., 1,000,000 and 100,000, respectively) and ran the BSLMM model 100 times for each trait separately, averaging the results across runs. While controlling for population structure [28], the BSLMM model estimates “chip heritability” (i.e., the proportion of the phenotypic variance explained by the analyzed set of SNPs, “PVE” hyperparameter) and the proportion of SNPs presenting a larger effect (which we used to infer polygenicity, “pi” hyperparameter).

Since all analyzed traits were polygenic, we tested if transgressive segregation is more likely when the parents are not too phenotypically divergent. For that, we used the classification of A. thaliana accessions in genetic groups from the 1,001 genomes dataset, excluding the “admixed” group [20] (http://1001genomes.org/), and calculated Q_ST for each analyzed trait as the among-group phenotypic variance divided by the total phenotypic variance. This analysis was performed for 210 G₀ accessions (as 44 belonged to the “admixed” group) by fitting multi-response generalized linear mixed models, which also generate 95% credible intervals around the Q_ST values (R package MCMCglmm [69]). We then performed Q_ST versus F_ST comparisons. To this end, we created a distribution of significant Weir and Cockerham’s F_ST calculated per intergenic SNP (125,413 significant SNPs out of 150,479 SNPs), using PLINK 1.9 [70], and defined its median as the neutral F_ST value against which Q_ST values were compared. We calculated F_ST values based on the group classification described above and assessed their significance by randomly permuting group labels 1,000 times to generate a null distribution. The 95th percentile of this distribution was defined as the threshold above which F_ST values were considered significant [35]. Finally, for each trait, we calculated the degree of transgressive segregation in the F₂ lines by dividing the CV of each trait in these lines by the CV of the corresponding trait in the G₀ accessions. We then regressed this ratio against the ratio of Q_ST to the neutral F_ST value and the polygenicity metric (“pi” hyperparameter). We performed regression analysis using the sma function from the smart R package [66].

To assess the genomic mechanisms that could explain the differences between phenotypically unique and common accessions, we used the whole pool of wild accessions to perform GWA for the phenotypically unique versus common accessions classification. For that, using VCFtools (0.1.16) [65], we first filtered SNPs according to a MAF of 1% and maximum missing data of 5%, restricted to a list of the 713 accessions here analyzed. This procedure returned a set of 1,330,670 SNPs, which was then used in GEMMA 0.98.3 [67] to perform the GWA, while controlling for population structure. To assess the underlying genes and their functions, we used The Arabidopsis Information Resource (TAIR, https://www.arabidopsis.org/). Finally, still applying the same parameters in VCFtools (0.1.16) [65], we calculated Tajima’s D metric every 100-bp window to test for signatures of selection.

Environmental analyses

To test if environmental variation explained the functional trait differences between phenotypically unique and common accessions, potentially leading support to natural selection as a driver of phenotypic outliers, we also analyzed the environments of origin of the wild accessions. For that, we extracted environmental variables from global climatic and pedologic layers using GPS coordinates, obtained from the 1,001 Genomes Project dataset [20] (http://1001genomes.org/), and the R package terra [71]. Climatic layers were obtained from CHELSA 2.1 climatologies across 1981–2010 [72,73], in 30 arc-second resolution. We extracted the following climatic variables, monthly predicted between October and June and averaged across these months (the period of the year in which A. thaliana generally grows and sets seeds): climate moisture indices, mean daily air temperature, near-surface relative humidity, near-surface wind speed, potential evapotranspiration, precipitation amount, surface downwelling shortwave flux in air, the difference between mean daily maximum and minimum air temperature, and VPD. Pedologic layers were obtained from ISRIC—World Soil Information [74], in 250 m aggregated to 1 km resolution, for 5 cm soil depth. We extracted the following soil variables: bulk density, cation exchange capacity, proportion of clay particles, proportion of sand particles, proportion of silt particles, total nitrogen content, soil organic carbon content, pH, volumetric water content at 10 kPa, volumetric water content at 33 kPa, and volumetric water content at 1,500 kPa. To evaluate the association of different environmental variables with phenotypically unique versus common accessions, we first centered and scaled all environmental variables. Since the classification “phenotypically unique vs. common” was highly unbalanced, we first resampled phenotypically common accessions 10,000 times to the sample size of phenotypically unique accessions and then conducted a random forest analysis on each set of data (using the R package randomForest [75], with ntree = 2,000). We computed mean variable importance and mean error rate of classification (out-of-bag estimate of error, OOB) across runs. To confirm the most relevant environmental variables for the phenotypically unique versus common classification, we used feature selection with each set of data (R package Boruta [76]), calculating the frequency of selection or rejection for each feature. We considered features as relevant when they were selected with a minimal frequency of 60%.

Extended comparisons for phenotypically unique versus common accessions

Phenotypically unique accessions did not belong to the relict genetic group (S1 Table), but this did not eliminate the possibility that their genomes have introgression segments from relicts. To test this possibility, we first examined published data of relict haplotypes among accessions of the 1,001 Genomes Project [20]. Then, we characterized trait similarity of phenotypically unique relative to Spanish relict (geographically closer to Spanish phenotypically unique accessions and possessing a high level of relict haplotypes [21]) and French common accessions (used before as a non-relict reference due to their very low level of relict haplotypes [21]). The country of origin and the genetic group (which defined the classification as relict or non-relict) of the accessions were obtained from the 1,001 Genomes Project dataset [20] (http://1001genomes.org/). To characterize trait similarity, we used not only the phenological and vegetative traits explained above, but also reproductive traits measured during the experiment: fruit (i.e., silique) length and number, inflorescence length, and seed mass. These traits were not measured in the same individuals analyzed for phenological and vegetative traits, because they needed to be assessed in a different phenological stage: fruit maturation (instead of flowering). Moreover, due to experimental constraints, we could not assess reproductive traits in every 713 accessions selected for the phenotypic space analyses. Because of that, we randomly selected 529 wild accessions from the initial pool, and we sowed them in three additional replicates (529 accessions × 3 blocks [57]). Because of the random selection of accessions and the fact that 495 out of the 529 initial accessions reached fruit maturation and could be measured, only 17 phenotypically unique accessions (14 from Sweden and 3 from Spain) were included in this sampling. Traits were measured as described in Przybylska and colleagues [57]. Inflorescences were cut at their rosette base and then photographed in their entirety. Except for seed mass, we estimated all reproductive traits through image analysis using ImageJ 1.53k [60]. Fruit length was measured as the mean length of three randomly selected mature fruits per plant. Fertility was estimated through the product of mean fruit length and fruit number [77,78]. To measure seed dry mass, we dried inflorescence stems at ambient temperature and weighed around 30 seeds per plant for an estimation of individual seed mass.

Using all the traits available for the 495 accessions mentioned above (i.e., phenological, vegetative, and reproductive traits), we calculated least-square means with block as a random factor. We then compared trait values of phenotypically unique accessions with those of Spanish relicts and French common accessions, using nonparametric tests (Kruskal–Wallis and Dunn test with Holm’s correction for post hoc comparisons).

All statistical analyses were carried out in R 4.4.0 [79].

Supporting information

S1 Table. List of the phenotypically unique accessions.

Information about their genetic group and country of origin, extracted from the 1,001 Genomes Project dataset (http://1001genomes.org/), is provided.

https://doi.org/10.1371/journal.pbio.3003536.s001

(DOCX)

S2 Table. Accuracy of trait imputation for the 104 G₀ accessions for which phenotypes had not been measured directly.

R² denotes the coefficient of determination. LA, leaf area; LDMC, leaf dry matter content; LNC, leaf nitrogen content; SLA, specific leaf area.

https://doi.org/10.1371/journal.pbio.3003536.s002

(DOCX)

S3 Table. “Chip heritability” (“PVE”) and the proportion of SNPs presenting a larger effect (“pi”) on traits of wild accessions.

LA, leaf area; LDMC, leaf dry matter content; LNC, leaf nitrogen content; SLA, specific leaf area.

https://doi.org/10.1371/journal.pbio.3003536.s003

(DOCX)

S1 Fig. Jaccard’s index distribution for the comparison between hypervolumes obtained for F₂s and wild accessions.

The distribution is based on 10,000 comparisons between resampled hypervolumes for each group. The result shown corresponds to one of the 10 hypervolume calculations. The data underlying this figure can be found at: https://doi.org/10.48579/PRO/3LQH1M.

https://doi.org/10.1371/journal.pbio.3003536.s004

(TIFF)

S2 Fig. Trait variation within the phenotypic spaces of F₂s (blue) and wild accessions (red) of Arabidopsis thaliana, based on nine hypervolume recalculations.

Variation for the first three PCA dimensions of trait hypervolumes is presented. Hypervolume centroids, observed data points, and uniformly distributed random points generated by the algorithm used for hypervolume computation [27] are depicted by large circles, opaque dots, and semitransparent small dots, respectively. Hypervolumes with size equivalent to the mean of the 100 hypervolumes generated by resampling per group (F₂s versus accessions) are shown. The data underlying this figure can be found at: https://doi.org/10.48579/PRO/3LQH1M.

https://doi.org/10.1371/journal.pbio.3003536.s005

(ZIP)

S3 Fig. Hypervolume sizes for F₂s (blue) versus wild accessions (red) when excluding flowering time from the analyses.

Size is represented in units of standard deviation to power five (the number of trait dimensions used for hypervolume computation). The result shown corresponds to one of the 10 hypervolume calculations. Bars denote SE. The data underlying this figure can be found at: https://doi.org/10.48579/PRO/3LQH1M.

https://doi.org/10.1371/journal.pbio.3003536.s006

(TIFF)

S4 Fig. Trait variation within the phenotypic spaces of F₂s (blue) and wild accessions (red) of Arabidopsis thaliana, based on 10 hypervolume calculations without flowering time.

Variation for the first three PCA dimensions of trait hypervolumes is presented. Hypervolume centroids, observed data points, and uniformly distributed random points generated by the algorithm used for hypervolume computation [27] are depicted by large circles, opaque dots, and semitransparent small dots, respectively. Hypervolumes with size equivalent to the mean of the 100 hypervolumes generated by resampling per group (F₂s versus accessions) are shown. LA, leaf area; LDMC, leaf dry matter content; LNC, leaf nitrogen content; SLA, specific leaf area. The data underlying this figure can be found at: https://doi.org/10.48579/PRO/3LQH1M.

https://doi.org/10.1371/journal.pbio.3003536.s007

(ZIP)

S5 Fig. Correlation between the whole-genome genetic diversity (π) of the 254 parental (G₀) accessions of the F₂s and the other 459 accessions analyzed in this study.

Each point represents one genomic region. The line was fitted with Standardized Major Axis (SMA) regressions, and R² denotes the coefficient of determination. The data underlying this figure can be found at: https://doi.org/10.48579/PRO/3LQH1M and http://1001genomes.org/.

https://doi.org/10.1371/journal.pbio.3003536.s008

(TIFF)

S6 Fig. Hypervolume sizes for F₂s (blue) versus wild accessions (red) when analyzing only the 254 G₀ accessions that were directly phenotyped in this study.

Size is represented in units of standard deviation to power five (the number of trait dimensions used for hypervolume computation). The result shown corresponds to one of the 10 hypervolume calculations. Bars denote SE. The data underlying this figure can be found at: https://doi.org/10.48579/PRO/3LQH1M.

https://doi.org/10.1371/journal.pbio.3003536.s009

(TIFF)

S7 Fig. Trait variation within the phenotypic spaces of F₂s (blue) and the 254 G₀ accessions (red) of Arabidopsis thaliana, based on 10 hypervolume calculations.

Variation for the first three PCA dimensions of trait hypervolumes is presented. Hypervolume centroids, observed data points, and uniformly distributed random points generated by the algorithm used for hypervolume computation [27] are depicted by large circles, opaque dots, and semitransparent small dots, respectively. Hypervolumes with size equivalent to the mean of the 100 hypervolumes generated by resampling per group (F₂s versus accessions) are shown. LA, leaf area; LDMC, leaf dry matter content; LNC, leaf nitrogen content; SLA, specific leaf area. The data underlying this figure can be found at: https://doi.org/10.48579/PRO/3LQH1M.

https://doi.org/10.1371/journal.pbio.3003536.s010

(ZIP)

S8 Fig. Hypervolume sizes for F₂s (blue) versus wild accessions (red) when including in the analyses imputed phenotypes from 104 G₀ accessions lacking direct measurements.

Size is represented in units of standard deviation to power five (the number of trait dimensions used for hypervolume computation). The result shown corresponds to one of the 10 hypervolume calculations. Bars denote SE. The data underlying this figure can be found at: https://doi.org/10.48579/PRO/3LQH1M and http://1001genomes.org/.

https://doi.org/10.1371/journal.pbio.3003536.s011

(TIFF)

S9 Fig. Trait variation within the phenotypic spaces of F₂s (blue) and wild accessions (red) of Arabidopsis thaliana, based on 10 hypervolume calculations including imputed phenotypes from 104 G₀ accessions lacking direct measurements.

Variation for the first three PCA dimensions of trait hypervolumes is presented. Hypervolume centroids, observed data points, and uniformly distributed random points generated by the algorithm used for hypervolume computation [27] are depicted by large circles, opaque dots, and semitransparent small dots, respectively. Hypervolumes with size equivalent to the mean of the 100 hypervolumes generated by resampling per group (F₂s versus accessions) are shown. LA, leaf area; LDMC, leaf dry matter content; LNC, leaf nitrogen content; SLA, specific leaf area. The data underlying this figure can be found at: https://doi.org/10.48579/PRO/3LQH1M and http://1001genomes.org/.

https://doi.org/10.1371/journal.pbio.3003536.s012

(ZIP)

S10 Fig. Manhattan plots showing SNPs associated with each analyzed trait in the 254 G₀ accessions.

The red line represents the significance threshold at P ≤ 0.05 with Bonferroni correction. LA, leaf area; LDMC, leaf dry matter content; LNC, leaf nitrogen content; SLA, specific leaf area. The data underlying this figure can be found at: https://doi.org/10.48579/PRO/3LQH1M and http://1001genomes.org/.

https://doi.org/10.1371/journal.pbio.3003536.s013

(ZIP)

S11 Fig. Boxplots comparing environmental variables associated with phenotypically unique and common accessions.

Phenotypically unique accessions are depicted in red, and phenotypically common accessions, in gray. Only environmental variables that were selected through the feature selection (Boruta) analysis are presented. Vpd, vapor pressure deficit; tas, mean daily air temperature; sand, soil sand content. Kruskal–Wallis test: **P < 0.01, ***P < 0.001. The data underlying this figure can be found at: https://doi.org/10.48579/PRO/3LQH1M, https://www.chelsa-climate.org/datasets/chelsa_climatologies/, and https://files.isric.org/soilgrids/latest/data_aggregated/1000m.

https://doi.org/10.1371/journal.pbio.3003536.s014

(TIFF)

S12 Fig. Boxplots comparing the number of relict haplotypes between groups of genotypes.

The number of relict haplotypes was evaluated across Spanish phenotypically unique (SP), Iberic phenotypically common and non-relict (IBER), Spanish relict (REL), and French (FR) accessions. The number of accessions (N) in each group is highlighted. Note log₁₀ scale. Letters indicate pairwise significant differences from nonparametric Dunn test. Data from Lee and colleagues [21].

https://doi.org/10.1371/journal.pbio.3003536.s015

(TIFF)

S13 Fig. Boxplots comparing vegetative, phenological, and reproductive traits between groups of genotypes.

Eleven traits were evaluated across Swedish phenotypically unique (SW), Spanish phenotypically unique (SP), Spanish relict (REL), and French (FR) accessions: five vegetative (a), one phenological (b), and five reproductive (c) traits. The number of accessions (N) in each group for each comparison is highlighted. FT, flowering time; LA, leaf area; LDMC, leaf dry matter content; LNC, leaf nitrogen content; SLA, specific leaf area. Note log₁₀ scale for FT and SLA, and square root scale for LNC and plant biomass. Nonparametric Dunn test: *P ≤ 0.05, **P < 0.01, ***P < 0.001. Dots denote marginal significance, and “ns” denotes non-significance in a Kruskal–Wallis test. The data underlying this figure can be found at: https://doi.org/10.48579/PRO/3LQH1M.

https://doi.org/10.1371/journal.pbio.3003536.s016

(ZIP)

S14 Fig. Correlation circle of a dual multiple factor analysis (DMFA) with a “F₂s (blue) vs. wild accessions (red)” contrast as grouping variable.

This analysis was performed using the R package FactorMineR [80] on six traits. FT, flowering time; LA, leaf area; LDMC, leaf dry matter content; LNC, leaf nitrogen content; SLA, specific leaf area. The data underlying this figure can be found at: https://doi.org/10.48579/PRO/3LQH1M.

https://doi.org/10.1371/journal.pbio.3003536.s017

(TIFF)

Acknowledgments

We are grateful to the “Terrain d’expériences” platform at CEFE, namely to Thierry Mathieu, David Degueldre, Pauline Durbin, Fabien Lopez, Pierrick Aury, and Jean-Marc Donnay, for supporting the conception and execution of the experiment. We thank Xavier Le Roux for advice during the conception and execution of the experiment, and Benoit Lacombe and Thibaut Perez for LNC measurement (AQUI platform, Montpellier). We are grateful to Lou Sales-Mabily, Pierre Moulin, Robin Latapie, Annick Lucas, Rahma Kazi-Tani, Mariette Laumond, Lisa Perrier, Maeva Tremblay, Ana Elkaim, Damien de La Faye, Marie-Charlotte Bopp, Alvaro Delgado Tenllado, Elodie Certenais, Thibault Martino, Benoit Berthet, Léo Delalandre, and Alexis Bediee for help during the experiment. We thank Benjamin Blonder and Daniel Chen for assistance with the R package hypervolume, and Cheng-Ruei Lee for providing data on relict introgressions in A. thaliana accessions. We also thank Ingo Ebersberger (Applied Bioinformatics at Goethe University Frankfurt) for providing the compute server infrastructure for this project, Erik Diaz for valuable discussion on related topics, and Damien de La Faye for help with figure conception and drawing design.

References

1. Endler JA. Multiple-trait coevolution and environmental gradients in guppies. Trends Ecol Evol. 1995;10(1):22–9. pmid:21236940
- View Article
- PubMed/NCBI
- Google Scholar
2. Pigliucci M. Finding the way in phenotypic space: the origin and maintenance of constraints on organismal form. Ann Bot. 2007;100(3):433–8. pmid:17495983
- View Article
- PubMed/NCBI
- Google Scholar
3. Donovan LA, Maherali H, Caruso CM, Huber H, de Kroon H. The evolution of the worldwide leaf economics spectrum. Trends Ecol Evol. 2011;26(2):88–95. pmid:21196061
- View Article
- PubMed/NCBI
- Google Scholar
4. Olson ME. The developmental renaissance in adaptationism. Trends Ecol Evol. 2012;27(5):278–87. pmid:22326724
- View Article
- PubMed/NCBI
- Google Scholar
5. Díaz S, Kattge J, Cornelissen JHC, Wright IJ, Lavorel S, Dray S, et al. The global spectrum of plant form and function. Nature. 2016;529(7585):167–71. pmid:26700811
- View Article
- PubMed/NCBI
- Google Scholar
6. Pigliucci M, Preston K. Phenotypic integration: studying the ecology and evolution of complex phenotypes. Oxford: Oxford University Press; 2004.
7. Schlötterer C. Unraveling the molecular basis of stabilizing selection by experimental evolution. Genome Biol Evol. 2023;15(12):evad220. pmid:38092037
- View Article
- PubMed/NCBI
- Google Scholar
8. Kingsolver JG, Diamond SE. Phenotypic selection in natural populations: what limits directional selection? Am Nat. 2011;177(3):346–57. pmid:21460543
- View Article
- PubMed/NCBI
- Google Scholar
9. Melamed-Bessudo C, Shilo S, Levy AA. Meiotic recombination and genome evolution in plants. Curr Opin Plant Biol. 2016;30:82–7. pmid:26939088
- View Article
- PubMed/NCBI
- Google Scholar
10. Fernandes JB, Séguéla-Arnaud M, Larchevêque C, Lloyd AH, Mercier R. Unleashing meiotic crossovers in hybrid plants. Proc Natl Acad Sci U S A. 2018;115(10):2431–6. pmid:29183972
- View Article
- PubMed/NCBI
- Google Scholar
11. Fraser HB. Detecting selection with a genetic cross. Proc Natl Acad Sci U S A. 2020;117(36):22323–30. pmid:32848059
- View Article
- PubMed/NCBI
- Google Scholar
12. Blonder B, Lamanna C, Violle C, Enquist BJ. The n‐dimensional hypervolume. Glob Ecol Biogeogr. 2014;23(5):595–609.
- View Article
- Google Scholar
13. Rieseberg LH, Archer MA, Wayne RK. Transgressive segregation, adaptation and speciation. Heredity (Edinb). 1999;83(Pt 4):363–72. pmid:10583537
- View Article
- PubMed/NCBI
- Google Scholar
14. Thompson KA, Osmond MM, Schluter D. Parallel genetic evolution and speciation from standing variation. Evol Lett. 2019;3(2):129–41. pmid:31289688
- View Article
- PubMed/NCBI
- Google Scholar
15. Thompson KA, Brandvain Y, Coughlan JM, Delmore KE, Justen H, Linnen CR, et al. The ecology of hybrid incompatibilities. Cold Spring Harb Perspect Biol. 2024;16(9):a041440. pmid:38151331
- View Article
- PubMed/NCBI
- Google Scholar
16. Chevin L-M, Decorzent G, Lenormand T. Niche dimensionality and the genetics of ecological speciation. Evolution. 2014;68(5):1244–56. pmid:24410181
- View Article
- PubMed/NCBI
- Google Scholar
17. Stelkens RB, Schmid C, Selz O, Seehausen O. Phenotypic novelty in experimental hybrids is predicted by the genetic distance between species of cichlid fish. BMC Evol Biol. 2009;9:283. pmid:19961584
- View Article
- PubMed/NCBI
- Google Scholar
18. Stelkens R, Seehausen O. Genetic distance between species predicts novel trait expression in their hybrids. Evolution. 2009;63(4):884–97. pmid:19220450
- View Article
- PubMed/NCBI
- Google Scholar
19. Munoz F, Klausmeier CA, Gaüzère P, Kandlikar G, Litchman E, Mouquet N, et al. The ecological causes of functional distinctiveness in communities. Ecol Lett. 2023;26(8):1452–65. pmid:37322850
- View Article
- PubMed/NCBI
- Google Scholar
20. The 1001 Genomes Consortium. 1,135 genomes reveal the global pattern of polymorphism in Arabidopsis thaliana. Cell. 2016;166(2):481–91. pmid:27293186
- View Article
- PubMed/NCBI
- Google Scholar
21. Lee C-R, Svardal H, Farlow A, Exposito-Alonso M, Ding W, Novikova P, et al. On the post-glacial spread of human commensal Arabidopsis thaliana. Nat Commun. 2017;8:14458. pmid:28181519
- View Article
- PubMed/NCBI
- Google Scholar
22. Suarez-Gonzalez A, Lexer C, Cronk QCB. Adaptive introgression: a plant perspective. Biol Lett. 2018;14(3):20170688. pmid:29540564
- View Article
- PubMed/NCBI
- Google Scholar
23. Hanemian M, Vasseur F, Marchadier E, Gilbault E, Bresson J, Gy I, et al. Natural variation at FLM splicing has pleiotropic effects modulating ecological strategies in Arabidopsis thaliana. Nat Commun. 2020;11(1):4140. pmid:32811829
- View Article
- PubMed/NCBI
- Google Scholar
24. Vasseur F, Fouqueau L, de Vienne D, Nidelet T, Violle C, Weigel D. Nonlinear phenotypic variation uncovers the emergence of heterosis in Arabidopsis thaliana. PLoS Biol. 2019;17(4):e3000214. pmid:31017902
- View Article
- PubMed/NCBI
- Google Scholar
25. Blonder B. Hypervolume concepts in niche‐ and trait‐based ecology. Ecography. 2017;41(9):1441–55.
- View Article
- Google Scholar
26. Wright IJ, Reich PB, Westoby M, Ackerly DD, Baruch Z, Bongers F, et al. The worldwide leaf economics spectrum. Nature. 2004;428(6985):821–7. pmid:15103368
- View Article
- PubMed/NCBI
- Google Scholar
27. Blonder B, Morrow CB, Maitner B, Harris DJ, Lamanna C, Violle C, et al. New approaches for delineating n‐dimensional hypervolumes. Methods Ecol Evol. 2017;9(2):305–19.
- View Article
- Google Scholar
28. Zhou X, Carbonetto P, Stephens M. Polygenic modeling with Bayesian sparse linear mixed models. PLoS Genet. 2013;9(2):e1003264. pmid:23408905
- View Article
- PubMed/NCBI
- Google Scholar
29. Miao Y, Laun T, Zimmermann P, Zentgraf U. Targets of the WRKY53 transcription factor and its role during leaf senescence in Arabidopsis. Plant Mol Biol. 2004;55(6):853–67. pmid:15604721
- View Article
- PubMed/NCBI
- Google Scholar
30. Massicotte P, South A. rnaturalearth: world map data from Natural Earth. 2023. Available from: https://cran.r-project.org/web/packages/rnaturalearth/index.html
31. Exposito-Alonso M, Brennan AC, Alonso-Blanco C, Picó FX. Spatio-temporal variation in fitness responses to contrasting environments in Arabidopsis thaliana. Evolution. 2018;:10.1111/evo.13508. pmid:29947421
- View Article
- PubMed/NCBI
- Google Scholar
32. Samis KE, Stinchcombe JR, Murren CJ. Population climatic history predicts phenotypic responses in novel environments for Arabidopsis thaliana in North America. Am J Bot. 2019;106(8):1068–80. pmid:31364776
- View Article
- PubMed/NCBI
- Google Scholar
33. Fournier-Level A, Taylor MA, Paril JF, Martínez-Berdeja A, Stitzer MC, Cooper MD, et al. Adaptive significance of flowering time variation across natural seasonal environments in Arabidopsis thaliana. New Phytol. 2022;234(2):719–34. pmid:35090191
- View Article
- PubMed/NCBI
- Google Scholar
34. Méndez‐Vigo B, Castilla AR, Gómez R, Marcer A, Alonso‐Blanco C, Picó FX. Spatiotemporal dynamics of genetic variation at the quantitative and molecular levels within a natural Arabidopsis thaliana population. J Ecol. 2022;110(11):2701–16.
- View Article
- Google Scholar
35. Sartori K, Vasseur F, Violle C, Baron E, Gerard M, Rowe N, et al. Leaf economics and slow-fast adaptation across the geographic range of Arabidopsis thaliana. Sci Rep. 2019;9(1):10758. pmid:31341185
- View Article
- PubMed/NCBI
- Google Scholar
36. Vasseur F, Violle C, Enquist BJ, Granier C, Vile D. A common genetic basis to the origin of the leaf economics spectrum and metabolic scaling allometry. Ecol Lett. 2012;15(10):1149–57. pmid:22856883
- View Article
- PubMed/NCBI
- Google Scholar
37. de Villemereuil P, Charmantier A, Arlt D, Bize P, Brekke P, Brouwer L, et al. Fluctuating optimum and temporally variable selection on breeding date in birds and mammals. Proc Natl Acad Sci U S A. 2020;117(50):31969–78. pmid:33257553
- View Article
- PubMed/NCBI
- Google Scholar
38. Charlesworth D, Willis JH. The genetics of inbreeding depression. Nat Rev Genet. 2009;10(11):783–96. pmid:19834483
- View Article
- PubMed/NCBI
- Google Scholar
39. Weigel D. Natural variation in Arabidopsis: from molecular genetics to ecological genomics. Plant Physiol. 2012;158(1):2–22. pmid:22147517
- View Article
- PubMed/NCBI
- Google Scholar
40. Korves TM, Schmid KJ, Caicedo AL, Mays C, Stinchcombe JR, Purugganan MD, et al. Fitness effects associated with the major flowering time gene FRIGIDA in Arabidopsis thaliana in the field. Am Nat. 2007;169(5):E141–57. pmid:17427127
- View Article
- PubMed/NCBI
- Google Scholar
41. Ågren J, Schemske DW. Reciprocal transplants demonstrate strong adaptive differentiation of the model organism Arabidopsis thaliana in its native range. New Phytol. 2012;194(4):1112–22. pmid:22432639
- View Article
- PubMed/NCBI
- Google Scholar
42. Vasseur F, Sartori K, Baron E, Fort F, Kazakou E, Segrestin J, et al. Climate as a driver of adaptive variations in ecological strategies in Arabidopsis thaliana. Ann Bot. 2018;122(6):935–45. pmid:30256896
- View Article
- PubMed/NCBI
- Google Scholar
43. Postma FM, Ågren J. Effects of primary seed dormancy on lifetime fitness of Arabidopsis thaliana in the field. Ann Bot. 2022;129(7):795–808. pmid:35092679
- View Article
- PubMed/NCBI
- Google Scholar
44. Ågren J, Oakley CG, Lundemo S, Schemske DW. Adaptive divergence in flowering time among natural populations of Arabidopsis thaliana: estimates of selection and QTL mapping. Evolution. 2017;71(3):550–64. pmid:27859214
- View Article
- PubMed/NCBI
- Google Scholar
45. Thoen MPM, Davila Olivas NH, Kloth KJ, Coolen S, Huang P-P, Aarts MGM, et al. Genetic architecture of plant stress resistance: multi-trait genome-wide association mapping. New Phytol. 2017;213(3):1346–62. pmid:27699793
- View Article
- PubMed/NCBI
- Google Scholar
46. Song H, Cao Y, Zhao L, Zhang J, Li S. Review: WRKY transcription factors: understanding the functional divergence. Plant Sci. 2023;334:111770. pmid:37321304
- View Article
- PubMed/NCBI
- Google Scholar
47. Bresson J, Doll J, Vasseur F, Stahl M, von Roepenack-Lahaye E, Kilian J, et al. The genetic interaction of REVOLUTA and WRKY53 links plant development, senescence, and immune responses. PLoS One. 2022;17(3):e0254741. pmid:35333873
- View Article
- PubMed/NCBI
- Google Scholar
48. Rehman S, Bahadur S, Xia W. Unlocking nature’s secrets: the pivotal role of WRKY transcription factors in plant flowering and fruit development. Plant Sci. 2024;346:112150. pmid:38857658
- View Article
- PubMed/NCBI
- Google Scholar
49. Song H, Duan Z, Zhang J. WRKY transcription factors modulate flowering time and response to environmental changes. Plant Physiol Biochem. 2024;210:108630. pmid:38657548
- View Article
- PubMed/NCBI
- Google Scholar
50. Li W, Wang H, Yu D. Arabidopsis WRKY transcription factors WRKY12 and WRKY13 oppositely regulate flowering under short-day conditions. Mol Plant. 2016;9(11):1492–503. pmid:27592586
- View Article
- PubMed/NCBI
- Google Scholar
51. Exposito-Alonso M, Vasseur F, Ding W, Wang G, Burbano HA, Weigel D. Genomic basis and evolutionary potential for extreme drought adaptation in Arabidopsis thaliana. Nat Ecol Evol. 2018;2(2):352–8. pmid:29255303
- View Article
- PubMed/NCBI
- Google Scholar
52. Martínez-Berdeja A, Stitzer MC, Taylor MA, Okada M, Ezcurra E, Runcie DE, et al. Functional variants of DOG1 control seed chilling responses and variation in seasonal life-history strategies in Arabidopsis thaliana. Proc Natl Acad Sci U S A. 2020;117(5):2526–34. pmid:31964817
- View Article
- PubMed/NCBI
- Google Scholar
53. Martínez-Berdeja A. Variation in dormancy and the timing of germination allows annuals to display different winter life histories in contrasting seasonal environments. A commentary on: “Effects of primary seed dormancy on lifetime fitness of Arabidopsis thaliana in the field”. Ann Bot. 2022;129(7):viii–x. pmid:35349632
- View Article
- PubMed/NCBI
- Google Scholar
54. Taylor MA, Cooper MD, Sellamuthu R, Braun P, Migneault A, Browning A, et al. Interacting effects of genetic variation for seed dormancy and flowering time on phenology, life history, and fitness of experimental Arabidopsis thaliana populations over multiple generations in the field. New Phytol. 2017;216(1):291–302. pmid:28752957
- View Article
- PubMed/NCBI
- Google Scholar
55. Vasseur F, Exposito-Alonso M, Ayala-Garay OJ, Wang G, Enquist BJ, Vile D, et al. Adaptive diversification of growth allometry in the plant Arabidopsis thaliana. Proc Natl Acad Sci U S A. 2018;115(13):3416–21. pmid:29540570
- View Article
- PubMed/NCBI
- Google Scholar
56. Auge GA, Penfield S, Donohue K. Pleiotropy in developmental regulation by flowering-pathway genes: is it an evolutionary constraint? New Phytol. 2019;224(1):55–70. pmid:31074008
- View Article
- PubMed/NCBI
- Google Scholar
57. Przybylska MS, Violle C, Vile D, Scheepens JF, Lacombe B, Le Roux X, et al. AraDiv: a dataset of functional traits and leaf hyperspectral reflectance of Arabidopsis thaliana. Sci Data. 2023;10(1):314. pmid:37225767
- View Article
- PubMed/NCBI
- Google Scholar
58. Garnier E, Shipley B, Roumet C, Laurent G. A standardized protocol for the determination of specific leaf area and leaf dry matter content. Funct Ecol. 2001;15(5):688–95.
- View Article
- Google Scholar
59. Pérez-Harguindeguy N, Díaz S, Garnier E, Lavorel S, Poorter H, Jaureguiberry P, et al. New handbook for standardised measurement of plant functional traits worldwide. Australian J Botany. 2013;61(3):167–234.
- View Article
- Google Scholar
60. Schneider CA, Rasband WS, Eliceiri KW. NIH Image to ImageJ: 25 years of image analysis. Nat Methods. 2012;9(7):671–5. pmid:22930834
- View Article
- PubMed/NCBI
- Google Scholar
61. Vasseur F, Cornet D, Beurier G, Messier J, Rouan L, Bresson J, et al. A perspective on plant phenomics: coupling deep learning and near-infrared spectroscopy. Front Plant Sci. 2022;13:836488. pmid:35668791
- View Article
- PubMed/NCBI
- Google Scholar
62. Josse J, Husson F. missMDA: a package for handling missing values in multivariate data analysis. J Stat Soft. 2016;70(1):1–31.
- View Article
- Google Scholar
63. Mason CM, Donovan LA. Evolution of the leaf economics spectrum in herbs: evidence from environmental divergences in leaf physiology across Helianthus (Asteraceae). Evolution. 2015;69(10):2705–20. pmid:26339995
- View Article
- PubMed/NCBI
- Google Scholar
64. Dray S, Dufour A-B. Theade4Package: implementing the duality diagram for ecologists. J Stat Soft. 2007;22(4):1–20.
- View Article
- Google Scholar
65. Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, et al. The variant call format and VCFtools. Bioinformatics. 2011;27(15):2156–8. pmid:21653522
- View Article
- PubMed/NCBI
- Google Scholar
66. Warton DI, Duursma RA, Falster DS, Taskinen S. smatr 3– an R package for estimation and inference about allometric lines. Methods Ecol Evol. 2011;3(2):257–9.
- View Article
- Google Scholar
67. Zhou X, Stephens M. Genome-wide efficient mixed-model analysis for association studies. Nat Genet. 2012;44(7):821–4. pmid:22706312
- View Article
- PubMed/NCBI
- Google Scholar
68. Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience. 2015;4:7. pmid:25722852
- View Article
- PubMed/NCBI
- Google Scholar
69. Hadfield JD. MCMC methods for multi-response generalized linear mixed models: The MCMCglmm R package. J Stat Soft. 2010;33(2):1–22.
- View Article
- Google Scholar
70. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75. pmid:17701901
- View Article
- PubMed/NCBI
- Google Scholar
71. Hijmans RJ. terra: spatial data analysis. 2024. Available from: https://cran.r-project.org/web/packages/terra/index.html
72. Karger DN, Conrad O, Böhner J, Kawohl T, Kreft H, Soria-Auza RW, et al. Climatologies at high resolution for the earth’s land surface areas. Sci Data. 2017;4:170122. pmid:28872642
- View Article
- PubMed/NCBI
- Google Scholar
73. Karger DN, Conrad O, Böhner J, Kawohl T, Kreft H, Soria-Auza RW, et al. Climatologies at high resolution for the earth’s land surface areas. 2021 [cited 2025 Nov 06]. Database: EnviDat [Internet]. https://doi.org/10.16904/envidat.228
74. Poggio L, de Sousa LM, Batjes NH, Heuvelink GBM, Kempen B, Ribeiro E, et al. SoilGrids 2.0: producing soil information for the globe with quantified spatial uncertainty. SOIL. 2021;7(1):217–40.
- View Article
- Google Scholar
75. Liaw A, Wiener M. Classification and regression by randomForest. R News. 2002;2:18–22.
- View Article
- Google Scholar
76. Kursa MB, Rudnicki WR. Feature selection with the Boruta Package. J Stat Soft. 2010;36(11):1–13.
- View Article
- Google Scholar
77. Roux F, Gasquez J, Reboud X. The dominance of the herbicide resistance cost in several Arabidopsis thaliana mutant lines. Genetics. 2004;166(1):449–60. pmid:15020435
- View Article
- PubMed/NCBI
- Google Scholar
78. Wilczek AM, Cooper MD, Korves TM, Schmitt J. Lagging adaptation to warming climate in Arabidopsis thaliana. Proc Natl Acad Sci U S A. 2014;111(22):7906–13. pmid:24843140
- View Article
- PubMed/NCBI
- Google Scholar
79. R Core Team. R: a language and environment for statistical computing. 2024. Available from: https://www.r-project.org/
80. Lê S, Josse J, Husson F. FactoMineR: an R package for multivariate analysis. J Stat Soft. 2008;25(1):1–18.
- View Article
- Google Scholar

[ref1] 1. Endler JA. Multiple-trait coevolution and environmental gradients in guppies. Trends Ecol Evol. 1995;10(1):22–9. pmid:21236940
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Pigliucci M. Finding the way in phenotypic space: the origin and maintenance of constraints on organismal form. Ann Bot. 2007;100(3):433–8. pmid:17495983
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Donovan LA, Maherali H, Caruso CM, Huber H, de Kroon H. The evolution of the worldwide leaf economics spectrum. Trends Ecol Evol. 2011;26(2):88–95. pmid:21196061
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Olson ME. The developmental renaissance in adaptationism. Trends Ecol Evol. 2012;27(5):278–87. pmid:22326724
View Article
PubMed/NCBI
Google Scholar

[14] View Article

[15] PubMed/NCBI

[16] Google Scholar

[ref5] 5. Díaz S, Kattge J, Cornelissen JHC, Wright IJ, Lavorel S, Dray S, et al. The global spectrum of plant form and function. Nature. 2016;529(7585):167–71. pmid:26700811
View Article
PubMed/NCBI
Google Scholar

[18] View Article

[19] PubMed/NCBI

[20] Google Scholar

[ref6] 6. Pigliucci M, Preston K. Phenotypic integration: studying the ecology and evolution of complex phenotypes. Oxford: Oxford University Press; 2004.

[ref7] 7. Schlötterer C. Unraveling the molecular basis of stabilizing selection by experimental evolution. Genome Biol Evol. 2023;15(12):evad220. pmid:38092037
View Article
PubMed/NCBI
Google Scholar

[23] View Article

[24] PubMed/NCBI

[25] Google Scholar

[ref8] 8. Kingsolver JG, Diamond SE. Phenotypic selection in natural populations: what limits directional selection? Am Nat. 2011;177(3):346–57. pmid:21460543
View Article
PubMed/NCBI
Google Scholar

[27] View Article

[28] PubMed/NCBI

[29] Google Scholar

[ref9] 9. Melamed-Bessudo C, Shilo S, Levy AA. Meiotic recombination and genome evolution in plants. Curr Opin Plant Biol. 2016;30:82–7. pmid:26939088
View Article
PubMed/NCBI
Google Scholar

[31] View Article

[32] PubMed/NCBI

[33] Google Scholar

[ref10] 10. Fernandes JB, Séguéla-Arnaud M, Larchevêque C, Lloyd AH, Mercier R. Unleashing meiotic crossovers in hybrid plants. Proc Natl Acad Sci U S A. 2018;115(10):2431–6. pmid:29183972
View Article
PubMed/NCBI
Google Scholar

[35] View Article

[36] PubMed/NCBI

[37] Google Scholar

[ref11] 11. Fraser HB. Detecting selection with a genetic cross. Proc Natl Acad Sci U S A. 2020;117(36):22323–30. pmid:32848059
View Article
PubMed/NCBI
Google Scholar

[39] View Article

[40] PubMed/NCBI

[41] Google Scholar

[ref12] 12. Blonder B, Lamanna C, Violle C, Enquist BJ. The n‐dimensional hypervolume. Glob Ecol Biogeogr. 2014;23(5):595–609.
View Article
Google Scholar

[43] View Article

[44] Google Scholar

[ref13] 13. Rieseberg LH, Archer MA, Wayne RK. Transgressive segregation, adaptation and speciation. Heredity (Edinb). 1999;83(Pt 4):363–72. pmid:10583537
View Article
PubMed/NCBI
Google Scholar

[46] View Article

[47] PubMed/NCBI

[48] Google Scholar

[ref14] 14. Thompson KA, Osmond MM, Schluter D. Parallel genetic evolution and speciation from standing variation. Evol Lett. 2019;3(2):129–41. pmid:31289688
View Article
PubMed/NCBI
Google Scholar

[50] View Article

[51] PubMed/NCBI

[52] Google Scholar

[ref15] 15. Thompson KA, Brandvain Y, Coughlan JM, Delmore KE, Justen H, Linnen CR, et al. The ecology of hybrid incompatibilities. Cold Spring Harb Perspect Biol. 2024;16(9):a041440. pmid:38151331
View Article
PubMed/NCBI
Google Scholar

[54] View Article

[55] PubMed/NCBI

[56] Google Scholar

[ref16] 16. Chevin L-M, Decorzent G, Lenormand T. Niche dimensionality and the genetics of ecological speciation. Evolution. 2014;68(5):1244–56. pmid:24410181
View Article
PubMed/NCBI
Google Scholar

[58] View Article

[59] PubMed/NCBI

[60] Google Scholar

[ref17] 17. Stelkens RB, Schmid C, Selz O, Seehausen O. Phenotypic novelty in experimental hybrids is predicted by the genetic distance between species of cichlid fish. BMC Evol Biol. 2009;9:283. pmid:19961584
View Article
PubMed/NCBI
Google Scholar

[62] View Article

[63] PubMed/NCBI

[64] Google Scholar

[ref18] 18. Stelkens R, Seehausen O. Genetic distance between species predicts novel trait expression in their hybrids. Evolution. 2009;63(4):884–97. pmid:19220450
View Article
PubMed/NCBI
Google Scholar

[66] View Article

[67] PubMed/NCBI

[68] Google Scholar

[ref19] 19. Munoz F, Klausmeier CA, Gaüzère P, Kandlikar G, Litchman E, Mouquet N, et al. The ecological causes of functional distinctiveness in communities. Ecol Lett. 2023;26(8):1452–65. pmid:37322850
View Article
PubMed/NCBI
Google Scholar

[70] View Article

[71] PubMed/NCBI

[72] Google Scholar

[ref20] 20. The 1001 Genomes Consortium. 1,135 genomes reveal the global pattern of polymorphism in Arabidopsis thaliana. Cell. 2016;166(2):481–91. pmid:27293186
View Article
PubMed/NCBI
Google Scholar

[74] View Article

[75] PubMed/NCBI

[76] Google Scholar

[ref21] 21. Lee C-R, Svardal H, Farlow A, Exposito-Alonso M, Ding W, Novikova P, et al. On the post-glacial spread of human commensal Arabidopsis thaliana. Nat Commun. 2017;8:14458. pmid:28181519
View Article
PubMed/NCBI
Google Scholar

[78] View Article

[79] PubMed/NCBI

[80] Google Scholar

[ref22] 22. Suarez-Gonzalez A, Lexer C, Cronk QCB. Adaptive introgression: a plant perspective. Biol Lett. 2018;14(3):20170688. pmid:29540564
View Article
PubMed/NCBI
Google Scholar

[82] View Article

[83] PubMed/NCBI

[84] Google Scholar

[ref23] 23. Hanemian M, Vasseur F, Marchadier E, Gilbault E, Bresson J, Gy I, et al. Natural variation at FLM splicing has pleiotropic effects modulating ecological strategies in Arabidopsis thaliana. Nat Commun. 2020;11(1):4140. pmid:32811829
View Article
PubMed/NCBI
Google Scholar

[86] View Article

[87] PubMed/NCBI

[88] Google Scholar

[ref24] 24. Vasseur F, Fouqueau L, de Vienne D, Nidelet T, Violle C, Weigel D. Nonlinear phenotypic variation uncovers the emergence of heterosis in Arabidopsis thaliana. PLoS Biol. 2019;17(4):e3000214. pmid:31017902
View Article
PubMed/NCBI
Google Scholar

[90] View Article

[91] PubMed/NCBI

[92] Google Scholar

[ref25] 25. Blonder B. Hypervolume concepts in niche‐ and trait‐based ecology. Ecography. 2017;41(9):1441–55.
View Article
Google Scholar

[94] View Article

[95] Google Scholar

[ref26] 26. Wright IJ, Reich PB, Westoby M, Ackerly DD, Baruch Z, Bongers F, et al. The worldwide leaf economics spectrum. Nature. 2004;428(6985):821–7. pmid:15103368
View Article
PubMed/NCBI
Google Scholar

[97] View Article

[98] PubMed/NCBI

[99] Google Scholar

[ref27] 27. Blonder B, Morrow CB, Maitner B, Harris DJ, Lamanna C, Violle C, et al. New approaches for delineating n‐dimensional hypervolumes. Methods Ecol Evol. 2017;9(2):305–19.
View Article
Google Scholar

[101] View Article

[102] Google Scholar

[ref28] 28. Zhou X, Carbonetto P, Stephens M. Polygenic modeling with Bayesian sparse linear mixed models. PLoS Genet. 2013;9(2):e1003264. pmid:23408905
View Article
PubMed/NCBI
Google Scholar

[104] View Article

[105] PubMed/NCBI

[106] Google Scholar

[ref29] 29. Miao Y, Laun T, Zimmermann P, Zentgraf U. Targets of the WRKY53 transcription factor and its role during leaf senescence in Arabidopsis. Plant Mol Biol. 2004;55(6):853–67. pmid:15604721
View Article
PubMed/NCBI
Google Scholar

[108] View Article

[109] PubMed/NCBI

[110] Google Scholar

[ref30] 30. Massicotte P, South A. rnaturalearth: world map data from Natural Earth. 2023. Available from: https://cran.r-project.org/web/packages/rnaturalearth/index.html

[ref31] 31. Exposito-Alonso M, Brennan AC, Alonso-Blanco C, Picó FX. Spatio-temporal variation in fitness responses to contrasting environments in Arabidopsis thaliana. Evolution. 2018;:10.1111/evo.13508. pmid:29947421
View Article
PubMed/NCBI
Google Scholar

[113] View Article

[114] PubMed/NCBI

[115] Google Scholar

[ref32] 32. Samis KE, Stinchcombe JR, Murren CJ. Population climatic history predicts phenotypic responses in novel environments for Arabidopsis thaliana in North America. Am J Bot. 2019;106(8):1068–80. pmid:31364776
View Article
PubMed/NCBI
Google Scholar

[117] View Article

[118] PubMed/NCBI

[119] Google Scholar

[ref33] 33. Fournier-Level A, Taylor MA, Paril JF, Martínez-Berdeja A, Stitzer MC, Cooper MD, et al. Adaptive significance of flowering time variation across natural seasonal environments in Arabidopsis thaliana. New Phytol. 2022;234(2):719–34. pmid:35090191
View Article
PubMed/NCBI
Google Scholar

[121] View Article

[122] PubMed/NCBI

[123] Google Scholar

[ref34] 34. Méndez‐Vigo B, Castilla AR, Gómez R, Marcer A, Alonso‐Blanco C, Picó FX. Spatiotemporal dynamics of genetic variation at the quantitative and molecular levels within a natural Arabidopsis thaliana population. J Ecol. 2022;110(11):2701–16.
View Article
Google Scholar

[125] View Article

[126] Google Scholar

[ref35] 35. Sartori K, Vasseur F, Violle C, Baron E, Gerard M, Rowe N, et al. Leaf economics and slow-fast adaptation across the geographic range of Arabidopsis thaliana. Sci Rep. 2019;9(1):10758. pmid:31341185
View Article
PubMed/NCBI
Google Scholar

[128] View Article

[129] PubMed/NCBI

[130] Google Scholar

[ref36] 36. Vasseur F, Violle C, Enquist BJ, Granier C, Vile D. A common genetic basis to the origin of the leaf economics spectrum and metabolic scaling allometry. Ecol Lett. 2012;15(10):1149–57. pmid:22856883
View Article
PubMed/NCBI
Google Scholar

[132] View Article

[133] PubMed/NCBI

[134] Google Scholar

[ref37] 37. de Villemereuil P, Charmantier A, Arlt D, Bize P, Brekke P, Brouwer L, et al. Fluctuating optimum and temporally variable selection on breeding date in birds and mammals. Proc Natl Acad Sci U S A. 2020;117(50):31969–78. pmid:33257553
View Article
PubMed/NCBI
Google Scholar

[136] View Article

[137] PubMed/NCBI

[138] Google Scholar

[ref38] 38. Charlesworth D, Willis JH. The genetics of inbreeding depression. Nat Rev Genet. 2009;10(11):783–96. pmid:19834483
View Article
PubMed/NCBI
Google Scholar

[140] View Article

[141] PubMed/NCBI

[142] Google Scholar

[ref39] 39. Weigel D. Natural variation in Arabidopsis: from molecular genetics to ecological genomics. Plant Physiol. 2012;158(1):2–22. pmid:22147517
View Article
PubMed/NCBI
Google Scholar

[144] View Article

[145] PubMed/NCBI

[146] Google Scholar

[ref40] 40. Korves TM, Schmid KJ, Caicedo AL, Mays C, Stinchcombe JR, Purugganan MD, et al. Fitness effects associated with the major flowering time gene FRIGIDA in Arabidopsis thaliana in the field. Am Nat. 2007;169(5):E141–57. pmid:17427127
View Article
PubMed/NCBI
Google Scholar

[148] View Article

[149] PubMed/NCBI

[150] Google Scholar

[ref41] 41. Ågren J, Schemske DW. Reciprocal transplants demonstrate strong adaptive differentiation of the model organism Arabidopsis thaliana in its native range. New Phytol. 2012;194(4):1112–22. pmid:22432639
View Article
PubMed/NCBI
Google Scholar

[152] View Article

[153] PubMed/NCBI

[154] Google Scholar

[ref42] 42. Vasseur F, Sartori K, Baron E, Fort F, Kazakou E, Segrestin J, et al. Climate as a driver of adaptive variations in ecological strategies in Arabidopsis thaliana. Ann Bot. 2018;122(6):935–45. pmid:30256896
View Article
PubMed/NCBI
Google Scholar

[156] View Article

[157] PubMed/NCBI

[158] Google Scholar

[ref43] 43. Postma FM, Ågren J. Effects of primary seed dormancy on lifetime fitness of Arabidopsis thaliana in the field. Ann Bot. 2022;129(7):795–808. pmid:35092679
View Article
PubMed/NCBI
Google Scholar

[160] View Article

[161] PubMed/NCBI

[162] Google Scholar

[ref44] 44. Ågren J, Oakley CG, Lundemo S, Schemske DW. Adaptive divergence in flowering time among natural populations of Arabidopsis thaliana: estimates of selection and QTL mapping. Evolution. 2017;71(3):550–64. pmid:27859214
View Article
PubMed/NCBI
Google Scholar

[164] View Article

[165] PubMed/NCBI

[166] Google Scholar

[ref45] 45. Thoen MPM, Davila Olivas NH, Kloth KJ, Coolen S, Huang P-P, Aarts MGM, et al. Genetic architecture of plant stress resistance: multi-trait genome-wide association mapping. New Phytol. 2017;213(3):1346–62. pmid:27699793
View Article
PubMed/NCBI
Google Scholar

[168] View Article

[169] PubMed/NCBI

[170] Google Scholar

[ref46] 46. Song H, Cao Y, Zhao L, Zhang J, Li S. Review: WRKY transcription factors: understanding the functional divergence. Plant Sci. 2023;334:111770. pmid:37321304
View Article
PubMed/NCBI
Google Scholar

[172] View Article

[173] PubMed/NCBI

[174] Google Scholar

[ref47] 47. Bresson J, Doll J, Vasseur F, Stahl M, von Roepenack-Lahaye E, Kilian J, et al. The genetic interaction of REVOLUTA and WRKY53 links plant development, senescence, and immune responses. PLoS One. 2022;17(3):e0254741. pmid:35333873
View Article
PubMed/NCBI
Google Scholar

[176] View Article

[177] PubMed/NCBI

[178] Google Scholar

[ref48] 48. Rehman S, Bahadur S, Xia W. Unlocking nature’s secrets: the pivotal role of WRKY transcription factors in plant flowering and fruit development. Plant Sci. 2024;346:112150. pmid:38857658
View Article
PubMed/NCBI
Google Scholar

[180] View Article

[181] PubMed/NCBI

[182] Google Scholar

[ref49] 49. Song H, Duan Z, Zhang J. WRKY transcription factors modulate flowering time and response to environmental changes. Plant Physiol Biochem. 2024;210:108630. pmid:38657548
View Article
PubMed/NCBI
Google Scholar

[184] View Article

[185] PubMed/NCBI

[186] Google Scholar

[ref50] 50. Li W, Wang H, Yu D. Arabidopsis WRKY transcription factors WRKY12 and WRKY13 oppositely regulate flowering under short-day conditions. Mol Plant. 2016;9(11):1492–503. pmid:27592586
View Article
PubMed/NCBI
Google Scholar

[188] View Article

[189] PubMed/NCBI

[190] Google Scholar

[ref51] 51. Exposito-Alonso M, Vasseur F, Ding W, Wang G, Burbano HA, Weigel D. Genomic basis and evolutionary potential for extreme drought adaptation in Arabidopsis thaliana. Nat Ecol Evol. 2018;2(2):352–8. pmid:29255303
View Article
PubMed/NCBI
Google Scholar

[192] View Article

[193] PubMed/NCBI

[194] Google Scholar

[ref52] 52. Martínez-Berdeja A, Stitzer MC, Taylor MA, Okada M, Ezcurra E, Runcie DE, et al. Functional variants of DOG1 control seed chilling responses and variation in seasonal life-history strategies in Arabidopsis thaliana. Proc Natl Acad Sci U S A. 2020;117(5):2526–34. pmid:31964817
View Article
PubMed/NCBI
Google Scholar

[196] View Article

[197] PubMed/NCBI

[198] Google Scholar

[ref53] 53. Martínez-Berdeja A. Variation in dormancy and the timing of germination allows annuals to display different winter life histories in contrasting seasonal environments. A commentary on: “Effects of primary seed dormancy on lifetime fitness of Arabidopsis thaliana in the field”. Ann Bot. 2022;129(7):viii–x. pmid:35349632
View Article
PubMed/NCBI
Google Scholar

[200] View Article

[201] PubMed/NCBI

[202] Google Scholar

[ref54] 54. Taylor MA, Cooper MD, Sellamuthu R, Braun P, Migneault A, Browning A, et al. Interacting effects of genetic variation for seed dormancy and flowering time on phenology, life history, and fitness of experimental Arabidopsis thaliana populations over multiple generations in the field. New Phytol. 2017;216(1):291–302. pmid:28752957
View Article
PubMed/NCBI
Google Scholar

[204] View Article

[205] PubMed/NCBI

[206] Google Scholar

[ref55] 55. Vasseur F, Exposito-Alonso M, Ayala-Garay OJ, Wang G, Enquist BJ, Vile D, et al. Adaptive diversification of growth allometry in the plant Arabidopsis thaliana. Proc Natl Acad Sci U S A. 2018;115(13):3416–21. pmid:29540570
View Article
PubMed/NCBI
Google Scholar

[208] View Article

[209] PubMed/NCBI

[210] Google Scholar

[ref56] 56. Auge GA, Penfield S, Donohue K. Pleiotropy in developmental regulation by flowering-pathway genes: is it an evolutionary constraint? New Phytol. 2019;224(1):55–70. pmid:31074008
View Article
PubMed/NCBI
Google Scholar

[212] View Article

[213] PubMed/NCBI

[214] Google Scholar

[ref57] 57. Przybylska MS, Violle C, Vile D, Scheepens JF, Lacombe B, Le Roux X, et al. AraDiv: a dataset of functional traits and leaf hyperspectral reflectance of Arabidopsis thaliana. Sci Data. 2023;10(1):314. pmid:37225767
View Article
PubMed/NCBI
Google Scholar

[216] View Article

[217] PubMed/NCBI

[218] Google Scholar

[ref58] 58. Garnier E, Shipley B, Roumet C, Laurent G. A standardized protocol for the determination of specific leaf area and leaf dry matter content. Funct Ecol. 2001;15(5):688–95.
View Article
Google Scholar

[220] View Article

[221] Google Scholar

[ref59] 59. Pérez-Harguindeguy N, Díaz S, Garnier E, Lavorel S, Poorter H, Jaureguiberry P, et al. New handbook for standardised measurement of plant functional traits worldwide. Australian J Botany. 2013;61(3):167–234.
View Article
Google Scholar

[223] View Article

[224] Google Scholar

[ref60] 60. Schneider CA, Rasband WS, Eliceiri KW. NIH Image to ImageJ: 25 years of image analysis. Nat Methods. 2012;9(7):671–5. pmid:22930834
View Article
PubMed/NCBI
Google Scholar

[226] View Article

[227] PubMed/NCBI

[228] Google Scholar

[ref61] 61. Vasseur F, Cornet D, Beurier G, Messier J, Rouan L, Bresson J, et al. A perspective on plant phenomics: coupling deep learning and near-infrared spectroscopy. Front Plant Sci. 2022;13:836488. pmid:35668791
View Article
PubMed/NCBI
Google Scholar

[230] View Article

[231] PubMed/NCBI

[232] Google Scholar

[ref62] 62. Josse J, Husson F. missMDA: a package for handling missing values in multivariate data analysis. J Stat Soft. 2016;70(1):1–31.
View Article
Google Scholar

[234] View Article

[235] Google Scholar

[ref63] 63. Mason CM, Donovan LA. Evolution of the leaf economics spectrum in herbs: evidence from environmental divergences in leaf physiology across Helianthus (Asteraceae). Evolution. 2015;69(10):2705–20. pmid:26339995
View Article
PubMed/NCBI
Google Scholar

[237] View Article

[238] PubMed/NCBI

[239] Google Scholar

[ref64] 64. Dray S, Dufour A-B. Theade4Package: implementing the duality diagram for ecologists. J Stat Soft. 2007;22(4):1–20.
View Article
Google Scholar

[241] View Article

[242] Google Scholar

[ref65] 65. Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, et al. The variant call format and VCFtools. Bioinformatics. 2011;27(15):2156–8. pmid:21653522
View Article
PubMed/NCBI
Google Scholar

[244] View Article

[245] PubMed/NCBI

[246] Google Scholar

[ref66] 66. Warton DI, Duursma RA, Falster DS, Taskinen S. smatr 3– an R package for estimation and inference about allometric lines. Methods Ecol Evol. 2011;3(2):257–9.
View Article
Google Scholar

[248] View Article

[249] Google Scholar

[ref67] 67. Zhou X, Stephens M. Genome-wide efficient mixed-model analysis for association studies. Nat Genet. 2012;44(7):821–4. pmid:22706312
View Article
PubMed/NCBI
Google Scholar

[251] View Article

[252] PubMed/NCBI

[253] Google Scholar

[ref68] 68. Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience. 2015;4:7. pmid:25722852
View Article
PubMed/NCBI
Google Scholar

[255] View Article

[256] PubMed/NCBI

[257] Google Scholar

[ref69] 69. Hadfield JD. MCMC methods for multi-response generalized linear mixed models: The MCMCglmm R package. J Stat Soft. 2010;33(2):1–22.
View Article
Google Scholar

[259] View Article

[260] Google Scholar

[ref70] 70. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75. pmid:17701901
View Article
PubMed/NCBI
Google Scholar

[262] View Article

[263] PubMed/NCBI

[264] Google Scholar

[ref71] 71. Hijmans RJ. terra: spatial data analysis. 2024. Available from: https://cran.r-project.org/web/packages/terra/index.html

[ref72] 72. Karger DN, Conrad O, Böhner J, Kawohl T, Kreft H, Soria-Auza RW, et al. Climatologies at high resolution for the earth’s land surface areas. Sci Data. 2017;4:170122. pmid:28872642
View Article
PubMed/NCBI
Google Scholar

[267] View Article

[268] PubMed/NCBI

[269] Google Scholar

[ref73] 73. Karger DN, Conrad O, Böhner J, Kawohl T, Kreft H, Soria-Auza RW, et al. Climatologies at high resolution for the earth’s land surface areas. 2021 [cited 2025 Nov 06]. Database: EnviDat [Internet]. https://doi.org/10.16904/envidat.228

[ref74] 74. Poggio L, de Sousa LM, Batjes NH, Heuvelink GBM, Kempen B, Ribeiro E, et al. SoilGrids 2.0: producing soil information for the globe with quantified spatial uncertainty. SOIL. 2021;7(1):217–40.
View Article
Google Scholar

[272] View Article

[273] Google Scholar

[ref75] 75. Liaw A, Wiener M. Classification and regression by randomForest. R News. 2002;2:18–22.
View Article
Google Scholar

[275] View Article

[276] Google Scholar

[ref76] 76. Kursa MB, Rudnicki WR. Feature selection with the Boruta Package. J Stat Soft. 2010;36(11):1–13.
View Article
Google Scholar

[278] View Article

[279] Google Scholar

[ref77] 77. Roux F, Gasquez J, Reboud X. The dominance of the herbicide resistance cost in several Arabidopsis thaliana mutant lines. Genetics. 2004;166(1):449–60. pmid:15020435
View Article
PubMed/NCBI
Google Scholar

[281] View Article

[282] PubMed/NCBI

[283] Google Scholar

[ref78] 78. Wilczek AM, Cooper MD, Korves TM, Schmitt J. Lagging adaptation to warming climate in Arabidopsis thaliana. Proc Natl Acad Sci U S A. 2014;111(22):7906–13. pmid:24843140
View Article
PubMed/NCBI
Google Scholar

[285] View Article

[286] PubMed/NCBI

[287] Google Scholar

[ref79] 79. R Core Team. R: a language and environment for statistical computing. 2024. Available from: https://www.r-project.org/

[ref80] 80. Lê S, Josse J, Husson F. FactoMineR: an R package for multivariate analysis. J Stat Soft. 2008;25(1):1–18.
View Article
Google Scholar

[290] View Article

[291] Google Scholar

Figures

Abstract

Introduction

Results

Wild accessions occupy a smaller phenotypic space than recombinant F2s, but present phenotypic outliers

Divergent selection on traits influences the extent of the phenotypic space of wild accessions

Strong directional selection explains the presence of phenotypic outliers that are characterized by relict introgression segments

Discussion

Conclusions

Materials and methods

Plant material

Experimental setup

Trait measurements

Multivariate analyses

Genomic analyses

Environmental analyses

Extended comparisons for phenotypically unique versus common accessions

Supporting information

S1 Table. List of the phenotypically unique accessions.

S2 Table. Accuracy of trait imputation for the 104 G0 accessions for which phenotypes had not been measured directly.

S3 Table. “Chip heritability” (“PVE”) and the proportion of SNPs presenting a larger effect (“pi”) on traits of wild accessions.

S1 Fig. Jaccard’s index distribution for the comparison between hypervolumes obtained for F2s and wild accessions.

S2 Fig. Trait variation within the phenotypic spaces of F2s (blue) and wild accessions (red) of Arabidopsis thaliana, based on nine hypervolume recalculations.

S3 Fig. Hypervolume sizes for F2s (blue) versus wild accessions (red) when excluding flowering time from the analyses.

S4 Fig. Trait variation within the phenotypic spaces of F2s (blue) and wild accessions (red) of Arabidopsis thaliana, based on 10 hypervolume calculations without flowering time.

S5 Fig. Correlation between the whole-genome genetic diversity (π) of the 254 parental (G0) accessions of the F2s and the other 459 accessions analyzed in this study.

S6 Fig. Hypervolume sizes for F2s (blue) versus wild accessions (red) when analyzing only the 254 G0 accessions that were directly phenotyped in this study.

S7 Fig. Trait variation within the phenotypic spaces of F2s (blue) and the 254 G0 accessions (red) of Arabidopsis thaliana, based on 10 hypervolume calculations.

S8 Fig. Hypervolume sizes for F2s (blue) versus wild accessions (red) when including in the analyses imputed phenotypes from 104 G0 accessions lacking direct measurements.

S9 Fig. Trait variation within the phenotypic spaces of F2s (blue) and wild accessions (red) of Arabidopsis thaliana, based on 10 hypervolume calculations including imputed phenotypes from 104 G0 accessions lacking direct measurements.

S10 Fig. Manhattan plots showing SNPs associated with each analyzed trait in the 254 G0 accessions.

S11 Fig. Boxplots comparing environmental variables associated with phenotypically unique and common accessions.

S12 Fig. Boxplots comparing the number of relict haplotypes between groups of genotypes.

S13 Fig. Boxplots comparing vegetative, phenological, and reproductive traits between groups of genotypes.

S14 Fig. Correlation circle of a dual multiple factor analysis (DMFA) with a “F2s (blue) vs. wild accessions (red)” contrast as grouping variable.

Acknowledgments

References

Wild accessions occupy a smaller phenotypic space than recombinant F₂s, but present phenotypic outliers

S2 Table. Accuracy of trait imputation for the 104 G₀ accessions for which phenotypes had not been measured directly.

S1 Fig. Jaccard’s index distribution for the comparison between hypervolumes obtained for F₂s and wild accessions.

S2 Fig. Trait variation within the phenotypic spaces of F₂s (blue) and wild accessions (red) of Arabidopsis thaliana, based on nine hypervolume recalculations.

S3 Fig. Hypervolume sizes for F₂s (blue) versus wild accessions (red) when excluding flowering time from the analyses.

S4 Fig. Trait variation within the phenotypic spaces of F₂s (blue) and wild accessions (red) of Arabidopsis thaliana, based on 10 hypervolume calculations without flowering time.

S5 Fig. Correlation between the whole-genome genetic diversity (π) of the 254 parental (G₀) accessions of the F₂s and the other 459 accessions analyzed in this study.

S6 Fig. Hypervolume sizes for F₂s (blue) versus wild accessions (red) when analyzing only the 254 G₀ accessions that were directly phenotyped in this study.

S7 Fig. Trait variation within the phenotypic spaces of F₂s (blue) and the 254 G₀ accessions (red) of Arabidopsis thaliana, based on 10 hypervolume calculations.

S8 Fig. Hypervolume sizes for F₂s (blue) versus wild accessions (red) when including in the analyses imputed phenotypes from 104 G₀ accessions lacking direct measurements.

S9 Fig. Trait variation within the phenotypic spaces of F₂s (blue) and wild accessions (red) of Arabidopsis thaliana, based on 10 hypervolume calculations including imputed phenotypes from 104 G₀ accessions lacking direct measurements.

S10 Fig. Manhattan plots showing SNPs associated with each analyzed trait in the 254 G₀ accessions.

S14 Fig. Correlation circle of a dual multiple factor analysis (DMFA) with a “F₂s (blue) vs. wild accessions (red)” contrast as grouping variable.