Registered Report Protocol
25 Aug 2022: Ohlert T, Kimmel K, Avolio M, Chang C, Forrestel E, et al. (2022) Exploring the impact of trait number and type on functional diversity metrics in real-world ecosystems. PLOS ONE 17(8): e0272791. https://doi.org/10.1371/journal.pone.0272791 View registered report protocol
Figures
Abstract
The use of trait-based approaches to understand ecological communities has increased in the past two decades because of their promise to preserve more information about community structure than taxonomic methods and their potential to connect community responses to subsequent effects of ecosystem functioning. Though trait-based approaches are a powerful tool for describing ecological communities, many important properties of commonly-used trait metrics remain unexamined. Previous work with simulated communities and trait distributions shows sensitivity of functional diversity measures to the number and correlation of traits used to calculate them, but these relationships have yet to be studied in actual plant communities with a realistic distribution of trait values, ecologically meaningful covariation of traits, and a realistic number of traits available for analysis. To address this gap, we used data from six grassland plant communities in Minnesota and New Mexico, USA to test how the number of traits and the correlation between traits used in the calculation of eight functional diversity indices impact the magnitude of functional diversity metrics in real plant communities. We found that most metrics were sensitive to the number of traits used to calculate them, but functional dispersion (FDis), kernel density estimation dispersion (KDE dispersion), and Rao’s quadratic entropy (Rao’s Q) maintained consistent rankings of communities across the range of trait numbers. Despite sensitivity of metrics to trait correlation, there was no consistent pattern between communities as to how metrics were affected by the correlation of traits used to calculate them. We recommend that future use of evenness metrics include sensitivity analyses to ensure results are robust to the number of traits used to calculate them. In addition, we recommend use of FDis, KDE dispersion, and Rao’s Q when ecologically applicable due to their ability to produce consistent rankings among communities across a range of the numbers of traits used to calculate them.
Citation: Ohlert T, Kimmel K, Avolio M, Chang C, Forrestel E, Gerstner BP, et al. (2024) The impact of trait number and correlation on functional diversity metrics in real-world ecosystems. PLoS ONE 19(9): e0306342. https://doi.org/10.1371/journal.pone.0306342
Editor: Francesco Boscutti, University of Udine: Universita degli Studi di Udine, ITALY
Received: January 26, 2024; Accepted: June 11, 2024; Published: September 23, 2024
Copyright: © 2024 Ohlert et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All data and scripts may be found at https://github.com/kaitkimmel/FDiv/tree/master/R.
Funding: NSF DEB-1257965 (Kenneth D. Whitney); NSF DBI- 1725683, NSF DEB-1753859, NSF DEB- 1831944 (Sarah E. Hobbie); NSF DBI- 1725683, NSF DEB-1753859, NSF-DBI-2021898 (Peter B. Reich), NSF DEB-0841917 (Elisabeth Forrestel). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Trait-based diversity measures have advanced the field of community ecology by increasing our understanding of both community assembly and diversity impacts on ecosystem functions [1, 2]. Functional diversity metrics allow researchers to quantify multiple facets of diversity, place an emphasis on mechanisms of community assembly, and provide a ‘common currency’ by which communities can be compared across sites and ecosystems [3, 4].Traditional measures for characterizing communities, such as species richness and species ordinations, use species’ taxonomic classifications as discrete units, but functional diversity metrics can preserve more information about community assembly and function by including traits of species organized on continuous axes [5, 6].
Several aspects of functional and taxonomic diversity have been extensively studied. Scientists have probed functional diversity’s correlation with species richness [7, 8] and ecosystem functioning [4], the importance of intraspecific trait variation for diversity [4, 9, 10], and the ecological hypotheses that functional diversity metrics can test, such as optimal strategies or functional turnover [6, 11]. Many taxonomic measures of community diversity have been extensively studied for their mathematical properties to allow these metrics to be comparable across sites and ecosystems, such as Shannon’s diversity and Simpson’s evenness that have mathematical characteristics linked to species number [12, 13]. Similarly, functional diversity metrics have mathematical characteristics that may cause the number or type of traits used to calculate the metric to impact the measure. For example, multidimensional metrics are calculated with additional dimensions for each additional trait included, and the correlation between traits affects the importance of each dimension to the metric [14]. Therefore, functional diversity could differ among replicate plots or sites simply because of the number or types of traits used to calculate the metric without any underlying ecological basis. Though single-trait indices are an effective tool for linking trait diversity to specific ecosystem processes [15, 16], indices based on multiple traits may better match ecological theories of community assembly around multidimensional niche space [17–19]. As use of multi-trait functional diversity increases, it is important to determine the conditions under which they reflect ecological processes as opposed to mathematical patterns.
Studies using simulated communities have tested whether the number and correlation of traits used in functional diversity metrics can impact the magnitude of the metric [7, 20]. Using simulated data, Legras et al. [20] showed that functional richness and functional divergence metrics decreased with increased trait number, but functional evenness metrics were not responsive to increasing trait numbers. Also using simulated data, Cornwell et al. [7] showed that convex hull volume (commonly referred to as “functional richness”) tended to decrease with increasing correlation among traits included in the metric calculation, and that the decrease was greater in more species-rich communities. The limitations of functional diversity metrics described in these studies with simulated community data could be exacerbated when applied in natural communities. Calculating functional diversity measures in natural communities poses additional challenges both ecological and practical. Real plant communities are non-random assemblages of species which are influenced by competitive interactions, coexistence, mutualisms, niche partitioning, and environmental filtering among many other processes of community assembly [21–26]. Functional diversity metrics are likely to exhibit patterns due to ecologically meaningful correlation of traits in real communities, in particular, among suites of traits typically used in community ecology such as the leaf economic spectrum and root economic spectrum [27, 28]. Moreover, real data collection introduces constraints on trait data, such as realistic numbers of traits collected given limited resources and missing trait data, particularly for rare species. Functional diversity metrics, therefore, are most often calculated with fewer traits and fewer species than those in studies based on simulated communities.
The field lacks clear guidelines for researchers to follow when choosing the number and types of traits to include when calculating functional diversity metrics. Decisions are often based on researcher intuition and the practices of similar studies, but such intuition and interpretation of trait selection can be improved by rigorous exploration of the impact of trait selection on diversity metrics [4, 29, 30]. These decisions can fall along a spectrum of options ranging from selecting the minimum number of traits needed to calculate a metric to using every trait available. For example, some studies suggest that researchers use a small number of traits related to certain ecosystem properties or other topics of interest (e.g., [8]), regardless of how correlated they may be. Other studies use all available traits in order to maximize the dimensions of diversity being studied in an effort to comprehensively assess the niche space that species and communities occupy (e.g., [31]). Choosing traits that are highly correlated can result in an underrepresentation of the diversity of functions present by overemphasizing groups of traits which describe similar processes, such as traits involved in the leaf economics spectrum [32]. Further, functional diversity metric calculation in high dimensional space can require dimensionality reduction–another decision that can impact the metrics. However, few studies scrutinize how these decisions can impact conclusions when using functional diversity metrics to characterize communities.
Here, we aimed to understand how the number of traits and correlation between traits impact functional diversity values. We focused on eight measures of functional diversity that express principal facets of community trait composition (see Table 1 for more details on each metric): functional richness (FRich), functional evenness (FEve), functional divergence (FDiv), functional dispersion (FDis), Rao’s quadratic entropy (Rao’s Q), kernel density estimation (KDE) richness, KDE evenness, and KDE dispersion [33–35]. We used trait data from real (natural/intact and experimental) plant communities, which allowed us to understand how these metrics respond to a realistic spread of traits and species richness. In this study, we used trait data collected from six U.S. grassland communities at two sites to test impacts of trait number and identity in functional diversity metric values. Our dataset included plant traits collected on location at these sites that include both naturally assembled and planted communities.
Specifically, we asked:
(1) Do functional diversity metrics exhibit specific patterns with respect to the number and correlation of traits used? Based on findings from [20], we expect functional richness, KDE richness, functional dispersion, and functional divergence to decrease with increasing numbers of traits, but for Rao’s Q to increase [36] and functional evenness to be unresponsive to the number of traits. We do not have a priori hypotheses for KDE evenness and KDE dispersion since properties of these metrics have yet to be explicitly studied. Based on [7], we expect that functional richness will be greater when traits are less correlated. However, we do not have directional hypotheses with respect to effects of trait correlation on the rest of the metrics.
(2) Is metric sensitivity to trait number/type consistent across sites and experiments? If metric sensitivity is consistent across sites, it will be easier to standardize functional diversity metrics across different studies. If sensitivity is not consistent across sites, further caution will be needed in interpreting cross-site comparisons of functional diversity.
Methods
We performed methods as described in the Registered Report Protocol [37]. Alterations to the protocol are explained in Table 2 and the following methods represent only those methods performed for this study.
Site descriptions
Here we used data from six communities in two United States grasslands that span a range of species diversity. Two communities were from a site with natural species assemblages and four communities were from a site with planted species assemblages in order to be representative of the state of grassland studies where some use naturally assembled communities while others use planted communities. Cedar Creek Ecosystem Science Reserve (CDR; East Bethel, Minnesota, USA; latitude = 45.4, longitude = -93.2) is in central Minnesota and classified as a tallgrass prairie. According to Koppen and Geiger classification, the climate is characterized as cold continental with hot summer, but without a dry season [38]. The mean growing season (May–August) precipitation is approximately 420 mm, mean minimum growing season temperature is 12°C, and mean maximum growing season temperature is 25°C (1982–2016 period; http://www.cedarcreek.umn.edu/research/data). Soils at Cedar Creek are characterized as nutrient-poor entisols derived from a glacial outwash sand plain [38]. The study from Cedar Creek consists of artificially planted communities. The Sevilleta National Wildlife Refuge (SEV) is in central New Mexico at the northern edge of the Chihuahuan Desert (latitude = 34.4, longitude = -106.7). The Sevilleta includes desert grasslands, and the climate is characterized as cold semi-arid according to the Koppen and Geiger classification [38]. The growing season is characterized by two rainy periods (March—May and July—September) split by a dry period. The mean monsoon growing season precipitation is approximately 150 mm and the mean monsoon growing season temperature is 22°C.
Community composition data
We used community composition data from two communities at the Sevilleta and four at Cedar Creek (n = 6 communities total) collected within a single year (2018 for Sevilleta, 2020 for Cedar Creek) to characterize the functional diversity of grassland plant communities.
At Cedar Creek, we used community composition data from all 16-species plots in a biodiversity, CO2, and nitrogen addition experiment (BioCON, n = 48; 12 plots for each CO2-N combination). All 16-species plots were originally planted with the same mixture of species (Achillea millefolium, Amorpha canescens, Andropogon gerardii, Anemone cylindrica, Asclepias tuberosa, Bouteloua gracilis, Bromus inermis, Elymus repens, Koeleria cristata, Lespedeza capitata, Lupinus perennis, Petalostemum villosum, Poa pratensis, Schizachyrium scoparium, Solidago rigida, and Sorghastrum nutans) such that all species were seeded at the same density in 1997. Plots were weeded every year to remove invading species. Through time, the plots can lose species (and regain those) but could never gain new species. Further, species abundances shifted from the equal proportion planted in the first year. Every August, species abundances were visually estimated in a 1 m2 permanent plot.
At Sevilleta, we used community composition data from two observational sites, one in a Great Plains grassland ecosystem and the other in a desert grassland ecosystem. The Great Plains grassland is dominated by Bouteloua gracilis (blue grama), a long-lived, caespitose, C4 perennial grass common throughout much of the United States and Canada. The desert grassland is dominated by B. eriopoda (black grama), a stoloniferous C4 perennial grass common in the southwestern United States and Mexico. These two dominant perennial grasses account for about 80% of vegetative cover in their respective ecosystems. Each site has 28 1 m2 quadrats which were sampled in September of 2018, at the peak of the post-monsoon growing season. In each quadrat, plants were identified to species and their percent ground cover was visually estimated.
Trait data
Trait data were collected on individuals found at each of the different sites. Thus, our trait data are representative of the traits actually found in the given community and not just an average independent of location. Traits include measurements from leaves (e.g. specific leaf area), stems (e.g. stem dry matter content), roots (e.g. root dry matter content), whole-plant (e.g. height), and ecological attributes (e.g. amount of nitrogen in monoculture). Including traits across these measurement categories provides a more-complete representation of community assemblages [39–42]. For detailed descriptions of trait collection protocols at each site, see the S1 and S2 Figs.
At Cedar Creek, we used trait data collected in the monoculture plots of the BioCON experiment that correspond to the CO2 and N treatments to match with 16-species community plots. Trait data were collected between 1998 and 2020. Some traits were collected over multiple years whereas others were only collected once. In total, there were 9 distinct traits: specific leaf area (SLA), I* (the amount of light at the soil surface in monoculture), R* (the amount of nitrogen in monoculture), root %C, root %N, total root biomass, shoot %N, shoot %C, and seed mass.
At Sevilleta, we used trait data collected from 2017–2021 on individuals growing under ambient conditions near permanent ambient plots used to monitor plant communities. The full suite of traits were often measured on the same individuals, up to 10 individuals per species. In total there were 10 distinct traits: maximum plant height, leaf dry matter content, specific leaf area, d15N, d13C, leaf %N, leaf %C, stem dry matter content, root dry matter content, and photosynthetic pathway.
For each trait at each site, we calculated an average trait value based on all the measurements for the given species and trait. We acknowledge that this obscures variation within a given trait (intraspecific variation) for a species; such variation can be quite important for some questions [43–46]. The impacts of intraspecific variation in this study are minimized by only using trait values collected at each site, but sufficient data were not collected for each trait of each species to include intraspecific variation into our analysis. Before analysis, we removed species that had less than 100% trait coverage. We made sure that the communities were still represented by at least 80% of species abundance–this approach de-emphasizes the importance of rare species, but is a logistical constraint faced by many researchers doing trait analyses. This ensured that we represented the community to the best of our ability with the given trait data.
Brief background on functional diversity metrics
We focused our analyses on eight common functional diversity metrics: functional richness (FRich) [8], functional evenness (FEve) [9], functional dispersion (FDis), functional divergence (FDiv), Rao’s quadratic entropy (Rao’s Q), kernel density estimation (KDE) richness, KDE evenness, and KDE dispersion [34]. FRich is the multidimensional equivalent of a range [8]. It is calculated as the convex hull volume that is made from all trait values for up to n traits in the community. The number of dimensions used to calculate the final volume can be reduced from the total trait number [45]. FEve is the minimum spanning tree to quantify the regularity of branch lengths and the evenness in trait relative abundances. For each branch, l, of the minimum spanning tree, the weighted evenness (EW) is calculated as where i and j are species, and wi is the relative abundance of species i. Then, the partial weighted evenness (PEW) is calculated for each branch as
, where S is the total number of species in the community. FEve is then defined as
[8]. FDis is the weighted mean distance between species and a weighted-centroid. It is calculated as
where aj is the relative abundance of species j and zj is the distance species j is from the weighted centroid [34]. FDiv is a relative abundance-weighted spread of traits along a trait axis independent of functional richness and is calculated as
where
is the mean distance of species to the weighted-centroid and Δd is the sum of relative abundance-weighted deviances from the weighted-centroid [9]. Rao’s Q measures the pairwise differences in traits between species in a community and is calculated as
dijpi where S is the number of species in the community, dij is the functional difference between the i-th and j-th species, and p is a vector of relative abundance values [46]. These five functional diversity metrics commonly incorporate distance measures by reducing dimensionality using principal coordinates analysis (PCoA) to return PCoA axes which are used to calculate the functional diversity metrics. However, we will avoid this dimensionality reduction for all metrics except FRich, see discussion in Functional Diversity Calculations section. n-dimensional hypervolumes use Gaussian kernel density estimation (KDE) to create a relative abundance-weighted probability distribution of traits in multidimensional space [35]. KDE richness is the total volume of the n-dimensional hypervolume created from unweighted trait values present in the community. KDE evenness is the overlap between the abundance-weighted n-dimensional hypervolume and a similar hypervolume in which all traits and abundances are distributed evenly. KDE dispersion is the average distance between random points within the n-dimensional hypervolume and the hypervolume centroid.
Functional diversity calculations
For each site, we followed the same protocol for calculating functional diversity metrics. We calculated FRich, FEve, and FDis, FDiv, and Rao’s Q using the ‘FD’ package in R [46] using both Gower and Euclidean dissimilarity as the distance measure, along with using the hypervolume package in R to calculate KDE n-dimensional hypervolumes which are passed to the ‘bat’ package to create KDE richness, KDE evenness, and KDE dispersion [35, 48]. Functional diversity metrics from the ‘FD’ package and kernel density estimation are among the most-used metrics for quantifying trait-based diversity within communities due to both ease of use and ecological relevance [35, 45]. To understand the impact of trait number on functional diversity, each functional diversity metric was calculated using all possible combinations of two traits up to all possible combinations of the maximum number of traits at each site. For example, at Sevilleta there are 10 different traits so there are 45 2-trait calculations, 120 3-trait calculations, 210 4-trait calculations, and so forth up to 10 9-trait calculations and 1 10-trait calculation. This allows us to focus on the impact of trait number independent of the constituent set of traits used to calculate the metric.
To calculate the five metrics using the ‘FD’ package, we first calculated a species-trait distance matrix using both Gower (categorical and continuous traits) and Euclidean (continuous traits only) distances. These distance matrices were calculated with both scaled and centered and non-scaled trait data for each community. Centering was done by subtracting the trait mean from each observation and scaling was done by dividing the centered traits by their standard deviations (as in the ‘FD’ package). These distance matrices along with a species-abundance matrix are the input for the ‘FD’ package. The ‘FD’ package performs a principal components analysis on the full species-trait distance matrix. Dimensionality reduction only occurs for FRic and FDiv metric calculation. For all FRich and FDiv analyses, we hold the number of dimensions equal to 2, similar to Legras et al. [20]. Because some communities only had two species, we did not perform a sensitivity analysis to look at how increased dimensionality impacted our results since these species depauperate communities would be excluded. Further, when running the analyses for FRic, FEve, and FDiv, one plot from each Sevilleta community with only one species present was removed because FRic, FEve, and FDiv are undefined in monoculture communities. For calculation of KDE metrics, a species-abundance matrix and a species-trait matrix were loaded for each of the six communities while the distance matrix was set to either Gower or Euclidean depending on the calculation being performed. To measure the effects of trait correlation on functional diversity, we focused on metrics calculated with four traits only to standardize between sites. We calculated the minimum, maximum, and mean correlation between the traits at each community. Only combinations of four traits were used as a balance between reduction of noise in the evaluation of minimum and maximum correlation (calculated as the pairwise correlation of just two traits) and the lower end of the number of traits likely to be used to calculate these metrics in the literature.
Statistical analyses
For each community separately, we ran mixed effects models to test the dependence of the eight functional trait metrics on trait number and on trait correlation using the lme function from the ‘nlme’ package in R [48]. To examine how trait number impacts the values of a given functional trait metric, we ran the model Metric ~ trait number for 2–10 unique traits. To examine how trait-trait correlation impacts the values of a given functional diversity metric, calculated three metrics of trait correlation for each unique combination of four traits: 1) min trait correlation is the minimum two-trait correlation among the set of four traits, 2) max trait correlation is the maximum two-trait correlation among the set of four traits, and 3) mean trait correlation is the average of all trait-trait correlations among the set of four traits. Next, we ran three models for each community: Metric ~ min trait correlation, Metric ~ max trait correlation, Metric ~ mean trait correlation. We explored which functional form of the predictor variables best fit the spread of the functional metric data by fitting null (e.g., intercept only), linear, and quadratic fits. We selected models based on best fit using AIC values. We accounted for repeated samples within plots by fitting plot as a random effect and using an autoregressive correlation structure. Raw model outputs for the total 1,152 models are generated by scripts n_traitStats_simplified.R, n_traitStats_euc_simplified.R, max_corrStats_simplified.R, max_corrStats_euc_simplified.R,min_corrStats_simplified.R, min_corrStats_euc_simplified.R, mean_corrStats_simplified.R, mean_corrStats_euc_simplified.R’ in this GitHub repository <https://github.com/kaitkimmel/FDiv/tree/master/R>.
Results
Data processing summary
Cedar Creek and Sevilleta had adequate trait coverage to proceed with the analyses. At Cedar Creek, all plots from all communities had 100% trait coverage. Overall, we included data from four different communities (defined as different nitrogen fertilization and carbon dioxide enrichment treatments) each with 16 total plots. At Sevilleta, the minimum plot-level trait coverage from the ‘Blue grama’ communities was 79.96%. We included all the plots as 79.96% was close to the 80% trait-coverage threshold. Thus, we had 28 individual plots within this community. The minimum plot-level trait coverage from the ‘Black grama’ communities at Sevilleta was 77.72%. We removed this one plot and had a total of 27 plot from the ‘Black grama’ community.
Sensitivity of functional diversity metrics to trait number
FDis calculated with Gower dissimilarity was insensitive to the number of traits used to calculate it across all six communities (Fig 1I; Table 3). KDE richness, KDE dispersion, and Rao’s Q with Gower dissimilarity were negatively correlated with the number of traits (Fig 1B, 1J and 1M). However, the rankings of communities from low to high values remained consistent. That is, community order was maintained within these three metrics across the range of trait numbers. Similarly, FRich, KDE richness, FDis, KDE dispersion, and Rao’s Q calculated with Euclidean dissimilarity maintained rankings among communities throughout the range of trait numbers, though these metrics all increased with the number of traits (Fig 1C, 1D, 1K, 1L and 1O; Table 3). For both Gower and Euclidean dissimilarity, FEve, KDE evenness, and FDiv had different rankings among communities depending upon the number of traits used to calculate them.
Each point represents the mean value for the given community for a certain number of traits used to calculate the metric. Solid lines are the predicted fits of the best model and shaded regions are +/- SE of the predicted fit. Different colors represent the six communities used in this study (four experimental communities at Cedar Creek Ecosystem Science Reserve, CDR, and two natural communities at Sevilleta National Wildlife Refuge, SEV). N = 6,024 observations for each Cedar Creek community (432 2-trait combinations, 1,008 3-trait combinations, 1,512 4-trait, 1,512 5 trait combinations, 1,008 6-trait combinations, 432 7-trait combinations, 108 8-trait combinations, 1 9-trait combination); n = 27,351 observations for SEV1; n = 28,364 observations for SEV2.
For six communities, each of eight metrics was calculated across a range of the four predictor variables using both Gower and Euclidean dissimilarity matrices. Three functional forms of models were tested: intercept only, linear, and quadratic. In this table, counts underneath those functional form columns display the number of communities (out of six) for which that functional form was the best model. For example, FRich predicted by trait number and calculated with Gower dissimilarity was best predicted by a linear model for three communities and best predicted by a quadratic model for three communities. In total, this table summarizes the results of 384 models.
Sensitivity of functional diversity metrics to trait correlations
Trait correlations (mean, maximum, and minimum trait-trait correlation for combinations of four traits) had limited power to predict the calculated metrics for both dissimilarity matrices and the relationships between metrics and trait correlation varied widely among communities (Fig 2, S1 and S2 Figs). Consequently, we observed inconsistent rankings among communities with respect to trait correlation (Fig 2, S1 and S2 Figs). For example, for FRich calculated with an Euclidean matrix at maximum correlation of 0.40, FRich of the CDR2 community was greater than that of CDR1, whereas at maximum correlation of 0.75, the reverse was true, FRich of CDR1 was greater than that of CDR2 (Fig 2C). Such reorganizations of community rankings were common across functional diversity metrics, distance matrices, and correlation metrics (max, min, mean) (Fig 2). In some cases, metrics were not responsive to trait correlations (e.g., null models were the best fit for 19% of all max correlation models, 20% of min correlation models, 18% of mean correlation models).
Each point represents the mean value for the given community at that correlation. Solid lines are the predicted fits of the best model and shaded regions are +/- SE of the predicted fit. Different colors represent the six communities used in this study (four experimental communities at Cedar Creek Ecosystem Science Reserve, CDR, and two natural communities at Sevilleta National Wildlife Refuge, SEV). N = 1,512 observations for each CDR community; n = 3,402 for SEV1; n = 3,528 for SEV2.
Distance matrices
Overall, metrics calculated with Gower and Euclidean distance matrices maintained consistent results with respect to the ranking of the six communities (four experimental planted communities from Cedar Creek and two natural communities at Sevilleta) for different metrics. The Euclidean distance matrix tended to amplify the differences among communities as the numbers of traits increased for KDE richness, FDis, Rao’s Q, and KDE dispersion. Certain functional diversity metrics were poor at maintaining the ranking of communities across ranges of the number and correlation of traits (e.g. FEve; Figs 1E, 1G, 2E and 2G). However, for instances in which the ranking of communities changed, neither Gower nor Euclidean distance improved the issues.
Discussion
Sensitivity to the number of traits
Our study aimed to understand the sensitivity of functional diversity metrics to trait inputs. Here, we found that FDis had consistent values when calculated with Gower dissimilarity such that, across all communities, there was no correlation with the number of traits used (e.g., it remained consistent regardless of trait number). This suggests that FDis provides reliable values across different communities and sets of traits, making it a potentially valuable tool for assessing patterns of functional diversity across communities and ecosystems. Similarly, Rao’s Q, KDE dispersion, and KDE richness maintained consistent ordered rankings of metrics among communities across the range of trait numbers for both Gower and Euclidean dissimilarity matrices along with FDis calculated with Euclidean dissimilarity. Other metrics (FEve, KDE evenness, FDiv) had less consistency across the range of traits used to calculate them as relative rankings of different communities changed with the number of traits used in constructing the metrics.
A previous simulation study found no sensitivity to the number of traits for FEve and FDiv and magnitude decreases with increasing trait number for FRich, KDE richness, FDis, and Rao’s Q calculated with Gower dissimilarity [20]. Our results were consistent with Legras et al. [20] for Rao’s Q and KDE richness, but we found FDis unresponsive to the number of traits and FDiv, FEve, and KDE evenness to have inconsistent slopes among communities. These differences in findings might be attributed to the inherent complexity and noise present in real-world data. Real communities often contain anomalous species with outlier trait values (e.g. a gymnosperm among angiosperms, a tree seedling among herbaceous plants, or other rare species outlier values), which can exert considerable influence on evenness indices.
We found further discrepancies with previous studies reporting results using Euclidean dissimilarity. Previous studies found no sensitivity of FEve and FDiv [20, 36], and increases with increasing trait number for KDE richness, Rao’s Q [35], and FRich [20] when calculated with Euclidean dissimilarity. Our results of increasing FDis with the number of traits matches Legras et al. [20] while Zhang et al. [36] reported no sensitivity. A potential explanation for this discrepancy is the difference in the number of traits considered. Our study and the simulated data in Legras et al. [20] were limited to a maximum of 10 traits while Zhang et al. [36] used a maximum of 34 traits. While there may indeed be no sensitivity of FDis calculated with Euclidean dissimilarity at numbers of traits as high as 34, studies employing these metrics more often use fewer than 10 traits, within the range in which we found sensitivity.
Some metrics may be unreliable measures for comparing functional diversity among communities since comparisons are dependent upon the number of traits used to calculate them. Specifically, FRich, FEve, and KDE evenness showed crossing slopes among communities (i.e. ranking of communities changed with the number of traits) for both Gower and Euclidean distance matrices. The inconsistency of communities’ relationships to one another across the range of the number of traits raises concerns. For example, community CDR4 had a greater FEve than community SEV1 when using four traits, but CDR4 had a smaller FEve than SEV1 when calculated with eight traits. Such discrepancies have the potential to introduce discordant results in the literature, even when otherwise identical studies have been conducted. This is particularly concerning given the often-arbitrary nature of selecting the number of traits used in a study. The number of traits is often dictated by the resources available to collect data or the completeness of publicly available data [49, 50]. Therefore, additional analyses are warranted when using these metrics to ensure that results are not merely an artifact of the number of traits used to calculate the metrics. Interestingly, the more consistent indices—such as KDE alpha, FDis, Rao’s Q, and KDE dispersion—measure similar ecological properties (e.g., the range of traits expressed in the community) as FRich [6, 33, 35] and therefore, these indices could be substituted for FRich in analyses. However, evenness metrics, both FEve and KDE evenness, quantify a different type of ecological property, the relative homogeneity of traits within a community [35]. These indices do not have obvious substitutes for measuring these ecological properties among the metrics studied here.
Trait correlation concerns in calculating metrics: Much ado about nothing?
We found inconsistent and null relationships between metrics and the correlation of traits used for their calculations. Despite suggestions in the literature that trait selection should minimize correlation [35], lower levels of trait-trait correlation did not result in more or less clear comparisons among communities. Though nonlinearity was prevalent in our analysis, most trait metrics demonstrated similarity across the entire range of maximum trait-trait correlation (i.e. the range of values was relatively small). Lefcheck et al. [29] utilized simulated data and reported insensitivity of Rao’s Q, FEve, and FDis to trait correlation except at very high levels of correlation (Pearson’s |R| > 0.95). However, they observed that FRich and FDiv decreased with trait correlation—a trend that was not evident in our study. Additionally, Lefcheck et al. [29] noted that sensitivity to trait correlation became most apparent for FDiv and FRich when larger numbers of traits were considered, whereas we only tested combinations of four traits. Notably, Mammola and Cardoso [35] recommended avoiding the use of highly correlated traits (Pearson |r| > = 0.8) when calculating KDE metrics. Though we did not find substantial support for such a cutoff, our set of collected traits also had very few combinations with correlation above 0.8. Overall, our results show no consistent link between trait correlation and the values of functional diversity metrics.
Gower and Euclidean dissimilarity
Both Gower and Euclidean distances performed similarly, though diversity metric values varied more with the number of traits under Euclidean distance. Metrics that preserved rankings among communities across the trait number and correlation gradients did so with both Gower and Euclidean matrices. The primary difference driving the use of these distance matrices in the literature is that unlike Euclidean distance, Gower distance can conveniently accommodate categorical data. Categorical traits, such as photosynthetic pathway, growth form, and nitrogen fixation capacity, are often easy to collect and more reliably scored than continuous traits. Moreover, trait databases typically have a great deal of missing data [50, 51] and categorical traits are more reliably gap-filled ad-hoc (e.g. growth form may be determined from a picture or nitrogen fixation capacity pulled from literature) than continuous traits. Given that both distance matrices performed similarly across our broad range of functional diversity indices, trait correlations, and trait numbers, there is no clear reason to favor use of a particular matrix other than the ability of Gower matrices to include categorical trait types.
Ecological significance of methodology and recommendations
Perhaps most important is to center ecological significance and interpretation when choosing traits and metrics. Though many metrics produced consistent results with respect to the rankings of communities from two to ten traits, there are many sensible reasons to include more than 2 traits in order to capture more dimensions of diversity [43, 52]. Similarly, though we found no obvious difference of results when including highly correlated traits, this may not be license to include the maximum number of traits available in all circumstances. One example of responsible use of traits is found in Griffin-Nolan et al. [53] in which traits were used to assess plant community responses to drought. In this case, the analyses focused solely on hydrological traits, ignoring many other traits commonly used in the literature (e.g. seed mass), but the authors correctly emphasized the importance of choosing only traits relevant to the particular treatments and plant functions of interest. While use of the maximum number of traits available may be justifiable when seeking to quantify diversity defined broadly, we discourage inclusion of traits without ecological rationale.
Based on our findings, we recommend use of FDis, KDE dispersion, and/or Rao’s Q in analyses of functional diversity as all of these measures provide consistent results among communities at all numbers of traits tested. Additionally, due to the inconsistency of evenness metrics with respect to community rankings, we strongly recommend that any use of FEve or KDE evenness metrics include supplemental analyses to test whether results are consistent with different numbers of traits used to calculate them. Surprisingly, we found no rationale to favor a particular distance matrix; we simply suggest that the number of traits used or correlation of traits need not be a consideration when choosing between Gower and Euclidean dissimilarity matrices. While functional diversity indices enrich the toolbox for exploring trait-based plant diversity, it remains important to ensure that our findings and inferences are rooted primarily in ecological principles rather than being solely reflective of the metrics employed in assessing functional diversity.
Supporting information
S1 Fig. The relationship between mean trait-trait correlation for each set of 4 traits and functional diversity metrics using both Gower (columns 1 & 2) and Euclidean (columns 3 & 4) dissimilarity matrices.
Each point represents the mean value for the given community for a specific number of traits. Solid lines are the predicted fits of the best model and shaded regions are +/- SE of the predicted fit. Different colors represent the six communities used in this study (four experimental communities at Cedar Creek Ecosystem Science Reserve, CDR and two natural communities at Sevilleta National Wildlife Refuge, SEV). N = 1,512 observations for each CDR community; n = 3,402 for SEV1; n = 3,528 for SEV2.
https://doi.org/10.1371/journal.pone.0306342.s001
(PDF)
S2 Fig. The relationship between the minimum trait-trait correlation for each set of 4 traits and functional diversity metrics using both Gower (columns 1 & 2) and Euclidean (columns 3 & 4) dissimilarity matrices.
Each point represents the mean value for the given community at that correlation. Solid lines are the predicted fits of the best model and shaded regions are +/- SE of the predicted fit. Different colors represent the six communities used in this study (four experimental communities at Cedar Creek Ecosystem Science Reserve, CDR, and two natural communities at Sevilleta National Wildlife Refuge, SEV). N = 1,512 observations for each CDR community; n = 3,402 for SEV1; n = 3,528 for SEV2.
https://doi.org/10.1371/journal.pone.0306342.s002
(PDF)
Acknowledgments
Thank you to two anonymous reviewers for providing feedback that improved the quality of this research.
References
- 1. Suding KN, Goldstein LJ. Testing the Holy Grail framework: using functional traits to predict ecosystem change. New Phytol. 2008;180: 559–562. pmid:19138225
- 2. Griffin-Nolan RJ, Bushey JA, Carroll CJW, Challis A, Chieppa J, Garbowski M, et al. Trait selection and community weighting are key to understanding ecosystem responses to changing precipitation regimes. Funct Ecol. 2018;32: 1746–1756.
- 3. Lavorel S, Garnier E. Predicting changes in community composition and ecosystem functioning from plant traits: revisiting the Holy Grail. Funct Ecol. 2002;16: 545–556.
- 4. Funk JL, Larson JE, Ames GM, Butterfield BJ, Cavender-Bares J, Firn J, et al. Revisiting the Holy Grail: Using plant functional traits to predict ecological processes. Biol Rev. 2016. pmid:27103505
- 5. Ackerly DD, Cornwell WK. A trait-based approach to community assembly: Partitioning of species trait values into within- and among-community components. Ecol Lett. 2007;10: 135–145. pmid:17257101
- 6. Boersma KS, Dee LE, Miller SJ, Bogan MT, Lytle DA, Gitelman AI. Linking multidimensional functional diversity to quantitative methods: a graphical hypothesis-evaluation framework. Ecology. 2016;97: 583–593. pmid:27197386
- 7. Cornwell WK, Schwilk DW, Ackerly DD. A trait-based test for habitat filtering: Convex hull volume. Ecology. 2006;87: 1465–1471. pmid:16869422
- 8. Villéger S, Mason H, Mouillot D. New multidimensional functional diversity indices for a multifacted framework in functional ecology. Ecology. 2008;89: 2290–2301.
- 9. Des Roches S, Post DM, Turley NE, Bailey JK, Hendry AP, Kinnison MT, et al. The ecological importance of intraspecific variation. Nat Ecol Evol. 2018;2: 57–64. pmid:29203921
- 10. Westerband AC, Funk JL, Barton KE. Intraspecific trait variation in plants: a renewed focus on its role in ecological processes. Annals of botany. 2021 Apr 1;127(4):397–410. pmid:33507251
- 11. Muscarella R, Uriarte M. Do community-weighted mean functional traits reflect optimal strategies? Proc R Soc B Biol Sci. 2016;283. pmid:27030412
- 12. Jost L. Partitioning diversity into independent alpha and beta components. Ecology. 2007;88: 2427–2439. pmid:18027744
- 13. Smith B, Wilson JB. A consumer’s guide to evenness indices. Oikos. 1996;76: 70–82.
- 14. Blonder B, Morrow CB, Maitner B, Harris DJ, Lamanna C, Violle C, et al. New approaches for delineating n‐dimensional hypervolumes. Methods in Ecology and Evolution. 2018 Feb;9(2):305–19.
- 15. Butterfield BJ, Suding KN. Single‐trait functional indices outperform multi‐trait indices in linking environmental gradients and ecosystem services in a complex landscape. Journal of Ecology. 2013 Jan;101(1):9–17.
- 16. Bílá K, Moretti M, de Bello F, Dias AT, Pezzatti GB, Van Oosten AR, et al. Disentangling community functional components in a litter‐macrodetritivore model system reveals the predominance of the mass ratio hypothesis. Ecology and evolution. 2014 Feb;4(4):408–16. pmid:24634725
- 17. Hutchinson CE. Concluding remarks, Coldspring Harbor Symposium. Quant. Biol. 1957;22:415–27.
- 18. Spasojevic MJ, Suding KN. Inferring community assembly mechanisms from functional diversity patterns: the importance of multiple assembly processes. Journal of Ecology. 2012 May;100(3):652–61.
- 19.
Chase JM, Leibold MA. Ecological niches: linking classical and contemporary approaches. University of Chicago Press; 2009 Aug 11.
- 20. Legras G, Loiseau N, Gaertner JC, Poggiale JC, Gaertner-Mazouni N. Assessing functional diversity: the influence of the number of the functional traits. Theor Ecol. 2020;13: 117–126.
- 21. Keddy PA. Assembly and response rules: two goals for predictive community ecology. Journal of vegetation science. 1992 Apr;3(2):157–64.
- 22. Tilman D. Niche tradeoffs, neutrality, and community structure: a stochastic theory of resource competition, invasion, and community assembly. Proceedings of the National Academy of Sciences. 2004 Jul 27;101(30):10854–61. pmid:15243158
- 23. Kraft NJ, Adler PB, Godoy O, James EC, Fuller S, Levine JM. Community assembly, coexistence and the environmental filtering metaphor. Functional ecology. 2015 May;29(5):592–9.
- 24. Götzenberger L, de Bello F, Bråthen KA, Davison J, Dubuis A, Guisan A, et al. Ecological assembly rules in plant communities—approaches, patterns and prospects. Biological reviews. 2012 Feb;87(1):111–27. pmid:21692965
- 25. Fukami T. Historical contingency in community assembly: integrating niches, species pools, and priority effects. Annual review of ecology, evolution, and systematics. 2015 Dec 4;46:1–23.
- 26. Li Y, Shipley B, Price JN, Dantas V de L, Tamme R, Westoby M, et al. Habitat filtering determines the functional niche occupancy of plant communities worldwide. J Ecol. 2018;106: 1001–1009.
- 27. Reich PB. The world‐wide ‘fast–slow’ plant economics spectrum: a traits manifesto. Journal of ecology. 2014 Mar;102(2):275–301.
- 28. Kong D, Wang J, Wu H, Valverde-Barrantes OJ, Wang R, Zeng H, et al. Nonlinearity of root trait relationships and the root economics spectrum. Nature Communications. 2019 May 17;10(1):1–9.
- 29. Lefcheck JS, Duffy JE. Multitrophic functional diversity predicts ecosystem functioning in experimental assemblages of estuarine consumers. Ecology. 2015;96: 2973–2983. pmid:27070016
- 30. Poos MS, Walker SC, Jackson DA. Functional-diversity indices can be driven by methodological choices and species richness. Ecology. 2009;90: 341–347. pmid:19323217
- 31. van der Plas F, Schröder-Georgi T, Weigelt A, Barry K, Meyer S, Alzate A, et al. Plant traits alone are poor predictors of ecosystem properties and long-term ecosystem functioning. Nat Ecol Evol. 2020. pmid:33020598
- 32. Osnas JLD, Lichstein JW, Reich PB, Pacala SW. Global leaf trait relationships: Mass, area, and the leaf economics spectrum. Science (80-). 2013;340: 741–744. pmid:23539179
- 33. Mouchet MA, Villéger S, Mason NWH, Mouillot D. Functional diversity measures: An overview of their redundancy and their ability to discriminate community assembly rules. Funct Ecol. 2010;24: 867–876.
- 34. Laliberte E, Legendre P. A distance-based framework for measuring functional diversity from multiple traits. Ecology. 2010;91: 299–305. 26. pmid:20380219
- 35. Mammola S., & Cardoso P. Functional diversity metrics using kernel density n‐dimensional hypervolumes. Methods in Ecology and Evolution. 2020; 11(8), 986–995.
- 36. Zihao Z, Jihua H, Nianpeng H. Predictability of Functional Diversity Depends on the Number of Traits. J Resour Ecol. 2021;12: 332–345.
- 37. Ohlert T, Kimmel K, Avolio M, Chang C, Forrestel E, Gerstner B, Hobbie SE, Komatsu K, Reich P, Whitney K. Exploring the inpact of trait number and tye of functional diversity metrics in real-world ecosystems. Plos One. 2022. 17(8): e0272791.
- 38. Peel MC, Finlayson BL, McMahon TA. Updated world map of the Köppen–Geiger climate classification. Hydrol Earth Syst Sci. 2007;11: 1633–1644.
- 39. Frenette‐Dussault C, Shipley B, Meziane D, Hingrat Y. Trait‐based climate change predictions of plant community structure in arid steppes. Journal of ecology. 2013 Mar;101(2):484–92.
- 40. Biswas SR, Mallik AU. Disturbance effects on species diversity and functional diversity in riparian and upland plant communities. Ecology. 2010 Jan;91(1):28–35. pmid:20380192
- 41. Kimmel K, Dee L, Tilman D, Aubin I, Boenisch G, Catford JA, et al. Chronic fertilization and irrigation gradually and increasingly restructure grassland communities. Ecosphere. 2019 Mar;10(3):e02625.
- 42. Laughlin DC. The intrinsic dimensionality of plant traits and its relevance to community assembly. Journal of Ecology. 2014 Jan;102(1):186–93.
- 43. Siefert A, Violle C, Chalmandrier L, Albert CH, Taudiere A, Fajardo A, et al. A global meta‐analysis of the relative extent of intraspecific trait variation in plant communities. Ecology letters. 2015 Dec;18(12):1406–19. pmid:26415616
- 44. Jung V, Violle C, Mondy C, Hoffmann L, Muller S. Intraspecific variability and trait‐based community assembly. Journal of ecology. 2010 Sep;98(5):1134–40.
- 45. Bolnick DI, Amarasekare P, Araújo MS, Bürger R, Levine JM, Novak M, et al. Why intraspecific trait variation matters in community ecology. Trends in ecology & evolution. 2011 Apr 1;26(4):183–92. pmid:21367482
- 46. Laliberté E, Legendre P, Bill Shipley. Measuring functional diversity (FD) from multiple traits, and other tools for functional ecology. 2015. pp. 1–28.
- 47. Botta‐Dukát Z. Rao’s quadratic entropy as a measure of functional diversity based on multiple traits. Journal of vegetation science. 2005;, 16(5), 533–540.
- 48. Pinheiro J, Bates D, DebRoy S, Sarkar D, R Core Team. nlme: Linear and nonlinear mixed effects models. 2020. p. R package version 3.1–148.
- 49. Schrodt F., Kattge J., Shan H., Fazayeli F., Joswig J., Banerjee A., et al. BHPMF–a hierarchical Bayesian approach to gap-filling and trait prediction for macroecology and functional biogeography. Global Ecology and Biogeography. 2015; 24(12), 1510–1521.
- 50. Májeková M., Paal T., Plowman N. S., Bryndová M., Kasari L., Norberg A., et al. Evaluating Functional Diversity: Missing Trait Data and the Importance of Species Abundance Structure and Data Transformation. PLOS ONE. 2015; 11(2), e0149270.
- 51. Pakeman R. J. Functional trait metrics are sensitive to the completeness of the species’ trait data?. Methods in Ecology and Evolution. 2014; 5(1), 9–15.
- 52. Eklöf A., Jacob U., Kopp J., Bosch J., Castro-Urgal R., Chacoff N. P., et al. The dimensionality of ecological networks. Ecology Letters. 2013; 16(5), 577–583. pmid:23438174
- 53. Griffin‐Nolan RJ, Blumenthal DM, Collins SL, Farkas TE, Hoffman AM, Mueller KE, et al. Shifts in plant functional composition following long‐term drought in grasslands. Journal of Ecology. 2019 Sep;107(5):2133–48.
- 54. de Vries FT, Bardgett RD. Plant community controls on short‐term ecosystem nitrogen retention. New Phytologist. 2016 May;210(3):861–74. pmid:26749302
- 55. de la Riva EG, Violle C, Pérez-Ramos IM, Marañón T, Navarro-Fernández CM, Olmo M, et al. A multidimensional functional trait approach reveals the imprint of environmental stress in Mediterranean woody communities. Ecosystems. 2018 Mar;21(2):248–62.
- 56. Lourenco J Jr, Newman EA, Ventura JA, Milanez CR, Thomaz LD, Wandekoken DT, et al. Soil‐associated drivers of plant traits and functional composition in Atlantic Forest coastal tree communities. Ecosphere. 2021 Jul;12(7):e03629.
- 57. Soares FC, de Lima RF, Palmeirim JM, Cardoso P, Rodrigues AS. Combined effects of bird extinctions and introductions in oceanic islands: Decreased functional diversity despite increased species richness. Global Ecology and Biogeography. 2022 Jun;31(6):1172–83.
- 58. Piano E, Doretto A, Mammola S, Falasco E, Fenoglio S, Bona F. Taxonomic and functional homogenisation of macroinvertebrate communities in recently intermittent Alpine watercourses. Freshwater Biology. 2020 Dec;65(12):2096–107.
- 59. Pavlek M, Mammola S. Niche‐based processes explaining the distributions of closely related subterranean spiders. Journal of Biogeography. 2021 Jan 1;48(1):118–33.
- 60. Bello FD, Lavorel S, Lavergne S, Albert CH, Boulangeat I, Mazel F, et al. Hierarchical effects of environmental filters on the functional structure of plant communities: a case study in the French Alps. Ecography. 2013 Mar;36(3):393–402.
- 61. Niu K, He JS, Zhang S, Lechowicz MJ. Grazing increases functional richness but not functional divergence in Tibetan alpine meadow plant communities. Biodiversity and Conservation. 2016 Nov;25(12):2441–52.
- 62. Biswas SR, Mallik AU, Braithwaite NT, Biswas PL. Effects of disturbance type and microhabitat on species and functional diversity relationship in stream-bank plant communities. Forest ecology and management. 2019 Jan 15;432:812–22.
- 63. Zuo X, Zhao S, Cheng H, Hu Y, Wang S, Yue P, et al. Functional diversity response to geographic and experimental precipitation gradients varies with plant community type. Functional Ecology. 2021 Sep;35(9):2119–32.
- 64. Shovon TA, Rozendaal DM, Gagnon D, Gendron F, Vetter M, Vanderwel MC. Plant communities on nitrogen‐rich soil are less sensitive to soil moisture than plant communities on nitrogen‐poor soil. Journal of Ecology. 2020 Jan;108(1):133–44.
- 65. Jäschke Y, Heberling G, Wesche K. Environmental controls override grazing effects on plant functional traits in Tibetan rangelands. Functional Ecology. 2020 Mar;34(3):747–60.
- 66. Ebeling A, Rzanny M, Lange M, Eisenhauer N, Hertzog LR, Meyer ST, et al. Plant diversity induces shifts in the functional structure and diversity across trophic levels. Oikos. 2018 Feb;127(2):208–19.
- 67. Thakur D, Chawla A. Functional diversity along elevational gradients in the high altitude vegetation of the western Himalaya. Biodiversity and Conservation. 2019 Jul;28(8):1977–96.
- 68. De Bello F, Thuiller W, Lepš J, Choler P, Clément JC, Macek P, et al. Partitioning of functional diversity reveals the scale and extent of trait convergence and divergence. Journal of Vegetation Science. 2009 Jun;20(3):475–86.
- 69. Ebeling A, Pompe S, Baade J, Eisenhauer N, Hillebrand H, Proulx R, et al. A trait-based experimental approach to understand the mechanisms underlying biodiversity–ecosystem functioning relationships. Basic and Applied Ecology. 2014 May 1;15(3):229–40.
- 70. Pillar VD, Blanco CC, Müller SC, Sosinski EE, Joner F, Duarte LD. Functional redundancy and stability in plant communities. Journal of Vegetation Science. 2013 Sep;24(5):963–74.
- 71. Wang C, Jiang K, Liu J, Zhou J, Wu B. Moderate and heavy Solidago canadensis L. invasion are associated with decreased taxonomic diversity but increased functional diversity of plant communities in East China. Ecological Engineering. 2018 Mar 1;112:55–64.
- 72. Greenop A, Woodcock BA, Outhwaite CL, Carvell C, Pywell RF, Mancini F, et al. Patterns of invertebrate functional diversity highlight the vulnerability of ecosystem services over a 45-year period. Current Biology. 2021 Oct 25;31(20):4627–34. pmid:34411527
- 73. Rao CR. Diversity and dissimilarity coefficients: a unified approach. Theoretical population biology. 1982 Feb 1;21(1):24–43.