Figures
Abstract
The R-package BIOdry allows to model and compare fluctuations of Tree-ring Width (TRW) and climate, or dendroclimatic fluctuations, while accounting for source variability. The package eases multilevel modeling and multivariate comparison in dendroclimatic analysis using the nlme and ecodist packages, respectively. For implementing such libraries, the in-package algorithms transform the dendroclimatic fluctuations into Multilevel Dendroclimatic Data Series and maintain categorical variables and time units in the outputs. The dendroclimatic modeling is developed with two functions: modelFrame and muleMan. The first function binds core-level cumulative TRWs to the processed data sets and subtracts trends in TRWs by fitting multilevel log-linear growth formulas or multilevel linear formulas. modelFrame can also model within-group fluctuations in dendroclimatic variables other than tree-radial increments such as aridity indices or allometric components of tree growth: e.g. diameters at breast height over bark, tree basal areas, total tree biomass, among other. The second function compares fluctuations in modelFrame objects that share outermost categorical variable and annual records. Here, we use BIOdry to model dendroclimatic relationships in northern and east-central Spain to illustrate future users in the implementation of the package for modeling ecological relationships in space and time.
Citation: Lara W, Bogino S, Bravo F (2018) Multilevel analysis of dendroclimatic series with the R-package BIOdry. PLoS ONE 13(5): e0196923. https://doi.org/10.1371/journal.pone.0196923
Editor: Ben Bond-Lamberty, Pacific Northwest National Laboratory, UNITED STATES
Received: August 4, 2017; Accepted: April 23, 2018; Published: May 17, 2018
Copyright: © 2018 Lara et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Code availability: The in-package code is available on https://cran.r-project.org/web/packages/BIOdry/index.html. Data sets processed here are contained in the BIOdry package.
Funding: This research has been supported through funds from the Colombian Government (Colciencias) within the framework of the National Program for Doctoral Studies Abroad (Year 2012, Number 568) to WL (URL: http://legadoweb.colciencias.gov.co/convocatoria/convocatoria-nacional-para-estudios-de-doctorado-en-el-exterior-o-2012-conformaci-n-de-). This study was also financed by the FORMIXING project (AGL2014-51964-C2-1-R) funded by the Spanish Ministry of Economy, Industry and Competitiveness to FB (URL: http://www.mineco.gob.es/). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
1 Introduction
Dendrochronological studies date and analyze tree-rings from woody plants to understand past and current environmental processes. Dendroclimatology uses tree rings to study past climate conditions. It is one of the most prominent dendrochronological application to study global climate change [1]. Most standard approaches to model Tree-ring Width (TRW) and climate fluctuations implement time-series decomposition at specific levels of variance (single-level modeling) [2, 3]. The TRW data extraction usually requires sampling with specialized hardware [4] and/or specific statistical software [2, 3, 5, 6]. These procedures have been efficiently implemented not only for climate change studies [1] but also in paleoclimatic reconstructions [1, 7–9], the assessment of forest response to water availability [10, 11], among others [12–15]. Even though these methods are widespread applied, they have not always considered the sampling design and the different levels of variation in tree-ring series. Sampling dendroclimatic variables in forest ecosystems results in hierarchical sources of variability from ecological factors. For instance, variability in core replicates (number of samples per each analyzed tree) of TRW from forest communities would have at least three hierarchical levels: tree-radial morphology [16], individual-tree genetics or phenotype [17], and stand quality [15]. Additional ecological factors such as site elevation [18], fire intensity [14], tree decay [19], or water regimes [10, 11] can further complicate hierarchical variance in TRW. Meteorological records, which are used to model climatic variables in dendroclimatic analyses, also contain temporal and spatial variability [20, 21].
Developing software for dendroclimatic analysis in forest ecosystems can help to address these sources variability. This new software should account for the effects of both ecological-factors and sampling schemes, and should integrate tree-growth/yield modeling into dendroclimatic analysis [22–24] based on multilevel modeling and multivariate comparison.
One example of multilevel modeling methods is mixed-effects regression [25–28], which can be implemented to detrend dendroclimatic data by considering random effects from ecological factors. Another example is dissimilarity analysis [29–32], which can be used to compare and organize dendroclimatic fluctuations into common ecological-factor levels. However, the implementation of these methods using statistical environments requires dendroclimatic inputs stored in special formats [33] such as Multilevel Dendroclimatic Data Series (MDDS), or sequences of observations ordered according to spatial/temporal hierarchies which are the result of sampling schemes, with sample variability confined to ecological factors.
The dendroclimatic modeling in forest ecosystems should also integrate information about tree allometry, growth, and yield. For example, allometric scaling can help transform TRW data into other serial components of tree growth [34–38]. Similarly, organic growth theory has provided simplified equations [39–41] that can be used to subtract ontogenetic trends from TRW data. Such equations are log-linear expressions of tree growth that are easily fitted to the multilevel data applying mixed-effects-modeling procedures: e.g. lme methods in R [42].
In addition, dendroclimatic modeling can also involve time-series transformations at specific levels in the ecological factors. Such modeling is important for evaluating allometric parameters for TRW [36–38], computing water-availability indexes from climatic MDDS [20, 43–45], developing statistical process control for TRW series [2, 46–48], or for time-series smoothing and decomposition [49, 50]. Procedures to evaluate this kind of routines should be efficient and preserve MDDS structures and be adaptable to new methods in dendroclimatic modeling. Consequently, efficient implementation of all these procedures requires of programming higher-order functions which can recursively evaluate diverse routines, control multilevel detrending, and compare fluctuations in MDDS.
In this work, we present the R-package BIOdry, a statistical package that processes MDDS using higher-order functions to integrate processes for input derivation, multilevel analysis, and multivariate comparison of dendroclimatic fluctuations. Here, we explain the functionality of the package by modeling relationships between TRW fluctuations and drought using MDDS from pine forests in the Iberian Peninsula. We also test the hypothesis that accounting for ecological factors improves statistical analysis for dendroclimatic modeling compared to other conventional alternatives, such as linear regression. To do so, we compare parameters from implementing two procedures: conventional single-level modeling and the multilevel procedures in the package. The examples of dendroclimatic modeling developed here are also intended as a guideline for future BIOdry users interested in implementing the package for modeling other ecological relationships in time and space.
2 Package installation and requerimients
The primary focus of this paper is on BIOdry 0.5 (R versions ≥ 3.4.2). This and further versions of the package can be easily installed in any R session using the install.packages command. Once installed, the package is loaded in the R environment with the library command. The in-package routines are programmed with generic functions in base, stats and rutils packages. Functions for developing the multilevel modeling and the multivariate comparison depend on two R packages in CRAN: nlme [42], and ecodist [30]. These are automatically loaded in R after the package has been requested. A further dependence, the nlme package, is used to extract fluctuations in MDDS, or Multilevel Dendroclimatic Data Series, with linear mixed effects models (detrending); while ecodist is used to compute multivariate correlations (Mantel correlograms) between two compared MDDS.
2.1 Data inputs
BIOdry processes three kind of data sets: TRW (mm), monthly cumulative precipitations (mm), and monthly average temperatures (°C). These can have wide or long formats. In the wide data format, the rows are ordered and labeled from earliest to latest year and the columns contain the dendroclimatic variables referred above. Names of the columns must be dot-separated codes representing the hierarchy of ecological factors, where higher ecological levels are defined first and lower levels after. For instance (truncations are indicated with ellipses):
> data(Pchron)
> tail(Pchron, 3)
P16106.17.a P16106.17.b… P44005.7.b
2003 0.926 0.882… 0.621
2004 1.205 0.638… 0.368
2005 0.964 0.295… 0.038
Pchron is a TRW (1861-2005) for Pinus pinaster Ait. forests (P. pinaster) in northern and east-central Spain [51]. These two regions have contrasting climate regimes with forests in central areas of the country being more affected by drought than forests in northern regions. To account for source variability, two trees were selected per site, and two cores were extracted from each tree. Consequently, the column names have within-plot levels representing variabilities in the plot qualities (alphanumerics), tree genetics/phenotypes (numbers), and tree-radial morphology (lowercase letters).
Meteorological records in wide formats have the rows labeled with years and the columns labeled with dot-separated codes of nearby plots and monthly abbreviations. For instance:
> data(Prec)
> data(Temp)
> names(Prec) # or names(Temp)
[1] “P16106.Apr” “P16106.Aug” “P16106.Dec” “P16106.Feb”…
The Prec sample data set, included in BIOdry, consists of cumulative precipitations, while Temp contains corresponding monthly temperatures. These were provided by the Spanish Meteorological Agency (AEMET).
On the other hand, long data formats should have the chronologies and time units in the initial columns, followed by ecological factors. In the factor hierarchy, columns of lower time units or classification factor levels are defined first, and columns of higher levels are specified later. For instance,
> data(Prings05)
> head(Prings05, 3)
x year sample tree plot
1 0.698 1897 a 17 P16106
2 0.878 1898 a 17 P16106
3 0.842 1899 a 17 P16106
where Prings05 is a long format of data stored in Pcrhon.
An alternative vector of tree radii can also be specified although this is not mandatory. In this case, the year of measurement of the radii should also be recorded. Until recently, core samples often lacked the initial rings of the pits producing truncated growths [22]; so these reference radii can be used to scale the cumulative growth. This vector should be named with dot separated codes and contain codes in the tree factor of the TRW.
> data(Pradii03)
> Pradii03
P16106.17 P16106.18 P44005.56 P44005.7
122.2343 139.7485 158.5184 114.9109
3 Package functionality
Dendroclimatic modeling has evolved from paleoclimatic sciences where the ecological variability is seen as noise [9]. Consequently, statistical procedures in dendroclimatic research traditionally consider ecological variability as irrelevant. These standard methods model signs by independently detrending TRW/climatic fluctuations (Fig 1, upper panels) and comparing the extracted signs with cross-correlation functions.
Standard approaches (upper panels) usually account for sample variability (gray areas). These implement cross-correlation analyses to compare the detrended fluctuations. On the other hand, the BIOdry package (lower panels) consider hierarchical structures from sample design (gray layers) to account for source variability: e.g. tree morphology. The package implements dissimilarity analysis to compare the detrended fluctuations.
Procedures in the BIOdry package can complement these approaches by modeling and comparing dencroclimatic fluctuations while accounting for ecological variability (Fig 1, lower panes). To do so, the package implements methods to derive, detrend and correlate multilevel dendroclimatic fluctuations while accounting for factors defined in MDDS (Appendix A). These are programmed using a variety of in-package routines (Tables 1 and 2) which could be implemented independently if users are interested in running specific calculations or better understanding the sequence of steps for dendroclimatic modeling. However, implementation of these can be burdensome. Consequently, the routines can also be recursively evaluated on MDDS using two higher-order functions: modelFrame and muleMan. Former modelFrame can either derive allometric components of tree-growth fluctuation from TRW data or compute aridity indexes from meteorologic records depending on whether defaults in the function are maintained or modified.
These are specified in the form of lists with the fn and lv arguments (see second argument in Table 3). Arguments of these one-level functions are specified in modelFrame in either a MoreArgs list argument or directly, depending on whether they are vectors (e.g. mp) or constants, respectively.
Implementation of these is similar to what was explained for tree growth modeling in Table 1.
3.1 Modeling features
3.1.1 Function defaults.
By default, the modelFrame function synchronizes, cumulates, and detrends TRW by evaluating three routines and a detrending formula (Table 3). The routines are defined in the fn argument and vectorized over factors in the lv argument. The synchronization circumvents biases causing artificial increases or decreases in the modeled tree growth: slow-grower survivorship, big-tree selection, among other [22] while enhancing convergence of parameters during the detrending procedure [42]. The cumulative TRWs are used to model ontogenetic growth fitting detrending formulas [41]. These are specified with the form argument in modelFrame. The defaults are maintained by only specifying the processed MDDS. For instance,
This function is used to model fluctuations of Tree-ring Widths (TRWs) (cm) and Annual Aridity Indexes (AAIs) (dimensionless).
> trwf <- modelFrame(Pchron)
On the other hand, muleMan compares TRW and aridity-Index fluctuations while maintaining hierarchical structures in the MDDS, see Section 7.1.
3.1.2 Detrending formulas.
Currently, the ‘tdForm’ or ‘lmeForm’ methods can be implemented automatically with the form argument to evaluate formulas with the same names. The ‘tdForm’ method is a linear generalization from growth theory (Appendix A.1) used here to subtract ontogenetic growth from TRW. This method considers either time units or classification factors in MDDS as random effects, see details about formulation of ‘tdForm’ method in Section A.1.2. The ‘lmeForm’ method implements a more flexible linear formula, which is implemented here to detrend normalized aridity indexes. See also structure of ‘lmeForm’ method in Section A.1.3.
3.1.3 Correlation functions and variance structures.
The package can implement procedures for serial normalization in the nlme package. Heteroscedasticity and serial autocorrelation of the detrended MDDS can be modeled with arguments in lme methods that are specified in modelFrame: e.g. weights and correlation arguments in the nlme package (Appendices A.3 and A.4). Likewise, autocorrelation in detrended MDDS can be assessed with empirical autocorrelation functions.
3.1.4 Multivariate comparison.
Mantel correlograms between two MDDS with a common classification factor are established by comparing distances in one of the MDDS with sets of binary model matrices that specify membership in the other MDDS classes (Appendix A.5). The multivariate comparison is implemented here to compare detrended aridity indexes with detrended TRW.
4 Tree-growth modeling and diagnostics
Parameters of tree growth can be enhanced by changing arguments of the routines in the modelFrame function or updating modelFrame objects with the new arguments. The new model updating can help for properly scaling truncated fluctuation [22] and correctly defining shape of tree ontogeny [41]. TRW lacking initial rings in the pits can be scaled around the vectors of reference radii specifying the tree radii and corresponding measurement year in sc.c and rf.t arguments, respectively. Likewise, the detrending formula can be modified from Chapman-Richards to Levakovic forms, changing default log.t = FALSE to log.t = TRUE respectively. The forms are simplified expressions of most growth equations used in forest sciences to model tree growth [41]. Here, we update the trwf object modifying these parameters with the update call:
> trwf <- update(trwf,
sc.c = Pradii03,
rf.t = 2003,
log.t = TRUE)
The trwf object contains a threefold list of model parameters in model, TRW fluctuations in fluc, and the model call in call. Parameters in trwf can be inspected calling the model element with currency or bracket operators and using plenty diagnostic functions in the nlme package [42]. Alternatively, we have developed five in-package S3 methods to directly inspect parameters in modelFrame objects with lme functions: summary, plot, getData, anova, and Empiric Autocorrelation Function (ACF). These are implemented here to inspect significances of fixed-effects parameters, compare models with similar fixed-effects structures, and detect serial patterns in normalized fluctuations.
4.1 Model diagnostics
The summary function provides information about the estimation method (default method = REML), information-based criteria (Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), and Log-Likelihood (logLik)), fixed-effects formula (default form = ‘tdForm’), random-effects standard deviations, and conditional t-tests, as shown in following summary output:
> summary(trwf)
Linear mixed-effects model fit by REML
Data: fixed
AIC BIC logLik
1425.193 1488.797 -699.5963
Random effects:
Formula: ˜log(csx) + log(time) | plot
Structure: Diagonal
(Intercept) log(csx) log(time)
StdDev: 0.9541355 0.00115044 0.226334
Formula: ˜log(csx) + log(time) | tree %in% plot
Structure: Diagonal
(Intercept) log(csx) log(time)
StdDev: 0.872989 2.42083e-07 0.2220615
Formula: ˜log(csx) + log(time) | sample %in% tree %in% plot
Structure: Diagonal
(Intercept) log(csx) log(time) Residual
StdDev: 0.09011541 6.108796e-24 0.02288596 0.4784995
Fixed effects: log(x) ˜ log(csx) + log(time)
Value Std.Error DF t-value p-value
(Intercept) -3.171448 1.4210373 978 -2.231784 0.0259
log(csx) 1.398107 0.3585760 978 3.899055 0.0001
log(time) -0.953834 0.2296603 978 -4.153239 0.0000
Correlation:
(Intr) lg(cs)
log(csx) -0.822
log(time) 0.423 -0.521
Standardized Within-Group Residuals:
Min Q1 Med Q3 Max
-6.9065514 -0.5052164 0.1119620 0.6316315 3.5025194
Number of Observations: 988
Number of Groups:
plot tree %in% plot
2 4
sample %in% tree %in% plot
8
The random effects of the summary output define the limits of the hierarchical levels and are expressed in standard deviations. The fixed effects show conditional t-tests testing marginal significances of fixed effects. These suggest that all the fixed-effects terms of multilevel model in trwf are significant (p ≤ 0.026). We can infer that tree-growth parameters derived from the TRW conforms to the theoretical growth patterns implied by the Levakovic expression of tdForm. Diagnostic plots for assessing the quality of the fit are obtained using the method for the plot function.
> plot(trwf) ## Fig (2)
Accounting for the within-plot variability affected patterns between the TRW fluctuations (Fig 2). Fluctuations from trees in similar plots were more comparable than these in different plots. This could be explained by the different biophysical conditions of the plots, for instance: genetic adaptations of the populations, local variations in nutrients and water availability, competition [52].
Residual autocorrelations in the chronology has been accounted for with an auto-regressive structure for lags ≥ 2 (ARMA(p = 1, q = 1)) after normalizing the fitted residuals via Choleski factorization.
To analyze improvements in model fitting by accounting for within-core variability, we ignore the grouping structure of the tdForm formula and fit a single linear regression without random effects to the TRW data. The model is fitted with lm regression in default R-package stats. The cumulative TRW are recycled from the trwf object with getData function. Parameters fitted with the new lm model are compared with parameters of the multilevel model in trwf doing a log-Likelihood ratio test(logLik) with the anova function.
> trwfl <- lm(log(x) ˜ log(csx) + log(time),
data = getData(trwf))
> anova(trwf,trwfl)
Model df AIC BIC logLik Test L.Ratio p-value
trwf 1 13 1425.193 1488.797 -699.5963
trwfl 2 4 1525.847 1545.418 -758.9235 1 vs 2 118.6543 <.0001
This suggests that accounting for within-core variability significantly reduces AIC, BIC, and logLik (p ≤ 10−4). Consequently, we assert that multilevel model in trwf fits better than the tdForm model with only fixed effects.
4.2 Allometric components of tree grotwth
Allometric relationships in components of tree growth other than TRW: e.g. tree diameters, basal areas, or tree biomasses, have not been fully explored to understand non-linear relationships between tree growth and drought [53]. The modelFrame function can help to explore such relationships by recursively evaluating allometric parameters in mp and formatting metric units of the TRW in to argument. However, this recursive evaluation of allometric parameters must be done with care because excessive iteration of parameters could exponentially increase errors of the allometric models. Here, we specify two pairs of parameters in mp to compute fluctuations in Diameters at Breast Height over bark (DBH). The first pair of parameters c(2, 1) is used to compute diameters at breast height inside bark (dib) from the TRW. Metric units of the TRW are transformed from mm to cm before the dibs are computed. This is done by writing to = ‘cm’. The other two parameters c(2.87, 0.85) are used to predict the DBHs (cm) from the dibs. The four allometric parameters are formulated in a common vector inside a MoreArgs list. This avoids the modelFrame function to vectorize over the allometric constants along the within-core levels.
> tdf <- update(trwf,
to = ‘cm’,
MoreArgs = list(mp = c(2, 1, 2.87, 0.85)))
> summary(tdf)
Linear mixed-effects model fit by REML
Data: fixed
AIC BIC logLik
1425.903 1489.507 -699.9515
#…
Fixed effects: log(x) ˜ log(csx) + log(time)
Value Std.Error DF t-value p-value
(Intercept) -3.185970 1.3154323 978 -2.421995 0.0156
log(csx) 1.469242 0.4220967 978 3.480818 0.0005
log(time) -0.954741 0.2298998 978 -4.152858 0.0000
#…
5 Random-effects structures
Hierarchical sources of variability from endogenous and exogenous disturbances in the stands are usually masked in the error terms of the TRW chronologies or the climatic fluctuations [54]. BIOdry implements methods to account for this variability using the modelFrame function. By default, this function constructs random effects with pdDiag constructor class in nlme [42], a primary covariate from the formula in form, and the factors in the MDDS. However, users of the package can specify other constructor classes, see p. 157 in [42], and different random-effects covariates while removing factors from the processed MDDS. These are specified in either the modelFrame or update functions with the random and lev.rm arguments. We update here the trwf object with new parameters to modify original random-effects structure in trwf by specifying the pdIdent constructor class and the ˜time covariate while removing the sample random effect.
> trwfr <- update(trwf,
random = pdIdent(˜ time),
lev.rm = ‘sample’)
> summary(trwfr)
Linear mixed-effects model fit by REML
Data: fixed
AIC BIC logLik
1510.969 1540.325 -749.4844
Random effects:
Formula: ˜time | plot
Structure: Multiple of an Identity
(Intercept) time
StdDev: 8.272046e-07 8.272046e-07
Formula: ˜time | tree %in% plot
Structure: Multiple of an Identity
(Intercept) time Residual
StdDev: 0.001311275 0.001311275 0.512138
#…
6 Serial modeling
Multilevel analysis of dendroclimatic data involves spatial and temporal pseudoreplication: i.e., series in the hierarchical levels are correlated introducing dependencies among the modeled fluctuations [55]. BIOdry can print ACF plots to inspect such dependencies. The inspection process is controlled with two arguments: maxLag and alpha. The first argument controls the number of lags for which the autocorrelations are computed. The second argument specifies the significance level for two side critical bounds.
> plot(ACF(trwf, maxlag = 10),
alpha = 0.01) ## Fig (3), upper panel
The plot shows that fluctuations in trwf are autocorrelated at three annual lags (Fig 3, upper panel). Part of this occurred because the core samples are replicated across the within-plot levels introducing dependencies among the residuals [55]. Another part of the autocorrelation could be explained by common climatic signals of the series [56].
Labels indicate level-codes in the plot factor.
The residual autocorrelation in multilevel models can be accounted for with standard corStruct classes in nlme, see p. 234 in [42]. Here, we account for the three annual-lag autocorrelation updating multilevel model in trwf with an auto-regressive structure for time lags ≥ 2, see Appendix A.3. This is developed with correlation argument in the update function. Adequacy of the fitted correlation model is tested with diagnostic plots of ACFs applied to normalized fluctuations. Choleski decomposition of lme is implemented to normalize the fluctuations, see Appendix A.2. This is controlled with resType=‘n’ argument in ACF. We also test improvements in parameters of the correlation model using the in-package anova method.
> trwfAC <- update(trwf,
correlation = corARMA(p = 1, q = 1))
> plot(ACF(trwfAC, maxlag = 10,
resType = ‘n’),
alpha = 0.01) ## Fig (3), middle panel
> anova(trwf, trwfAC)
Model df AIC BIC logLik Test L.Ratio p-value
trwf 1 13 1425.193 1488.797 -699.5963
trwfAC 2 14 1292.074 1360.571 -632.0371 1 vs 2 135.1184 <.0001
The ACF plot indicates fluctuations in trwfAC to be uncorrelated (Fig 3, middle panel), and the anova output evinces that accounting for the residual autocorrelation improves parameters of the multilevel model (p ≤10−4).
7 Aridity-index fluctuations
The Annual Aridity Index (AAI) is a good indicator of seasonal drought. This is constructed under the assumption that 1°C of monthly temperature amounts to 2mm month−1 of evaporation [57, 58] helping to identify periods of relative water surplus or deficit. modelFrame can model and detrend series of AAI processing the meteorological MDDS. Formulating parameters to model aridity indexes from Prec and Temp data sets using modelFrame is similar to what was explained for modeling tree-growth fluctuations. Here, new functions moveYr and wlai are specified in fn argument (Table 2). The moveYr function transforms time units into climatic variables for the years considered begin with October (default ini.mnt = ‘Oct’, which is the beginning of meteorological year in Spain). The wlai function calculates square-root annual aridity indexes (default sqt = TRUE). Both functions are evaluated at the year level with the lv argument. Fluctuations in the aridity-index are extracted using alternative lmeForm formula. This is a linear expression developed to detrend square-root aridity indexes, see Appendix A.1.3. Default Restricted Maximum Likelihood (REML) method is maintained.
> aif <- modelFrame(list(Prec, Temp),
fn = list(’moveYr’,’wlai’),
lv = list(’year’,’year’),
form = ‘lmeForm’)
> summary(aif)
Linear mixed-effects model fit by REML
Data: fixed
AIC BIC logLik
44.47932 57.40416 -17.23966
#…
Fixed effects: AI ˜ year
Value Std.Error DF t-value p-value
(Intercept) -18.928112 3.648494 97 -5.187925 0
year 0.009985 0.001842 97 5.420766 0
#…
> plot(aif) ## Fig (4)
In this case, the fixed effects terms of the aridity-index model are significant (p ≈ 0) suggesting that drought increased throughout the study area during the 50 years prior to sampling, even though the slope is very low (9.9 × 10−3). Detrended plot-level aridity indexes fluctuate around zero over time and display constant variance (Fig 4).
Labels indicate level-codes in the plot factor.
Similarly to the TRW modeling, we compare parameters of the multilevel model in aif with parameters of an lm model without grouping structure. We also inspect presence of serial patterns in the aridity-index fluctuations.
> aifl <- lm(AI ˜ year,
data = getData(aif))
> anova(aif, aifl)
Model df AIC BIC logLik Test L.Ratio p-value
aif 1 5 44.47932 57.40416 -17.23966
aifl 2 3 42.43269 50.18759 -18.21635 1 vs 2 1.953366 0.3766
> plot(ACF(aif, maxlag = 10, resType = ‘n’),
alpha = 0.01) ## Fig (3), lower panel
Parameters in the multilevel model in aif exhibit non-significant differences with regard to parameters in the lm model (p ≈ 0.38) suggesting that more extensive meteorological records are needed to properly account for plot-level variability within the multilevel model. On the other hand, the ACF plot of normalized fluctuations in aif indicates no significant autocorrelations (Fig 3, lower panel). Considering that the fixed-effects parameters in aif are significant and that the corresponding fluctuations are not autocorrelated, we use the aif model during the final process of multivariate comparison.
7.1 Multivariate comparison
The BIOdry package can measure relationships and patterns between classes of the dendroclimatic fluctuations by computing Mantel correlograms (Appendix A.5). This comparison is performed by evaluating the muleMan function. Here, we use this function to correlate dissimilarity matrices between fluctuations in the trwf and the aif objects which have common levels in the plot factor. We test significances of the multivariate correlations by repeatedly permuting the standardized dissimilarity matrices by rows and columns with the nperm argument.
> mcomp <- muleMan(trwf,
cd = aif,
nperm = 103)
> plot(mcomp) ## Fig (5)
The plot-level panels suggest that relationships between fluctuations in trwf and aif depend on the within-plot hierarchy, and plot location is the most important driver of dendroclimatic correlations (Fig 5). Within-plot variability affects the patterns of the computed correlations over time, while within-tree variability influences correlation trends and scales across the computed distance classes. Trends in the two sites exhibit oscillating pulses, and pulse frequency was significant for distance classes around 2 or 6. Although dendroclimatic interactions are stronger in East-Central Spain (P16106), they are more regular in Northern Spain (P44005) than in East-Central Spain (P16106). The Mantel correlogram also illustrates that dendroclimatic relationships are affected by within-core variability, such as radial-increment morphologies in this case. The core replicates of the same trees indicates both significant and no significant responses to drought (Fig 5, filled and empty circles, respectively), and discrepancies are more evident in P16106 than in P44005.
Red circles indicate significant correlations (p ≥ 0.05).
8 Conclusion
The BIOdry package is an useful tool for multilevel dendroclimatic modeling by integrating recursive evaluation of one-level functions, multilevel modeling, and multivariate comparison of dendroclimatic fluctuations. A larger community of researchers will find it relatively easy to use, as it only requires implementation of the modelFrame and muleMan functions. Most arguments in other functions of the same package, or in the nlme or the ecodist libraries, can be implemented with the same modelFrame and the muleMan functions. The implementation of the functionalities within the BIOdry package to study the effects of dendroclimatic factors on forest-ecosystem dynamics could be greatly relevant for managing natural forests or monocultures, or for understanding the role of species mixtures in forestry. In the latter case, effects from dendroclimatic factors representing complementary resource uses (root stratification, nutrients, facilitative improvements, etc.) or even commercial productivity might be modeled with the package. We hope that other scientists can implement the BIOdry package to study the effect of such factors on many species and at different successional stages.
A Appendix: Multilevel and multivariate analyses with the BIOdry package
A.1 Trend subtraction
Currently, the R-package BIOdry has two detrending formulas:‘tdForm’ and ‘lmeForm’. The first formula uses growth theory to subtract trends in organic variables. The second formula is a more flexible linear expression developed to detrend normalized aridity indexes and to formulate further linear relationships in MDDS: e.g. gropedData structures for panel plotting.
A.1.1 Growth theory.
Ontogenetic component of organic growth (e.g. tree growth) behaves like a mechanistic open system with inputs and outputs of energy [40, 59, 60]. Diverse growth equations have been postulated on this basis to model, for instance, forest growth [39, 61–63], with most of the proposed equations being particular cases of following log-linear expression [41]: (1) where ln is logarithm natural; is the relative growth of the organism; β are parameters to be fitted; Y is cumulative organic growth; and t is the time. f(t) is a function of time which can be ln(t) or t. When f(t) = t, Eq (1) is called a Time-Decline Form (TD), and when f(t) = ln(t) then the equation is a Logarithmic Time-Decline Form (LTD). Even though growth theory has provided theoretical balance equations, such as the TD or the LTD forms, to detrend tree growth series, conventional dendroclimatical software implements particular cases of growth or arbitrary equations, such as polynomials or even straight lines [2, 64]. These procedure has been successful in subtracting fluctuations and signals from dendrochronologic analysis [54]. Growth equations can help to enhance detrending process in dendrochronolgical research.
A.1.2 Multilevel detrending of MDDS.
The ‘tdForm’ method in BIOdry package is a linear generalization of Eq (1) which considers either time units or classification factors in MDDS as random effects. This method function has the following structure: (2) (3) (4) (5) (6) where the β are the model parameters, υ⋯ = (υ⋯,0, υ⋯,1, υ⋯,2)T are vectors of outermost levels of grouping, υ⋯i is a vector of tree random effects nested in the outermost levels of grouping, and υ⋯ic is a vector of core random effects nested in trees that are nested in the outermost levels. The vector ϵ⋯ic,t contains the within-group residual errors ϵ⋯ic,t and follows a multivariate normal distribution ϵ⋯ic,t ∼ N(0, R).
A.1.3 Trends in aridity indexes.
Algorithms in BIOdry can construct Walter-Lieth diagrams and compute annual aridity indexes as the ratios between dry and wet areas in the diagrams. The inputs are MDDS of monthly average temperatures (°C) and monthly precipitation sums (mm). This integrates seasonal precipitations and temperatures to identify periods of relative water surplus and deficit [65]. Square root transformation of the aridity indexes are usually stationary [20]. Transformed aridity indexes also exhibit weak linear trends [52] that can be subtracted with simple linear equations, without further filtering the low and high frequencies observed in the raw precipitation and temperature records [43]. Autocorrelation in climatic proxies has been a frequent problem in dendroclimatology [47]. The square root transformed indexes use are weakly autocorrelated [52].
The ‘lmeForm’ method implements a flexible linear formula, which was developed to enable users formulate their own multilevel linear expressions. For instance, form = ‘lmeForm’ can be used to detrend the aridity indexes. In such a case, the ‘lmeForm’ method formula would have the following structure: (7) (8) (9) (10) where the γ are the model parameters, τk = (τk,0, τk,1)T is the vector of location random effect. The vector ωk,t contains the location-level residual errors ωk,t and follows a multivariate normal distribution ωk,t ∼ N(0,R). These are ordered on time and depict climatic fluctuations.
A.1.4 Back-transformation of model parameters.
In some applications involving ecological analysis of dendroclimatic variables, it might be necessary to back-transform tree-growth and aridity-index predictions from Eqs (2) and (7) to the original scales. It is well known that exponential and power transformations could introduce biases and correction factors must be applied [66, 67]. However, it is not an objective of the BIOdry package to find how correction factors could best be incorporated in mixed effects models. Analyses with the package focus on comparing dendroclimatic fluctuations without the need of back-transforming variables and the mixed-effects models include no correction factors. Therefore, inferences with the package apply only to the untransformed values of the parameters.
A.2 Residual normalization
Multilevel fluctuations in modelFrame objects are normalized via Choleski decomposition of the residual variance-covariance matrix, see p. 239 in [42]. For the innermost level of Residuals of Annual Biomass Increment (RABI) the normalized residuals are defined as follows: (11) where is the lower triangle of the Cholesky decomposition of the variance-covariance matrix Vi, Xi is the design matrix, contains the parameter estimates and yi is the vector of observed responses.
A.3 Autocorrelation function
Autocorrelation of normalized fluctuations in multilevel models is assessed with ACF, see p. 183 in [28]. As an example, the autocorrelation model for the lowest level of ϵ⋯ic,t normalized residuals is (12) where l is the lag in years, and N(l) is the number of residual pairs defining the numerator of . Any correlation structure of nlme package can be implemented in modelFrame function to model residual autocorrelation. For the case of dendroclimatic data, autocorrelation in MDDS is modeled with an autoregressive-moving average models ARMA (1,1), which have exponentially decaying auto-correlation functions for lags ≥ 2. For the lowest level of the ϵ⋯ic,t normalized residuals the autocorrelation model is (13) where ϕ was the autoregressive parameter; θ was the moving average parameter; and ac,t was the noise term, see p. 128 in [50].
A.4 Variance function
Heteroscedasticity and serial autocorrelation of and Residuals of Annual Aridity Index (RAAI) can be modeled with correlation and weights arguments [42] in modelFrame function. However, such modeling would take a long time depending on the complexity of the classification factors in the MDDS. For example, the variance model for the innermost level of ϵ⋯ic,t normalized residuals is: (14) where σ2 was the variance; and were constants; νc,t had the same structure of the fixed effects from Eqs (2) and (7).
A.5 Multivariate comparison
Mantel correlograms between two MDDS with a common classification factor are established by comparing distances in one of the MDDS with sets of binary model matrices specifying membership in particular classes of the other MDDS. Euclidean distances from both MDDS are standardized to z-scores by subtracting the means from individual distances and then dividing the differences by the corresponding standard deviations. The Mantel statistic is (15) where d is the distance class from one of the MDDS; zij is the distance between each pair i and j from the other MDDS; wij is a weight for the pair: typically 1 if zij is in d and 0 if it is not. The number of classes d is calculated with the Sturges rule [29]: (16) where m is the number of distances in the upper-triangular binary model matrix.
Acknowledgments
This research has been supported through funds from the Colombian Government (Colciencias) within the framework of the National Program for Doctoral Studies Abroad (Year 2012, Number 568). This study was also financed by the FORMIXING project (AGL2014-51964-C2-1-R) funded by the Spanish Ministry of Economy, Industry and Competitiveness. Our thanks to the researchers at the UVa-INIA Sustainable Forest Management Research Institute for their assistance in core sampling and for their ideas related to the development of the algorithm; in particular Cristobal Ordoñez, Ana de Luca, José Riofrio, and Cristina Prieto. Many thanks to Andy Bunn (Western Washington University), Ignacio del Valle (National University of Colombia) and Victor Gutierrez-Velez (Temple University) for their valuable suggestions. Finally, we want to thank the anonymous reviewers, as well as Ben Bon-Lamberty (Academic Editor of PLOS ONE), for their suggestions and comments.
References
- 1.
Farmer GT, Cook J. Climate Change Science: A Modern Synthesis: Volume 1—The Physical Climate. Springer, Dordrecht Heidelberg New York London, 564 pp.; 2013.
- 2.
Cook E, Holmes R. Guide for computer program ARSTAN; 1996. Available from: http://www.esf.edu/for/bevilacqua/for496/.
- 3. Grissino-Mayer HD. Evaluating crossdating accuracy: a manual and tutorial for the computer program COFECHA. Tree-Ring Res. 2001;57(2):205–221.
- 4.
Voor Tech Consulting. Measure J2X; 2008. Available from: http://www.voortech.com/projectj2x/docs/V321/install.htm.
- 5. Kanatjev AG, Shumilov OI, Kasatkina EA. Software for dendrochronological measurements. Instrum Exp Tech. 2014;57(2):214–217.
- 6.
Regent Instruments Canada Inc. WINDENDRO for Tree-ring Analysis; 2009. Available from: http://www.regent.qc.ca/assets/windendro_about.html.
- 7. Boswijk G, Fowler AM, Palmer JG, Fenwick P, Hogg A, Lorrey A, et al. The late Holocene kauri chronology: assessing the potential of a 4500-year record for palaeoclimate reconstruction. Quaternary Sci Rev. 2014;90:128–142.
- 8. Petrillo M, Cherubini P, Fravolini G, Marchetti M, Ascher-Jenull J, Schärer M, et al. Time since death and decay rate constants of Norway spruce and European larch deadwood in subalpine forests determined using dendrochronology and radiocarbon dating. Biogeosciences. 2016;13(5):1537–1552.
- 9.
Hughes MK, Swetnam TW, Diaz HF. Dendroclimatology: progress and prospects. vol. 11. Springer, Dordrecht Heidelberg London New York; 2010.
- 10. Battipaglia G, Saurer M, Cherubini P, Siegwolf RTW, Cotrufo MF. Tree rings indicate different drought resistance of a native (Abies alba Mill.) and a nonnative (Picea abies (L.) Karst.) species co-occurring at a dry site in Southern Italy. For Ecol Manage. 2009;257(3):820–828.
- 11. Cocozza C, Marino G, Giovannelli A, Cantini C, Centritto M, Tognetti R. Simultaneous measurements of stem radius variation and sap flux density reveal synchronisation of water storage and transpiration dynamics in olive trees. Ecohydrology. 2014;8(2014):1–13.
- 12. Gerhart LM, McLauchlan KK. Reconstructing terrestrial nutrient cycling using stable nitrogen isotopes in wood. Biogeochemistry. 2014;120:1–21.
- 13. Nehrbass-Ahles C, Babst F, Klesse S, Nötzli M, Bouriaud O, Neukom R, et al. The influence of sampling design on tree-ring based quantification of forest growth. Glob Chang Biol. 2014;20:1–19.
- 14. Jones MD, Bowles ML. Eastern redcedar dendrochronology links hill prairie decline with decoupling from climatic control of fire regime and reduced fire frequency 1. J Torrey Bot Soc. 2016;143(3):239–253.
- 15. Metsaranta JM, Lieffers VJ. Using dendrochronology to obtain annual data for modelling stand development: a supplement to permanent sample plots. For. 2009;82(2):163.
- 16. De Micco V, Battipaglia G, Cherubini P, Aronne G. Comparing methods to analyse anatomical features of tree rings with and without intra-annual density fluctuations (IADFs). Dendrochronologia. 2014;32(1):1–6.
- 17. Troupin D, Nathan R, Vendramin G. Analysis of spatial genetic structure in an expanding Pinus halepensis population reveals development of fine-scale genetic clustering over time. Molecular Ecology. 2006;15(12):3617–3630. pmid:17032261
- 18. Touchan R, Shishov VV, Tychkov II, Sivrikaya F, Attieh J, Ketmen M, et al. Elevation-layered dendroclimatic signal in eastern Mediterranean tree rings. Environ Res Lett. 2016;11(4):044020.
- 19. Foster JR, D’Amato AW, Bradford JB. Looking for age-related growth decline in natural forests: unexpected biomass patterns from tree rings and simulated mortality. Oecologia. 2014;175(1):363–374. pmid:24442595
- 20. Manrique E, Fernandez-Cancio A. Extreme Climatic Events in Dendroclimatic Reconstructions from Spain. Clim Chang. 2000;44(1):123–138.
- 21. García-López JM, Allué C. A phytoclimatic-based indicator for assessing the inherent responsitivity of the European forests to climate change. Ecol Indic. 2012;18:73–81.
- 22. Bowman DMJS, Brienen RJW, Gloor E, Phillips OL, Prior LD. Detecting trends in tree growth: not so simple. Trends Plant Sci. 2013;18(1):11–17. pmid:22960000
- 23.
Sanogo K, Gebrekirstos A, Bayala J, Villamor GB, Kalinganire A, Dodiomon S. Potential of dendrochronology in assessing carbon sequestration rates of Vitellaria paradoxa in southern Mali, West Africa. Dendrochronologia. 2016;.
- 24. Wu Y, Wang X, Ouyang S, Xu K, Hawkins BA, Sun OJ. A test of BIOME-BGC with dendrochronology for forests along the altitudinal gradient of Mt. Changbai in northeast China. J Plant Ecol. 2016; p. rtw076.
- 25. Green P, MacLeod CJ. SIMR: an R package for power analysis of generalized linear mixed models by simulation. Methods Ecol Evol. 2016;7(4):493–498.
- 26. Bolker BM, Brooks ME, Clark CJ, Geange SW, Poulsen JR, Stevens MHH, et al. Generalized linear mixed models: a practical guide for ecology and evolution. Trends Ecol Evol. 2009;24(3):127–135. pmid:19185386
- 27. Oswald SA, Nisbet IC, Chiaradia A, Arnold JM. FlexParamCurve: R package for flexible fitting of nonlinear parametric curves. Methods Ecol Evol. 2012;3(6):1073–1077.
- 28.
Galecki A, Burzykowski T. Linear Mixed-Effects Models Using R: A Step-by-Step Approach (Springer Texts in Statistics). 2013th ed. Springer; 2013.
- 29.
Legendre P, Legendre LF. Numerical Ecology, Volume 24, Third Edition (Developments in Environmental Modelling). vol. 24. Elsevier; 2012.
- 30. Goslee SC, Urban DL. The ecodist package for dissimilarity-based analysis of ecological data. J Stat Softw. 2007;22:1–19.
- 31. Legendre P. Comparison of permutation methods for the partial correlation and partial Mantel tests. J Stat Comput Simul. 2000;67(1):37–73.
- 32.
Oksanen J. Multivariate Analysis of Ecological Communities in R: vegan tutorial; 2015. Available from: https://cran.r-project.org/web/packages/vegan/vignettes/intro-vegan.pdf.
- 33. Wickham H, et al. The split-apply-combine strategy for data analysis. J Stat Softw. 2011;40(1):1–29.
- 34.
Peters RL, Groenendijk P, Vlam M, Zuidema PA. Detecting long-term growth trends using tree rings: A critical evaluation of methods. Global change biology. 2014;.
- 35. Carrer M, von Arx G, Castagneri D, Petit G. Distilling allometric and environmental information from time series of conduit size: the standardization issue and its relationship to tree hydraulic architecture. Tree Physiol. 2015; p. tpu108.
- 36. West GB, Brown JH, Enquist BJ. A general model for the origin of allometric scaling laws in biology. Science. 1997;276(5309):122–126. pmid:9082983
- 37. King DA. Linking tree form, allocation and growth with an allometrically explicit model. Ecol Model. 2005;185(1):77–91.
- 38.
Stillwell R, Shingleton A, Dworkin I, Frankino W. Tipping the scales: Evolution of the allometric slope independent of average trait size. Evolution. 2016;
- 39. Lei YC, Zhang SY. Features and Partial Derivatives of Bertalanffy-Richards Growth Model in Forestry. Model Control. 2004;9(1):65–73.
- 40. West GB, Brown JH, Enquist BJ. A general model for ontogenetic growth. Nature. 2001;413(6856):628–631. pmid:11675785
- 41. Zeide B. Analysis of Growth Equations. For Sci. 1993;39(3):594–616.
- 42.
Pinheiro JC, Bates DM. Mixed-effects Models in S and S-PLUS. New York: Springer-Verlag, New York, Berlin, Heidelberg; 2000.
- 43.
Guijarro JA. User’s guide to Climatol: An R contributed package for homogenization of climatological series (and functions for drawing wind-rose and Walted&Lieth diagrams). Version 2.1, distributed under the GPL licence, version 2 or newer. Guide version 1.0; 2011. URL http://webs.ono.com/climatol/climatol.html.
- 44. Martinelli N. Climate from dendrochronology: latest developments and results. Glob Planet Chang. 2004;40(1-2):129–139.
- 45. Mishra AK, Singh VP. A review of drought concepts. J Hydrol. 2010;391(1-2):202–216.
- 46.
Fritts H. Tree rings and climate. Academic Press Inc., London, 582 pp.; 1976. Available from: http://store.elsevier.com/product.jsp?isbn=9780323145282&pagename=search.
- 47.
Cook ER, Pederson N. Uncertainty, Emergence, and Statistics in Dendrochronology. In: Hughes MK, Swetnam TW, Dias HF, editors. Dendroclimatology: Progress and Prospects. Volume 11. Springer; 2011. p. 77–112.
- 48. Bunn AG. Statistical and visual crossdating in R using the dplR library. Dendrochronologia. 2010;28(4):251–258.
- 49.
Shumway RH, Stoffer DS. Time Series Analysis and Its Applications: With R Examples, Second Edition. New York: Springer; 2006.
- 50.
Cowpertwait PSP, Metcalfe AV. Introductory Time Series with R (Use R!). Springer, Dordrecht Heidelberg London New York; 2009.
- 51. Bogino S, Bravo F. Growth response of Pinus pinaster Ait. to climatic variables in central Spanish forests. Ann For Sci. 2008;65:1–13.
- 52. Lara W, Bravo F, Maguire DA. Modeling patterns between drought and tree biomass growth from dendrochronological data: A multilevel approach. Agric For Meteorol. 2013;178-179(0):140–151.
- 53. van der Molen M, Dolman A, Ciais P, Eglin T, Gobron N, Law B, et al. Drought and ecosystem carbon cycling. Agric For Meteorol. 2011;151(7):765–773.
- 54.
Speer JH. Fundamentals of tree-ring research. The University of Arizona Press, Tucson; 2010.
- 55.
Crawley MJ. The R Book, Second Edition. Chichester: Wiley; 2013.
- 56. Hughes MK. Dendrochronology in climatology–the state of the art. Dendrochronologia. 2002;20(1):95–116.
- 57. Jia Z, Luo W. A modified climate diagram displaying net water requirements of wetlands in arid and semi-arid regions. Agric Water Manag. 2009;96(9):1339–1343.
- 58.
Walter H. Vegetation of the Earth and Ecological Systems of the Geo-biosphere. Berlin: Springer-Verlag; 1985.
- 59. Jørgensen SE, Nielsen SN, Fath BD. Recent progress in systems ecology. Ecol Model. 2016;319:112–118.
- 60. Von Bertalanffy L, et al. The Theory of Open systems in Physics and Biology. Science. 1950;111(2872):23–29. pmid:15398815
- 61. Pommerening A, Muszta A. Relative plant growth revisited: Towards a mathematical standardisation of separate approaches. Ecol Model. 2016;320(24):383–392.
- 62. Shi PJ, Chen L, Hui C, Grissino-Mayer HD. Capture the time when plants reach their maximum body size by using the beta sigmoid growth equation. Ecol Model. 2016;320(24):177–181.
- 63. Tjørve E, Tjørve K. A unified approach to the Richards-model family for use in growth analyses: Why we need only two model forms. J Theor Biol. 2010;267(3):417–425. pmid:20831877
- 64. Bunn AG. A dendrochronology program library in R (dplR). Dendrochronologia. 2008;26(2):115–124.
- 65. Kempes CP, Myers OB, Breshears DD, Ebersole JJ. Comparing response of Pinus edulis tree-ring growth to five alternate moisture indices using historic meteorological data. J Arid Environ. 2008;72(4):350–357.
- 66. Hurlbert SH, Lombardi CM, Quinn GP, Keough MJ. Design and analysis: uncertain intent, uncertain result. Ecology. 2003;84(3):810–812.
- 67. Stow CA, Reckhow KH, Qian SS. A Bayesian approach to retransformation bias in transformed regression. Ecology. 2006;87(6):1472–1477. pmid:16869423