Correction
23 Dec 2015: The PLOS ONE Staff (2015) Correction: Evaluating Multi-Level Models to Test Occupancy State Responses of Plethodontid Salamanders. PLOS ONE 10(12): e0145899. https://doi.org/10.1371/journal.pone.0145899 View correction
Figures
Abstract
Plethodontid salamanders are diverse and widely distributed taxa and play critical roles in ecosystem processes. Due to salamander use of structurally complex habitats, and because only a portion of a population is available for sampling, evaluation of sampling designs and estimators is critical to provide strong inference about Plethodontid ecology and responses to conservation and management activities. We conducted a simulation study to evaluate the effectiveness of multi-scale and hierarchical single-scale occupancy models in the context of a Before-After Control-Impact (BACI) experimental design with multiple levels of sampling. Also, we fit the hierarchical single-scale model to empirical data collected for Oregon slender and Ensatina salamanders across two years on 66 forest stands in the Cascade Range, Oregon, USA. All models were fit within a Bayesian framework. Estimator precision in both models improved with increasing numbers of primary and secondary sampling units, underscoring the potential gains accrued when adding secondary sampling units. Both models showed evidence of estimator bias at low detection probabilities and low sample sizes; this problem was particularly acute for the multi-scale model. Our results suggested that sufficient sample sizes at both the primary and secondary sampling levels could ameliorate this issue. Empirical data indicated Oregon slender salamander occupancy was associated strongly with the amount of coarse woody debris (posterior mean = 0.74; SD = 0.24); Ensatina occupancy was not associated with amount of coarse woody debris (posterior mean = -0.01; SD = 0.29). Our simulation results indicate that either model is suitable for use in an experimental study of Plethodontid salamanders provided that sample sizes are sufficiently large. However, hierarchical single-scale and multi-scale models describe different processes and estimate different parameters. As a result, we recommend careful consideration of study questions and objectives prior to sampling data and fitting models.
Citation: Kroll AJ, Garcia TS, Jones JE, Dugger K, Murden B, Johnson J, et al. (2015) Evaluating Multi-Level Models to Test Occupancy State Responses of Plethodontid Salamanders. PLoS ONE 10(11): e0142903. https://doi.org/10.1371/journal.pone.0142903
Editor: Robert F. Baldwin, Clemson University, UNITED STATES
Received: June 12, 2015; Accepted: October 28, 2015; Published: November 30, 2015
This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: Oregon State University Fish and Wildlife in Managed Forests Program, Oregon Forest Industries Council, Port Blakely Tree Farms, and Weyerhaeuser provided funding to collect empirical data. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The funders provided support in the form of salaries for authors [AJK, TG, JJ, KD, BM, JJ, SP, MR], but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the 'author contributions' section.
Competing interests: The authors' commercial affiliation does not alter their adherence to PLOS ONE policies on sharing data and materials.
Introduction
Salamanders are a diverse, widespread, and often abundant group of organisms occurring in a broad range of habitat types and serve critical roles in ecosystem dynamics such as nutrient cycling and food webs [1, 2]. Evolutionary studies often use salamanders as subjects because of their novel morphological adaptations, broad geographic distributions, and ancient phylogenies [3–5]. However, life history traits such as limited dispersal and low reproductive rates, and preferences for cool, moist micro-habitats, can render some species vulnerable to habitat modification, exotic species, and disease, and, as a result, many taxa are threatened or imperiled [6–9].
Ecological roles of most salamander species are poorly documented [10, 11]. Due to their multi-stage life histories and occupancy of structurally complex aquatic and terrestrial habitats, studying salamanders poses a unique set of challenges to investigators [12–14]. Long-term studies, preferably with experimental manipulations, are required to understand mechanistic population responses to disturbances events and gradients [11, 15, 16]. However, in most cases only a portion of the population (e.g., in fossorial or in-stream species) is available for sampling [17–20] as many individuals may be sub-surface during sampling. Also, only a portion of these available individuals will be detected due to variation in environmental conditions (e.g., habitat complexity) and animal behavior [21]. Failure to account for this imperfect detection causes bias in naïve estimators of occupancy and abundance [22, 23]. In addition, regression coefficients for associations and/or effects will likely be biased, leading to ineffective or counter-productive conservation and management prescriptions [22]. Finally, disturbances induced by the sampling process can modify salamander habitat and behavior, and these additional sources of variation can bias inferences and future sampling efforts [17]. As a result, sampling methods and estimators of quantities of interest (e.g., occupancy or abundance) require careful thought and evaluation before deployment [24, 25].
We evaluated sampling designs and models for estimating occupancy and detection probabilities for plethodontid salamanders within the context of a planned before-after control-impact manipulative study, and applied them to empirical data on Oregon slender (Batrachoseps wrighti) and Ensatina (Ensatina eschscholtzii) salamanders as examples. The Oregon slender salamander (BAWR) is endemic to the Oregon Cascades, USA, and is distributed widely in Douglas-fir forests from ~200‒2500 feet in elevation, where the species demonstrates a strong reliance on decayed coarse woody debris [26]. Ensatina salamanders (ENES) are distributed broadly in coniferous and deciduous forests from British Columbia, Canada, to southern California, USA [27].
Occupancy is a state variable of interest in conservation and management studies (e.g., determining species distribution as a function of environmental covariates) and may be defined at different scales. An appropriate choice of scale will depend on factors such as species’ home range and dynamics, scientific or management questions of interest, and sampling constraints [28]. Further, scientific or management interest related to occupancy processes may exist at multiple spatial scales, for example at both local and regional scales [29]. Multi-scale occupancy models are a valuable extension to standard single-scale occupancy models, as they can be used with nested samples to evaluate patterns of habitat selection and use at different spatial scales while accounting for dependencies in occupancy status across scales (i.e., local scale occupancy depends on regional scale occupancy)[30]. Spatio-temporal models are yet another extension of occupancy models that account for variation across time. Several approaches exist to account for temporal variation, including implicit or explicit process dynamics, as well as empirical statistical models used for repeated measures [31–33].
We used simulations to evaluate estimator properties for single-scale and multi-scale occupancy models [29, 34], with empirical correlation structures to account for temporal dependencies, for the analysis of a planned before-after control-impact study. We generated empirical results for the hierarchical single-scale occupancy model using field data for both salamander species and discuss the relative importance of empirical and simulation results in the context of both observational and experimental studies. Researchers and managers can use results from our evaluations to design efficient studies on basic and applied aspects of salamander ecology, conservation, and management. Our results are likely applicable to a broad range of other taxa for which occupancy is evaluated.
Materials and Methods
Site Selection & Sampling
We sampled BAWR and ENES at 66 forested harvest units in the Cascade Range, OR, USA. Weyerhaeuser and Port Blakely Tree Farms LP owned the harvest units and provided permission for sampling to occur after reviewing all sampling procedures and experimental manipulations. Field sampling did not involved threatened or endangered species. Harvest unit age ranged from 35–90 ( = 60; SD = 8) years and size from 20–183 ha ( = 79; SD = 33). Clearcut harvests were used to regenerate all units. Harvest units occurred in one of two study blocks: Clackamas (Clackamas County, OR) or Snow Peak (Linn County, OR)(S1 Fig). We selected harvest units randomly for inclusion within a long-term experimental study of salamander responses to contemporary management prescriptions.
Within each harvest unit, we sampled seven 81 m2 (9×9 m) plots in 2013–2014. We chose plot sizes to incorporate multiple salamander home ranges [28] and to allow systematic searching of habitat features. We selected each 81 m2 plot randomly and sampled over three consecutive 10 minute intervals in a single day (sampling occurred between 0800–1600 and from April-June in 2013 and 2014). During each 10 minute interval, one observer surveyed the plot. Although the same harvest units were sampled in both years, we selected a new set of plots to sample in each year (i.e., each plot received a maximum of three 10 minutes surveys). Spatial and temporal replication was necessary to estimate and incorporate detection into estimates of occupancy [35]. Observers employed a light touch methodology [36, 37] in which all surface objects, including leaf litter and moss blankets on logs, were turned over to observe salamanders. All surface objects were then returned to as close to their original position as possible so as to not negatively impact the habitat quality within a plot for salamanders. Observers did not survey the same objects during each interval, but aimed to sample the entire plot over the course of the three intervals. Salamanders were not handled with this survey method. We followed a removal sampling protocol in which sampling stopped once both species were observed in a plot [31]. During sampling, observers quantified number of pieces of coarse woody debris (CWD; all logs > 25 cm DBH (small end) and > 1 m in length).
Statistical Models
The sampling design described above incorporates replication at two spatial scales (unit-level and plots within units) and two temporal scales (year and visit). Our objective in formulating a statistical model was to estimate patterns of occupancy at relevant scales of interest while accounting for the sampling design and adjusting for imperfect detection. To this end, we considered two different statistical models each representing a distinct occupancy process.
The first approach, which we refer to as the “hierarchical single-scale model”, is a modified version of the basic MacKenzie occupancy model [35], that incorporates random effects to account for both the nested spatial sampling design and repeated measurements from the same unit across time. Specifically, we let uijk denote the occupancy of plot k in year j from harvest unit i, which takes the value of 1 when the plot is occupied and 0 otherwise. We assume that plot-level occupancy follows a Bernoulli distribution uijk ∼ Bern(ψijk), where ψijk is the probability that plot k of harvest unit i during year j is occupied. Within-plot occupancy is assumed to be closed across all visits within a year. We further assume that occupancy probability is defined in terms of a mean model, μijk, and a unit-specific random intercept, b0i: logit(ψijk) = μijk + b0i. The mean model may be parameterized in terms of harvest unit level, year level and plot level covariates, while the random effect imparts a correlation among plots within a harvest unit and among measurements on the same harvest unit over time. We assume independence among plots in different harvest units, and conditional independence among plots within the same harvest unit, given the unit-level random effects. For the detection process, we let yijkl denote the detection status of plot k from harvest unit i during year j and visit l, taking the value of 1 when the species is detected and 0 otherwise. We considered species detection to also follow a Bernoulli distribution yijkl = Bern(uijk ⋅ pijkl), with detection probability pijkl.
The second approach, which we refer to as the “multi-scale” model, is a modified version of methods for multi-scale inference introduced in [29, 34]. In this approach, we allow for two nested but distinct occupancy processes, one acting at the harvest unit level, and the other acting at the plot level, conditional on harvest unit occupancy. We let zij denote the occupancy of harvest unit i in year j, which takes the value of 1 when the harvest unit is occupied and 0 otherwise. We assume that zij follows a Bernoulli distribution zij ∼ Bern(θij) with harvest unit occupancy probability θij. Similar to plot-level occupancy for the single-scale model, we parameterize unit-level occupancy probability in terms of a mean model μij and a unit-specific random intercept, b0i: logit(θij) = μij + b0i. The random intercept in this model accounts for repeated measurements on the same harvest unit across years. We represent the plot-level occupancy as a Bernoulli random variable uijk ∼ Bern(zij ⋅ ψijk). Under this parameterization, plot-level occupancy probability is equal to ψijk when zij = 1, and equal to 0 when zij = 0. The detection process is again considered to follow a conditional Bernoulli distribution yijkl = Bern(uijk ⋅ pijkl), with detection probability pijkl, and where yijkl = 1 if the species is detected and 0 otherwise. Covariates may be included in any of these models, but are restricted to harvest unit and year variables for θij, to plot, year or harvest unit variables for ψijk, and visit, plot, year, or harvest unit variables for pijkl. Under this model, we assume independence among harvest units and conditional independence on measurements of the same harvest unit over time, given the unit-specific random effects. Plots within a harvest unit are also considered conditionally independent given the partially latent occupancy status zij.
Both the hierarchical single-scale and multi-scale models may contain effects acting at different scales. The primary distinction between the two models is the scale at which an assumed occupancy process manifests: strictly at the plot level for the hierarchical single-scale model or separately at both the plot and harvest unit levels for the multi-scale model. For example, in the hierarchical single-scale model, we assume that habitat selection is taking place at the plot scale only, but that factors at a larger scale (the harvest unit) may influence this selection. For the multi-scale model, we assume that two different selection decisions are taking place: at the harvest unit first, and conditional on a harvest unit being selected, a second selection decision is made at the plot scale. Given this distinction, we cannot directly contrast the performance of these models. Instead, we present the results here for each model as two distinct options to be considered for analysis.
Simulation Study
We conducted a simulation study to evaluate the effectiveness of each occupancy model in the context of a Before-After Control-Impact (BACI) [38] experimental design with multiple levels of sampling. Our goal was to assess the impact of sample size on the consistency and efficiency of the treatment effect estimator of a BACI analysis, and to aid in the design of a planned manipulative experiment. We simulated data for both the hierarchical single-scale process and the multi-scale process discussed in the previous section. Models were then fit to data simulated under the assumed model (e.g., the multi-scale model was only fit to data simulated from the multi-scale process). The impact of model misspecification and checks for model misspecification were not considered in this study.
To simulate a BACI design, we assumed that N harvest units would be split evenly among two treatments (“control” and “impact”), and would be followed for two years: one year “before” and one year “after”. Furthermore, n plots were assumed to be placed within each of the N harvest units, with each plot visited m times. In this setting, harvest units represent our primary sampling units and are the experimental units with respect to the treatment. Plots within harvest units are a secondary sampling unit and represent sub-samples with respect to the applied treatment. As with our empirical study, we assumed that different plots within a harvest unit were sampled across years and could therefore be considered conditionally independent sampling units. Unlike our empirical study, the simulation study did not stop visits after the first detection.
The mean model used to simulate occupancy data from the hierarchical single-scale model was of the form: (1) where ψij is the expected plot-level occupancy of year j in harvest unit i, Yearij is an indicator variable for Year = “after”, and Treatmenti is an indicator variable for “impact” stands. The random intercept b0i is assumed to follow a normal distribution with mean = 0 and variance = and is included to account for correlation among plots within a harvest unit. Under this occupancy parameterization, exp(β3) is interpreted as the relative effect of the impact treatment on plot-level occupancy-odds, after controlling for year and stand-level variation. Thus, the parameter β3 can be considered an estimator for the impact treatment effect. For simplicity, no covariates, including treatment effects, were included in the detection probability model.
The mean model used to simulate data from the multi-scale model was of the form: (2) where θij is the expected harvest unit-level occupancy of harvest unit i in year j, Yearij is an indicator variable for Year = “after”, and Treatmenti is an indicator variable for “impact” harvest units. We used constant values for both the conditional plot-level occupancy, ψijk, and detection probability, pijkl, for plot k and visit l. As with the hierarchical single-scale model, the random intercept a0,i is assumed to follow a normal distribution with mean = 0 and variance = Under this parameterization, exp(α3) is interpreted as the relative effect of the impact treatment on harvest unit-level occupancy-odds, after controlling for year and harvest unit-level variation.
We draw attention to the different interpretations of the “treatment effect” in each of these models. Under the hierarchical single-scale model, the treatment effect is estimated for plot level occupancy, whereas under the multi-scale model, the effect is estimated for harvest unit-level occupancy. The two occupancy models correspond to distinct occupancy processes with different interpretations. Consequently, we used different parameter values in our simulation study for the two different models. In each case, we chose parameters to fall within an expected range for our application to BAWR and ENES (Table 1).
We evaluated all parameter combinations in Table 1, giving a total of 90 unique conditions for each model. Each unique condition was simulated 500 times. All datasets were simulated in R [39] and we fit all models using JAGS [40] called from R using package ‘R2jags’ [41]. For both models, we used 3 chains of length 10000 each, with a burnin of 5000 and 1/10 thinning. We checked convergence for occupancy and detection model parameters using the Gelman-Rubin statistic [42]. The results summary included only those simulations where all mean model convergence statistics were less than 1.1. We included the R code used to conduct the simulation study in S1 and S2 Files.
Empirical Analysis
We chose to fit the hierarchical single-scale model to the empirical data collected across two years on 66 harvest units in the Cascade Range, Oregon, USA. These data represent pre-treatment conditions in harvest units that will be included in a long-term BACI design to investigate the impact of timber harvest on BAWR and ENES occupancy. Our goals in this analysis were to obtain preliminary estimates of salamander plot-level occupancy in control conditions; to understand the association between occupancy and coarse woody debris; and to estimate detection probabilities. We could have chosen to fit the multi-scale model to our dataset. However, we selected randomly harvest units to be included in the study from a larger pool of harvest units, all of which were known to be occupied. Consequently, no variation existed in occupancy status at the harvest unit level, and we could not model the upper-level occupancy process. For both BAWR and ENES, we modeled plot-level occupancy as a function of CWD, Year, and Block: (3) where CWDij is the coarse woody debris count for plot j in unit i, Yearij is an indicator for whether plot j was sampled in 2014, and Blocki is an indicator for harvest units in the Clackamas tree farm. We sampled coarse woody debris on plots each year during salamander sampling.
Detection probability was modeled as a quadratic function of Julian date and an indicator variable for year: (4) where Julian Dateijk is the Julian date for visit k of plot j in harvest unit i, and Yearij is an indicator for whether plot j was sampled in 2014. We centered and scaled Julian date prior to analysis.
We fit all models in a Bayesian framework using N(μ = 0, σ2 = 3) priors for the plot-level occupancy and detection probability intercepts, and N(μ = 0, σ2 = 4) priors for occupancy and detection covariates. A Gamma(2,1) prior was used for the random effects standard deviation. We fit the models using JAGS, called from R, with three chains of length 55,000, a burnin of length 5,000, and 1/10 thinning. We assessed convergence using the Gelman-Rubin statistic and visual inspection of the chains [42]. The JAGS model used for this analysis is included in S3 File. The detection data and covariates for both BAWR and ENES are included in S1 and S2 Tables.
Results
Simulations
We used the results of the simulation study to focus on three quantities of interest related to the treatment effect estimator: bias, precision, and coverage probability. We calculated bias as the difference between the true value of the parameter and the expected value of the estimator, and (inverse) precision as the mean posterior standard deviation of the treatment estimator. Coverage probability was calculated as the proportion of times a 95% equal-tail credible interval contained the true value of the treatment effect parameter. Bayesian posterior credible intervals often show close to nominal coverage probabilities in both small and large sample problems [42, 43], and we examine their performance in our results.
Although we fit our models in a Bayesian framework, we would like to understand these frequentist properties of the model estimators, as they provide useful pre-experimental guidance on the use of the models and design considerations. Any Bayesian estimate will, to some extent, depend on the priors used in the analysis. Nevertheless, we consider it instructive to examine these pre-experimental properties of the model estimators, even under a limited set of conditions.
The hierarchical single-scale model showed consistent results for the average posterior mean of the treatment effect estimator for all sample sizes when detection probability was 0.5, but some evidence of bias when detection probability was 0.15 (Fig 1). Bias was generally small (<10%) for all detection probabilities when the number of harvest units was at least 40. Estimator bias for the multi-scale model was strongly dependent on both the number of simulated harvest units as well as the number of plots per harvest unit, particularly at lower detection probabilities (Fig 2). Although bias was generally low when detection probability was 0.5, substantial bias existed with low detection probabilities, particularly in cases with less than 50 harvest units (Fig 2). However, these trends suggest that with sufficient sample size at both the primary and secondary sampling units, approximately unbiased estimates can be obtained for the treatment effect estimator under either model in a BACI design, at least for the prior distributions and range of parameter values considered in this study.
Results are shown on the logit scale. Panels show the results for different combinations by simulated detection probability and post-treatment occupancy. Horizontal dashed lines show the true coefficient value.
Results are shown on the logit scale. Panels show the results for different combinations by simulated detection probability and post-treatment occupancy. Horizontal dashed lines show the true coefficient value.
We observed improved posterior estimator precision with increasing numbers of harvest units, as expected (Figs 3 and 4). Number of plots per harvest unit, which represent subsamples, also played a role in estimator precision by affecting the amount of information available to estimate occupancy of the primary sampling unit. These trends also suggest improved precision with increasing detection probability, but very little impact due to the size of the treatment effect.
Results are shown on the logit scale. Panels show the results for different combinations by simulated detection probability and post-treatment occupancy.
Results are shown on the logit scale. Panels show the results for different combinations by simulated detection probability and post-treatment occupancy.
Coverage probabilities for the single-scale model treatment effect estimator (Table 2, S3 Table) were close to nominal, but somewhat conservative with an average value of 0.97. Coverage probabilities for the multi-scale model treatment effect estimator were highly conservative, with 100% of all intervals containing the true value. This result indicates that the posterior standard deviation of the treatment effect parameter is not a good surrogate for the standard deviation of the estimator sampling distribution. The hierarchical single-scale model (Fig 5, S3 Table) showed generally good calibration between sampling variability and posterior estimates of variability when detection probability was 0.5 for all sample sizes considered in this study. As detection probability decreased, posterior variability tended to be greater than the sampling variability of the treatment estimator. Results from the multi-scale model show, across all conditions included in this study, that the average posterior variability is substantially greater than the sampling variability of the treatment estimator (Fig 6). The trends for the multi-scale model (S4 Table) suggest that posterior credible intervals for the treatment effect estimate will tend to be highly conservative, exceeding nominal coverage rates and limiting the efficiency of estimating the sign and magnitude of putative treatment effects.
Results are shown only for combinations with seven plots. Results for five and nine plots are not shown, but have trends similar to those shown here. Panels show the results for different combinations of detection probability and post-treatment occupancy.
Results are shown only for combinations with seven plots. Results for five and nine plots are not shown, but have trends similar to those shown here. Panels show the results for different combinations of detection probability and post-treatment occupancy.
Results in this table are limited to combinations with detection probability of 0.3 and seven plots. Results for other conditions are consistent with these trends (S3 and S4 Tables).
All of the above results are based on a single set of prior distributions, chosen beforehand to reflect subject matter knowledge regarding plethodontid salamander behavior. Although we think that this choice of priors is appropriate for our current application, these priors are potentially informative and have the potential to affect estimator properties such as bias and precision. At the suggestion of a reviewer, we repeated our simulation study with relaxed priors—N(μ = 0, σ2 = 10) for covariate parameters—for a reduced set of the conditions in Table 1. Results for this simulation study are summarized in S5 and S6 Tables. The hierarchical single-scale model with relaxed priors showed a negative bias in the treatment estimator expected value across most conditions (Fig 7). The magnitude of bias was affected by both sample size and detection probability, showing relatively small bias with moderate detection probabilities and larger sample sizes, and substantial bias with low detection probabilities. Results for the multi-scale model with relaxed priors showed a mix of positive and negative bias depending on the simulation conditions (Fig 8). Under both the single-scale and multi-scale models, estimator precision and coverage probabilities using the relaxed priors were substantially the same as with our original choice of priors (S5 and S6 Tables).
Note we ran only a subset of cases in Table 1 for this analysis. Results are shown on the logit scale. Panels show the results for different combinations by simulated detection probability and post-treatment occupancy. Horizontal dashed lines show the true coefficient value. Prior sensitivity can be assessed by comparing with Fig 1.
Note we ran only a subset of cases in Table 1 for this analysis. Results are shown on the logit scale. Panels show the results for different combinations by simulated detection probability and post-treatment occupancy. Horizontal dashed lines show the true coefficient value. Prior sensitivity can be assessed by comparing with Fig 2.
Empirical Evaluations
Estimated plot-level mean occupancies from the hierarchical single-scale model were higher in the Clackamas tree farm than the Snow Peak tree farm for BAWR (0.66 vs. 0.43), but lower for ENES (0.20 vs. 0.63)(Table 3). The results also showed a strong association between coarse woody debris count and BAWR plot-level occupancy. We estimated approximately 35% higher odds of occupancy for each additional piece of CWD found in a plot. In contrast, no clear association existed between ENES occupancy and CWD.
All estimates are reported on the logit scale to two significant digits. Estimates for coarse woody debris (CWD) and Julian date (JD) are based on standardized covariates.
Detection probabilities were estimated to be lower in 2014 than in 2013 for both BAWR (0.21 vs. 0.33) and ENES (0.09 vs. 0.14)(Table 3). Detection probabilities tended to increase later in the season for BAWR, but decrease later in the season for ENES.
Discussion
Diverse life history strategies and behavioral variation make Plethodontid salamanders challenging organisms to study [17, 19, 44]. Evaluation of sampling designs and estimators for responses of interest are critical to provide strong inference about Plethodontid ecology and responses to conservation and management activities. We used simulations to evaluate performance of two candidate occupancy models in an analysis of a BACI experiment, and to set appropriate sample sizes for the experiment. Our results indicated expected improvements in precision with increasing numbers of primary sampling units, but also highlighted the potential gains accrued when adding secondary sampling units. Importantly, our results identified conditions that could suffer from excessive estimator bias under particular choices of prior distributions. Both models showed evidence of estimator bias at low detection probabilities and low sample sizes, a problem that was also dependent on the choice of prior distribution. However, the results also suggested that sufficient sample size at both the primary and secondary sampling levels could ameliorate this issue.
Another insight from this study relates to the calibration between posterior uncertainty and sampling variability. Under some conditions, good calibration existed between these two quantities for the hierarchical single-scale model, suggesting that posterior credible intervals could have a similar pre-experimental interpretation as confidence intervals. However, in all cases studied for the multi-scale model and in many cases for the hierarchical single-scale model, the posterior variability was substantially greater than the sampling variability. Under these conditions, interpreting credible intervals as confidence intervals appears unjustified and Bayesian interpretation of results is preferred.
Based on the simulation results, we expanded the size of our multi-year BACI experiment from 45 to 66 harvest units and maintained seven plots per harvest unit. This change should help to reduce estimator bias and improve precision.
The results of the simulation study are limited in several ways. Our study did not fully address the sensitivity of results to the choice of priors. Our limited comparison of two different prior distributions highlights sensitivity of estimator properties to these choices, even with priors that may be considered ‘reasonable’ based on subject matter considerations. We therefore stress the importance of evaluating model performance within applied contexts prior to fitting the model to data, as well as the need for posterior model checking.
We did not consider issues of lack-of-fit, variable selection, or the impact of model misspecification on estimator performance. These issues are important, but we considered them to be outside the scope of the study. In our particular application, some of these issues are less of a concern, as certain aspects of the model will be determined by the sampling design. For example, all design-based factors will be included in the occupancy model(s), using a pre-determined functional form, regardless of statistical ‘significance’.
We did not address the question of how one might select between the two models that we explored in the simulation study. As the two models represent two distinct processes for occupancy, scientific judgment can and should be considered in the choice of model. In cases such as our empirical example, the choice of models may be guided by species distribution considerations apparent in the raw data. When limited background information is available to support one model over the other, or when a goal of the analysis is to infer which model is a better descriptor of the data, a range of options are available for Bayesian model selection and comparison [45, 46]. In all cases, it is incumbent on the analyst to perform appropriate diagnostic checks.
In the empirical analysis, we wanted to obtain preliminary estimates of salamander occupancy and detection under control conditions; examine the association between salamander occupancy and coarse woody debris; and determine suitability of models for evaluation of data collected within a BACI experimental design. Our results suggest higher plot-level occupancy of Oregon slender salamander than ENES at Clackamas; the reverse is true for Snow Peak. Our results also helped to quantify the positive association between BAWR occupancy and CWD count, a trend which had been observed during stand pre-selection. Oregon slender salamanders are thought to rely on CWD for foraging and nesting substrates, as opposed to the more generalist affinities of ENES [26]. However, we note that both BAWR and ENES appear to be relatively abundant on our study sites. For organisms occurring as relatively abundant and well-distributed populations, estimating effects of habitat change may be difficult if the response variable is occupancy as sample units may remain occupied even if population size changes substantially.
Finally, the detection probability estimates for both BAWR and ENES were similar to conditions examined in the simulation study, and suggest low bias for the BACI treatment-effect estimator under either model (and our choice of priors), given that we have included 66 harvest units in our study. However, due to relatively low detection probabilities, we can anticipate that the multi-scale model treatment-effect credible intervals for ENES will be substantially larger than the estimator sampling variability. Consequently, the intervals can be expected to exceed nominal coverage, and will be conservative if used to assess evidence of a putative treatment effect.
Supporting Information
Additional supporting information may be found in the online version of this article.
S1 Fig. Geographic distribution of Oregon slender (Batrachoseps wrighti) and Ensatina (Escholtzii ensatina) salamanders and ownership boundaries within study area, Oregon, USA, 2013–2014.
The Clackamas block occurs in Clackamas County; the Snow Peak block occurs in Linn County.
https://doi.org/10.1371/journal.pone.0142903.s001
(EPS)
S1 File. R code for the simulation study for 'hierarchical single-scale’ BACI occupancy model.
https://doi.org/10.1371/journal.pone.0142903.s002
(DOCX)
S2 File. R code for the simulation study for 'multi-scale' BACI occupancy model.
https://doi.org/10.1371/journal.pone.0142903.s003
(DOCX)
S3 File. R code for empirical analyses of Oregon slender and ensatina salamander occupancy, Oregon Cascades, US, 2013–2014.
https://doi.org/10.1371/journal.pone.0142903.s004
(DOCX)
S1 Table. Detection data and covariates for Oregon slender salamanders, Oregon Cascades, USA, 2013–2014.
https://doi.org/10.1371/journal.pone.0142903.s005
(CSV)
S2 Table. Detection data and covariates for Ensatina salamanders, Oregon Cascades, USA, 2013–2014.
https://doi.org/10.1371/journal.pone.0142903.s006
(CSV)
S3 Table. Full results of simulation study for ‘hierarchical single-scale’ BACI occupancy model.
https://doi.org/10.1371/journal.pone.0142903.s007
(CSV)
S4 Table. Full results of simulation study for ‘multi-scale’ BACI occupancy model.
https://doi.org/10.1371/journal.pone.0142903.s008
(CSV)
S5 Table. ‘Hierarchical’ BACI occupancy model simulation summary with alternative priors.
https://doi.org/10.1371/journal.pone.0142903.s009
(CSV)
S6 Table. ‘Multi-scale’ BACI occupancy model simulation summary with alternative priors.
https://doi.org/10.1371/journal.pone.0142903.s010
(CSV)
Acknowledgments
We thank Jack Giovanini, Kathryn Irvine, and two anonymous reviewers for helpful reviews of previous versions of this manuscript. Any use of trade, firm, or product names is for descriptive purposes only and does not imply endorsement by the U.S. Government.
Author Contributions
Conceived and designed the experiments: AJK TSG JEJ KD BM BB. Performed the experiments: JJ SP MR. Analyzed the data: AJK JEJ KD. Contributed reagents/materials/analysis tools: AJK TSG JEJ KD BM JJ SP BB MR. Wrote the paper: AJK TSG JEJ KD.
References
- 1. Best ML, Welsh HH. The trophic role of a forest salamander: impacts on invertebrates, leaf litter retention, and the humification process. Ecosphere. 2014;5(2):16.
- 2. Homyack JA, Sucre EB, Haas CA, Fox TR. Does Plethodon cinereus affect leaf litter decomposition and invertebrate abundances in mixed oak forest? Journal of Herpetology. 2010;44(3):447–56.
- 3. Vieites DR, Nieto-Román S, Wake DB. Reconstruction of the climate envelopes of salamanders and their evolution through time. Proceedings of the National Academy of Sciences. 2009;106:19715–22.
- 4. Hanken J, Wake DB. Miniaturization of body size: organismal consequences and evolutionary significance. Annual Review of Ecology and Systematics. 1993;24:501–19.
- 5. Reagan NL, Verrell PA. The evolution of plethodontid salamanders: did terrestrial mating facilitate lunglessness? The American Naturalist. 1991;138(5):1307–13.
- 6. Otto CRV, Roloff GJ, Thames RE. Comparing population patterns to processes: abundance and survival of a forest salamander following habitat degradation. PLOS ONE. 2014;9(4):e93859. pmid:24718498
- 7. Semlitsch RD, Conner CA, Hocking DJ, Rittenhouse TAG, Harper EB. Effects of timber harvesting on pond-breeding amphibian persistence: testing the evacuation hypothesis. Ecol Appl 2008;18(2):283–9. pmid:18488596
- 8. Maret TJ, Snyder JD, Collins JP. Altered drying regime controls distribution of endangered salamanders and introduced predators. Biol Cons. 2006;127:129–38.
- 9.
Petranka JW. Salamanders of the United States and Canada: Washington, D.C., USA.; 1998.
- 10. Davic RD, Welsh HH Jr. On the ecological role of salamanders. Annual Review of Ecology, Evolution, and Systematics. 2004;35:405–34.
- 11. Kroll AJ. Sources of uncertainty in stream-associated amphibian ecology and responses to forest management in the Pacific Northwest, USA: a review. For Ecol Manage. 2009;257:1188–99.
- 12. Kroll AJ, Hayes MH, MacCracken JG. Concerns regarding the use of amphibians as metrics of critical biological thresholds: a comment on Welsh and Hodgson 2008. Freshwater Biology. 2009;54:2364–73.
- 13. Grant EHC, Nichols JD, Lowe WH, Fagan WF. Use of multiple dispersal pathways facilitates amphibian persistence in stream networks. Proc Natl Acad Sci U S A. 2010;107(15):6936–40. pmid:20351269
- 14. Otto CRV, Kroll AJ, McKenny HC. Amphibian response to downed wood retention in managed forests: a prospectus for future biomass harvest in North America. For Ecol Manage. 2013;304:275–85.
- 15. Homyack JA, Kroll AJ. Slow lives in the fast landscape: conservation and management of Plethodontid salamanders in production forests of the United States. Forests. 2014;5:2750–72.
- 16. Lindström J, Reeve R, Salvidio S. Bayesian salamanders: analysing the demography of an underground population of the European plethodontid Speleomantes strinatii with state-space modelling. BMC Ecology. 2010;10:4. pmid:20122249
- 17. Otto CRV, Bailey LL, Roloff GJ. Improving species occupancy estimation when sampling violates the closure assumption. Ecography. 2013;36.
- 18. Tilley SG. Life histories and comparative demography of two salamander populations. Copeia. 1980;4:806–21.
- 19. O’Donnell KM, Thompson FR III, Semlitsch RD. Partitioning detectability components in populations subject to within-season temporary emigration using binomial mixture models. PLoS ONE. 2015;10(3):e0117216. pmid:25775182
- 20. Kroll AJ, Risenhoover KL, McBride T, Beach E, Kernohan BJ, Light J, et al. Factors influencing stream occupancy and detection probability parameters of stream-associated amphibians in commercial forests of Oregon and Washington, USA. For Ecol Manage. 2008;255:3726–35.
- 21. Bailey LL, Simons TR, Pollock KH. Estimating detection probability parameters for plethodon salamanders using the robust capture-recapture design. The Journal of Wildlife Management. 2004;68(1):1–13.
- 22. Archaux F, Henry P-Y, Gimenez O. When can we ignore the problem of imperfect detection in comparative studies? Methods in Ecology and Evolution. 2012;3:188–94.
- 23. Guillera-Arroita G, Lahoz-Monfort JJ, MacKenzie DI, Wintle BA, McCarthy MA. Ignoring imperfect detection in biological surveys is dangerous: a respond to 'Fitting and Interpreting Occupancy Models'. PLOS ONE. 2014;9(7):e99571. pmid:25075615
- 24. MacKenzie DI, Royle JA. Designing occupancy studies: general advice and allocation survey effort. Journal of Applied Ecology. 2005;42:1105–14.
- 25. McIntyre AP, Jones JE, Lund EM, Waterstrat FT, Giovanini JN, Duke SD, et al. Empirical and simulation evaluations of an abundance estimator using unmarked individuals of cryptic forest-dwelling taxa. For Ecol Manage. 2012;286:129–36.
- 26.
Clayton DR, Olson DH. Conservation assessment for the Oregon Slender Salamander (Batrachoseps wrighti), Version 1.0: USDA Forest Service, Region 6, and Oregon BLM, Special Status Species Program; 2007. Available from: http://www.fs.fed.us/r6/sfpnw/issssp/planning-documents/assessments.shtml.
- 27.
Jones LLC, Leonard WP, Olson DH, editors. Amphibians of the Pacific Northwest. Seattle, Washington, USA: Seattle Audubon Society; 2005.
- 28. Efford MG, Dawson DK. Occupancy in continuous habitat. Ecosphere. 2012;3(4):32.
- 29. Pavlacky DC Jr., Blakesley JA, White GC, Hanni DJ, Lukacs PM. Hierarchical multi-scale occupancy estimation for monitoring wildlife populations. J Wildl Manage. 2012;76(1):154–62.
- 30. Mordecai RS, Mattsson BJ, Tzilkowski CJ, Cooper RJ. Addressing challenges when studying mobile or episodic species: hierarchical Bayes estimation of occupancy and use. Journal of Applied Ecology. 2011;48:56–66.
- 31.
MacKenzie DI, Nichols JD, Royle JA, Pollock KJ, Bailey LL, Hines JE. Occupancy estimation and modeling: inferring patterns and dynamics of species occurrence. San Diego, California, USA: Elsevier; 2006.
- 32.
Diggle PJ, Heagerty PJ, Liang KY, Zeger SL. Analysis of longitudinal data. Oxford, England: Oxford University Press; 2002.
- 33. Pearson SF, Giovanini J, Jones J, Kroll AJ. Breeding bird community continues to colonize riparian buffers ten years after harvest. PLOS ONE. 2015;in press.
- 34. Nichols JD, Bailey LL, O’Connell AF Jr., Talancy NW, Grant EHC, Gilbert AT, et al. Multi-scale occupancy estimation and modelling using multiple detection methods. Journal of Applied Ecology. 2008;45:1321–9.
- 35. MacKenzie DI, Nichols JD, Lachman GB, Droege S, Royle JA, Langtimm CA. Estimating site occupancy rates when detection probabilities are less than one. Ecology. 2002;83(8):2248–55.
- 36. Hayes MP, Quinn T, Dugger DJ, Hicks TL, Melchiors MA, Runde DE. Dispersion of coastal tailed frog (Ascaphus truei): a hypothesis relating occurrence of frogs in non-fish-bearing headwater basins to their seasonal movements. Journal of Herpetology. 2006;40(4):533–45.
- 37. Kroll AJ, MacCracken JG, McBride TC, Bakke J, Light J, Peterson P, et al. Basin-scale surveys of stream-associated amphibians in intensively managed forests. J Wildl Manage. 2010;74(7):1580–7.
- 38. McDonald TL, Erickson WP, McDonald LL. Analysis of count data from Before-after control-impact studies. Journal of Agricultural, Biological, and Environmental Statistics. 2000;5(3):262–79.
- 39.
R Development Core Team. R: A language and environment for statistical computing: R Foundation for Statistical Computing, Vienna, Austria; 2010.
- 40.
Plummer M, editor JAGS: A program for analysis of Bayesian graphical models using Gibbs sampling. 3rd International Workshop on Distributed Statistical Computing (DSC 2003); 2003.
- 41.
Su Y-S, Yajima M. R2jags: a package for running Jags from R. http://CRAN.R-project.org/package=R2jags. R package version 0.03–08 ed2012.
- 42.
Gelman A, Carlin JB, Stern HS, Rubin DB. Bayesian data analysis, 2nd edition: Chapman & Hall/CRC, Boca Raton, FL, USA; 2004.
- 43.
Hoff PD. A first course in Bayesian statistical methods: Springer, New York, New York, USA; 2009.
- 44. Kroll AJ, Runge JP, MacCracken JG. Unreliable amphibian population metrics may obfuscate more than they reveal. Biol Cons. 2009;142:2802–6.
- 45. Hooten MB, Hobbs NT. A guide to Bayesian model selection for ecologists. Ecol Monogr. 2015;85(1):3–28.
- 46. Linden DW, Roloff GJ. Improving inferences from short-term ecological studies with Bayesian hierarchical modeling: white-headed woodpeckers in managed forests. Ecology and Evolution. 2015;5(16):3378–88. pmid:26380671