Alien Invasive Slider Turtle in Unpredicted Habitat: A Matter of Niche Shift or of Predictors Studied?

Background Species Distribution Models (SDMs) aim on the characterization of a species' ecological niche and project it into geographic space. The result is a map of the species' potential distribution, which is, for instance, helpful to predict the capability of alien invasive species. With regard to alien invasive species, recently several authors observed a mismatch between potential distributions of native and invasive ranges derived from SDMs and, as an explanation, ecological niche shift during biological invasion has been suggested. We studied the physiologically well known Slider turtle from North America which today is widely distributed over the globe and address the issue of ecological niche shift versus choice of ecological predictors used for model building, i.e., by deriving SDMs using multiple sets of climatic predictor. Principal Findings In one SDM, predictors were used aiming to mirror the physiological limits of the Slider turtle. It was compared to numerous other models based on various sets of ecological predictors or predictors aiming at comprehensiveness. The SDM focusing on the study species' physiological limits depicts the target species' worldwide potential distribution better than any of the other approaches. Conclusion These results suggest that a natural history-driven understanding is crucial in developing statistical models of ecological niches (as SDMs) while “comprehensive” or “standard” sets of ecological predictors may be of limited use.


Introduction
Alien invasive species are a concern in nature conservation as they may have a negative impact on native biodiversity [1]. To learn about the capability or risk of alien invasive plants and animals, Species Distribution Models (SDMs) are a powerful tool. A SDM characterizes the ecological niche of a species, based on ecological predictors recorded at the known distribution, and projects it into geographic space uncovering its potential distribution [2][3][4][5]. In recent times, there have been numerous examples in which SDMs were applied to identify areas which are suitable to certain alien invasive species. These generally aimed on climatic suitability, i.e. the species' climate envelopes [6][7][8][9][10]. In these studies, the climate envelope was understood as a part of a species' fundamental niche, which is the entirety of abiotic and biotic conditions under which it can persist [5,11]. The portion of the fundamental niche exploited by a species is commonly limited by interactions with other species (e.g. competition, predation) as well as by spatial accessibility (e.g. through presence/absence of physical barriers) ( Figure 1A) [12,13]. It is known that fundamental niches are subject to evolution. In a recent review, it has been shown that, independent of the taxonomic group, the fundamental niche can remain stable for tens of thousands of years or that it can substantially shift within only a few generations [14]. However, there is still a considerable lack of knowledge regarding the processes triggering niche shifts.
It has been pointed out that the establishment and geographic range extension alien invasive species can provide valuable insights into ecological and evolutionary processes [15]. Indeed, some recent studies have addressed the question of rapid ecological niche shifts during invasion processes. Using SDMs, it was found that in the Spotted knapweed (Centaurea maculosa) the climate envelopes in its native range (western North America) differed from its invasive range in Europe [16]. Similarly, it was demonstrated in a SDM approach that Fire ants (Solenopis invicta) can be ascribed to climate envelopes in their invaded range (North America) from which they are absent in their native South American range [17]. These observations could represent a shift either in the fundamental ( Figure 1B) or realized niches ( Figure 1C). Since alien invasive species, by definition, access areas they were absent from before, the 'new' climate envelope might most likely simply represent a better exploitation of the existing fundamental niche ( Figure 1C). To the best of our knowledge, information on the physiological limits of Centaurea maculosa and Solenopis invicta is sparse. Hence, it cannot be ruled out that the climate predictors chosen in previous approaches mentioned [16,17] are not physiologically limiting for the native range borders of these species.
The striking question arising is genetic novelty (niche evolution) versus a better insight into the existing fundamental niche breadth. We claim that this should be more properly addressed when applying SDMs. Some authors have argued that SDM approaches using observed distributions for model computation per se rather reflect the realized than fundamental niche [18]. That may per se cause errors when projecting SDMs into new areas, since suitable areas may be excluded although being physiologically suitable for the target species. Modeling should thus focus on the physiological limits of species for maximum predictions. Without this information, many of the observed mismatches (or 'niche shifts') might simply be artifacts caused by a choice of unsuitable predictors. We hypothesize that a selection of predictors aiming at a description or even at a complete depiction of the climatic conditions in the native range may be less useful for statistical model training than predictor selections based on a mechanistic understanding of physiologically limiting factors.
So far, only a few studies have tried to model the fundamental niche of a species without using distribution records. In a comprehensive study physiological measurements of the Australian gecko Heteronotia binoei were combined with high-resolution climatic data to calculate the species' climate envelope and to project it into geographic space [19]. A similar study was undertaken on Cane toads (Rhinella marina) in Australia where it is an alien invasive species [20]. Such mechanistic approaches, with no doubt, are superior to the commonly used empirical methods. However, detailed information on the physiology and natural history traits required to fully address the fundamental niches from a mechanistic point of view will remain unavailable for most of the species on our planet. At least the predictors with physiological relevance are known in some species. Accordingly, empirical records and statistical models will remain a starting point, with, as we hypothesize, predictor sets based on natural history providing the most successful calibrations.
In order to test this hypothesis, the Slider turtle (Trachemys scripta Schoepff) may be a suitable study organism. It is an alien invasive species in many parts of the world and its ecology has been thoroughly studied. Between 1989 and 1997, about 52 million individuals were produced in the United States for the foreign pet trade [21]. Released by pet owners, it has established feral populations in many different regions of the world [22][23][24][25]; see IUCN Invasive Species Specialist Group (http://www.issg.org, search for 'Trachemys scripta elegans'). At the same time, the natural history (including thermal tolerance, reproduction and physiology) of the Slider turtle has been the object of numerous studies [26][27][28][29][30][31][32][33][34][35][36][37], providing the basis for a natural history-driven modeling approach.

Climate Data
Our climate information stems from WorldClim 1.4 [39], which is based on climate conditions in the period 1950-2000 at a spatial resolution of about 161 km. It was created by interpolation using a thin-plate smoothing spline of observed climate at weather stations, with latitude, longitude and elevation as independent variables (http://fennerschool.anu.edu.au/publications/software/). The climate data set was downloaded from the DIVA-GIS homepage (http://www.diva-gis.org), i.e. 36 monthly mean variables (minimum temperature, maximum temperature and precipitation). Based on these data, we calculated 19 'bioclimate' variables for further processing with DIVA-GIS 5.4 [38]; see Figure 2 and Table  S1. DIVA-GIS provide the opportunity to plot the cumulative frequency of distribution records according to 'bioclimate' variables. This allowed us to compare the climatic tolerance between the native and invasive distributions of the Slider turtle for all 19 'bioclimate' variables.

Selection of Climate Predictors
We chose three sets of variables as predictors for building SDMs: 'comprehensive' set: all 19 'bioclimate' layers depicting the most comprehensive climatic pattern following the approach of different authors running SDM [6,16]; 'minimalistic' set: a subset of seven variables out of the 'comprehensive' data set defining the availability of thermal energy and water (e.g. the minimum, maximum and mean values at the species records) as applied to different taxa [17,[40][41][42]; 'natural history' set: a subset of five variables out of the 'comprehensive' set aiming at reflecting the physiological limits of the Slider turtle's climate envelope (see Results). To be clear, we did not use these variables to map the known physiological limits. The variables were used as predictors in the same way as the other sets. In addition, we analyzed each 100 random subsets of seven and five 'bioclimate' variables, respectively, to test the null hypotheses that our selected variable sets 'minimalistic' and 'natural history' do not predict the potential distribution of invasive populations better than any random set consisting of the same number of variables. All sets, including the 'minimalistic' and 'natural history' sets, were extracted from the same set of WorldClim variables.
The Slider turtle strongly depends on continuous availability of water throughout the year, whereby almost any kind of water body is suitable (http://nas.er.usgs.gov/queries/FactSheet.asp? speciesID = 1259). Therefore, it is not surprising that the southwestern limit of its native range border is found in the North American deserts, which are characterized by low precipitation throughout the year (http://nas.er.usgs.gov/queries/FactSheet. asp?speciesID = 1259). To take this into account, we included the 'annual precipitation' and the 'precipitation of the driest quarter' in our data set for SDMs. It has been demonstrated that the feeding behavior and digestive turnover rates in the Slider turtle are strongly temperature-dependent. At body temperature ,10uC the species does not feed anymore [25,43]. Hence, in accordance with a positive energetic balance over the year, we added the 'annual mean temperature' into our modeling approaches. The physiologically determined minimum equates with the minimum value recorded within the native range (8.3uC; see Table S1). It has been shown that the upper avoidance temperature is around 37uC [32] which is remarkably similar to the upper limit of the 'maximum temperature of the warmest month' recorded within the native range (i.e. 37.4uC; see Table S1). To account for this we included the 'maximum temperature of the warmest month' in SDM approaches.
Adult Slider turtles commonly hibernate at the bottom of icebound water bodies being largely insulated against cold air.
They maintain a body temperature of approximately 4uC, which makes the species insensitive to cold winters. Nevertheless, Slider turtle records from Illinois were compared to those from eastern Iowa with contours identifying locations where frost penetrates to a depth of 12 cm in 11 out of 14 winters and found a strong relationship [34]. In colder parts of the native range, Slider turtle neonates hibernate inside their nests and are sensitive to temperatures below 20.6uC, at which they die; see also [36]. As a consequence, adult Slider turtles hibernating in water may tolerate frost, but neonates in nests may be negatively affected by frost. The native range of our study species to the north is therefore reasonably defined by minimum temperatures during winter. Considering this relationship, we included 'minimum temperature of the coldest month' when computing SDMs.

Computation of SDMs
For the SDM building we used Maxent 3.2.19 [44] (http:// www.cs.princeton.edu/,shapire/maxent), a machine-learning algorithm following the principles of maximum entropy. It has been shown to reveal better SDM results than other comparable methods [2,45,46]. A disadvantage of Maxent is that it is a 'black box' method. Since results can remarkably vary between different algorithms, we compared Maxent results with those obtained from a second algorithm BIOCLIM [47,48], as implemented in DIVA-GIS. BIOCLIM develops SDMs by intersecting the ranges inhabited by the species along each environmental axis. An advantage of this method is that results are completely transparent for interpretation.
Clumped records can violate the statistical independence of observations and therefore assumptions of SDMs [49]. To account for this we extracted all 'bioclimate' values from the native distribution records and performed a cluster analysis with XLSTAT 2008 of Addinsoft (http://www.xlstat.com) in order to remove redundant information in the data set. XLSTAT allows to blunt cluster classes at a predefined threshold of similarity (herein 99.9%), and calculates mean values for each resulting class. These class means were used for further processing in SDMs.
DIVA-GIS allows for model testing by calculation of the Area Under the Curve (AUC), referring to the Receiver Operation Characteristic (ROC) curve by using a subset of data (commonly 25-30%) as test points and the remaining ones as training points [45,50]. Independent validation (i.e. with invasive records) was suggested to be superior to data splitting [2]; therefore, we used all invasive Slider turtle records as subsets and in a second run 25% of the native records. Because absence data are lacking, DIVA-GIS uses a set of random pseudo-absence points [38]. AUC calculation is recommended for ecological applications because it is nonparametric. Values of AUC range from 0.5 for models with no predictive ability to 1.0 for models giving perfect predictions and, according to a given classification [51], AUC values .0.9 describe 'very good', .0.8 'good' and .0.7 'useable' discrimination ability. However, the reliability of AUC validation in ecological modeling has recently been questioned [52]. AUC values depend on the predicted degree of sites occupied by a species within the study area (i.e. AUC values for models describing the potential distribution of generalists are commonly lower than those computed for specialists). Nevertheless, relative comparisons of AUC values are useful for comparisons within areas of the same extend and when applying the same set of random background points.
For thresholds derived from the natural history and physiological traits describing the climate envelope of the Slider turtle, it is important to reduce the contribution of variables to their upper or lower tails, respectively. This is reasonable considering the limiting function of the 'minimum temperature of the coldest month', which may kill neonates. Here, only the lower tail has a biological meaning, but warmer temperatures may provide no disadvantage for the species. In BIOCLIM this kind of function is implemented directly, but is unfortunately absent in Maxent. Therefore, we used grids of each variable containing categorical classes between the upper or lower limits and the mean of the variables within the native range of the Slider turtle for Maxent runs. For parts of a grid representing the biologically meaningless tail, values greater or smaller than the mean of the variable within the native range were combined into a single category. These procedures remove the influence of meaningless tails during Maxent runs.
The logistic output of Maxent is a continuous map which allows fine distinctions to be made between the modeled suitability of different areas. Maxent calculates a threshold value at each run [44]. Values greater than this threshold may be interpreted as reasonable approximation of a species' potential distribution, but the higher a Maxent value, the better the prediction and therefore the climatic suitability for a species.
Six types of areas are mapped in the BIOCLIM output: areas outside the 0-100 percentile climatic envelope of the species for one or more 'bioclimate' variables are considered unsuitable, grid cells within the 0-2.5 percentile have a 'low' climatic suitability, those within the 2.5-5 percentile a 'medium', those within the 5-10 percentile a 'high', those within the 10-20 percentile a 'very high' and cells within the 20-100 percentile an 'excellent' climatic suitability [38]. Figure 2 compares each of the 19 'bioclimate' variables of the native and invasive ranges of the Slider turtle, respectively. Ranges of variables observed in invasive populations which exceed those observed in native ones can be interpreted as shifts in niche dimension. Ranges in the following variables were most similar in native and invasive ranges: 'annual mean temperature', 'mean temperature of the wettest quarter', 'mean temperature of the driest quarter', 'annual precipitation', 'precipitation of the driest month', 'precipitation of the driest quarter' and 'precipitation of the coldest quarter'. The highest dissimilarity was found in 'isothermality', 'temperature seasonality', 'annual temperature range', 'minimum temperature of the coldest month' and 'mean temperature of the coldest quarter'. Lower temperature limits in the native and invasive ranges were almost equal for 'annual mean temperature', 'isothermality', 'minimum temperature of the coldest month', 'mean temperature of the wettest quarter' and 'mean temperature of the driest quarter', but the upper limits within the invasive range frequently exceeded those of the native range.

Results
Areas meeting all climatic requirements of the species according to the expected physiological limits of the Slider turtle are mapped in Figure 3. Areas where any of the proposed climatic variables are outside the physiological limit of the species were excluded. The remaining area is highly coincident with the native range as well as records of native and invasive populations (AUC native = 0.849; AUC invasive = 0.795).
Applying the 'comprehensive set' of 'bioclimate' variables to SDM calculation predicted the native range in a way which matched the known natural distribution of the Slider turtle in both Maxent and BIOCLIM models. However, the models largely failed to predict populations elsewhere in the world due to overfitting ( Figure 4A, see also Figure S1; Maxent AUC native = 0.991; AUC invasive = 0.716; BIOCLIM AUC native = 0.990; AUC invasive = 0.547). Using the 'minimalistic' subset of 'bioclimate' variables, SDM accuracy within the native range was reasonably met. However, predictions for invasion of the Slider turtle outside North America remained poor ( Figure 4B; see also Figure S1; Maxent AUC native = 0.989; AUC invasive = 0.702; BIOCLIM AUC native = 0.988; AUC invasive = 0.535). In contrast, only the results for the 'natural histroy' subset of variables met both native and invasive potential distributions of the Slider turtle ( Figure 4C; see also Figure S1; Maxent AUC native = 0.974; AUC invasive = 0.861; BIOCLIM AUC native = 0.974; AUC invasive = 0.757).
The randomly selected subsets of five and seven 'bioclimate' variables revealed that all models were 'very good' in describing the native range (AUC seven variables 0.987-0.994; AUC five variables 0.977-0.994; Figure 5, Figure 6), which is slightly better then our models derived from the 'natural history' set. Comparing the predictive performance of the models outside the Slider turtle's native range, selection of a lower number of variables was associated with a broader area classified as suitable in a limited number of models (,10%). The AUC value of our model for invasive records derived from natural history criteria was higher than all AUC values obtained from random variable selection confirming a better prediction ability (AUC ranges seven random variables: native: 0.987-0.994, invasive: 0.587-0.847, AUC ranges five random variables: native: 0.977-0.994, invasive: 0.569-0.855; AUC data set 'natural history' = 0.861). In both random iterations, invasive records were less frequently captured than invasive records at the same latitudes as the native records ( Figure 6A, B). This applies especially to records situated at lower Figure 3. Worldwide occurrence of the Slider turtle. Shown is presence in the native range (green dots) and in the invasive range where the Slider turtle is known to reproduce (red dots) whereas areas considered as suitable to the species with respect to physiological limits as described in the text are indicated in black. doi:10.1371/journal.pone.0007843.g003 latitudes (between 26u N and S corresponding to the southernmost known native records). This latitudinal decrease in predictive performance was confirmed when testing the models using only invasive records between 26u N and S as test points (N = 62; Figure 6D, E; AUC range seven random variables: 0.356-0.708; AUC range five random variables: 0.279-0.749), whereas our model derived from natural history criteria performed well here ( Figure 6C, AUC = 0.795). Thus, the vast majority of models did not capture the Slider turtle's actual climate envelope.

Discussion
Our results provide evidence that the observation of an apparent 'niche' (i.e. climate envelope) shift in the Slider turtle strongly depends on the choice of the variables applied during modeling. The observed range of a species reflects multiple determinants, including climatic tolerances, biotic interactions, equilibrium with climate and dispersal limitation. Hence, niche-based models derived from distribution alone will predict the geographic equivalent of the realized niche rather than the potential range of a species [18]. A SDM derived from the realized niche may therefore under-predict a species' fundamental niche because it does not consider biotic interactions and abiotic factors which may limit distributions. In our study species, one such abiotic factor is probably the ocean, which limits the native range south-and eastwards. This illustrates that not all range limits can be explained by climate alone what strongly affects the models herein by frequently excluding areas between 26u N and S.
Furthermore, when applying a data set depicting the complete climatic variation within the realized distribution of a species, the limits of all dimensions of its fundamental niche are unlikely to be reached because some niche dimensions may have a widereaching impact defining a large part of the native range border (as the 'minimum temperature of the coldest month' in the Slider turtle). Likewise, others may have no impact. However, the parameters without an actual limiting function could be treated as Although the SDM approach may provide insight into the fundamental niche of a species [3,7], it cannot provide a complete picture and might be poor in choosing the relevant determinants of distribution patterns. Our results imply that parameters, which are unrelated to a species' natural history and physiology albeit congruent with its range limits, are frequently used by the models as proxies for a species' climatic envelope. This becomes evident in comparing the predictive performance of our models in the invaded range computed with a random selection of variables and a model derived from natural history criteria ( Figure 5A, B). Only the model considering explicit natural history traits performed significantly better than models based on an equal number of randomly chosen variables ( Figure 4C, Figure 6C; see also Figure  S1). The vast majority of random models did not capture the Slider turtle's actual climate envelope although test statistics may suggest a reasonably high model quality. Hence, the observed mismatches may be misinterpreted as range shifts rather than as errors in the selection of variable ( Figure 4A, B, Figure 5A, B; see also Figure S1). It was found that the predictive power of models in respect of native and introduced distributions is strongly affected by the different environmental data sets applied [53]. These findings are congruent with our results, since different sets of predictor variables have a different chance of capturing a greater or smaller part of the niche dimensions restricting a species' native range, thus explaining their different prediction success.
Assuming a shift in the slider turtle's fundamental niche is not necessary to explain the range of invasive populations in SDM, as mismatches between climate envelopes in native and invasive ranges can simply be explained by the choice of variables in SDM. Before any conclusions on niche shifts are made, an assessment of a species' fundamental niche should be addressed based on a mechanistic understanding of the limiting factors of its range. Our results indicate that such an understanding of causal factors is essential when assessing the climatic suitability of a geographic area or potential range shifts in past or future scenarios.
Our study does not aim at a principle rejection of a fundamental niche shift occurring during invasion processes [16,17,54]. If in fact a niche shift had occurred in invasive populations of the Slider turtle our conclusions will be based on the false assumption (i.e. no niche shift). However, we are convinced that assuming inappropriate model selection instead of niche shift (evolutionary response) in a successful invader is a more parsimonious assumption, especially in the light of all the methodological uncertainties accompanying with SDMs [45]. First, within 30 years no more than two Slider turtle generations may have occurred which makes evolutionary change unlikely. Second, over this time span the species has conquered different parts of the world, so evolutionary change should have taken place multiple times. These observations raise some concerns about the simplistic approach of applying 'standard datasets' of predictors in climate envelope modeling.
In conclusion, the mismatch between 'very good' [51] model performance in a mere statistical sense and the model's ability to capture the climatic niche of an organism is of particular concern. Selection of variables must be conducted carefully and needs to be fitted to the ecological and physiological characteristics of each species. Unfortunately, the lack of physiological data for the vast Figure 5. Prediction accuracy of models calculated. One-hundred summed Maxent models for the Slider turtle converted to presence/absence maps each calculated with a random selection of seven (A) and five (B) variables out of the complete set of 19 'bioclimate' predictors. Note that the species' native range is well captured by all models whereby the invasive populations are not, especially between 26u N and S longitude. doi:10.1371/journal.pone.0007843.g005 majority of species and the application of 'standard' sets of environmental variables make predictions for whole species' communities and biodiversity loss questionable [55,56]. Thus, future research should place more emphasis on the evaluation of the physiological and ecological important characteristics which are important for each single species instead of being content with deductions from distributional information. Figure S1 Presence of the Slider turtle in its native range (green dots) and invasive range where it is known to reproduce (red dots), countries from which reproducing populations are known but no specific localities are available (hatched) and potential distribution derived from BIOCLIM SDM (colored): (A) using 19 'bioclimate' variables, approach 'comprehensive'; (B) using 7 'bioclimate' variables, approach 'minimalistic'; (C) using 5 'bioclimate' variables derived from physiological and natural history traits of the Slider turtle, approach 'natural history'. Found at: doi:10.1371/journal.pone.0007843.s001 (2.13 MB DOC)

Acknowledgments
Axel Hochkirch kindly made valuable comments on an earlier version of the manuscript. Rene Bonke, Douglas Hendrie, Ralf Hendrix, Claudia