Figures
Abstract
Remote sensing techniques are useful in the monitoring of woody plant species diversity in different environments including in savanna vegetation types. However, the performance of satellite imagery in assessing woody plant species diversity in dry seasons has been understudied. This study aimed to assess the performance of multiple Gray Level Co-occurrence Matrices (GLCM) derived from individual bands of WorldView-2 satellite imagery to quantify woody plant species diversity in a savanna environment during the dry season. Woody plant species were counted in 220 plots (20 m radius) and subsequently converted to a continuous scale of the Shannon species diversity index. The index regressed against the GLCMs using the all-possible-subsets regression approach that builds competing models to choose from. Entropy GLCM yielded the best overall accuracy (adjusted R2: 0.41−0.46; Root Mean Square Error (RMSE): 0.60−0.58) in estimating species diversity. The effect of the number of predicting bands on species diversity estimation was also explored. Accuracy generally increased when three–five bands were used in models but stabilised or gradually decreased as more than five bands were used. Despite the peak accuracies achieved with three–five bands, performances still fared well for models that used fewer bands, showing the relevance of few bands for species diversity estimation. We also assessed the effect of GLCM window size (3×3, 5×5 and 7×7) on species diversity estimation and generally found inconsistent conclusions. These findings demonstrate the capability of GLCMs combined with high spatial resolution imagery in estimating woody plants species diversity in a savanna environment during the dry period. It is important to test the performance of species diversity estimation of similar environmental set-ups using widely available moderate-resolution imagery.
Citation: Fundisi E, Musakwa W, Ahmed FB, Tesfamichael SG (2020) Estimation of woody plant species diversity during a dry season in a savanna environment using the spectral and textural information derived from WorldView-2 imagery. PLoS ONE 15(6): e0234158. https://doi.org/10.1371/journal.pone.0234158
Editor: Wang Li, Aarhus University, DENMARK
Received: February 13, 2020; Accepted: May 19, 2020; Published: June 8, 2020
Copyright: © 2020 Fundisi et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All data are within paper.
Funding: The University of Johannesburg (South Africa) provided the necessary financial and material support to undertake the study. We thank the National Research Foundation (NRF) of South Africa for supporting the first author through student scholarship programme (Reference SFH150803134516) and the second author through Grants No 110778 and 119288.
Competing interests: No potential conflict of interest was reported by the authors.
Introduction
Natural environments provide vital ecosystem services such as primary energy, habitat to fauna, maintaining of hydrological cycles, protection of biological diversity, medicinal benefits, recreation, aesthetic values, etc [1–4]. Maintaining the heterogeneity and complexity of vegetation composition in these environments is therefore critical to ensure the sustained functioning of such ecosystems [3]. Unfortunately, anthropogenic activities and natural processes are threatening biodiversity and associated ecosystem services, particularly in areas close to human settlements [4–7]. An essential first step in the management of biodiversity is through accounting and monitoring of vegetation composition existing in a given area of interest. In Africa, inventories on plant species diversity are often out-of-date and/or unavailable [8]. Field inventory for characterising vegetation composition is expensive, labour intensive and time-consuming, and thus is inefficient in complex heterogeneous ecosystems [3,5,9–12].
Remote sensing overcomes the difficulties of field-based inventory and has become a primary tool in species diversity estimation and ecosystem structure assessments [9–11]. In particular, optical remote sensing utilises the sensitivity of spectra to biochemical and structural characteristics to distinguish vegetation types [12]. Advances in spectral and spatial resolution of remote sensing have allowed for more efficient species diversity estimation in different environments including grasslands (e.g., [7,13], temperate forest (e.g., [14–16], wetlands (e.g., [17,18], tropical forest (e.g., [19–21] and savanna (e.g., [22–24].
The traditional remote sensing-based species classification approaches identify a defined number of classes, irrespective of the spatial resolution of remotely-sensed data [9,21,22,25]. Such approaches, therefore, can underestimate the number of species that potentially exist in a given environment. Statistical modelling approaches such as the Shannon diversity index offer an alternative approach by converting categorical species data into continuous diversity scale [25–27], thus eliminating the restriction on the number of species that can be estimated for a given area [13,17,28–30]. A number of studies have applied continuous-scale metrics derived from species count data to quantifying woody plant species diversity in the savanna vegetation type [24,27,31–35].
Vegetation indices (e.g., Normalised Vegetation Index, NDVI) have been used as the common source of data to quantify vegetation species diversity on a continuous scale [24,28,36,37]. It should be noted that Vegetation Indices (VIs) primarily capture vegetation vigour or amount rather than species diversity [9–11]. Thus, a plot with high species diversity but low vigour (low index value) can be misinterpreted as having low species diversity while a plot containing a single or few plants with high vigour can be misinterpreted as being species-rich [10,38]. It is therefore important to test the performance of individual spectral bands rather than relying on VIs, to quantify species diversity.
A further challenge associated with reliance on vegetation indices is that the indices logically work well in high-photosynthesis-activity periods of the year [38–40]. For instance, [15,24,41] compared species diversity estimation in the dry versus wet season and found better accuracies in the wet season. There is, therefore, a need to exploit the information offered by individual bands of satellite imagery to assess species diversity in the dry season within the savanna environment. While biodiversity assessment is ideal in the wet (vigorous seasons), placing focus on the dry season has operational and ecological benefits. Operationally, remotely-sensed data acquired during dry seasons are less affected by clouds and therefore provide readily interpretable information [38]. Ecologically, dry season biodiversity assessment is vital as part of continuous monitoring strategies such as phenological tracking [6,22] and land degradation facing the savanna ecosystem [23,42].
This study, therefore, aims to assess the performance of individual bands of WorldView-2 imagery in quantifying woody plant species diversity in a dry season within the savanna region in South Africa. The study uses various GLCMs of the WorldView-2 bands as predictors to estimate field-derived species diversity expressed in the Shannon diversity index. Our study builds on [33] who extracted all eight GLCMs but only used one band (near-infrared) of a Landsat image from the wet season of an American savanna to estimate plant species diversity. Our study also differs from [43] that used fewer GLCMs and coarser spatial resolution (Landsat data). We apply the all-possible-subset regression approach to exploit the large number of models that can be created using individual bands as predictors. This specific regression approach provides alternative competing models from which a reasonable model can be selected, therefore it was selected for this study. Satisfactory findings in this study have important implications for conservation efforts that require vegetation species monitoring, irrespective of the season [44].
Materials and methods
Study area
The Klipriviersberg Nature Reserve (KNR), south of Johannesburg, South Africa, was used for the study. The reserve was proclaimed for conservation purposes in 1984 and covers approximately 651 hectares (Fig 1). KNR, administered by the City Parks of the City of Johannesburg, is a public park open to visitors at no cost. Different activities such as hiking and research work are allowed in the reserve without the need or written permission from the management provided that the regulations governing the reserves’ natural resources are observed at all times. Vegetation types in the reserve include Andesite Mountain Bushveld and Clay Grassland which are associated with a savanna environment [45]. The altitude of the area ranges between 1540 m in the south and 1790 m in the north, with a mean annual rainfall of 624–802 mm [46]. The study area experiences warm to hot summers and cold nights in winter, with a mean annual temperature ranging between 17°C and 26°C in summer and 5°C and 7°C in winter [47]. The geology types found in the area, which lead to the floristic structure of the reserve, include quartzites, conglomerates and dolomites [48].
Field data
Fig 2 provides a summary of the methodology followed in the study to estimate woody plant species diversity. Initially, a grid of 240 points distributed at approximately 170 m intervals in the north-south and east-west directions were generated using the fishnet tool in ArcGIS (ESRI® ArcGIS 10.6, Redlands, California, USA). The point coverage was exported into a Global Position System (GPS) (Garmin, GPSMAP® 64, Kansas, USA) and located in the field. Points which did not have woody plant species in their vicinity were removed from the enumeration resulting in 220 number of points available for the survey. Field surveys were done between May and June 2017, representing the dry winter season in the area [47]. A buffer with a 20 m radius was created around each point; this size was specified to accommodate not only multiple pixels of WorldView-2 imagery, but also coarser-resolution remotely sensed data that will be used for further study. The plot size was also deemed large enough to contain as many woody plant species as possible.
Unique woody plant species ≥ 2 m height were recorded in each plot (Fig 2) guided by the KNR field surveyors and the reserve’s species inventory database. The total number of distinct species in all plots was 26, while the plot-level minimum and maximum count of distinct woody plant species were 1 and 9, respectively. In addition to species uniqueness, the survey recorded the total number of woody plants ≥ 2 m in each plot. The species diversity was then converted to a continuous scale by applying the Shannon diversity index [49] for use in regression modelling. The Shannon index (Eq 1) is advantageous in that it incorporates the relative abundance and evenness of species, and therefore does not favour common or rare species [33,50]. The minimum, maximum, standard deviation of the Shannon diversity index in the sampled plots were 0.15, 2.86 and 0.78, respectively.
(1)
where p is the number of plants of a species divided by the total number of all plants, ln is the natural log, s is the number of species.
WorldView-2 image
WorldView-2 imagery covering the visible to near-infrared range of the electromagnetic spectrum was obtained for the same dry winter period (May−June 2017) as the field survey period. WorldView-2 image has high spectral and spatial resolutions making it effective for fine-scale woody plant species diversity estimation [51]. Specifically, the image has eight multispectral bands between 0.40 and 1.04 μm at 1.8 m spatial resolution and a panchromatic band covering 0.45−0.80 μm at 0.5 m spatial resolution (DigitalGlobe, www.digitalglobe.com). The bands vary in width with the yellow (0.59 and 0.63 μm) and red edge (0.71 and 0.75 μm) bands being narrower than the others. The coastal band (0.40−0.46 μm) was excluded from the analysis due to its relative sensitivity to atmospheric interferences [50]. Following a comparison of Dark Object Subtraction (DOS), [52], Fast Line-of-sight Atmospheric Analysis of Hypercubes (FLASH), [53] and QUick Atmospheric Correction (QUAC), [54] methods that yielded similar reflectance values (Pearson’s correlation, r = 0.99)). We applied the DOS method implemented in ENVI 5.3 ©2015 (Exelis Visual Information Solution Inc., Boulder, Colorado). Subsequently, the spatial resolution of the atmospheric corrected multispectral bands was pan-sharpened to 0.5 m. (Fig 2). The reliability of our pan-sharpening was ascertained by comparing the results with the already pan-sharpened image provided by the data supplier (DigitalGlobe, www.digitalglobe.com).
Woody vegetation extraction from WorldView-2 image
Pixels representative of woody vegetation plants were extracted prior to derivation of GLCMs that were used in the regression analysis. Previous studies have shown the effectiveness of NDVI thresholding to separate woody and non-woody plants (e.g., [55–57]. The approach assumes that woody plants including trees, bushes and shrubs with relatively dense foliage have greater NDVI than non-woody plants [57–60]. In this study, we computed WorldView-2 Improved Vegetation Index (WV-VI) [61] that combines the near-infrared 2 and the red bands in a similar formula as the NDVI. The narrowly focused near-infrared 2 band (0.86−1.04 μm) used in WV-VI has a higher reflectance value than the traditional broad near-infrared band range used in the NDVI [62].
Parallel to WV-VI derivation, objects were created from the multispectral image of WorldView-2 using multiresolution segmentation, following the previous studies that showed good woody plants identification capability using object-based classification [63–66]. The segmentation was applied in eCognition Developer® 9.01 (Trimble Germany GmbH, Arnulfstrasse 126, 80636 Munich, Germany). The WV-VI values were subsequently averaged per segment [67]. After iterative and independent trials, segments with WV-VI ≥ 0.4 were determined as woody plants separating them from non-woody plants (Fig 2). Segments of woody plants and non-woody features were confirmed by consulting with plot observations that were taken during field surveys.
Gray Level Co-occurrence Matrix
GLCMs were quantified from individual bands of WorldView-2 imagery within woody segments. GLCM is a statistical method used to examine the texture of pixels within a specific neighbourhood [68]. Since the statistic quantifies how often unique combinations of pixel brightness values (gray levels) occur in an image, it signifies underlying physical variations in the image and thus reveals the structural arrangements of the surface and their relationship to the surrounding environment [69]. Typically, GLCM implements a matrix scenario and it is computed by considering four different directions (0°, 45°, 90°, and 135°) between neighbouring cells that are separated by a certain distance [68,69]. This study extracted eight GLCM statistics derived from each WorldView-2 band using ENVI 5.3 ©2015 (Exelis Visual Information Solutions. Inc Boulder, Colorado).
The GLCMs were quantified in three kernel sizes: 3×3, 5×5 and 7×7. These window sizes were specified considering the spatial resolution of the image (0.5 m) and the varying canopy sizes of woody vegetation observed during the field survey. In addition, plants were observed as isolated individuals or in patches–as is common in savanna environments [28,32,33]. It is unclear how such arrangements influence the detection capability of different window sizes. It is important to ascertain that these kernel sizes exhibit variation and thereby may warrant different species diversity estimations. A simple Pearson’s correlation analysis of GLCMs among the different kernel sizes using Entropy and three selected bands (Green, Yellow and Near-infrared) resulted in r ≤ 0.5 for a number of comparisons, indicating the difference between the window sizes.
Statistical analysis
This study used an all-possible-subsets regression modelling approach to estimate the Shannon diversity index derived from the field data using GLCM values (Table 1) derived from WorldView-2 bands at different window sizes (3×3, 5×5 and 7×7) as predictors. All-possible-subsets regression tests all possible combinations of explanatory variables to develop estimation models from which favourable models can be selected [70]. A key advantage of the approach is that it does not require significance-testing that can be influenced by certain values in the samples that may or may not be representative of the population [70]. Instead, the approach provides all alternative models from which decisions can be made based on knowledge of the population from which the samples were drawn [70]. The all-possible-subset regression approach builds 2n - 1 models from n number of explanatory variables [71]. In this study, the alternative models for estimating the Shannon diversity index were grouped and compared per GLCM window size. That is, the seven bands (explanatory variables) within the 3×3 window had 127 competing models; a similar principle applied to 5×5 and 7×7 windows respectively.
By selecting the best model from the alternative models, in this study sought to achieve good accuracy with few predictors [70] using a combination of statistical indicators. Firstly, the Akaike Information Criterion (AIC) that measures the distance between a model and an ideal but unobservable model that created the data was used to rank the models [72]. Secondly, models were compared using adjR2 and RMSE values, which respectively provide an absolute measure of the explanatory power of predictors and accuracy of a model. [5,73]. Thirdly, we compared the errors of models with the smallest AIC values and explanatory variables per GLCM and window size. This approach provides more insights than the above model-fit statistics (adjR2 and RMSE) since it compares the direction and magnitude of errors of individual samples. In doing so, it also allows the identification of samples that do not have comparable estimates by competing models. The all-possible-subsets regression analysis was implemented using lmSubsets package [74] for R [75].
Results
Effect of window size, predictors and GLCMs on species diversity estimation
Table 2 lists the variables of the model that returned the smallest AIC value per GLCM and window size. Each selection represented the best-case AIC out of 127 competing models. The best model using entropy GLCM and a window size of 3×3 contained red, near-infrared 2, red edge and yellow bands as predictors of Shannon diversity index. Within the same GLCM group, the model developed using pixels in the 5×5 window used five bands as predictors. Looking at the rest of the GLCMs and window sizes, the best model and second-best model contained three bands as predictors. Contrast had the least number of predicting bands (one), whilst dissimilarity recorded the greatest number of predictor bands (five). The best AIC models using dissimilarity GLCM had five predictors for each window size, while the best models using mean GLCM used the same bands as predictors (green, red edge, yellow) for all window sizes. It is also important to note from Table 2 the inclusion of the yellow band as a predictor in the best-case models in five of the GLCM statistics.
Note that, the results presented here are the best-case scenario (smallest AIC) of 127 models per GLCM and window size.
Unfortunately, the AIC values given in Table 2 cannot be used to evaluate the relative performances of the models across GLCM and window sizes. It is, therefore, necessary to use other statistics such as adjR2 and RMSE to assess model accuracies and comparisons (Fig 3). Entropy GLCM achieved the best accuracies in all three windows, according to adjR2 (> 0.40) and RMSE (< 0.6). Specifically, entropy GLCM with a window size of 5×5 yielded the highest adjR2 of 0.46 and RMSE of 0.58 when five variables were used as predictors. Dissimilarity GLCM and a 7×7 window had the closest accuracy level to the entropy GLCM (adjR2 = 0.40; RMSE = 0.62); however, the model used six bands as predictors. Homogeneity and mean GLCMs showed the lowest prediction accuracies for all the window sizes and across the different number of predictors.
Fig 3 also shows an important trend in terms of accuracy across the number of predictors. In general, estimation accuracy increases rapidly for models that consist of three to five predictors, after which the accuracies start to level off or decrease. For instance, accuracies of models using the entropy GLCM peaked when five bands were used as predictors for the image using a 5×5 window size while the accuracies remained almost constant for all models containing six or seven predictors. The peak accuracies using the same entropy GLCM using 3×3 and 7×7 window sizes were reached when four and three predictors, were used respectively. The variance GLCM statistic that showed the third-best estimation capability after the entropy and dissimilarity statistic had models that peaked when two or three predictors were used for the three window size kernels.
Focusing on species diversity estimation using entropy GLCM that showed the best accuracies (Fig 3), we illustrate the relationships between observed and predicted Shannon diversity index in Fig 4. Similarities can be noted among estimations using the three window sizes: 3×3 (Fig 4A), 5×5 (Fig 4B) and 7×7 (Fig 4C). One similarity is that the observed versus predicted correlations were generally comparable for the three images when compared against the ideal 1:1 correlation. The second similarity relates to the over and underestimation of the Shannon species diversity for low and high values, respectively. A closer look, however, shows the best correlation for 5×5 window size (Fig 4B). This is evidenced by the greater number of estimations deviating from the regression lines in the cases of 3×3 (Fig 4A) and 7×7 window size images (Fig 4C).
The predicted indices were estimated using entropy GLCM derived from images using 3×3 (a), 5×5 (b) and 7×7 (c) window sizes. Note that the best estimations shown in the fig used four, five and three predictors for 3×3, 5×5 and 7×7, respectively. Dashed lines show 1:1 correspondence.
Comparison of competing models derived from GLCMs
Entropy GLCM with 5×5 window size from WorldView-2 image provided better estimates of species diversity (Fig 4B compared to 4a and 4c), therefore it was used as the basis to evaluate the performances of other competing models. Three categories of comparison were made using estimation errors of all competing models as illustrated in Fig 5. The first category compared estimation errors across predictor size using the best GLCM statistic with five predictors as the reference. The results of this comparison clearly show strong similarities (r = 0.97–0.99) between the best model combining GLCM, 5×5 window size and five predicting bands with models containing fewer predictors, although the similarities show a decreasing trend as the predictor size decreases (Fig 5A).
Correlation of estimation errors between selected competing models: (a) correlation between best entropy GLCM model derived from 5×5 window with 5 bands against entropy GLCM models from different number of bands within 5×5 window, (b) correlation between the best model against models derived from 7×7 window size 5 competing models, (c) correlation between best model against models derived from 3×3 window size and 5 competing models, (d) correlation between best entropy GLCM model derived from 5×5 window against seven other GLCM statistics. Underlining shows the same number of predicting bands (5) in different windows. SM = second moment, Var = variance, Cor = correlation, Con = contrast, Hom = homogeneity, Dis = dissimilarity, Mn = mean, ba = bands.
The second category of comparison was intended mainly to assess the effect of window size on estimation residual (Fig 5B and 5C). The correlation of estimation errors of the best scenario (Entropy 5×5) was 0.93 when compared with 7×7 window size and the same predictor size (five predictor bands) while it improved when the predictors were fewer (r = 0.97 for three predictors) (Fig 5B). On the other hand, the 3×3 window size and five predictors correlated with the best scenario at r = 0.90 while the best similarity was observed when four predictors were used in the 3×3 window size (Fig 5C).
The third category of comparison was made between the overall best model (entropy GLCM using 5×5 window size image) against the best competing models from the other seven GLCM statistics as shown in Fig 5D. The correlation between entropy GLCM and the other GLCM statistics ranged between 0.70 (contrast GLCM) and 0.80 (second moment GLCM). These correlations were considerably weaker than those observed in Fig 5A–5C. This is noticeable from the error of entropy GLCM for 5×5 window size image falling mostly within the -0.65–1.5 range whilst most of the error ranged between -2 and 2 for the competing models developed from other GLCM statistics.
Discussion
This study aimed at assessing the performance of GLCM texture values derived from individual bands of WorldView-2 imagery in quantifying woody plant species diversity in a dry season. Many studies have focused on imagery at coarser resolutions (e.g., [33,35,37,43,59,76–79]) that may ignore processes occurring at finer spatial resolutions (i.e. within pixel variation). The present study, therefore, adds to the body of knowledge by utilising finer resolution imagery and smaller filed-plots, which may lead to different observations than studies using coarser data. We converted the categorical species diversity information acquired through field survey to the continuous Shannon diversity index scale. The all-possible-subset regression approach that creates alternative estimation models was used in this study to correlate field data (expressed in Shannon index) and remotely-sensed data. [51,79–81] utilised all-possible subsets in species diversity estimation and reported its efficacy in evaluating all possible combinations of explanatory variables to select the best model. We believe that this approach is useful for exploring the suitability of the multiple sets of predictors (spectral bands, GLCM statistics and image window levels) to estimate species diversity.
GLCMs vs. model performance
Generally, models with the best AIC ranking per window size and GLCM statistic had 3–5 bands as predictors (Table 2). Notably, the yellow band was part of the best model in most GLCM and window sizes. The ability of yellow band in discriminating plant species in dry seasons makes WorldView-2 imagery quite useful since a shortage of moisture in these seasons renders foliage yellowish [32,82]. Comparison of GLCM statistics showed entropy GLCM to be the best (Fig 4). This is not surprising given that difference in species types (mixed herbaceous−woodland plants) found in savanna creates heterogeneous environments that are capable of supporting diverse species [60].
Other better-performing GLCMs close to entropy in this study included dissimilarity, contrast and variance, all of which measure the degree of heterogeneity of the gray level and thus are capable of measuring diversity [83]. Such better performance as observed by [80,83] can be as a result of instantaneous changes in gray level values between neighbouring pixels; that portrays spatially contrasting pixel pairs. It is noteworthy to mention the weak estimation capability of some of the GLCMs, particularly homogeneity and mean (Fig 3) which do not specifically measure gray level dispersion. Homogeneity measures and represents the amount of local similarity in the image window [76] while mean measures the average gray level values in a window.
Importance of predictor size and window size on species diversity estimation
Best estimations peaked when models contained 3–5 predictors for most GLCMs and window sizes, after which accuracies remained unchanged or decreased slightly and progressively (Fig 3). Logically, multiple individual spectral bands, should be preferable for effective identification of plant species composition [80,83–85]. This is justified by the fact that different species respond differently to incident radiant energy across the electromagnetic spectrum; hence a large number of individual spectral bands leads to a higher chance of species discrimination [9]. In connection to this, the better performance in quantifying species diversity in our study compared to [31] whose study focused on southern Africa’s savanna region (in which our study area also belongs) is attributed to the additional bands of WorldView-2 (red edge and yellow bands). These additional bands are shown to be sensitive to variations in plant condition in dry periods [28].
Despite high accuracies achieved by entropy and dissimilarity GLCM statistics, it is worth noting the higher number of predicting bands used in the latter statistic (Fig 3). The number of predictors used in a model should, therefore, be taken into consideration in model selection [71]. It was, therefore, important to evaluate accuracy across predictor size using directly comparable models developed using a GLCM statistic and window size, as illustrated in Fig 5A. A specific comparison between the best model and those that contained fewer bands of entropy GLCM at 5×5 window size showed little difference in accuracy, indicating the potential of using simple models to estimate species diversity. In addition to model simplicity, knowledge of good models that use few predictors (such as those shown in Fig 5A) is significant since these models allow the use of suitable bands and they avoid those that show a high level of uncertainty due, for instance, to atmospheric interference.
Although the effects of the three window sizes (3×3, 5×5 and 7×7) were comparable for entropy GLCM-based analysis, the two smaller window sizes had a marginally better effect on the accuracy (Fig 4). This shows the importance of limiting the size in characterising species diversity at a localised scale such as one considered in this study. Overall, window sizes also did not have a consistent effect on species diversity estimation accuracy across GLCMs used in the study (Fig 3). This observation is also noted by [86] who reported that GLCM statistics have varied output with different window sizes. We illustrated a more focused comparison of the 5×5 window (which was taken as the best) with the 7×7 (Fig 5B) and 3×3 (Fig 5C) window sizes of the entropy GLCM. The better similarity of 5×5 with 7×7 than with 3×3, irrespective of the predictor size, shows the need to use fairly large window sizes to reduce the effect of noise commonly encountered in small windows [87]. It should be noted that the comparison of windows can be conditioned to the GLCM statistics used. For instance, we can deduct from Fig 3 that all predictor sizes of 3×3 window size result in significantly low accuracies than those of 5×5 and 7×7 when dissimilarity GLCM is used, while the three windows yield comparable accuracies for second moment and mean GLCMs.
Finally, the best model (namely, that used entropy, 5×5 and five bands) was compared against the best models of all the seven GLCM statistics. This was done to assess how much these statistics maintain the accuracy of the best model (Fig 5D). The comparisons generally showed low correspondence between the best entropy-based model and those developed from the seven GLCM statistics. However, this should not be construed as weak estimation accuracy for individual GLCM statistics as some of them showed good accuracies (Fig 3). For instance, errors of dissimilarity and variance GLCMs had low correlation with errors of entropy GLCM (r = 0.72); however, the three of them were the best predictors as shown in Fig 3. This implies that the models had inconsistent performance on each sample, but their overall accuracies remain similar. This, in turn, suggests the importance of identifying appropriate GLCM for species diversity assessment in a vegetation environment such as the one considered in the current study.
Conclusions
This study explored the performances of eight GLCM statistical measures derived from reflectance values of WorlView-2 individual bands to estimate species diversity during a dry season in a savanna vegetation type. An exhaustive analysis using the all-possible-subset regression approach showed that entropy GLCM statistic performed better than other statistics in capturing plot level species diversity expressed in the Shannon index scale. This finding agrees with other studies and with expectations since the entropy GLCM statistic exploits the complexity of pixel values within a particular window size. This suggests the preference of entropy in a vegetation environment where there is a good deal of complexity that should not be underestimated. Notably, the yellow band formed part of models that yielded best estimation accuracies in most cases, confirming the importance of this band in discriminating species diversity, particularly in the dry season.
The study further evaluated the accuracies of predictor and GLCM window sizes in estimating species diversity. Predictor size had a remarkable pattern of influence on estimation accuracy. Model accuracy increased until three–five predicting bands were used but stabilised or decreased as more predictors were used per model for most cases of GLCM. Accuracies of models that used fewer predicting variables compared to the best models that used three–five bands were fairly good, indicating the adequacy of limited bands in species diversity estimation. The effect of window size on species diversity estimation varied with GLCM type used in the extraction of representative value. This is due to the fact that the GLCM statistics inherently determine the type and level of similarity or contrast derived from a gray-level neighbourhood of pixels.
Although the findings might not be universal to all images and all vegetation environments, this study provides important observations on the performances of high spatial resolution imagery coupled with GLCM statistics for woody species diversity estimation in dry conditions within a savanna environment. Such an approach should be investigated in different savanna environments as well as other ecosystems that might have more diverse species types. Furthermore, it is worth factoring in external variables (e.g. topographic variables and climate data) in classifying species diversity. It is also important to extend such a study by using higher spatial resolution data than used in the current study. Availability of very resolution data are realized in particular with the development of unmanned aerial systems. In addition, GLCM statistics should also be tested for estimating species diversity at a broader scale using different moderate resolution imagery.
Acknowledgments
The authors thank Bishop Ngobeli (Manager of the Klipriviersberg Nature Reserve) for unlimited access to the reserve and for additional transportation support to and from the reserve. We are grateful to Tumelo Molaba for all the assistance during the field survey.
References
- 1. Bommarco R, Kleijn D, Potts SG. Ecological intensification: Harnessing ecosystem services for food security. Trends in Ecology and Evolution. 2013. pmid:23153724
- 2. Sandifer PA, Sutton-Grier AE, Ward BP. Exploring connections among nature, biodiversity, ecosystem services, and human health and well-being: Opportunities to enhance health and biodiversity conservation. Ecosystem Services. 2015.
- 3. Tehrany MS, Kumar L, Drielsma MJ. Review of native vegetation condition assessment concepts, methods and future trends. Journal for Nature Conservation. 2017;40: 12–23.
- 4. Deng W, Cheshmehzangi A. Eco-development in China. Eco-development in China. 2018.
- 5. Wang K, Wang T, Liu X. A review: Individual tree species classification using integrated airborne LiDAR and optical imagery with a focus on the urban environment. Forests. 2018;10: 1–18.
- 6. Sharma RP, Bílek L, Vacek Z, Vacek S. Modelling crown width–diameter relationship for Scots pine in the central Europe. Trees—Structure and Function. 2017.
- 7. Gao J, Meng B, Liang T, Feng Q, Ge J, Yin J, et al. Modeling alpine grassland forage phosphorus based on hyperspectral remote sensing and a multi-factor machine learning algorithm in the east of Tibetan Plateau, China. ISPRS Journal of Photogrammetry and Remote Sensing. 2019;147: 104–117.
- 8. Sosef MSM, Dauby G, Blach-Overgaard A, van der Burgt X, Catarino L, Damen T, et al. Exploring the floristic diversity of tropical Africa. BMC Biology. 2017. pmid:28264718
- 9.
Lillesand TM, Kiefer RW, Chipman JW. Remote sensing and image interpretation. Nev York Chichester Brisbane Toronto 6IS s. John Wiley & Sons; 2004. Available: http://www.osti.gov/energycitations/product.biblio.jsp?osti_id=6028047
- 10. Xue J, Su B. Significant remote sensing vegetation indices: A review of developments and applications. Journal of Sensors. 2017.
- 11. Xie Y, Sha Z, Yu M. Remote sensing imagery in vegetation mapping: a review. Journal of Plant Ecology. 2008.
- 12. Zhang C, Lu D, Chen X, Zhang Y, Maisupova B, Tao Y. The spatiotemporal patterns of vegetation coverage and biomass of the temperate deserts in Central Asia and their relationships with climate controls. Remote Sensing of Environment. 2016.
- 13. Gholizadeh H, Gamon JA, Townsend PA, Zygielbaum AI, Helzer CJ, Hmimina GY, et al. Detecting prairie biodiversity with airborne remote sensing. Remote Sensing of Environment. 2019;221: 38–49.
- 14. Marks CO, Muller-Landau HC, Tilman D. Tree diversity, tree height and environmental harshness in eastern and western North America. Ecology Letters. 2016. pmid:27146846
- 15. Arekhi M, Yılmaz OY, Yılmaz H, Akyüz YF. Can tree species diversity be assessed with Landsat data in a temperate forest? Environmental Monitoring and Assessment. 2017. pmid:29080961
- 16. Dieler J, Uhl E, Biber P, Müller J, Rötzer T, Pretzsch H. Effect of forest stand management on species composition, structural diversity, and productivity in the temperate zone of Europe. European Journal of Forest Research. 2017.
- 17. Castillo-Riffart I, Galleguillos M, Lopatin J, Perez-Quezada JF. Predicting vascular plant diversity in anthropogenic peatlands: Comparison of modeling methods with free satellite data. Remote Sensing. 2017;9.
- 18. Luize BG, Magalhães JLL, Queiroz H, Lopes MA, Venticinque EM, de Moraes Novo EML, et al. The tree species pool of Amazonian wetland forests: Which species can assemble in periodically waterlogged habitats? PLoS ONE. 2018. pmid:29813116
- 19. Almeida DRA, Stark SC, Chazdon R, Nelson BW, Cesar RG, Meli P, et al. The effectiveness of lidar remote sensing for monitoring forest cover attributes and landscape restoration. Forest Ecology and Management. 2019;438: 34–43.
- 20. Silveira EM de O, Terra M de CNS, ter Steege H, Maeda EE, Acerbi Júnior FW, Scolforo JRS. Carbon-diversity hotspots and their owners in Brazilian southeastern Savanna, Atlantic Forest and Semi-Arid Woodland domains. Forest Ecology and Management. 2019;452: 117575.
- 21. Schwartz NB, Budsock AM, Uriarte M. Fragmentation, forest structure, and topography modulate impacts of drought in a tropical forest landscape. Ecology. 2019;100. pmid:30825323
- 22. Khwaja ZH, Ahmad W, Williams RJ. An evaluation of alternative image classification techniques for the identification and mapping of tropical savanna landscape in Northern Australia. Geocarto International. 2003;18: 33–44.
- 23. Carrijo TF, Brandão D, De Oliveira DE, Costa DA, Santos T. Effects of pasture implantation on the termite (Isoptera) fauna in the Central Brazilian Savanna (Cerrado). Journal of Insect Conservation. 2009.
- 24. Madonsela S, Cho MA, Ramoelo A, Mutanga O, Naidoo L. Estimating tree species diversity in the savannah using NDVI and woody canopy cover. International Journal of Applied Earth Observation and Geoinformation. 2018;66: 106–115.
- 25. Rocchini D, Boyd DS, Féret JB, Foody GM, He KS, Lausch A, et al. Satellite remote sensing to monitor species diversity: potential and pitfalls. Remote Sensing in Ecology and Conservation. 2016.
- 26. Shannon CE. The Mathematical Theory of Communication. M.D. Computing. 1963.
- 27. Campos VE, Gatica GM, Cappa FM, Giannoni SM, Campos CM. Remote sensing data to assess compositional and structural indicators in dry woodland. Ecological Indicators. 2018;88: 63–70.
- 28. Mapfumo RB, Murwira A, Masocha M, Andriani R. Detection of subtle deforestation due to logging using satellite remote sensing in wet and dry savanna woodlands of Southern Africa. Geocarto International. 2017;32: 514–530.
- 29. Munyati C. Optimising multiresolution segmentation: Delineating savannah vegetation boundaries in the Kruger National Park, South Africa, using Sentinel 2 MSI imagery. International Journal of Remote Sensing. 2018;39: 5997–6019.
- 30. Rocchini D, Balkenhol N, Carter GA, Foody GM, Gillespie TW, He KS, et al. Remotely sensed spectral heterogeneity as a proxy of species diversity: Recent advances and open challenges. Ecological Informatics. 2010.
- 31. Madonsela S, Cho MA, Mathieu R, Mutanga O, Ramoelo A, Kaszta Ż, et al. Multi-phenology WorldView-2 imagery improves remote sensing of savannah tree species. International Journal of Applied Earth Observation and Geoinformation. 2017;58: 65–73.
- 32. Cho MA, Malahlela O, Ramoelo A. Assessing the utility WorldView-2 imagery for tree species mapping in South African subtropical humid forest and the conservation implications: Dukuduku forest patch as case study. International Journal of Applied Earth Observation and Geoinformation. 2015.
- 33. Wood EM, Pidgeon AM, Radeloff VC, Keuler NS. Image texture as a remotely sensed measure of vegetation structure. Remote Sensing of Environment. 2012;121: 516–526.
- 34. Samara e Silva Medeiros E, Machado CCC, Galvíncio JD, de Moura MSB, de Araujo HFP. Predicting plant species richness with satellite images in the largest dry forest nucleus in South America. Journal of Arid Environments. 2019;166: 43–50.
- 35. Karlson M, Ostwald M, Reese H, Sanou J, Tankoano B, Mattsson E. Mapping tree canopy cover and aboveground biomass in Sudano-Sahelian woodlands using Landsat 8 and random forest. Remote Sensing. 2015.
- 36. Brehaut L, Danby RK. Inconsistent relationships between annual tree ring-widths and satellite-measured NDVI in a mountainous subarctic environment. Ecological Indicators. 2018.
- 37. He KS, Zhang J, Zhang Q. Linking variability in species composition and MODIS NDVI based on beta diversity measurements. Acta Oecologica. 2009.
- 38. Rocchini D, Luque S, Pettorelli N, Bastin L, Doktor D, Faedi N, et al. Measuring β-diversity by remote sensing: A challenge for biodiversity monitoring. Methods in Ecology and Evolution. 2018.
- 39.
Rouse JW, Haas RH, Schell JA, Deeering D. Monitoring vegetation systems in the Great Plains with ERTS (Earth Resources Technology Satellite). In: Freden SC, Mercanti EP, Becker MA, editors. Third Earth Resources Technology Satellite-1 Symposium. Washington: Nasa; 1973. pp. 309–317.
- 40. Verrelst J, Schaepman ME, Koetz B, Kneubühler M. Angular sensitivity analysis of vegetation indices derived from CHRIS/PROBA data. Remote Sensing of Environment. 2008;112: 2341–2353.
- 41. Macintyre P, van Niekerk A, Mucina L. Efficacy of multi-season Sentinel-2 imagery for compositional vegetation classification. International Journal of Applied Earth Observation and Geoinformation. 2020;85: 101980.
- 42. Ludwig M, Morgenthal T, Detsch F, Higginbottom TP, Lezama Valdes M, Nauß T, et al. Machine learning and multi-sensor based modelling of woody vegetation in the Molopo Area, South Africa. Remote Sensing of Environment. 2019;222: 195–203.
- 43. Madonsela S, Cho MA, Ramoelo A, Mutanga O. Remote sensing of species diversity using Landsat 8 spectral variables. ISPRS Journal of Photogrammetry and Remote Sensing. 2017.
- 44. Cleland EE, Allen JM, Crimmins TM, Dunne JA, Pau S, Travers SE, et al. Phenological tracking enables positive species responses to climate change. Ecology. 2012;93: 1765–1771. pmid:22928404
- 45. Mucina L, Rutherford , Michael C. Powrie LW. Biomes and Bioregions of Southern Africa. The vegetation of South Africa, Lesotho and Swaziland Strelitzia 19. 2006.
- 46. Kruger AC, Nxumalo MP. Historical rainfall trends in South Africa: 1921–2015. Water SA. 2017;43: 285–297.
- 47. MacKellar N, New M, Jack C. Observed and modelled trends in rainfall and temperature for South Africa: 1960–2010. South African Journal of Science. 2014.
- 48. Molezzi MG, Hein KAA, Manzi MSD. Mesoarchaean-Palaeoproterozoic crustal-scale tectonics of the central Witwatersrand basin—Interpretation from 2D seismic data and 3D geological modelling. Tectonophysics. 2019.
- 49. Shannon CE. A Mathematical Theory of Communication. Bell System Technical Journal. 1948.
- 50. Stych P, Jerabkova B, Lastovicka J, Riedl M, Paluba D. A comparison of worldview-2 and landsat 8 images for the classification of forests affected by bark beetle outbreaks using a support vector machine and a neural network: A case study in the sumava mountains. Geosciences (Switzerland). 2019.
- 51. Waser LT, Küchler M, Jütte K, Stampfer T. Evaluating the potential of worldview-2 data to classify tree species and different levels of ash mortality. Remote Sensing. 2014;6: 4515–4545.
- 52. Chavez PS. Image-based atmospheric corrections—Revisited and improved. Photogrammetric Engineering and Remote Sensing. 1996.
- 53. Adler-Golden SM, Matthew MW, Bernstein LS, Levine RY, Berk A, Richtsmeier SC, et al. Atmospheric correction for shortwave spectral imagery based on MODTRAN4. Imaging Spectrometry V. 1999.
- 54. Bernstein LS, Adler-Golden SM, Sundberg RL, Levine RY, Perkins TC, Berk A, et al. Validation of the QUick atmospheric correction (QUAC) algorithm for VNIR-SWIR multi- and hyperspectral imagery. Algorithms and Technologies for Multispectral, Hyperspectral, and Ultraspectral Imagery XI. 2005.
- 55. Loranty MM, Davydov SP, Kropp H, Alexander HD, Mack MC, Natali SM, et al. Vegetation indices do not capture forest cover variation in Upland Siberian larch forests. Remote Sensing. 2018;10.
- 56. Gini R, Sona G, Ronchetti G, Passoni D, Pinto L. Improving tree species classification using UAS multispectral images and texture measures. ISPRS International Journal of Geo-Information. 2018;7.
- 57. Zhang S, Chen H, Fu Y, Niu H, Yang Y, Zhang B. Fractional vegetation cover estimation of different vegetation types in the Qaidam Basin. Sustainability (Switzerland). 2019.
- 58. Sobrino JA, Raissouni N. Toward remote sensing methods for land cover dynamic monitoring: Application to Morocco. International Journal of Remote Sensing. 2000;21: 353–366.
- 59. Ibrahim Balzter, Mathieu Tsutsumida. Impact of Soil Reflectance Variation Correction on Woody Cover Estimation in Kruger National Park Using MODIS Data. Remote Sensing. 2019.
- 60. Bunting P, Lucas R. The delineation of tree crowns in Australian mixed species forests using hyperspectral Compact Airborne Spectrographic Imager (CASI) data. Remote Sensing of Environment. 2006.
- 61. Wolf AF. Using WorldView-2 Vis-NIR multispectral imagery to support land mapping and feature extraction using normalized difference index ratios. Algorithms and Technologies for Multispectral, Hyperspectral, and Ultraspectral Imagery XVIII. 2012.
- 62. Jawak SD, Luis AJ, Fretwell PT, Convey P, Durairajan UA. Semiautomated detection and mapping of vegetation distribution in the antarctic environment using spatial-spectral characteristics of WorldView-2 imagery. Remote Sensing. 2019;11.
- 63. Lu D, Weng Q. A survey of image classification methods and techniques for improving classification performance. International Journal of Remote Sensing. 2007.
- 64. Lu D, Hetrick S, Moran E. Land cover classification in a complex urban-rural landscape with QuickBird imagery. Photogrammetric Engineering and Remote Sensing. 2010. pmid:21643433
- 65. Lu D, Li G, Moran E, Kuang W. A comparative analysis of approaches for successional vegetation classification in the Brazilian Amazon. GIScience and Remote Sensing. 2014.
- 66. Momeni R, Aplin P, Boyd DS. Mapping complex urban land cover from spaceborne imagery: The influence of spatial resolution, spectral band set and classification approach. Remote Sensing. 2016.
- 67.
Trimble. eCognition® Developer 7 reference book. Definiens AG, München. 2007.
- 68.
Gonzalez , Rafael C. and Woods RE. Digital_Image_Processing_2ndEd.pdf. New Jersey: Prentice Hall; 1992. pp. 1–793. Available: www.prenhall.com/gonzalezwoods
- 69. Haralick RM, Dinstein I, Shanmugam K. Textural Features for Image Classification. IEEE Transactions on Systems, Man and Cybernetics. 1973.
- 70. Hocking RR, Leslie RN. American Society for Quality Selection of the Best Subset in Regression Analysis Selection of the Best Subset in Regression Analysis. Source: Technometrics. 1967;9: 531–540. Available: http://www.jstor.org/stable/1266192%5Cnhttp://about.jstor.org/terms
- 71.
Burnham KP, Anderson DR. Model Selection and Multimodel Inference: A practical Information-theoretic Approach (2nd ed). Library of Congress Cataloging-in-Publication Data. Second Edi. Springer-Verlag. New York; 2002. https://doi.org/10.1016/j.ecolmodel.2003.11.004
- 72.
Akaike H. Information theory and the maximum likelihood principle. 2nd International Symposium on Information Theory. 1973.
- 73. Kadane JB, Lazar NA. Methods and criteria for model selection. Journal of the American Statistical Association. 2004;99: 279–290.
- 74. Hofmann M, Kontoghiorghes EJ, Colubi A. lmSubsets: Exact Variable-Subset Selection in Linear Regression for R. Journal of Statistical Software. 2019;In press.
- 75.
Team RC. R: A Language and Environment for Statistical Computing. Vienna, Austria. 2019.
- 76. Chaluvadi M, Raghunadh MV, Pullarao C, Kishore MS. Significance of Textural Features in Aerial Images. IETE Journal of Education. 2012;53: 9–20.
- 77. Ouma YO, Tetuko J, Tateishi R. Analysis of co-occurrence and discrete wavelet transform textures for differentiation of forest and non-forest vegetation in very-high-resolution optical-sensor imagery. International Journal of Remote Sensing. 2008.
- 78. Saatchi S, Buermann W, ter Steege H, Mori S, Smith TB. Modeling distribution of Amazonian tree species and diversity using remote sensing measurements. Remote Sensing of Environment. 2008.
- 79. Graf W, Kleinn C, Schall P, Nauss T, Detsch F, Magdon P. Analyzing the relationship between historic canopy dynamics and current plant species diversity in the herb layer of temperate forests using long-term Landsat time series. Remote Sensing of Environment. 2019.
- 80. George-Chacon SP, Dupuy JM, Peduzzi A, Hernandez-Stefanoni JL. Combining high resolution satellite imagery and lidar data to model woody species diversity of tropical dry forests. Ecological Indicators. 2019;101: 975–984.
- 81. Meng J, Li S, Wang W, Liu Q, Xie S, Ma W. Estimation of forest structural diversity using the spectral and textural information derived from SPOT-5 satellite images. Remote Sensing. 2016;8.
- 82. Gillon D, Dauriac F, Deshayes M, Valette JC, Moro C. Estimation of foliage moisture content using near infrared reflectance spectroscopy. Agricultural and Forest Meteorology. 2004.
- 83. Hall-Beyer M. Practical guidelines for choosing GLCM textures to use in landscape classification tasks over a range of moderate spatial scales. International Journal of Remote Sensing. 2017.
- 84. Warren SD, Alt M, Olson KD, Irl SDH, Steinbauer MJ, Jentsch A. The relationship between the spectral diversity of satellite imagery, habitat heterogeneity, and plant species richness. Ecological Informatics. 2014;24: 160–168.
- 85. Ozkan UY, Ozdemir I, Demirel T, Saglam S, Yesil A. Comparison of satellite images with different spatial resolutions to estimate stand structural diversity in urban forests. Journal of Forestry Research. 2017;28: 805–814.
- 86. Coburn CA, Roberts ACB. A multiscale texture analysis procedure for improved forest stand classification. International Journal of Remote Sensing. 2004;25: 4287–4308.
- 87. Silver M, Tiwari A, Karnieli A. Identifying vegetation in arid regions using object-based image analysis with RGB-only aerial imagery. Remote Sensing. 2019;11.