Figures
Abstract
Background
Advances in earth observation and machine learning techniques have created new options for forest monitoring, primarily because of the various possibilities that they provide for classifying forest cover and estimating aboveground biomass (AGB).
Methods
This study aimed to introduce a novel model that incorporates the atom search algorithm (ASO) and adaptive neuro-fuzzy inference system (ANFIS) into mangrove forest classification and AGB estimation. The Ca Mau coastal area was selected as a case study since it has been considered the most preserved mangrove forest area in Vietnam and is being investigated for the impacts of land-use change on forest quality. The model was trained and validated with a set of Sentinel-1A imagery with VH and VV polarizations, and multispectral information from the SPOT image. In addition, feature selection was also carried out to choose the optimal combination of predictor variables. The model performance was benchmarked against conventional methods, such as support vector regression, multilayer perceptron, random subspace, and random forest, by using statistical indicators, namely, root mean square error (RMSE), mean absolute error (MAE), and coefficient of determination (R2).
Citation: Pham MH, Do TH, Pham V-M, Bui Q-T (2020) Mangrove forest classification and aboveground biomass estimation using an atom search algorithm and adaptive neuro-fuzzy inference system. PLoS ONE 15(5): e0233110. https://doi.org/10.1371/journal.pone.0233110
Editor: Francisco Martínez-Álvarez, Pablo de Olavide University, SPAIN
Received: November 9, 2019; Accepted: April 28, 2020; Published: May 21, 2020
Copyright: © 2020 Pham et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Sentinel 1A is available for download at https://scihub.copernicus.eu/, and the other is available at DOI:10.5281/zenodo.3715333 (host by zenodo.org).
Funding: This research was funded by Vietnam Academy of Science and Technology under grant number VT-UD.08/18-20, which belongs to the National Program on Space Science and Technology (2016 - 2020).
Competing interests: The authors have no conflict of interest.
Introduction
Biomass, which includes above- and belowground biomass, is a critical component of carbon budget accounting and carbon monitoring, especially under the context of climate change [1,2]. Aboveground biomass (AGB) includes both live and dead material; estimation of the AGB of live trees has been more prominent in recent research. Accurate biomass quantifications are a prerequisite for a better understanding of the impacts of deforestation and environmental degradation on climate change [3]. The estimation of biomass is now crucial, as it is considered an essential source of energy in many countries. Technically, there are two biomass estimation methods. The destructive methods require tree cutting and further indoor weighing procedures [4]. These methods are limited to smaller areas and are usually employed to measure the biomass of sample plots that can also be used as ground truth samples. On the other hand, non-destructive measures take advantage of spatial technology to estimate AGB from a distance through backscatter or reflectance signals [1].
Mangrove forests cover a small portion of the global land area [5], but their carbon-rich ecosystems play a crucial role in sustaining the livelihoods of coastal communities [6], protecting coastal lines and inner land from storms and tsunamis [7], offsetting anthropogenically produced carbon dioxide, and contributing to carbon export to the ocean [8]. However, mangrove forests are threatened by human interactions that claim forest cover for aquaculture and agricultural activities. The estimated global loss of mangrove forests is approximately 0.16 to 0.39% [9], which poses a significant risk to the total carbon emission rate because of the considerable proportion of carbon storage in mangrove forests [9,10]. The surveillance of mangrove biomass is, therefore, crucial for estimating the potential carbon stored in these forests for global emission reduction programs.
Remote sensing (optical, radio detection and ranging- radar; light detection and ranging- Lidar) probably provides the best alternative for estimating AGB on a large scale and enables repetitive and rapid assessment of biomass over large areas relatively quickly and at a low cost, providing a more spatially comprehensive measure of forest biomass variation. Radar remote sensing enables surveying operations in all weather conditions [11–13], and it is usually used in combination with optical imageries to obtain complementary information about mangrove structure and biomass [1,14]. Among the radar bands, AGB can be effectively calculated by using the high-frequency L-band and P-band because of their penetration capability, as explained in the studies of [1,11,15] through the use of ALOS PALSAR data. From a global perspective, both active and passive remote sensing have become vital sources of data for mapping the spatial distribution of vegetation [16,17].
There is a growing body of research on the application of remotely sensed imageries, machine learning algorithms, and geospatial information technology in AGB estimation [18,19]. The underlying scientific background is to understand the correlation between backscatter and the reflectance of remotely sensed data and live biomass over a given area [20,21]. The methods for AGB estimation are diverse. For example, the application of inversion of the PROSAIL model estimates AGB by using leaf dry matter content and leaf area index [22]. The combination of vegetation indices with radar data has also been investigated in several works [1,23]. From reviewing the literature, several types of research were found that employed machine learning models to estimate the AGB. As examples, statistical and data-driven approaches have been proposed, such as multiple regression in the study of [13,24], geographically weighted regression [25], and support vector regression and random forest in [1,23,24].
Recently, metaheuristic algorithms have gained considerable popularity because they are capable of searching for the optimal parameters of classifiers in image classification and disaster susceptibility mapping by solving objective functions that are differently defined case to case. Three typical types consist of physically based, swarm intelligence, and evolutionary algorithms that mimic the behaviors or mechanisms of natural events to mathematically model artificial applications [26–28]. New models are being examined for their simplicity, flexibility, and ability to solve complex nonlinear problems. However, few of these models can be found for biomass estimation in such a way that the optimization algorithm supports the improvement of the performance of the classifiers that are generally trained by conventional methods. Currently, new networks and models are continuously being developed for various applications, many of which are available as open-source libraries [13]. To the best of our knowledge, the use of metaheuristic algorithms in biomass studies is still limited.
Although there is a vast number of studies on applications of machine learning algorithms in biomass estimation, there are no models that fit all problems. Moreover, the search for optimal machine learning models is crucial to contribute to global knowledge in the field of forest management. This study aimed to investigate a novel combination of atom search optimization algorithms and adaptive neuro-fuzzy inference systems in classifying mangrove forests and estimating AGB. Ca Mau Province, a coastal area in southern Vietnam, was selected as a case study because of its diverse ecosystems and its role in protecting the coastal zone. This hybrid model was validated by using common statistical indicators, namely, root mean square error (RMSE), mean absolute error (MAE), and coefficient of determination (R2) and was benchmarked by regression models that had been used for mangrove studies, such as multilayer perceptron (MLP), support vector regression (SVR), random subspace (RS) and random forest (RF). The geospatial database was processed by Quantum GIS (QGIS), the segmentation process, and SPOT-6, and its derived indices were analyzed by using PCI Geomatica 2018 Service Pack 2. The Sentinel-1A imagery was processed by Sentinel Application Platform (SNAP) from the European Space Agency, and the model was coded in MATLAB R2018b. SVR, RF and multilayer perceptron were implemented in Weka version 8.3.
Study area and data
Description of the study area
In Vietnam, mangrove forests are mainly distributed in the northeastern and northern delta and the central and southern delta, with the densest area in the U Minh National Park. As the importance of mangrove forests has become widely recognized, the area has been regrown and expanded through national and international projects. However, the transformation of mangrove forests into aquaculture and other economic activities continues in some areas, and it makes the coastal zone more susceptible to natural hazards such as salinity intrusion, drought, coastal erosion and flooding [2,3].
Ca Mau Province is on the southernmost coast of Vietnam, which is located on the Mekong Delta and extends between the latitudes of 8°34’N and 9°33’N and longitudes of 104°43’E and 105°25’E (Fig 1). Ca Mau is characterized by many rivers and canals and low and flat terrain, and it is periodically flooded. Influenced by the tropical monsoon climate near the equator, the regional weather can be divided into two seasons: the rainy (from May to November) and dry (from December to April) seasons. The study area is characterized by an average temperature of 26.5°C, an annual average rainfall of approximately 2,360 mm, an annual average evaporation of 1,022 mm and an annual average moisture of 85.6% (http://www.camau.gov.vn).
Background spatial data were collected from https://gadm.org/ and processed by the authors.
The Ca Mau mangrove forest is the second most pristine forest in Vietnam, both in terms of species composition and biomass, with an entire area of approximately 69,000 ha [2]. It is located mostly in the Ngoc Hien and Nam Can districts, and the remaining area is situated in the Dam Doi, Phu Tan, Tran Van Thoi, and U Minh districts. Most of the forest area is in the Ca Mau Cape Biosphere Reserve (41,862 ha). Surrounded by the sea and 249 km of coastline, the forest is considered an erosion barrier. The forest is also the green lung of the whole southeastern region, which plays a role in climate harmonization, ecological balancing, and environmental protection. Ca Mau has diverse mangrove species, among which the most saline-tolerant plants are in Avicenniaceae (Avicennia alba, Avicennia marina) and Rhizophoraceae (Rhizophora apiculata, Rhizophora mucronata, Bruguiera gymnorrhiza), Lumnitzera racemosa, and Excoecaria agallocha. Among these, Rhizophoraceae is the most popular species, so the forest is also called Rhizophora forest. Another aspect relating to the management of specific uses of certain types of mangrove forest includes the zoning of the mangrove forest into functional zones. In this regard, six distinct types of forest have been defined, namely, natural Rhizophora forest, natural mixed Avicennia/Rhizophora forest, naturally regenerated Avicennia forest, Rhizophora plantation forest (mainly Rhizophora apiculata and Rhizophora mucronata), Avicennia plantation forest (mainly Avicennia alba and Avicennia marina), and other mangroves forest and shrubs.
Data used
Predictor variables from Sentinel-1A and SPOT-6 imagery.
There are numerous studies on the uses of active radar at the L-band (1 GHz ÷ 2 GHz), C-band (4 GHz ÷ 8 GHz), and P-band (300 MHz ÷ 1 GHz) in estimating the AGB of mangrove forests. However, the selection of the radar bands is subject to the availability of input data from either commercial satellites or free sources. In this paper, the Sentinel-1A C-band (ground range detected product; interferometric wide swath mode; 250 km swath width; 5 × 20 m spatial resolution; VV: vertical transmit–vertical receive, and VH: vertical transmit–Horizontal receive dual polarization) was acquired on March 23, 2015. High-resolution SPOT-6 satellite imagery (Satellit Pour l’Observation de la Terre 6—February 8, 2015, with 6 m multispectral resolution capability) data were used to estimate the biomass of the mangrove forests in Ca Mau Province. The Sentinel-1A and SPOT-6 multispectral data were acquired in February/March 2015 at the same time as the field measurements were conducted. Since the spatial resolutions of Sentinel-1A (5 m x 20 m) and SPOT-6 (6 m) are technically different, the SAR data were resampled in the multispectral image to 6 m resolution by the Bilinear resampling technique, which computes new pixels using linear interpolation. This process was implemented after the preprocessing of the raw dataset by using PCI Geomatics 2018 software.
The Sentinel-1 data were preprocessed using the Sentinel-1 toolbox (S1TBX) embedded in the SNAP desktop application (version 6) from the European Space Agency (http://step.esa.int). The orbit file with accurate satellite and velocity information was also used for this step and included (i) radiometric calibration of the backscatter representation of the reflecting object (converted from digital number (DN) values to σo values); (ii) speckle filtering for speckle suppression using the Lee adaptive filter (with a window size 7×7) [13]; (iii) terrain correction using the DEM data at a 5 m spatial resolution from the Vietnam Ministry of Natural Resources and Environment to correct for the SAR geometric distortions; and projection of the images into the WGS 84 coordinate system in the UTM zone 48N projection.
The multispectral SPOT-6 data (blue, green, red, near-IR) were obtained from optical satellite sensors with a high spatial resolution of 6 meters. The study area had a cloud cover of less than 10%. The multitemporal satellite image was calibrated, and the radiation/atmospheric effects were removed by using the ATCOR (atmospheric correction) function integrated with PCI Geomatics 2018 software. The processing consisted of three parts: (i) top-of-the-atmosphere reflectance; (ii) haze removal and cloud masking; and (iii) ground reflectance atmospheric correction [29]. These images were projected into the WGS84 coordinate system/UTM zone 48N projection with ground-control points and orthorectified by using DEM data, which ensured a geometric correction accuracy of approximately ±0.5 pixels.
Even though the limitation of the C-band in interacting with the more profound components of the forest was mentioned, few works have investigated the potential uses of Sentinel-1A with a variety of optical image indices [30,31]. This study is a continuation of research on the C-band from Sentinel-1A in mangrove forest estimation, in which the combination of polarizations was proposed, such as HH, HV, HH-HV, and HH/HV, as has been suggested in many studies [1,13,15]. The structure of mangrove forests (open or closed) and the water level conditions on the acquisition day influence the volume backscatter, double-bounce backscatter and HH, HV, and VV polarizations. As a supplement, the optical imageries provide useful information about mangrove conditions by transforming spectral bands to enhance the contribution of the vegetation properties or chemical components of the leaves. A wide variety of vegetation indices that differ from each other in their transformation equations and required objectives were used in this study, as shown in (Table 1). Such indices have also been suggested to have significant contributions to the overall AGB estimation, as in the studies of [1,20,21,23]. Forty-two predictor variables were generated in Table 1 and the average values (based on the centers of sample plots and plot sizes) were used as an input database for the analysis workflow, as shown in Fig 3.
Field survey dataset.
The field survey was carried out from January to March 2015 in the Ca Mau district and was authorized by the Mui Ca Mau National Park and local administration. The investigations followed the guidelines issued by the Ministry of Agriculture and Rural Development with sampling plots with areas of 100 and 1,000 m2. The plot locations were randomly selected across the study area and provided the best description and measurement of the plot condition, canopy coverage rate, the total number of trees in each plot, average diameter, average cross-section, and tree heights. Fig 1 shows the coordinates of the center of each plot. Afterward, the AGB was statistically estimated for each plot by using single tree allometry and plot-aggregated allometry to quantify all measured trees in the plot. Two genera, Avicenniaceae (Avicennia alba, Avicennia marina) and Rhizophoraceae (Rhizophora apiculata, Rhizophora mucronata, Bruguiera gymnorrhiza), dominated the area, with an average density of 2,830 trees per ha, an average diameter varying from 6.9 cm to 19 cm and an estimated biomass between 40 and 340 Mg ha-1. The field estimation of AGB was calculated based on the estimation equations from [1,56–58], as specifically shown in Table 2.
Background of the algorithms used
Adaptive neuro-fuzzy inference system
Since the first study of [59], the ANFIS has been widely used to solve numerous problems, either in image classifications [27,60] or regression applications [61]. This method integrates fuzzy inference into conventional neural networks and inherits the benefits of both. A typical structure of the ANFIS is briefly described in Fig 2, and a more detailed explanation of each layer set can be found in various studies, such as [59].
Rule k: IF x1 is AND x2 is …. AND xm is THEN Pi can be estimated through several steps as described in the following layers:
Layer 1: This layer consisted of the training data and 42 associated predictor variables (as presented in Table 1) or the remaining variables after the feature selection process. As a preliminary step, a clustering method was used to define the optimal clusters for this training dataset. These data were fed into Layer 2 with the following membership function: (1) where i = 1:m, in which m is the number of input variables, and j = 1: k, where k is the number of clusters as well as the number of rules in this study. The determination of k is carried out by trial-and-error process; xi indicates the input variables (42 independent variables in Table 1 or the variables after the feature selection process); is the linguistic label, and is membership value that defines how much of factor (x) belongs to . Pi is the predicted value; and aij, bij, and cij are adaptive parameters to be adjusted by using the ASO.
Layer 3: The weights in this layer are calculated by using the following equation: (2)
Layer 4: Weights are normalized in this step by
Layer 5: This adaptive layer takes the sum of linear functions multiplied by the normalized weights in the previous step. The equations are as follows: (3) (4) (5) where and are to be adjusted at the same time as aij, bij, and cij by the ASO. Oi is the observed value (ground-truth value). n is the training size. Eq 5 is used as the objective function for the ASO search. In general, the number of adaptive parameters is calculated based on the number of input features and the number of clusters and is presented as follows: No. of parameters = m * k * 3 + (m + 1) * k, in which 3 represents aij, bij, cij of the membership functions as described in Eq 1. These parameters are tuned by the ASO as described in the next section.
Atom search optimization
First introduced by [62], the ASO is formulated based on the molecular dynamics in which all atoms in search spaces interact with each other through attraction and repulsion forces. The equilibrium stage is achieved when two forces are equal at a distance rij = 1.12σ. Technically, the mass of a defined atom represents a solution, where a heavier atom is better than a lighter one. Atom masses are influenced by the movement of all other atoms. The ASO can be simply described as follows:
The objective function was defined in Eq 5 to minimize the RMSE of the regression function. To solve this minimization problem, an atom population of n in d dimensional space was proposed. That was , i = 1…n. where rij is the Euclidean distance between atom i and atom j in d dimensional space. The positions of the atoms were randomly generated in the d dimensional space xi(i = 1..n), and d is equal to the adaptive parameters of the ANFIS. Atoms interact with each other by the force defined in Eq 6, and the total interaction is represented in Eq 7. (6) (7) where U(rij) is the Lennard-Jones (L-J) potential between atoms i and j and σ is the length scale that denotes the collision diameter. This potential U(rβ|) controls how the atoms interact and therefore determines the positions of the atoms after each iteration.
The mass of the atoms is recalculated after each iteration by using Eq 8 and Eq 9: (8) (9)
The interaction force Fi and acceleration are calculated after each iteration. From this stage, the positions and velocities of all atoms are updated by using the following: (10) (11) where are random values between [0] and [1].
The searching process terminates when the predefined maximum number of iterations is reached or the objective function reaches a smaller than desirable value. By integrating them into the ANFIS, the optimal parameters of the ANFIS were defined for calculating biomass for the whole study area.
Feature selection
Feature selection has considerable impacts on the performance of classification or regression models through the elimination of irrelevant variables, handling multicollinerity [63] and might effectively boost the operation [1,27]. It is also necessary to compare classifiers in an unbiased manner. The simple Spearman correlation, which assesses monotonic relationships, i.e., linear or nonlinear, has sometimes been used, but it has some specific limitations in the nonlinear relationship. In some cases, even the Spearman coefficient indicates that two features are highly correlated in low dimensional space; i.e., they are linear or nonlinear, which can provide very different information in high-dimensional space. In this study, several feature selection methods were examined, such as Relief Attribute Evaluation [64,65], subclass evaluation [65], Correlation Attribute Evaluation [26,65], and the genetic algorithm [27,65]. Generally, the features were chosen by gradually adding more features until the performance of the classifier started to drop. The use of cross-validation to calculate precision will probably reveal that different classifiers choose different feature combinations. The features that are selected by these methods will be examined by the proposed hybrid model, and the detailed assessment is in the next section.
Performance assessment
This study aimed to find the best fit model but to ensure that the model would not be overfitted. This was achievable by estimating the expected prediction error. Therefore, to eliminate overfitting problems, more training datasets were required, and different sampling methods were used. Typically, there are three common ways to train and validate a model: (1) The hold-out method randomly divides the points in the training set into roughly 70% for training and 30% for validation, and (2) k-fold cross-validation (CV) randomly divides the training set into k equal folds. In this case, previous studies have shown that ten-folds is the optimal number for this method [23]. (3) Leave p-out cross-validation with p equal to 1 is usually applied. Each method requires a specific size of the training set, and in this case study, the sampling size was large enough, so the hold out method was applied.
In general, for a regression study, the coefficient of determination (R2) is useful for explaining the explanatory power of independent predictor variables. It is a common indicator that has been used in all AGB estimation studies, such as in [1,13,15,23]. In other words, R2 gives a sense of how well the model can explain the input dataset. Initially, R2 ranges from 0 to 1, but negative values of (R2) are found in some cases. This situation occurs when the model is even worse than a linear regression.
On the other hand, the RMSE (Eq 5) is useful for understanding the accuracy and precision of the estimation model by comparing the predicted data (in this case, AGB) to the observation data (in situ measures in each plot). The RMSE can be used either in classification or regression as an optimal objective function of the optimization process [27]. The drawback of this value is that it is sensitive to large errors so that a preliminary screening of input data should be performed to remove any outliers. Similarly, MAE also provides an average prediction error with negatively oriented scores, which means lower values are better. Depending on the actual dataset, the MAE and RMSE might vary differently and should not be used as comparative indicators between estimation methods.
Finally, this hybrid model was benchmarked with machine learning methods that had been used in previous mangrove studies, including MLP, SVR, and RF [20]. For this current dataset, the parameters for SVR, including the kernel width (γ) and regularization (C), were defined through a grid search. On the other hand, the determination of several trees impacted the speed of search and accuracy of the result. From trial and error tests, 500 trees was the optimal parameter for this selected RF method.
Proposed methodology for mangrove forest classification and aboveground biomass estimation
This flowchart of Fig 3 shows the step-by-step procedure starting from image processing, which was followed by the vegetation index calculation, and then the classification and AGB estimation were implemented based on the input dataset from previous steps by using the proposed machine learning method. Detailed explanations are described as follows:
Step 1: This step involved the preprocessing of Sentinel-1A and SPOT-6 imagery, including atmospheric correction, noise removal, image rectification, and image index calculation. This step was implemented by using SNAP, which was developed by the European Space Agency, and PCI Geomatics, a program that was created by the Canada Centre for Remote Sensing. The output from this step was a layer stack of 42 predictor variables in the UTM-WGS 84 projection with a spatial resolution of 6 m for the SPOT-6 data and 10 m for the Sentinel-1A data, which is resampled to 6 m by using SPOT-6 as the spatial reference. A short description of this step was explained in the previous section.
Step 2: The inclusion or exclusion of certain predictor variables is subject to the nature of datasets (forest type, growing stage, and geographical locations) and specific algorithms. This means that, among all predictor variables that can be collected for AGB, different algorithms might result in different combinations of variables based on the observed dependent ground truth values. Therefore, this step is vital for filtering out redundant features that might have negative influences on the predictive performance of the regression methods. This step is the iteration process, in which features are alternatively selected by feature selection methods. Each selected subset is used to run the proposed hybrid model and benchmarked algorithms for comparison.
Step 3: The input data from Steps 1 and 2 were used for the AGB estimation and the classification of the mangrove forests. Since the long-term objective of this work was to quantify the structure of each forest type in a time-series manner, the spatial distribution of each species was required. The process was carried out with the use of the ASO-ANFIS for the forest cover classification into six mangrove types, as mentioned in the previous section. The classified map was overlaid on the AGB map to extract the AGB for each type.
Step 4: For any selected subset of features, the ASO searched for the optimal parameters of the ANFIS, in which RMSE (Eq 1) was used as the objective function. The maximum number of iterations and the dimensional space were defined by the structure of the ANFIS, as presented in Fig 2. The hybrid model started with the preliminary initialization of the model parameters of the model (determination of atoms). The algorithm observes the movement of atoms, calculates the fitness value for each, and identifies the best position of atoms. The process iterates until the desirable condition is met. This process resulted in the smallest RMSE, and the fine-tuned parameters of the ANFIS were used to estimate the AGB for the entire study area in Ca Mau. For benchmarking, other methods were also examined against similar training subsets for the performance comparison. A brief description of the ASO and ANFIS is provided in the next section.
Mangrove forest classification
The main focus of this work was to monitor the changes in AGB across the most preserved region in Vietnam, as mentioned in the previous section. For that reason, two independent processes were implemented in parallel, which were the classification of forest cover and the AGB estimation; the latter was the main focus. Therefore, the classification was a minor focus, and result accuracies could be tolerated. In this regard, PCI Geomatics was used for image segmentation with the determination of scale = 15, compactness = 0.5, and shape = 0.8 after several trials. A total of 998 polygons, which were randomly selected, were manually assigned to one of the six classes and used for model training, and 300 polygons were used for validation.
The classification was carried out by using the ASO-ANFIS, and the result was compared to those from the most common classifiers, such as RF and SVM. The training and validation set was built from segmented objects with associated attributes from SPOT data, such as multiple spectral bands, NDVI, and texture analysis images [66]. The overall classification accuracy for the ASO-ANFIS method was 86.7%, RF was 84.8%, and SVM was 82.6% (Table 3). The kappa (kappa coefficients) for the ASO-ANFIS was 0.84, that for RF was 0.82, and that for the SVM was 0.81. Fig 4 presents the results of the image classification by the ASO-ANFIS, including six mangrove classes with different densities. The natural Rhizophora forest area was 3,436.42 ha, the natural mixed Avicennia/Rhizophora forest area was 4,323.15 ha, the naturally regenerated Avicennia forest area was 1,026.21 ha, the Rhizophora plantation forest area was 26,463,05 ha, the Avicennia plantation forest area was 549.77 ha, and the other mangrove forest and shrub areas were 33,121.39 ha. The study area retains vast mangrove resources and is a prestigious World Biosphere Reserve in Vietnam.
Background data were collected from https://gadm.org/ and processed by the authors.
Feature selection and AGB estimation
The training data (110 samples) and validation (48 samples) data were randomly split from 158 samples, with basic descriptive statistics such as mean = 181.9 Mg ha-1 and SD = 99.33 Mg ha-1for the training set and mean = 159.9 Mg ha-1 and SD = 91 Mg ha-1 for the validation set. Initially, the mean of the training set was higher than that of the validation set, and there was more variation in the biomass for the training set than for the validation set.
On the other hand, the configuration of the proposed model, specifically the number of weights to be tuned, is subject to the number of features that will be fed into the ANFIS. Table 4 represents the features, which were selected by different feature selection methods (implemented by using Weka software), and the last column shows the number of parameters of the ANFIS to be tuned by the ASO-ANFIS. In this regard, the atoms (ASO algorithm) were initiated in 405-dimensional space as the result of the Relief Attribute Evaluation method, 65 as the result of CF subclass evaluation, 445 as the result of Correlation Attribute Evaluation, and 205 as the result of the GA. The search mechanism of the ASO iterated 200 times and ended up with a validated RMSE (objective function), as shown in Table 5. The results in Table 5 showed that the highest R2 (0.58) was produced by the ASO-ANFIS, with the selected features from the genetic algorithm, which outperformed the benchmarked methods, including RF and SVR.
The feature combinations from Table 4 were used in the first layer of the ANFIS (Fig 2), and the RMSE values from the use of different repressors are shown in Table 5. The results showed a highest R2 value of 0.58, which was produced by the ASO-ANFIS with the selected features from the genetic algorithm. This process resulted in ten features, including two from the Sentinel-1A and eight spectral indices.The scatter plots are shown in Fig 5. The selection of VV for AGB estimation also comes along with the works of [1,67], in which VV was found to be sensitive to the increase in biomass and mangrove forest structure. The TSAVI was calculated using a soil adjustment factor [46]. WDRVI is an NDVI type, but NIR (this band was used to measure WDRVI) was rescaled by a factor ranging from 0.1–0.5. This index increased the linearity between the biomass and NIR, thus reducing the sensor saturation [68]. The contribution of VH was reflected in the ratio between VH and VV, and the remaining indices, which had been proven useful in previous studies [1,14], had a significant contribution to the overall estimation.
Scatter plots of 10 selected features by GA against the AGB of the sample plots: a) CI, b) EVI, c) IF, d) VH/VV ratio, e) SIPI3, f) TSARVI, g) VIN, h) VV, i) WDRVI, and k) WDVI.
Table 5 shows the estimation accuracies from the machine learning methods with different combinations of features. With ten features from GA, the ASO-ANFIS generated the highest R2 at 0.577 (rounded up to 0.58) and the smallest RMSE and MAE values of 70.88 and 55.458, respectively, among other regression methods. The two ensemble algorithms were followed by the regression methods of RF (RMSE = 82.227, R2 = 0.503) and RS (RMSE = 84.406, R2 = 0.484). SVR (R2 = 0.24) and MLP (R2 = 0.34) had the worst performance among the regression methods. However, SVR (R2 = 0.33) and MLP (R2 = 0.40) performed better with 20 selected features from the Relief Attribute Evaluation method. In addition, a test of statistical significance was also implemented, as shown in Table 6, in which null hypothesis H0 was the equality of performance. The p-values were all smaller than 0.05 (5%), so the differences were significant.
The top three highest R2 values were from the ASO-ANFI, RS, and RF by using ten selected features with the GA method, and the scatter plots between the predicted AGB and observed AGB are shown in Fig 6. As shown in Fig 6(C), the RS underestimated the AGB for the entire validation dataset. It can also be seen that for values lower than 100 Mg ha-1, the ASO-ANFIS and RF were likely to overestimate the AGB, and after this value, the two methods seemed to underestimate the AGB. This situation occurred due to the saturation level of the C-bands, and the spectral reflectance can only partly offset the saturation effect.
a) validation dataset, b) training dataset, and c) estimated AGB in ascending order of observed AGB.
From the map (Fig 7), the AGB was grouped into the five following classes: (i) less than 50 Mg ha-1, (ii) from 50 to 100 Mg ha-1, (iii) from 100 to 150 Mg ha-1, (iv) from 150 to 200 Mg ha-1, and (v) over 200 Mg ha-1. With AGB values between 56.72 and 339.85 Mg ha-1 (average = 125.63 Mg ha-1), the predicted spatial pattern of the AGB value was consistent with actual observations [2,68]. The non-mangrove forested areas, such as bare land and agricultural land, were removed from the result of the AGB map.
Background spatial data were collected from https://gadm.org/ and processed by the authors.
As shown in Fig 7 and Table 7, the area with an AGB over 200 Mg ha-1 was approximately 16,066.84 ha (approximately 23.31% of the mangrove forests of the study area), which was mainly distributed in the Rhizophora plantation forest (8,277.09 ha), natural Rhizophora forest (3,217.85 ha), natural mixed Avicennia/Rhizophora forest (2,712.74 ha), and other mangroves forest and shrubs (1,032.42 ha). Approximately 12,020.95 ha within the AGB value of 150–200 Mg ha-1 was found in the Rhizophora plantation forest (6,960.27 ha), other mangrove forests, shrubs (3,326.72 ha), and natural mixed Avicennia/Rhizophora forest (1,044.14 ha). The largest areas of moderate and low AGB (smaller than 150 Mg ha-1) were predominantly observed in the Rhizophora plantation forests and other mangrove forests and shrubs.
The spatial distribution of the mangrove forest was observed in two regions: (i) in the western-southwestern area, this was the new alluvial land with low and flat terrain and a flooding depth of over 80 cm and (ii) the eastern coastal area of the Ngoc Hien, Nam Can and Don Doi districts. The mangrove forests here grow on acidic and highly compacted soil due to the erosion impacts from the influence of coastal currents and waves. The area was mainly Rhizophora plantation forest and mixed Avicennia/Rhizophora forest. Unlike in the western-southwestern area, in the eastern coastal area, the area of natural mangrove forests was in very low land, and there was no natural Avicennia forest (pioneering forest species encroaching on the sea).
Mapping AGB is a major concern at the global scale and for many developing countries as it is a challenging task because of the lack of field data [69]. For a given ecosystem, these maps can be used for forest monitoring, deforestation, forest degradation, and other forest-related industries, such as conservation, sustainable management, and increased carbon storage [70]. Carbon accumulation in mangrove forests is influenced by tree density, tree species, tree age, organic decomposition in soil, and regular submergence tides. Frequent tidal inundation and the degree of organic decomposition in the anaerobic environment are key factors enabling the mangrove forest in Ca Mau to become a greenhouse-gas reservoir. Therefore, the protection of the vast carbon storage in mangrove forests and on the peatlands in Ca Mau, Vietnam and throughout Asia, in general, is crucial to prevent the release of carbon dioxide and methane into the atmosphere.
Search strategy of the ASO
The hybrid model outperformed the benchmarked regression methods by comparing three common indicators, as shown in Table 5. This result was achieved through the robust search mechanism of the ASO in tuning parameters of the ANFIS. Indeed, the parameters of the ASO also influenced how the search operated. Fiver clusters were determined in the ANFIS and two parameters in the ASO were defined through the trial and error process. Fig 8(A) shows the RMSE variation curve after 2000 iterations. The exploration search can be realized by the sudden jumps in the graph (almost vertical lines in the curve) with large variation at the beginning stages.
a) Variation in the RMSE values of the validation plots with the use of the ASO and (b) variations in RMSE by the number of features using the ASO.
Fig 8(B) shows another aspect of the ASO operation in combination with feature selection by GA. A feature selection solution was represented by a 1 x 42 dimension vector with binary values of 1 or 0, in which the selected features were represented by 1 and vice versa. For each iteration step in the GA, the ASO-ANFIS was triggered to search for the best RMSE. The horizontal axis represents the number of features that were selected during the search, regardless of how the subsets were determined. The y-axis shows the best RMSE value among the values that were generated by the ASO-ANFIS model.
Potential uses of C bands for mangrove AGB estimation
There have been a large number of studies on the application of the L-band in AGB estimation that have investigated the relationship of backscatter signals, their transformed ratios, and the field estimation of AGB. The penetration capability of the L-band makes it useful for measuring tree cover, canopy height, and, consequently, in estimating AGB because of the correlation among these factors. The C-band has several drawbacks in comparison to the L-band in AGB applications because of its limited ability to penetrate forest canopies. The dependency of AGB estimation on the SAR wavelength has been reported in numerous studies, which have focused on the discussion of the saturation level, in which the saturation level of the C-band is typically low in AGB estimation (50–70 Mg ha-1) and the level of the L-band is approximately 100–150 Mg ha-1 [1,15]. In this study, the AGB mean of the sample plots was higher than the saturation level of the Sentinel-1A dataset, which might result in underestimation in the high AGB area. However, this situation can partly be minimized with the contribution of the optical data to the AGB estimation, as multispectral reflectance offsets the saturation effect [12,14]. The study of [71] successfully sampled plots with values higher than 300 Mg ha-1 by using SPOT-6 or the combination of Sentinel-1 and Sentinel-2 imagery to produce reliable AGB of approximately 200 Mg ha-1 [30]. The NIR, red and green bands play a key role among optical data for AGB estimation [14] because of their strong interaction with trees, and they were the main components for the measurements of the vegetation-derived features as described in Table 4. There have been limited studies on the combination of Sentinel-1A and SPOT-6-derived vegetation indices, and machine learning methods for AGB estimation and the uses of such datasets were the main objective of this study.
From the experiment, it could be noticed that the combination of vegetation indices from multispectral imageries with SAR improved the accuracy of AGB estimation, even though there was still underestimation (in plot samples of having AGB higher than around 120Mg ha-1) and overestimation (in plot samples of having AGB smaller than around 120Mg ha-1) of the AGB (Fig 6C).
This paper presented a novel model for advancing atom search optimization in searching for the parameters of the ANFIS by using 158 sample plots. The highest R2 (0.577) in this study was satisfactory compared to the (R2 = 0.28–0.44) in [72], (R2 = 0.596) in [1] with L-band PALSAR, (R2 = 0.46) in [3], even though the C-band was not as good as the L-band because of its limited ability to penetrate the canopy. Then, the proposed model (ASO-ANFIS) with ten selected features was used to generate the AGB map along the coastal area of Ca Mau Province (Fig 7).
Conclusions and further remarks
Ca Mau Province is the largest reserved area for the regrowth and expansion of mangrove forests in Vietnam. The estimation results from the proposed methods showed a spatial variation in biomass that ranged from less than 40 Mg ha-1 to 339.85 Mg ha-1. This result was consistent with the in situ plot samples, with a considerable correlation between the estimated values and observed values. As the AGB is an important estimation indicator in sustainable forest management, a timely estimation of it is crucial to monitor the surface changes or potential loss and degradation of the area’s mangrove ecosystems.
The ASO significantly improved the performance of the ANFIS regression through comparison with benchmarked functions by using common statistical indicators with different combinations of features. The best values were found at RMSE = 70.882, MAE = 55.458, and R2 = 0.577. Another essential concept is that feature selection played an essential role in defining the most critical predictor variables before running any regression methods. This study investigated the potential uses of both optical and radar datasets, and it was found that the combination of both types of data is crucial in eliminating the saturation effect and in improving the estimation accuracy. The backscatter information from radar data and vegetation indices were evaluated to determine how they drove the changes in tree structures and associated AGB. The optimal selections are subject to the frequency of the radar dataset (X, L, P, or C bands) and spectral information of the optical data (multiple spectral bands).
This paper investigates artificial intelligence as machine learning methods and has become a trending topic because of its broad applications in almost all research fields. The increase in computational capacity and multiple sensor platforms have made geolocated data overwhelmingly available for spatial analysis. For the understanding of carbon flux, machine learning is predominantly applied to the methods used in the regression of spectral reflectance and backscatter of remotely sensed satellite imagery to the in situ measurements of AGB. This paper aimed to investigate novel hybrid machine learning algorithms for data fusion and spatial and temporal modeling for biomass estimation in the coastal area of Ca Mau Province in Vietnam. The findings are practically relevant, and the methodology is scientifically sound. This research is a novel approach and contributes to global knowledge in the field of forest cover estimation.
Acknowledgments
The authors thank all anonymous reviewers for their critical and constructive comments, which have improved the quality of the manuscript.
References
- 1. Pham TD, Yoshino K, Le NN, Bui DT (2018) Estimating aboveground biomass of a mangrove plantation on the Northern coast of Vietnam using machine learning techniques with an integration of ALOS-2 PALSAR-2 and Sentinel-2A data. International Journal of Remote Sensing 39: 7761–7788.
- 2. Tue NT, Dung LV, Nhuan MT, Omori K (2014) Carbon storage of a tropical mangrove forest in Mui Ca Mau National Park, Vietnam. CATENA 121: 119–126.
- 3. Quoc Vo T, Kuenzer C, Oppelt N (2015) How remote sensing supports mangrove ecosystem service valuation: A case study in Ca Mau province, Vietnam. Ecosystem Services 14: 67–75.
- 4. Chabi A, Lautenbach S, Orekan VOA, Kyei-Baffour N (2016) Allometric models and aboveground biomass stocks of a West African Sudan Savannah watershed in Benin. Carbon Balance and Management 11: 16. pmid:27594897
- 5. Giri C, Ochieng E, Tieszen LL, Zhu Z, Singh A, et al. (2011) Status and distribution of mangrove forests of the world using earth observation satellite data. Global Ecology and Biogeography 20: 154–159.
- 6. Barbier EB, Hacker SD, Kennedy C, Koch EW, Stier AC, et al. (2011) The value of estuarine and coastal ecosystem services. Ecological Monographs 81: 169–193.
- 7. Danielsen F, Sørensen MK, Olwig MF, Selvam V, Parish F, et al. (2005) The Asian Tsunami: A Protective Role for Coastal Vegetation. Science 310: 643. pmid:16254180
- 8. Jennerjahn TC, Ittekkot V (2002) Relevance of mangroves for the production and deposition of organic matter along tropical continental margins. Naturwissenschaften 89: 23–30. pmid:12008969
- 9. Hamilton SE, Casey D (2016) Creation of a high spatio-temporal resolution global database of continuous mangrove forest cover for the 21st century (CGMFC-21). Global Ecology and Biogeography 25: 729–738.
- 10. Donato DC, Kauffman JB, Murdiyarso D, Kurnianto S, Stidham M, et al. (2011) Mangroves among the most carbon-rich forests in the tropics. Nature Geoscience 4: 293.
- 11. Lucas RM, Mitchell AL, Rosenqvist A, Proisy C, Melius A, et al. (2007) The potential of L-band SAR for quantifying mangrove characteristics and change: case studies from the tropics. Aquatic Conservation: Marine and Freshwater Ecosystems 17: 245–264.
- 12. Vafaei S, Soosani J, Adeli K, Fadaei H, Naghavi H, et al. (2018) Improving Accuracy Estimation of Forest Aboveground Biomass Based on Incorporation of ALOS-2 PALSAR-2 and Sentinel-2A Imagery and Machine Learning: A Case Study of the Hyrcanian Forest Area (Iran). Remote Sensing 10.
- 13. Van Pham M, Pham TM, Viet Du QV, Bui Q-T, Van Tran A, et al. (2019) Integrating Sentinel-1A SAR data and GIS to estimate aboveground biomass and carbon accumulation for tropical forest types in Thuan Chau district, Vietnam. Remote Sensing Applications: Society and Environment 14: 148–157.
- 14. Navarro AJ, Algeet N, Fernández-Landa A, Esteban J, Rodríguez-Noriega P, et al. (2019) Integration of UAV, Sentinel-1, and Sentinel-2 Data for Mangrove Plantation Aboveground Biomass Monitoring in Senegal. Remote Sensing 11.
- 15. Hamdan O, Khali Aziz H, Mohd Hasmadi I (2014) L-band ALOS PALSAR for biomass estimation of Matang Mangroves, Malaysia. Remote Sensing of Environment 155: 69–78.
- 16. Aslan A, Rahman AF, Warren MW, Robeson SM (2016) Mapping spatial distribution and biomass of coastal wetland vegetation in Indonesian Papua by combining active and passive remotely sensed data. Remote Sensing of Environment 183: 65–81.
- 17. Huang X, Ziniti B, Torbick N, Ducey JM (2018) Assessment of Forest above Ground Biomass Estimation Using Multi-Temporal C-band Sentinel-1 and Polarimetric L-band PALSAR-2 Data. Remote Sensing 10.
- 18. Sileshi GW (2014) A critical review of forest biomass estimation models, common mistakes and corrective measures. Forest Ecology and Management 329: 237–254.
- 19. Mohd Zaki NA, Abd Latif Z (2017) Carbon sinks and tropical forest biomass estimation: a review on role of remote sensing in aboveground-biomass modelling. Geocarto International 32: 701–716.
- 20. Pham LTH, Brabyn L (2017) Monitoring mangrove biomass change in Vietnam using SPOT images and an object-based approach combined with machine learning algorithms. ISPRS Journal of Photogrammetry and Remote Sensing 128: 86–97.
- 21. Ghosh SM, Behera MD (2018) Aboveground biomass estimation using multi-sensor data synergy and machine learning algorithms in a dense tropical forest. Applied Geography 96: 29–40.
- 22. He L, Li A, Yin G, Nan X, Bian J (2019) Retrieval of Grassland Aboveground Biomass through Inversion of the PROSAIL Model with MODIS Imagery. Remote Sensing 11.
- 23. Dang ATN, Nandy S, Srinet R, Luong NV, Ghosh S, et al. (2019) Forest aboveground biomass estimation using machine learning regression algorithm in Yok Don National Park, Vietnam. Ecological Informatics 50: 24–32.
- 24. Viljanen N, Honkavaara E, Näsi R, Hakala T, Niemeläinen O, et al. (2018) A Novel Machine Learning Method for Estimating Biomass of Grass Swards Using a Photogrammetric Canopy Height Model, Images and Vegetation Indices Captured by a Drone. Agriculture 8.
- 25. Chen L, Ren C, Zhang B, Wang Z, Xi Y (2018) Estimation of Forest Above-Ground Biomass by Geographically Weighted Regression and Machine Learning with Sentinel Imagery. Forests 9.
- 26. Bui Q-T, Pham Van M, Hang NTT, Nguyen Q-H, Linh NX, et al. (2019) Hybrid model to optimize object-based land cover classification by meta-heuristic algorithm: an example for supporting urban management in Ha Noi, Viet Nam. International Journal of Digital Earth 12: 1118–1132.
- 27. Bui Q-T, Pham MV, Nguyen Q-H, Nguyen LX, Pham HM (2019) Whale Optimization Algorithm and Adaptive Neuro-Fuzzy Inference System: a hybrid method for feature selection and land pattern classification. International Journal of Remote Sensing 40: 5078–5093.
- 28. Bui Q-T, Nguyen Q-H, Pham VM, Pham MH, Tran AT (2019) Understanding spatial variations of malaria in Vietnam using remotely sensed data integrated into GIS and machine learning classifiers. Geocarto International 34: 1300–1314.
- 29. Richter R, Schläpfer D, Müller A (2006) An automatic atmospheric correction algorithm for visible/NIR imagery. International Journal of Remote Sensing 27: 2077–2085.
- 30. Castillo JAA, Apan AA, Maraseni TN, Salmo SG (2017) Estimation and mapping of above-ground biomass of mangrove forests and their replacement land uses in the Philippines using Sentinel imagery. ISPRS Journal of Photogrammetry and Remote Sensing 134: 70–85.
- 31. Argamosa RJL, Blanco AC, Baloloy AB, Candido CG, Dumalag JBLC, et al. (2018) MODELLING ABOVE GROUND BIOMASS OF MANGROVE FOREST USING SENTINEL-1 IMAGERY. ISPRS Ann Photogramm Remote Sens Spatial Inf Sci IV-3: 13–20.
- 32. Bannari A, Morin D, Bonn F, Huete AR (1995) A review of vegetation indices. Remote Sensing Reviews 13: 95–120.
- 33. Baret F, Guyot G (1991) Potentials and limits of vegetation indices for LAI and APAR assessment. Remote Sensing of Environment 35: 161–173.
- 34. Hancock DW, Dougherty CT (2007) Relationships between Blue- and Red-based Vegetation Indices and Leaf Area and Yield of Alfalfa. Crop Science 47: 2547.
- 35. Escadafal R, Belghith A, Ben Moussa H. Indices spectraux pour la dégradation des milieux naturels en Tunisie aride; 1994 1994. pp. 253–259.
- 36. Gitelson AA, Viña A, Arkebauer TJ, Rundquist DC, Keydan G, et al. (2003) Remote estimation of leaf area index and green leaf biomass in maize canopies: REMOTE ESTIMATION OF LEAF AREA INDEX. Geophysical Research Letters 30: n/a-n/a.
- 37. Gitelson AA, Keydan GP, Merzlyak MN (2006) Three-band model for noninvasive estimation of chlorophyll, carotenoids, and anthocyanin contents in higher plant leaves. Geophysical Research Letters 33: L11402.
- 38. Vincini M, Frazzi E, D’Alessio P (2008) A broad-band leaf chlorophyll vegetation index at the canopy scale. Precision Agriculture 9: 303–319.
- 39. Schlerf M, Atzberger C, Hill J (2005) Remote sensing of forest biophysical variables using HyMap imaging spectrometer data. Remote Sensing of Environment 95: 177–194.
- 40. Huete A, Didan K, Miura T, Rodriguez EP, Gao X, et al. (2002) Overview of the radiometric and biophysical performance of the MODIS vegetation indices. Remote Sensing of Environment 83: 195–213.
- 41. le Maire G, Francois C, Dufrene E (2004) Towards universal broad leaf chlorophyll indices using PROSPECT simulated database and hyperspectral reflectance measurements. Remote Sensing of Environment 89: 1–28.
- 42. Gobron N, Pinty B, Verstraete MM, Widlowski JL (2000) Advanced vegetation indices optimized for up-coming sensors: Design, performance, and applications. Geoscience and Remote Sensing, IEEE Transactions on 38: 2489–2505.
- 43. Buschmann C, Nagel E (1993) In vivo spectroscopy and internal optics of leaves as basis for remote sensing of vegetation. International Journal of Remote Sensing 14: 711–722.
- 44. Sripada RP, Farrer DC, Weisz R, Heiniger RW, White JG (2007) Aerial Color Infrared Photography to Optimize In-Season Nitrogen Fertilizer Recommendations in Winter Wheat. Agronomy Journal 99: 1424–1435.
- 45. Rouse JW Jr., Haas RH, Schell JA, Deering DW(1973) Monitoring the vernal advancement and retrogradation (green wave effect) of natural vegetation. Prog Rep RSC 1978–1: 93p.
- 46. Valenzuela JL, Romero L (1988) Biochemical indicators and iron index for the appraisal of the mineral status in leaves of cucumber and tomato. Journal of Plant Nutrition 11: 1177–1184.
- 47. Wang F-M, Huang J-F, Tang Y-L, Wang X-Z (2007) New Vegetation Index and Its Application in Estimating Leaf Area Index of Rice. Rice Science 14: 195–203.
- 48. Richardson AJ (1977) Distinguishing Vegetation from Soil Background Information. 12.
- 49. Broge NH, Leblanc E (2000) Comparing prediction power and stability of broadband and hyperspectral vegetation indices for estimation of green leaf area index and canopy chlorophyll density. Remote Sensing of Environment 76: 156–172.
- 50. Escadafal R, Huete A (1991) Improvement in remote sensing of low vegetation cover in arid regions by correcting vegetation indices for soil "noise'. Comptes Rendus—Academie des Sciences, Serie II 11: 1385–1391.
- 51. Huete AR (1988) A soil-adjusted vegetation index (SAVI). Remote Sensing of Environment 25: 295–309.
- 52. Blackburn GA (1998) Spectral indices for estimating photosynthetic pigment concentrations: A test using senescent tree leaves. International Journal of Remote Sensing 19: 657–675.
- 53. Richardson AJ, Everitt JH (1992) Using spectral vegetation indices to estimate rangeland productivity. Geocartogr Int 1: 63–69.
- 54. Gitelson AA (2004) Wide Dynamic Range Vegetation Index for Remote Quantification of Biophysical Characteristics of Vegetation. Journal of Plant Physiology 161: 165–173. pmid:15022830
- 55. Komiyama A, Ong JE, Poungparn S (2008) Allometry, biomass, and productivity of mangrove forests: A review. Aquatic Botany 89: 128–137.
- 56. Fourqurean JW, Beverly Johnson J., Kauffman B., Kennedy H., Lovelock C., and Saintilan N. (2014) Field Sampling of Vegetative Carbon Pools in Coastal Ecosystems. Arlington, USA: Conservation International, Intergovernmental Oceanographic Commission of UNESCO, International Union for Conservation of Nature.
- 57. Komiyama A, Poungparn S, Kato S (2005) Common allometric equations for estimating the tree weight of mangroves. Journal of Tropical Ecology 21: 471–477.
- 58. Jang JR (1993) ANFIS: adaptive-network-based fuzzy inference system. IEEE Transactions on Systems, Man, and Cybernetics 23: 665–685.
- 59. Tien Bui D, Bui Q-T, Nguyen Q-P, Pradhan B, Nampak H, et al. (2017) A hybrid artificial intelligence approach using GIS-based neural-fuzzy inference system and particle swarm optimization for forest fire susceptibility modeling at a tropical area. Agricultural and Forest Meteorology 233: 32–44.
- 60. Khademi F, Akbari M, Jamal SM, Nikoo M (2017) Multiple linear regression, artificial neural network, and fuzzy logic prediction of 28 days compressive strength of concrete. Frontiers of Structural and Civil Engineering 11: 90–99.
- 61. Zhao W, Wang L, Zhang Z (2019) Atom search optimization and its application to solve a hydrogeologic parameter estimation problem. Knowledge-Based Systems 163: 283–304.
- 62. Katrutsa A, Strijov V (2017) Comprehensive study of feature selection methods to solve multicollinearity problem according to evaluation criteria. Expert Systems with Applications 76: 1–11.
- 63.
Kira K, Rendell LA (1992) The feature selection problem: traditional methods and a new algorithm. Proceedings of the tenth national conference on Artificial intelligence. San Jose, California: AAAI Press. pp. 129–134.
- 64.
Ian H. Witten EF, Hall Mark A. (2011) Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations (The Morgan Kaufmann Series in Data Management Systems). USA: Morgan Kaufmann.
- 65. Roslani MA, Mustapha MA, Lihan T, Juliana WAW (2013) Classification of mangroves vegetation species using texture analysis on Rapideye satellite imagery. AIP Conference Proceedings 1571: 480–486.
- 66. Kuenzer C, Bluemel A, Gebhardt S, Quoc TV, Dech S (2011) Remote Sensing of Mangrove Ecosystems: A Review. Remote Sensing 3.
- 67. Rondeaux G, Steven M, Baret F (1996) Optimization of soil-adjusted vegetation indices. Remote Sensing of Environment 55: 95–107.
- 68. Van TT, Wilson N, Thanh-Tung H, Quisthoudt K, Quang-Minh V, et al. (2015) Changes in mangrove vegetation area and character in a war and land use change affected region of Vietnam (Mui Ca Mau) over six decades. Acta Oecologica 63: 71–81.
- 69. Urbazaev M, Thiel C, Cremer F, Dubayah R, Migliavacca M, et al. (2018) Estimation of forest aboveground biomass and uncertainties by integration of field measurements, airborne LiDAR, and SAR and optical satellite data in Mexico. Carbon Balance and Management 13: 5. pmid:29468474
- 70. Domingo D, Lamelas-Gracia MT, Montealegre-Gracia AL, de la Riva-Fernández J (2017) Comparison of regression models to estimate biomass losses and CO2 emissions using low-density airborne laser scanning data in a burnt Aleppo pine forest. European Journal of Remote Sensing 50: 384–396.
- 71. Motlagh MG, Kafaky SB, Mataji A, Akhavan R (2018) Estimating and mapping forest biomass using regression models and Spot-6 images (case study: Hyrcanian forests of north of Iran). Environmental Monitoring and Assessment 190: 352. pmid:29785643
- 72. Zhao P, Lu D, Wang G, Wu C, Huang Y, et al. (2016) Examining Spectral Reflectance Saturation in Landsat Imagery and Corresponding Solutions to Improve Forest Aboveground Biomass Estimation. Remote Sensing 8.