Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Predicting coastal erosion susceptibility in Bangladesh under climate scenario via machine learning techniques

  • Sakib Hosan,

    Roles Data curation, Formal analysis, Methodology, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Department of Urban and Regional Planning, Khulna University of Engineering & Technology (KUET), Khulna, Bangladesh

  • Sondipon Dey Pranta,

    Roles Data curation, Formal analysis, Methodology, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Department of Urban and Regional Planning, Khulna University of Engineering & Technology (KUET), Khulna, Bangladesh

  • Tahdia Tahmid,

    Roles Data curation, Writing – original draft

    Affiliation Department of Urban and Regional Planning, Khulna University of Engineering & Technology (KUET), Khulna, Bangladesh

  • Mafrid Haydar ,

    Roles Conceptualization, Data curation, Methodology, Supervision, Writing – original draft, Writing – review & editing

    mafrid@urp.kuet.ac.bd

    Affiliation Department of Urban and Regional Planning, Khulna University of Engineering & Technology (KUET), Khulna, Bangladesh

  • Al Hossain Rafi

    Roles Data curation, Methodology, Writing – original draft, Writing – review & editing

    Affiliation Department of Urban and Regional Planning, Khulna University of Engineering & Technology (KUET), Khulna, Bangladesh

Abstract

Using advanced machine learning methods along with geospatial data and climate estimates, this study found areas in Bangladesh that are likely to experience coastal erosion. Twenty important factors were looked at, such as meteorological, geographical, hydrological, tropological, and land-use variables. The normalized difference vegetation index (NDVI) was found to be the most important factor. An ensemble machine learning technique was used to figure out how susceptible coastal areas are to erosion. Several types of boosting techniques were used, including Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), Categorical Boosting (CatBoost), Gradient Boosted Decision Trees (GBDT), and AdaBoost. Random Forest, Decision Tree, Treebag, Bagging, and Averaged Neural Network (avNNet) were also used. The area under the curve (AUC) and receiver operating characteristic (ROC) values were used to check how well the model worked. XGBoost had the best AUC, at 0.95, which means it did a very good job of classifying places that are likely to be washed away by erosion. The study of geography showed that most of the models showed moderate-risk areas, which made up 71.82% to 79.36% of the whole area. Some of the districts that were identified as mostly high-risk were Bhola (19.41%), Cox’s Bazar (26.20%), and Patuakhali (21.47%). A lot of low-risk zones were found in places like Jashore and Narail, on the other hand. Predictions of how likely erosion will be in the future based on different warming models to make Representative Concentration Pathways (RCPs 2.6, 4.5, 6.0, and 8.5) for the years 2040, 2060, 2080, and 2100, we used data from the Coupled Model Intercomparison Project (CMIP5). In line with RCP 8.5, the number of high-risk places is expected to rise to 50% by 2080 and to 40% by 2100. RCP 6.0 had a smooth shift, and in high-risk areas, there were only small rises. RCP 2.6, 4.5, and 6.0 all went up a little. These results show that Bangladesh’s shore is becoming more likely to be worn away by erosion as temperatures rise. They stress how important it is to quickly react, handle coastal areas, and use planning methods that are resilient to climate change.

Introduction

As waves continuously alter the morphology of the shore over time, coastal erosion gradually erodes away the land. The high rate of coastal erosion significantly raises coastal susceptibility at the national, regional, and international levels in addition to other coastal disasters such as tidal surge, cyclones, and flooding [1,2]. For people who live in coastal, deltaic, and riverine systems worldwide, erosion poses serious environmental problems [36]. It is typically the result of both natural and anthropogenic forces, which may work alone or in concert [7,8]. Anthropogenic factors, including building ports, coastal barriers, groins, waterways, tourist hotels, gardens, and dredging inlets, as well as natural ones like the wind, surges, currents, particles, active tectonics, and sea level rise, all contribute to coastal erosion [9]. The shape of the coasts is permanently altered when the sediment is removed from the sediment sharing system [10]. Despite the fact that no part of the planet is immune to environmental dangers, certain places are geographically more susceptible to their impacts than others. For instance, the low-lying coastal areas are extremely susceptible due to the effects of climate change [11]. Although the effects of coastal erosion are limited to coastal regions, these regions are home to more than 40% of the global population and a diverse spectrum of coastal ecosystems that offer a multitude of services [12]. The coastal areas of Bangladesh are no exception to this reality, as they cover 32% of the terrestrial area of Bangladesh and are home to 26% of the total population of the country [13]. Bangladesh loses 34 square kilometers of land per year, and this loss could get worse as sea levels rise [14]. Coastal regions, especially mangrove forests, are prime candidates for protection [15]. Along the coast, loss of land is common, displacing people, disrupting livelihoods, and imperiling valuable ecosystems like the Sundarbans, the world’s largest mangrove forest. Coastal resilience is one of the priority sustainable development issues of Bangladesh because of intensifying cyclones, increased salinity intrusion, and unpredictable weather patterns caused by climate change.

Various techniques have been used by the researchers, which include expert-based methods and real-time data analysis. As alternative methods for the prediction of beach morpho dynamics, different machine learning methods have been developed and tested over the last several years. In some studies, researcher have trained and validated models such as the Functional Trees (FT), Multi-Layer Perceptron (MLP), Naïve Bayes (NB), Support Vector Machines (SVM), Logistic Regression (LR) and Deep Learning (DL) with the physical and environmental parameters such as wave height and direction, horizontal flow magnitude, geology, and slope, along with coastal erosion inventories [1618]. The Random Forest (RF), Gradient Boosted Regression Tree (GBRT), and Tree Ensemble (TE) machine learning models were used by [1923] to identify the zones of maximum gully erosion (GES). Some other studies [2428] focused on three deep learning techniques: Convolutional Neural Network (CNN), Recurrent Neural Network (RNN), and Long-Short Term Memory (LSTM) for spatial forecasting. In research by Naceur et al. [29], outlined the N’fis River Basin in Morocco’s gully erosion using RF, AdaBoost, and GBDT models. With an AUC of 0.932, RF was the top-performing model, followed by GBDT (0.893) and AdaBoost (0.902). Eloudi et al. [20] likened six models in mapping the susceptibility of gully erosion: AdaBoost, XGBoost, GBM, TreeBag, RF, and C5.0. With an AUC of 90.80%, the C5.0 model performed optimally, followed closely by RF (90.10%), XGBoost, and AdaBoost (both 90%).

A number of studies have used traditional geospatial and statistical techniques to investigate erosion patterns and suggest conservation measures in Bangladesh. The Revised Universal Soil Loss Equation (RUSLE) with GIS has been used in a number of studies to predict soil erosion and detect erosion hotspots. Using the Global Soil Erosion Modeling Platform (GloSEM) Rahman et al. and Islam et al. [30,31] measured erosion rates and validated their findings. Despite yielding valuable information, these studies employed deterministic models assuming rigid links between erosion rates and input variables. This works against adaptive erosion management and reduces their potential to forecast future erosion scenarios in the face of changing climatic conditions. Socio-economic erosion research pays greater attention to adaptation and resilience compared to predictive modeling. Hasan et al. [32] used K-Nearest Neighbors (KNN), XGBoost, and Random Forest (RF) were utilized to evaluate flood risk in the coastal areas of Bangladesh. RF was the most accurate with an accuracy of 86.7%, followed closely by XGBoost (86.3%). Barguna, Bhola, and Patuakhali districts were found to be particularly susceptible by the research. While Mamun et al. and Islam et. al [31,33] took into account coping mechanisms and communal susceptibilities, they didn’t involve high-resolution spatial modeling for future hazard forecasting. However, there are a number of issues with existing approaches because deterministic models like RUSLE and GloSEM are based on static assumptions; they are unable to capture dynamic drivers like changing land use, extreme weather, and shifting climatic trends. Moreover, these models do not incorporate real-time data, thus limiting their use in adaptive erosion management. For better predictive accuracy, future research should emphasize the creation of hybrid models that combine machine learning and geographic methods. Furthermore, an integrated approach that combines socioeconomic risk analysis with physical erosion modeling in support of sustainable land management would provide holistic insight.

Thus, the main goal of this research is to utilize an exhaustive set of machine learning (ML) models to evaluate the geographical aspects of coastal erosion in Bangladesh’s dynamic coastline by combining geological, climatic, hydrological, topographical, and human-induced parameters. This study aims to: (i) identify and rank the most important drivers controlling the susceptibility of coastal erosion; (ii) compare the predictive capabilities of different individual and ensemble-based machine learning models for forecasting erosion-susceptible zones; (iii) generate high-resolution spatial susceptibility maps for the current scenario. The results aim to inform planners, environmental managers, and policymakers to implement well-informed, spatially explicit plans for the sustainable management of coastal zones in the face of present and future climate change.

Materials and methods

Overview of the study location

The research area (in Fig 1) is situated in the coastal region of Bangladesh, in the southern part of the country, lying between longitudes 80° 0’ E to 91° 0’ E and latitudes 21° 30’ N to 22° 30’ N [34]. The coastal region of Bangladesh covers 47,200 km², including land, islands, and several water bodies, constituting around 32% of the nation’s total area [35]. It is subdivided between the interior coast (23,265 km²) and the exposed coast (23,935 km²), with the latter being directly influenced by tidal surges and cyclonic activity from the Bay of Bengal [36]. The coastal zone of the nation is divided into three zones: western, central, and eastern, based on geomorphological characteristics, spanning approximately 27,150 km², 12,040 km², and 8,010 km² of coastal land area, respectively [37]. The eastern coast of Bangladesh is characterized by lofty, hilly land with urbanization and intricate flood dynamics; the middle portion is low-lying and extremely vulnerable to tidal and river floods; whereas the western region is covered with mangrove forests and dynamic delta landscape that is prone to erosion and saltwater inundation [35,37,38]. The study aims at the coastal belt of Bangladesh, where erosion threatens human habitation and natural ecosystems significantly [30]. The region’s susceptibility to tidal surges, storm effects, and sea level rise at high levels renders it a significant place for assessing erosion risk [39]. This study aims to enhance the predictive accuracy and establish a sustainable coastal erosion mitigation framework in Bangladesh through the application of advanced machine learning algorithms and GIS-based susceptibility modeling. The research employs high-resolution geographical data and climate change scenarios to more accurately identify locations susceptible to erosion. This integrative approach safeguards Susceptibility ecosystems and societies on the coast of Bangladesh by creating a scientific basis to formulate sustainable coastal management policies and long-term adaptation strategies, while facilitating evidence-based decision-making.

thumbnail
Fig 1. Study area map of the coastal region of Bangladesh.

https://doi.org/10.1371/journal.pone.0334347.g001

Description of data

The model parameters were chosen and identified following a comprehensive review of the literature. Twenty factors mentioned in Table 1 are employed for reducing coastal erosion in Bangladesh. Digital Elevation Models (DEMs) help pinpoint low-lying coastal regions susceptible to sea-level rise, tidal inundation, and wave erosion, where low-lying places are more Susceptibility to erosion because of their closeness to water and coastal dynamics, whereas elevated regions are safeguarded [24,4043]. The Shuttle Radar Topography Mission (SRTM) Digital Elevation Model (DEM) data are used for spatial analysis. The gradient of the terrain or slope influences hydrological and geomorphological processes [44]. The coastal slope influences surface runoff, sediment transport, and stability, and on the Senegal coast, gentle slopes are more susceptible to erosion due to inadequate water drainage and surface water accumulation, whereas steeper slopes may mitigate wave impact but heighten the risk of landslides [24,42,43,45]. The slope layer was generated from the DEM. Vertical Distance from Channel Network (VDCN) quantifies elevation of a location with adjacent drainage channels [46]. It indicates the accumulation of surface water and fluvial activity, where VDCN values that are lower show a higher risk of erosion from overland flow and waterlogging, and those that are increasing indicate a reduced risk [43]. DEM made it possible to derive VDCN. The Topographic Wetness Index (TWI) evaluates local slope and upstream contributing area in order to measure soil texture distribution [47]. High TWI values indicate water accumulation and saturation, which can compromise soil stability and enhance susceptibility to erosion [43]. Conversely, low TWI values depict dry, more stable conditions. The DEM was utilized to derive the TWI via hydrological modelling. The Topographic Position Index (TPI) quantifies the height of a point above or below the average height of its surroundings [48]. This index can be utilized to classify hills, valleys, and plains. Positive values of TPI express high landforms with lower susceptibility to erosion, whereas negative values of TPI signify depressions and valleys with high water retention capacity [43,49]. The TPI layer was developed based on the 30-meter resolution DEM data. The Topographic Ruggedness Index (TRI) quantifies height variation in a region to evaluate terrain heterogeneity [50]. Cliffs and dunes have elevated TRI values, which can either exacerbate or mitigate erosion based on substrate hardness where Rugged terrain with pliable substrates may erode more rapidly, whereas rocky regions may exhibit greater resistance [43,51]. TRI was derived from DEM. The flow of water over landforms is influenced by the curvature of the ground [52]. Concave landforms retain water and silt, heightening erosion risk, whereas convex parts facilitate drainage and diminish moisture retention [24,43,53]. The curvature layer was generated using GIS Curvature with DEM data. Precipitation and potential evapotranspiration dictate climate aridity in the Aridity Index [54]. An increased aridity index indicates a drier environment with diminished vegetation, rendering land more Susceptibility to wind and water erosion, whereas low-aridity regions possess abundant flora, which stabilizes the soil [43,55]. This index was developed using WorldClim datasets. Microclimatic elements such as wind patterns and sunlight exposure are influenced by slope aspect [56]. Coastal windward slopes are more susceptible to wave and wind forces, heightening the risk of erosion, whereas leeward slopes are protected [43]. DEM data yielded aspect values. The Normalized Difference Vegetation Index (NDVI) is a widely utilized remote sensing statistic for assessing vegetation density and health [57,58]. Elevated NDVI values indicate abundant vegetation, enhancing soil structure and mitigating erosion, and low NDVI signifies diminished vegetation or bare soil, which is more susceptible to erosion [24,43]. NDVI data were obtained from Sentinel-2 imagery at a resolution of 10 meters. Precipitation influences the hydrological cycle and erosion, where heavy rainfall enhances surface runoff and soil saturation, accelerating erosion [43,59]. CHIPS precipitation data were utilized in the study. The mean wind speed significantly influences coastal erosion by producing wave energy and transferring aeolian silt [60]. Wind velocity amplifies wave dynamics, resulting in coastal erosion and the inland transport of materials so calm circumstances diminish coastal erosion [43,61]. Wind speed data was used from ERA5 reanalysis datasets in order to examine these effects. Additionally, geomorphology examines landform classifications and their geological characteristics, which influence coastal erosion [62]. Rocky coastal formations such as cliffs and headlands exhibit superior erosion resistance compared to sandy beaches [42,63]. The geomorphological categories utilized in this study were gained from the Geological Survey of Bangladesh. Soil texture effect coastal erosion where Clay soils exhibit greater compaction and erosion resistance compared to sandy soils, which are readily moved by wind and water [42,64]. Soil texture data was obtained from the FAO Global Soil Map. The distance to the coastline quantifies the proximity of a location to the shore that influence coastal erosion [42]. Coastal regions are especially susceptible to wave dynamics, tidal influences, and saline intrusion, which intensify erosion [39]. Distance was ascertained via shape file data. The spatial distribution of anthropogenic and natural surface elements that influence coastal erosion, such as urban areas, forests, and agricultural lands, is depicted by Land Use and Land Cover (LULC) statistics [15,24,42]. The research employs Esri’s Sentinel-2 Global Land Cover 2023 datasets at a 10-meter resolution, a globally recognized dataset derived from high-resolution satellite imagery for land use and land cover analysis [65,66]. The distance to a river indicates a location’s closeness to fluvial systems that provide water and sediment that affect coastal erosion, where it occurs more frequently in proximity to rivers than at greater distances [67]. The river distance layer employs humanitarian vector datasets. Fault lines can influence ground stability and long-term erosion, where active tectonic faults may experience subsidence or uplift, hence augmenting coastal instability and sediment redistribution [16]. Stable locations are typically situated distant from fault lines. Increasing mean sea level, a long-term average of sea surface elevation, induces coastal erosion [39]. Elevated sea levels exacerbate coastal erosion, saline intrusion, and storm surges. Stable or declining sea levels mitigate these effects. The Permanent Service for Mean Sea Level utilized MSL data. The erosion resistance of an area is contingent upon its geological materials. Basalt and granite exhibit durability, but shale and sedimentary formations are more susceptible to erosion under hydrodynamic forces [68]. From the USGS Earth Explorer, utilized geological data.

thumbnail
Table 1. Data sources of parameters for mitigating coastal.

https://doi.org/10.1371/journal.pone.0334347.t001

Erosion inventory mapping

Fig 2 illustrates the coastal erosion map, demonstrating the spatial distribution of soil erosion intensity throughout the research area. This map was produced utilizing the Revised Universal Soil Loss Equation (RUSLE) model, which quantifies yearly soil loss in tons per hectare per year. The Revised Universal Soil Loss Equation (RUSLE) was developed by Renard [69] and is widely used and popular all over the world for average annual soil loss estimation. [70]. In this research, the model was used to analyze the potential soil erosion in the coastal zone of Bangladesh. As [71], said, the RUSLE model is represented by the equation A = R × K × LS × C × P, where A is the potential yearly soil loss in tons per hectare per year, R is the rainfall-runoff erosivity factor in megajoules per hectare per hour per year, K is the soil erodibility factor in tons per meter per hour, LS is the slope length and steepness factor, C is the cover and management factor, and P is the conservation support practice factor. CHIRPS (Climate Hazards Group InfraRed Precipitation with Station Data) daily precipitation records were utilized in the computation of the rainfall erosivity factor, R [72]. The K factor for soil erodibility was computed from data on several soil types where categorical reclassification was performed based on soil texture classes, and empirically estimated K values were allocated to each class [73]. The Topographic Factor (LS) is the compound effect of slope steepness and length, whereby the slope calculation was performed based on the Shuttle Radar Topography Mission (SRTM) data and the digital elevation model (DEM). Once the slope had been converted from degrees to percentages, the LS was then calculated once the conversion had been carried out. The Cover Management Factor (C) was derived by calculating the Normalized Difference Vegetation Index (NDVI) from the Landsat-8 photo surface reflectance bands [74]. A function of NDVI was applied to transform vegetation density to a C factor. This correction normalizes the values of C to between 0 and 1 and implies that areas with denser plant cover are less susceptible to erosion. Slope data and MODIS land cover type (LC_Type1) were utilized in developing the Support Practice Factor (P) [75]. Diverse P values were assigned to the distinct land cover types and slope categories, grounded in the methodologies defined in the literature. The application of effective soil conservation techniques resulted in reduced P values on gentle slopes of agricultural land. The final soil loss map was generated by multiplying the five RUSLE components (R, K, LS, C, and P) on a pixel-by-pixel basis. This yielded estimated yearly soil loss figures articulated in tons per acre. Following the estimation of annual soil loss in tons per hectare per year (t ha ⁻ ¹ year ⁻ ¹) over the study area, the results were visualized and categorized into two classes of erosion severity, employing a threshold-based classification method to establish the coastal erosion inventory. Based on [7679], all pixels exhibiting soil loss rates exceeding 30 t ha ⁻ ¹ year ⁻ ¹ were classified as erosion-prone locations and assigned a rating of 1 whereas, all other pixels with values equal to or below this threshold were classified as non-erosion zones and awarded a value of 0. This inventory not only offers reliable erosion occurrence data grounded in quantitative soil loss estimates but also acts as a fundamental input for further susceptibility modeling and spatial analysis.

Selection of explanatory variables

Multicollinearity, a prevalent problem in multiple regression, occurs when independent variables exhibit significant correlation, which may inflate the standard errors of coefficients and diminish statistical reliability [80]. Pearson correlation matrices provide independent variable correlation coefficients to discover multicollinearity. Multicollinearity is possible if two variables have a high correlation near 1 or −1 [81]. However, Variance Inflation Factor (VIF) and Pearson’s correlation coefficient (p-value) are employed to measure multicollinearity more precisely [80]. A VIF below 10 shows a significant correlation that may pose issues, while a VIF over 10 signifies severe multicollinearity, potentially distorting regression estimates [82].

Analytic methods, including boosting and bagging technique

This study employes five boosting and five bagging algorithms to predict coastal erosion in Bangladesh (Fig 3). XGBoost is one of the strong and robust methods that is utilized in the machine learning field is referred to as [8385]. For performing user-defined predictive tasks such as regression, classification, ranking, and supervised learning, XGBoost is a flexible and efficient ensemble learning framework that makes use of gradient boosting and tree topologies [86,87]. The dataset was normalized and divided into training and testing subsets using stratified sampling. Sample weights were utilized throughout training to address class imbalance. XGBoost classifier was trained with 100 trees (n_estimators = 100), a maximum depth of 5, and a learning rate of 0.03, utilizing complete subsampling and feature utilization (subsample = 1, colsample_bytree = 1). Robust L1 and L2 regularization (reg_alpha = 8, reg_lambda = 8) was employed to mitigate overfitting. In the current study, the XGBoost algorithm was used, and seventy percent of the database was allocated to training the model while thirty percent was allocated to testing the model.

AdaBoost employs adaptive boosting for classification trees [88]. Adaboost improves the classification precision of machine learning based on classification, and the non-parametric method efficiently identifies outliers without the need to identify bad learners [88,89]. AdaBoost training starts with the creation of a base decision tree through the equal weighting of the dataset [89,90]. AdaBoost is susceptible to noisy or corrupted data [91]. AdaBoost classifier was executed utilizing a decision stump (a decision tree with a maximum depth of one) as the foundational learner. The model was trained using 100 estimators and a learning rate of 0.03. The issue of class imbalance in the training data was mitigated by utilizing sample weights derived from balanced class weighting. The authors employed the AdaBoost model with a 70% and 30% training to test split.

Gradient Boosting Decision Trees, simply known as GBDT, is a sophisticated combination algorithm that iteratively enhances regression decision trees to reduce classification errors at each step along the process [92,93]. Numerous weak classifiers are constructed across different iterations, and learning rates are smaller, so more iterations need to be conducted to secure the same fit [94]. It is the correct tree algorithm on which the GBDT depends that scales up the loss function with the negative gradient as a form of residuals, and whose result results in the gradual diminution of residual values [95,96]. Because of its use of many weak decision trees, GBDT offers stability and accurate projections. Gradient Boosting classifier was trained with 100 estimators, a maximum tree depth of 5, a learning rate of 0.03, and subsample and feature sampling rates of 0.8 to enhance generalization. It is thus the best solution. 70% of the time is spent in training and 30% in testing here.

LightGBM is an open-source gradient boosting library founded by Microsoft that utilizes the histogram-based method to optimize training efficiency and reduce computational complexity [97,98]. In addition, it has enhanced network communication to enable additional parallel learning, for example, the parallel decision tree voting technique and grows trees by managing variance gain and sorting training examples in decreasing order based on absolute gradient values [99,100]. The LightGBM classifier was trained using 100 estimators, a maximum tree depth of 5, a learning rate of 0.03, and complete subsampling of both data and features (subsample = 1, colsample_bytree = 1). Robust regularization was implemented through L1 (reg_alpha = 8) and L2 (reg_lambda = 8) penalties, while class imbalance was mitigated by employing balanced class weights. Feature significance was derived and standardized to ascertain the most significant predictors. Thirty percent of the data was used for testing, and seventy percent was used for training for this study.

CatBoost is a very advanced machine learning method that has been effective in capturing a variety of factors influencing water potential as well as properly handling noisy and heterogeneous information [101]. As a Gradient Boosting Decision Tree-based technique, it is particularly well-suited to model complex relationships and heterogeneous feature sets [101,102]. Sample weights were calculated to address class imbalance, with class weights explicitly designated according to the imbalance ratio. A CatBoost classifier was trained with 100 iterations, a maximum depth of 5, a learning rate of 0.03, and an L2 regularization value of 10. Feature subsampling was established at 70% (rsm = 0.7) to improve generalization. According to this study, a statistically target-oriented approach is employed wherein seventy percent of the training data is randomly chosen, and thirty percent is reserved for testing.

Random Forest (RF) is a statistical ensemble machine learning method employed most of the time to forecast the target variable [24,103,104]. The method employs bootstrapping techniques along with stochastic binary trees for the purpose of using some portion of the data that provides a better representation. The model is constructed by the random selection of training from the initial set of observations [105]. The bagging classifier was trained with 100 decision trees, each with a maximum depth of 5, and minimum sample split and leaf sizes adjusted to mitigate overfitting. Probabilities were computed for predictions, and feature importance was averaged and normalized across all base estimators to find significant contributors. Within the experiment, the information is divided into two sets: thirty percent is held out for testing, and seventy percent for training.

The bagging strategy for an ensemble entails repeatedly training the same algorithm on numerous distinct subsets of the training set [105,106]. The average of all the projections from the sub-models forms the basis of the final forecast. This process minimizes heterogeneity in classification, which ultimately increases overall accuracy [107]. Bagging is an approach that can be used to improve accuracy in situations where big fluctuations in the training set significantly affect the subsequent prediction, where the outcome of a test sample is determined through the vote of the ensemble that received the highest vote [108,109]. A Bagging Regressor consisting of 100 decision trees, each with a maximum depth of 5, was trained, utilizing 1% of the training data per tree to mitigate overfitting while employing all features. Despite being intended for regression, the model’s continuous predictions were utilized as probabilities for binary classification. The significance of features was averaged throughout the ensemble and normalized to emphasize principal predictors. Two data sets were established for the study, thirty percent of which were reserved for testing and seventy percent reserved for training.

In order to execute the decision tree (DT) method, the data are first split into various regions according to the independent variables, and rules for each region are subsequently developed [110,111]. The primary application of decision tree models is to obtain the development of a model for predicting results on the basis of input characteristics. Decision trees are significantly effective in classification, regression, and prediction tasks [112,113]. This code trains a Decision Tree classifier on the specified dataset, utilizing features associated with environmental and geographic indices to predict a binary target variable. The data is divided into training and test sets based on the DT strategy, with thirty percent of the data left for testing and seventy percent left for training.

Treebag is an ensemble learning method that takes advantage of the benefits of decision trees and bagging in order to improve prediction accuracy that is achieved when multiple decision trees are made by constructing bootstrapped subsets of the training data, each of which was trained separately [114,115]. Through the mean of error averaging over many trees, Treebag improves the stability and robustness of decision tree implementations and this technology has been applied with high frequency to vast topics, including issues of classification and regression, and has achieved better performance than single decision tree models [116119]. With a thirty percent test split and seventy percent train split, A Treebag classifier with limited complexity (maximum depth of 5, minimum samples for split and leaf) was employed to mitigate overfitting. The Bagging Classifier was subsequently trained using 100 estimators, each utilizing the complete sample and feature set, while employing parallel processing for enhanced efficiency. Treebag is utilized in the present research.

Averaged Neural Network, or avNNet, is an ensemble learning method utilized for increasing predictive accuracy by averaging the output of numerous neural networks [120]. It is a method that does not rely on one trained network but one that utilizes predictions of multiple models trained independently and then averages the models’ output [121]. When overfitting problems must be resolved and model robustness has to be enhanced for hard learning tasks, such a method is quite helpful. Through the utilization of various neural networks trained on various subsets of the data or initially charged with various weights, avNNet has been found to enhance classification and regression task stability, as demonstrated by research [122,123]. This renders the process of averaging to be a stable method to employ in machine learning settings since it reduces the impact of individual network biases and improves the accuracy of the entire output [124]. The avNNet classifier was executed with a multilayer perceptron architecture featuring two hidden layers containing 100 and 50 neurons, respectively. The avNNet technique is employed within this study by splitting 70% of the dataset for training purposes and 30% for testing purposes, according to the dataset.

For all models, to rectify class imbalance, we utilized the class_weight = ‘balanced’ argument, which modifies weights according to class frequencies. Moreover, stratified sampling was employed in the train-test division to maintain class distribution.

Model validation

Better performance in machine learning models is associated with greater AUC values [125]. The study graphs ROC curves for all of the models through software in the programming language Python. The ROC curve is a plot of a classifier’s performance in distinguishing between numerous classes [126,127]. The Area Under the Curve (AUC) is a scalar measure of the model’s performance that originated from the Receiver Operating Characteristic (ROC) curve. The data was divided in a training-to-testing proportion of 70% and 30%, so that there was an equilibrium between model training and validation. The AUC score and ROC curve were utilized as primary measures of performance, with significantly larger AUC scores indicating superior classification capacity (Table 2). The best classifier will have a ROC curve that is clumped in the upper-left quadrant, which indicates that it has a high level of both sensitivity and specificity [86,126,127]. According to the set standards, values of area under the curve (AUC) range between 0 and 1, and high values indicate improved model discrimination capacities by a greater margin.

thumbnail
Table 2. Classification of Model Performance According to AUC (Area Under the Curve). Values [86,126,127].

https://doi.org/10.1371/journal.pone.0334347.t002

A confusion matrix evaluates the effectiveness of a classification model by displaying true positives (nodes accurately identified as positive), true negatives (nodes accurately identified as negative), false positives (nodes inaccurately identified as positive), and false negatives (nodes inaccurately identified as negative), thereby providing a thorough assessment of the model’s performance across different classes [128,129]. This matrix enables the assessment of model correctness, the detection of misclassifications, and the calculation of critical metrics such as the F1-score, which improves model performance. Accuracy represents the proportion of correct forecasts to total predictions, with increased accuracy indicating greater overall correctness; Precision measures the ratio of true positives to total predicted positives, with high precision signifying fewer false positives; recall evaluates the ratio of true positives to total actual positives, where high recall reduces false negatives; the F1-score integrates precision and recall as their harmonic mean, making it beneficial for assessing models, especially in imbalanced datasets [129].

Future climate change models

Forecasts of impending climate change are required for the description of mechanisms of coastal erosion, for making estimates of potential danger to coastal residents, and guiding the development of effective adaptation measures and mitigations. The fifth Coupled Model Intercomparison Project (CMIP5) phase will provide a cutting-edge multimodel dataset that will advance our knowledge of climate variability and climate change [130]. The framework provides an excellent platform for climate variability assessment and its effects on water resources [131,132]. Representative Concentration Paths (RCPs) establish several greenhouse gas concentration pathways, such as a set of four new pathways created for the use of the climate modelling community as a basis for long-term and short-term modelling studies [133]. These RCPs serve as the basis for CMIP5 based projections wherein RCP 2.5, RCP 4.5, RCP 6.0, and RCP 8.5 represent the four RCP scenarios applied in this study. The purpose of this study is to evaluate future climate conditions and their implications that may occur [134]. Particularly, the research assesses the sea level average pressure under various scenarios using projection data for 2040, 2060, 2080, and 2100. Sea level rise causes the acceleration of the coastal erosion process by intensifying the frequency of wave impact on the coastlines, especially during storms and periods of high tides [135]. This results in the shoreline erosion process happening frequently and more intensely [136]. In this study, climate projection data were retrieved from the website of the NCAR GIS Initiative Climate Change Scenario (https://gisclimatechange.ucar.edu), where projections of climate are provided for public access.

Future coastal erosion projection

A global climate model simulated projected coastal erosion risk zone maps for 2040, 2060, 2080, and 2100 using multiple climate change scenarios defined based on Representative Concentration Pathways (RCPs). A raster calculator was used to compute a spatial correlation between simulated coastal erosion effects and projected coastal erosion using outputs of a boosting model, i.e., XGBoost. The risk maps produced were then systematically put into various categories from low to highly vulnerable to coastal erosion. These projections provide useful insight into the expected increase in coastal vulnerability in Bangladesh and are a critical input to policymakers for formulating long-term adaptive shore protection and general disaster risk reduction strategies.

Results and discussion

Description of topographical factors

The topographical factors in Fig 4 exhibit clear spatial changes throughout the region. Aspect(a) and Curvature(b) exhibit a mixed pattern with no distinct predominance in any specific area. DEM(c), Slope(e), and TPI(f) demonstrate elevated values in the northern and eastern highland regions, whereas lower values are found in the southern and western lowlands, with intermediate values distributed throughout the center sections. DtC(d) has lowered values predominantly in the southern regions, progressively escalating toward the north. TRI(g) exhibits higher levels in the central and northeastern regions, while lower values are observed in the southwestern and western areas.

thumbnail
Fig 4. Topographical factors (a) Aspect, (b) Curvature, (c) DEM, (d) DtC, (e) Slope, (f) TPI, (g) TRI.

https://doi.org/10.1371/journal.pone.0334347.g004

Description of hydrological factors

The hydrological parameters depicted in Fig 5 exhibit diverse spatial distributions throughout the region. DtR(a) and VDCN(d) have elevated values in the northern and western upland regions, while demonstrating lower values in the middle and southeastern regions adjacent to rivers. Mean Sea Level (MSL)(b) reflects higher values in the northwestern and southwestern regions, whereas lower values are observed in the southern and coastal zones, with intermediate levels in the intervening areas. TWI(c) exhibits a comparable pattern, displaying low values in the eastern regions and higher values in the center and low-lying areas in the western regions.

thumbnail
Fig 5. Hydrological factors (a) DtR, (b) MSL, (c) TWI, (d) VDCN.

https://doi.org/10.1371/journal.pone.0334347.g005

Description of meteorological factors

According to Fig 6 the meteorological factors demonstrate varied regional distributions throughout the region. The Aridity Index(a) and Precipitation(c) exhibit higher levels mostly in the southeastern and northeastern regions, whilst lower values are observed in the southwestern and northwestern sectors. The Mean Wind Speed(b) has a contrasting pattern, with higher values in the southwestern areas and lower values in the eastern parts. Moderate values for all three parameters are typically concentrated in the central areas of the region.

thumbnail
Fig 6. Meteorological factors (a) Aridity Index, (b) Mean Wind Speed, (c) Precipitation.

https://doi.org/10.1371/journal.pone.0334347.g006

Description of geological factors

Fig 7 presents detailed scenarios of geological parameters, including DtF, Geology, Geomorphology, and Soil Texture. Low DtF(a) values are predominantly noted in the east, progressively rising towards the west. The geological map(b) indicates that alluvial predominates in the central and western regions. In terms of Geomorphology(c), prominent high and medium hills, together with ridges, are in the eastern and southeastern regions, while the plains and basins are mostly found in the middle and northern sections. In terms of soil texture(d), silt, loam, and clay soils are prevalent in the center region, whilst brown/gray soils are predominant in the eastern section.

thumbnail
Fig 7. Geological factors (a) DtF, (b) Geology, (c) Geomorphology, (d) Soil Texture.

https://doi.org/10.1371/journal.pone.0334347.g007

Description of land use & human activity factors

In Fig 8, according to LULC(a) map built-up areas are mostly found in the eastern and northwestern regions, whereas vegetation predominates in the southwestern regions. High NDVI(b) values are observed in the eastern and northeastern regions, characterized by extensive vegetation. Moderate NDVI values are allocated between vegetation and urban areas.

thumbnail
Fig 8. Land use & Human activity factors (a) LULC, (b) NDVI.

https://doi.org/10.1371/journal.pone.0334347.g008

Independent variable selection

Table 3 displays the results of the multicollinearity evaluation and statistical significance testing employed to identify the most impactful factors for the model. Parameters having p-values exceeding 0.05, including average curvature, DtR, LULC, TPI, and TWI, were excluded and denoted with a double star (**). MSL and Precipitation parameters were excluded because of high VIF values, denoted with triple star (***) (>10). The correlation matrix for every variable is shown in Fig 9. The pair MSL and DtF in this matrix exhibited a strong positive correlation (r = 0.89), surpassing the 0.80 threshold for multicollinearity. Consequently, MSL was eliminated from that pair because of its high correlation and excessively high VIF value. Notably, the Aridity Index exhibits a positive correlation with precipitation (r = 0.77), whereas precipitation shows a strong negative correlation with both DtF (r = −0.95) and MSL (r = −0.87). There were also moderate correlations found between DEM and Geology (r = 0.52) and between Slope and DEM (r = 0.77).

thumbnail
Table 3. Multicollinearity and Significance Testing for Model Variable Selection.

https://doi.org/10.1371/journal.pone.0334347.t003

Fig 10 indicates that NDVI is the predominant factor in AdaBoost, CatBoost, GBDT, LightGBM, XGBoost, Bagging, DT, RF and TreeBag, with scores of 19, 22, 23, 16, 26, 22, 20, 28 and 24, respectively. Nonetheless, with a value of 20 the Aridity Index is the predominant factor in the avNNet model. The scores of 14, 14, 12, 16, 18 and 14 for the Aridity Index have a significant impact on AdaBoost, GBDT, LightGBM, Bagging, RF, and TreeBag. With scores of 14, 15 and 15 in CatBoost, XGBoost and Decision Trees, respectively, DtC also has a significant influence. Significantly, excluding avNNet, Geology and Soil Texture demonstrate the lowest significance across all models.

thumbnail
Fig 10. Assessment of the importance of parameters that influence coastal erosion.

https://doi.org/10.1371/journal.pone.0334347.g010

Coastal erosion susceptibility zones distribution based on machine learning models

Table 4 and Fig 11 illustrate the coastal erosion vulnerability zones categorized into low, moderate, and high impact levels. Table 4 demonstrates that there are comparable patterns among AdaBoost, GBDT, avNNet, and LightGBM, with a high percentage of moderate-impact areas (77.81%–79.36%), low-impact areas (around 10%–12.6%), and high-impact areas (9.58%–12.03%). CatBoost and XGBoost likewise exhibit similar outcomes, with high impact coming in slightly lower at 8.12%–8.81%, low impact areas at about 18%, and moderate impact predominating at 72.68% and 73.58%, respectively. Between 71.82% and 74.36%, the Bagging and DT models have moderate impact; Bagging has the largest low-impact area (19.03%), while DT has a noteworthy high-impact proportion (13.23%). With a moderate impact of almost 76%, RF and TreeBag exhibit comparable patterns; however, RF’s high-impact area is smaller (7.9%) than TreeBag’s (9.04%).

thumbnail
Table 4. Coastal Erosion Susceptibility Zones Distribution Based on ML Models.

https://doi.org/10.1371/journal.pone.0334347.t004

thumbnail
Fig 11. Maps of coastal erosion generated through machine learning methods (a) AdaBoost, (b) CatBoost, (c) GBDT, (d) LightGBM, (e) XGBoost, (f) ANN, (g) Bagging, (h)DT, (i)RF, (j) TreeBag.

https://doi.org/10.1371/journal.pone.0334347.g011

Validation of models by ROC and AUC

Fig 12 depicts the results of model validation through ROC curves, confusion matrices, F1-scores, precision, and recall, highlighting significant performance disparities. XGBoost proved to be the most effective model, attaining a maximum AUC of 0.95 and an F1-score of 0.81, with precision and recall metrics surpassing 0.68. Close competitors included Bagging and CatBoost, both attaining robust results with AUCs about 0.94 and F1-scores of 0.79. Models including GBDT, Random Forest, and LightGBM demonstrated comparable efficacy, attaining AUCs of 0.93 and F1-scores of 0.78 and 0.80. Both ANN and Treebag demonstrated somewhat suboptimal performance, with AUCs of 0.92 and F1-scores between 0.73 and 0.78. AdaBoost demonstrated marginally inferior performance, achieving an AUC of 0.91 and an F1-score of 0.72. The Decision Tree model had the lowest evaluation, attaining an AUC of 0.78 and an F1-score slightly under 0.70.

thumbnail
Fig 12. Model validation with Receiver Operating Characteristic (ROC) curve and Confusion matrix for (a) AdaBoost, (b) CatBoost, (c) GBDT, (d) LightGBM, (e) XGBoost, (f) ANN, (g) Bagging, (h)DT, (i)RF, (j) Treebag.

https://doi.org/10.1371/journal.pone.0334347.g012

Coastal erosion susceptibility zones distribution by best model among districts

The geographical findings that are derived from the analysis of the XGBoost model presented in Fig 13 and Table 5 reveal that the Jashore and Narail districts possess the highest percentage of areas that are classified as low-risk. The districts take up 86.94% and 71.24% of the respective areas. Shariatpur (46.78%) and Gopalganj (52.71%) are primarily the contributors of this kind of low-risk investment. Districts like Barguna, Chandpur, Feni, and Lakshmipur are predominantly endowed with moderate-risk classes, with a high 91.45% of Barguna’s area falling into this category. Spatial patterns, however, indicate that districts like these are largely dominated by these classes. Bagerhat, Khulna, Pirojpur, and Jhalokati are typical districts with high coverage levels in moderate risks, with all of them exceeding 77%. The spatial pattern reveals that Bhola (19.41%), Cox’s Bazar (26.20%), and Patuakhali (21.47%) have the highest percentage of high-risk areas, thus being the most vulnerable districts. Moreover, districts like Noakhali (16.94%) and Satkhira (3.83%) have significant areas of high-risk areas, which signify their vulnerability in the local risk scenario.

thumbnail
Table 5. Coastal erosion Susceptibility zones distribution among districts in the Coastal region by the best model (XGBoost).

https://doi.org/10.1371/journal.pone.0334347.t005

thumbnail
Fig 13. District-wise coastal erosion Susceptibility zones by the best model (XGBoost).

https://doi.org/10.1371/journal.pone.0334347.g013

Future climate change scenarios

According to Fig 14, sea level pressure maps across several RCP scenarios demonstrate that the eastern region exhibits more sea level pressure compared to the western region. In 2040, RCP 2.6 demonstrates a slightly elevated sea level pressure relative to RCP 4.5, 6.0, and 8.5. In 2060, a universal decrease in sea level pressure is noted across all scenarios. By 2080, RCP 8.5 forecasts slightly reduced pressure compared to the other scenarios. By 2100, all Representative Concentration Pathways (RCPs) indicate a little upward trend in sea level pressure.

thumbnail
Fig 14. From 2040 to 2100, the annual average sea level pressure for the various RCP scenarios (RCP 2.6, RCP 4.5, RCP 6.0, and RCP 8.5).

https://doi.org/10.1371/journal.pone.0334347.g014

Coastal erosion vulnerability zones distribution based on different future climate change scenarios

Table 6 and Fig 15 present coastal erosion Susceptibility zones distributions under various climatic scenarios (RCP 2.6, 4.5, 6.0, and 8.5) for the years 2040, 2060, 2080, and 2100. In 2040, RCP 2.6 is characterized by a significant Low category (42%) and a minimal High category (3%), but RCP 4.5, 6.0, and 8.5 exhibit diverse distributions, with RCP 6.0 presenting the most substantial High category (35%). By 2060, RCP 2.6 and 8.5 indicate a substantial transition, with around 47% of the area classified as High, whilst RCP 4.5 and 6.0 also demonstrate an increasing trend in the High category; however, RCP 6.0 exhibits a more equitable distribution (13% Low, 53% Moderate, 34% High). By 2080, a comparable trend persists, with RCP 6.0 and 8.5 exhibiting the highest High category percentages at 48% and 50%, respectively, and RCP 2.6 and 4.5 are at 44% and 45%. By 2100, RCP 2.6, 4.5, and 6.0 exhibit a marked decrease in the High category (between 8% and 19%) and an increase in the Low category, especially in RCP 6.0 (31%). Simultaneously, RCP 8.5 with 40% of the area is still designated as High.

thumbnail
Table 6. Coastal erosion Susceptibility zones distribution according to Different RCP Scenarios (RCP 2.6, RCP 4.5, RCP 6.0, and RCP 8.5) from 2040 to 2100 with Best Model.

https://doi.org/10.1371/journal.pone.0334347.t006

thumbnail
Fig 15. Coastal Erosion Susceptibility for Different RCP Scenarios (RCP 2.6, RCP 4.5, RCP 6.0, and RCP 8.5) from 2040 to 2100 with Best Model (XGBoost).

https://doi.org/10.1371/journal.pone.0334347.g015

Discussion

This study analytically evaluated coastal erosion susceptibility in Bangladesh using sophisticated machine learning algorithms that combined topographical, hydrological, climatic, geological, and human elements. The Normalized Difference Vegetation Index (NDVI) was identified as the most significant variable by feature selection procedures, specifically Pearson correlation, variance inflation factor (VIF), and p-values. The significance of vegetation cover in reducing the danger of erosion was highlighted by the NDVI with up to 28% contribution in the Random Forest (RF) model and 26% in XGBoost. Another important component was the Aridity Index, which made up 20% of avNNet and 12–18% of ensemble configurations. With contributions of 15% in XGBoost and 14–16% in other models, Distance to Coast (DtC) continuously placed highly across all models. On the other hand, geological factors including soil texture and geology had lower significance ratings, usually less than 6%, suggesting that they had little bearing on the variability of erosion in space. These results are reliable with earlier research by R. Bera and R. Maiti [137] and Islam et al. [138], who also highlighted the supremacy of NDVI and coastal proximity in determining coastal susceptibility. The results, however, are in contrast to those of Gao et al. [139], whose modeling approach based on RUSLE gave geomorphological variables more weight. The deterministic nature of RUSLE models, which lack the adaptive learning capability found in the machine learning approaches employed here, is probably the cause of the discrepancy.

Ensemble machine learning methods consistently beat individual classifiers in terms of model performance. With an area under the curve (AUC) score of 0.95, XGBoost outperformed CatBoost and Bagging, which were closely followed by GBDT, RF, and LightGBM (0.93). With an AUC of only 0.78, simpler models such as Decision Tree performed poorly. High-risk zones ranged from 7.9% (RF) to 13.23% (DT), whereas moderate-risk zones, which ranged from 71.82% (Bagging) to 79.36% (GBDT), spanned the largest areas across all models. Particularly, the AUC values in this study surpass those reported by Islam et al. [138] for distinctive between high and low erosion susceptibility zones in the Chattogram coastal region, excellent model performance is indicated by the AUC value of 0.909. Additionally, Bammou et al. [140] documented an AUC of 0.96 for the XGBoost model in their soil erosion evaluation, corroborating the dependability of superior performance results in analogous situations. This suggests that the existing ensemble configurations for coastal erosion scenarios have a greater level of predictive dependability. A different result was reported by Rahmati et al. [141], for susceptibility mapping in Iran, Random Forest (AUC 0.91) performed somewhat better than XGBoost (AUC 0.89). This demonstrates how dataset properties and regional conditions can affect model performance.

According to the high-resolution susceptibility maps created for this study, Bhola (19.41%), Cox’s Bazar (26.20%), and Patuakhali (21.47%) are some of the districts that are most susceptible to erosion at the moment. Conversely, districts with large percentages of low-risk zones include Jashore (86.94%), Narail (71.24%), and Gopalganj (52.71%). This is probably because of things like higher elevation, more vegetation, and a more inland location. Many districts, such as Barguna (91.45%), and Feni (90.13%), are dominated by moderate-risk zones. The spatial distribution of erosion susceptibility resembles well with [142,143] who also recognized Patuakhali and Cox’s Bazar as areas for vulnerability. Although their research focused on tidal-driven riverbank erosion, the current findings point to a combined effect of low elevation, vegetation loss, and weather stresses. Anthropogenic and climatic factors, which are frequently missing from deterministic models like GloSEM and RUSLE, allow for this wider viewpoint. This makes the model a more complete tool for both current and future coastal erosion risk assessment.

This research is distinct from previous studies that it combines CMIP5-based climate scenarios. The risk of coastal erosion is predicted to significantly increase under future climate scenarios, especially under high-emission pathways. High-risk areas are expected to rise from 26% in 2040 to a peak of 50% by 2080, then slightly fall to 40% by 2100, according to RCP 8.5. RCP 6.0 shows similar patterns, with high-risk zones reaching a peak of 48% by 2080. Peak high-risk area coverage, on the other hand, rises more moderately at RCP 2.6 and 4.5, reaching 47% and 45%, respectively. These forecasts highlight how severe coastal damage could occur in the event of unchecked climate change. These patterns align with the work of [144], who documented faster coastline retreat in high sea-level rise scenarios. However, compared to the regional-scale estimates seen in previous work, the spatially explicit outputs of this analysis provide a more detailed view of risk distribution, increasing their usefulness for local adaptation planning.

By combining machine learning, climate modeling, and geospatial analysis, this study creates a solid, data-driven framework for evaluating the danger of coastal erosion. The models are useful tools for predicting erosion under present and future conditions because of their excellent predictive ability and good spatial resolution. This hybrid methodology provides more generalizability and actionable insights for resilient coastal management and governance as compared to traditional approaches used in coastal Bangladesh. Although the modeling framework employed in this study is technically transferable to other geographic locations, its predictability and interpretability will necessarily be region-dependent. Geomorphological variation including coral reefs, deltas, cliffs, and climatic regimes like monsoonal versus arid regimes, and land use necessitate extensive recalibration of the model using local input variables to yield significant and credible results. Not only this but also the lack of real-time erosion monitoring could compromise temporal accuracy. Future research should focus on the transfer and calibration of this framework across different geographic and climatic settings, employing local parameters, and also real-time monitoring to enhance model robustness, generalizability, and applicability across different coastal systems.

Conclusion

The security of Bangladesh’s shoreline is in danger because sea levels are rising faster, climate change is getting worse, and the land is becoming less stable. Using modern machine learning methods and 20 important meteorological, geographical, hydrological, tropological, and land-use factors, this study looked at how Susceptibility the coast is to erosion. The main things that affect erosion risk are the normalized difference vegetation index (NDVI). To find areas that are likely to erode, different ensemble learning methods were used, such as bagging (Random Forest, Bagging, Decision Tree, Treebag, averaging Neural Network) and boosting (XGBoost, LightGBM, CatBoost, GBDT, AdaBoost). XGBoost was used to get an area under the curve (AUC) of 0.95, which means it was better at predicting the future and finding high-risk areas. The results show big differences in geography, with most of the high-risk places being in Bhola, Cox’s Bazar, and Patuakhali. This was the breakdown of high-risk places in these three areas: 19.41%, 26.20%, and 21.47%. Barguna (91.45%), and Feni (90.13%) have a relatively high number of places that are considered to have moderate risk. RCP models 2.6, 4.5, 6.0, and 8.5 all predict that the number of high-risk areas will grow in the years 2040, 2060, 2080, and 2100. This is especially true for RCP 8.5, which says that shares will grow by 50% by 2080 and by 40% by 2100. When the RCP number is 2.6, 4.5, or 6.0, it means that the risk has changed, though only slightly. These results show how important it is to build buildings that can withstand climate change, use methods to stop erosion, and make policies that adapt to new situations. It is important to protect especially Susceptibility groups by setting up map early warning systems, enforcing zoning laws, building marine embankments, and planting trees. Our results also point to the ethical challenge of coastal erosion management, since a lot of the high-risk zones coincide with marginalized groups. Policymakers must utilize these projections to create inclusive plans that reduce displacement risks and promote equitable access to resources. Incorporating community-based adaptation and livelihood assistance will be crucial for equitable and sustainable coastal resilience planning. Even though this study is very thorough, it has some limitations. If scholars do not take into account changing socio-economic factors like changes in land use and the building of new infrastructure, the risk of erosion might be underrated. Mitigation strategies such as population relocation can also disproportionately impact underprivileged people, resulting in loss of livelihoods, displacement, or limited access to critical services. Therefore, evaluations of erosion risk and policy measures must take into account socioeconomic and environmental factors in order to achieve inclusive and equitable results. To enhance predictive accuracy, future studies ought to make use of integrated socio-environmental databases, real-time monitoring, and high-resolution remote sensing. The UN’s Sustainable Development Goals (SDG) 11 (Sustainable Cities and Communities) and 13 (Climate Action) are in line with this work. It gives useful information that could help with managing the coast and making plans for long-term protection in Bangladeshi places that are prone to erosion.

References

  1. 1. Burkett V, Davidson M. Coastal Impacts, Adaptation, and Vulnerabilities. Island Press/Center for Resource Economics. 2012. doi: https://doi.org/10.5822/978-1-61091-460-4
  2. 2. Islam MR, Khan NA, Reza MM, Rahman MM. Vulnerabilities of River Erosion–Affected Coastal Communities in Bangladesh: a Menu of Alternative Livelihood Options. Glob Soc Welf. 2020;7(4):353–66.
  3. 3. Roy S, Pandit S, Papia M, Rahman MM, Otto Rehder Ocampo JC, Razi MA, et al. Coastal erosion risk assessment in the dynamic estuary: The Meghna estuary case of Bangladesh coast. International Journal of Disaster Risk Reduction. 2021;61:102364.
  4. 4. Safra de Campos R, et al. Where people live and move in deltas. 2020.
  5. 5. Mallick A, Mallick B. Staying despite riverbank erosion: evidence of coastal Bangladesh. SN Soc Sci. 2021;1(6).
  6. 6. Rimi RA, Rafi AH, Sultana J. Factors affecting sustainable resource management in. 2024;(March).
  7. 7. Gantulga N, Iimaa T, Batmunkh M, Surenjav U, Tserennadmin E, Turmunkh T, et al. Impacts of natural and anthropogenic factors on soil erosion. Proc Mong Acad Sci. 2023;:3–18.
  8. 8. Marks D, Bayrak MM, Connell J. Increasing livelihood vulnerabilities to coastal erosion and wastewater intrusion: The political ecology of Thai aquaculture in peri‐urban Bangkok. Geographical Research. 2023;61(2):259–72.
  9. 9. Nguyen Minh Q, Vu Van H, Nguyen Thi M, Ngo Thi D, Nguyen Thi Thu C, Dang Minh T, et al. Holocene sedimentary facies in the incised valley of Ma River Delta, Vietnam. Vietnam J Earth Sci. 2023.
  10. 10. Prasad DH, Kumar ND. Coastal Erosion Studies—A Review. IJG. 2014;05(03):341–5.
  11. 11. Murshed S, Griffin AL, Islam MA, Wang XH, Paull D. Assessing multi-climate-hazard threat in the coastal region of Bangladesh by combining influential environmental and anthropogenic factors. Progress in Disaster Science. 2022;16:100261.
  12. 12. Martínez ML, Intralawan A, Vázquez G, Pérez-Maqueo O, Sutton P, Landgrave R. The coasts of our world: Ecological, economic and social importance. Ecological Economics. 2007;63(2–3):254–72.
  13. 13. Shampa MTA, Shimu NJ, Chowdhury KMA, Islam MM, Ahmed MK. A comprehensive review on sustainable coastal zone management in Bangladesh: Present status and the way forward. Heliyon. 2023;9(8):e18190. pmid:37554819
  14. 14. van Steenbergen F r a n k, Marandy J o h n. Loose and gain: coastal erosion in Bangladesh. The Water Channel. 2023. https://thewaterchannel.tv/thewaterblog/loose-and-gain-coastal-erosion-in-bangladesh/
  15. 15. Islam MdM, Rahman MdS, Kabir MdA, Islam MdN, Chowdhury RM. Predictive assessment on landscape and coastal erosion of Bangladesh using geospatial techniques. Remote Sensing Applications: Society and Environment. 2020;17:100277.
  16. 16. Thanh BN, Van Phong T, Trinh PT, Costache R, Amiri M, Nguyen DD, et al. Prediction of coastal erosion susceptible areas of Quang Nam Province, Vietnam using machine learning models. Earth Sci Inform. 2023;17(1):401–19.
  17. 17. V.M. P, Aldehim G, Negm N, Subathradevi S. Integrating geospatial techniques and machine learning for assessing soil erosion and associated geomorphic risks. Journal of South American Earth Sciences. 2025;156:105463.
  18. 18. Çelik Oİ, Gazioğlu C. Coast type based accuracy assessment for coastline extraction from satellite image with machine learning classifiers. The Egyptian Journal of Remote Sensing and Space Science. 2022;25(1):289–99.
  19. 19. Sarkar SK, Ansar SB, Ekram KMM, Khan MH, Talukdar S, Naikoo MW, et al. Developing Robust Flood Susceptibility Model with Small Numbers of Parameters in Highly Fertile Regions of Northwest Bangladesh for Sustainable Flood and Agriculture Management. Sustainability. 2022;14(7):3982.
  20. 20. Eloudi H, Hssaisoune M, Reddad H, Namous M, Ismaili M, Krimissa S, et al. Robustness of Optimized Decision Tree-Based Machine Learning Models to Map Gully Erosion Vulnerability. Soil Systems. 2023;7(2):50.
  21. 21. Ghosh A, Maiti R. Soil erosion susceptibility assessment using logistic regression, decision tree and random forest: study on the Mayurakshi river basin of Eastern India. Environ Earth Sci. 2021;80(8).
  22. 22. Saha S, Roy J, Arabameri A, Blaschke T, Tien Bui D. Machine Learning-Based Gully Erosion Susceptibility Mapping: A Case Study of Eastern India. Sensors (Basel). 2020;20(5):1313. pmid:32121238
  23. 23. Gayen A, Pourghasemi HR, Saha S, Keesstra S, Bai S. Gully erosion susceptibility assessment and management of hazard-prone areas in India using different machine learning algorithms. Sci Total Environ. 2019;668:124–38. pmid:30851678
  24. 24. Sadia H, Sarkar SK, Haydar M. Soil erosion susceptibility mapping in Bangladesh. Ecological Indicators. 2023;156:111182.
  25. 25. Rudra RR, Sarkar SK. Artificial neural network for flood susceptibility mapping in Bangladesh. Heliyon. 2023;9(6):e16459. pmid:37251459
  26. 26. Khosravi K, Rezaie F, Cooper JR, Kalantari Z, Abolfathi S, Hatamiafkoueieh J. Soil water erosion susceptibility assessment using deep learning algorithms. Journal of Hydrology. 2023;618:129229.
  27. 27. AlDahoul N, Essam Y, Kumar P, Ahmed AN, Sherif M, Sefelnasr A, et al. Suspended sediment load prediction using long short-term memory neural network. Sci Rep. 2021;11(1):7826. pmid:33837236
  28. 28. Rao DR, Raju PBN. Coastal sea line dynamics using machine learning and LSTM based deep learning models. 2021. https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5060685
  29. 29. Ait Naceur H, Abdo HG, Igmoullan B, Namous M, Alshehri F, A Albanai J. Implementation of random forest, adaptive boosting, and gradient boosting decision trees algorithms for gully erosion susceptibility mapping using remote sensing and GIS. Environ Earth Sci. 2024;83(3).
  30. 30. Rahman M, Rafi MKH, Tasmim N, Islam MM, Raju MR, Karim MR. Unveiling the Wrath of Erosion: Assessing the Impacts of Soil Erosion on the Coastal Region of Bangladesh. IUBAT Rev. 2023;6(2):1–24.
  31. 31. Islam MdR, Imran HM, Islam MdR, Saha GC. A RUSLE-based comprehensive strategy to assess soil erosion in a riverine country, Bangladesh. Environ Earth Sci. 2024;83(6).
  32. 32. Hasan MdH, Ahmed A, Nafee KM, Hossen MdA. Use of machine learning algorithms to assess flood susceptibility in the coastal area of Bangladesh. Ocean & Coastal Management. 2023;236:106503.
  33. 33. Mamun AA, Islam ARMdT, Alam E, Chandra Pal S, Alam GMM. Assessing Riverbank Erosion and Livelihood Resilience Using Traditional Approaches in Northern Bangladesh. Sustainability. 2022;14(4):2348.
  34. 34. Akter J, Islam SN. Water resources management in the coastal region of Bangladesh. Water Resources Management in the Coastal Region of Bangladesh. 2022.
  35. 35. Ahmed A, Nawaz R, Drake F, Woulds C. Modelling land susceptibility to erosion in the coastal area of Bangladesh: A geospatial approach. Geomorphology. 2018;320:82–97.
  36. 36. Ministry of Environment and Forests G. Bangladesh: National programme of action for protection of the coastal and marine environment from land-based activities. 2006.
  37. 37. Akter J, Islam SN. Water resources management in the coastal region of Bangladesh. Water Resources Management in the Coastal Region of Bangladesh. 2022;(January 2010).
  38. 38. C. D. S. Proceeding. Draft Coastal Development Strategy (CDS): Highlights, no. October 2004. 2005. p. 2004–2007.
  39. 39. Roy B, Penha-Lopes GP, Uddin MS, Kabir MH, Lourenço TC, Torrejano A. Sea level rise induced impacts on coastal areas of Bangladesh and local-led community-based adaptation. International Journal of Disaster Risk Reduction. 2022;73:102905.
  40. 40. Alves DCL, Weschenfelder J, Ferreira-Cravo M, de A. Espinoza JM, da G. Albuquerque M. Advances in the Application of Digital Elevation Models (DEMs) for the Evaluation of Coastal Flooding. Mercator (Fortaleza). 2021;20(April):e20012.
  41. 41. Eakins BW, Grothe PR. Challenges in Building Coastal Digital Elevation Models. Journal of Coastal Research. 2014;297:942–53.
  42. 42. Ahmed N, Howlader N, Hoque MA-A, Pradhan B. Coastal erosion vulnerability assessment along the eastern coast of Bangladesh using geospatial techniques. Ocean & Coastal Management. 2021;199:105408.
  43. 43. Cissé COT, Marić I, Domazetović F, Glavačević K, Almar R. Derivation of Coastal Erosion Susceptibility and Socio-Economic Vulnerability Models for Sustainable Coastal Management in Senegal. Sustainability. 2024;16(17):7422.
  44. 44. Chen G, Zeng T, Liu D, Chen H, Wang L, Wang L, et al. Geomorphological and Geological Characteristics Slope Unit: Advancing Township-Scale Landslide Susceptibility Assessment Strategies. Land. 2025;14(2):355.
  45. 45. Dionísio António S, van der Werf J, Horstman E, Cáceres I, Alsina J, van der Zanden J, et al. Influence of Beach Slope on Morphological Changes and Sediment Transport under Irregular Waves. JMSE. 2023;11(12):2244.
  46. 46. Samia J, Temme A, Bregt A, Wallinga J, Guzzetti F, Ardizzone F, et al. Characterization and quantification of path dependency in landslide susceptibility. Geomorphology. 2017;292:16–24.
  47. 47. Winzeler HE, Owens PR, Read QD, Libohova Z, Ashworth A, Sauer T. Topographic Wetness Index as a Proxy for Soil Moisture in a Hillslope Catena: Flow Algorithms and Map Generalization. Land. 2022;11(11):2018.
  48. 48. De Reu J, Bourgeois J, Bats M, Zwertvaegher A, Gelorini V, De Smedt P, et al. Application of the topographic position index to heterogeneous landscapes. Geomorphology. 2013;186:39–49.
  49. 49. Al-Sababhah N. Topographic Position Index to Landform Classification and Spatial Planning, Using GIS, for Wadi Araba, South West Jordan. eer. 2023;11(1):79–101.
  50. 50. Riley SJ. (PDF) A Terrain Ruggedness Index that Quantifies Topographic Heterogeneity. Accessed: May 18, 2025. https://www.researchgate.net/publication/259011943_A_Terrain_Ruggedness_Index_that_Quantifies_Topographic_Heterogeneity
  51. 51. Sigren JM, Figlus J, Armitage AR. Coastal sand dunes and dune vegetation: Restoration, erosion, and storm protection. Shore & Beach. https://www.researchgate.net/publication/270822597
  52. 52. Stefano CD, Ferro V, Porto P, Tusa G. Slope curvature influence on soil erosion and deposition processes. Water Resources Research. 2000;36(2):607–17.
  53. 53. Chen T, Shu J, Han L, Tian G, Yang G, Lv J. Modeling the effects of topography and slope gradient of an artificially formed slope on runoff, sediment yield, water and soil loss of sandy soil. CATENA. 2022;212:106060.
  54. 54. Zomer RJ, Xu J, Trabucco A. Version 3 of the Global Aridity Index and Potential Evapotranspiration Database. Sci Data. 2022;9(1):409. pmid:35840601
  55. 55. FAO. Chapter I. The arid environments. Arid zone forestry: A guide for field tecnhicians. Accessed: May 18, 2025. https://www.fao.org/4/t0122e/t0122e03.htm
  56. 56. Bennie J, Huntley B, Wiltshire A, Hill MO, Baxter R. Slope, aspect and climate: Spatially explicit and implicit models of topographic microclimate in chalk grassland. Ecological Modelling. 2008;216(1):47–59.
  57. 57. Environmental Research. Normalized Difference Vegetation Index - an overview | ScienceDirect Topics. Accessed: May 18, 2025. https://www.sciencedirect.com/topics/earth-and-planetary-sciences/normalized-difference-vegetation-index
  58. 58. Rahman M, Haque T, Jahan Rakhi T, Hossain Rafi A, Islam Oishi S. Carbon consequences of deforestation in Khagrachhari of Chittagong Hill Tracts: a GIS and remote sensing approach. 2024;(February).
  59. 59. Ke Q, Zhang K. Interaction effects of rainfall and soil factors on runoff, erosion, and their predictions in different geographic regions. Journal of Hydrology. 2022;605:127291.
  60. 60. Kelpšaite L, Dailidiene I. Influence of wind wave climate change on coastal processes in the eastern Baltic Sea. Journal of Coastal Research. https://www.researchgate.net/publication/286004480_Influence_of_wind_wave_climate_change_on_coastal_processes_in_the_eastern_Baltic_Sea
  61. 61. Jarmalavičius D, Satkūnas J, Žilinskas G, Pupienis D. The influence of coastal morphology on wind dynamics. Estonian Journal of Earth Sciences. https://www.researchgate.net/publication/267409655_The_influence_of_coastal_morphology_on_wind_dynamics
  62. 62. Flemming BW, Hansom JD. Estuarine and Coastal Geology and Geomorphology – A Synthesis. Treatise on Estuarine and Coastal Science. Elsevier. 2011. p. 1–5. doi: https://doi.org/10.1016/b978-0-12-374711-2.00301-6
  63. 63. Vargas-T. VH, Uribe-P. E, Castellanos-A. OM, Ríos-R. CA. Coastal landforms caused by deposition and erosion along the shoreline between Punta Brava and Punta Betín, Santa Marta, Colombian Caribbean. Rev Acad Colomb Cienc Ex Fis Nat. 2016;40(157):664–82.
  64. 64. Finch HJS, Samuel AM, Lane GPF. Soils and soil management. Lockhart & Wiseman’s Crop Husbandry Including Grassland. Elsevier. 2014. p. 37–62. doi: https://doi.org/10.1533/9781782423928.1.37
  65. 65. Karra K, Kontgis C, Statman-Weil Z, Mazzariello JC, Mathis M, Brumby SP. Global land use / land cover with Sentinel 2 and deep learning. In: 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS. 2021. 4704–7. doi: https://doi.org/10.1109/igarss47720.2021.9553499
  66. 66. Xu P, et al. Comparative validation of recent 10 m-resolution global land cover maps. Remote Sens Environ. 2024;311:114316.,
  67. 67. Thonon I, Middelkoop H, van der Perk M. The influence of floodplain morphology and river works on spatial patterns of overbank deposition. Netherlands Journal of Geosciences. 2007;86(1):63–75.
  68. 68. Opengeology. 5 Weathering, Erosion, and Sedimentary Rocks – An Introduction to Geology. Accessed: May 18, 2025. https://opengeology.org/textbook/5-weathering-erosion-and-sedimentary-rocks/
  69. 69. Al Shoumik BA, Khan MZ, Islam MS. Soil erosion estimation by RUSLE model using GIS and remote sensing techniques: A case study of the tertiary hilly regions in Bangladesh from 2017 to 2021. Environ Monit Assess. 2023;195(9):1096. pmid:37626274
  70. 70. Pandey S, Kumar P, Zlatic M, Nautiyal R, Panwar VP. Recent advances in assessment of soil erosion vulnerability in a watershed. International Soil and Water Conservation Research. 2021;9(3):305–18.
  71. 71. Allafta H, Opp C. Soil Erosion Assessment Using the RUSLE Model, Remote Sensing, and GIS in the Shatt Al-Arab Basin (Iraq-Iran). Applied Sciences. 2022;12(15):7776.
  72. 72. Vrieling A, Sterk G, de Jong SM. Satellite-based estimation of rainfall erosivity for Africa. Journal of Hydrology. 2010;395(3–4):235–41.
  73. 73. Radziuk H, Świtoniak M. Soil erodibility factor (K) in soils under varying stages of truncation. Soil Sci Ann. 2021;72(1):1–8.
  74. 74. Erencin Z. C-Factor Mapping Using Remote Sensing and GIS. A case study of Lom Sak/Lom Kao, Thailand. Transformation. Accessed: May 17, 2025. https://www.researchgate.net/publication/30813051_C-Factor_mapping_using_Remote_Sensing_and_GIS_a_case_study_of_Lom_Sak_Lom_Kao_Thailand
  75. 75. Girma R, Gebre E. Spatial modeling of erosion hotspots using GIS-RUSLE interface in Omo-Gibe river basin, Southern Ethiopia: implication for soil and water conservation planning. Environ Syst Res. 2020;9(1).
  76. 76. Hernando D, Romana MG. Development of a Soil Erosion Classification System for Cut and Fill Slopes. Transp Infrastruct Geotech. 2015;2(4):155–66.
  77. 77. Nosrati K, Collins AL. A soil quality index for evaluation of degradation under land use and soil erosion categories in a small mountainous catchment, Iran. J Mt Sci. 2019;16(11):2577–90.
  78. 78. Jorgenson MT, Brown J. Classification of the Alaskan Beaufort Sea Coast and estimation of carbon and sediment inputs from coastal erosion. Geo-Mar Lett. 2004;25(2–3):69–80.
  79. 79. Qi L, Zhou Y, Van Oost K, Ma J, van Wesemael B, Shi P. High-resolution soil erosion mapping in croplands via Sentinel-2 bare soil imaging and a two-step classification approach. Geoderma. 2024;446:116905.
  80. 80. Shrestha N. Detecting Multicollinearity in Regression Analysis. AJAMS. 2020;8(2):39–42.
  81. 81. Wagavkar S. Correlation matrix. IEEE Transactions on Computers. https://builtin.com/data-science/correlation-matrix
  82. 82. O’brien RM. A Caution Regarding Rules of Thumb for Variance Inflation Factors. Qual Quant. 2007;41(5):673–90.
  83. 83. Ma M, Zhao G, He B, Li Q, Dong H, Wang S, et al. XGBoost-based method for flash flood risk assessment. Journal of Hydrology. 2021;598:126382.
  84. 84. Chen T, Guestrin C. XGBoost. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2016. 785–94. doi: https://doi.org/10.1145/2939672.2939785
  85. 85. Haydar M, Rafi AH, Hasan Khan MdK, Hosan S, Sadia H. Mapping climate change induced forest fire susceptibility using tree-based machine learning algorithms in Bangladesh. Ecological Indicators. 2025;176:113657.
  86. 86. Haydar M, Hossain Rafi A, Sadia H, Tanvir Hossain M. Data driven forest fire susceptibility mapping in Bangladesh. Ecological Indicators. 2024;166:112264.
  87. 87. Arif Ali Z, H. Abduljabbar Z, A. Tahir H, Bibo Sallow A, Almufti SM. eXtreme Gradient Boosting Algorithm with Machine Learning: a Review. Acad J Nawroz Univ. 2023;12(2):320–34.
  88. 88. Huang X, Li Z, Jin Y, Zhang W. Fair-AdaBoost: Extending AdaBoost method to achieve fair classification. Expert Systems with Applications. 2022;202:117240.
  89. 89. Mosavi A, Hosseini FS, Choubin B. Ensemble boosting and bagging based machine learning models for groundwater potential prediction. 2020.
  90. 90. Hastie T, Rosset S, Zhu J, Zou H. Multi-class AdaBoost. Statistics and Its Interface. 2009;2(3):349–60.
  91. 91. Kégl B, Busa-Fekete R. Boosting products of base classifiers. In: Proceedings of the 26th Annual International Conference on Machine Learning, 2009. 497–504. doi: https://doi.org/10.1145/1553374.1553439
  92. 92. Haidong C, Shangqi D, Xingke G, Shuangde H, Tao W, Debin X, et al. Multi-temporal remote sensing fire detection based on GBDT in Yunnan area. In: 2020 2nd International Conference on Machine Learning, Big Data and Business Intelligence (MLBDBI), 2020. 469–73. doi: https://doi.org/10.1109/mlbdbi51377.2020.00099
  93. 93. Wen Z, He B, Kotagiri R, Lu S, Shi J. Efficient Gradient Boosted Decision Tree Training on GPUs. In: 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS). 2018. 234–43. doi: https://doi.org/10.1109/ipdps.2018.00033
  94. 94. Shu Y, Shi C, Yi B, Zhao P, Guan L, Zhou M. Influence of Climatic Factors on Lightning Fires in the Primeval Forest Region of the Northern Daxing’an Mountains, China. Sustainability. 2022;14(9):5462.
  95. 95. Friedman JH. Greedy function approximation: A gradient boosting machine. Ann Statist. 2001;29(5).
  96. 96. Seto H, Oyama A, Kitora S, Toki H, Yamamoto R, Kotoku J, et al. Gradient boosting decision tree becomes more reliable than logistic regression in predicting probability for diabetes with big data. Sci Rep. 2022;12(1):15889. pmid:36220875
  97. 97. Ileri K. Comparative analysis of CatBoost, LightGBM, XGBoost, RF, and DT methods optimised with PSO to estimate the number of k-barriers for intrusion detection in wireless sensor networks. Int J Mach Learn & Cyber. 2025;16(9):6937–56.
  98. 98. Omotehinwa TO, Oyewola DO, Moung EG. Optimizing the light gradient-boosting machine algorithm for an efficient early detection of coronary heart disease. Informatics and Health. 2024;1(2):70–81.
  99. 99. Machado MR, Karray S, de Sousa IT. LightGBM: an Effective Decision Tree Gradient Boosting Method to Predict Customer Loyalty in the Finance Industry. In: 2019 14th International Conference on Computer Science & Education (ICCSE). 2019. 1111–6. doi: https://doi.org/10.1109/iccse.2019.8845529
  100. 100. Gan M, Pan S, Chen Y, Cheng C, Pan H, Zhu X. Application of the Machine Learning LightGBM Model to the Prediction of the Water Levels of the Lower Columbia River. JMSE. 2021;9(5):496.
  101. 101. Hancock JT, Khoshgoftaar TM. CatBoost for big data: an interdisciplinary review. J Big Data. 2020;7(1):94. pmid:33169094
  102. 102. Zhang Y, Zhao Z, Zheng J. CatBoost: A new approach for estimating daily reference crop evapotranspiration in arid and semi-arid regions of Northern China. Journal of Hydrology. 2020;588:125087.
  103. 103. Liaw A, Wiener M. The R Journal: Classification and regression by randomForest. R Journal. 2002;2(3):18–22.
  104. 104. Conesa A, Hernández R. Omics Data Integration in Systems Biology. Comprehensive Analytical Chemistry. Elsevier. 2014. p. 441–59. doi: https://doi.org/10.1016/b978-0-444-62650-9.00016-6
  105. 105. Richman R, Wüthrich MV. Nagging Predictors. Risks. 2020;8(3):83.
  106. 106. Ngo G, Beard R, Chandra R. Evolutionary bagging for ensemble learning. Neurocomputing. 2022;510:1–14.
  107. 107. Chan JC-W, Paelinckx D. Evaluation of Random Forest and Adaboost tree-based ensemble classification and spectral band selection for ecotope mapping using airborne hyperspectral imagery. Remote Sensing of Environment. 2008;112(6):2999–3011.
  108. 108. Korecki JN, Banfield RE, Hall LO, Bowyer KW, Kegelmeyer WP. Semi-supervised learning on large complex simulations. In: 2008 19th International Conference on Pattern Recognition. 2008. 1–4. doi: https://doi.org/10.1109/icpr.2008.4761797
  109. 109. Chollet F. Machine learning 분야 소개 및 주요 방법론 학습 기본 machine learning 알고리즘에 대한 이해 및 응용 관련 최신 연구 동향 습득. Mach Learn. 2017;45(13):40–8.
  110. 110. Cho JH, Kurup PU. Decision tree approach for classification and dimensionality reduction of electronic nose data. Sensors and Actuators B: Chemical. 2011;160(1):542–8.
  111. 111. Packwood D, Nguyen LTH, Cesana P, Zhang G, Staykov A, Fukumoto Y, et al. Machine Learning in Materials Chemistry: An Invitation. Machine Learning with Applications. 2022;8:100265.
  112. 112. Murthy SK. Automatic Construction of Decision Trees from Data: A Multi-Disciplinary Survey. Data Mining and Knowledge Discovery. 1998;2(4):345–89.
  113. 113. Ahmed AM, Rizaner A, Ulusoy AH. A Decision Tree Algorithm Combined with Linear Regression for Data Classification. In: 2018 International Conference on Computer, Control, Electrical, and Electronics Engineering (ICCCEEE). 2018. 1–5. doi: https://doi.org/10.1109/iccceee.2018.8515759
  114. 114. Scornet E. On the asymptotics of random forests. Journal of Multivariate Analysis. 2016;146:72–83.
  115. 115. Altman N, Krzywinski M. Ensemble methods: bagging and random forests. Nat Methods. 2017;14(10):933–4.
  116. 116. Breiman L. Bagging Predictors. Machine Learning. 1996;24(2):123–40.
  117. 117. Oza NC. Online bagging and boosting. Conf Proc IEEE Int Conf Syst Man Cybern. 2005;3(2001):2340–2345.
  118. 118. Kleinberg EM. On the algorithmic implementation of stochastic discrimination. IEEE Trans Pattern Anal Machine Intell. 2000;22(5):473–90.
  119. 119. Quinlan JR. Combining Instance-Based and Model-Based Learning. Machine Learning Proceedings 1993. Elsevier. 1993. p. 236–43. doi: https://doi.org/10.1016/b978-1-55860-307-3.50037-x
  120. 120. Yang S, Browne A. Neural network ensembles: combining multiple models for enhanced performance using a multistage approach. Expert Systems. 2004;21(5):279–88.
  121. 121. Burton TA. Averaged neural networks. 1993;6(3):677–80.
  122. 122. Hansen LK, Salamon P. Neural network ensembles. IEEE Trans Pattern Anal Machine Intell. 1990;12(10):993–1001.
  123. 123. Krogh A, Vedelsby J. Neural network ensembles, cross validation, and active learning Anders. 2010. 6–7.
  124. 124. Zhou Z-H, Wu J, Tang W. Corrigendum to “Ensembling neural networks: Many could be better than all” Artificial Intelligence 137 (1–2) (2002) 239–263. Artificial Intelligence. 2010;174(18):1570.
  125. 125. Tien Bui D, Pham BT, Nguyen QP, Hoang N-D. Spatial prediction of rainfall-induced shallow landslides using hybrid integration approach of Least-Squares Support Vector Machines and differential evolution optimization: a case study in Central Vietnam. International Journal of Digital Earth. 2016;9(11):1077–97.
  126. 126. Richardson E, Trevizani R, Greenbaum JA, Carter H, Nielsen M, Peters B. The receiver operating characteristic curve accurately assesses imbalanced datasets. Patterns (N Y). 2024;5(6):100994. pmid:39005487
  127. 127. Haydar M, Hosan S, Rafi AH. Assessment of urban expansion susceptibility in major urban units of Bangladesh leveraging machine learning and geostatistical approach. Journal of Urban Management. 2025;14(2):451–67.
  128. 128. Mazumder T, Saroar MdM. Lightning-induced vulnerability assessment in Bangladesh using machine learning and GIS-based approach. Progress in Disaster Science. 2025;25:100406.
  129. 129. Riehl K, Neunteufel M, Hemberg M. Hierarchical confusion matrix for classification performance evaluation. Journal of the Royal Statistical Society Series C: Applied Statistics. 2023;72(5):1394–412.
  130. 130. Taylor KE, Stouffer RJ, Meehl GA. An Overview of CMIP5 and the Experiment Design. Bulletin of the American Meteorological Society. 2012;93(4):485–98.
  131. 131. Sarkar SK, Rudra RR, Talukdar S, Das PC, Nur MS, Alam E, et al. Future groundwater potential mapping using machine learning algorithms and climate change scenarios in Bangladesh. Sci Rep. 2024;14(1):10328. pmid:38710767
  132. 132. Rupp DE, Abatzoglou JT, Hegewisch KC, Mote PW. Evaluation of CMIP5 20th century climate simulations for the Pacific Northwest USA. JGR Atmospheres. 2013;118(19).
  133. 133. van Vuuren DP, Edmonds J, Kainuma M, Riahi K, Thomson A, Hibbard K, et al. The representative concentration pathways: an overview. Climatic Change. 2011;109(1–2):5–31.
  134. 134. Ghazi B, Jeihouni E, Kouzehgar K, Haghighi AT. Assessment of probable groundwater changes under representative concentration pathway (RCP) scenarios through the wavelet–GEP model. Environ Earth Sci. 2021;80(12).
  135. 135. FitzGerald DM, Fenster MS, Argow BA, Buynevich IV. Coastal Impacts Due to Sea-Level Rise. Annu Rev Earth Planet Sci. 2008;36(1):601–47.
  136. 136. Dong WS, Ismailluddin A, Yun LS, Ariffin EH, Saengsupavanich C, Abdul Maulud KN, et al. The impact of climate change on coastal erosion in Southeast Asia and the compelling need to establish robust adaptation strategies. Heliyon. 2024;10(4):e25609. pmid:38375273
  137. 137. Bera R, Maiti R. An assessment of coastal vulnerability using geospatial techniques. Environ Earth Sci. 2021;80(8).
  138. 138. Islam MdK, Jahan I, Azad A. Spatial Assessment of Coastal Erosion Vulnerability: Natural and Anthropogenic Factors in Chattogram, Bangladesh. Natural Hazards Research. 2025.
  139. 139. Gao G, Liang Y, Liu J, Dunkerley D, Fu B. A modified RUSLE model to simulate soil erosion under different ecological restoration types in the loess hilly area. International Soil and Water Conservation Research. 2024;12(2):258–66.
  140. 140. Bammou Y, Benzougagh B, Abdessalam O, Brahim I, Kader S, Spalevic V, et al. Machine learning models for gully erosion susceptibility assessment in the Tensift catchment, Haouz Plain, Morocco for sustainable development. Journal of African Earth Sciences. 2024;213:105229.
  141. 141. Rahmati O, Falah F, Naghibi SA, Biggs T, Soltani M, Deo RC, et al. Land subsidence modelling using tree-based machine learning algorithms. Sci Total Environ. 2019;672:239–52. pmid:30959291
  142. 142. Anwar MdS, Rahman K, Bhuiyan MAE, Saha R. Assessment of Sea Level and Morphological Changes along the Eastern Coast of Bangladesh. JMSE. 2022;10(4):527.
  143. 143. Toa SS, Ali AAB. A Remote Sensing-Based Approach to Identifying Spatio-Temporal Changes in Coastal Morphology in Bhola District, Bangladesh. J eng adv. 2022:76–90.
  144. 144. Jevrejeva S, Williams J, Vousdoukas MI, Jackson LP. Future sea level rise dominates changes in worst case extreme sea levels along the global coastline by 2100. Environ Res Lett. 2023;18(2):024037.