Figures
Abstract
Accurate monthly runoff forecasting is vital for water management, flood control, hydropower, and irrigation. In glacierized catchments affected by climate change, runoff is influenced by complex hydrological processes, making precise forecasting even more challenging. To address this, the study focuses on the Lotschental catchment in Switzerland, conducting a comprehensive comparison between deep learning and ensemble-based models. Given the significant autocorrelation in runoff time series data, which may hinder the evaluation of prediction models, a novel statistical method is employed to assess the effectiveness of forecasting models in detecting turning points in the runoff data. The performance of Extreme Gradient Boosting (XGBoost) was compared with long short-term memory (LSTM) and random forest (RF) models for one-month-ahead runoff forecasting. The study used 20 years of runoff data (2002–2021), with 70% (2002–2015) dedicated for training and calibration, and the remaining data (2016–2021) for testing. The findings for the testing phase results show that the XGBoost model achieves the best accuracy, with R² of 0.904, RMSE of 1.554 m³/sec, an NSE of 0.797, and Willmott index (d) of 0.972, outperforming both the LSTM and RF models. The study also found that the XGBoost model estimated turning points more accurately, obtaining forecasting improvements of up to 22% to 34% compared to LSTM and RF models. Overall, the study’s findings are essential for global water resource management, providing insights that can inform sustainable practices to support societies impacted by climate change.
Citation: Hameed MM, Masood A, hamid A, Elbeltagi A, Razali SFM, Salem A (2025) Forecasting monthly runoff in a glacierized catchment: A comparison of extreme gradient boosting (XGBoost) and deep learning models. PLoS One 20(5): e0321008. https://doi.org/10.1371/journal.pone.0321008
Editor: Shicheng Li, KTH Royal Institute of Technology: Kungliga Tekniska Hogskolan, SWEDEN
Received: September 11, 2024; Accepted: February 28, 2025; Published: May 23, 2025
Copyright: © 2025 Hameed et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
Abbreviations: AAETP, Average absolute error of turning points; ANN, Artificial Neural Network; AR, Autoregressive; ARIMA, Autoregressive Integrated Moving Average; ARMA, Autoregressive Moving Average; BiLSTM, Bidirectional Long Short-Term Memory; CEEMDAN, Complete Ensemble Empirical Mode Decomposition with Adaptive Noise; EANN, Emotional Artificial Neural Network; ELM, Extreme Learning Machine; EOF, Edge-Of-Field; FOEN, Swiss Federal Office for the environment; GBM, Gradient Boosting Machines; GMM, Gaussian Mixture Model; HBV, Hydrologiska Byråns Vattenbalansavdelning; LSSVM, Least Squares Support Vector Machine; LSTM, Long Short-Term Memory; MAE, Mean Absolute Error; NSE, Nash-Sutcliffe Efficiency Index; PACF, Partial Autocorrelation Function; R2, Coefficient of Determination; RF, Random Forest; RMSE, Root Mean Square Error; RNN, Recurrent Neural Network; RNN, Recurrent Neural Network; SHAP, SHapley Additive exPlanations; SSA, Singular Spectrum Analysis; SVM, Support Vector Machine; SVR; U95, Uncertainty at a 95% Confidence Interval; VMD, Variational Mode Decomposition; XGBoost, Extreme Gradient Boostig.
1. Introduction
Hydrological Runoff Forecasting plays a crucial role in today’s water resource management, regional water resource system, controlling flood water for agriculture and hydrological power projects and environmental impact assessment [1,2]. Runoff denotes the fraction of precipitation or snowmelt that flows over the land surface and eventually confluences with streams, rivers, and other types of fluvial systems. River runoff constitutes a substantial replenishment source for the groundwater basin within the region. Accurate runoff forecasting is a vital engineering endeavor for mitigating the impacts of flood disasters. Improving the precision of runoff predictions remains a formidable challenge in the realm of water resources management research [3]. Various models have been created to predict runoff in Glacierized catchments. For example, the HBV (Hydrologiska Byråns Vattenbalansavdelning) model was utilized for studying the effect of climate change on the runoff and glacier changes in Yushugou Basin, China [4] and to calculate the snowmelt and its share to the overall streamflow in the Panchachuli glacier region of Central Himalaya [5].
Accurate runoff prediction is vital for managing water resources. It also allows for better flood and drought control, reservoir management, and hydropower generation, ultimately leading to more efficient water use [6]. In particular, monthly-scale runoff forecasting is essential for effectively monitoring natural hydrological disasters such as droughts and floods, as these forecasts enable the establishment of early warning systems [7–9]. Also, this allows necessary measures to be taken in advance, mitigating their socio-economic impacts as well as improving water resources management. In general, the models used for runoff prediction are mainly divided into two sections: Process driven, and Data driven models [10]. These models are also widely used for runoff estimation and prediction ([11–13]). Several studies have reported that process-driven models are primarily constructed based on an understanding of runoff development processes [14–16]. These models aim to derive physical parameters that can be used for simulation or prediction tasks. On the other hand, data-driven models solely focus on the input-output relationship without making explicit causal inferences [17]. Also, the process of driven models relies on linear and partial differential equations [18,19]. Runoff prediction methods are primarily categorized into process-driven and data-driven models. Process-driven models like Soil and Water Assessment Tool (SWAT) [20] and Hydrological Simulation Program-Fortran (HSPF) [21] simulate hydrological processes using physical principles. They provide interpretable results but often being computationally intensive and require lots of hydrological and climate variables. While some researchers point out that runoff prediction is a multifaceted hurdle [22], it is influenced by numerous uncertain factors, which include high complexity, non-linear factors, non-stationarity, and dynamism, hence making it difficult to accurately predict runoff by process-driven methods. Moreover, conceptual models require extensive hydrological data, which may not be available in certain regions. Additionally, the implementation of such models is often more expensive and time-consuming. Therefore, various researchers inferred the input-output relationship of data-driven models to be linear and hence predicted runoff by methods like an autoregressive moving average (ARMA) model [23], autoregressive integrated moving average (ARIMA) model [24] and autoregressive (AR) model [25]. Also, other researchers inferred the prediction result could be made better by taking into consideration the non-linear nature [26–28].
Moreover, a variety of other classical machine learning models such as artificial neural networks (ANN) [29], SVM [30], and Random Forest (RF) [31] have been applied for studying runoff prediction. Advanced models based on deep learning have also been developed for various activities involving prediction applications, which include Runoff-prediction [32], prediction of precipitation [33], properties and composition of soil [34], water table [35], and streamflow forecasting [36]. In recent years, machine learning and deep learning models have garnered significant interest from researchers in constructing highly computationally intensive and accurate forecasting systems. Some researchers compared three boosting models (Extreme Gradient Boosting (XGBoost), Light Gradient-Boosting Machine (Light GBM) & Categorical Boosting (CatBoost)) to forecast daily streamflow in mountainous catchment [37]. The study concluded that the gradient boosting algorithms used in models such as XGBoost, are simple to implement, fast, and robust.
It is important to note that both long short-term memory (LSTM), and XGBoost are commonly used to model several hydrological parameters (e.g., runoff) due to their efficiency in handling time series data [38–40]. The LSTM algorithm is highly effective for modeling time series data, as it captures both short- and long-term dependencies within sequential data. Furthermore, RF is a robust ensemble method that captures complex relationships in runoff data (e.g., reduces) overfitting by averaging across multiple decision trees [41–43]. Besides, the XGBoost model is a powerful gradient-boosting algorithm that excels at modeling non-linear relationships and performs exceptionally well in time series forecasting, particularly with lagged variables due to its advanced optimization and ability to handle structured data.
Some investigations constructed a hybrid model which is formulated by using extreme gradient boosting and Gaussian mixture model, which they named as GMM-XGBoost [29]. They applied this model to predict monthly streamflow at Cuntan & Hankou stations on Yangtze River and the result concluded that the XGBoost-based models performed well and are applicable for streamflow forecasting and are superior to SVM. In a similar work, [44] combined machine learning (ML) models XGBoost and SHAP to develop an explainable ML-based model (KXGBoost) and its results demonstrate a robust data-driven model for runoff prediction. Other scholars investigated the hybrid model created by integration of Fourier transform with LSTM to predict monthly runoff in Brahmani Rive, India [45]. The researchers found that the suggested model outperforms several classical and deep learning models. Some investigations have confirmed that classical ML models can be enhanced by incorporating bio-inspired algorithms and data preprocessing techniques [46–49], thereby improving the models’ capability in forecasting runoff. An advanced model such as LSTM with AM (Attention mechanism) was developed for estimating runoff in Hun River, China [50]. It was reported that the model presented high accuracy, indicating its potential in hydrological modeling. Moreover, other researchers [51] demonstrated the framework efficacy of daily edge-of-field (EOF) runoff prediction using the XGBoost algorithm and concluded that the effectiveness of the XGBoost model in runoff prediction is well-supported by various studies emphasizing its accuracy, robustness, and efficiency. Its capability to minimize overfitting while maintaining high predictive accuracy makes it an invaluable tool in hydrological modeling and other environmental prediction tasks. As this field keeps on growing, XGBoost is likely to remain a cornerstone model for researchers and practitioners aiming to enhance predictive accuracy and efficiency in runoff prediction.
Despite the advancement of various models for runoff forecasting conducting in previous research, a significant knowledge gap remains regarding the comprehensive comparative effectiveness of machine learning approaches such as XGBoost, RF, and advanced deep learning models like LSTM. Moreover, the comparison of deep learning models such as LSTM, XGBoost offers an easier interpretation of the significance of predictors. Also, previous research indicates that very few studies have focused on modeling runoff in the Lotschental catchment in Switzerland. A comprehensive study of such catchments is hydrologically significant due to their provision of essential freshwater through glacial melt, particularly in the warmer months. Such catchments are important indicators of climate change, influencing local hydrology and downstream water availability, and they also pose flood risks during melt periods, necessitating effective management strategies. In prior works, researchers interested in modeling runoff time series data have often overlooked the importance of analyzing how well these predictive models can capture turning points. The turning points are essential for understanding significant shifts in hydrological dynamics.
In our study, we propose using the XGBoost model to forecast monthly rainfall one month ahead in the Lotschental catchment, Switzerland, and compared its performance with other AI model such as LSTM and regression tree model like RF. Runoff generation in mountain regions is affected by cyclogenetic processes [52] and the development of mesoscale circulations [53], which makes the process of runoff prediction extremely challenging. Therefore, developing a reliable projection model for runoff prediction is a difficult task. In order to address this challenge of limited accuracy and reliability in runoff prediction, we developed and compared three state-of-the-art machine learning models to determine the most effective technique for runoff forecasting. Additionally, tuning points in the runoff time series have been detected using a new algorithm. These tuning points are very important in analyzing time series data, as they identify significant shifts in trends (peaks and troughs) within the runoff data, providing valuable insights into the system’s underlying dynamics. Thus, the efficiency of the applied forecasting models has been evaluated based on their ability to adequately capture these tuning points. By capitalizing the findings from these studies, this study aims to contribute to the existing knowledge on monthly runoff forecasting in glacierized catchments. Notably, this study is among the first to compare XGBoost, LSTM, and RF models for runoff prediction in glacier catchments, utilizing turning point analysis. The study aims to:
- 1) Develop models using XGBoost and LSTM for forecasting runoff, focusing on selecting appropriate features using partial autocorrelation function.
- 2) Compare the performance of XGBoost and LSTM with other ML models based on statistical evaluation metrics such as RMSE, MAE, R², and other metrics to assess their predictive accuracy and reliability.
- 3) Identify and analyze the turning points in monthly runoff time series data and assess the efficiency of forecasting models in accurately predicting these points.
- 4) Investigate the potential of XGBoost for forecasting runoff by analyzing its predictive capabilities, robustness, and scalability in different hydrological scenarios and datasets.
The comparative analysis between XGBoost and Deep Learning models will provide insights into the strengths and limitations of each approach, ultimately guiding decision-makers in selecting the most effective modeling technique for accurate and reliable runoff predictions in glacier-influenced watersheds.
To achieve the above-mentioned research objectives, four key research questions (RQs) have been developed:
- RQ1: How effective are XGBoost and LSTM models in forecasting runoff when using the Partial Autocorrelation Function (PACF) to select relevant features?
- RQ2: How does the performance of XGBoost and LSTM models compare to other machine learning models in terms of RMSE, MAE, and R² for runoff forecasting?
- RQ3: Can XGBoost and LSTM models accurately identify and predict turning points in monthly runoff time series data?
- RQ4: How robust and scalable are XGBoost models for runoff forecasting across various hydrological scenarios and datasets?
2. Case study location
The Lӧtschental catchment occupies a unique transitional zone between the Mediterranean climate of the Southern Alps and the maritime climate of the Northern Alps. Located at 46° 24’ 59.99“ N latitude and 7° 49’ 59.99” E longitude (See Fig 1), the catchment sits within the Jungfrau-Aletsch-Bietschhorn region of the Bernese Alps at an elevation of 2624 meters above sea level [54]. The Lonza River, fed by its primary tributaries, the Lang and Anun Glaciers, flows through the valley, with the glaciers themselves forming the eastern and northeastern boundaries of the Lӧtschental [55]. In recent years, the valley has experienced significant flooding due to increased discharge from mountain rivers in the Bernese Alps, highlighting the region’s susceptibility to climate change-induced floods [56]. Previous studies have documented substantial changes in and around the study area due to the climate change [57]. Furthermore, active fault lines underpinning the region give rise to a series of earthquakes. The faults detected in the Canton of Bern potentially influenced by localized stress conditions, pose a threat to regional stability and enhance the susceptibility of landslides [58]. Also, the 2005 floods drastically affected the region, due to severe destruction in 900 municipalities and complete infrastructure damage in others, as well as some documented displacements within the Canton of Bern [59]
The surging tributaries of the Lonza River have caused substantial erosion and debris flow, underscoring the critical need for accurate runoff forecasting to mitigate flood risks in this area. Daily precipitation and mean air temperature data were acquired from MeteoSwiss for the Blatten, Lötschental station located at 1538 meters above sea level. Corresponding daily runoff (Q) measurements for the Lonza-Blatten hydrometric station were obtained from the Swiss Federal Office for the Environment (FOEN). This station sits at 1520 meters and drains an upstream area of approximately 77 km², with 24.6% glacier coverage. Table 1 summarizes the key statistics of the meteorological and hydrometric data used, while Fig 2 presents a bar chart depicting the monthly distribution of measured runoff during the study period. It is evident that the applied case study experienced the highest runoff rates in June, July, and August, with values of 11.919 m3/sec, 12.854 m3/sec, and 11.126 m3/sec respectively. As shown in Fig 2, runoff values demonstrate a clear seasonal pattern, with the highest values recorded in July (likely due to snowmelt) and the lowest values occurring in February. Notably, the data regarding the mean monthly runoff used in this study for Lӧtschental station are provided in the S Appendix (S1 Table) [60].
3. Machine learning models
This study employs three advanced machine learning models: Long Short-Term Memory (LSTM) networks, Extreme Gradient Boosting (XGBoost), and Random Forest (RF) to forecast runoff in the Lotschental catchment in Switzerland. These models are particularly well-suited for runoff forecasting due to their ability to capture complex, nonlinear relationships within hydrological data, handle long-term dependencies, and mitigate overfitting.
The LSTM algorithm is well-suited for modeling time series data as it effectively captures both short- and long-term dependencies in sequential data. Furthermore, RF is a robust ensemble method that captures complex relationships in runoff data and reduces overfitting by averaging across multiple decision trees. Lastly, XGBoost is a powerful gradient-boosting algorithm that efficiently models non-linear relationships. It performs exceptionally well in time series forecasting, especially with lagged variables, due to its advanced optimization and ability to handle structured data.
3.1 LSTM
LSTM is a type of recurrent neural network that processes sequential data using memory cells within the hidden layer [61]. These cells control information flow through forget, input, and output gates (Fig 3). The forget gate discards irrelevant past information, the input gate selects new data to remember, and the output gate determines what processed information affects the next cell. This gating mechanism allows LSTMs to learn long-term dependencies in the sequential data. The LSTM’s gated configuration enables long-term information retention over multiple time steps, effectively mitigating the vanishing gradient problem common in classical RNN models [62]. Mathematically, the operation of a memory cell in an LSTM can be described as follows [63]:
Where, ,
and
represent forget gate, input gate, and output gate,
represents the input,
. is the cell state,
and tanh are the sigmoid and hyperbolic tangent activation functions,
,
,
represent the biases in the network and
,
and
are the corresponding weigh for the forget gate, input gate, output gate, and cell state respectively.
In this study, a deep Long Short-Term Memory (LSTM) neural network architecture was utilized, as depicted in Fig 3b. This architecture comprises an input layer, two LSTM layers (LSTM-1 and LSTM-2), a dense layer, and a lambda layer.
The initial LSTM layer (LSTM-1) incorporates numerous units and employs a Rectified Linear Unit (ReLU) activation function to generate a sequence output. Subsequently, the second LSTM layer (LSTM-2) utilizes several units and a ReLU activation function, but without producing a sequence. The dense layer, consisting of n units, employs a sigmoid activation function. Finally, a lambda layer used to scale the output of the dense layer.
3.2 RF
Random forest, a supervised machine learning approach, leverages ensemble learning to improve prediction accuracy. They combine numerous estimator decision trees, each making individual predictions [64] (Fig 4). Training utilizes bagging, where subsets of data are used to grow each tree. This technique, called bootstrap aggregation, reduces overfitting and improves model robustness [65]. In this study, we employ random forest regression, where each tree node considers a random subset of input variables. The final prediction is the average of all individual tree predictions [66]. As the number of trees increases, so does the overall prediction accuracy.
3.3 XGBoost
The XGBoost model employs the concept of boosting, which aggregates the predictions of multiple weak learners to construct a potent and strong ensemble learner through an additive training approach [67]. This modeling technique, unlike other ML models, is complex with many tunable parameters [68]. Strategically designed to reduce overfitting and prediction variability while enhancing overall prediction accuracy, the technique is widely used in hydrology [69] for various applications, including modeling groundwater salinity, [70] water quality [71], ice phenomena, stream temperatures [37], and runoff prediction [10].
The following outlines the general algorithm for XGBoost applied to regression tasks:
- i. Set
,
i in the training set (X, y).
- ii. For T = 1: N
- 1. Development of cost function
- Train a decision tree (
) to minimize the objective function (
) on the training data (X, R)
- Update O by incorporating a new tree, scaled by the shrinkage factor γ.
+
(x) (7)
- Residual update
- iii. The resulting boosted model is
Where, =
+
, l represents a convex loss function, that quantifies the difference between the predicted (
) and the target value (
),
is the Tth decision tree,
denotes the ith sample, R represents the residual, N and I represent the tree and leaf count,
are coefficients of regularization.
3.4 Model Construction
The monthly runoff in the Lotschental catchment, located in Switzerland, has been predicted using various AI-based models, including XGBoost, RF, and a deep learning algorithm called LSTM. This study utilized runoff data spanning twenty years, from January 2002 to December 2021. The majority of the data, constituting 70% and covering the years 2002–2015, was employed for training the models and optimizing their calibrations (See Fig 5). The remaining data, approximately from 2016 to 2022, was reserved for testing the models’ accuracies.
To identify the most suitable input data, partial autocorrelation was employed for selecting the best input lags. This method provides valuable insights into the time series properties, including stationarity, trend patterns, seasonality, and randomness [72]. The partial autocorrelation function (PACF) analysis, depicted in Fig 6 led to the establishment of three different input combinations, as illustrated in Table 2.
3.5 Statistical metrics
In this research, various statistical metrics commonly employed in the hydrological field were used to evaluate the forecasting reliability of each model [73]. These metrics include root mean square error (RMSE), mean absolute error (MAE), correlation coefficient and determination (R, and R2), Maximum absolute relative error (erMAX), Nash-Sutcliffe efficiency index (NSE), Willmot index (d), and uncertainty at a 95% confidence interval (U95) [74]. Also, the current research used a combined accuracy (CA) [75,76] which considered as efficient and global metrics, taking advantages of both error, and accuracy metrics. The mathematical equations for these metrics are provided in the following equations [77,78]:
Where for this study, ,
are the observed runoff at ith data point and forecasted monthly runoff (Q), n is the total data points. The means of observed and forecasted Q values are respectively,
,
.
3.6 Turning points in monthly runoff time series data
Turning points are essential in time series analysis as they indicate significant changes in the data’s trend or direction. They mark transitions from one phase to another, highlighting shifts in behavior, growth, or decline within the dataset. These points provide critical insights into major events, changes in underlying processes, and the rise of new trends. Identifying turning points, such as peaks, troughs, cycles, and anomalies, is crucial for decision-making. Since time series data exhibit substantial autocorrelation (See Fig 6), forecasting models often achieve high accuracy (R > 0.9) [79] and low forecasting errors, according to metrics like RMSE and MAE [80]. However, special attention should be devoted to accuracy, specifically at turning points when evaluating forecasting models.
This study employed the Average Absolute Error of Turning Points (AAETP) to assess the accuracy of forecasting models in predicting turning points within a runoff dataset. The AAETP is a performance metric that can be used in time series analysis, focusing on the models’ abilities to replicate significant shifts and turning points in the data. To identify turning points, the first and second derivatives of the original time series data were computed. The first derivative captures the rate of change or slope of the data, while the second derivative identifies the points where the sign of the first derivative changes, indicating turning points. By comparing the forecasted monthly runoff values with the actual observed values at the identified turning points, the average absolute difference was calculated independently for each model. For more details, please refer to [63] where the mathematical equation for AAETP can be found in Equation 18. The analysis of error magnitudes specifically at turning points provides valuable insights into the models’ ability to accurately capture data shifts and transitions, thereby improving the assessment of their overall forecasting capabilities.
Where, Ei represents the forecasting error for turning points. It is computed by subtracting the forecasted values of the critical points from their corresponding actual runoff values.
The theoretical optimum value for AAETP is zero, indicating that the model perfectly captures turning points. However, in practice, achieving a value of zero is very difficult. Therefore, it is essential to identify the minimum values that are closest to zero when comparing models. These values serve as indicators of the most effective models, facilitating a more reliable evaluation of how well they capture turning points
4 . Results and discussion
4.1 Forecasting runoff using AI-based models
This study aimed to develop runoff forecasting models for the Lotschental catchment for one-month ahead timescale through three particular techniques Random Forest (RF), Long short-term Memory (LSTM), and XGBoost. In order to determine the optimal number of input lags, a backward selection strategy called Partial Autocorrelation Function (PACF) was applied. This resulted in three different sets of input variables. During the training stage, rigorous evaluation of each model employed several statistical measures such as Mean Absolute error (MAE), Root Mean Square Error (RMSE), Willmot index (d), Correlation Coefficient (R), Maximum absolute relative error (erMAX) and Nash-Sutcliffe efficiency index (NSE), and Combined accuracy(CA) to represent forecasting of models, error measurement and the degree of correlation and proportion of variance between the independent variables and the dependent variable respectively (refer to Table 3). These parameters played a key role in assessing the relative performance of the models in predicting runoff for Lotschental catchment
This study utilized three forecasting models (RF, LSTM, and XGBoost) to forecast runoff one month ahead for Lotschental catchment. The input lags were selected using PACF, resulting in three different combinations. The performance of each forecasting model during the training phase was evaluated using various statistical metrics (refer to Table 3).
The results demonstrated that all the models exhibited good forecasting accuracy, with the second and third combinations appearing to be more suitable, although the second combination shows a lower forecasting error. For example, XGBoost presented a higher MAE of 0.937 m3/sec in Comb1, whereas this value was reduced to 0.752 m3/sec and 0.776 m3/sec in Comb2 and Comb3, respectively. Similarly, for LSTM, the MAE values for the three combinations were 1.357 m3/sec, 0.784 m3/sec, and 0.821 m3/sec during the training phase. RF, on the other hand, exhibited slightly better performance than XGBoost across all three combinations, with lower MAE values of 0.727 m3/sec, 0.613 m3/sec, and 0.582 m3/sec.
with lowest MAE and CA values of 0.727 m3/sec, 0.613 m3/sec, and 0.582 m³/sec, and 0.658, 0.572, and 0.540, respectively. However, XGBoost and RF showed similar accuracy parameters, such as R, during the training phase. It’s important to note that the training phase alone is not crucial for selecting the best model since the models are trained with available input and output runoff data. The testing phase, where the model is only provided with input data, determines the generalization capacity and efficiency of the model.
The hyperparameters of the applied models are selected using a trial-and-error method, with the objective function being RMSE. The models are trained on calibration data, and hyperparameters are chosen based on their performance in reducing RMSE. For example, the LSTM model is very sensitive to hidden units and dropout values, with hidden units ranging from 15 to 35 and dropout rates from 0.09 to 0.2. In the RF model, sensitivity was noted regarding the number of trees, with the optimum found to be 18 and the minimum number of samples required at a leaf node set at 5. Finally, for the XGBoost model, the best learning rate was determined to be 0.15, while the maximum tree depth was set to 7. S2 Table presents the hyperparameters for all models. Notably, the hyperparameters for each forecasting model used to forecast monthly runoff are listed in the S Appendix (S2 Table).
Table 4 presents the performance of the forecasting models during the testing phase. Based on statistical parameters, the combination labeled Comb2, which involves seven previous runoff lags, exhibits the most favorable scenario for forecasting models. Within this particular scenario, XGBoost demonstrates superior performance compared to the other models, yielding the lowest values for RMSE (1.554 m3/sec), MAE (0.976 m3/sec), erMax (1.080), and CA (0.871), as well as the highest values for d (0.972), NSE (0.797), and R (0.956). Additionally, the XGBoost model achieves higher prediction accuracy by reducing RMSE by 14.57% and 17.03% compared to the LSTM and RF models, respectively. Overall, the statistical results from Tables 3 and 4 indicate that the XGBoost model outperformed the other models for all three experimental combinations. The model was the one to have the best performance metrics throughout, followed by the LSTM network and the Random Forest (RF) model, which presented relatively below-par performance. It is important to note that the RF model is prone to overfitting, as it demonstrates excellent forecasting performance during the training phase but performs less effectively during the testing phase. These findings underscore the efficiency of XGBoost in forecasting runoff and its superior generalization capacity when compared to LSTM and RF.
To comprehensively assess the predictive accuracy of the applied model, it is vital to verify its generalization capacity and accuracy in forecasting runoff across different seasons. Such evaluation provides further insights into how a model performs in each season, allowing us to understand the significant differences in model performance under varying conditions. According to Table 5, XGBoost is the best performing model, offering predicted values that closely align with those measured over the seasonal period. Notably, XGBoost delivers the highest forecasting accuracy during the summer months (July to September). Additionally, the other models demonstrate very good forecasting accuracy from July to September. XGBoost has the lowest overall relative percentage error (4.15%), followed by LSTM (6.16%), while RF generates the lowest accuracy (7.07%).
4.2 Graphical assessment
This chapter aims to visually demonstrate the effectiveness of each model in simulating the runoff time series data for each month. The line graph presented in Fig 7(a, b, c) illustrates that, in general, the models have successfully forecasted the runoff from May 2016 to approximately March 2019, capturing the overall pattern of the measured values. However, during this period, only the RF model exhibits relatively poorer forecast performance in terms of peak values (refer to Fig 6c).
Panels a) XGBoost (Line Graph), a1) XGBoost (scatter Plot), b) LSTM (Line Graph), b1) LSTM (Scatter Plot), c) RF (Line Graph), c1) RF (Scatter Plot).
From March 2019 to December 2021, the efficiency of the models varies significantly due to the presence of several peak points and substantial changes in the time series data. The XGBoost model demonstrates the closest fit to the measured values, with its forecasted values closely aligning with the measured values throughout the time series, particularly during peak points. Similarly, the LSTM model shows a good fit, with its forecasted values generally following the measured values, although there are some instances where slight deviations occur (refer to Fig 6b). On the other hand, the RF model exhibits a slightly wider range of forecasted values compared to the other two models, suggesting that it is less precise in its forecasts (refer to Fig 7c). According to the figure, the highest positive peak value is 16.887 m³/sec, which is simulated very well by XGBoost and LSTM, with predicted values of 15.8861 m³/sec and 15.4421 m³/sec, respectively. However, RF exhibits lower prediction performance for these peak points, with a value of 15.3793 m³/sec. With regard to the lowest runoff point, which is measured at 0.444 m³/sec, only XGBoost simulates this point with reasonable accuracy at 0.6851 m³/sec, while the other models generate values ranging from 1 m³/sec to 1.68 m³/sec. The accurate simulation of both maximum and minimum runoff points demonstrates the adaptability of the XGBoost model in providing reliable forecasts under varying hydrological conditions, including both drought and flood events.
The scatter plots in Fig 6 a1, b1, and c1 illustrate the relationship between measured runoff and the forecasted values from the models in Comb2. The scatter plot for XGBoost (Fig 7 a1) demonstrates a strong positive correlation with a tight clustering of data points around the diagonal line, indicating accurate predictions (R2 = 0.901). The LSTM plot (Fig 6b1) shows slightly more spread data points compared to XGBoost but less than RF, suggesting some errors in predictions (R2 = 0.869). Similarly, the RF plot ((Fig 6 c1) exhibits a wider spread of data points, indicating deviations from the actual values (R2 = 0.861). Overall, The XGBoost model is the best-performing model, followed by LSTM and then RF.
Based on the results displayed in Fig 8, which presents the seasonal performance of each model, it is evident that all models exhibit varying accuracy across seasons, reflecting fluctuations in the runoff pattern over time. Overall, XGBoost demonstrates strong performance throughout all three seasonal periods, yielding the lowest absolute residuals of 0.325 m3/sec, 0.3288 m3/sec, and 0.1241 m3/sec, respectively. Notably, during the period from April to June, XGBoost consistently outperforms the other models. Overall, XGBoost consistently demonstrates the highest accuracy in modeling seasonal runoff patterns and effectively captures variations across various time periods.
A violin plot represents the distribution of numerical data across one or more groups through density curves. The width of each curve indicates the relative frequency of data points within specific intervals. As depicted in Fig 9, violin plot has been employed to facilitate a comprehensive comparison between the measured runoff and the forecasted runoff obtained from the applied models. In general, all models capture the overall trend and distribution of the runoff data. However, variations among the models become evident, particularly in the peak points. Notably, XGBoost demonstrates favorable forecasting performance for the peak runoff points when compared to the LSTM and RF models.
Moreover, the Taylor diagram has been used as a valuable tool to identify the most efficient model. These plots are mathematical constructs that graphically illustrate the relative accuracy of various model representations of a system, process, or phenomenon. The diagrams facilitate comparative analyses and provide a comprehensive visualization by summarizing the correlation, standard deviation ratio, and root mean square difference between the reference dataset (measured runoff) and multiple model simulations in a single plot. Fig 10 demonstrates that XGBoost exhibits the closest proximity to the reference points (measured runoff), followed by the LSTM and RF models. Finally, uncertainty analysis, specifically in terms of U95, was performed to determine which model produces the least uncertainty. According to Fig 11, XGBoost demonstrates the lowest U95 value of 1.231, which is lower than LSTM with 1.251 and RF with 1.259. Overall, the analysis of the Taylor plot, violin plot, and U95 indicates that XGBoost is the most effective model for predicting monthly runoff in the study area, demonstrating the lowest level of uncertainty.
4.3 Analyzing forecasting models with turning point detection
This section of the paper focused on the assessment focused on the applied model’s efficiency in detecting runoff turning points. Fig 12a visually depicts how the algorithm identifies turning points in the time series runoff data. The graph illustrates a cyclical pattern in the time series runoff data, with clearly defined peaks and valleys. The turning points denoted on the graph highlight substantial changes in the data. It is important to note that the graph enables an assessment of the accuracy of the applied models in predicting these critical turning points.
a) illustration of the algorithm’s capability in detecting turning points in the runoff time series data. b) calculated results of AAETP.
The bar chart in Fig 12b showcases the accuracy of each applied model in forecasting the identified turning points. The comparison of AAETP values reveals that XGboost performs the best among the models, with an AAETP of 1.579 m3/sec. In contrast, the LSTM model has an AAETP of 1.924 m3/sec, and the RF model has an AAETP of 2.119 m3/sec. This indicates that XGboost outperforms LSTM and RF in accurately forecasting the turning points of runoff. Specifically, the accuracy of turning point forecast improves by approximately 21% and 34% compared to LSTM and RF models, respectively.
4.4 Discussion
The current study investigated the effectiveness of three different models, namely LSTM, RF, and XGBoost, in forecasting monthly runoff for the Lotschental catchment in Switzerland. Among these models, XGBoost demonstrated superior accuracy, achieving a forecasting accuracy of R = 0.956. These findings contribute to advancing the accuracy of previous studies in this field. In a similar vein, some researchers [81] have employed the hybrid model ELM-GWO, which combines the extreme learning machine (ELM) with the grey wolf optimizer (GWO), to forecast runoff in the Three Gorges region of China. During the testing phase, the hybrid model (ELM-GWO) exhibited good accuracy, achieving an R-value of 0.9228. Another study explored the application of the Emotional Artificial Neural Network (EANN) for monthly runoff forecasting in the Kallada River basin, located in Kerala [82]. The findings of this research demonstrated that the EANN model yielded satisfactory results, achieving a forecasting accuracy with an R-value of 0.877. Furthermore, machine learning models have also been utilized in the semi-arid Central Andes of Argentina to predict the runoff [83]. The study concluded that support vector regression (SVR) demonstrated effective forecasting of the runoff, achieving R2 values ranging from 0.75 to 0.89 during the testing phase. Moreover, the XGBoost model developed in this study demonstrates good accuracy compared to similar studies conducted to forecast runoff in glacierized catchments. For example, some researchers conducted a study forecasting runoff using a conceptual hydrological model called Glacial Snow Melt (GSM) and SVR based on several hydrological and meteorological parameters [84]. The results showed that the GSM achieved a good accuracy with an R2 of 0.77, while the standalone SVR reached an R2 of 0.83. Similarly, another study developed a conceptual hydrological model for modeling runoff in glacierized alpine areas and found that the model provided good accuracy, with R2 values ranging from 0.74 to 0.89 [85]. Among the models examined in the literature, XGBoost demonstrated superior forecasting accuracy compared to others in forecasting monthly runoff for the Lotschental catchment, highlighting its effectiveness in this context. Furthermore, it is important to highlight that the forecasting models developed in previous research did not include turning point analysis. However, this research demonstrates that such a metric is highly significant in evaluating and comparing models.
As the XGBoost model in this study provides good forecasting accuracy for modeling monthly runoff compared to previous works, it may also be suitable for forecasting runoff over other time scales, such as daily and weekly. In shorter time scales (daily and weekly), the autocorrelation in time series data is typically higher than that in monthly scales, which may increase the opportunity for achieving better results. Additionally, in shorter time scales, the number of data points is greater than in the monthly scale, which may help models to be trained effectively with sufficient data.
In this work, a deep learning model called LSTM is used for modeling time series data because of its ability to remember past events and capture patterns over time. While LSTM addresses the vanishing gradient problem faced by traditional models like recurrent neural network (RNNs) [86], it did not outperform XBGoost in modeling runoff. This is largely because LSTM requires a significant amount of data to train effectively and may need additional features to capture complex temporal dependencies in the runoff data. The results of the current study indicate that deep learning model (LSTM) were not superior to machine learning models (XGBoost). Thus, the findings of this research align with another study [87] on runoff prediction that found deep learning models gave an acceptable result but less accurate than their machine learning counterparts.
The selection of training and testing samples for model training and validation is a crucial step in developing predictive models. In this study, runoff measurements from 2002 to 2015 are utilized for model development, while data from 2016 to 2021 serve to assess the accuracy of the model. Ensuring balanced data and minimizing bias between the training and testing sets play a significant role in creating a robust model with strong generalization capacity [88]. Fig 13 summarizes the key statistical metrics for both datasets. Notably, there is a strong similarity in the training characteristics. For instance, the minimum, maximum, average, and standard deviation values for the training and testing datasets are as follows: 0.461 m3/sec vs. 0.444 m3/sec, 16.778 m3/sec vs. 16.887 m3/sec, 4.5328 m3/sec vs. 4.962 m3/sec, and 4.5997 m3/sec vs. 5.0549 m3/sec. These statistics indicate that both datasets are homogeneous and well-balanced, further supporting the validity of the model.
The choice of input lag combinations plays a critical role in time series forecasting (e.g., runoff) as it can significantly affect the performance of forecasting models such as LSTM, XGBoost, and RF. Also, using various combinations of input lags helps to evaluate their impact on the performance of prediction models. In this work, three combinations are used based on PACF. The first combination involves only three input variables, while the second includes seven lags, and the last one involves a large number of variables, up to ten. The performance of the models varies from one combination to another. The models attempt to capture the overall temporal dependencies in the testing data and require optimum input choices. It can be seen that none of the models provided good prediction accuracy using the first combination due to the lack of significant information needed to effectively capture the patterns of the time series data. In this case, XGBoost achieved the lowest forecasting error with MAE of 1.252 m3/sec, followed by RF (1.351 m3/sec) and LSTM (1.560 m3/sec). Regarding the second input combination, all models presented significant improvement, with XGBoost providing an MAE of 0.976 m3/sec, followed by RF (1.063 m3/sec) and LSTM (1.065 m3/sec). However, the forecasting accuracy of all the models deteriorated with the third input combination, with the MAE for XGBoost, RF, and LSTM being higher than those in the second combination. The main reason may be related to the introduction of a large number of features, which could lead to redundancy in the data and significantly affect the generalization of the models, complicating the training process. Once the input combination contained fewer variables, as seen in the first scenario, the models exhibited lower prediction capacity because the available inputs did not contain significant information that could be mapped by the models to mimic the patterns of the time series data and effectively capture the temporal dependencies in the runoff.
The melting of snow and glaciers is increasingly important for runoff generation due to the impacts of climate change and global warming. Enhancing the accuracy of runoff forecasting models offers tangible benefits for managing water resources in cold regions affected by such environmental changes. This is particularly crucial for the effective allocation and conservation of water resources. In our study, we found that traditional models had limited effectiveness in forecasting peak flows, which are critical for water management and flood mitigation in climate-affected regions. Conversely, the XGBoost model demonstrated notable improvements, producing the most accurate runoff simulations. The model’s performance metrics, including RMSE of 1.554 m3/s, MAE of 0.976 m3/s, U95 of 1.231, R2 of 0.91, and d of 0.972, reflect its proficiency. Besides, the model exhibited superior performance in forecasting runoff turning points, achieving the lowest AAAET (1.579 m3/sec) among the compared models. These results underscore the potential of XGBoost models in areas with glacierized catchments, which are particularly vulnerable to rapid climate change and glacial melt. The study offers evidence that the proposed model adeptly handles the complex, nonlinear interplay between historical and projected runoff data, thus enabling more reliable water resource management in these challenging environments.
Furthermore, as shown in Fig 14, the model demonstrates excellent monthly runoff performance across nearly all months, generating minimal errors as indicated by the RMSE values. The lowest forecasting errors occurred in January and December, with RMSE values of 0.068 m3/sec and 0.124 m3/sec, respectively. In contrast, the highest forecasting error was recorded in June, with an RMSE of 3.354 m3/sec. The increase in forecasting error can be attributed to the instability and significant variability of the flow during this month. As shown in Fig 2, this period corresponds to the highest runoff into the basin, which leads to greater discrepancies in the model’s predictions and subsequently higher forecasting errors.
5. Conclusion
The current study provides a comprehensive comparison between LSTM, XGBoost, and RF models for forecasting mean monthly runoff one month in advance. This study was conducted in the Lötschental catchment, Switzerland, which is classified as a glacierized catchment. The glacierized catchments are highly sensitive to climate change, where surface runoff undergoes dynamic hydrological processes influenced by climatic conditions and the formation and melting of ice. Historical data spanning from January 2002 to December 2021 was utilized, with input lag values determined using PACF and supplied to the runoff forecasting models. The data was divided into two phases: a training phase, which included runoff data from 2002 to 2015, and a testing phase, which used data from 2016 to 2021. The results indicated that XGBoost provided superior forecasting results compared to the other models, enhancing forecasting accuracy by reducing RMSE by 14.57% and 17.03% compared to the LSTM and RF models, respectively.
The current study also analyzes the capacity of the applied models to accurately forecast turning points in runoff data. Since the time series data of monthly runoff records contain significant autocorrelation, most AI models tend to provide in general good forecasts according to classical statistical metrics. Therefore, a new statistical criterion has been introduced in this research called AAETP. The quantitative results indicate that the XGBoost model was the most effective in accurately identifying turning points in the runoff time series data, outperforming the LSTM and RF models by approximately 21% and 34%, respectively. The effectiveness of the XGBoost model in capturing the complex runoff dynamics of glacierized catchments may have practical implications for water resource management decision-making. Also, its accurate forecasts may provide decision-makers with valuable insights to optimize strategies, allocate resources, and promote sustainable practices in response to changing hydrological conditions.
However, the proposed study still has some major constraints. Models’ limitation to predict peak values of monthly runoff concentrations with high accuracy will be improved by using data decomposition techniques such as reliable wavelet transform techniques. Future studies could also investigate the potential of incorporating ensemble and hybrid models, such as CNN-LSTM (combining Convolutional Neural Network with LSTM), ELM-MHA (combining metaheuristic algorithms with ELM) and other hybrid techniques for predicting monthly runoff rates. Also, it is recommended to incorporate longer runoff and climate datasets from observed stations or satellites to enhance runoff forecasting and better understand the impacts of climate change.
Supporting information
S1 Table. Monthly runoff data that is used in this study.
https://doi.org/10.1371/journal.pone.0321008.s001
(DOCX)
S2 Table. Hyperparameters for the applied models.
https://doi.org/10.1371/journal.pone.0321008.s002
(DOCX)
References
- 1. Zhang S, Kang Y, Gao X, Chen P, Cheng X, Song S, et al. Optimal reservoir operation and risk analysis of agriculture water supply considering encounter uncertainty of precipitation in irrigation area and runoff from upstream. Agric Water Manag 2023;277:108091.
- 2. Guo S, Wen Y, Zhang X, Chen H. Runoff prediction of lower Yellow River based on CEEMDAN-LSSVM-GM(1,1) model. Sci Rep. 2023;13(1):1511. pmid:36707680
- 3. Wu J, Wang Z, Hu Y, Tao S, Dong J. Runoff forecasting using convolutional neural networks and optimized bi-directional long short-term memory. Water Resour Manag. n.d.;37:937–53.
- 4. Zhou P, Zhang H, Li Z. Impact of climate change on the glacier and runoff of a glacierized basin in Harlik Mountain, Eastern Tianshan Mountains. Remote Sens. 2022;14(14):3497.
- 5. Rautela KS, Kuniyal JC, Alam MA, Bhoj AS, Kanwar N. Assessment of daily streamflow, sediment fluxes, and erosion rate of a pro-glacial stream basin, Central Himalaya, Uttarakhand. Water Air Soil Pollut. 2022;233:136.
- 6. Zhang X, Wang X, Li H, Sun S, Liu F. Monthly runoff prediction based on a coupled VMD-SSA-BiLSTM model. Sci Rep. 2023;13(1):13149. pmid:37573389
- 7. Hameed MM, Mohd Razali S, Wan Mohtar W, Yaseen ZM. Improving multi-month hydrological drought forecasting in a tropical region using hybridized extreme learning machine model with Beluga Whale Optimization algorithm. Stoch Environ Res Risk Assess. 2023;37:4963–89.
- 8. Gan R, Gu S, Tong X, Lu J, Tang H. A nonparametric standardized runoff index for characterizing hydrological drought in the Shaying River Basin, China. Nat Hazards. 2023;120(3):2233–53.
- 9. Wang WC, Gu M, Li Z, Hong YH, Zang HF, Xu DM. A stacking ensemble machine learning model for improving monthly runoff prediction. Earth Sci Inform. 2025;18(1):1–27.
- 10. Kratzert F, Klotz D, Brenner C, Schulz K, Herrnegger M. Rainfall–runoff modelling using Long Short-Term Memory (LSTM) networks. Hydrol Earth Syst Sci. 2018;22(11):6005–22.
- 11. Morales Y, Querales M, Rosas H, Allende-Cid H, Salas R. A self-identification Neuro-Fuzzy inference framework for modeling rainfall-runoff in a Chilean watershed. J Hydrol. 2021;594:125910.
- 12. Bartoletti N, Casagli F, Marsili-Libelli S, Nardi A, Palandri L. Data-driven rainfall/runoff modelling based on a neuro-fuzzy inference system. Environ Model Softw. 2018;106:35–47.
- 13. Safari MJS, Rahimzadeh Arashloo S, Danandeh Mehr A. Rainfall-runoff modeling through regression in the reproducing kernel Hilbert space algorithm. J Hydrol. 2020;587:125014.
- 14. Han H, Morrison RR. Data-driven approaches for runoff prediction using distributed data. Stoch Environ Res Risk Assess. 2021;36(8):2153–71.
- 15. Bi X, Li B, Lu W, Zhou X. Daily runoff forecasting based on data-augmented neural network model. J Hydroinform. 2020;22(4):900–15.
- 16. Vilaseca F, Castro A, Chreties C, Gorgoglione A. Daily rainfall-runoff modeling at watershed scale: A comparison between physically-based and data-driven models. Comput Sci Its Appl – ICCSA 2021. 2021:18–33.
- 17. Özdoğan-Sarıkoç G, Dadaser-Celik F. Physically based vs. data-driven models for streamflow and reservoir volume prediction at a data-scarce semi-arid basin. Environ Sci Pollut Res Int. 2024;31(27):39098–119. pmid:38811456
- 18. Partington D, Brunner P, Simmons CT, Werner AD, Therrien R, Maier HR, et al. Evaluation of outputs from automated baseflow separation methods against simulated baseflow from a physically based, surface water-groundwater flow model. J Hydrol. 2012;458–459:28–39.
- 19. Bittelli M, Tomei F, Pistocchi A, Flury M, Boll J, Brooks ES, et al. Development and testing of a physically based, three-dimensional model of surface and subsurface hydrology. Adv Water Resour. 2010;33(1):106–22.
- 20. Al-Kubaisi MHD. Surface runoff estimation in kubaisa watershed using SWAT, Western Desert, Iraq. IGJ. 2024;57(2B):286–97.
- 21. Javan K, Lialestani MRFH, Nejadhossein M. A comparison of ANN and HSPF models for runoff simulation in Gharehsoo River watershed, Iran. Model Earth Syst Environ. 2015;1(4).
- 22. Yoon H, Jun S-C, Hyun Y, Bae G-O, Lee K-K. A comparative study of artificial neural networks and support vector machines for predicting groundwater levels in a coastal aquifer. J Hydrol. 2011;396(1–2):128–38.
- 23. Mehdizadeh S, Fathian F, Adamowski JF. Hybrid artificial intelligence-time series models for monthly streamflow modeling. Appl Soft Comput. 2019;80:873–87.
- 24. Valipour M. Long-term runoff study using SARIMA and ARIMA models in the United States. Meteorol Appl. 2015;22(4):592–8.
- 25. Pulukuri S, Keesara VR, Deva P. Flow forecasting in a watershed using autoregressive updating model. Water Resour Manag. 2018;32(1):2701–16.
- 26. Vishwakarma DK, Kumar P, Yadav KK, Ali R, Markuna S, Chauhan S, et al. Evaluation of CatBoost method for predicting weekly pan evaporation in subtropical and sub-humid regions. Pure Appl Geophys. 2024;181:719–47.
- 27. Elbeltagi A, Al-Mukhtar M, Kushwaha NL, Al-Ansari N, Vishwakarma DK. Forecasting monthly pan evaporation using hybrid additive regression and data-driven models in a semi-arid environment. Appl Water Sci. 2022;13:42.
- 28. Amiri E. Forecasting daily river flows using nonlinear time series models. J Hydrol. 2015;527:1054–72.
- 29. Ni L, Wang D, Wu J, Wang Y, Tao Y, Zhang J, et al. Streamflow forecasting using extreme gradient boosting model coupled with Gaussian mixture model. J Hydrol. 2020;586:124901.
- 30. Feng Z, Niu W, Tang Z, Jiang Z, Xu Y, Liu Y, et al. Monthly runoff time series prediction by variational mode decomposition and support vector machine based on quantum-behaved particle swarm optimization. J Hydrol. 2020;583:124627.
- 31. Behrouz MS, Yazdi MN, Sample DJ. Using Random Forest, a machine learning approach to predict nitrogen, phosphorus, and sediment event mean concentrations in urban runoff. J Environ Manage. 2022;317:115412. pmid:35649331
- 32. Yilmaz AG, Muttil N. Runoff estimation by machine learning methods and application to the Euphrates Basin in Turkey. J Hydrol Eng. 2014;19:1015–25.
- 33. Lin GF, Jhong BC, Chang CC. Development of an effective data-driven model for hourly typhoon rainfall forecasting. J Hydrol. 2013;495:52–63.
- 34. Feng Y, Cui N, Hao W, Gao L, Gong D. Estimation of soil temperature from meteorological data using different machine learning models. Geoderma. 2019;338:67–77.
- 35. Sahoo S, Russo TA, Elliott J, Foster I. Machine learning algorithms for modeling groundwater level changes in agricultural regions of the U.S. Water Resour Res. 2017;53(5):3878–95.
- 36. Le X-H, Nguyen D-H, Jung S, Yeon M, Lee G. Comparison of deep learning techniques for river streamflow forecasting. IEEE Access. 2021;9:71805–20.
- 37. Szczepanek R. Daily streamflow forecasting in mountainous catchment using XGBoost, LightGBM and CatBoost. Hydrology. 2022;9(12):226.
- 38. Wang S, Peng H. Multiple spatio-temporal scale runoff forecasting and driving mechanism exploration by K-means optimized XGBoost and SHAP. J Hydrol. 2024;630:130650.
- 39. Guo J, Liu Y, Zou Q, Ye L, Zhu S, Zhang H. Study on optimization and combination strategy of multiple daily runoff prediction models coupled with physical mechanism and LSTM. J Hydrol. 2023;624:129969.
- 40. Xu Y, Lin K, Hu C, Wang S, Wu Q, Zhang J, et al. Interpretable machine learning on large samples for supporting runoff estimation in ungauged basins. J Hydrol. 2024;639:131598.
- 41. Alomar MK, Khaleel F, Aljumaily MM, Masood A, Razali SFM, AlSaadi MA, et al. Data-driven models for atmospheric air temperature forecasting at a continental climate region. PLoS One. 2022;17(11):e0277079. pmid:36327280
- 42. Döll P, Abbasi M, Messager ML, Trautmann T, Lehner B, Lamouroux N. Streamflow intermittence in Europe: Estimating high‐resolution monthly time series by downscaling of simulated runoff and random forest modeling. Water Resour Res. 2024;60(8).
- 43. Zema DA, Parhizkar M, Plaza-Alvarez PA, Xu X, Lucas-Borja ME. Using random forest and multiple-regression models to predict changes in surface runoff and soil erosion after prescribed fire. Model Earth Syst Environ. 2023;10(1):1215–28.
- 44. Wang S, Peng H, Hu Q, Jiang M. Analysis of runoff generation driving factors based on hydrological model and interpretable machine learning method. J Hydrol: Reg Stud. 2022;42:101139.
- 45. Swagatika S, Paul JC, Sahoo BB, Gupta SK, Singh PK. Improving the forecasting accuracy of monthly runoff time series of the Brahmani River in India using a hybrid deep learning model. J Water Clim Change. 2023;15(1):139–56.
- 46. Guo H, Chen L, Fang Y, Zhang S. Model and application of annual river runoff prediction based on complementary set empirical mode decomposition combined with particle swarm optimization adaptive neuro-fuzzy system. Water Supply. 2023;23(5):1760–74.
- 47. Kumar K, Singh V, Roshni T. Application of the PSO–Neural network in rainfall–runoff modeling. Water Pract Technol. 2022;18(1):16–26.
- 48. Katipoğlu OM, Sarıgöl M. Improving the accuracy of rainfall-runoff relationship estimation using signal processing techniques, bio-inspired swarm intelligence and artificial intelligence algorithms. Earth Sci Inform. 2023;16(4):3125–41.
- 49. Katipoğlu OM, Yeşilyurt SN, Dalkılıç HY, Akar F. Application of empirical mode decomposition, particle swarm optimization, and support vector machine methods to predict stream flows. Environ Monit Assess. 2023;195(9).
- 50. Wang Y, Wang W, Xu D, Zhao Y, Zang H. A compound approach for ten-day runoff prediction by coupling wavelet denoising, attention mechanism, and LSTM based on GPU parallel acceleration technology. Earth Sci Inform. 2024;17(2):1281–99.
- 51. Hu Y, Ghosh C, Malakpour-Estalaki S. A methodological framework for improving the performance of data-driven models: A case study for daily runoff prediction in the Maumee domain, USA. Geosci Model Dev. 2023;16(7):1925–36.
- 52.
Mesinger F, Pierrehumbert R. Alpine lee cyclogenesis: Numerical simulation and theory. WMO Proceedings of the Conference on the Scientific Results of the Alpine Experiment(ALPEX), 1986. (ALPEX). 1986.
- 53.
Wallén CC. Climates of northern and western Europe. (No Title). 1970.
- 54. Rößler O, Löffler J. Analyzing spatio-temporal hydrological processes and related gradients to improve hydrological modeling in high mountains. Lecture Notes in Earth Sciences. 2010;115: 243–256.
- 55. Leys M, Keller I, Robinson CT, Räsänen K. Cryptic lineages of a common alpine mayfly show strong life-history divergence. Mol Ecol. 2017;26(6):1670–86. pmid:28099770
- 56. Rössler O, Froidevaux P, Börst U, Rickli R, Martius O, Weingartner R. Retrospective analysis of a nonforecasted rain-on-snow flood in the Alps – A matter of model limitations or unpredictable nature?. Hydrol Earth Syst Sci. 2014;18(6):2265–85.
- 57.
Frey E, Prof S, Zischg A, Prof A, Wunderle S. Methodological framework for long-term satellite-based forest monitoring in the canton of Bern: Combining geospatial forest data with 35 years of Landsat 5 and 7 imagery products. 2024.
- 58. Diehl T, Deichmann N, Clinton J, Kästli P, Cauzzi C, Kraft T, et al. Earthquakes in Switzerland and surrounding regions during 2014. Swiss J Geosci. 2015;108(2–3):425–43.
- 59.
Synthesis report on the event analysis the floods of 2005 in Switzerland. 2005:1–50.
- 60. Mohammadi B, Vazifehkhah S, Duan Z. A conceptual metaheuristic-based framework for improving runoff time series simulation in glacierized catchments. Eng Appl Artif Intell. 2024;127:107302.
- 61. Moghar A, Hamiche M. Stock market prediction using LSTM recurrent neural network. Procedia Comput Sci. 2020;170:1168–73.
- 62. Landi F, Baraldi L, Cornia M, Cucchiara R. Working memory connections for LSTM. Neural Netw. 2021;144:334–41. pmid:34547671
- 63. Hameed MM, Mohd Razali SF, Wan Mohtar WHM, Ahmad Alsaydalani MO, Yaseen ZM. Deep learning versus hybrid regularized extreme learning machine for multi-month drought forecasting: A comparative study and trend analysis in tropical region. Heliyon. 2023;10(1):e22942. pmid:38187234
- 64. Rokach L. Decision forest: Twenty years of research. Inf Fusion. 2016;27:111–25.
- 65. Jollans L, Boyle R, Artiges E, Banaschewski T, Desrivières S, Grigis A, et al. Quantifying performance of machine learning methods for neuroimaging data. Neuroimage. 2019;199:351–65. pmid:31173905
- 66. Hameed MM, Mohd Razali SF, Wan Mohtar WHM, Yaseen ZM. Examining optimized machine learning models for accurate multi-month drought forecasting: A representative case study in the USA. Environ Sci Pollut Res Int. 2024;31(39):52060–85. pmid:39134798
- 67. Fan J, Wu L, Zhang F, Cai H, Zeng W, Wang X, et al. Empirical and machine learning models for predicting daily global solar radiation from sunshine duration: A review and case study in China. Renew Sustain Energy Rev. 2019;100:186–212.
- 68. Masood A, Hameed MM, Srivastava A, Pham QB, Ahmad K, Razali SFM, et al. Improving PM2.5 prediction in New Delhi using a hybrid extreme learning machine coupled with snake optimization algorithm. Sci Rep. 2023;13(1):21057. pmid:38030733
- 69. Zounemat-Kermani M, Batelaan O, Fadaee M, Hinkelmann R. Ensemble machine learning paradigms in hydrology: A review. J Hydrol. 2021;598:126266.
- 70. Sahour H, Gholami V, Vazifedan M. A comparative analysis of statistical and machine learning techniques for mapping the spatial distribution of groundwater salinity in a coastal aquifer. J Hydrol. 2020;591:125321.
- 71. Lu H, Ma X. Hybrid decision tree-based machine learning models for short-term water quality prediction. Chemosphere. 2020;249:126169. pmid:32078849
- 72. Alomar MK, Khaleel F, Aljumaily MM, Masood A, Razali SFM, AlSaadi MA, et al. Data-driven models for atmospheric air temperature forecasting at a continental climate region. PLoS One. 2022;17(11):e0277079. pmid:36327280
- 73. Hameed MM, AlOmar MK, Al-Saadi AAA, AlSaadi MA. Inflow forecasting using regularized extreme learning machine: Haditha reservoir chosen as case study. Stoch Environ Res Risk Assess. 2022;36(12):4201–21.
- 74. Mamat N, Mohd Razali SF. Comparisons of various imputation methods for incomplete water quality data: A case study of The Langat River, Malaysia. jkukm. 2023;35(1):191–201.
- 75. Adnan RM, Liang Z, Trajkovic S, Zounemat-Kermani M, Li B, Kisi O. Daily streamflow prediction using optimally pruned extreme learning machine. J Hydrol. 2019;577:123981.
- 76. Eray O, Mert C, Kisi O. Comparison of multi-gene genetic programming and dynamic evolving neural-fuzzy inference system in modeling pan evaporation. Hydrol Res. 2017;49(4):1221–33.
- 77. Hameed MM, Khaleel F, AlOmar MK, Mohd Razali SF, AlSaadi MA. Optimising the selection of input variables to increase the predicting accuracy of shear strength for deep beams. Complexity. 2022;2022(1).
- 78. Hameed MM, Abed MA, Al-Ansari N, Alomar MK. Predicting compressive strength of concrete containing industrial waste materials: Novel and hybrid machine learning model. Adv Civ Eng. 2022;2022(1).
- 79. Mahmood OA, Sulaiman SO, Al-Jumeily D. Forecasting for Haditha reservoir inflow in the West of Iraq using Support Vector Machine (SVM). PLoS One. 2024;19(9):e0308266. pmid:39240996
- 80. Hameed MM, Razali SFM, Mohtar WHMW, Rahman NA, Yaseen ZM. Machine learning models development for accurate multi-months ahead drought forecasting: Case study of the Great Lakes, North America. PLoS One. 2023;18(10):e0290891. pmid:37906556
- 81. Cheng X, Feng Z-K, Niu W-J. Forecasting monthly runoff time series by single-layer feedforward artificial neural network and grey wolf optimizer. IEEE Access. 2020;8:157346–55.
- 82. Reddy BSN, Pramada SK, Roshni T. Monthly surface runoff prediction using artificial intelligence: A study from a tropical climate river basin. J Earth Syst Sci. 2021;130(1).
- 83. Korsic SAT, Notarnicola C, Quirno MU, Cara L. Assessing a data-driven approach for monthly runoff prediction in a mountain basin of the Central Andes of Argentina. Environ Chall. 2023;10:100680.
- 84. Mohammadi B, Vazifehkhah S, Duan Z. A conceptual metaheuristic-based framework for improving runoff time series simulation in glacierized catchments. Eng Appl Artif Intell. 2024;127:107302.
- 85. Koboltschnig GR, Schöner W, Zappa M, Kroisleitner C, Holzmann H. Runoff modelling of the glacierized Alpine Upper Salzach basin (Austria): Multi‐criteria result validation. Hydrol Process. 2008;22(19):3950–64.
- 86. Li Y, Zhu Z, Kong D, Han H, Zhao Y. EA-LSTM: Evolutionary attention-based LSTM for time series prediction. Knowl-Based Syst. 2019;181:104785.
- 87. Yan L, Lei Q, Jiang C, Yan P, Ren Z, Liu B. Climate-informed monthly runoff prediction model using machine learning and feature importance analysis. Front Environ Sci. 2022;10:1049840.
- 88. Hameed MM, Masood A, Srivastava A, Abd Rahman N, Mohd Razali SF, Salem A, et al. Investigating a hybrid extreme learning machine coupled with Dingo Optimization Algorithm for modeling liquefaction triggering in sand-silt mixtures. Sci Rep. 2024;14(1):10799. pmid:38734717