Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Application of seasonal-adjusted hybrid models for forecasting Discomfort Index in a heat-prone region of Bangladesh

Abstract

Extreme heat and humidity pose an increasing threat to human health, labor productivity, and overall well-being, particularly in heat-vulnerable and rapidly urbanizing regions such as Rajshahi, Bangladesh. As rising temperatures and elevated humidity levels intensify exposure to heat stress, accurate forecasting has become essential for effective early warning systems and climate-resilient urban planning. However, modeling thermal discomfort is challenging due to the need to analyze long-term, high-frequency meteorological time series data with complex seasonal and nonlinear structures. Therefore, this study applies and evaluates seasonal-adjusted, machine-learning-based hybrid models to forecast thermal discomfort using 40 years (1985–2024) of daily temperature and humidity data. The Thom’s Discomfort Index (DI), a standard measure of thermal stress, was calculated and decomposed using the STL (Seasonal-Trend decomposition based on LOESS) method to separate trend, seasonality, and residual components. A total of 128 hybrid model combinations were implemented by integrating traditional time series models (ARIMA, TBATS, ETS, and GARCH) with machine learning techniques (ANN, Prophet, SVR, Decision Trees, Random Forests, XGBoost, LSTM, and GRU). Among all models, the STL-TBATS-LSTM hybrid achieved the best performance, with MAE = 0.4810, MAPE = 2.1230, RMSE = 0.6381, and MASE = 0.6644, followed closely by STL-TBATS-DTR. Historical analysis from 1985 to 2024 revealed strong seasonal peaks in discomfort from June to August, along with a clear long-term increase in both the frequency and intensity of high-discomfort days. Forecasts for 2025–2027 project a substantial rise in thermal stress, with approximately 39.8% of days falling under “High Discomfort” and 1.2% under “Severe Discomfort.” These findings highlight the escalating burden of heat stress in Bangladesh and underscore the urgency of STL-based hybrid forecasting models in supporting climate adaptation strategies and enhancing public health preparedness.

1. Introduction

Climate and prevailing weather conditions have a direct impact on human comfort [1]. Environmental factors play a key role in regulating the heat exchange of the body with its surroundings and can contribute to the sensations of heat-related stress [2]. Understanding local thermal discomfort conditions is essential due to their strong association with increased morbidity and mortality rates [36]. Prolonged exposure to high levels of thermal discomfort can lead to a higher incidence of heat-related health issues and fatalities [7]. Various biometeorological indices quantify thermal discomfort by considering environmental factors like air temperature, relative humidity, and wind speed to assess human heat stress [810]. Solar radiation and surface temperatures further affect thermal perception [11]. However, individual experiences of discomfort can vary based on physiological, psychological, and behavioral factors [12].

Among the various indices, Thom’s Discomfort Index (DI) has emerged as a widely adopted measure due to its simplicity and dependence on two readily available meteorological variables: air temperature and relative humidity [13]. It has a strong correlation with perceived human discomfort, making it a valuable tool, especially in regions like Bangladesh, where access to comprehensive meteorological data may be limited. Since temperature and humidity data are easily obtainable and exhibit significant variability across time and space, DI serves as a pragmatic choice for large-scale assessments. The index quantifies thermal discomfort by using these parameters and categorizes the results into easily interpretable levels of human comfort or discomfort [14]. This straightforward interpretability enhances DI’s utility for public health advisories and early warning systems, particularly in resource-constrained settings.

In recent years, the global impacts of climate change have brought increased attention to the rise in thermally uncomfortable days, particularly in low-income and tropical regions. Zhang et al. (2023) project that low-income regions, particularly those in warmer latitudes, will experience a significant rise in thermally uncomfortable days, intensifying existing vulnerabilities [15]. Ullah et al. (2024) report substantial increases in both the frequency and intensity of heat stress across urban centers in the Arabian Peninsula [16]. In a study of a tropical megacity, Gupta et al. (2024) estimated that nearly 0.9 million individuals were exposed to moderate to severe heat stress, with mid-rise buildings contributing notably to elevated thermal discomfort [17]. Interestingly, Golshan et al. (2021) found that students across various global locations experienced higher levels of thermal discomfort during the winter months, highlighting the complex interplay between climate, behavior, and infrastructure [18]. Together, these findings underscore the multifaceted nature of thermal discomfort and the urgency for implementing adaptive urban strategies, such as green pavements, water spray cooling, and heat-reflective infrastructure [15,17].

Bangladesh, with its hot and humid climate, is already experiencing marked increases in thermal discomfort. Ekra et al. (2024) reported significant increases in Heat Discomfort Index (DI), Humidex, and wet-bulb temperature across the country, with coastal regions experiencing more severe changes [19]. Talukdar et al. (2017) found that over 50% of the population in Mymensingh experienced discomfort from April to October [20]. Sharmin et al. (2020) modeled outdoor thermal sensation in Dhaka using both meteorological and personal factors [21]. Rahman et al. (2021) projected a continuous rise in heat index (HI) for the mid-century, with northern, northeastern, and south-central regions expected to face more significant increases [22]. These region-specific findings highlight the urgent need for localized adaptation strategies, such as promoting green urban spaces, implementing cool roof technologies, and improving public awareness around heat-related risks [23,24].

To meet the increasing demand for accurate forecasting, recent research has concentrated on hybrid models that merge traditional statistical methods with machine-learning algorithms to enhance predictive precision. Susanti et al. (2017) developed a hybrid model integrating time series regression, autoregressive integrated moving average (ARIMA), and artificial neural networks (ANN) to forecast data with trend, seasonal, and calendar variation patterns [25]. In the same way, Yunis et al. (2024) proposed hybrid models combining support vector regression (SVR), seasonal autoregressive integrated moving average (SARIMA), Long Short-Term Memory (LSTM), and Prophet for air pollution prediction, with LSTM-Prophet achieving the highest accuracy [26]. Thakur et al. (2021) surveyed hybrid models used in hydrological time series forecasting, highlighting the benefits of combining neural networks with conventional methods [27]. Sulandari et al. (2020) introduced singular spectrum analysis (SSA)-based hybrid models, two-level seasonal neural network (TLSNN) and two-level complex seasonal neural network (TLCSNN), which incorporate flexible trend functions, harmonics, and neural networks to improve forecasting performance for complex time series patterns [28]. These studies consistently demonstrated that hybrid models tend to outperform individual forecasting methods in terms of accuracy and flexibility.

Expanding upon these worldwide progressions, researchers in Bangladesh have recently started implementing hybrid models in diverse domains. Qureshi et al. (2025) introduced two novel algorithms for developing machine learning (ML)-based hybrid models, including seasonal-adjusted variants, to forecast heatwave warnings. These models are designed to manage high frequency datasets and effectively capture complex seasonal patterns [29]. Saif et al. (2024) compared ARIMA, neural networks, and a hybrid model to forecast fish production and concluded that the hybrid approach yielded the highest accuracy [30]. In the context of flood prediction, Dipro et al. (2023) evaluated ARIMA, SARIMAX, and Prophet models using inputs such as temperature, rainfall, and river water-level data [31]. Similarly, Ferdoush et al. (2021) demonstrated that a hybrid machine learning model combining random forest regression (RFR) and bidirectional LSTM outperformed standard LSTM models in short-term electricity load forecasting in Bangladesh [32]. In another study focused on precipitation forecasting in northern Bangladesh, a hybrid model combining M5 Prime (M5P) model tree and SVR achieved high predictive accuracy, with R² values reaching 0.87 and 0.92 for Rangpur and Sylhet, respectively, outperforming the individual models [33].

Despite this progress, limited research has explored the forecasting of thermal discomfort using seasonally adjusted hybrid models in the heat-vulnerable regions of Bangladesh, such as Rajshahi. Although DI is a widely used biometeorological indicator, it remains underutilized in long-term forecasting efforts aimed at informing public health and urban planning. Conventional models often struggle to fully capture the complex, multi-scale patterns in climatic data, which include strong seasonality, non-stationary trends, and non-linear residuals—a limitation that can be critical in a rapidly warming region like Bangladesh. To address this critical gap, the present study introduces a novel forecasting approach by applying the seasonal-adjusted hybrid algorithms of Qureshi et al. (2025) [29]. The novelty of our method lies in the strategic integration of seasonal-trend decomposition using Loess (STL) for effective seasonal decomposition, an advanced time series model (like Trigonometric Seasonality, Box-Cox Transformation, ARMA Errors, Trend, and Seasonal Components (TBATS)) for managing complex and multiple seasonal patterns, and an ML model (like LSTM) to capture nonlinear temporal dynamics. This integrated hybrid model surpasses conventional methods by delivering enhanced accuracy in forecasting thermal discomfort, specifically tailored to Bangladesh’s highly variable climate. The resulting high-precision projections are not only a technical advancement but also play a crucial role in enabling proactive public health interventions, optimizing early warning systems, and guiding data-driven urban planning. Such improvements are essential for strengthening climate resilience among vulnerable populations in heat-prone regions like Rajshahi.

2. Materials and methods

2.1. Study area and data collection

This study focuses on Rajshahi, a northwestern district of Bangladesh characterized by a hot and dry climate, particularly during the pre-monsoon and summer seasons [34]. Daily meteorological data, including average temperature and relative humidity, were obtained from the Bangladesh Meteorological Department (BMD) for the period from January 1, 1985, to December 31, 2024. Fig 1 presents the spatial distribution of the average annual number of severe discomfort days across 34 meteorological stations in Bangladesh, providing an overview of historical thermal stress patterns. To ensure data integrity and continuity, all missing values in the meteorological variables were addressed using time series-based imputation techniques before analysis. These variables were then utilized to calculate the DI, which is a well-known composite thermal indicator for evaluating human thermal stress. The resulting DI time series laid the groundwork for all the subsequent forecasting analyses. All data processing, modeling, and visualization tasks were performed using the R programming language.

thumbnail
Fig 1. Spatial distribution of the average annual number of severe discomfort days across 34 meteorological stations in Bangladesh.

The map was generated by the authors using QGIS version 3.30. Administrative boundary data were sourced from the Humanitarian Data Exchange (HDX), provided by OCHA Bangladesh (available at: https://data.humdata.org/dataset/cod-ab-bgd). This is an original figure created for illustrative purposes; it contains no proprietary basemap layers and is compliant with CC BY 4.0 licensing.

https://doi.org/10.1371/journal.pone.0344556.g001

2.2. Discomfort Index (DI) definition and classification

The DI is a widely used empirical indicator that estimates the level of thermal discomfort experienced by individuals due to the combined effects of temperature and humidity [35]. It plays a crucial role in evaluating human comfort, particularly in the context of climate monitoring, public health, and energy management [36]. In this study, Thom’s Discomfort Index was used, which is computed using the following formula [14]:

(1)

where DI is the Discomfort Index, T is the Average Air Temperature (°C), and RH is the Relative Humidity (%). Based on the calculated DI values, thermal stress is classified into six levels, ranging from no discomfort to extreme heat stress, as presented in Table 1. These categories help in interpreting the severity of heat stress and its potential impact on public health.

thumbnail
Table 1. Discomfort Index (DI) classification and associated thermal stress levels [13,14].

https://doi.org/10.1371/journal.pone.0344556.t001

2.3. Hybrid model’s forecasting framework

The hybrid modeling strategies used in this study are based on the earlier work [29]. To evaluate model performance, the DI time series was partitioned into 75% training data and 25% test data. As illustrated in Fig 2, the first approach applies a time series (TS) or ML model to the original data, followed by modeling the residuals using a complementary method. The final forecast is obtained by combining the primary forecast and the forecasted residuals. The second approach incorporates a seasonal adjustment step using STL decomposition to isolate the seasonal component from the original DI data. A model was then fitted to the seasonal-adjusted data and the residuals were further modeled. The final forecast was generated by aggregating the seasonal component with both the forecasted components from the adjusted model and its residuals. These workflows allow for better handling of trend, seasonality, and nonlinearity in the DI time series.

thumbnail
Fig 2. Workflow of hybrid forecasting models for Discomfort Index (DI) prediction.

https://doi.org/10.1371/journal.pone.0344556.g002

2.4. Forecasting time series and machine learning models

We utilized a diverse set of forecasting models, including classical time-series, machine learning, ensemble, and deep learning models. These models were selected to capture the linear, nonlinear, seasonal, and volatile characteristics of the DI time series. Each model was either applied individually or integrated into a hybrid framework to enhance the predictive performance. Detailed descriptions of the models used are provided in the following subsections (2.4.1–2.4.12).

2.4.1. Autoregressive integrated moving average (ARIMA).

ARIMA models are a powerful statistical tool used for time series analysis and forecasting [37]. These models are especially valuable when the data exhibit underlying patterns such as trends, seasonality, and autocorrelation. An ARIMA model consists of three main components: the autoregressive (AR) component captures the influence of past observations on the current value, the integrated (I) component addresses non-stationarity in the data by applying differencing to stabilize the mean, and the Moving Average (MA) component reflects the relationship between the current observation and past forecast errors. The structure of an ARIMA model is defined by three parameters—p, d, and q—which correspond to the orders of the AR, I, and MA components, respectively [3841]. The entire model can be specified as [42]:

(2)

where . A seasonal ARIMA known as SARIMA model defined as process with seasonal period s,

(3)

where , and the polynomial , , , and are defined as , , , and , respectively.

2.4.2. Trigonometric seasonality, Box-Cox transformation, ARMA errors, trend, and seasonal components (TBATS).

The TBATS model is a state-of-the-art time series forecasting method that effectively handles complex trends and multiple seasonal components [43]. To address non-linearity and non-normality in the data, the model applies the Box-Cox transformation, which enhances its ability to capture underlying trends and seasonal structures more accurately [44,45]. After extracting these components, the model fits an ARMA process to the residuals, to account for any remaining autocorrelation. This additional step helps to de-correlate the series and improve the accuracy of forecasts for future periods [46].

2.4.3. Exponential smoothing state space (ETS).

The ETS model provides a versatile and dependable method for forecasting univariate time series, especially when the data show trends and/or seasonal variations. This model decomposes the series into three fundamental components: error, trend, and seasonality. Each of these components can be specified as additive (A), multiplicative (M), none (N), or automatic (Z), with the latter allowing the model to select the most appropriate form based on the data [47]. For example, an ETS (A, M, N) model applies an additive form for the error term, a multiplicative form for the trend, and assumes no seasonal variation. The flexibility of the ETS framework in accommodating various structural features makes it especially appropriate for numerous time series forecasting tasks. [48].

2.4.4. Generalized autoregressive conditional heteroskedasticity (GARCH).

The GARCH model, which was first introduced by Bollerslev [49], is commonly utilized to analyze and predict time series data that exhibit changing volatility. This model comprises two elements: the autoregressive conditional heteroskedasticity (ARCH) component, which addresses the variance of the error term, and the GARCH component, which accounts for the relationship between the current and previous error variances. GARCH models are especially proficient at illustrating volatility clustering and long-memory effects within time series data. Nonetheless, they can be computationally demanding and typically need substantial datasets for accurate estimation [50].

2.4.5. Artificial neural network (ANN).

Artificial Neural Networks (ANNs) are adaptable, nonlinear modeling systems that excel at identifying intricate relationships within data, which makes them particularly useful for time series forecasting applications [51]. Numerous ANN architectures have been designed and effectively utilized in various fields, such as climate science, finance, and engineering [52]. Among these architectures, the multilayer perceptron (MLP) featuring a single hidden layer is one of the most widely employed structures for time series forecasting [53]. This configuration generally consists of an input layer, a hidden layer, and an output layer, with the connections between the nodes organized as feedforward, acyclic links [52].

2.4.6. Facebook prophet (FP).

Prophet is a powerful forecasting model for time series, created by Facebook’s Core Data Science team, intended to effectively manage data that shows significant trends and various seasonal influences [54,55]. It breaks down the time series into four components utilizing the additive model [56]:

(4)

where is the predicted value; captures the trend (linear or logistic growth); and accounts for seasonal effects (daily, weekly, yearly, etc.); models the holiday-related irregularities; and represents the error term. Although holiday effects were not considered relevant in this context, Prophet was selected for its strength in modeling long-term trends and complex seasonal structures. The model has been widely used in environmental and climatological applications for its flexibility, interpretability, and computational efficiency in handling interannual variation, long-term seasonality, and anomaly detection [57].

2.4.7. Support vector regression (SVR).

SVR is an effective method for forecasting time series, based on the principles of Support Vector Machines (SVM) [58]. It functions by transforming input features into a higher-dimensional space using kernel functions, which allows it to capture complex dependencies that typical linear models might overlook [59]. This characteristic makes SVR especially well-suited for forecasting tasks that involve non-stationary and nonlinear time series. Additionally, SVR is robust against outliers and has demonstrated the ability to deliver consistent and dependable forecasts across a range of applications [60].

2.4.8. Decision tree regression (DTR).

DTR is a supervised learning model that does not rely on parametric assumptions and is used to forecast continuous outcomes by systematically dividing the data into subsets based on feature values. At every internal node within the tree, a decision rule is employed to partition the data in a manner that reduces prediction error, usually by utilizing metrics like mean squared error. This process continues until a defined stopping condition is met, resulting in a tree structure where each leaf node represents a predicted numerical outcome [61]. This method can effectively capture nonlinear relationships and interactions among variables without needing data normalization or assumptions about the underlying distribution. While single decision trees may be susceptible to overfitting, their clarity and computational efficiency make them valuable for foundational forecasting in time series analysis [62].

2.4.9. Random forest regression (RFR).

RFR is an ensemble learning technique that builds several decision trees during the training phase and averages their outputs to perform regression tasks [63]. This technique reduces the likelihood of overfitting often seen with single decision trees by combining predictions from trees that are formed using various bootstrap samples and subsets of features [64]. It enhances model stability and prediction accuracy, especially in cases with nonlinear relationships, outliers, or high-dimensional datasets. Moreover, RF models are robust against noise and require limited parameter adjustments, without making strong assumptions regarding the underlying distribution of the data [65,66]. Due to these attributes, RFR has been extensively utilized in time series prediction, environmental modeling, and other fields that require dependable forecasting capabilities.

2.4.10. Extreme gradient boosting (XGBoost).

The XGBoost algorithm is an advanced ensemble learning method that enhances gradient boosting. It combines the predictions of multiple weak learners to achieve greater predictive accuracy than individual models. This technique is widely utilized in various scientific and engineering fields [67]. It is a decision tree-based ensemble machine learning approach that is extensively utilized in data science because of its high degree of accuracy and efficiency. By adopting a gradient boosting framework that consolidates the predictions from multiple constituent trees, the model enhances performance incrementally and generates precise forecasts [68]. The XGBoost model uses a gradient descent-based optimization method to reduce the loss function, leading to increased prediction accuracy through a process of iterative model improvement [69]. Given its capability to effectively manage nonlinear relationships, feature interactions, and large datasets, XGBoost is particularly effective for complex forecasting challenges, such as time series predictions.

2.4.11. Long short-term memory (LSTM).

LSTM networks are a specialized form of recurrent neural network (RNN) designed to handle sequential and time-dependent data [70]. They utilize memory cells along with gating mechanisms to effectively capture both short-term and long-term relationships, overcoming the vanishing gradient issue often faced by conventional RNNs. This makes LSTM models especially adept at time series forecasting tasks characterized by nonlinearity and temporal dependencies. LSTM networks feature gating mechanisms—including input, output, and forget gates—that allow the model to selectively keep, update, or eliminate information, enabling it to identify important temporal patterns in time series data [29,71].

2.4.12. Gated recurrent unit (GRU).

GRUs are a type of recurrent neural network designed to model sequential data. They are a simplified version of LSTM networks and can effectively learn long-term patterns in time series. GRUs use gating mechanisms to decide what information to keep or discard, helping the model focus on important trends over time. Compared to LSTMs, GRUs are faster to train and require fewer computational resources, while still offering similar prediction accuracy. These features make them well-suited for time series forecasting tasks [72]. In practice, GRU models involve several hyperparameters that can influence prediction accuracy, including the number of hidden layers, batch size, and learning rate schedule. The number of hidden layers defines the model’s depth, batch size determines the frequency of weight updates, and the learning rate schedule (e.g., learning rate drop) adjusts how the learning rate changes over time [73,74].

2.5. Hyperparameter tuning for ML time series models

The hyperparameters for all ML and deep learning models were optimized using Time Series Cross-Validation (TSCV) with grid search methodology [75]. This approach ensures robust parameter selection while maintaining the temporal integrity of the time series data. The tuning process employed rolling window cross-validation to prevent data leakage and ensure generalizability. For classical time-series models (ARIMA, ETS, TBATS, GARCH, etc.), automatic optimization techniques were utilized to tune the models through Akaike information criterion (AIC) minimization. ML models underwent extensive TSCV with grid search optimization. The SVR model was fine-tuned with a polynomial degree of 3, cost parameter of 45.69, nu value of 0.5, and tight tolerance and epsilon values of 0.00001 to ensure precise convergence. The RFR model was optimized with 100 trees, balancing computational efficiency with predictive performance. XGBoost parameters were carefully calibrated with “reg:squarederror” objective function, RMSE evaluation metric, learning rate of 0.1, maximum tree depth of 6, and subsample ratios of 0.8 for both instances and features, trained over 100 rounds with 10-round early stopping.

Deep learning architectures received specialized attention in their hyperparameter configuration. The LSTM network was structured with two hidden layers of 50 units each using ReLU activation, trained with Adam optimizer for 10 epochs with batch size of 30 and early stopping patience of 5 epochs. The GRU model employed a larger architecture with 100 units in both layers, similarly trained with Adam optimizer for 10 epochs with batch size of 30. Both recurrent neural networks utilized a consistent lookback period of 30 time steps to capture temporal dependencies effectively. The Prophet model maintained its default parameter settings, leveraging its inherent optimization capabilities for time series forecasting and the ANN model’s network structure was automatically determined by the nnetar() function. This comprehensive hyperparameter tuning strategy ensured that each model was optimally configured for the specific characteristics of the time series data, balancing model complexity with generalization capability while respecting the chronological dependencies inherent in the dataset.

2.6. Seasonal-trend decomposition using loess (STL decomposition)

In time series analysis, decomposition techniques are essential tools for isolating underlying components such as trend, seasonality, and irregular fluctuations from observed data. One such widely used method is the STL, introduced by Cleveland et al. (1990) [76]. Unlike classical decomposition methods such as SEATS (Seasonal Extraction in ARIMA Time Series) and X-11, STL is highly flexible and capable of handling a wide range of seasonal patterns, including daily, sub-daily, monthly, and quarterly data [56,77]. This adaptability makes STL particularly suitable for complex or non-standard seasonal structures often found in climatological and environmental time series.

2.7. Model performance evaluation metrics

The model’s accuracy was assessed using the following performance metrics: Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE), Mean Absolute Scaled Error (MASE), and Root Mean Squared Error (RMSE). The formulas for calculating each of these metrics are provided below:

(5)(6)(7)(8)

where represents the total number of data points, refers to the actual observed values, and denotes the predicted values. The symbol represents the seasonal period. MAE, RMSE, MAPE, and MASE are common metrics used to evaluate the accuracy of forecasting models. MAE and RMSE are affected by the scale of the data, while MAPE and MASE are scale-independent.

2.8. Ethical statement

Ethical approval was not required for this study, as the dataset used does not involve any human or animal subjects.

3. Results

3.1. Data description

Rajshahi, situated in northwestern Bangladesh, is among the hottest regions in the country. To assess thermal discomfort in this heat-prone area, we utilized daily average air temperature (°C) and relative humidity (%) data from the Bangladesh Meteorological Department (BMD), spanning January 1985 to December 2024. The DI was computed using Thom’s formula, offering a standardized indicator of heat-related stress. Out of 14,610 days analyzed, 4,692 were classified as “High Discomfort,” where most of the population would feel discomfort. Additionally, 806 days fell under the “Severe Discomfort” category, suggesting widespread stress. Notably, no days met the criteria for “Extreme Heat Stress,” implying that although discomfort is common, life-threatening thermal extremes were not observed during this period.

Fig 3 illustrates the daily changes in the DI for Rajshahi from January 1985 to December 2024. The blue spikes represent the daily DI values, which follow a consistent seasonal pattern, rising sharply during the warmer months and declining in the cooler periods each year. This pattern emphasizes the regular cycles of thermal discomfort experienced in the region. The maroon line represents the seasonal-adjusted values, which smooth out the predictable fluctuations caused by the changing seasons. This adjustment allows us to observe short-term variations and irregularities that may result from unusual weather or broader climate changes. Notably, the trend line (gray) shows no significant upward or downward movement, indicating that there have been no abrupt long-term changes in DI during the 40 years.

thumbnail
Fig 3. Daily discomfort index in Rajshahi with seasonal adjustment (maroon) and trend components (gray) from January 1985 to December 2024.

https://doi.org/10.1371/journal.pone.0344556.g003

Fig 4 displays the annual distribution of the DI in Rajshahi from 1985 to 2024. While Fig 3 indicates consistent seasonal patterns, Fig 4 shows a gradual increase in the median DI values over time. Most years after 2005 have median values consistently above 25. This implies that while the overall pattern of seasonal discomfort remains stable, the intensity of heat stress has progressively increased, leading to more frequent extreme discomfort events in recent decades. This long-term trend aligns with the broader warming patterns observed in the region.

thumbnail
Fig 4. Boxplot shows the distribution of yearly thermal discomfort variation in Rajshahi from January 1985 to December 2024.

https://doi.org/10.1371/journal.pone.0344556.g004

Fig 5 illustrates the monthly distribution of the DI in Rajshahi throughout the entire study period. Each boxplot represents the range of daily DI values for every month, highlighting the seasonal fluctuations in human thermal discomfort over the year. The findings indicate a clear seasonal trend, with discomfort levels gradually rising from January to June, peaking between June and September, and then decreasing as the cooler months of November and December approach. During the summer and early autumn months (June to September), the DI remains consistently elevated, with median values hovering around or surpassing 27, signifying an extended period of increased thermal stress. Furthermore, these months display narrower interquartile ranges, implying that the higher discomfort levels are more stable and persistent during this period. Conversely, the winter and early spring months (December to March) are characterized by lower DI values and greater variability, reflecting cooler and more variable conditions.

thumbnail
Fig 5. Boxplot shows the distribution of monthly thermal discomfort variation in Rajshahi from January 1985 to December 2024.

https://doi.org/10.1371/journal.pone.0344556.g005

Fig 6 complements this pattern by showing the monthly distribution of discomfort categories based on the discomfort index. Throughout the winter months (December to February), almost every day is categorized as “No discomfort,” reflecting thermally pleasant conditions. As temperatures begin to rise in March, levels of discomfort also increase, resulting in a higher percentage of days classified as “Mild discomfort” and “Moderate discomfort.” From April through September, thermal stress intensifies, especially between June and August, where the majority of days are categorized as “High discomfort” and there is also a notable number of “Severe discomfort” days. Interestingly, there are no days rated as “Extreme Heat Stress,” signifying that such conditions are absent. In October, the trend starts to change, with a reduction in high discomfort levels, leading to a return to more comfortable conditions by November. This seasonal cycle illustrates a transition from winter thermal comfort to widespread discomfort in the summer months.

thumbnail
Fig 6. Monthly proportion of population discomfort levels in Rajshahi based on daily Discomfort Index from January 1985 to December 2024, categorized by defined thermal stress thresholds.

https://doi.org/10.1371/journal.pone.0344556.g006

Fig 7 summarizes these seasonal dynamics by plotting the monthly average DI values against established comfort thresholds. The lowest DI values are recorded in January (~16.1) and December (~17.8), both within the no-discomfort zone. Starting in March, DI levels rise sharply, reaching 23.0 (Mild Discomfort), and remaining above 26 from April to September. The peak occurs in June (DI ≈ 28.1), falling within the “High Discomfort” category. No monthly average exceeds the extreme stress threshold, which aligns with the findings from previous figures. Overall, this pattern highlights extended periods of moderate to high discomfort during the pre-monsoon and monsoon months.

thumbnail
Fig 7. Monthly average Discomfort Index (DI) overlaid on comfort zones in Rajshahi from January 1985 to December 2024, illustrating typical thermal stress levels experienced in each month.

https://doi.org/10.1371/journal.pone.0344556.g007

3.2. Forecasting Discomfort Index (DI) without seasonal adjustment

To forecast the DI without adjusting for seasonality in Rajshahi, a city known for its intense heat in northwestern Bangladesh, we examined high-frequency time series data spanning from 1985 to 2024. Due to the non-stationary and volatile characteristics of DI—especially without seasonal decomposition—accurate forecasting presents significant challenges. To address this issue, we conducted an extensive evaluation of over 76 model configurations, including traditional time series models (ARIMA, ETS, TBATS, GARCH), eight standalone ML models (e.g., ANN, SVR, XGBoost, LSTM, GRU), and a diverse array of hybrid combinations – including ARIMA-based hybrids (e.g., ARIMA-SVR), TBATS-based hybrids, ETS-based hybrids, GARCH-based hybrids, and cross-hybrids such as SVR-ARIMA, LSTM-TBATS, etc. A complete listing of all hybrid models without seasonal adjustment and their performance metrics is provided in Table S1 in S1 File.

The forecasting performance of the standalone models and the top five best performing hybrid models for daily DI without seasonal adjustment in Rajshahi is presented in Table 2. Among the standalone models, tree-based algorithms demonstrated superior capability in handling the raw, high-frequency data. XGBoost emerged as the top performer (MAE: 0.7132, MAPE: 3.13%), slightly outperforming SVR and Random Forest (RFR), while traditional models like ARIMA and ETS performed poorly (MAPE > 18%), highlighting their limitations with non-stationary series. The hybrid model evaluation revealed that combinations leveraging the decomposition strength of TBATS with a subsequent machine learning regressor were most effective (Table 2). The top-performing models in this category were TBATS-DTR and TBATS-LSTM, which achieved the lowest errors (e.g., MAE < 0.68, MAPE ≈ 2.95%). This superior performance underscores the advantage of first using TBATS to model the series’ inherent trend and seasonality, then applying a non-linear model (DTR or LSTM) to capture complex patterns in the residuals (Table 2). In contrast, hybrids that lacked a coherent decomposition strategy or combined incompatible components (e.g., GARCH-GRU, LSTM-TBATS) exhibited significantly higher error rates, confirming that a strategic integration of models is critical for success (Table S1 in S1 File). To enhance predictive accuracy, we implemented a seasonal-adjusted forecasting framework using STL decomposition. This approach isolates the trend, seasonal, and residual components, allowing models to be applied to the adjusted series. We developed a comprehensive suite of seasonal-adjusted single and hybrid models, with full results of seasonal-adjusted hybrid models available in Table S2 in S1 File.

thumbnail
Table 2. Forecasting performance of single models and the top five hybrid models for daily Discomfort Index (DI) without seasonal adjustment in Rajshahi (1985–2024).

https://doi.org/10.1371/journal.pone.0344556.t002

Among the single models applied to the STL-adjusted data in Table 3, STL-TBATS and STL-GRU were the most accurate, achieving low MAE values of 0.7957 and 0.8100, respectively. This indicates that even after STL decomposition, components like complex seasonality and non-linear residuals persist, which these models are well-suited to handle. The seasonal-adjusted hybrid models demonstrated a further significant improvement in performance. The top-performing model overall was the STL-TBATS-LSTM hybrid in Tables 3 and S2 in S1 File, which achieved the lowest scores across all metrics (MAE: 0.4809, MAPE: 2.12%). The STL-TBATS-DTR model was a close competitor. The success of the STL-TBATS-LSTM architecture can be attributed to its complementary, multi-stage design: the STL decomposition first removes the dominant seasonality; TBATS then expertly models any remaining complex seasonal and trend components in the adjusted series; finally, the LSTM captures the intricate non-linear and long-term temporal dependencies present in the residuals of the TBATS model. This layered approach effectively addresses both linear and non-linear patterns, making it exceptionally powerful for forecasting the DI. In summary, while several models showed competence, the strategic hybrid combination of STL, TBATS, and LSTM proved to be the most accurate and reliable method for forecasting thermal discomfort in Rajshahi. This model’s architecture is particularly adept at handling the complex, multi-component nature of the DI time series.

thumbnail
Table 3. Forecasting performance of seasonal-adjusted single models and the top five seasonal-adjusted hybrid models for daily Discomfort Index (DI) in Rajshahi (1985–2024).

https://doi.org/10.1371/journal.pone.0344556.t003

Fig 8 shows the daily DI in Rajshahi, both observed (1985–2024) and forecasted (2025–2027), categorized by thermal stress levels. Fig 8a presents the long-term DI trends, while the bottom panel zooms in on the forecast period to highlight seasonal discomfort patterns. Historically, DI values generally stayed within the “No Discomfort” to “Moderate Discomfort” range, indicating relatively comfortable conditions. However, the forecast indicates a significant increase in thermal stress. Fig 8b reveals frequent spikes into the “High Discomfort” and “Severe Discomfort” zones, especially during the pre-monsoon and summer months. This suggests a growing climate risk, with more days likely to surpass critical heat thresholds, posing challenges to health and well-being.

thumbnail
Fig 8. Observed (1985–2024) and forecasted (2025–2027) daily Discomfort Index (DI) in Rajshahi using the STL-TBATS-LSTM model, overlaid with categorized thermal stress levels in (a) and the forecasted DI (2025–2027) with 80% and 95% confidence intervals to highlight the uncertainty ranges with seasonal and extreme discomfort patterns in (b).

https://doi.org/10.1371/journal.pone.0344556.g008

Fig 9, the donut chart, summarizes this trend quantitatively by showing the percentage distribution of forecasted days across categorized discomfort levels. A large portion of days (39.8%) are predicted to reach “High Discomfort” (27 ≤ DI < 29), meaning most people will experience hot and uncomfortable conditions. This is followed by “No Discomfort” (26%), “Moderate Discomfort” (20.3%), and “Mild Discomfort” (12.7%). Notably, “Severe Discomfort” (29 ≤ DI < 32) is expected on 1.2% of days, indicating significant heat stress for nearly everyone. Although no days are projected to reach the “Extreme Heat Stress” threshold (DI ≥ 32), the increasing frequency of days in the higher discomfort categories points to a concerning upward trend. Combining Figs 8 and 9, we observed that demonstrate a clear intensification of heat-related discomfort in Rajshahi’s near future. This pattern calls for proactive climate adaptation strategies, including improved heat warning systems, urban greening, and targeted health interventions to protect vulnerable populations.

thumbnail
Fig 9. Donut chart illustrates percentage distribution of forecasted daily Discomfort Index (DI) categories in Rajshahi from January 2025 to December 2027.

https://doi.org/10.1371/journal.pone.0344556.g009

4. Discussion

Globally, climate change is intensifying heat-related stress, especially in tropical and low-income countries [78]. Thermal discomfort, measured by indicators such as the DI, serves as a more reliable measure of heat-related stress that better represents human experience than temperature alone [79]. As rising temperatures and humidity levels interact with changing climatic regimes, DI is becoming an important tool for assessing public health risk [80]. In Bangladesh, particularly in northwestern regions such as Rajshahi, growing evidence indicates a rise in both the frequency and intensity of thermally uncomfortable days [22]. This pattern is especially pronounced during the pre-monsoon and monsoon seasons, indicating that there is an immediate need for forecasting technologies to aid climate-sensitive planning and safeguard susceptible communities.

This study proposed and evaluated 64 ML-based hybrid models for forecasting the Discomfort Index (DI) in Rajshahi, a heat-prone region of Bangladesh. To enhance forecasting accuracy, these combinations were also analyzed with seasonal adjustment using STL decomposition, resulting in a total of 128 model variations. Among the non-seasonally adjusted models, the TBATS-DTR hybrid yielded the best performance, with an MAE of 0.6709, MAPE of 2.9508, RMSE of 0.8940, and MASE of 0.9268. The TBATS-LSTM model followed closely, achieving nearly identical metrics, indicating its high effectiveness. After applying STL decomposition, the STL-TBATS-LSTM model achieved the highest overall accuracy (MAE = 0.4810, MAPE = 2.1230, RMSE = 0.6381, MASE = 0.6644), with the STL-TBATS-DTR model showing comparable results. These findings confirm that STL-based decomposition significantly enhances model performance and that hybridization, particularly involving LSTM, consistently improves forecasting accuracy. Furthermore, the study observed a notable increase in the frequency and intensity of thermally uncomfortable days, particularly during the pre-monsoon and monsoon seasons. This trend, supported by both historical data and model projections, signals an escalating thermal discomfort burden in Rajshahi. As shown in Fig 8, while historical data mostly fell within the “No Discomfort” to “Moderate Discomfort” categories, future forecasts for 2025–2027 indicate a significant shift toward more extreme conditions. Specifically, “High Discomfort” days are projected to dominate, with 39.8% forecasted days, while “Severe Discomfort” is predicted for 1.2% days (Fig 9). These projections align with recent findings on the pronounced impact of seasonal variation on health outcomes, particularly for vulnerable populations [81], underscoring that accurate seasonal forecasting is critical for managing these escalating heat-related health risks.

The forecasted increase in “High Discomfort” days has profound implications for public health and urban management. To translate these model outputs into actionable strategies, stakeholders can use these forecasts for targeted planning. For urban and health authorities in Bangladesh, we recommend the following specific policy and adaptation strategies: (i) Enhanced Early Warning Systems, (ii) Adaptive Work Scheduling, and (iii) Informed Infrastructure Design. (i) Enhanced Early Warning Systems: Integrate the STL-TBATS-LSTM model forecasts into a city-level heat-health alert system. Timely DI-based warnings can trigger public health advisories, directing citizens to cooling centers and initiating hydration programs, especially for the elderly, children, and those with pre-existing conditions who are most susceptible to seasonal health shocks [81,82]. This study’s model can play a crucial role in shaping early warning systems, allowing public health authorities to issue timely alerts ahead of discomfort peaks. (ii) Adaptive Work Scheduling: Labor and industry departments can use seasonal forecasts to mandate adjusted work schedules during peak discomfort periods, reducing heat exposure for outdoor workers in construction and agriculture. In practice, these forecasts can inform the establishment of cooling centers, hydration stations, targeted outreach programs, and work/rest schedules for vulnerable populations [83]. Furthermore, (iii) Informed Infrastructure Design: Urban planners should use long-term discomfort projections to guide investments in heat mitigation infrastructure. This includes prioritizing green spaces, cool roofs, and improved building insulation in areas projected to experience the most severe discomfort, thereby mitigating the urban heat island effect [84]. By embedding these forecasts into operational frameworks, authorities can proactively protect public health and enhance urban resilience.

The findings of this study align closely with prior research on thermal discomfort, climate variability, and hybrid forecasting models. The increasing frequency and intensity of thermally uncomfortable days in Rajshahi reflect regional trends identified by Ekra et al. (2024) and Rahman et al. (2021), who reported rising heat stress indicators across Bangladesh, particularly in the coastal and northern regions [19,22]. Similarly, Talukdar et al. (2017) and Sharmin et al. (2020) emphasized the seasonal prevalence of discomfort in Mymensingh and Dhaka, respectively, highlighting the nationwide impact of climate-induced thermal stress [20,21]. Our findings further support global projections made by Zhang et al. (2023) and Ullah et al. (2024), which indicate that low-income tropical regions are expected to experience increased thermal discomfort due to climate change. [15,16].

Our results emphasize the effectiveness of hybrid forecasting models that combine ML with classical time series methods. The improved performance of the STL-TBATS-LSTM model in this study aligns with the findings of Qureshi et al. (2025), who reported that their STL-ARIMA-LSTM hybrid model outperformed others in predicting heatwaves in Bangladesh [29]. Both studies demonstrate the importance of incorporating seasonal decomposition (STL) to capture patterns in high-frequency environmental data more effectively. Similarly, Saif et al. (2024) found that a hybrid model combining ARIMA and neural networks achieved the highest forecasting accuracy for fish production in the country [30]. Ferdoush et al. (2021) also showed that a hybrid model integrating Random Forest with bidirectional LSTM outperformed standalone LSTM models in short-term electricity load forecasting in Bangladesh [32]. Collectively, these studies highlight the growing success of hybrid approaches in addressing complex forecasting challenges across various domains in the region. Furthermore, our findings are consistent with earlier international research by Susanti et al. (2017), Sulandari et al. (2020), and Yunis et al. (2024), which demonstrated that hybrid models that integrate statistical and machine learning techniques provide higher forecasting accuracy across a range of fields, including air pollution, electricity load, and hydrology [25,26,28].

This study presents several key strengths that enhance the credibility and applicability of its findings. First, it employs a high-frequency dataset consisting of daily observations over 40 years (1985–2024). This extensive temporal resolution allows for an accurate capture of both short-term fluctuations and long-term trends in thermal discomfort, ensuring greater statistical reliability and sensitivity to seasonal and interannual variability. Second, the study systematically evaluates a diverse array of hybrid model combinations (128 in total), encompassing both traditional and machine learning approaches. By integrating models such as ARIMA, TBATS, and ETS with advanced learners like DTR, RFR, and LSTM, it thoroughly investigates the forecasting potential of both linear and nonlinear structures. Third, the consistent outperformance of STL-adjusted hybrid models—especially those utilizing LSTM—demonstrates the practical advantages of combining statistical rigor with deep learning capabilities. This strategy not only improves prediction accuracy but also enhances the model’s adaptability to complex and evolving patterns in climate data. Finally, the study goes beyond model evaluation; it interprets the results in terms of human impact. By linking forecasted discomfort levels to public health risks and seasonal exposure patterns, it provides actionable insights for urban planners and policymakers in heat-prone regions such as Rajshahi.

The proposed models demonstrated strong performance, but several limitations should be acknowledged. First, the DI was calculated using only air temperature and relative humidity, excluding other influential variables like wind speed and solar radiation due to data unavailability. Second, although the models showed excellent accuracy for Rajshahi, their applicability to other regions may be limited without further validation. Third, while STL decomposition improved forecast performance, it assumes a stable seasonal pattern over time, which may not fully capture nonlinear shifts driven by climate change [85,86]. External validation against independent datasets, such as ERA5 reanalysis or nearby weather stations, would further strengthen the robustness of the findings; however, this was beyond the scope of the current study and is recommended as an important direction for future research. Moreover, hybrid models that incorporate deep learning components like LSTM are computationally intensive, requiring longer training times and greater resources. This could limit their real-time operational use [87]. However, these limitations are somewhat mitigated by the use of high-frequency (daily) data collected over a robust 40-year period, which enhances the statistical reliability and temporal sensitivity of the models. Importantly, none of these limitations undermines the validity of the key findings, which remain sound and valuable for forecasting and planning in similar heat-prone contexts.

Recognizing these limitations creates opportunities for future work. First, future studies could explore additional decomposition methods like Empirical Mode Decomposition (EMD), robust STL, or wavelet transform [88] to capture more complex patterns in climate data. Integrating these techniques with newer deep learning architectures, like Transformer models [89], may further enhance prediction accuracy. Second, while this study focused on the DIin Rajshahi, the methodology could be applied to other regions of Bangladesh or similar heat-prone areas worldwide. Comparative studies across different climatic zones would help evaluate the generalizability and adaptability of the proposed hybrid models. Third, incorporating additional meteorological and socioeconomic variables could refine DI calculations and enable more targeted public health strategies. Additionally, while this study primarily focused on error metrics for model evaluation, the incorporation of comprehensive temporal validation strategies and sensitivity analyses remains beyond the current scope and will be addressed in future research to further enhance model robustness and reliability. Finally, collaboration with health data systems could facilitate the development of early warning systems that connect DI forecasts with hospital admission trends, supporting real-time decision-making and climate-resilient urban planning in vulnerable regions like Rajshahi.

5. Conclusion

This study demonstrates that hybrid forecasting models integrating seasonal-trend decomposition (STL), advanced time series modeling (TBATS), and deep learning (LSTM) substantially improve the accuracy of thermal discomfort predictions in Rajshahi, Bangladesh. With climate change intensifying heat stress, our findings reveal a marked increase in the frequency and severity of discomfort days, underscoring a growing public health challenge. The STL-TBATS-LSTM model provides actionable forecasts that can inform heat-health early warning systems, adaptive labor policies, and climate-resilient urban planning, empowering stakeholders to mitigate heat-related risks effectively. While some limitations, such as variable selection and regional generalizability, remain, this work lays a critical foundation for enhancing climate adaptation strategies in heat-prone regions through robust, interpretable, and operational forecasting tools.

Supporting information

S1 File. Supplementary tables showing forecasting performance of hybrid models for daily Discomfort Index (DI) with and without seasonal adjustment in Rajshahi (1985–2024).

https://doi.org/10.1371/journal.pone.0344556.s001

(DOCX)

S1 Data. Dataset of Discomfort Index (1985–2024).

https://doi.org/10.1371/journal.pone.0344556.s002

(XLSX)

Acknowledgments

The authors are grateful to the editor and the anonymous reviewers for their constructive comments and suggestions, which have substantially strengthened the quality of this manuscript. The authors also acknowledge the Climate Division of the Bangladesh Meteorological Department (BMD) for generously providing the data utilized in this analysis.

References

  1. 1. Stathopoulos T, Wu H, Zacharias J. Outdoor human comfort in an urban climate. Build Environ. 2004;39:297–305.
  2. 2. Tselepidaki I, Santamouris M, Moustris C, Poulopoulou G. Analysis of the summer discomfort index in Athens, Greece, for cooling purposes. Energy Build. 1992;18(1):51–6.
  3. 3. Fouillet A, Rey G, Laurent F, Pavillon G, Bellec S, Guihenneuc-Jouyaux C, et al. Excess mortality related to the August 2003 heat wave in France. Int Arch Occup Environ Health. 2006;80(1):16–24. pmid:16523319
  4. 4. Nastos PT, Matzarakis A. Weather impacts on respiratory infections in Athens, Greece. Int J Biometeorol. 2006;50(6):358–69. pmid:16596366
  5. 5. Nastos PT. Weather, ambient air pollution and bronchial asthma in Athens, Greece. Adva Glob Change Res. 2008;30:173–88.
  6. 6. Panagiotakos DB, Chrysohoou C, Pitsavos C, Nastos P, Anadiotis A, Tentolouris C, et al. Climatological variations in daily hospital admissions for acute coronary syndromes. Int J Cardiol. 2004;94(2–3):229–33. pmid:15093986
  7. 7. Heat and health. [cited 11 May 2025]. Available from: https://www.who.int/news-room/fact-sheets/detail/climate-change-heat-and-health
  8. 8. Becker S, Potchter O, Yaakov Y. Calculated and observed human thermal sensation in an extremely hot and dry climate. Energy Build. 2003;35(8):747–56.
  9. 9. Conti S, Meli P, Minelli G, Solimini R, Toccaceli V, Vichi M, et al. Epidemiologic study of mortality during the Summer 2003 heat wave in Italy. Environ Res. 2005;98(3):390–9. pmid:15910795
  10. 10. Höppe P. The physiological equivalent temperature - a universal index for the biometeorological assessment of the thermal environment. Int J Biometeorol. 1999;43(2):71–5. pmid:10552310
  11. 11. Ning S, Jing W, Ge Z. Sunlight perception and outdoor thermal comfort in college campuses: a new perspective. Sci Rep. 2023;13(1):16112. pmid:37752195
  12. 12. Nurmaya EM, Abidin AU, Hasanah NAI, Asmara AA. Heat stress analysis using discomfort index method: impact on macro-environmental in Yogyakarta. J Ecol Eng. 2022;23(1):286–95.
  13. 13. Siami L, Ramadhani A. Climatology of Discomfort Index for decade in Bandar Lampung, Indonesia. KnE Soc Sci. 2019.
  14. 14. Poupkou A, Nastos P, Melas D, Zerefos C. Climatology of Discomfort Index and air quality index in a large urban Mediterranean Agglomeration. Water Air Soil Pollut. 2011;222(1–4):163–83.
  15. 15. Zhang J, You Q, Ren G, Ullah S, Normatov I, Chen D. Inequality of global thermal comfort conditions changes in a warmer world. Earths Future. 2023;11:e2022EF003109.
  16. 16. Ullah S, Aldossary A, Ullah W, Al-Ghamdi SG. Augmented human thermal discomfort in urban centers of the Arabian Peninsula. Sci Rep. 2024;14(1):3974. pmid:38368465
  17. 17. Gupta A, De B, Shukla AK, Pignatta G. Vulnerability assessment of a highly populated megacity to ambient thermal stress. Sustainability. 2024;16(8):3395.
  18. 18. Golshan T, Lande S, Nickfardjam K, Cohensedgh S, Roitblat Y, Nehuliaieva L, et al. Thermal comfort in school classes in the era of global warming: a prospective multicenter study. J Sch Health. 2021;91(2):146–54. pmid:33404108
  19. 19. Ekra AT, Hamed MM, Ali Z, Muhammad MKIB, Khan MMH, Kamruzzaman M, et al. Changes in human heat discomfort and its drivers in Bangladesh. Urban Climate. 2024;55:101884.
  20. 20. Talukdar M, Hossen M, Baten M. Trends of outdoor thermal discomfort in mymensingh: an application of Thoms’ Discomfort index. J Environ Sci Nat Resourc. 2018;10(2):151–6.
  21. 21. Sharmin T, Steemers K. Effects of microclimate and human parameters on outdoor thermal sensation in the high-density tropical context of Dhaka. Int J Biometeorol. 2020;64(2):187–203. pmid:30209615
  22. 22. Rahman MB, Salam R, Islam ARMT, Tasnuva A, Haque U, Shahid S, et al. Appraising the historical and projected spatiotemporal changes in the heat index in Bangladesh. Theor Appl Climatol. 2021;146(1–2):125–38. pmid:34334853
  23. 23. Rahman MdM, Hasan J. Evaluating the impact of green spaces on urban heat reduction in Rajshahi, Bangladesh Using the InVEST Model. Land. 2024;13(8):1284.
  24. 24. Khorat S, Das D, Khatun R, Aziz SM, Anand P, Khan A, et al. Cool roof strategies for urban thermal resilience to extreme heatwaves in tropical cities. Energy Build. 2024;302:113751.
  25. 25. Susanti A, Jati Setyadi H, Zhang YR, Huang JW, Zhou FJ. Hybrid model for forecasting time series with trend, seasonal and salendar variation patterns. J Phys Conf Ser. 2017;890:012160.
  26. 26. Yunis R, Andri A, Djoni D. Hybridization model for air pollution prediction using time series data. CogITo Smart Journal. 2024;10(1):422–35.
  27. 27. Thakur S, Pandey M. A survey on hybrid models used for hydrological time-series forecasting. 2021;247–59. https://doi.org/10.1007/978-981-15-7533-4_19
  28. 28. Sulandari W, Subanar S, Suhartono S, Utami H, Hisyam Lee M, Rodrigues PC. SSA-based hybrid forecasting models and applications. Bull Electrc Eng Inform. 2020;9(5):2178–88.
  29. 29. Qureshi MMU, Ahmed AB, Dulmini A, Khan MMH, Rois R. Developing a seasonal-adjusted machine-learning-based hybrid time‑series model to forecast heatwave warning. Sci Rep. 2025;15(1):8699. pmid:40082574
  30. 30. Saif MB, Khanam M. Comparing ARIMA, neural network and hybrid models for forecasting fish production in Bangladesh. Dhaka Univ J Sci. 2024;72(1):71–6.
  31. 31. Dipro N-A-M, Sultana Z, Jui MS, Faisal BA, Shovon MSH, Shin J, et al. Forecasting Flood in Bangladesh Using Different Time Series Models: A Comparative Analysis. In: 2023 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT). 2023. pp. 107–14.
  32. 32. Ferdoush Z, Mahmud BN, Chakrabarty A, Uddin J. A short-term hybrid forecasting model for time series electrical-load data using random forest and bidirectional long short-term memory. Int J Electric Comput Eng. 2021;11(1):763.
  33. 33. Di Nunno F, Granata F, Pham QB, de Marinis G. Precipitation forecasting in Northern Bangladesh using a hybrid machine learning model. Sustainability. 2022;14(5):2663.
  34. 34. Rashid GM, Hossain MMT, Akhter MAE, Mallik MAK. A study on the heat wave conditions over Bangladesh During 1990 - 2019. J of Eng Sci. 2024;14(2):59–67.
  35. 35. Epstein Y, Moran DS. Thermal comfort and the heat stress indices. Ind Health. 2006;44(3):388–98. pmid:16922182
  36. 36. Thom discomfort index – UrbanSIS. [cited 13 May 2025]. Available from: https://urbansis.eu/thom-discomfort-index/
  37. 37. What are ARIMA Models? | IBM. [cited 5 May 2025]. Available from: https://www.ibm.com/think/topics/arima-model
  38. 38. ARIMA for Time Series Forecasting: A Complete Guide | DataCamp. [cited 5 May 2025]. Available from: https://www.datacamp.com/tutorial/arima
  39. 39. Hyndman RJ, Khandakar Y. Automatic time series forecasting: The forecast package for R. J Stat Softw. 2008;27:1–22.
  40. 40. Autoregressive Integrated Moving Average (ARIMA) Prediction Model. [cited 5 May 2025]. Available from: https://www.investopedia.com/terms/a/autoregressive-integrated-moving-average-arima.asp
  41. 41. 4.1 Seasonal ARIMA models | STAT 510. [cited 5 May 2025]. Available from: https://online.stat.psu.edu/stat510/lesson/4/4.1
  42. 42. Ho SL, Xie M. The use of ARIMA models for reliability forecasting and analysis. Comput Ind Eng. 1998;35:213–6.
  43. 43. de Livera AM, Hyndman RJ, Snyder RD. Forecasting time series with complex seasonal patterns using exponential smoothing. J Am Stat Assoc. 2011;106:1513–27.
  44. 44. Thayyib PV, Thorakkattle MN, Usmani F, Yahya AT, Farhan NHS. Forecasting Indian goods and services tax revenue using TBATS, ETS, neural networks, and hybrid time series models. Cogent Econ Fin. 2023;11.
  45. 45. Bhattacharyya A, Chattopadhyay S, Pattnaik M, Chakraborty T. Theta Autoregressive Neural Network: A Hybrid Time Series Model for Pandemic Forecasting. In: 2021 International Joint Conference on Neural Networks (IJCNN). 2021. pp. 1–8.
  46. 46. Chakraborty T, Ghosh I, Mahajan T, Arora T. Nowcasting of COVID-19 confirmed cases: foundations, trends, and challenges. Stud Syst Decision Control. 2022;366:1023–64.
  47. 47. Jofipasi CA, Miftahuddin, Hizir. Selection for the best ETS (error, trend, seasonal) model to forecast weather in the Aceh Besar District. IOP Conf Ser: Mater Sci Eng. 2018;352:012055.
  48. 48. Naher S, Rabbi F, Hossain MM, Banik R, Pervez S, Boitchi AB. Forecasting the incidence of dengue in Bangladesh-Application of time series model. Health Sci Rep. 2022;5(4):e666. pmid:35702512
  49. 49. Bollerslev T. Generalized autoregressive conditional heteroskedasticity. J Econom. 1986;31:307–27.
  50. 50. (23) ARIMA vs GARCH (Time series forecasting) | LinkedIn. [cited 6 May 2025]. Available from: https://www.linkedin.com/pulse/arima-vs-garch-time-series-forecasting-mahdi-navaei/
  51. 51. Khashei M, Bijari M. An artificial neural network (p,d,q) model for timeseries forecasting. Exp Syst Appl. 2010;37(1):479–89.
  52. 52. Panigrahi S, Behera HS. A hybrid ETS–ANN model for time series forecasting. Eng Appl Artif Intell. 2017;66:49–59.
  53. 53. Zhang GP. Time series forecasting using a hybrid ARIMA and neural network model. Neurocomputing. 2003;50:159–75.
  54. 54. Aditya Satrio CB, Darmawan W, Nadia BU, Hanafiah N. Time series analysis and forecasting of coronavirus disease in Indonesia using ARIMA model and PROPHET. Procedia Comput Sci. 2021;179:524–32.
  55. 55. Hasnain A, Sheng Y, Hashmi MZ, Bhatti UA, Hussain A, Hameed M. Time series analysis and forecasting of air pollutants based on prophet forecasting model in Jiangsu Province, China. Front Environ Sci. 2022;10:945628.
  56. 56. Hyndman RJ, Athanasopoulos G. Forecasting: Principles and Practice. 3rd ed. 2021. [cited 6 May 2025]. Available from: https://otexts.com/fpp3/
  57. 57. Elneel L, Zitouni MS, Mukhtar H, Al-Ahmad H. Examining sea levels forecasting using autoregressive and prophet models. Sci Rep. 2024;14(1):14337. pmid:38906913
  58. 58. Time Series Forecasting with Support Vector Regression (SVR) | by Hassan Ishaq | Medium. [cited 6 May 2025]. Available from: https://medium.com/@hassanishaq/time-series-forecasting-with-support-vector-regression-svr-aab08da8cd77
  59. 59. Sapankevych N, Sankar R. Time Series Prediction Using Support Vector Machines: A Survey. IEEE Comput Intell Mag. 2009;4(2):24–38.
  60. 60. Time Series Forecasting with Support Vector Regression | GeeksforGeeks. [cited 6 May 2025]. Available: https://www.geeksforgeeks.org/time-series-forecasting-with-support-vector-regression/
  61. 61. CART (Classification And Regression Tree) in Machine Learning | GeeksforGeeks. [cited 6 May 2025]. Available: https://www.geeksforgeeks.org/cart-classification-and-regression-tree-in-machine-learning/
  62. 62. Spiliotis E. Decision Trees for Time-Series Forecasting. Available from: https://www.researchgate.net/publication/359865759
  63. 63. Breiman L. Random Forests. Mach Learn. 2001;45(1):5–32.
  64. 64. Random Forest Regression in Python | GeeksforGeeks. [cited 6 May 2025]. Available from: https://www.geeksforgeeks.org/random-forest-regression-in-python/
  65. 65. Yaqoob A, Verma NK, Mir MA, Tejani GG, Eisa NHB, Mamoun Hussien Osman H, et al. SGA-Driven feature selection and random forest classification for enhanced breast cancer diagnosis: A comparative study. Sci Rep. 2025;15(1):10944. pmid:40159513
  66. 66. Sun Z, Wang G, Li P, Wang H, Zhang M, Liang X. An improved random forest based on the classification accuracy and correlation measurement of decision trees. Exp Syst Appl. 2024;237:121549.
  67. 67. Lv C-X, An S-Y, Qiao B-J, Wu W. Time series analysis of hemorrhagic fever with renal syndrome in mainland China by using an XGBoost forecasting model. BMC Infect Dis. 2021;21(1):839. pmid:34412581
  68. 68. Fang Z-G, Yang S-Q, Lv C-X, An S-Y, Wu W. Application of a data-driven XGBoost model for the prediction of COVID-19 in the USA: a time-series study. BMJ Open. 2022;12(7):e056685. pmid:35777884
  69. 69. Luo J, Zhang Z, Fu Y, Rao F. Time series prediction of COVID-19 transmission in America using LSTM and XGBoost algorithms. Results Phys. 2021;27:104462. pmid:34178594
  70. 70. Siami-Namini S, Tavakoli N, Siami Namin A. A Comparison of ARIMA and LSTM in Forecasting Time Series. In: 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA). 2018. pp. 1394–401.
  71. 71. Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput. 1997;9(8):1735–80.
  72. 72. Time-Series Forecasting Using GRU: A Step-by-Step Guide | by Sharmasaravanan | Medium. [cited 6 May 2025]. Available from: https://sharmasaravanan.medium.com/time-series-forecasting-using-gru-a-step-by-step-guide-b537dc8dcfba
  73. 73. Arfianti UI, Candra D, Novitasari R, Widodo N, Hafiyusholeh M, Dianita Utami W. Sunspot Number Prediction Using Gated Recurrent Unit (GRU) Algorithm. Indonesian J Comput Cybernetics Syst. 2021;15:141–52.
  74. 74. Ridwan M, Sadik K, Afendi FM. Comparison of ARIMA and GRU Models for High-Frequency Time Series Forecasting. 2024. https://doi.org/10.15294/sji.v10i3.45965
  75. 75. Bergmeir C, Benítez JM. On the use of cross-validation for time series predictor evaluation. Inf Sci (N Y). 2012;191:192–213.
  76. 76. Cleveland RB, Cleveland WS, McRae JE, Terpenning I. STL: A Seasonal-Trend Decomposition Procedure Based on Loess. J Off Stat. 1990;6:3–73.
  77. 77. Kolassa S, Rostami-Tabar B, Siemsen E. Demand Forecasting for Executives and Professionals. 2023; 1–254.
  78. 78. Amoadu M, Ansah EW, Sarfo JO, Hormenu T. Impact of climate change and heat stress on workers’ health and productivity: A scoping review. J Climate Change Health. 2023;12:100249.
  79. 79. Matzarakis A. Comments on the quantification of thermal comfort and heat stress with thermal indices. Atmosphere. 2024;15(8):963.
  80. 80. Baldwin JW, Benmarhnia T, Ebi KL, Jay O, Lutsko NJ, Vanos JK. Humidity’s role in heat-related health outcomes: a heated debate. Environ Health Perspect. 2023;131(5):55001. pmid:37255302
  81. 81. Abdulrazaq MA, Fan WD. Temporal dynamics of pedestrian injury severity: a seasonally constrained random parameters approach. Int J Transp Sci Technol. 2025;20:237–57.
  82. 82. Münzel T, Khraishah H, Schneider A, Lelieveld J, Daiber A, Rajagopalan S. Challenges posed by climate hazards to cardiovascular health and cardiac intensive care: implications for mitigation and adaptation. Eur Heart J Acute Cardiovasc Care. 2024;13(10):731–44. pmid:39468673
  83. 83. Grossman E, Nayak SG, Roach M. The Use of Cooling Centers to Prevent Heat-Related Illness: Summary of Evidence and Strategies for Implementation. 2017. https://doi.org/10.13140/RG.2.2.32267.59688
  84. 84. Predictive Modeling in Urban Planning: Urban Growth and Infrastructure Needs - Terra Drone Arabia. [cited 11 May 2025]. Available from: https://terra-drone.com.sa/predictive-modeling-in-urban-planning-urban-growth-and-infrastructure-needs/
  85. 85. Das P, Barman S, Das P, Barman S. Perspective chapter: An overview of time series decomposition and its applications. Applied and theoretical econometrics and financial crises. 2025. [cited 10 May 2025].
  86. 86. Master the Power of Seasonal Decomposition of Time Series (STL): Unveiling the Essence of Time | by Tushar Aggarwal | Medium. [cited 10 May 2025]. Available from: https://medium.com/@tushar_aggarwal/master-the-power-of-seasonal-decomposition-of-time-series-stl-unveiling-the-essence-of-time-26c19a910314
  87. 87. Sonani MS, Badii A, Moin A. Stock Price Prediction Using a Hybrid LSTM-GNN Model: Integrating Time-Series and Graph-Based Analysis. 2025 [cited 10 May 2025]. Available from: https://arxiv.org/pdf/2502.15813v1
  88. 88. Kaleem M, Guergachi A, Krishnan S. Comparison of Empirical Mode Decomposition, Wavelets, and Different Machine Learning Approaches for Patient-Specific Seizure Detection Using Signal-Derived Empirical Dictionary Approach. Front Digit Health. 2021;3:738996. pmid:34966902
  89. 89. Rühling Cachay S, Mitra P, Hirasawa H, Kim S, Hazarika S, Hingmire D. ClimFormer: A Spherical Transformer Model for Long-term Climate Projections.