An AI-driven fire risk forecasting framework for urban villages using IGWO-optimized LSTM with incremental learning

Jiangxue Tian; Handong Li; Shuran Lv

doi:10.1371/journal.pone.0350182

Abstract

Artificial intelligence (AI) is reshaping decision-support systems across multiple domains, including risk management and urban safety. Urban villages, characterized by high population density and informal infrastructure, are particularly vulnerable to fire hazards. This study presents an AI-driven fire risk forecasting framework based on an Improved Grey Wolf Optimizer (IGWO) and a Long Short-Term Memory (LSTM) neural network, further enhanced by an incremental learning strategy. IGWO improves hyperparameter convergence and avoids local optima, while the incremental component allows real-time model updates without full retraining. Using real fire incident data from 55 urban villages in Beijing, the proposed IGWO-LSTM-IL model achieves a 92.57% reduction in mean squared error compared to baseline LSTM. The model demonstrates high predictive accuracy, stability, and adaptability, making it a practical tool for intelligent fire risk monitoring and urban safety systems within the scope of AI-transforming urban infrastructure.

Citation: Tian J, Li H, Lv S (2026) An AI-driven fire risk forecasting framework for urban villages using IGWO-optimized LSTM with incremental learning. PLoS One 21(6): e0350182. https://doi.org/10.1371/journal.pone.0350182

Editor: Peng Wu, Anhui University, CHINA

Received: July 2, 2025; Accepted: May 7, 2026; Published: June 2, 2026

Copyright: © 2026 Tian et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the manuscript and its Supporting Information files.

Funding: The author(s) received no specific funding for this work.

Competing interests: The authors have declared that no competing interests exist.

1 Introduction

Urban villages refer to areas located within or on the outskirts of a city that have not yet fully urbanized during the process of urban development. They possess some characteristics of both urban and rural areas, such as a lack of planning for building facilities [1], a large number of self built houses, dense population, and chaotic government management. These areas have been formed during the process of urbanization, preserving their original social and cultural characteristics. Urban villages are widely present worldwide, especially in rapidly developing cities in Asia and urbanized areas in Europe and America, reflecting the transitional state between urban expansion and rural traditions [2].

The structure and functional layout of urban villages have a high degree of spontaneity, and the diverse characteristics of different regions enhance the structural and functional diversity of urban villages, making them play a unique role in the process of urbanization construction [3]. At the same time, they bring many safety hazards, such as dense buildings, complex personnel composition, and weak fire protection facilities. Specifically urban villages’ self-built houses have even higher fire risk due to lack of formal design and regulation, using low standard of building material, etc [4]. Beside these, complex composition of residents, lack of safety awareness and necessary protective skills push the fire risk to a incredible high level [5]. As a result urban villages fire’s casualties are more serious than those in cities. Therefore, carrying out research specifically on urban villages’ fire risks benefits building out prediction scheme, and potential risks and accidents are more likely to be eliminated as a result of that [6].

Currently researches on fire risk prediction are mainly focuses on the fields of urban buildings and forest, and these researches have drawbacks of poor timeliness and strong subjectivity since they rely on on-site inspections and empirical formulas. For example, [7] presented an approach to generating a forest fire risk map by integrating geographic information system–based multiple criteria decision analysis (GIS-MCDA) with the analytic hierarchy process (AHP) and a statistical index (SI). In paper [8], it constructed a novel approach to predicting fire risk in buildings by leveraging advanced machine learning techniques and integrating diverse datasets. a fire prediction and evacuation system based on cellular automata for large buildings with dense population. Besides these, more studies focus on fire risks in special scenarios, such as goafs in coal mines [9], inside mines [10], and specific city districts [11], etc. However, the research on urban villages, which are areas with a high incidence of fires, is relatively scarce.

Although some scholars focus on multi-factor influences on urban fire risks, the disaster-causing factors of fires in urban villages are special, and are much more complex than those in urban areas. Conventional analysis methods such as AHP, Entropy Method and Markov model, etc. are difficult to be applied in urban village scenarios. For example, [12] carried out a hierarchical integrated spatial risk assessment in India’s core city using GIS and AHP. This method has issues such as strong subjectivity and inability to handle time-series data.

Some researchers have explored more advanced prediction technologies like machine learning and big data analysis. [13] developed a deep learning model based on the UNet architecture to achieve fast and accurate prediction of fire temperature field under the ceiling in complex planar rooms. However, this model is limited to special scenarios and not suitable for general cases. [14] obtained forest fire susceptibility maps of the Babolrood Watershed in the Mazandaran Province of Iran from random forest, artificial neural network and logistic regression models. Although the random forest model used in this study performs well in terms of prediction accuracy, it does not have the ability to update in real time. The timeliness of the model is limited by the time range of data collection, thus it could not reveal the current fire risk status in real time. [15] designed three kinds of network models based on Long Short-Term Memory (LSTM) to predict fire spread rate, exploring the interaction between fire and wind. Recent studies have further demonstrated the effectiveness of LSTM-based models in complex time-series prediction tasks. For example, Kong et al. [16] applied LSTM networks for short-term load forecasting, while Qin et al. [17] proposed a dual-stage attention-based recurrent neural network for time-series prediction. However, LSTM still suffers from issues such as long training time, sensitivity to hyperparameters, and potential overfitting.

Intelligent swarm optimization algorithm, as a machine learning algorithm, is commonly used in various decision support models due to its powerful optimization ability. At the same time, it can also optimize the parameters of neural networks to improve the performance of models. [18] optimized the hyperparameters in the eXtreme Gradient Boosting (XGBoost) model using Grey Wolf Optimizer (GWO) algorithm to create a fire growth rate warning map for the Liangshan Prefecture in Sichuan Province, China. [19] used a Neural Fuzzy inference system (NF) to establish the forest fire model whereas Particle Swarm Optimization (PSO) was adopted to investigate the best values for the model parameters. However, traditional intelligent swarm optimization algorithms have problems such as being prone to falling into local optima, slow convergence speed, and poor robustness. Recent studies have also emphasized the importance of evidence-based forecasting principles in predictive modeling. Armstrong and Green [20] proposed forecasting method checklists to improve the transparency and reliability of forecasting practices. In addition, large-scale forecasting evaluations such as the M4 competition [21] highlight the importance of rigorous model comparison in time-series forecasting research.

To solve the above issue, we propose a grey wolf optimizer (IGWO) by introducing nonlinear convergence factors and Gaussian mutation operators [22]. By optimizing the hyperparameters of LSTM using the proposed IGWO, a time-series data fire prediction model is constructed. LSTM is a special type of recurrent neural network (RNN) designed to solve the problems of gradient vanishing and exploding in traditional RNNs when processing long sequence data [23]. LSTM introduces input gates, forget gates, and output gates to control the flow of information, enabling it to learn and remember long-term dependency information and more effectively process and predict time-series data [24]. However, LSTM has shortcomings such as long training time, sensitivity to hyperparameters, and susceptibility to overfitting. Thus, an improved GWO (IGWO) algorithm is proposed to improve LSTM in this paper. The Grey Wolf Optimization Algorithm (GWO) is an optimization algorithm that simulates the social behavior and predation strategy of grey wolves [25]. Due to its simple structure, few parameters, and ease of adjustment, it is often applied to optimize the parameters of neural network models and usually exhibits good convergence speed and solution accuracy [26]. The proposed IGWO can further improve the performance of the algorithm by introducing dynamic weights and mutation operators to enhance the LSTM prediction model.

Based on the time-series data of fire-related factors in urban villages, the model can effectively process real-time data and accurately analyze the complex nonlinear relationships between fire influencing factors and fire probability in such areas. At the same time, the incremental learning mechanism enables the model to update in real time and adapt to environmental changes, so that the accuracy and practicality of fire risk diagnosis is significantly improved. This study provides a scientific basis and technical support for fire prevention and control in urban villages. The main contributions and advantages of the proposed method could be given as follows.

An improved Grey Wolf Algorithm (IGWO) is proposed to solve the problems of premature convergence, poor robustness, and slow convergence speed in traditional intelligent swarm optimization algorithms.
A urban village fire risk prediction model based on the Improved Grey Wolf Optimization Algorithm (IGWO) and Long Short Term Memory Network (LSTM) is constructed. The proposed model could realize fire risk prediction in urban villages based on time-series data.
Incremental learning (IL) strategy is integrated to the model to continuously update the model and achieve real-time prediction of fire risk in urban villages.
A full-scale fire experiment is conducted to analyze the fire risk and verify the effectiveness of the proposed method.

The rest of this paper is structured as follows. Sect 1 introduces the basic concepts and related work, including LSTM, the GWO improvement process, and the incremental learning algorithm. Sect 2 describes the construction of the fire risk prediction model based on the proposed IGWO improved LSTM and incremental learning, including the determination of fire risk indicators and the model’s implementation details. Sect 3 presents the experimental setup and case study analysis, verifying the model’s performance. Finally, Sect 4 concludes the achievements of the paper and discusses future research directions.

2 Related work

2.1 LSTM model

LSTM is a recurrent neural network that designed specifically to handle long-term dependencies in time series data effectively [27]. While the convolutional neural network (RNN) can address time sequence issues and retain short-term memory, it lacks the capability for long-distance time sequence retention [28]. Additionally, recurrent neural networks (RNNs) often grapple with the issue of gradient disappearance or divergence during the training process, a phenomenon that frequently precipitates challenges in achieving convergence or the emergence of suboptimal local solutions, thus posing certain constraints [29]. The LSTM neural network incorporates memory cells into its hidden neurons, which control the state of memory cells through three gating units, including the forget gate, input gate, and output gate, to discard and transfer information and update states, ultimately outputting information for the next time step [30]. Thus, LSTM is now able to solve the issues caused by gradient vanishing and gradient explosion since it has the long-term memory capability for time-series data.

The information flow inside an LSTM unit, as illustrated in Fig 1, proceeds through the following four steps:

Download:

Fig 1. Structure of the Long Short-Term Memory (LSTM) unit.

The architecture illustrates the memory cell C_t, hidden state h_t, and the three gating mechanisms (input gate, forget gate, and output gate) that regulate information flow within the LSTM network.

https://doi.org/10.1371/journal.pone.0350182.g001

Step 1: Forget gate.

The forget gate determines what portion of the previous memory cell state m_t−1 should be retained. It uses the current input x_t and the previous hidden state s_t−1:

(1)

where is the sigmoid activation function, W_f is the weight matrix for the forget gate, and b_f is the bias term.

Step 2: Input gate and candidate memory update.

The input gate determines how much new information is written to the memory cell. First, a candidate memory update is generated using the hyperbolic tangent function:

(2)

Then, the input gate value i_t is calculated as:

(3)

Step 3: Memory cell state update.

The current memory cell state m_t is updated by combining the retained memory and the new candidate values:

(4)

Step 4: Output gate and hidden state update.

The output gate determines the new hidden state s_t, based on the updated memory state:

(5)

(6)

This structure allows LSTM networks to selectively retain relevant temporal information and effectively model long-range dependencies. In the context of urban village fire risk prediction, the LSTM’s ability to capture dynamic, nonlinear time-series patterns between fire risk indicators proves highly valuable. However, LSTM performance heavily depends on hyperparameter tuning, such as the learning rate, hidden units, dropout rate, and training epochs. Suboptimal settings may cause overfitting, slow convergence, or entrapment in local minima. To address this, we integrate an Improved Grey Wolf Optimizer (IGWO) to perform global search over the hyperparameter space, thereby enhancing LSTM prediction accuracy and stability.

2.2 Improved Grey Wolf Optimizer (IGWO)

The Grey Wolf Optimizer (GWO) is a metaheuristic inspired by the social hierarchy and hunting strategies of grey wolves [31]. The population is divided into four roles based on fitness: (leader), (sub-leader), (third leader), and (follower). The optimization process is guided by the top three wolves, which simulate the encircling and attacking of prey, as illustrated in Fig 2.

Download:

Fig 2. Social hierarchy and hunting mechanism of the Grey Wolf Optimizer (GWO).

The population is divided into four levels (alpha, beta, delta, and omega), which collaboratively guide the search process toward the global optimum.

https://doi.org/10.1371/journal.pone.0350182.g002

At each iteration, the wolves update their positions based on the estimated location of the prey:

(7)

(8)

Here, X_p(t) is the prey’s position (best solution), and X(t) is the current position. The vectors A and C are defined as:

(9)

(10)

where are random vectors, and a is the convergence factor linearly decreasing over iterations:

(11)

To improve search performance and avoid premature convergence, we propose two enhancements:

First, we replace the linear convergence factor with a nonlinear, sigmoid-based strategy to better balance exploration and exploitation:

(12)

The maximum convergence factor is set to the standard value in this stud.

Second, we modify the position update by simultaneously considering the three best wolves. The combined influence is computed as:

(13)

To further enhance global search ability and maintain population diversity, we integrate a Gaussian mutation mechanism. Given the best solution , a gene x_m (1 ≤ m ≤ n) is randomly selected for mutation with probability p_m:

(14)

where

(15)

Here, lb_m and ub_m are the bounds for the m-th dimension, f_m is the mutation intensity which set to 0.1 provides a moderate perturbation strength to balance exploration and convergence, and is a standard Gaussian noise term. The mutation probability p_m is set to 0.1, which is a commonly used default value in evolutionary optimization algorithms to preserve population diversity while avoiding excessive randomness.

These modifications yield the Improved Grey Wolf Optimizer (IGWO), which achieves stronger exploration in early iterations and better convergence stability in complex, high-dimensional search spaces.

2.3 Benchmark evaluation and analysis

To evaluate the performance of the proposed Improved Grey Wolf Optimizer (IGWO), eight standard benchmark functions from the CEC2017 test suite [32] are employed in Fig 3. These functions represent a diverse set of characteristics, encompassing unimodal, multimodal, separable, and non-separable landscapes. Such diversity ensures that the simulation results are both comprehensive and scientifically robust. For comparative analysis, three widely recognized metaheuristic algorithms are selected: the original Grey Wolf Optimizer (GWO), Particle Swarm Optimization (PSO), and the Great Wall Construction Algorithm (GWCA), as in [33–35]. GWO, PSO, and GWCA are representative algorithms in metaheuristic algorithms, which have been widely applied and studied in academia and industry. This means that there are a large number of research and experimental results that can be used as references, which helps to make fair and comprehensive comparisons [36,37].

Download:

Fig 3. Search landscapes of eight benchmark functions used to evaluate the optimization algorithms.

(a) Sphere (F₁); (b) Schwefel 2.22 (F₂); (c) Schwefel 1.2 (F₃); (d) Schwefel 2.21 (F₄); (e) Quartic with noise (F₅); (f) Rastrigin (F₆); (g) Ackley (F₇); (h) Griewank (F₈).

https://doi.org/10.1371/journal.pone.0350182.g003

As shown in Fig 4, for the quadratic convex function F₁, IGWO quickly converges to the global minimum; When dealing with non convex functions F₂ and F₃, IGWO also exhibits fast and stable convergence characteristics; For the discontinuous function F₄ with a large number of local minima, the IGWO algorithm can effectively avoid getting stuck in local minima and quickly find the global optimal solution; When dealing with polynomial function F₅ with random noise, IGWO exhibits good robustness, although slightly fluctuating, it can still converge to the extremum; For complex functions F₆, F₇, and F₈ containing trigonometric functions, IGWO can effectively handle the non convexity and complexity of the functions, and quickly reach a stable state. These results indicate that the IGWO algorithm has significant advantages over traditional GWO, PSO, and GWCA algorithms in terms of global search capability, convergence speed, adaptability, and robustness. Especially when dealing with non convex function optimization problems with a large number of local minima and complex patterns, its performance is significantly better than other algorithms.

Download:

Fig 4. Convergence curves of four optimization algorithms on the benchmark functions.

(a) F₁: Sphere; (b) F₂: Schwefel 2.22; (c) F₃: Schwefel 1.2; (d) F₄: Schwefel 2.21; (e) F₅: Quartic (with noise); (f) F₆: Rastrigin; (g) F₇: Ackley; (h) F₈: Griewank.

https://doi.org/10.1371/journal.pone.0350182.g004

To balance exploration capability with computational efficiency, the population size for all algorithms is fixed at 10, and the maximum number of iterations is set to 500. Each algorithm is independently executed 30 times to assess convergence stability and robustness under stochastic initialization. The average and standard deviation of the optimal fitness values obtained across these runs are reported for each test function. A detailed overview of the benchmark functions is provided in Table 1, while their search space visualizations and convergence behaviors are depicted in Fig 3 and Fig 4, respectively. To better illustrate comparative performance, Fig 5 and Fig 6 present normalized heatmaps for the 30-dimensional and lower-dimensional (3D and 10D) cases. From the visual analysis in Fig 5, IGWO consistently achieves the lowest normalized average error across most 30-dimensional test functions, highlighting its superior convergence accuracy and stability compared to traditional GWO, PSO, and GWCA. Similarly, Fig 6 confirms that IGWO maintains its optimization effectiveness in lower-dimensional spaces, with particularly strong performance on precision-oriented functions such as F₁ through F₄.

Download:

Table 1. Definitions and properties of the benchmark functions used in this study.

https://doi.org/10.1371/journal.pone.0350182.t001

Download:

Fig 5. Normalized performance comparison of the optimization algorithms on 30-dimensional benchmark functions.

https://doi.org/10.1371/journal.pone.0350182.g005

Download:

Fig 6. Normalized performance comparison of the optimization algorithms on 3-dimensional and 10-dimensional benchmark functions.

https://doi.org/10.1371/journal.pone.0350182.g006

2.4 Incremental learning strategy

Traditional batch learning algorithms operate under the assumption that the complete training dataset is available prior to model training. Once training is completed, the model ceases to learn, and no further updates are incorporated. However, in real-world scenarios, data is typically collected over time rather than in a single batch. Furthermore, the underlying distribution or context of the data may evolve dynamically [38]. In such settings, re-training the entire model from scratch with each new data instance is computationally inefficient and impractical, especially for time-sensitive applications.

To address this challenge, incremental learning algorithms have been proposed. These methods enable continuous learning by updating model parameters as new data arrives. This process allows the model to revise, reinforce, and extend previously acquired knowledge without the need to access or retrain on the full historical dataset [39]. In this study, to enhance the real-time adaptability and responsiveness of the fire risk prediction model for urban villages, we integrate the optimized LSTM model with an incremental learning mechanism. This integration facilitates dynamic updates to the model as new fire-related data becomes available, ensuring timely adaptation to changing risk factors.

The proposed incremental learning strategy proceeds as follows:

Data Acquisition: Collect new characteristic data related to fire hazards in urban villages in real-time.
Model Prediction and Error Calculation: Feed the newly acquired data into the optimized LSTM model to compute the prediction output and the corresponding error.
Parameter Update: Fine-tune the model’s weights and biases using the computed error via gradient-based optimization.
Validation: Evaluate the updated model on a validation set to ensure that its predictive performance remains accurate and stable.

The core of the incremental learning approach lies in efficient parameter adjustment. This work adopts an online gradient descent strategy, where the model parameters are iteratively updated based on the prediction loss. The update rule is defined as:

(16)

(17)

where denotes the model parameters at iteration t, is the learning rate, y_t is the true value, is the predicted value, and represents the gradient operator with respect to the model parameters.

This strategy allows the model to quickly adapt to new conditions in urban fire risk environments, ensuring that predictions remain accurate and up-to-date without incurring the computational cost of full retraining. Specifically, in this study,the main settings are as the Table 2 as: (1) Learning rate: A small constant learning rate (1e-4) is used during incremental updates to ensure stable parameter adaptation. (2) Update frequency: Model updates are performed in a mini-batch manner when new data become available. (3) Drift detection: A performance-based drift detection mechanism is introduced. When the prediction error (RMSE) on recent data exceeds 110% of the historical average, the model update is triggered. (4) Catastrophic forgetting mitigation: To alleviate forgetting, a small portion of historical samples is retained and jointly used with new data during fine-tuning.

Download:

Table 2. Configuration of the incremental learning framework.

https://doi.org/10.1371/journal.pone.0350182.t002

3 Proposed algorithm methodology

3.1 IGWO-LSTM-IL model

The integration of the Improved Grey Wolf Optimizer (IGWO) into the LSTM optimization framework is centered on leveraging IGWO’s global search capability to automatically fine-tune key hyperparameters of the LSTM model—specifically, the number of hidden units, learning rate, and training epochs. Within the IGWO-LSTM architecture, the optimizer is designed to minimize the prediction error of the LSTM on a validation set, thereby enhancing model accuracy and robustness while significantly reducing manual trial-and-error in hyperparameter tuning. Given the complex and nonlinear nature of fire risk factors in urban villages, conventional LSTM models often suffer from limitations such as slow convergence, gradient vanishing/explosion, overfitting, and susceptibility to local optima. By contrast, the IGWO-based optimization approach facilitates an efficient global exploration of the hyperparameter space, enabling the discovery of near-optimal configurations in fewer iterations.

The optimization process begins by initializing a population of grey wolves, where each wolf encodes a unique LSTM hyperparameter configuration. The fitness of each individual is evaluated based on the LSTM model’s prediction error. The positions of the wolves are then iteratively updated according to IGWO’s hierarchical hunting mechanism, in which the top-ranked wolves (, , and ) guide the others toward better solutions. Once the optimal set of hyperparameters is identified, it is used to train the final LSTM model. The overall training and update process of the proposed IGWO-LSTM-IL model is illustrated in Fig 7, while the corresponding implementation logic is outlined in Algorithm 1. The workflow begins with the construction of the LSTM network, followed by hyperparameter optimization using the Improved Grey Wolf Optimizer (IGWO). Once the optimal configuration is identified, the LSTM model is trained and incrementally updated in real-time using newly arriving data. This hybrid approach ensures that the model continuously adapts to dynamic fire risk patterns while maintaining high prediction accuracy and computational efficiency.

Download:

Fig 7. Flowchart of the proposed IGWO-LSTM-IL fire risk prediction framework.

The workflow includes data preprocessing, Bayesian network-based fire probability estimation, parameter optimization using the Improved Grey Wolf Optimizer (IGWO), LSTM model training, and incremental learning for model updating.

https://doi.org/10.1371/journal.pone.0350182.g007

Algorithm 1. IGWO-LSTM Training with Incremental Learning.

The proposed prediction framework is built upon a five-layer LSTM network, which includes: an input layer, an LSTM layer, a dropout layer, a fully connected (dense) layer, and a regression output layer. The input layer receives multivariate time-series data representing fire risk factors in urban villages. The LSTM layer captures temporal dependencies and dynamic patterns through gated recurrent units. To prevent overfitting, the dropout layer randomly disables a portion of neurons during training. The fully connected layer transforms the LSTM outputs to match the dimensionality of the target variable, while the regression layer computes the prediction loss and enables backpropagation during training.

To optimize the LSTM model’s performance, the Improved Grey Wolf Optimizer (IGWO) is employed to tune three key hyperparameters: the number of hidden units, learning rate, and number of training epochs. Each grey wolf represents a unique combination of these hyperparameters, encoded as a position vector in a three-dimensional search space. These wolf individuals are initialized randomly within predefined bounds.

The fitness of each wolf is evaluated using the root mean square error (RMSE) between the predicted and actual values on the training dataset:

(18)

where denotes the number of samples, x_t is the expected output, and is the predicted value of the model.

Through iterative optimization guided by the hierarchy of , , and wolves, IGWO converges to the optimal hyperparameter configuration. This configuration is then used to construct and train the final LSTM model. Once the model achieves satisfactory accuracy on the training set, its performance is evaluated on the testing dataset. To enable real-time adaptation to newly available fire risk data, the model is further refined using an incremental learning strategy. New input samples are standardized and used to update the trained model via online learning, without requiring complete retraining. This strategy not only maintains computational efficiency but also ensures model relevance in dynamic urban environments.

In summary, the IGWO-LSTM model demonstrates strong capabilities in processing time series data and capturing complex interactions between fire risk variables. When integrated with incremental learning, it becomes a robust and adaptive tool for predicting fire risk in real-time in urban village settings.

3.2 Determination of fire risk factor indicators

The causes of fires in urban villages are multifaceted, involving a combination of environmental, demographic, and management-related factors. To systematically identify relevant fire risk indicators for urban villages, we consulted multiple authoritative sources, including the China Fire and Rescue Yearbook, government-issued fire accident investigation reports, and research publications on urban village fire incidents from academic and institutional sources [40].

In addition, on-site field investigations were conducted in collaboration with local fire safety authorities, including the Chaoyang District Fire and Rescue Detachment (2024). A statistical analysis of reported fire incidents over the past five years was also performed to inform indicator selection.The fire risk indicators in this study were constructed to focus on ignition-related (hazard-triggering) factor within urban village environments, based on statistical analysis of 101 real fire incidents in Beijing. The final set of fire risk factors identified for suburban towns is summarized in Table 3.

Download:

Table 3. Fire Risk Factors for Suburban Towns.

https://doi.org/10.1371/journal.pone.0350182.t003

3.3 Risk Prediction and analysis

It should be noted that two independent datasets are used in this study for different purposes. The first dataset consists of real fire incident records collected from 55 urban villages in Chaoyang District during 2023, including 101 fire events. These data are used to statistically analyze the distribution of fire causes and to construct the Bayesian network. The second dataset is a questionnaire-based time-series dataset obtained from 100 representative urban villages over a 30-day observation period. Therefore, the dataset provides 3,000 temporal observations. Expert scoring is applied to evaluate the fire risk indicators, and the Bayesian network is used to generate the corresponding fire probability, which serves as the target variable for training the prediction model. To clarify the overall workflow of the proposed method, the complete data processing and modeling framework is illustrated in Fig 8.

Download:

Fig 8. Data processing pipeline used in the study.

The pipeline illustrates the transformation from expert questionnaire data and fire risk indicators to Bayesian probability estimation and the subsequent preparation of time-series inputs for the IGWO-LSTM-IL prediction model.

https://doi.org/10.1371/journal.pone.0350182.g008

Based on field surveys conducted in collaboration with local government departments, including the Chaoyang District Fire and Rescue Detachment (2024), a total of 101 fire incidents were recorded across 55 urban villages in Chaoyang District during the year 2023. This section analyzes the statistical distribution of fire causes and assesses the predictive performance of the proposed model on these real-world data.

Based on expert opinions, relevant literature [41–43], and local government documents, 12 fire risk factors in urban villages were identified. According to the statistical analysis of 101 fires in urban villages in Chaoyang District, the number of fires caused by each fire factor was obtained, as shown in Table 4.

Download:

Table 4. Statistics of Fire Risk Factors in Urban Villages.

https://doi.org/10.1371/journal.pone.0350182.t004

To quantify the intermediate nodes A₁, A₂, A₃, and A₄ in the Bayesian network shown in Fig 9, conditional probabilities are computed based on the states of their corresponding parent nodes B. For instance, node A₁, representing fire risks related to electrical systems, is influenced by four parent factors: electrical circuit faults (B₁), electrical equipment faults or improper use (B₂), electric vehicle faults or improper use (B₃), and fuel vehicle faults or improper operation (B₄).

Download:

Fig 9. Bayesian network representing the relationships between fire causes and urban village fire risk categories.

https://doi.org/10.1371/journal.pone.0350182.g009

The weight of each parent node contributing to an intermediate node A_j is defined by:

(19)

where is the set of parent nodes for A_j, and is the original proportion of fire cause B_i from the statistics.

If the state (or score) of node B_i is denoted as , the probability of node A_j is computed as:

(20)

The final fire probability is calculated by aggregating the contributions from all intermediate nodes A_j, weighted by their total proportions:

(21)

(22)

4 Case study

According to recent research conducted by the Beijing Municipal Institute of City Planning and Design, there are currently 501 urban villages in Beijing, predominantly located between the 5th and 6th ring roads. In a targeted study conducted over a period of 30 days, 100 representative urban villages were selected. Six experienced experts in fire prevention and control assessed fire risk indicators through structured questionnaires. Participation in the questionnaire survey was voluntary, and informed consent was obtained from all participants prior to participation. The questionnaire collected only professional assessments of fire risk indicators and did not involve personal or sensitive information. Therefore, formal ethical approval was not required for this study. So, a fuzzy–Bayesian integrated framework was adopted to reduce subjectivity in expert judgment. Experts from relevant fields (fire rescue, emergency management, architecture, and safety science) evaluated each fire risk indicator using a 0–100 scale. Their assessments were mapped to triangular fuzzy numbers corresponding to five linguistic levels (Very Low to Very High). Expert weights (0.2 for fire rescue and emergency management experts, 0.1 for the architecture expert, and 0.3 for the senior safety science professor) were assigned according to domain relevance and experience. The weighted fuzzy assessments were aggregated using -cut sets ( = 0.1) and defuzzified using the integral value method to obtain crisp scores () within the interval [0,1]. These scores were then used as inputs to the Bayesian network. The detailed questionnaire design, representative example data, and implementation code used in this study are available at: https://github.com/handongli2019/Urban-Village-Fire-Risk-Prediction-Using-IGWO-Optimized-LSTM-with-Incremental-Learning. The scores were then organized into 100 samples, each with a 30-day time step and 12 fire risk feature inputs for the IGWO-LSTM-IL model. The output mode is set to “last,” meaning each sample outputs the fire probability at the final time step derived from the Bayesian network. The dataset is divided into training and testing sets for model development and validation. Samples 1–75 are used for training and samples 76–100. All reported results are based on strict out-of-sample testing (75/25 train/test split) to ensure the model’s predictive validity and generalization ability. The random seed is set to 42 to ensure reproducibility. To evaluate the practical performance of the proposed fire risk prediction model, this case study was conducted using the fire probability outputs derived from the Bayesian network in conjunction with the IGWO-LSTM-IL model.

To establish a baseline for fire risk prediction in urban village environments, a Long Short-Term Memory (LSTM) network was trained on the dataset without optimization by metaheuristic algorithms. The performance of this model is illustrated in Fig 10 and Fig 11, which highlight the prediction error characteristics and the correspondence between predicted and actual risk values, respectively. Fig 10 presents the distribution of prediction errors. The histogram demonstrates a generally symmetric distribution centered near zero, with a slight left skew (skewness = −0.524), indicating that the model occasionally underestimates fire risk. The superimposed normal distribution curve confirms that the residuals approximate a Gaussian distribution, with a kurtosis of approximately 0.007, suggesting neither pronounced peaks nor heavy tails. The standard deviation of the error is 0.146, and the mean error is −0.050, implying a minor overall overestimation by the model. Fig 11 compares the predicted and actual fire risk values across the sample index. The shaded area represents the error band, defined as ± the absolute error around the predicted value. The visual alignment between the predicted and actual curves confirms the model’s ability to capture the temporal dynamics of risk variation, although the width of the error band in some segments highlights regions with higher uncertainty.

Download:

Fig 10. Distribution of prediction errors for the baseline LSTM model.

https://doi.org/10.1371/journal.pone.0350182.g010

Download:

Fig 11. Comparison between predicted and actual fire risk values for the LSTM model.

The shaded region represents the prediction error band.

https://doi.org/10.1371/journal.pone.0350182.g011

Overall, the LSTM baseline provides reasonable predictive capacity with moderate variance in error. However, the observed bias and residual spread suggest potential for improvement, which motivates the introduction of enhanced models optimized by intelligent algorithms in subsequent sections.

In the simulation, both the Grey Wolf Optimizer (GWO) and the Improved Grey Wolf Optimizer (IGWO) have a population size of 30, with a maximum iteration count of 10, and operate in a 3-dimensional search space. A single-layer LSTM network is adopted in this study, with the number of hidden layers fixed to one. It should be noted that LSTM performance is highly sensitive to hyperparameter settings. While metaheuristic algorithms such as PSO or GWO can theoretically be used to optimize LSTM weights, such an approach would significantly increase the computational burden due to the high dimensionality of the weight space. Therefore, in this study, the IGWO algorithm is employed only for hyperparameter optimization, while the LSTM weights are trained using standard gradient-based backpropagation to ensure computational efficiency and stable convergence.The key LSTM hyperparameters—number of hidden units, number of training epochs, and learning rate—are optimized using the GWO and IGWO algorithms. The batch size is set to 32 based on the dataset characteristics (number of features, time steps, and samples). To ensure robustness, each model was independently executed 30 times with different random initializations, and the average results are reported as the final prediction outcomes. The search space of LSTM hyperparameters was defined as follows: the number of hidden units was set within [16, 128], the learning rate within [0.0001, 0.01], and the number of epochs within [50, 200]. These ranges were selected based on common practices in deep learning applications and prior related studies. The IGWO algorithm was then employed to search within this bounded space to identify the optimal configuration. All comparison models adopted the same search ranges to ensure fairness. Regarding additional baselines such as GRU, ARIMA, or XGBoost, we acknowledge their value as forecasting benchmarks. However, the primary objective of this study is to evaluate the effectiveness of the proposed optimization framework (IGWO) and incremental learning strategy (IL) in enhancing LSTM performance. Therefore, our baseline comparisons are designed as controlled experiments (LSTM vs. GWO-LSTM vs. IGWO-LSTM vs. IGWO-LSTM-IL) to isolate and quantify the contribution of each proposed component. A comprehensive comparison with other forecasting models is an important direction for future research and has been noted in the Conclusion section.

4.1 GWO-LSTM model performance

To enhance the predictive capability of the baseline LSTM model, the Grey Wolf Optimizer (GWO) algorithm was applied to optimize the initial weights and biases of the LSTM network. The resulting hybrid GWO-LSTM model is evaluated in this subsection. Fig 12 and Fig 13 illustrate the distribution of prediction errors and the correspondence between predicted and actual values, respectively. As shown in Fig 12, the prediction errors of the GWO-LSTM model are distributed closely around zero and follow a nearly normal distribution. The histogram reveals a slightly left-skewed profile (skewness = −0.287), indicating occasional underestimations of fire risk, but less so than the baseline model. The kurtosis is approximately 0.014, suggesting the error distribution is neither sharply peaked nor heavy-tailed. Compared to the LSTM model, the GWO-LSTM shows a smaller error spread, with a standard deviation of 0.108 and a mean error of −0.012, reflecting improved precision and reduced bias. Fig 13 depicts the time series comparison between predicted and actual fire risk values, including an error band representing the absolute prediction error. The GWO-LSTM model demonstrates a tighter alignment between the predicted and actual curves, and the reduced width of the error band confirms increased confidence in the model’s output across most data points.

Download:

Fig 12. Distribution of prediction errors for the GWO-LSTM model.

https://doi.org/10.1371/journal.pone.0350182.g012

Download:

Fig 13. Comparison between predicted and actual fire risk values for the GWO-LSTM model.

The shaded region represents the prediction error band.

https://doi.org/10.1371/journal.pone.0350182.g013

The GWO-LSTM model significantly improves prediction accuracy and consistency over the baseline. The integration of the Grey Wolf Optimizer leads to better-initialized network parameters, contributing to more stable and reliable learning dynamics.

4.2 IGWO-LSTM model performance

Building upon the GWO-LSTM model, the Improved Grey Wolf Optimizer (IGWO) is integrated to further refine the initialization and convergence of the LSTM network. This improvement introduces adaptive dynamic coefficients and leader selection strategies, enabling better exploration and exploitation during the training process.

As observed in Fig 14, the prediction errors of IGWO-LSTM are highly concentrated around zero, with a nearly symmetrical distribution (skewness = −0.054) and minimal kurtosis (≈ 0.025). The error histogram closely matches the superimposed normal distribution, suggesting that the residuals are not only small but also statistically well-behaved. The mean prediction error is −0.002, indicating virtually no systemic bias, and the standard deviation of 0.093 represents the lowest error spread among all tested models.

Download:

Fig 14. Distribution of prediction errors for the IGWO-LSTM model.

https://doi.org/10.1371/journal.pone.0350182.g014

Fig 15 illustrates the predicted and actual fire risk values across the sample indices, along with an error band denoting absolute deviations. The error band in the IGWO-LSTM model is notably narrower and more consistent compared to previous models, especially in regions of complex risk fluctuation. This signifies a more stable and accurate predictive response.

Download:

Fig 15. Comparison between predicted and actual fire risk values for the IGWO-LSTM model.

The shaded region represents the prediction error band.

https://doi.org/10.1371/journal.pone.0350182.g015

In conclusion, the IGWO-LSTM model achieves superior performance in terms of both accuracy and stability. The enhancements provided by the IGWO algorithm allow the LSTM network to converge more effectively, reducing both the bias and variance of the predictions. These results demonstrate the value of metaheuristic-driven optimization in fine-tuning deep learning models for fire risk prediction.

4.3 IGWO-LSTM with incremental learning

To further improve the robustness and adaptability of the predictive model, incremental learning (IL) was integrated into the IGWO-LSTM framework. The incorporation of IL enables the model to continually adjust its parameters using new data without retraining from scratch, making it well-suited for dynamic environments such as urban fire risk monitoring.

As depicted in Fig 16, the prediction errors exhibit a near-perfect normal distribution centered around zero. The skewness of the residuals is negligible (≈ −0.006), and the kurtosis is close to zero (≈ 0.021), suggesting the errors are symmetrically and evenly distributed. The model achieved the smallest mean error (−0.002) and the lowest standard deviation (0.086) among all tested approaches, indicating minimal bias and extremely stable predictions.

Download:

Fig 16. Distribution of prediction errors for the proposed IGWO-LSTM-IL model.

https://doi.org/10.1371/journal.pone.0350182.g016

In Fig 17, the predicted values closely track the actual fire risk measurements across all sample indices. The error band surrounding the predictions is consistently narrow, reflecting high prediction confidence and low variance even in complex or volatile conditions. This highlights the effectiveness of incremental learning in maintaining model performance over time.

Download:

Fig 17. Comparison between predicted and actual fire risk values for the proposed IGWO-LSTM-IL model.

The shaded region represents the prediction error band.

https://doi.org/10.1371/journal.pone.0350182.g017

Overall, the IGWO-LSTM-IL model demonstrates superior predictive accuracy, stability, and adaptability. By combining an improved metaheuristic initializer with an incremental learning mechanism, this model delivers the best generalization performance and is particularly well-suited for continuous, real-time fire risk assessment in evolving urban village scenarios.

4.4 Model comparison

Fig 18 illustrates a grouped bar chart comparing the three standard regression metrics root mean squared error (RMSE), mean absolute error (MAE), and R² across all four models. The IGWO-LSTM-IL model achieves the best overall performance, with the lowest RMSE (0.040), lowest MAE (0.034), and highest R² score (0.87). In contrast, the baseline LSTM shows the poorest performance with an RMSE of 0.149 and R² of 0.27, indicating its limited predictive accuracy and generalization. To highlight relative improvements, the IGWO-LSTM-IL model achieved an RMSE reduction of 72.27%, MAE reduction of 68.54%, and R² increase of 217.70% compared to the LSTM baseline. Against the intermediate models (GWO-LSTM and IGWO-LSTM), IGWO-LSTM-IL still achieved notable error reductions of 49.99% in RMSE and 48.59% in MAE, along with an R² improvement of 85.81%. All reported metrics (MSE, RMSE, MAE, R²) are calculated exclusively on the out-of-sample test set as Table 5.

Download:

Fig 18. Performance comparison of different models in terms of RMSE, MAE, and normalized R².

https://doi.org/10.1371/journal.pone.0350182.g018

Download:

Table 5. Comparison of prediction performance for different models.

https://doi.org/10.1371/journal.pone.0350182.t005

Fig 19 provides a direct sample-wise comparison between the actual values and the predicted outputs of all models. The IGWO-LSTM-IL curve shows the closest alignment with the actual data points, demonstrating its ability to follow dynamic trends and avoid large deviations. Meanwhile, the LSTM and GWO-LSTM predictions deviate significantly in several regions, reflecting their relatively weaker learning and generalization capabilities.

Download:

Fig 19. Comparison between predicted and actual fire risk values for multiple models.

https://doi.org/10.1371/journal.pone.0350182.g019

To further assess the statistical significance of the performance improvements, Wilcoxon signed-rank tests were conducted between the proposed IGWO-LSTM-IL model and the baseline models (LSTM, GWO-LSTM, and IGWO-LSTM). The Wilcoxon signed-rank test is a non-parametric statistical test suitable for paired comparisons without assuming normality of the error distribution. The results, summarized in Table 6, indicate that the proposed IGWO-LSTM-IL model significantly outperforms all baseline models (p < 0.01 for all comparisons).

Download:

Table 6. Wilcoxon signed-rank test results comparing IGWO-LSTM-IL with baseline models.

https://doi.org/10.1371/journal.pone.0350182.t006

5 Conclusion

To address the limitations of the traditional Grey Wolf Optimizer (GWO), including slow convergence and premature convergence, this study proposes an enhanced variant known as the Improved Grey Wolf Optimizer (IGWO). The IGWO incorporates a nonlinear convergence factor to accelerate and refine the convergence process, while the integration of a Gaussian mutation operator effectively mitigates the risk of premature convergence. Experimental results confirm that IGWO significantly outperforms traditional GWO, Particle Swarm Optimization (PSO), and Grey Wolf with Chaotic Algorithm (GWCA) in terms of both convergence speed and accuracy, thereby enhancing overall algorithmic robustness and stability.

Building upon the IGWO framework, an IGWO-optimized Long Short-Term Memory (LSTM) network is developed and further integrated with an incremental learning (IL) strategy to construct a fire risk prediction model for urban villages. This hybrid IGWO-LSTM-IL model demonstrates strong capabilities in processing time-series data, capturing the dynamic interdependencies among fire risk factors, and adapting to evolving environmental conditions through real-time model updates. A case study involving fire incident data from urban villages in Chaoyang District, Beijing demonstrates that the proposed IGWO-LSTM-IL model achieves a 92.57% reduction in mean squared error compared to standard LSTM models and a 64.52% improvement over IGWO-LSTM without incremental learning.

Future research could expand the indicator system to include additional factors such as building material density, vegetation distribution, and climate conditions, and validate the model’s generalizability to other cities with different regional characteristics. Moreover, a comprehensive comparison with other forecasting models is an important direction for future research.

Supporting information

S1 Checklist. Inclusivity in Global Research Questionnaire.

https://doi.org/10.1371/journal.pone.0350182.s001

(PDF)

S2 File. Expert questionnaire used for evaluating fire risk indicators in urban villages.

https://doi.org/10.1371/journal.pone.0350182.s002

(DOCX)

Inclusivity in global research

Additional information regarding the ethical, cultural, and scientific considerations specific to inclusivity in global research is included in the Supporting Information (S1 Checklist).

References

1. Zain M, Keawsawasvong S, Thongchom C, Sereewatthanawut I, Usman M, Prasittisopin L. Establishing efficacy of machine learning techniques for vulnerability information of tubular buildings. Eng Sci. 2023;27(2):1008.
- View Article
- Google Scholar
2. Pan W, Du J. Towards sustainable urban transition: a critical review of strategies and policies of urban village renewal in Shenzhen, China. Land Use Policy. 2021;111:105744.
- View Article
- Google Scholar
3. Wu Y, Chen S, Wang D, Zhang Q. Fire risk assessment of heritage villages: a case study on Chengkan Village in China. Fire. 2023;6(2):47.
- View Article
- Google Scholar
4. Rebellon HE, Henao OFP, Gutierrez-Velasquez EI, Amell AA, Colorado HA. Thermoelectric modules: applications and opportunities in building environments for sustainable energy generation: from biomass, municipal waste, and other sources. Eng Sci. 2024.
- View Article
- Google Scholar
5. Wei L, Duan W, Dong S. Research on leased space of urban villages in large cities based on fuzzy kano model evaluation and building performance simulation: a case study of Laojuntang village, Chaoyang District, Beijing. Buildings. 2024;14(1):120.
- View Article
- Google Scholar
6. Yuan D, Yau Y, Bao H. Urban village redevelopment in China: Conflict formation and management from a neo-institutional economics perspective. Cities. 2024;145:104710.
- View Article
- Google Scholar
7. Sivrikaya F, Küçük Ö. Modeling forest fire risk based on GIS-based analytical hierarchy process and statistical analysis in Mediterranean region. Ecological Informatics. 2022;68:101537.
- View Article
- Google Scholar
8. Ahn S, Won J, Lee J, Choi C. Comprehensive building fire risk prediction using machine learning and stacking ensemble methods. Fire. 2024;7(10):336.
- View Article
- Google Scholar
9. Brodny J, Felka D, Tutak M. Applying an automatic gasometry system and a fuzzy set theory to assess the state of gas hazard during the coal mining production process. Eng Sci. 2023.
- View Article
- Google Scholar
10. Wang F, Tan B, Chen Y, Fang X, Jia G, Wang H, et al. A visual knowledge map analysis of mine fire research based on CiteSpace. Environ Sci Pollut Res Int. 2022;29(51):77609–24. pmid:35680744
- View Article
- PubMed/NCBI
- Google Scholar
11. Cetin M, Isik Pekkan Ö, Ozenen Kavlak M, Atmaca I, Nasery S, Derakhshandeh M, et al. GIS-based forest fire risk determination for Milas district, Turkey. Nat Hazards. 2022;119(3):2299–320.
- View Article
- Google Scholar
12. Rani G, Siddiqui NA, Yadav M, Ansari S. Hierarchical integrated spatial risk assessment model of fire hazard for the core city areas in India. Land Use Policy. 2023;126:106536.
- View Article
- Google Scholar
13. Zeng Y, Li Y, Du P, Huang X. Smart fire detection analysis in complex building floorplans powered by GAN. Journal of Building Engineering. 2023;79:107858.
- View Article
- Google Scholar
14. Eslami R, Azarnoush M, Kialashki A, Kazemzadeh F. GIS-based forest fire susceptibility assessment by random forest, artificial neural network and logistic regression methods. JTFS. 2021;33(2):173–84.
- View Article
- Google Scholar
15. Li X, Gao H, Zhang M, Zhang S, Gao Z, Liu J, et al. Prediction of forest fire spread rate using UAV images and an LSTM model considering the interaction between fire and wind. Remote Sensing. 2021;13(21):4325.
- View Article
- Google Scholar
16. Kong W, Dong ZY, Jia Y, Hill DJ, Xu Y, Zhang Y. Short-term residential load forecasting based on LSTM recurrent neural network. IEEE Trans Smart Grid. 2019;10(1):841–51.
- View Article
- Google Scholar
17. Qin Y, Song D, Chen H, Cheng W, Jiang G, Cottrell GW. A dual-stage attention-based recurrent neural network for time series prediction. In: Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence. 2017. p. 2627–33. https://doi.org/10.24963/ijcai.2017/366
18. Zhang L, Shi C, Zhang F. Predicting forest fire area growth rate using an ensemble algorithm. Forests. 2024;15(9):1493.
- View Article
- Google Scholar
19. Tien Bui D, Bui Q-T, Nguyen Q-P, Pradhan B, Nampak H, Trinh PT. A hybrid artificial intelligence approach using GIS-based neural-fuzzy inference system and particle swarm optimization for forest fire susceptibility modeling at a tropical area. Agricultural and Forest Meteorology. 2017;233:32–44.
- View Article
- Google Scholar
20. Armstrong JS, Green KC. Forecasting methods and principles: evidence-based checklists. Journal of Global Scholars of Marketing Science. 2018;28(2):103–59.
- View Article
- Google Scholar
21. Makridakis S, Spiliotis E, Assimakopoulos V. The M4 competition: 100,000 time series and 61 forecasting methods. International Journal of Forecasting. 2020;36(1):54–74.
- View Article
- Google Scholar
22. Alruwais N, Elhessewi GMohS, Saeed MK, Alshammeri M, Alrusaini O, Alkharashi A, et al. Federated learning and GWO-enabled consumer-centric healthcare internet of things for pancreatic tumour. Alexandria Engineering Journal. 2025;122:344–54.
- View Article
- Google Scholar
23. Xu X, Guo C, Wan P, Xu H, Yu Y, Fan J. WT-DSE-LSTM: a hybrid model for the multivariate prediction of dissolved oxygen. Alexandria Engineering Journal. 2025;124:285–96.
- View Article
- Google Scholar
24. Landi F, Baraldi L, Cornia M, Cucchiara R. Working memory connections for LSTM. Neural Netw. 2021;144:334–41. pmid:34547671
- View Article
- PubMed/NCBI
- Google Scholar
25. Hou Y, Gao H, Wang Z, Du C. Improved grey wolf optimization algorithm and application. Sensors (Basel). 2022;22(10):3810. pmid:35632219
- View Article
- PubMed/NCBI
- Google Scholar
26. Wang R, Xu H, Yi D, Song C, Che Y. Automatic detection of Alzheimer’s disease from EEG signals using hybrid PSO-GWO algorithm. Biomedical Signal Processing and Control. 2025;107:107798.
- View Article
- Google Scholar
27. Bharatheedasan K, Maity T, Kumaraswamidhas LA, Durairaj M. Enhanced fault diagnosis and remaining useful life prediction of rolling bearings using a hybrid multilayer perceptron and LSTM network model. Alexandria Engineering Journal. 2025;115:355–69.
- View Article
- Google Scholar
28. Zha W, Liu Y, Wan Y, Luo R, Li D, Yang S, et al. Forecasting monthly gas field production based on the CNN-LSTM model. Energy. 2022;260:124889.
- View Article
- Google Scholar
29. Shiri FM, Perumal T, Mustapha N, Mohamed R. A comprehensive overview and comparative analysis on deep learning models: CNN, RNN, LSTM, GRU. arXiv preprint. 2023. https://arxiv.org/abs/2305.17473
- View Article
- Google Scholar
30. Wen X, Li W. Time series prediction based on LSTM-attention-LSTM model. IEEE Access. 2023;11:48322–31.
- View Article
- Google Scholar
31. Negi G, Kumar A, Pant S, Ram M. GWO: a review and applications. Int J Syst Assur Eng Manag. 2020;12(1):1–8.
- View Article
- Google Scholar
32. Salgotra R, Singh S, Singh U, Kundu K, Gandomi AH. An adaptive version of differential evolution for solving CEC2014, CEC 2017 and CEC 2022 test suites. In: Proceedings of the 2022 IEEE Symposium Series on Computational Intelligence (SSCI). IEEE; 2022. p. 1644–9.
33. Guan Z, Ren C, Niu J, Wang P, Shang Y. Great wall construction algorithm: a novel meta-heuristic algorithm for engineer problems. Expert Systems with Applications. 2023;233:120905.
- View Article
- Google Scholar
34. Jain M, Saihjpal V, Singh N, Singh SB. An overview of variants and advancements of PSO algorithm. Applied Sciences. 2022;12(17):8392.
- View Article
- Google Scholar
35. Makhadmeh SN, Al-Betar MA, Doush IA, Awadallah MA, Kassaymeh S, Mirjalili S, et al. Recent advances in grey wolf optimizer, its versions and applications: review. IEEE Access. 2024;12:22991–3028.
- View Article
- Google Scholar
36. Tang J, Liu G, Pan Q. A review on representative swarm intelligence algorithms for solving optimization problems: applications and trends. IEEE/CAA J Autom Sinica. 2021;8(10):1627–43.
- View Article
- Google Scholar
37. Tang J, Duan H, Lao S. Swarm intelligence algorithms for multiple unmanned aerial vehicles collaboration: a comprehensive review. Artif Intell Rev. 2022;56(5):4295–327.
- View Article
- Google Scholar
38. van de Ven GM, Tuytelaars T, Tolias AS. Three types of incremental learning. Nat Mach Intell. 2022;4(12):1185–97. pmid:36567959
- View Article
- PubMed/NCBI
- Google Scholar
39. Vandenhaute S, Cools-Ceuppens M, DeKeyser S, Verstraelen T, Van Speybroeck V. Machine learning potentials for metal-organic frameworks using an incremental learning approach. npj Comput Mater. 2023;9(1).
- View Article
- Google Scholar
40. Luo Y, Li Q, Jiang L, Zhou Y. Analysis of Chinese fire statistics during the period 1997–2017. Fire Safety Journal. 2021;125:103400.
- View Article
- Google Scholar
41. Andri Hermawan Y, Warlina L, Mohd M. GIS-based urban village regional fire risk assessment and mapping. INJIISCOM. 2021;2(2):31–43.
- View Article
- Google Scholar
42. Wang Y, Xia T, Xu M, Fang Z, Zhang M, Ruan H. Lai’an fire tests: influence of opening condition on the fire dynamics of real urban village dwellings. Fire Technol. 2023;61(3):1269–85.
- View Article
- Google Scholar
43. Liu Z, Li Z, Lin X, Xie L, Jiang J. Study on fire prevention in dong traditional villages in the western hunan region: a case study of Gaotuan Village. Fire. 2023;6(9):334.
- View Article
- Google Scholar

[ref1] 1. Zain M, Keawsawasvong S, Thongchom C, Sereewatthanawut I, Usman M, Prasittisopin L. Establishing efficacy of machine learning techniques for vulnerability information of tubular buildings. Eng Sci. 2023;27(2):1008.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Pan W, Du J. Towards sustainable urban transition: a critical review of strategies and policies of urban village renewal in Shenzhen, China. Land Use Policy. 2021;111:105744.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. Wu Y, Chen S, Wang D, Zhang Q. Fire risk assessment of heritage villages: a case study on Chengkan Village in China. Fire. 2023;6(2):47.
View Article
Google Scholar

[8] View Article

[9] Google Scholar

[ref4] 4. Rebellon HE, Henao OFP, Gutierrez-Velasquez EI, Amell AA, Colorado HA. Thermoelectric modules: applications and opportunities in building environments for sustainable energy generation: from biomass, municipal waste, and other sources. Eng Sci. 2024.
View Article
Google Scholar

[11] View Article

[12] Google Scholar

[ref5] 5. Wei L, Duan W, Dong S. Research on leased space of urban villages in large cities based on fuzzy kano model evaluation and building performance simulation: a case study of Laojuntang village, Chaoyang District, Beijing. Buildings. 2024;14(1):120.
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref6] 6. Yuan D, Yau Y, Bao H. Urban village redevelopment in China: Conflict formation and management from a neo-institutional economics perspective. Cities. 2024;145:104710.
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref7] 7. Sivrikaya F, Küçük Ö. Modeling forest fire risk based on GIS-based analytical hierarchy process and statistical analysis in Mediterranean region. Ecological Informatics. 2022;68:101537.
View Article
Google Scholar

[20] View Article

[21] Google Scholar

[ref8] 8. Ahn S, Won J, Lee J, Choi C. Comprehensive building fire risk prediction using machine learning and stacking ensemble methods. Fire. 2024;7(10):336.
View Article
Google Scholar

[23] View Article

[24] Google Scholar

[ref9] 9. Brodny J, Felka D, Tutak M. Applying an automatic gasometry system and a fuzzy set theory to assess the state of gas hazard during the coal mining production process. Eng Sci. 2023.
View Article
Google Scholar

[26] View Article

[27] Google Scholar

[ref10] 10. Wang F, Tan B, Chen Y, Fang X, Jia G, Wang H, et al. A visual knowledge map analysis of mine fire research based on CiteSpace. Environ Sci Pollut Res Int. 2022;29(51):77609–24. pmid:35680744
View Article
PubMed/NCBI
Google Scholar

[29] View Article

[30] PubMed/NCBI

[31] Google Scholar

[ref11] 11. Cetin M, Isik Pekkan Ö, Ozenen Kavlak M, Atmaca I, Nasery S, Derakhshandeh M, et al. GIS-based forest fire risk determination for Milas district, Turkey. Nat Hazards. 2022;119(3):2299–320.
View Article
Google Scholar

[33] View Article

[34] Google Scholar

[ref12] 12. Rani G, Siddiqui NA, Yadav M, Ansari S. Hierarchical integrated spatial risk assessment model of fire hazard for the core city areas in India. Land Use Policy. 2023;126:106536.
View Article
Google Scholar

[36] View Article

[37] Google Scholar

[ref13] 13. Zeng Y, Li Y, Du P, Huang X. Smart fire detection analysis in complex building floorplans powered by GAN. Journal of Building Engineering. 2023;79:107858.
View Article
Google Scholar

[39] View Article

[40] Google Scholar

[ref14] 14. Eslami R, Azarnoush M, Kialashki A, Kazemzadeh F. GIS-based forest fire susceptibility assessment by random forest, artificial neural network and logistic regression methods. JTFS. 2021;33(2):173–84.
View Article
Google Scholar

[42] View Article

[43] Google Scholar

[ref15] 15. Li X, Gao H, Zhang M, Zhang S, Gao Z, Liu J, et al. Prediction of forest fire spread rate using UAV images and an LSTM model considering the interaction between fire and wind. Remote Sensing. 2021;13(21):4325.
View Article
Google Scholar

[45] View Article

[46] Google Scholar

[ref16] 16. Kong W, Dong ZY, Jia Y, Hill DJ, Xu Y, Zhang Y. Short-term residential load forecasting based on LSTM recurrent neural network. IEEE Trans Smart Grid. 2019;10(1):841–51.
View Article
Google Scholar

[48] View Article

[49] Google Scholar

[ref17] 17. Qin Y, Song D, Chen H, Cheng W, Jiang G, Cottrell GW. A dual-stage attention-based recurrent neural network for time series prediction. In: Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence. 2017. p. 2627–33. https://doi.org/10.24963/ijcai.2017/366

[ref18] 18. Zhang L, Shi C, Zhang F. Predicting forest fire area growth rate using an ensemble algorithm. Forests. 2024;15(9):1493.
View Article
Google Scholar

[52] View Article

[53] Google Scholar

[ref19] 19. Tien Bui D, Bui Q-T, Nguyen Q-P, Pradhan B, Nampak H, Trinh PT. A hybrid artificial intelligence approach using GIS-based neural-fuzzy inference system and particle swarm optimization for forest fire susceptibility modeling at a tropical area. Agricultural and Forest Meteorology. 2017;233:32–44.
View Article
Google Scholar

[55] View Article

[56] Google Scholar

[ref20] 20. Armstrong JS, Green KC. Forecasting methods and principles: evidence-based checklists. Journal of Global Scholars of Marketing Science. 2018;28(2):103–59.
View Article
Google Scholar

[58] View Article

[59] Google Scholar

[ref21] 21. Makridakis S, Spiliotis E, Assimakopoulos V. The M4 competition: 100,000 time series and 61 forecasting methods. International Journal of Forecasting. 2020;36(1):54–74.
View Article
Google Scholar

[61] View Article

[62] Google Scholar

[ref22] 22. Alruwais N, Elhessewi GMohS, Saeed MK, Alshammeri M, Alrusaini O, Alkharashi A, et al. Federated learning and GWO-enabled consumer-centric healthcare internet of things for pancreatic tumour. Alexandria Engineering Journal. 2025;122:344–54.
View Article
Google Scholar

[64] View Article

[65] Google Scholar

[ref23] 23. Xu X, Guo C, Wan P, Xu H, Yu Y, Fan J. WT-DSE-LSTM: a hybrid model for the multivariate prediction of dissolved oxygen. Alexandria Engineering Journal. 2025;124:285–96.
View Article
Google Scholar

[67] View Article

[68] Google Scholar

[ref24] 24. Landi F, Baraldi L, Cornia M, Cucchiara R. Working memory connections for LSTM. Neural Netw. 2021;144:334–41. pmid:34547671
View Article
PubMed/NCBI
Google Scholar

[70] View Article

[71] PubMed/NCBI

[72] Google Scholar

[ref25] 25. Hou Y, Gao H, Wang Z, Du C. Improved grey wolf optimization algorithm and application. Sensors (Basel). 2022;22(10):3810. pmid:35632219
View Article
PubMed/NCBI
Google Scholar

[74] View Article

[75] PubMed/NCBI

[76] Google Scholar

[ref26] 26. Wang R, Xu H, Yi D, Song C, Che Y. Automatic detection of Alzheimer’s disease from EEG signals using hybrid PSO-GWO algorithm. Biomedical Signal Processing and Control. 2025;107:107798.
View Article
Google Scholar

[78] View Article

[79] Google Scholar

[ref27] 27. Bharatheedasan K, Maity T, Kumaraswamidhas LA, Durairaj M. Enhanced fault diagnosis and remaining useful life prediction of rolling bearings using a hybrid multilayer perceptron and LSTM network model. Alexandria Engineering Journal. 2025;115:355–69.
View Article
Google Scholar

[81] View Article

[82] Google Scholar

[ref28] 28. Zha W, Liu Y, Wan Y, Luo R, Li D, Yang S, et al. Forecasting monthly gas field production based on the CNN-LSTM model. Energy. 2022;260:124889.
View Article
Google Scholar

[84] View Article

[85] Google Scholar

[ref29] 29. Shiri FM, Perumal T, Mustapha N, Mohamed R. A comprehensive overview and comparative analysis on deep learning models: CNN, RNN, LSTM, GRU. arXiv preprint. 2023. https://arxiv.org/abs/2305.17473
View Article
Google Scholar

[87] View Article

[88] Google Scholar

[ref30] 30. Wen X, Li W. Time series prediction based on LSTM-attention-LSTM model. IEEE Access. 2023;11:48322–31.
View Article
Google Scholar

[90] View Article

[91] Google Scholar

[ref31] 31. Negi G, Kumar A, Pant S, Ram M. GWO: a review and applications. Int J Syst Assur Eng Manag. 2020;12(1):1–8.
View Article
Google Scholar

[93] View Article

[94] Google Scholar

[ref32] 32. Salgotra R, Singh S, Singh U, Kundu K, Gandomi AH. An adaptive version of differential evolution for solving CEC2014, CEC 2017 and CEC 2022 test suites. In: Proceedings of the 2022 IEEE Symposium Series on Computational Intelligence (SSCI). IEEE; 2022. p. 1644–9.

[ref33] 33. Guan Z, Ren C, Niu J, Wang P, Shang Y. Great wall construction algorithm: a novel meta-heuristic algorithm for engineer problems. Expert Systems with Applications. 2023;233:120905.
View Article
Google Scholar

[97] View Article

[98] Google Scholar

[ref34] 34. Jain M, Saihjpal V, Singh N, Singh SB. An overview of variants and advancements of PSO algorithm. Applied Sciences. 2022;12(17):8392.
View Article
Google Scholar

[100] View Article

[101] Google Scholar

[ref35] 35. Makhadmeh SN, Al-Betar MA, Doush IA, Awadallah MA, Kassaymeh S, Mirjalili S, et al. Recent advances in grey wolf optimizer, its versions and applications: review. IEEE Access. 2024;12:22991–3028.
View Article
Google Scholar

[103] View Article

[104] Google Scholar

[ref36] 36. Tang J, Liu G, Pan Q. A review on representative swarm intelligence algorithms for solving optimization problems: applications and trends. IEEE/CAA J Autom Sinica. 2021;8(10):1627–43.
View Article
Google Scholar

[106] View Article

[107] Google Scholar

[ref37] 37. Tang J, Duan H, Lao S. Swarm intelligence algorithms for multiple unmanned aerial vehicles collaboration: a comprehensive review. Artif Intell Rev. 2022;56(5):4295–327.
View Article
Google Scholar

[109] View Article

[110] Google Scholar

[ref38] 38. van de Ven GM, Tuytelaars T, Tolias AS. Three types of incremental learning. Nat Mach Intell. 2022;4(12):1185–97. pmid:36567959
View Article
PubMed/NCBI
Google Scholar

[112] View Article

[113] PubMed/NCBI

[114] Google Scholar

[ref39] 39. Vandenhaute S, Cools-Ceuppens M, DeKeyser S, Verstraelen T, Van Speybroeck V. Machine learning potentials for metal-organic frameworks using an incremental learning approach. npj Comput Mater. 2023;9(1).
View Article
Google Scholar

[116] View Article

[117] Google Scholar

[ref40] 40. Luo Y, Li Q, Jiang L, Zhou Y. Analysis of Chinese fire statistics during the period 1997–2017. Fire Safety Journal. 2021;125:103400.
View Article
Google Scholar

[119] View Article

[120] Google Scholar

[ref41] 41. Andri Hermawan Y, Warlina L, Mohd M. GIS-based urban village regional fire risk assessment and mapping. INJIISCOM. 2021;2(2):31–43.
View Article
Google Scholar

[122] View Article

[123] Google Scholar

[ref42] 42. Wang Y, Xia T, Xu M, Fang Z, Zhang M, Ruan H. Lai’an fire tests: influence of opening condition on the fire dynamics of real urban village dwellings. Fire Technol. 2023;61(3):1269–85.
View Article
Google Scholar

[125] View Article

[126] Google Scholar

[ref43] 43. Liu Z, Li Z, Lin X, Xie L, Jiang J. Study on fire prevention in dong traditional villages in the western hunan region: a case study of Gaotuan Village. Fire. 2023;6(9):334.
View Article
Google Scholar

[128] View Article

[129] Google Scholar

Figures

Abstract

1 Introduction

2 Related work

2.1 LSTM model

Step 1: Forget gate.

Step 2: Input gate and candidate memory update.

Step 3: Memory cell state update.

Step 4: Output gate and hidden state update.

2.2 Improved Grey Wolf Optimizer (IGWO)

2.3 Benchmark evaluation and analysis

2.4 Incremental learning strategy

3 Proposed algorithm methodology

3.1 IGWO-LSTM-IL model

3.2 Determination of fire risk factor indicators

3.3 Risk Prediction and analysis

4 Case study

4.1 GWO-LSTM model performance

4.2 IGWO-LSTM model performance

4.3 IGWO-LSTM with incremental learning

4.4 Model comparison

5 Conclusion

Supporting information

S1 Checklist. Inclusivity in Global Research Questionnaire.

S2 File. Expert questionnaire used for evaluating fire risk indicators in urban villages.

Inclusivity in global research

References