Figures
Abstract
As power systems evolve towards greater intelligence and flexibility, flexible interconnection technology has emerged as a critical means to enhance operational reliability and economic performance. This paper presents a data-model dual-driven planning methodology for flexible interconnection systems, integrating a multi-scale spatio-temporal cross-enhanced CNN-LSTM model for load forecasting with a Chaotic Evolutionary Optimization (CEO) algorithm to optimize system design. The proposed framework first constructs an improved CNN-LSTM hybrid architecture, trained on historical load data and simulated feature sets, to predict future load profiles. A novel Tunable Relative Entropy (TRE) metric is introduced as a complementarity quantification index, forming a multi-objective function that incorporates system balance, reliability, economy, and spatio-temporal complementarity. The CEO algorithm is then employed to solve the optimization model, determining the optimal system configuration and operational parameters. Experimental evaluations demonstrate that the forecasting module achieves high accuracy, with a Mean Squared Error (MSE) of 0.000368 and a Mean Absolute Error (MAE) of 0.006334. Moreover, the TRE index improves complementarity efficiency by 3.8%. By leveraging the predictive capability of the hybrid neural network and the CEO algorithm’s optimization efficacy, the proposed approach not only reduces load fluctuation indices but also enhances planning efficiency and operational economy, offering a viable pathway for intelligent power system development.
Citation: Zhao X, Xing X, Guo X, Chao J, Hu D, Zhuan X (2026) An optimization method for flexible interconnection planning based on improved CNN-LSTM prediction and tunable relative entropy-driven chaotic evolution. PLoS One 21(6): e0351324. https://doi.org/10.1371/journal.pone.0351324
Editor: Renier Mendoza, University of the Philippines Diliman, PHILIPPINES
Received: November 11, 2025; Accepted: May 26, 2026; Published: June 12, 2026
Copyright: © 2026 Zhao et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and in Supporting Information files.
Funding: China Southern Power Grid Company Limited Technology Project (030400KC23110018/GDKJXM20231172). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: NO authors have competing interests.
1 Introduction
Against the backdrop of the global energy transition and power market reform, modern power systems face multi-faceted challenges, including heightened load volatility and increasing penetration of renewable energy sources [1,2]. As a key solution to these issues, flexible interconnection technologies enable spatio-temporal optimization of power resource allocation by establishing multi-regional power interaction networks [3–5]. However, the planning and operation optimization of these devices involve high-dimensional, nonlinear decision-making problems, where conventional methods often exhibit limitations in solving efficiency and global optimization capability [6].
Load forecasting, as a fundamental link in the planning of flexible interconnection systems, directly affects the subsequent optimization effect. The application of traditional forecasting techniques, such as time series analysis and Kalman filtering, is constrained by inherent deficiencies in addressing nonlinearities and high-dimensionality within datasets, thus failing to adequately capture the differentiated patterns inherent to regional loads [7]. In recent years, deep learning technology has accounted for an increasing proportion in the field of time series forecasting, such as Convolutional Neural Network (CNN); however, deep learning methods generally have problems such as difficulty in determining internal parameters and slow calculation speed [8]. In response to this, hybrid forecasting methods have emerged, combining the characteristics of different types of neural networks to improve the overall optimization performance [9–14].
In recent years, further research on deep learning technology has improved forecasting performance [10]. The CNN-LSTM hybrid model combines the spatial feature extraction capability of CNN and the time series modeling capability of LSTM, and has achieved remarkable results in power load forecasting. For example, Reference [15] proposes using a hybrid CNN and GRU neural network for load forecasting, which helps to reduce waste and address issues related to demand-side management. In addition, there are load forecasting methods that combine traditional forecasting methods with neural networks; Reference [16] employs stacking to integrate base models such as ANN, XGBoost, and LSTM, and uses Lasso regression as the meta-model to fuse the prediction results. Due to the uncertainty of load forecasting itself, the forecasting is affected by various factors; Reference [17] combines feature engineering with Fourier transform and wavelet transform, and through enriching the data and optimizing the model performance, significantly improves the accuracy of power load forecasting. Furthermore, Reference [18] integrates empirical mode decomposition into the neural network model, and combines the Bayesian optimization strategy. The feature decomposition is used to enhance the accuracy of the model.
In addition to load forecasting, the scheduling of the power grid itself is also extremely important. However, research on the installation and scheduling of flexible interconnection devices is still not comprehensive [19–21]. In terms of the overall optimization of flexible interconnection systems, the objective function of flexible interconnection usually involves multi-objective optimization problems such as balance, reliability, and economy [22–24]. Reference [25] conducts adaptive scheduling for flexible interconnection stations in multi-level operating states and proposes an adaptive weight coefficient adjustment method. The work in Reference [26] focuses on flexible interconnection planning within low-voltage distribution networks to improve power supply capacity. However, conventional mathematical programming techniques, including linear and dynamic programming, often face challenges related to prohibitive computational complexity and inadequate global exploration, especially in large-scale or non-convex problems. Consequently, heuristic algorithms have gained prominence as a prominent alternative, favored for their flexibility and robustness in navigating complex, constrained search spaces.
Recent studies have further advanced interconnection planning. For instance, robust planning methods for hybrid microgrids have been proposed to fortify systems against renewable uncertainty [27], and risk-based data-driven approaches have been applied to integrated energy management [28], highlighting the trend towards intelligent optimization.
The optimization of flexible interconnection systems involves equipment configuration, operation scheduling, energy management, and other aspects. Traditional studies mostly adopt mathematical programming methods, such as Mixed Integer Linear Programming (MILP) and Nonlinear Programming (NLP); however, these methods have high computational costs when dealing with large-scale complex systems. Heuristic algorithms such as Genetic Algorithm (GA), Particle Swarm Optimization (PSO), and Ant Colony Optimization (ACO) have become important tools for power system optimization due to their global search capabilities and adaptability to nonlinear problems. For example, Reference [29] suggests using heuristic algorithms to optimize the configuration of flexible interconnections. Based on this research, this paper adopts the Chaotic Evolutionary Optimization (CEO) algorithm combined with chaotic evolution theory, which has high exploration capability and convergence speed, making it suitable for solving such problems [30].
Based on the described context, this paper proposes a data-model dual-driven planning method for flexible interconnection systems. The approach employs an improved CNN-LSTM model to achieve high-precision load forecasting, and utilizes the CEO algorithm to optimize system configuration and operational strategies. This integrated methodology enhances both the planning efficiency and economic performance of flexible interconnection systems. The main innovations of this work are as follows: (1) an enhanced CNN-LSTM hybrid model is designed, incorporating multi-scale convolution, bidirectional LSTM, and an attention mechanism to improve load forecasting accuracy; (2) a novel Tunable Relative Entropy (TRE) index is introduced as a measure of complementarity. A multi-objective function is constructed that simultaneously considers balance, reliability, economy, and spatio-temporal complementarity. Efficient optimization is achieved through a chaotic optimization algorithm.
2 Methodology
2.1 Improved CNN-LSTM load forecasting model
The proposed CNN-LSTM architecture introduces three critical enhancements over baseline models: (1) Multi-scale 1D convolutions (kernels 3 and 5) for hierarchical spatial feature extraction; (2) Bidirectional LSTM with residual connections to mitigate vanishing gradients and capture long-term dependencies; (3) 8-head attention mechanism focused on peak-load time points. These modifications specifically address the spatio-temporal complexity of regional load patterns.
To achieve high-precision load forecasting, this paper designs an improved CNN-LSTM hybrid model, which introduces an attention mechanism and multi-task branch forecasting on the basis of the original hybrid model [31]. Its structure is shown in Fig 1. First, it receives normalized regional feature data, including area, population, electricity consumption type, and climate characteristics. The input matrix is:
where Z is the input matrix, R represents the input features after linear transformation introduced through residual connection, B is the batch size, I is the sequence length, and dmodel denotes the model dimension.
In the multi-scale CNN module, short-kernel and long-kernel convolutions are used to extract spatial features of different scales, and nonlinear transformation is performed through the ReLU activation function:
where s1 is the short-convolution path, l2 is the long-convolution path, represents the one-dimensional convolution operation with the number of output channels C, convolution kernel size K, and padding P, and c is the concatenated feature tensor.
To retain the original input information and alleviate the gradient vanishing problem, residual connection is introduced. First, the input dimension is adjusted through a linear layer to expand r into and add it to the Transformer output. A bi-directional LSTM module is adopted to capture the long-term dependence of the load curve and improve the time series modeling effect. Among them, Bi-LSTM is the bi-directional LSTM network, and its output is
, where 512 is the dimension after concatenation of forward and backward hidden states. An 8-head attention mechanism is used to enhance the focus on key time points, followed by multi-head segmentation:
where indexes the attention head, H is the number of attention heads, and
is the dimension per head. First, linear transformation is performed to generate query Q, key K, and value V. Adaptive average pooling is used to compress the features into a fixed-size vector. Then, the fully connected layer outputs the load curve prediction values and peak prediction at 96 time points:
where p denotes the feature vector after adaptive average pooling.
The model training adopts the adamw optimizer (learning rate = 0.001, weight decay = 10–5) and cosine annealing learning rate scheduler. The loss function is Mean Squared Error (mse), supplemented by the auxiliary loss of peak prediction.
2.2 Chaotic evolutionary optimization (CEO) algorithm
The algorithm flow consists of the following steps: initialization of the population, calculation of fitness using the objective function Cost, update of positions, and iteration until convergence is achieved, culminating in the output of the optimal configuration parameters X and power allocation Pn. The overall algorithm is shown in Fig 2.
Population initialized via Latin Hypercube Sampling within [lb, ub] bounds; iter = current iteration index; n = individual index in population (), Bestt = individual with minimal Cost in current population (applies to both x and y trajectories); the convergence criterion is
or max iterations reached.
The overall CEO algorithm is a population-based evolutionary algorithm, and its unified search framework is first given as:
where xiter and xiter+1 are the current individual and mutated individual, respectively; represents the search step size; and diter is the evolutionary direction.
The CEO algorithm adopts the hyperchaotic characteristics of the two-dimensional heterogeneous group hyperchaotic map, where two individuals xiter and yiter must be mapped to the range of the following equations respectively:
where lb and ub are the lower and upper bounds of the population variables, respectively. Then, inverse mapping is performed on them:
Multiple evolutionary directions are generated according to the mapped individuals:
where , and
and
are the evolutionary directions of the corresponding individuals, respectively.
where ,
are the positions of individuals mutated by chaotic mapping, and a is the search step size of the algorithm, which acts on the mutated individuals generated from the original individuals. To further improve the local development capability of the algorithm, the search for nearby optimal solutions is continued to accelerate the convergence speed of the algorithm:
where Bestiter is the optimal solution of the current population.
2.3 Tunable relative entropy (TRE) for complementarity quantification
To enhance the coherence of the index system in flexible interconnection planning, it is proposed to introduce Tunable Relative Entropy (TRE) as a complementarity quantification index, which dynamically measures the spatio-temporal complementarity of load curves between feeders from the perspective of probability distribution.
For the load curves Pa(tk) and Pb(tk) of two feeders, the time-varying relative entropy is defined as:
where is a smoothing constant (to avoid division by zero). A tunable parameter
is introduced to construct a bidirectional complementarity measure:
where a larger Ccomp(tk) indicates a more obvious negative difference between the two feeders at time tk.
Aggregation in the time dimension is performed to generate a normalized evaluation index; a value closer to 1 indicates strong complementarity in the full time period:
where Cnorm,comp is the normalized complementarity assessment index, and the closer its value is to 1, the stronger the all-time spatio-temporal complementarity between feeders is, k indexes the K discrete time intervals.
3 Establishment of the overall model
3.1 Flexible interconnection model
In this study, the performance indicators of flexible interconnection are divided into three parts, fully considering the role and application requirements of flexible interconnection devices. This subsection defines three normalized performance indicators for flexible interconnection evaluation: (1) Load balance degree, (2) Device utilization, (3) Mutual aid balance strength. The indicators are summarized as follows:
3.1.1 Daily average balance degree of interconnected feeders (normalized).
where t is continuous time in hours ();
is the load rate of feeder i at time tk; Pi,k is the power of feeder i in the time period tk; Pi,rated is the rated capacity of feeder i; n is the number of feeders for flexible interconnection (generally 2); K is the number of time intervals; Cbal and Cbal,norm are the daily average value of load balance difference and the normalized daily average balance degree of the lines interconnected by flexible interconnection devices, respectively.
3.1.2 Daily utilization hours of flexible interconnection devices.
A day (24 hours) is divided into K = 96 time periods, with a time interval of 15 minutes. The mutual aid rule requires that the load rate of at least one end of the feeder is greater than the threshold Pt0 = 40%, and the mutual aid direction is guaranteed to be from the feeder with a larger load rate to the one with a smaller load rate:
where Pmutual,k is the mutual aid power in the k-th time period; Pa,k and Pb,k are the powers of feeders a and b in the k-th time period, respectively; Pn is the rated power of the Flexible Interconnection Device (FID);Pla, Plb represent the rated capacity of feeder a and b; Tu is the daily utilization hours of the FID; and Tu,norm is the normalized daily utilization hours of the FID, hours (15-minute interval).
3.1.3 Mutual aid balance strength of flexible interconnection devices.
The positive and negative mutual aid times are counted respectively when the mutual aid conditions are met:
where
where Eq represents the mutual balancing strength of the flexible interconnection device, Cp and Cn are the positive and negative mutual aid times of the FID, respectively; and
are the load rates of feeders a and b in the k-th time period (before mutual aid); Pt0 is the minimum load rate of the feeder for starting power mutual aid; The input of this model mainly includes relevant data of different lines, mainly the maximum load of the area corresponding to the line, which is solved using the maximum load and the relevant typical load rate curve.
In addition, the comparison of characteristic values before and after adding flexible devices is used to prove the necessity of flexible interconnection devices. This study adopts the Inter-Regional Peak-Valley Difference Standard Deviation (IPVSD) to indicate the characteristic value of load fluctuation in each region:
where M represents the total number of regions involved in the calculation; PVi denotes the peak-valley difference of region i, and is the mean peak-valley difference across M regions.
The mathematical expression for two lines is:
where X denotes the spatial coordinates of nodes, which is distinct from the input matrix X in Eq. (1) and the solution vector X in the CEO algorithm; N represents the number of nodes; X1 represents the first line; X2 represents the second line; and P represents the maximum load of different nodes on the corresponding line. The influence degree of the flexible interconnection device on the two lines can be judged using the indicators calculated above. To ensure the balance of the three indicators, corresponding weight coefficients are set to integrate the indicators into the objective function of the model for the final algorithm optimization:
where represent the weight coefficients of the corresponding indicators, and Cost represents the final optimization objective.
The objective function Cost is constructed as a weighted penalty function to convert the multi-objective problem into a single-objective one. The weights to
are determined using the Analytic Hierarchy Process (AHP), prioritizing the reliability (Cbal) and economy (Tu) of the system. This setup physically represents the total operational “dissatisfaction” of the system, where a lower cost implies a more balanced, economical, and stable grid operation. Cost aggregates normalized deviations from ideal states (1.0 = perfect balance/utilization/complementarity). Minimizing Cost determines a Pareto front over different sets of weight values—a practical single-objective surrogate for multi-objective optimization in engineering deployment.
3.2 Model data classification
In this paper, load data of different types from existing research results are used for data simulation. The main regional classifications include five types: residential areas, commercial areas, industrial areas, residential-commercial mixed areas, and new communities with electric vehicle charging piles. The typical load curves for different regions are shown in Fig 3. The dataset is generated based on their corresponding load rates; multiplying the load rate by the maximum load of different areas finally obtains the complete load data of different areas, which is used for further analysis and calculation [32].
The training dataset for the hybrid neural network comprises 3000 samples. To enhance model robustness and prevent overfitting caused by pure simulation data, we introduce and augment five real-world load samples obtained from a public utility dataset. These real-world samples capture the stochastic volatility characteristics inherent to actual power grid operation. The forecasting module is validated via a hold-out strategy to ensure its generalization capability. The model input features cover regional areas from 0.2 to 3 km2, populations from 5000 to 100000 people, five categories of electricity consumption types, and climate conditions with temperatures ranging from 0 to 40 °C; the model target output is the load curve across 96 time points. To improve the reliability of the overall dataset, this study integrates simulated data with real measured data, as presented in Fig 3.
Additionally, this paper performs data fitting and simulation on loads of different feature types, considering that the electricity load conditions of different regions are different, especially the corresponding peak electricity consumption periods [33].
3.2.1 Residential areas.
It consists of a base value of 0.3 and two Gaussian peaks, centered at 7.5 hours (morning) and 20 hours (evening), respectively, to simulate the morning and evening electricity consumption peaks of residents. In Sections 2.2.1–2.2.5, t is continuous time in hours ().
3.2.2 Commercial areas.
where I[a,b](t) is an indicator function, which is 1 when and 0 otherwise. The base load of 0.15 is superimposed with the active load during working hours (9:00–17:00), and the active load decreases by 15% during the noon period (12:00–13:00). The noise standard deviation increases to 0.03, reflecting the volatility of commercial activities.
3.2.3 Industrial areas.
A base value of 0.35 is superimposed with a sine fluctuation with an amplitude of 0.05 and a period of 24 hours. The sine term simulates the slight fluctuation of continuous day and night production, and the noise characteristics are the same as those of the residential type.
3.2.4 Residential-commercial mixed areas.
It is the weighted average of residential and commercial curves (50% each), retaining the composite characteristics of residential morning and evening peaks and commercial daytime activities, with the noise level consistent with that of the residential type.
4 Simulation and verification
The experiment implements the improved CNN-LSTM model using Python and PyTorch, and the optimization algorithm is implemented in MATLAB. The improved CNN-LSTM model captures different spatial patterns through short and long kernel convolutions, uses the BiLSTM module to handle the long-term fluctuation of the load curve, and finally adopts an 8-head attention mechanism to enhance the prediction accuracy of key time points (such as peaks). Finally, the MSE of the improved CNN-LSTM model on the test set is 0.000368, and the MAE is 0.006334, which is superior to the traditional Multi-Layer Perceptron (MLP) (MSE = 0.000571, MAE = 0.008857) and CNN (MSE = 0.000528, MAE = 0.007349) models, verifying its advantages in spatio-temporal feature extraction. The prediction results of the improved CNN-LSTM and the performance comparison of different neural networks are shown in Figs 4 and 5, respectively.
Using the predicted load curve, the CEO algorithm is used for optimization. The CEO algorithm generates diverse evolutionary directions through hyperchaotic mapping to avoid local optimality, and then adopts a local development mechanism to balance exploration and exploitation. Finally, convergence is achieved after 4 iterations. Compared with other algorithms, the CEO algorithm has improvements in both convergence speed and optimization effect. This paper compares different neural networks using the Crested Porcupine Optimizer (CPO) [34], Tornado Optimizer with Coriolis force (TOC) [35], Rime Optimization Algorithm (RIME) [36], and Fox Optimizer (FOX) [37]. The overall comparison is shown in Fig 6, the iteration data is shown in Table 1, and the performance indicators of the model are shown in Tables 2–5. The optimization process data and final results are provided in S1 File.
The daily average balance degree of interconnected feeders reflects the load difference between feeders, which is calculated by the spatio-temporal standard deviation of load rates; a larger value indicates better load balance. The daily utilization rate of the device is calculated based on the ratio of mutual aid power to rated power to activate the market share; a larger value indicates more sufficient use of the device. The mutual aid balance strength represents the bidirectional energy flow; a larger value indicates more balanced energy flow. From the comparison between Tables 2 and 3, it can be seen that the normalized indicators of flexible interconnection after CEO optimization using the data predicted by the improved CNN-LSTM neural network are larger, indicating a better flexible interconnection effect.
In addition, the IPVSD in Table 2 decreases from 0.08515 to 0.05924 (a decrease of 30.5%). Fig 7 shows the optimization results using the data predicted by the improved CNN-LSTM neural network, with higher benefits of flexible interconnection. In addition, compared with other optimization methods in Tables 3–5, this optimization method achieves the best effect and the highest reduction rate.
5 Conclusion
With the development of new power systems toward intelligence and flexibility, aiming at the key issue of how flexible interconnection technology can improve system reliability and economy, this paper proposes a flexible interconnection planning optimization method based on improved CNN-LSTM prediction and Chaotic Evolutionary Optimization (CEO). The proposed improved CNN-LSTM balances computational efficiency and accuracy, outperforming more complex models like Transformers. As shown in the results, it significantly outperforms standard MLP and CNN baselines. Comparison with the standard CNN-LSTM (Table 5) further validates the efficacy of the proposed multi-scale and attention mechanisms. This method has the advantages of high solution efficiency and strong global optimization capability. By adding flexible interconnection, the inter-regional peak-valley difference in the original power grid is reduced, and the overall operation efficiency of the power grid is improved. In addition, the following conclusions are obtained through theoretical research and simulation experiments:
- (1) Aiming at the load forecasting problem, an improved CNN-LSTM network with multi-scale spatio-temporal cross-enhancement is designed, which integrates multi-scale convolution, bi-directional LSTM, and attention mechanism to improve load forecasting accuracy. High-precision prediction at 96 points is realized on the load dataset of 5 types of regions. Simulation results show that this method achieves low Mean Squared Error (MSE = 0.000368) and Mean Absolute Error (MAE = 0.006334) in load forecasting, providing a high-precision data basis for optimization.
- (2) A flexible interconnection objective function integrating multiple indicators is constructed, which comprehensively considers multi-dimensional indicators such as balance, reliability, and economy, and realizes efficient optimization through the chaotic optimization algorithm. Simulation results show that the chaotic optimization algorithm can effectively reduce the value of the optimization objective function within a fast number of iterations, and compared with other methods, the reduction is more than 5%, achieving a better optimization effect.
- (3) The superior performance of the CEO algorithm can be attributed to the ergodic property of the hyperchaotic map. Unlike random initialization in traditional evolutionary algorithms, chaotic mapping allows the solution to traverse the search space more thoroughly in the early stages, preventing the algorithm from falling into local optima in this high-dimensional non-convex planning problem. Both Cbal,norm and Tu,norm are better the larger they are. From Tables 2–5, it can be seen that compared to other algorithms, CEO achieves the best optimization based on the optimal neural network, with improvements over other algorithms. Cbal,norm increases by more than 0.095%, and Tu,norm increases by more than 22%. The method proposed in this paper has good engineering practicality in load forecasting and system optimization. This study adopts Tunable Relative Entropy (TRE) as a complementarity quantification index, among which the CEO algorithm achieves the best indicator, which is at least 3.8% higher than other algorithms, dynamically measuring the spatio-temporal complementarity of load curves between feeders from the perspective of probability distribution. The future work of this study will explore a multi-objective optimization framework, introduce environmental and social benefit indicators, and further improve the practical value of the method.
References
- 1. Bunster JO, Palavicino CA, Nasirov S. Adaptation and evolution of policy processes to support an emerging renewable market: The case of distributed generation in Chile. Energy Research & Social Science. 2025;125:104127.
- 2. Etanya TF, Tsafack P, Ngwashi DK. Grid-connected distributed renewable energy generation systems: Power quality issues, and mitigation techniques – A review. Energy Reports. 2025;13:3181–203.
- 3. Han J, Zhang D, Jia B, Xie L, Wan W, Tan J, et al. Power loss minimization-oriented reactive power control for wind farm equipped with distributed energy storages using clustering-based data-driven method. Energy. 2025;328:136599.
- 4. Kim I. Optimal allocation of distributed generation and storage systems using multi-dimensional weighted least distance optimization and water droplet algorithms. Energy. 2025;320:135371.
- 5. Li Y, Li Y. Semi-supervised federated learning for collaborative security threat detection in control system for distributed power generation. Engineering Applications of Artificial Intelligence. 2025;148:110374.
- 6. Liu F, Gu B, Qin S, Zhang K, Cui L, Xie G. Power grid partition with improved biogeography-based optimization algorithm. Sustainable Energy Technologies and Assessments. 2021;46:101267.
- 7. Ma K, Nie X, Yang J, Zha L, Li G, Li H. A power load forecasting method in port based on VMD-ICSS-hybrid neural network. Applied Energy. 2025;377:124246.
- 8. Zhang W, Cai Y, Zhan H, Yang M, Zhang W. Multi-energy load forecasting for small-sample integrated energy systems based on neural network Gaussian process and multi-task learning. Energy Conversion and Management. 2024;321:119027.
- 9. Dhanasekaran S, Gopal D, Logeshwaran J, Ramya N, Salau AO. Multi-Model Traffic Forecasting in Smart Cities using Graph Neural Networks and Transformer-based Multi-Source Visual Fusion for Intelligent Transportation Management. Int J ITS Res. 2024;22(3):518–41.
- 10. Ullah K, Ahsan M, Hasanat SM, Haris M, Yousaf H, Raza SF, et al. Short-Term Load Forecasting: A Comprehensive Review and Simulation Study With CNN-LSTM Hybrids Approach. IEEE Access. 2024;12:111858–81.
- 11. Hasanat SM, Ullah K, Yousaf H, Munir K, Abid S, Bokhari SAS, et al. Enhancing Short-Term Load Forecasting With a CNN-GRU Hybrid Model: A Comparative Analysis. IEEE Access. 2024;12:184132–41.
- 12. Gao Z, Yang S, Yu J, Zhao A. Hybrid forecasting model of building cooling load based on combined neural network. Energy. 2024;297:131317.
- 13. Asiri MM, Aldehim G, Alotaibi FA, Alnfiai MM, Assiri M, Mahmud A. Short-Term Load Forecasting in Smart Grids Using Hybrid Deep Learning. IEEE Access. 2024;12:23504–13.
- 14. Hu X, Liu W, Huo H. An intelligent network traffic prediction method based on Butterworth filter and CNN–LSTM. Computer Networks. 2024;240:110172.
- 15. Chiu M-C, Hsu H-W, Chen K-S, Wen C-Y. A hybrid CNN-GRU based probabilistic model for load forecasting from individual household to commercial building. Energy Reports. 2023;9:94–105.
- 16. Guo F, Mo H, Wu J, Pan L, Zhou H, Zhang Z, et al. A Hybrid Stacking Model for Enhanced Short-Term Load Forecasting. Electronics. 2024;13(14):2719.
- 17.
Meng X, Shen H, Liu Z, Li B, Ji D, Yang F, et al. An Enhanced Electric Power Load Forecasting Model Considering Data Feature Extraction. In: 2024 IEEE 8th Conference on Energy Internet and Energy System Integration (EI2), 2024. 779–https://doi.org/10.1109/ei264398.2024.10991786
- 18. Lotfipoor A, Patidar S, Jenkins DP. Deep neural network with empirical mode decomposition and Bayesian optimisation for residential load forecasting. Expert Systems with Applications. 2024;237:121355.
- 19. Yang L, Li Y, Zhang Y, Xie Z, Chen J, Qu Y. Optimal allocation strategy of SOP in flexible interconnected distribution network oriented high proportion renewable energy distribution generation. Energy Reports. 2024;11:6048–56.
- 20. Zhu Z, Nan D, Duan Q, Liu Z, Wu X. Flexible DC Optimization Control Technology Based on Zonal Interconnection of Urban Power Grids. Electronics. 2024;13(23):4679.
- 21. Yan X, Wang Q, Bu J. High penetration PV active distribution network power flow optimization and loss reduction based on flexible interconnection technology. Electric Power Systems Research. 2024;226:109839.
- 22. Jiao M, Li P, Yang M, Wang Q, Han L. Coordinated Flexibility Provision Strategy for Multi-Area Interconnected IEHS. IEEE Trans on Ind Applicat. 2025;61(1):1675–82.
- 23. Yang J, Zhao X. Cascade model predictive control strategy for medium voltage flexible interconnected systems with arc suppression capability under single phase grounding fault. Electric Power Systems Research. 2024;235:110511.
- 24. Tang X, Zheng J, Yang Z, He X, Min H, Zhou S, et al. A Fully Decentralized Optimal Dispatch Scheme for an AC–DC Hybrid Distribution Network Formed by Flexible Interconnected Distribution Station Areas. Sustainability. 2023;15(14):11338.
- 25. Pang D, Wang Z, Shi X, Ge J, Wang Z, Yi H, et al. Multi-Level Dynamic Weight Optimization Scheduling Strategy for Flexible Interconnected Distribution Substations Based on Three-Port SNOPs. Energies. 2025;18(10):2421.
- 26. Li J, Zhang L, Zhang B, Tang W. Coordinated planning for flexible interconnection and energy storage system in low-voltage distribution networks to improve the accommodation capacity of photovoltaic. Global Energy Interconnection. 2023;6(6):700–13.
- 27. Liang Z, Chung CY, Wang Q, Chen H, Yang H, Wu C. Fortifying Renewable-Dominant Hybrid Microgrids: A Bi-Directional Converter Based Interconnection Planning Approach. Engineering. 2025;51:130–43.
- 28. Tan B, Liang Z, Chung CY, Tan H, Wang H, Yang H. Risk-based data-driven energy management for integrated electrical and hydrogen microgrids with improved hydrogen vehicle charging prediction. Applied Energy. 2025;398:126410.
- 29. Ayrir W, Helmi AM, Ramadan HS. Interconnected microgrids optimization via reconfiguration-based modular approach. Applied Energy. 2024;375:124050.
- 30. Dong Y, Zhang S, Zhang H, Zhou X, Jiang J. Chaotic evolution optimization: A novel metaheuristic algorithm inspired by chaotic dynamics. Chaos, Solitons & Fractals. 2025;192:116049.
- 31. Khorram S, Jehbez N. A Hybrid CNN-LSTM Approach for Monthly Reservoir Inflow Forecasting. Water Resour Manage. 2023;37(10):4097–121.
- 32. Aguiar-Pérez JM, Pérez-Juárez MÁ. An Insight of Deep Learning Based Demand Forecasting in Smart Grids. Sensors (Basel). 2023;23(3):1467. pmid:36772509
- 33. Azeem A, Ismail I, Jameel SM, Harindran VR. Electrical Load Forecasting Models for Different Generation Modalities: A Review. IEEE Access. 2021;9:142239–63.
- 34. Abdel-Basset M, Mohamed R, Abouhawwash M. Crested Porcupine Optimizer: A new nature-inspired metaheuristic. Knowledge-Based Systems. 2024;284:111257.
- 35. Braik M, Al-Hiary H, Alzoubi H, Hammouri A, Azmi Al-Betar M, Awadallah MA. Tornado optimizer with Coriolis force: a novel bio-inspired meta-heuristic algorithm for solving engineering problems. Artif Intell Rev. 2025;58(4).
- 36. Su H, Zhao D, Heidari AA, Liu L, Zhang X, Mafarja M, et al. RIME: A physics-based optimization. Neurocomputing. 2023;532:183–214.
- 37. Mohammed H, Rashid T. FOX: a FOX-inspired optimization algorithm. Appl Intell. 2022;53(1):1030–50.