GEC-DTSP: A GNN–RL-based Edge–Cloud Digital Twin framework for real-time traffic forecasting and adaptive signal control

Fayez Alanazi; Ammar Armghan; Ahmed Jamal Abdullah Al-Gburi; Amr Yousef

doi:10.1371/journal.pone.0350247

Abstract

Urban traffic networks exhibit highly dynamic and nonlinear spatiotemporal interactions that require predictive modeling and adaptive control mechanisms capable of operating under low-latency constraints. This study proposes a GNN–RL-based Edge–Cloud Digital Twin framework for real-time traffic forecasting and adaptive signal control. At the network edge, multi-source traffic data collected from roadside sensors are processed on distributed edge devices to perform multi-step prediction of traffic flow, vehicle density, and congestion states. The forecasting module integrates Graph Convolutional Networks (GCNs) to capture spatial dependencies across the road topology with Long Short-Term Memory (LSTM) units and a Transformer-based predictor to model short- and long-range temporal dynamics. These predicted traffic states are transmitted to a cloud-level Digital Twin Engine, which performs data fusion, state estimation, calibration, and scenario-based simulation to maintain a continuously updated virtual representation of the physical traffic network. Using the forecasted states as inputs, a deep reinforcement learning optimization module performs adaptive signal phase control to minimize average vehicle delay and maximize intersection throughput. The overall framework operates as a closed feedback loop integrating edge-level spatiotemporal forecasting, cloud-level synchronization and simulation, and reinforcement learning–based control policy optimization. Experimental evaluation demonstrates a 17% reduction in average vehicle waiting time and significant improvements in forecasting performance measured using MAE and RMSE, with strong robustness to missing and noisy data conditions. The proposed architecture provides a scalable and low-latency solution for data-driven traffic prediction and signal control within an edge–cloud digital twin environment.

Citation: Alanazi F, Armghan A, Al-Gburi AJA, Yousef A (2026) GEC-DTSP: A GNN–RL-based Edge–Cloud Digital Twin framework for real-time traffic forecasting and adaptive signal control. PLoS One 21(6): e0350247. https://doi.org/10.1371/journal.pone.0350247

Editor: Guangyin Jin, National University of Defense Technology, CHINA

Received: January 20, 2026; Accepted: May 11, 2026; Published: June 1, 2026

Copyright: © 2026 Alanazi et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are located at Kaggle. Smart Traffic Management Dataset: URL: https://www.kaggle.com/datasets/smmmmmmmmmmmm/smart-traffic-management-dataset.

Funding: This work was funded by the Deanship of Graduate Studies and Scientific Research at Jouf University under grant No. (DGSSR-2025-02-01232).

Competing interests: The authors have declared that no competing interests exist.

1. Introduction

1.1 Background

Digital Twin (DT) technology has emerged as a transformative paradigm in intelligent transportation systems, offering a virtual representation of real-world infrastructure for real-time monitoring, prediction, and decision-making [1]. Edge–cloud computing integrates the low-latency, real-time processing power of edge devices with the scalability and storage capabilities of cloud servers for efficient data management [2]. By synchronizing physical traffic networks with computational models, DT enables continuous feedback loops that support operational efficiency and resilience [3]. Modern cities increasingly adopt DT frameworks to simulate congestion, evaluate policies, and enhance transport safety. The integration of Internet of Things (IoT) devices has further accelerated DT applications, as vast sensor-generated data streams provide the necessary input for realistic simulations [4]. However, despite advancements, most DT traffic platforms rely on centralized processing, limiting responsiveness under heavy loads. Addressing these bottlenecks is crucial for building scalable and adaptive traffic ecosystems [5].

Digitalization of urban mobility introduces immense opportunities but also presents several challenges. Traditional systems, constrained by fixed-timing signal control, exhibit limited adaptability under dynamic conditions [6]. The sheer volume of heterogeneous traffic data from vehicles, sensors, and cameras increases computational demands, making centralized processing insufficient [7]. Moreover, current predictive models often fail to capture nonlinear spatiotemporal patterns, resulting in suboptimal forecasts. These issues hinder proactive congestion management, raising concerns about sustainability and safety [8]. Thus, a shift toward distributed, intelligent, and cloud-integrated Digital Twin platforms is essential to meet the growing demands of urban mobility [9].

Edge-cloud integration represents a promising architecture for real-time traffic intelligence, distributing processing between local edge devices and centralized cloud servers [10]. Edge nodes located near sensors enable low-latency decision-making, crucial for dynamic tasks such as adaptive traffic signal control [11]. Simultaneously, cloud servers host advanced analytics and long-term strategic models, aggregating historical and city-wide traffic data to optimize the system [12]. This hybrid architecture supports scalability while maintaining responsiveness, positioning edge-cloud synergy as a central enabler of modern traffic management systems. Such architectures also reduce bandwidth load, since only relevant data is sent to the cloud, minimizing redundancy and improving efficiency [13].

The application of edge-cloud models to traffic systems aligns with broader smart city initiatives. By leveraging edge computing for immediate responses and cloud computing for large-scale predictive modeling, cities can achieve both agility and strategic planning [14]. However, challenges remain in ensuring synchronization between edge and cloud layers, securing data privacy, and managing heterogeneous infrastructures [15]. Furthermore, integrating machine learning and reinforcement learning algorithms within edge-cloud platforms requires robust orchestration frameworks [16]. These challenges underscore the need for carefully designed architectures that can deliver accuracy, scalability, and resilience in traffic operations.

1.2 Challenges

Urban traffic systems are becoming increasingly complex due to rapid urbanization, rising vehicle ownership, and growing demand for sustainable mobility. Traditional traffic management approaches, which rely on centralized processing and static control strategies, struggle to adapt under dynamic conditions. These systems suffer from high latency, limited predictive capabilities, and poor scalability, resulting in congestion, delays, and inefficiency. Furthermore, current predictive models often struggle to capture complex, nonlinear spatial and temporal relationships within road networks, thereby limiting the accuracy of traffic forecasts. A lack of privacy-preserving collaborative learning further limits decentralized adaptation across intersections. Table 1 shows that the research problems addressed in this study are the lack of a scalable, real-time, and intelligent traffic management framework that can combine predictive modeling, adaptive optimization, and secure distributed learning.

Download:

Table 1. Limitations of traditional traffic management systems and corresponding research needs addressed by GEC-DTSP.

https://doi.org/10.1371/journal.pone.0350247.t001

1.3 Research strategy

Research architecture of GEC-DTSP integrates edge–cloud computing, predictive learning, and digital twin technology to facilitate intelligent traffic management. IoT-based roadside sensors yield streams of data that are processed at the edge devices with minimal latency for traffic analysis. GCN identifies spatial dependencies among road networks, and LSTM models learn temporal patterns of traffic flow. A Transformer predictor is employed to enhance long-term traffic prediction and facilitate strategic planning. A Digital Twin Engine at the cloud layer updates the real-time edge data by correlating it with historical data, keeping the virtual model of the traffic system current for monitoring and simulation. Building upon these predictive layers, an RL module dynamically optimizes traffic signal control, adaptive routing, and congestion mitigation by interacting with both physical and virtual environments. Experimental testing implements simulations and actual datasets, contrasting the enhancements in vehicle mean waiting time, congestion prediction accuracy, scalability, and resilience under missing-data conditions, and concludes that GEC-DTSP is an intelligent and sustainable traffic system.

1.4 Contributions

The main contributions of the study are:

An edge–cloud integrated digital twin framework enabling coordinated traffic data processing and real-time system synchronization.
A hybrid spatiotemporal learning model combining Graph Convolutional Networks (GCN), LSTM networks, and Transformer-based forecasting for multi-scale traffic prediction.
A reinforcement learning-based control module for adaptive traffic signal optimization based on dynamic traffic states.
A unified architecture for integrating spatial, temporal, and control-level intelligence in urban traffic systems.

The proposed framework centers on three validated components: (i) spatiotemporal traffic forecasting, (ii) adaptive traffic signal control utilizing reinforcement learning, and (iii) digital twin-based synchronization for real-time traffic state alignment. Conceptual advancements, including privacy-preserving collaborative learning and adaptive routing, are incorporated into the system design but are not practically implemented in the present work.

1.5 Research questions

RQ1: In what manner might spatiotemporal traffic relationships be proficiently characterized utilizing a hybrid GCN–LSTM–Transformer architecture?

RQ2: In what manner does the synchronization of edge-cloud digital twins enhance the accuracy of real-time traffic state estimate and prediction?

RQ3: To what degree can reinforcement learning improve adaptive traffic signal regulation in fluctuating congestion scenarios?

RQ4: What is the robustness of the suggested system in the presence of noise, incompleteness, and large-scale data?

1.6 Paper organization

The rest of the paper is followed by Section 2 which reviews the recent work on traffic prediction, and digital twin frameworks. Section 3 details the datasets and the proposed methodology. Section 4 gives experimental setup and evaluation metrics. Section 5 gives the discussion followed by Section 6 giving the limitation of the study. Finally, the conclusion is drawn in Section 7.

2. Related works

2.1 Conventional traffic prediction models

Sattarzadeh et al. (2025) [17] introduced a hybrid traffic flow forecasting model that combined ARIMA, Conv-LSTM, and a shuffle attention layer. The approach integrated spatiotemporal time-series analysis, utilizing statistics and deep learning, to address the nonlinear dynamics of traffic flow. The research demonstrated improved accuracy and resilience compared to single ARIMA or Conv-LSTM models. The drawback highlighted was limitations in scaling to a few smart city infrastructures without efficient model compression or edge deployment. Su et al. (2024) [18] had suggested a low-cost hybrid attention network for traffic forecasting in 5G transport systems. The approach used deep learning in combination with attention layers to capture temporal dynamics at lower computational expense. The results indicated higher accuracy and cost savings at moderate communication capacity, supporting the model’s application in real-time in 5G systems. The model had not been fully tested in heterogeneous or non-5G environments, which are still dominant in urban areas and pose challenges to widespread adoption.

Taher et al. (2025) [19] introduced a traffic congestion prediction technique based on filters that incorporated traffic signal movement in dynamic state estimation. The technique incorporated new filtering algorithms to combine real-time flow and real-time signal timing data, thereby improving estimation accuracy. Results validated enhanced responsiveness and improved congestion prediction compared to the existing practice. But it had the limitation that it was derived from highly accurate and pervasive traffic signal data. Jiang et al. (2025) [20] introduced an adaptive prediction model that can handle sparsely distributed traffic information on city networks. Adaptive learning processes were employed in the methodology to efficiently process sparse and random measurements. Greater precision than traditional static models was realized under incomplete or randomly sampled data. Precision, however, declined under heavy traffic fluctuations, where the adaptive updates were unable to detect sudden interruptions and demand changes. Sengupta et al. (2024) [21] developed a Bayesian model to quantify uncertainty and improve the overallizability of traffic forecasting models. The method integrated Bayesian inference with deep learning, providing prediction intervals alongside point forecasts. The presented results demonstrated greater robustness, reliability, and interpretability in the presence of noisy or uncertain data. However, the drawbacks included high computational cost and reduced inference speed, making it less suitable for real-time applications.

2.2 Edge and cloud-based traffic solutions

Reza et al. (2025) [22] proposed a city-level traffic signal control system based on TD-learning for autonomous vehicle deployment. The approach combined reinforcement learning and SUMO simulations to adjust traffic signal phases dynamically. Outcomes demonstrated a significant reduction in vehicle waiting time and improved traffic flow for simulated cases. Its limitation was its reliance solely on simulations, which lacked real-world deployment information, thereby reducing its near-term relevance to smart urban infrastructure. Medvei et al. (2025) [23] proposed DeepSIGNAL-ITS, an adaptive traffic signal control system for smart transportation based on deep learning. It utilized neural networks to process multi-source traffic signals and adaptively adjust signals. Performance led to the elimination of bottlenecks and enhanced vehicle throughput in simulation tests. Its limitation was that it required high-quality, synchronized, and multimodal input data, which are not typically found in most realistic real-world systems; hence, it lacked robustness against erroneous sensing for practical real-world deployment. Kan et al. (2024) [24] suggested optimizing urban traffic management by integrating YOLOv5 for real-time vehicle detection and DeepSORT for tracking within a digital twin framework. The framework analyzed real-time traffic video streams to ensure proper vehicle movement tracking, facilitating dynamic traffic analysis and sophisticated congestion management.

Liu et al. (2021) [25] proposed a smart traffic monitoring system based on computer vision and edge computing, enabling the real-time processing of video streams. It supported traffic density, congestion, and anomaly detection at edge devices with lower latency and lower network load. The model enhanced traffic observation responsiveness in urban areas, enabling timely interventions. However, its performance was limited by camera location, time-varying lighting, and occlusions. Accuracy decreased when traffic was dynamically or highly complex, limiting its potential for large-scale deployment across varied urban settings. Alkarim et al. (2024) [26] suggested ensemble learning-based algorithms for short-term traffic flow prediction for intelligent traffic systems. By integrating several prediction models, the method achieved greater predictive precision and robustness against failures in individual models. It presented enhanced traffic intelligence for real-time planning and management. The approach, however, increased computational complexity, necessitating greater computational capacity.

2.3 Digital Twin applications in urban traffic

Khadka et al. (2025) [27] developed Automated Traffic Signal Performance Measures (ATSPMS) in a digital twin simulation platform. The method offered virtual testing and optimization of traffic signal plans, enabling data-driven improvements in intersection efficiency. Through traffic simulation, it provided insights into reducing congestion and informed operational planning. Its weakness was a reliance on affluent, high-fidelity input data and iterative road network models, which are not always readily accessible. Llagostera-Brugarola et al. (2025) [28] proposed a Digital Twin framework for smart transportation in intercity settings, enabling predictive traffic simulation, incident response, and operational optimization. The framework combined sensor information with simulation models to improve traffic observation and planning. Though effective in intercity corridors, it was marred by scalability issues when extended to multi-complex urban networks with interdependent traffic relationships. These characteristics limited its application in gigantic city deployments, where data heterogeneity and changing conditions are the norm.

Li et al. (2024) [29] developed a driver risk-conscious smart mobility analytics platform using a digital twin to manage traffic in cities. It predicted hazardous driver behavior and optimized traffic flow to make roads safer and congestion a bare minimum. The system used real-time sensor data and driver-specific data to make decisions. Its use of personal driver data, however, raised privacy issues and limited its broader application, and its potential within heterogeneous real-world urban settings is limited. Fu et al. (2024) [30] proposed a digital twin platform for pedestrian safety warnings at an individual city intersection. The platform combined real-time sensor data with predictive modeling to warn pedestrians and drivers of potential collisions, enhancing safety. Simulation outputs verified it to be effective in minimizing hazards of accidents. Its dependence on quality targets and live data reduces resilience when sensors fail or provide inadequate data, limiting its practical use to larger metropolitan traffic networks.

Yanzhan Chen et al. [31] suggested the 3D Gaussian splatting (3D-GS) for intelligent 3D traffic accident reconstruction. To segment large-scale 3D point clouds, a clustering parameter stochastic optimization model and mixed-integer programming Bayesian optimization (MIPBO) method are suggested. 3D-GS suffers with visual rendering at night and in rain. In numerical trials, 3D-GS renders high-quality, seamless, real-time traffic accident scenarios with a structural similarity index of 0.90 across municipalities. Additionally, the MIPDBO method quickly converges, identifying optimal parameters in 3–5 iterations and achieving a high value of R2 0.8 on a benchmark cluster issue. Finally, the Gaussian Mixture Model with MIPBO distinguishes accident scene traffic components better than standard clustering techniques.

Yanzhan Chen et al. [32] proposed the Roadside LiDAR placement for cooperative traffic detection by a novel chance constrained stochastic simulation optimization approach. For Roadside LiDAR (RSL) placement, this study proposes a chance-constrained stochastic simulation-based optimization (SO) model to maximize mean Average Precision (mAP) with a budgeted number of RSLs and a chance constraint of ensuring a specific recall value under traffic uncertainties. Significantly, a data-driven deep learning strategy based on a high-fidelity co-simulator is used to evaluate an RSL placement plan, which is black-box, computationally expensive, and stochastic. These issues are addressed by a unique Gaussian Process Regression-based Approximate Knowledge Gradient (GPR-AKG) sampling approach. An RSL placement plan optimized by GPR-AKG achieves an outstanding mAP of 0.829 while meeting the chance limitation in numerical trials on a bi-directional eight-lane roadway, outperforming empirically developed alternatives. Under the optimal design, cooperative vehicle detection and tracking may reduce false alarms and missed detections from significant vehicle occlusions and create comprehensive and smooth vehicle trajectories. Analysis of detection coverage and average effective work time supports selecting center-mounted RSLs in the optimum plan. Deploying 20 RSLs in the optimum design is scientifically justified based on mAP balancing research.

Jingke Yan et al. [33] presented the a multimodal arc detection network based on denoising diffusion probabilistic models (DDPMs-MILNet) for arc detection in railway systems with limited data. To learn complex image characteristics, a DDPM is pretrained on many unlabeled photos. This model extracts features and fine-tunes a hierarchical variation semantic decoder to improve performance under small-sample settings and reduce dependency on labeled datasets. Based on this, an audiovisual semantic decoder uses audio signals as semantic cues to provide visual characteristics with modality information. This method decreases the model’s dependence on visual input and allows it to detect the arc’s visual target even when the item is not seen and heard, relieving the problems of small sample numbers. DDPM-MILNet performs well with little data in complicated railway settings, suggesting it might be used for railway system condition monitoring and anomaly identification.

Xin Wang et al. [34] introduced the adaptive fused domain-cycling variational generative adversarial network (AFDVGAN) for machine fault diagnosis under data scarcity. First, a smooth-regularized variational framework stabilizes latent space representation, increasing synthetic data structural consistency and training stability. Second, a ratio-controlled domain-cycling mechanism dynamically coordinates feature transfer between spatial, time-frequency, and frequency domains to improve multi-domain feature modeling and data synthesis. Finally, a multi-metric guided adaptive data fusion technique fuses synthetic and real data using statistical and time-frequency metrics to improve diagnostic model decision-making accuracy. AFDVGAN produces better synthetic data than normal and state-of-the-art approaches for electric locomotive and high-speed aerospace bearing case studies. Data fusion improves locomotive diagnostic accuracy to 99.81% and aerospace bearing accuracy to 99.16%.

Hui Wang et al. [35] discussed the Generalized Koopman Neural Operator for Data-Driven Modeling of Electric Railway Pantograph–Catenary Systems. The author presented a new generalized Koopman neural operator (GKNO) implemented using an autoencoder and an upgraded Transformer to describe complicated nonlinear dynamic systems with significant degrees of freedom. It has observable, evolution, and invertible functions. As an embedding model, the encoder transfers the original system’s state variables into observable space with linear dynamics. Using an autoregressive task, an enhanced Transformer model learns the embedding of space evolution function. Finally, using the embedding space, the decoder reconstructs the original system’s state variables.

HuanZhong Sun et al. [36] deliberated the Spatio-Temporal Graph Neural Network for Traffic Prediction Based on Adaptive Neighborhood Selection (STGNN-ANS). STGNN-ANS filters undesirable neighbors to create a new graph structure for more flexibility. A spatio-temporal serial module of STGNN-ANS uses bidirectional learning of bidirectional long short-term memory (BiLSTM) and the graph convolution network (GCN) enhanced by self-attention mechanism to capture traffic data’s spatio-temporal dependence and achieve superior prediction accuracy in both short- and long-range scenarios.

Qinyao Luo et al. [37] developed the Long-Short Term Transformer-based spatiotemporal neural network for traffic flow forecasting (LSTTN). The model learns compressed and contextual subseries temporal representations from long historical series by pretraining a masked subseries Transformer to infer the content of masked subseries from a small portion of unmasked subseries and their temporal context. After learning representations, stacked 1D dilated convolution layers extract long-term trend and dynamic graph convolution layers extract periodic features. A short-term trend extractor helps LSTTN learn fine-grained short-term temporal cues for time-step level prediction. Final predictions are made by LSTTN combining long-term trend, periodic characteristics, and short-term features. Experimental results on four real-world datasets reveal that the LSTTN model improves 60-minute-ahead long-term forecasting by 5.63% to 16.78% over baseline models.

Saira Karim et al. [38] investigated the Dynamic Spatial Correlation in Graph WaveNet for Road Traffic Prediction. This work uses attention mechanisms to calculate time-domain attention scores for the self-adaptive adjacency matrix to add dynamic spatial dependencies into the Graph WaveNet model. We compared the computation cost of our graph attention network model with multi-head attention to Graph WaveNet for up to 60 minutes. The best 60-min forecast model was ours, with root-mean-square error decreasing 3.4% and 4.76% on PEMS-BAY and METR-LA datasets, respectively. Attention score calculation increases model training time. Table 2 summarizes the related work.

Download:

Table 2. Summary of the related works.

https://doi.org/10.1371/journal.pone.0350247.t002

2.4 Research gap

Despite progress in intelligent transport systems, current traffic management methods remain limited by static signal timing, centralized processing, and inadequate predictive modeling. Current studies using hybrid deep models, such as ARIMA–ConvLSTM [17] and attention-based neural networks [18], achieve better short-term predictions but lack the ability to model nonlinear spatiotemporal dependencies, which are critical for real-time adaptation. Reinforcement-based and signal intelligence architectures [22,23] provide localized control without global coordination or scalability. Digital twin-based deployments [27–29] offer virtual monitoring but are hindered by synchronization delays and reliance on high-latency cloud infrastructure. Therefore, the most significant research gap is the development of a GEC-DTSP that integrates edge intelligence, spatiotemporal learning (GCN–LSTM–Transformer), and real-time digital twin synchronization to achieve adaptive, scalable, and sustainable traffic management.

3. Methods and methodology

The GEC-DTSP aims to enhance real-time traffic control in cities by leveraging combined edge computing, deep learning, and digital twin technologies. The system architecture in Fig 1 leverages low-latency data processing via edge devices and mass-scale simulation and coordination via cloud servers. IoT traffic sensors provide an input data stream that is continuously processed at the edge layer using GCN to represent spatial patterns involving interacting road networks. Temporal trends and traffic movement are modeled using LSTM models, while long-term prediction is facilitated using Transformer-based predictors for studying congestion patterns. The cloud-level Digital Twin Engine enables coordinated virtual simulation of the physical traffic network by combining real-time edge data with historical data for predictive analysis, scenario simulation, and strategic decision-making. This multi-modal system collectively brings responsiveness, scalability, and sustainability to intelligent transportation infrastructure.

Download:

Fig 1. System architecture of the GEC-DTSP methodology.

https://doi.org/10.1371/journal.pone.0350247.g001

3.1 Dataset explanation

The Kaggle Smart Traffic Management Dataset [39] is a small yet useful dataset comprising approximately 2,000 rows and 12 features that describe city traffic conditions. It includes variables such as vehicle count, signal timing, lane occupation, and other contextual data, making it ideal for prototyping traffic analysis methods at a small enough scale. It is beneficial for investigating algorithmic feasibility, prototyping, and rapid experimentation in environments with constrained computational resources. Conversely, the PEMS-BAY dataset [40], available on Zenodo, is a real-world benchmark of high-volume data collected from loop detectors in the San Francisco Bay Area freeway network. It comprises high-resolution speed and volume observations from hundreds of sensors, along with an adjacency graph representing the road network’s spatial topology.

The proposed framework uses a dual-dataset approach to independently support the localized traffic signal control and the large-scale modeling of traffic dynamics. It uses the Kaggle Smart Traffic Management dataset with about 2,000 samples including 12 traffic-related characteristics as a basis to control traffic signal lights at the intersection level. It entails fine-grained data, including the number of vehicles, the presence of a lane, the length of a queue, and even the state of a signal phase, which are directly applicable to an adaptive signal optimization approach to urban intersections. PEMS-BAY on the other hand is integrated to model the large-scale propagation of macroscopic traffic flow and long-range spatiotemporal dependencies across large-scale road networks. This data is sensor data of 325 loop detectors installed throughout the San Francisco Bay Area freeway network, with 5-minute sampling interval. Though originally intended as a highway traffic forecasting tool, it is not directly applied in training intersection level signal control. Rather, it helps to learn the patterns of evolution of global traffic, spatial dependencies of distributed sensors, and temporal continuity of traffic flow, which are crucial in the calibration of the predictive modeling layer of the digital twin. Fig 2 shows tiered integration of data sets for intelligent traffic management.

Download:

Fig 2. Layered architecture for Edge-Cloud Digital Twin Traffic Management using Kaggle Smart Traffic Dataset and PEMS-BAY Dataset.

https://doi.org/10.1371/journal.pone.0350247.g002

In the Edge Layer, the Kaggle Smart Traffic Management Data Set delivers small-scale, real-time attributes of vehicle volumes, signal timing, and intersection queues for local processing. The Cloud Layer leverages large-scale PEMS-BAY data, including loop detector speeds, traffic volume flow, and sensor network occupancy rates, across a broad sensor network. Lastly, the Digital Twin Layer synchronizes edge attributes with historical cloud-based data in real time to support adaptive traffic signal control, congestion forecasting, and dynamic urban mobility planning.

3.2 Data acquisition

The Data Acquisition module serves as the foundational layer that gathers multimodal traffic data from IoT-enabled infrastructure deployed across intersections. The sensing network consists of vehicular detectors, camera-based vision sensors, and signal controllers, all integrated through an edge gateway. These devices continuously monitor critical traffic parameters, including vehicle count, speed, flow rate, and signal phase, serving as the primary inputs for real-time modeling and control. The raw data stream from each sensor node i at time can be represented as a multivariate feature vector, as shown in equation 1.

(1)

where refers to total number of vehicles detected at the intersection i at time . is the average vehicle speed, is the traffic flow rate (vehicles per minute), denotes the current signal phase, and corresponds to environmental factors such as weather or illumination at the time . is the total detection zones or camera frames at the intersection i. refers to the instantaneous velocity of the vehicle at the intersection i at time . is the flow rate (vehicles per unit time). denoted time window or observation interval (e.g., 30 s or 1 min). is the temperature at the location i, is the humidity, is the rainfall intensity, refers to the normalization coefficients or learned weights. The sampling frequency defines the data acquisition rate , ensuring temporal resolution consistency, , where denotes the sampling interval between consecutive measurements. To maintain synchronization across heterogeneous sensors, a timestamp alignment mechanism is applied, ensuring that, . This synchronization ensures spatiotemporal consistency across different data sources before they are forwarded to the edge layer for preprocessing. To ensure robust temporal synchronization across heterogeneous traffic sensors, a tolerance-window-based alignment strategy is adopted. Instead of selecting the minimum timestamp difference directly, a bounded temporal matching function is used: , where represents the maximum allowable synchronization window (e.g., 5–10 seconds depending on sensor type). If no match exists within this window, linear interpolation is applied between adjacent timestamps to preserve continuity in the data stream.

3.3 Edge processing

Each edge node i receives a multidimensional traffic feature vector defined as equation 1. To align heterogeneous sensor readings with standard data ranges from the Kaggle Smart Traffic Management Dataset, a normalization () process is performed. To improve data credibility and eliminate the temporal noise effect, a smoothing filter is employed using a moving average function (). Based on the processed signals, a localized congestion index is computed to capture the dynamic traffic density (). To facilitate efficient communication between the edge and cloud layers, the processed data undergoes feature compression using a lightweight encoder as in equation 2.

(2)

where is the raw feature value, are feature-wise minimum and maximum dataset-calculated values, and is the normalised value that provides equal scaling across various intersections. represents the smoothing window size and is the smoothed observation. represents the congestion degree at the node i, are weight coefficients indicating the contribution of vehicle count, speed, and flow rate, respectively, while and denote their respective maximum observed values. A higher value implies increased congestion severity. represents the encoding function, often implemented through a compact CNN or autoencoder, and is the latent representation transmitted to the cloud-based Digital Twin Engine. The spatiotemporal feature encapsulation reduces bandwidth usage significantly without compromising the critical spatiotemporal aspects of traffic.

Fig 3 shows the edge node data preprocessing in GEC-DTSP. The edge processing phase, therefore, undertakes an integrated process of real-time data normalization, temporal-domain data smoothing, congestion estimation, and feature encoding—thereby improving computational efficiency and sending only high-value, structured data to the cloud layer for synchronization and predictive modeling. The pre-encoded edge features, preprocessed, constitute the fundamental input of the hybrid GCN–LSTM–Transformer model. The feature-compressed representations of these features retain important spatial and temporal details while minimizing redundancy for low-cost transmission over the cloud. GCN captures spatial interdependencies between intersections, the LSTM captures temporal traffic patterns, and the Transformer uses attention mechanisms for multi-horizon prediction.

Download:

Fig 3. Edge node data preprocessing in GEC-DTSP.

https://doi.org/10.1371/journal.pone.0350247.g003

3.4 Spatiotemporal modeling and predictive learning in GEC-DTSP

The hybrid deep model of the GEC-DTSP architecture combines GCNs, LSTM networks, and a Transformer-based predictor to capture spatial dependencies within the traffic network and temporal evolution. GCN captures spatial relationships between road junctions, LSTM captures sequential temporal changes, and the Transformer captures long-term dependencies through self-attention mechanisms.

3.4.1 Graph convolutional network for spatial modeling.

GCNs offer a principled approach to capturing the propagating patterns of traffic states across interconnected road segments, treating the road network as a graph. A rigorous mathematical definition of GCNs for representing spatial traffic dependencies among cities is provided in this section. GCN learns spatial correlations between neighboring road junctions by structuring the traffic network as a graph, where wherein are the nodes (junctions), are the roads connecting them, and A is the adjacency matrix representing the spatial relationships. Each node feature vector is made up of multivariate traffic conditions, including vehicle volume, speed, and flow rate at time t. A weighted adjacency matrix is constructed based on spatial proximity and connections. Formally, this can be written as in equation 3.

(3)

where represents the connection weight between nodes i and . is the Euclidean distance between road segments (or intersections) i and , and is a bandwidth parameter controlling the spatial decay of influences. Typically, is set to zero for distances exceeding a threshold distance to capture only local spatial interactions. To ensure numerical stability during graph convolution operations, the adjacency matrix is normalized using the symmetric normalization scheme as in equation 4.

(4)

where is the degree matrix, I is the identity matrix. Normalization ensures that features from neighboring nodes are properly scaled by their node degrees, preventing extreme value amplification during message passing. In practice, the normalized adjacency matrix () is often augmented with self-loops. Self-loops enable each node to retain its own information during graph convolution, facilitating better feature learning. The graph convolution operation for the layer is expressed as in equation 5.

(5)

where denotes node feature matrix at layer l, is the trainable weight matrix, refers to the bias vector, symmetrically normalized adjacency matrix, is the degree matrix. This operation aggregates spatial information from neighboring nodes, allowing each intersection to learn traffic dependencies from surrounding regions. The resulting encapsulates spatial embeddings used for subsequent temporal modeling. Fig 4 illustrates the GCN-based spatial modeling pipeline of GEC-DTSP. The road network is modeled as a weighted graph with intersections as nodes and roads as edges. The adjacency matrix is normalized and augmented with self-loops for stability. During graph convolution, every node gathers its neighbors’ spatial features to create spatial embeddings that capture inter-road dependencies for further temporal modeling.

Download:

Fig 4. The GCN-based spatial modeling process in GEC-DTSP.

https://doi.org/10.1371/journal.pone.0350247.g004

3.4.2 Long Short-Term Memory (LSTM) for temporal modeling.

As GCNs can accurately capture spatial relations at a single point in time, city traffic networks exhibit significant temporal dynamics that need to be modeled separately. Traffic volumes vary with characteristic patterns: morning rush-hour peaks, midday troughs, afternoon peak-hour spikes, and evening spinnings LSTM networks are specifically designed to learn long-term temporal correlations and avoid the vanishing gradient problem of traditional recurrent neural networks. The LSTM network retains temporal dependencies in traffic dynamics by iteratively processing space-encoded features from the GCN. It retains nonlinear relationships in traffic flow evolution, signal patterns, and congestion spreading via recurrent gating mechanisms, thereby preventing vanishing gradients. The forward computation of an LSTM unit is defined as in equation 6.

(6)

where refers to the GCN-derived input at time , is the hidden state representing temporal feature encoding, is the cell state maintaining long-term memory, refers to the input, forget, and output gates controlling information flow, is the sigmoid activation, is the element-wise multiplication, are the trainable matrices and biases for each gate. The LSTM output sequence serves as input to the Transformer predictor, enabling long-term forecasting with attention-based weighting over multiple time horizons. The cell state update rule is critical to understanding LSTM’s effectiveness as in equation 7.

(7)

The input gate also determines which elements of the candidate state are accumulated in. The additive update (plus sign) allows gradients to flow directly backward through the cell state without being attenuated by repeated multiplications. In traffic forecasting, gating mechanisms have intuitive interpretations as below.

Forget Gate: Controls whether habits that were followed in the past are still good ones. When there is a major incident (e.g., a crash), the forget gate may be reset to zero to adopt new habits. When it is business-as-usual rush hour, it continues as before.
Input Gate: Regulates the introduction of new observations. If new sensor input indicates a sudden change in traffic patterns, the input gate increases its strength in subsequent predictions.
Output Gate: Specifies which portions of the saved cell state are relevant to the output for the present time step. There are different predictive features for traffic at night versus during rush hour; thus, the output gate selectively activates the relevant features.

By iterative training, the LSTM cell state acquires the ability to accept multi-dimensional temporal patterns such as periodic peak-hour timing patterns (peaking congestion time intervals identification), flow continuation patterns (acceleration or deceleration consistency assessment), anomaly detection cues (abnormal flow identification), and anticipation features (identification of forthcoming congestion with density gradients). These temporally encoded patterns enable resilient, adaptive prediction of short-term traffic behavior within the GEC-DTSP architecture shown in Fig 5.

Download:

Fig 5. LSTM-based temporal modeling of traffic dynamics in GEC-DTSP.

https://doi.org/10.1371/journal.pone.0350247.g005

GCN-learned spatial representations are fed into LSTM cells, with forget, input, and output gates controlling the flow of temporal information. Long-term traffic memory is remembered in cell state , with varying congestion patterns disseminated across hidden states . Gate activations adapt dynamically to the traffic phases shown in Table 3, allowing the model to predict rush-hour peaks, incident disruptions, and recovery responses.

Download:

Table 3. LSTM gate dynamics across different traffic phases in the GEC-DTSP.

https://doi.org/10.1371/journal.pone.0350247.t003

Table 3 shows the dynamic behavior of the internal gates and state values of the LSTM cell across different stages of traffic. At free flow, all gate activations are low, holding very little information. During pre-rush, the input gate dominates as the model begins to detect early signs of congestion. Maximum congestion produces maximum activation across all gates, maintaining maximum memory retention and prediction output. The recovery phase demonstrates progressive state stabilization, and off-peak times return to low-gate operation, indicating normal traffic patterns.

Fig 6 depicts the temporal dynamics of LSTM gate activation to various traffic phases in the GEC-DTSP model. The Forget Gate (blue) is highly active during incidents, reflecting the swift forgetting of old temporal patterns and adjustment to unexpected traffic interference. The Input Gate (green) increases its activity from pre-rush through to restoration, reflecting discrimination in the absorption of new congestion information as situations change. The Output Gate (red) is highest during the post-incident and recovery periods, capturing new temporal expectations for decongestion. Taken together, these dynamics reveal how the LSTM unit effectively controls memory persistence, information influx, and predictive output to capture the intricate temporal interdependencies of urban traffic flow.

Download:

Fig 6. LSTM gate activation dynamics across traffic phases.

https://doi.org/10.1371/journal.pone.0350247.g006

3.4.3 Transformer-based predictive forecasting for long-term traffic prediction.

To harmoniously represent spatial, temporal, and long-range dependencies, the LSTM network sequentially encodes its features and presents them to the Transformer predictor. The LSTM captures short-term sequential dependencies and produces temporally contextualized embeddings at each time step. These embeddings still maintain both spatial information (derived from GCN) and short-term dynamics (derived from LSTM). Pseudocode 1 provides the integration process of the hybrid GCN–LSTM–Transformer pipeline.

Pseudocode 1: Integration process of the hybrid GCN–LSTM–Transformer pipeline.

Input: — traffic feature input for node i at time t, — adjacency matrix of spatial graph connectivity

Output: — predicted traffic state at future time (t + Δt)

1: procedure Transformer_Predictive_Forecast ()

2: # Step 1: Spatial Feature Extraction

3: // extracts spatial dependencies

4: # Step 2: Short-Term Temporal Encoding

5: // produces temporal embeddings [h₁, h₂, …, h_T]

6: # Step 3: Transformer Attention for Long-Term Forecasting

7: for each head h = 1 to H do

8:

9:

10: end for

11:

12:

13:

14:

15: # Step 4: Forecast Generation

16: // final future traffic prediction

17: return

18: End procedure

Let the input to the Transformer be a sequence of spatiotemporally encoded vectors , where each is the time fused GCN–LSTM embedding. Scaled dot-product attention is given in equation 8.

(8)

where are the query, key, and value matrices projected from Ht via learned weight matrices . The scaling factor avoids large inner-product values that can destabilize softmax. The multi-head attention generalized this idea to allow the model to attend jointly to different representation subspaces of information as shown in equation 9:

(9)

where m is the number of attention heads and is output projection. The different heads learn distinct temporal and spatial dependencies, enabling full pattern discovery across intersections and time periods. To include sequential order—attention being position-agnostic—the model employs positional encoding, as shown in equation 10, which is added to every input embedding for ensuring temporal continuity. The Transformer encoder output, Z, is the learned history traffic-aware hidden states. The decoder makes use of these states to forecast the future traffic states in the long-horizon .

(10)

where is a non-linear mapping function parameterized by the Transformer’s learned parameters .

The model optimizes a composite loss function combining short-term prediction error and long-horizon forecasting smoothness is shown in equation 11.

(11)

where are weighting coefficients, and is the total prediction horizon. : are the dimensionality of key and value vectors, controlling representational richness. is the number of attention heads, enhancing multi-perspective temporal reasoning, : Trade-off factors balancing near-term vs. long-term prediction accuracy, is the Forecast horizon length for future traffic states. Temporal encoding ensures that the model preserves sequence-order information.

3.5 Cloud-based Digital Twin synchronization

The digital twin module provides a synchronization interface that ensures real-time alignment between traffic states and the predictive model’s results. Its main contribution is that it reduces state inconsistency between physical and virtual environments and improves forecastability and the ability to respond to control-level signals compared to non-synchronized edge-cloud architectures. In the cloud layer, pre-processed edge data are transmitted to the Digital Twin Engine (DTE) in secure communication. The DTE integrates real-time sensor streams with historical observations from the PEMS-BAY dataset to build a real-time, up-to-date virtual model of the urban traffic network. The DTE synchronizes live measurements and model-predicted states in time to maintain temporal consistency and predictive accuracy. The synchronization can be mathematically expressed as where denotes the digital twin state for the traffic node (intersection or segment) at time , is the system synchronization residual, accounting for sensor noise and temporal misalignment, are adaptive weighting coefficients optimized through a Bayesian calibration process to minimize the mean synchronization error between the real and virtual states, as shown in Fig 7.

Download:

Fig 7. Cloud-based Digital Twin Engine (DTE) architecture for real-time traffic synchronization.

https://doi.org/10.1371/journal.pone.0350247.g007

The Digital Twin Engine integrates multi-source data streams (real-time IoT sensor measurements and past PEMS-BAY dataset) using a data fusion block. Spatial aspects of GCN and temporal elements of LSTM networks are combined with adaptive weighting factors () that dynamically adjust based on data reliability and traffic conditions. A Bayesian calibration process optimizes synchronization between physical and virtual traffic states, producing predictions for the subsequent time step. The synchronization of the digital twin is the basis for strategic control measures such as traffic congestion prediction, routing planning, control signal timing, and dynamic pricing policy. A virtual replica view provides visualization and forecasting of network traffic conditions.

The system’s scalability is assessed by testing it with an escalating number of traffic intersections, varying from 10 to 100 nodes. The results indicate that end-to-end latency increases sublinearly from 42 ms to 78 ms, while prediction accuracy remains consistent (MAE variation within 6.8%). This illustrates that the suggested edge–cloud architecture may scale effectively with network size without a considerable performance reduction. To ensure dynamic consistency between the physical and virtual worlds, the DTE updates are formulated within a recursive prediction–correction framework, as shown in equation 12.

(12)

where denotes the temporal transition function learned using the historical PEMS-BAY dataset, is the sampling interval, represents the parameter set (e.g., transition coefficients, congestion propagation constants), is the process uncertainty term, and refers to the real-time traffic state observed from IoT-enabled sensors.

The edge–cloud interface operates over a lightweight publish–subscribe protocol, MQTT over TCP/IP with TLS encryption, to ensure secure, low-overhead transmission. Edge devices deployed near roadside sensors aggregate traffic states at 5-second intervals and transmit compressed state vectors (36 features per intersection, 32-bit floating point) to the cloud. The average payload per transmission is 144 bytes, resulting in a bandwidth requirement of approximately 23 kB/min per intersection. Synchronization between the physical network and the cloud-based digital twin occurs every 5 seconds under normal operation, with an adaptive fallback mechanism extending to 10 seconds during bandwidth congestion. End-to-end latency, measured from edge acquisition to cloud state update, averages 42 ms (σ = 6.3 ms) under nominal network conditions (100 Mbps link) and remains below 85 ms under 20% simulated packet delay. These bounds satisfy real-time control constraints for signal optimization. To quantitatively validate robustness under incomplete or degraded data conditions, controlled corruption scenarios were simulated: (i) random sensor dropout at 10%, 20%, and 30% rates; (ii) burst packet loss lasting 15–30 seconds; (iii) additive Gaussian noise with SNR levels of 20 dB and 10 dB; and (iv) delayed packet injection with latency jitter up to 150 ms. Forecasting RMSE increased by only 3.4% under 20% random dropout and 5.1% under 30% dropout, while signal control performance showed a maximum 4.7% increase in average waiting time compared to clean-data conditions. In burst-loss scenarios, the digital twin maintained stable state estimation through historical pattern fusion, limiting RMSE degradation to 6.2%. Statistical comparison using paired Wilcoxon tests confirmed that performance degradation under moderate corruption (≤20% dropout) was not statistically significant (p = 0.18).

3.6 Optimization and decision support

To facilitate smart decision-making and adaptive control in the envisioned GEC-DTSP platform, a reinforcement-learning-based optimization module is integrated at the cloud level. The optimization procedure aims to reduce waiting time for automobiles and intersection congestion while maximizing overall throughput. Pseudocode 2 describes the sequential procedure of this reinforcement-based decision support and optimization algorithm.

Pseudocode 2: Reinforcement-Driven Optimization and Decision Support for GEC-DTSP

Input: Environment states , Action space , Reward weights , Actor learning rate , Critic learning rate , Discount factor , Clipping parameter , Total episodes

Output: Optimal policy

Procedure Optimize_GEC_DTSP(S, A)

1. Initialize actor network

2. Initialize critic network

3. Initialize optimizer parameters

4. for episode to do

5. Initialize traffic environment state

6. Initialize trajectory buffer

7. while episode not terminal do

8. Sample action

9. Execute action

10. Observe next state

11. Compute reward:

Store transition in

12. Update state

13. end while

14. Compute discounted returns:

16 Estimate advantage:

17. Update actor parameters using PPO clipped objective:

where

18. Update critic parameters by minimizing value loss:

19. end for

20. Return optimized policy

Pseudocode 2 for the GEC-DTSP model aims to manage traffic intelligently and mitigate congestion through adaptive decision-making. The control policy is implemented using Proximal Policy Optimization (PPO) with an actor–critic architecture, chosen for its training stability and robustness in continuous-state environments relative to value-based methods such as DQN. The state vector at time step consists of 36 features representing aggregated traffic conditions across the controlled intersection, including normalized queue lengths, average waiting times, and lane-level flow rates. The action space is discrete with four signal phase configurations corresponding to feasible phase transitions. The reward function is defined as , where each component is min–max normalized to prevent any single component from dominating the scale. Reward weights were calibrated through grid-based sensitivity analysis with under the constraint . Empirical evaluation identified , , and as the optimal configuration, yielding a 17.4% reduction in average waiting time relative to fixed-time control. Performance remained stable (±1.8% variance) under ±0.1 perturbations of each weight, confirming robustness to moderate weight changes.

The PPO hyperparameters are explicitly specified as follows: actolearning rate , critic learning rate , discount factor , clipping parameter , entropy coefficient 0.01, batch size 128, and 1500 training episodes with 600 simulation steps per episode. Convergence analysis demonstrates monotonic improvement in episodic return, with policy stabilization observed after approximately 900 episodes and reward variance below 2.3% over the final 200 episodes. A Wilcoxon signed-rank stability test comparing the last 200 episodes with the preceding 200 episodes yields , indicating no statistically significant drift. Mathematically, the degree of visualization transforms optimized control actions to a visualizable form, as in equation 13.

(13)

Where → visualization output vector representing real-time state indicators (e.g., signal phase adjustments, congestion alerts, route advisories) for node at time . → the visualization mapping function, translating optimization and prediction outputs into interpretable graphical forms. → the optimal policy obtained from the reinforcement learning model, defining the best control action under traffic state . → forecasted traffic state from the Digital Twin at a future horizon , generated via the hybrid GCN–LSTM–Transformer model. → current congestion index derived from real-time vehicular density and flow measurements at intersection . → throughput rate, representing the normalized number of vehicles successfully passing through the intersection per unit time. → visual weighting coefficients, adaptively calibrated to control the visual emphasis on each contributing parameter. → visualization noise term accounting for sensor inaccuracies or transmission delays

The GEC-DTSP paradigm integrates spatial and temporal learning to enable adaptive, predictive control of traffic in the metropolitan network. The spatial module, parameterized with the normalized adjacency matrix and node feature matrix uses propagation-rule-based graph-based learning, , to learn road network interdependencies between intersections In parallel, the temporal module, with LSTM cell parameters (), learns sequential traffic flow change, congestion index Ci(t), and vehicle speed Vi(t) from gated state updates, The learned temporal embedding is now input to the Transformer-based predictor , projecting multi-horizon predictions via adaptive weighting parameters to trade off short- and long-term accuracy as, Through this spatiotemporal synergy, the reinforcement learning optimizer controls signal phase parameters and flow rate factors dynamically to reduce congestion and optimize throughput efficiency . A combination of these modules provides for minimizing wait time, adaptive flow coordination, and quick response to real-time traffic conditions—a solid, scalable solution for smart transportation systems.

The digital twin engine, which is part of the cloud layer, combines real-time edge data, with historical traffic dynamics to assemble a synchronized urban traffic model. The intersection-level state representation is based on the Kaggle Smart Traffic Management dataset (approximately 2,000 samples, 12 features). Simultaneously, PEMS-BAY captures macroscopic traffic dynamics from 325 freeway sensors measured at 5-minute intervals, capturing large-scale spatiotemporal flow patterns. PEMS-BAY data are not used to directly optimize the digital twin signal. Still, they can support long-horizon predictions and spatial-temporal consistency within the digital twin, enhancing the modeling and predictability of global traffic behavior. Not all architectural aspects explained in the system design are empirically tested in this study. The experimental validation is based on three main aspects: (i) predicting traffic accuracy, (ii) adaptive traffic signal control performance, and (iii) digital twin synchronization stability. But the conceptual extensions of the proposed framework are auxiliary modules, such as privacy-preserving mechanisms and routing optimization, which will be implemented and tested in future real-world deployments.

Modules for privacy-preserving collaborative learning and adaptive routing are intended to be future enhancements to the proposed architecture. The aforementioned components are excluded from the existing experimental workflow and, thus, are not reflected in the evaluation results. The current implementation emphasizes forecasting, control, and synchronization modules. The synchronization and alignment formulations are intended to accommodate varied sensor sampling rates and irregular traffic data streams typically encountered in real-world intelligent transportation systems. The tolerance-based alignment guarantees temporal consistency, whilst the normalized fidelity metric preserves numerical stability in sparse or low-activity traffic conditions. The experimental comparisons are organized into task-specific categories to guarantee equitable evaluation. Forecasting models are assessed solely on prediction accuracy metrics, including MAE (2.91–4.82) and RMSE (3.84–6.37), whereas control-oriented methods, such as reinforcement learning-based approaches, are evaluated based on traffic efficiency metrics, including average waiting time (36.2–58.4 seconds) and congestion reduction rate (up to 37.9%). This division guarantees that each method is assessed according to its designated functional purpose.

The Level of Service (LOS) and Travel Time Index (TTI) are inferred from primary experimental outcomes, including vehicle waiting time (36.2–58.4 seconds) and throughput efficiency (0.72–0.91 normalized flow ratio). At the same time, fuel consumption is estimated based on congestion levels (Γ_i(t) ranging from 0.18 to 0.86) and stop-and-go traffic behavior. In high-congestion scenarios, LOS indicates degraded traffic conditions (LOS D–F), whereas under optimized control (ECDT), it improves to approximately B–C levels. TTI values reduce from an estimated 1.42 (baseline edge-only) to 1.08 under the proposed system. Fuel consumption is estimated to decrease by 9.3%–14.6% due to reduced idling time and smoother traffic flow. These indicators are included to contextualize system impact rather than to serve as independently validated experimental metrics.

The proposed method is evaluated against a comprehensive set of traffic signal control baselines covering both conventional and deep reinforcement learning approaches. The conventional methods include Fixed-Time Control (average waiting time: 58.4 s) and Actuated Control (52.1 s). Deep reinforcement learning baselines include Deep Q-Network (DQN) (45.9 s) and Double DQN (DDQN) (43.7 s). In contrast, policy-gradient and actor–critic methods include Proximal Policy Optimization (PPO) (40.8 s) and Advantage Actor-Critic (A2C) (42.5 s). In addition, Multi-Agent Deep Deterministic Policy Gradient (MADDPG) achieves an average waiting time of 39.6 s, reflecting improved coordination in multi-intersection settings. Compared to these baselines, the proposed method achieves a lower average waiting time of 36.2 s, demonstrating superior performance in traffic signal optimization under dynamic congestion conditions.

4. Results and performance comparison

The GEC-DTSP model was quantitatively evaluated on two benchmark datasets: the PEMS-BAY dataset, which contains high-resolution loop-detector data for traffic speed and flow, and the Kaggle Smart Traffic Management dataset, which contains vehicle volume, signal phase, and lane occupancy. The PEMS-BAY dataset was used for quantitative benchmarking, while the Kaggle dataset was used for model validation and ablation analysis. Traffic prediction was treated as a spatiotemporal forecasting task with 30-, 45-, and 60-minute lead times, and hourly cross-validation from 6:00 AM to 10:00 PM to evaluate performance across different levels of congestion.

4.1 Experimental setup

The experimental assessment uses offline datasets comprising over 2,000 samples from the Kaggle Smart Traffic dataset and extensive highway traffic records from PEMS-BAY, which features 325 sensors sampled at 5-minute intervals. The technology analyzes pre-collected traffic streams to replicate near-real-time behavior in a controlled, simulation-based digital-twin environment. Consequently, performance results indicate simulation-level operational viability, with recorded metrics comprising MAE (2.91–4.82), RMSE (3.84–6.37), and average vehicle waiting time (36.2–58.4 seconds), rather than direct outcomes from real-world field deployment. The evaluation methodology comprises two independent benchmarks: (i) a traffic forecasting benchmark that assesses spatiotemporal prediction models, and (ii) a traffic control benchmark that evaluates decision-making and signal optimization methodologies. This architecture mitigates cross-task comparison bias and ensures that all solutions are evaluated using uniform objective functions aligned with their respective problem specifications. The system’s scalability is assessed by testing it with an escalating number of traffic intersections, varying from 10 to 100 nodes. The results indicate that end-to-end latency increases sublinearly from 42 ms to 78 ms, while prediction accuracy remains consistent (MAE variation within 6.8%). This illustrates that the suggested edge–cloud architecture may scale effectively with network size without a considerable performance reduction.

Data were synchronized to a 5-minute frequency, normalized using min–max scaling, and missing values were handled by forward filling and linear interpolation. A weighted adjacency matrix A using a Gaussian kernel was formed for modeling spatial graphs. The GEC-DTSP model employed GCN, LSTM, and Transformer layers, with an Adam optimizer and a learning rate of 1 × 10^-3, and early stopping. An RL agent fine-tuned synchronization with γ = 0.99. Baselines used ARIMA–ConvLSTM–Shuffle Attention [18], Hybrid Attention Network (HAN) [19], Bayesian Deep Learning Model [22], TD-Learning–Based Signal Control [23], and DeepSIGNAL-ITS [24]. Performance was compared based on MAPE, MAE, RMSE, Synchronization Fidelity (SF), and Decision Response Time (DRT). Repeated experiments retained GEC-DTSP’s high accuracy, stability, and synchronization, as ascertained by t-tests and Wilcoxon tests (p < 0.05).

Table 4 compares the predictive performance of the new GEC-DTSP model and five competing models—ARIMA, ConvLSTM, Shuffle Attention, Hybrid Attention Network (HAN), Bayesian Deep Learning Model, TD-Learning–Based Intelligent Signal Control, and DeepSIGNAL-ITS—across three horizons (30, 45, and 60 minutes). Forecast performance was assessed using three widely used metrics—MAPE, MAE, and RMSE—on actual and forecast traffic variables from the Kaggle Smart Traffic Management and PEMS-BAY datasets. Let be the measured traffic parameter (e.g., number of vehicles, mean speed, or flow rate) at intersection i at time t, and be the predicted counterpart for each model. N denotes the number of samples measured at all intersections and time instants. In the GEC-DTSP methodology, these future variables are obtained from the Transformer-based predictor in Equation (10), using the GCN-based spatial embedding and LSTM-based temporal state. Real values are streamed in real time from edge IoT sensors measuring traffic flow, lane occupancy, and signal timing.

Download:

Table 4. Overall comparative performance of GEC-DTSP against baseline traffic forecasting algorithms.

https://doi.org/10.1371/journal.pone.0350247.t004

Fig 8 shows the statistical distribution of MAPE at different intersections and iterations of experiment runs within a one-hour time frame. Narrower, lower violin shapes indicate lower forecasting error and greater temporal stability. The simulated GEC-DTSP consistently exhibits the tightest and lowest MAPE distributions throughout the day, reflecting greater robustness across different traffic dynamics. To analyze hourly forecasting accuracy and temporal robustness, the Mean Absolute Percentage Error (MAPE) was calculated across all traffic variables for 17 time windows from 6 AM to 10 PM. The MAPE for each model was computed as, where is the total number of data samples within the hour, is the actual observed traffic measurement (vehicle flow, speed, or density) from edge sensors, and Is the corresponding model-predicted value derived from the temporal prediction layer output? in the GEC-DTSP framework. For any intersection and any time step , actual and predicted traffic parameters are and , respectively.

Download:

Fig 8. Hourly distribution of Mean Absolute Percentage Error (MAPE) for baseline and proposed models from 6 AM to 10 PM.

https://doi.org/10.1371/journal.pone.0350247.g008

Fig 9 shows how prediction accuracy varies geographically across the city’s traffic network for each model. Predictive model One—ARIMA–ConvLSTM–SA, HAN, Bayesian DL, TD-Learning, DeepSIGNAL-IT and the GTSP—is shown in the e subplot andis colred such that intensity corresponds to the Root Mean Square Error (RMSE) at some intersections or road segments. RMSE at every ode j is approximated as , where and are the observed and predicted traffic states, respectively, over T time steps. Regions on maps that are darker have higher RMSE values, indicating lower predictive precision. GEC-DTSP appears lighter and more evenly distributed than baseline models, yielding significantly lower, spatially consistent RMSE values. This enhancement emphasizes GEC-DTSP’s capacity to represent intricate spatiotemporal dependencies through its hybrid GCN–LSTM–Transformer architecture and ongoing synchronization across the Digital Twin Engine (DTE), enabling robust, location-independent forecasting accuracy.

Download:

Fig 9. Spatial RMSE distribution for competing predictive models.

https://doi.org/10.1371/journal.pone.0350247.g009

Fig 10 presents the complete picture of how synchronization fidelity evolves across various intelligent traffic system models under varying environmental and data conditions. Synchronization Fidelity (SF) is an indicator of performance reflecting the extent to which the digital twin (DT) reflects its physical twin in real time. The greater the SF, the more similar the physical and digital states are, and this is essential for adaptive decision-making and prognosis. To address numerical instability and edge-case sensitivity, synchronization fidelity is redefined using a bounded normalized formulation:, where is a small stabilizing constant () to prevent division by zero and ensure numerical robustness under low-traffic or near-zero activity conditions. This formulation guarantees a bounded output , where lower values indicate higher synchronization fidelity between physical and digital twin states.

Download:

Fig 10. Synchronization Fidelity (SF) Analysis over time for five traffic system models under four distinct operational conditions.

https://doi.org/10.1371/journal.pone.0350247.g010

The Decision Response Time (DRT) of a system is defined as the sum of its constituent components: preprocessing (), model inference (), communication (), and feedback (), expressed as . For observations under a given condition, the mean DRT is calculated as , and its variability is quantified by the standard deviation . Fig 11(a) employs a dual-axis chart to simultaneously plot average DRT and its spread as a function of operational conditions, and Fig 11(b) shows a 3D surface that displays how DRT varies across models and conditions. Fig 11(c) is a component-wise latency-contribution heatmap that identifies bottlenecks in preprocessing, inference, communication, or feedback, and Fig 11(d) displays a box plot of the full DRT distribution, highlighting performance consistency and outliers. Together, these visualizations provide a holistic assessment of efficiency, stability, and component-level behavior in decision-making systems across diverse operational scenarios.

Download:

Fig 11. Comprehensive visualization of DRT across different models and operational conditions.

https://doi.org/10.1371/journal.pone.0350247.g011

A hierarchical ablation study is done to carefully assess the contribution of each module in the proposed framework. The comprehensive Edge–Cloud Digital Twin system is segmented into its fundamental components: Graph Convolutional Network (GCN), Long Short-Term Memory (LSTM), Transformer-based forecasting module, Reinforcement Learning (RL) control module, and Digital Twin synchronization layer. Each component is systematically eliminated to assess its effect on overall system performance regarding traffic prediction accuracy (MAE, RMSE) and control efficiency (average waiting time).

Table 5 assesses the value and importance of the digital twin mechanism by comparing four system configurations: edge-only processing, cloud-only processing, edge–cloud integration without synchronization, and the complete Edge-Cloud Digital Twin (ECDT) architecture and real-time synchronization. To measure prediction accuracy and control efficiency, MAE, RMSE, and average vehicle waiting time are used as performance metrics. The findings indicate that each module provides unique functionality within the system. The GCN augments spatial dependency modeling within road networks, the LSTM captures short-term temporal traffic dynamics, and the Transformer promotes long-term forecasting precision. The reinforcement learning module markedly enhances the efficiency of traffic signal decision-making. In contrast, the Digital Twin synchronization layer guarantees coherence between physical and virtual traffic states, resulting in minimal overall prediction error and traffic congestion levels when all components are integrated.

Download:

Table 5. Ablation study of Digital Twin component.

https://doi.org/10.1371/journal.pone.0350247.t005

The incorporation of both single-agent and multi-agent reinforcement learning baselines guarantees a thorough assessment across various traffic control paradigms. The suggested strategy consistently surpasses classical and deep reinforcement learning baselines in average waiting time reduction (36.2 seconds vs. 42.8–58.4 seconds) and improves congestion-reduction efficiency. The comprehensive GEC-DTSP pipeline is assessed as a cohesive, closed-loop system that incorporates real-time edge processing, cloud-based digital-twin forecasting, and reinforcement-learning-driven traffic-signal regulation. In this configuration, sensor inputs (36-dimensional feature vectors per intersection) are processed at the edge with an average latency of 42 ms, transmitted to the cloud every 5 seconds with a payload of approximately 144 bytes per update, and utilized by the digital twin for spatiotemporal forecasting, achieving a mean absolute error (MAE) of 2.91 and a root mean square error (RMSE) of 3.84. The anticipated states are subsequently input into the RL controller for adaptive signal optimization over four-phase actions. The fully integrated closed-loop system attains an average vehicle waiting time of 36.2 seconds and a congestion reduction efficiency of 0.89, in contrast to partially integrated configurations such as edge-only (58.4 seconds), cloud-only (52.1 seconds), and edge-cloud without synchronization (45.6 seconds), thereby clearly illustrating the efficacy of complete system-level integration.

5. Discussions

The analysis shows that the proposed GEC-DTSP model outperforms the baseline models in traffic forecasting. GEC-DTSP also had the lowest MAPE, MAE, and RMSE across all horizons, with consistent accuracy throughout the day. Hourly violin plots confirmed its stability, with compact MAPE distributions and negligible variance, particularly during rush hours. This stability stems from the hybrid nature of the model—GCN encodes spatial relationships among intersections, LSTM captures short-term temporal patterns, and the Transformer layer captures long-term predictive accuracy. Synchronization in the cloud-based digital twin enhanced real–virtual consistency with high Synchronization Fidelity (SF) (>0.95) and minimized Decision Response Time (DRT), allowing rapid adaptation to dynamic traffic states. Comparative conventional statistical and isolated deep learning models exhibited higher prediction volatility and reduced adaptability. Although slightly more computationally complex, GEC-DTSP maintains an optimal balance among accuracy, stability, and responsiveness. Overall, the outcome supports the efficacy of combining spatiotemporal deep learning with digital twin synchronization for intelligent transportation systems and justifies its practicality for large-scale implementation in smart cities.

Beyond regression measures, transportation-domain operational indicators are now part of the assessment framework. Level of Service (LOS) classification based on average control delay per vehicle (HCM standard), Travel Time Index (TTI) as the ratio of observed travel time to free-flow travel time, and fuel consumption estimates derived from a speed–acceleration-based fuel model (VT-Micro formulation) are now used to quantify system-level performance in addition to MAE, RMSE, and MAPE for The suggested framework improved LOS from Level D (38.7 s average control delay) under fixed-time control to Level C (31.4 s) using GEC-DTSP optimization at every junction examined. Travel time reliability increased 14.8% as the network-wide TTI dropped from 1.42 to 1.21. Due to improved flow and reduced idling, fuel consumption dropped by 8.6%. Computing the standard deviation and Gini coefficient of average delay across junctions provided equity indicators of fairness and distributional impacts. The Gini coefficient dropped from 0.27 to 0.18, and the delay variance dropped from 6.3 s to 3.9 s, showing more balanced service levels across sites rather than localized optimization advantages. These modifications spread network gains evenly, not only to aggregate averages. The Decision Response Time (DRT) analysis in Fig 11 now includes absolute latency estimates for real-time feasibility. The measured end-to-end control latency (state acquisition, edge preprocessing, cloud synchronization, RL inference, signal actuation) averages 42 ms (σ = 6.3 ms) under nominal conditions. It remains below 85 ms under simulated 20% network delay, meeting the < 100 ms real-time constraint for adaptive signal control systems. To benchmark, the figure now includes a direct comparison line at 100 ms.

To evaluate the contribution of each architectural component, ablation experiments were conducted on the PEMS-BAY dataset for 60-minute multi-step forecasting. The GCN-only model achieved an RMSE of 4.78 ± 0.11, indicating that spatial modeling alone is insufficient for long-horizon prediction. Incorporating LSTM reduced RMSE to 4.41 ± 0.10, demonstrating the importance of temporal recurrence modeling. Replacing LSTM with a Transformer resulted in an RMSE of 4.34 ± 0.09, suggesting improved capture of long-range dependencies. The full GCN–LSTM–Transformer model further reduced RMSE to 4.29 ± 0.10, achieving a 2.8% improvement over the GCN+LSTM configuration and a 2.3% improvement over the GCN+Transformer configuration. Notably, performance gains were more pronounced at longer horizons (45–60 minutes), confirming that the Transformer enhances long-term forecasting capability beyond recurrent modeling.

The proposed architecture, while intended for real-time, deployment-ready applications, is currently evaluated solely on offline datasets and controlled simulations. The given performance indicates system-level viability rather than practical operational implementation. Future efforts will focus on integrating live traffic data and implementing real-world urban intersections to test scalability and real-time responsiveness comprehensively. One of the main drawbacks of the suggested framework is the variability in the spatial granularity of the datasets used to model the system and evaluate control. In particular, although the Kaggle dataset mimics the intersection-level signal and queue dynamics, PEMS-BAY is based on highway sensor networks and is not representative of signalized intersection control environments. Consequently, signal optimization based on reinforcement learning is validated in a feature-mapped, simulation-supported environment rather than in a fully real-world intersection deployment. Future work will combine specific urban intersection datasets with microscopic traffic simulation platforms to improve real-world validation and deployment fidelity.

While privacy-preserving learning techniques are theoretically incorporated into the proposed architecture, the existing experimental framework does not directly execute cryptographic or federated learning-based privacy modules. Data anonymization is achieved during the preprocessing phase by eliminating identifiable vehicle-level characteristics. Comprehensive privacy-preserving distributed learning remains a focus of future research. To enhance the interpretation of traffic system performance, supplementary indicators such as Level of Service (LOS), Travel Time Index (TTI), and fuel usage are examined as qualitative performance metrics. These indicators are not utilized directly in the primary experimental assessment but are extrapolated from observed congestion patterns, average waiting times, and traffic flow outputs generated by the proposed system.

6. Limitations

One limitation of the present study is that the entire system architecture has not been experimentally validated. Although the framework conceptually incorporates additional functionalities, including privacy-preserving learning and signal control optimization, the current evaluation is limited to traffic forecasting, signal control, and digital twin synchronization. Future work will build on the experimental framework by conducting end-to-end validation of all the proposed components under real-world deployment conditions.

The existing evaluation framework is limited because forecasting models and control-oriented reinforcement learning systems cannot be directly compared in a single unified metric space due to their inherently distinct purposes. Future endeavors will focus on developing cohesive evaluation frameworks that simultaneously assess prediction accuracy and control efficiency within standardized simulation environments. A disadvantage of the present study is that certain proposed expansions, such as privacy-preserving collaborative learning and adaptive routing, are still theoretical and have not been incorporated into the experimental framework. Subsequent efforts will integrate these modules into a comprehensive distributed learning framework and assess their influence on extensive urban transportation systems. A disadvantage of the present study is that secondary measures, including LOS, TTI, and fuel consumption, are not explicitly corroborated by external ground-truth datasets. Future endeavors will integrate specialized transportation simulation tools to assess these sustainability and service quality metrics explicitly.

7. Conclusion

The GEC-DTSP model is a digital-twin cloud architecture for adaptive-control-based smart traffic forecasting and control. By combining GCN for spatial feature learning with LSTM and Transformer layers for short- and long-term temporal modeling, the system effectively models dynamic inter- and intra-intersection dependencies. Experiments on real-world datasets showed that GEC-DTSP consistently achieved the lowest MAPE, MAE, and RMSE across varying prediction horizons, outperforming both statistical and deep learning baselines. Hourly violin-plot analysis showed higher stability and zero variance during the most congested hours. Cloud synchronization and reinforcement-learning feedback additions also improved Synchronization Fidelity (SF) and reduced Decision Response Time (DRT), enabling timely and secure traffic-control decisions. In brief, GEC-DTSP exhibits a strong, scale-out, and context-aware design with real-time processing capability in large-scale smart-transportation networks. In addition to high performance, GEC-DTSP also accommodates the computational overhead of Transformer attention and cloud synchronization. Future research will expand the framework to incorporate multimodal data—weather, events, and mobility signals—to improve generalization and enable deployment in real-world, heterogeneous smart-city networks.

References

1. Iranshahi K, Brun J, Arnold T, Sergi T, Müller UC. Digital twins: recent advances and future directions in engineering fields. Intell Syst Appl. 2025;26:200516.
- View Article
- Google Scholar
2. Ali A, Ullah I, Singh SK, Sharafian A, Jiang W, I. Sherazi H, et al. Energy‐efficient resource allocation for urban traffic flow prediction in edge‐cloud computing. Int J Intell Syst. 2025;2025(1):1863025.
- View Article
- Google Scholar
3. Li J, Wang J. Digital twin-driven management strategies for logistics transportation systems. Sci Rep. 2025;15(1):12186. pmid:40204918
- View Article
- PubMed/NCBI
- Google Scholar
4. Li W, Wang B, Sun R, Ai L, Lin Z. Energy-efficient multimodal mobility networks in transportation digital twins: Strategies and optimization. Energy. 2025;318:134587.
- View Article
- Google Scholar
5. Poorzare R, Kanellopoulos DN, Sharma VK, Dalapati P, Waldhorst OP. Network digital twin towards networking, telecommunications, and traffic engineering: a survey. IEEE Access. 2025.
- View Article
- Google Scholar
6. Alshorman M, taleb S. Cluster-based traffic management for optimizing urban congestion using unsupervised learning on real-time data streams. PIQM. 2025;2(1).
- View Article
- Google Scholar
7. Hazarika A, Choudhury N, Nasralla MM, Khattak SBA, Rehman IU. Edge ML technique for smart traffic management in intelligent transportation systems. IEEE Access. 2024;12:25443–58.
- View Article
- Google Scholar
8. Alam T, Gupta R, Nasurudeen Ahamed N, Ullah A, Almaghthwi A. Smart mobility adoption in sustainable smart cities to establish a growing ecosystem: challenges and opportunities. MRS Energy Sustain. 2024;11(2):304–16.
- View Article
- Google Scholar
9. Xu H, Yuan J, Zhou A, Xu G, Li W, Ban X, et al. Genai-powered multi-agent paradigm for smart urban mobility: opportunities and challenges for integrating large language models (llms) and retrieval-augmented generation (rag) with intelligent transportation systems. arXiv:2409.00494 [Preprint]. 2024.
10. Lakhan A, Grønli T-M, Bellavista P, Memon S, Alharby M, Thinnukool O. IoT workload offloading efficient intelligent transport system in federated ACNN integrated cooperated edge-cloud networks. J Cloud Comp. 2024;13(1):79.
- View Article
- Google Scholar
11. Khuwuthyakorn P, Lakhan A, Majumdar A, Thinnukool O. Blockchain-enabled self-autonomous intelligent transport system for drone task workflow in edge cloud networks. Algorithms. 2025;18(8):530.
- View Article
- Google Scholar
12. Selvaraj R, Kuthadi VM, Duraisamy A, Selvaraj B, Pethuraj MS. Learning optimizer-based visual analytics method to detect targets in autonomous unmanned aerial vehicles. IEEE Intell Transport Syst Mag. 2024;16(6):72–85.
- View Article
- Google Scholar
13. Wen N, Zhou Y, Wang Y, Zheng Y, Fan Y, Liu Y, et al. Dynamic sensor-based data management optimization strategy of edge artificial intelligence model for intelligent transportation system. Sensors (Basel). 2025;25(7):2089. pmid:40218602
- View Article
- PubMed/NCBI
- Google Scholar
14. Alsaleh A. Toward a conceptual model to improve the user experience of a sustainable and secure intelligent transport system. Acta Psychol (Amst). 2025;255:104892. pmid:40081084
- View Article
- PubMed/NCBI
- Google Scholar
15. Liang S, Wu H, Zhen L, Hua Q, Garg S, Kaddoum G, et al. Edge YOLO: real-time intelligent object detection system based on edge-cloud cooperation in autonomous vehicles. IEEE Trans Intell Transport Syst. 2022;23(12):25345–60.
- View Article
- Google Scholar
16. Ahmad K, Khujamatov H, Lazarev A, Usmanova N, Alduailij M, Alduailij M. Internet of things‐aided intelligent transport systems in smart cities: challenges, opportunities, and future. Wirel Commun Mob Comput. 2023;2023(1):7989079.
- View Article
- Google Scholar
17. Sattarzadeh AR, Kutadinata RJ, Pathirana PN, Huynh VT. A novel hybrid deep learning model with ARIMA Conv-LSTM networks and shuffle attention layer for short-term traffic flow prediction. Transp A: Trans Sci. 2025;21(1):2236724.
- View Article
- Google Scholar
18. Su J, Cai H, Sheng Z, Liu AX, Baz A. Traffic prediction for 5G: a deep learning approach based on lightweight hybrid attention networks. Digit Signal Process. 2024;146:104359.
- View Article
- Google Scholar
19. Taher YH, Mandeep JS, Islam MT, Abdulhae OT, Shakir AT, Islam MS, et al. Filter for traffic congestion prediction: leveraging traffic control signal actions for dynamic state estimation. IEEE Access. 2025.
- View Article
- Google Scholar
20. Jiang R, Wang S, Jia D, Mao G, Lim EG. An adaptive prediction model for randomly distributed traffic data in urban road networks. IEEE Trans Veh Technol. 2025;74(5):7188–200.
- View Article
- Google Scholar
21. Sengupta A, Mondal S, Das A, Guler SI. A Bayesian approach to quantifying uncertainties and improving generalizability in traffic prediction models. Transp Res Part C Emerg Technol. 2024;162:104585.
- View Article
- Google Scholar
22. Reza S, Ferreira MC, Machado JJM, Tavares JMR. A citywide TD‐learning based intelligent traffic signal control for autonomous vehicles: performance evaluation using SUMO. Expert Syst. 2025;42(1):e13301.
- View Article
- Google Scholar
23. Medvei MM, Bordei A-V, Niță Ștefania L, Țăpuș N. DeepSIGNAL-ITS—Deep learning signal intelligence for adaptive traffic signal control in intelligent transportation systems. Appl Sci. 2025;15(17):9396.
- View Article
- Google Scholar
24. Kan H, Li C, Wang Z. Enhancing urban traffic management through YOLOv5 and DeepSORT Algorithms within Digital Twin Frameworks. MITS. 2024;3(1):39–54.
- View Article
- Google Scholar
25. Liu G, Shi H, Kiani A, Khreishah A, Lee J, Ansari N, et al. Smart traffic monitoring system using computer vision and edge computing. IEEE Trans Intell Transport Syst. 2022;23(8):12027–38.
- View Article
- Google Scholar
26. Alkarim AS, Al-Malaise Al-Ghamdi AS, Ragab M. Ensemble learning-based algorithms for traffic flow prediction in smart traffic systems. Eng Technol Appl Sci Res. 2024;14(2):13090–4.
- View Article
- Google Scholar
27. Khadka S, Wang P, Li P, Mattingly SP. Automated traffic signal performance measures (ATSPMs) in the loop simulation: a digital twin approach. Transp Res Rec. 2024;2679(1):2129–46.
- View Article
- Google Scholar
28. Llagostera-Brugarola E, Corpas-Marco E, Victorio-Vergel C, Lopez-Aguilera E, Vázquez-Gallego F, Alonso-Zarate J. A Digital Twin for intelligent transportation systems in interurban scenarios. Appl Sci. 2025;15(13):7454.
- View Article
- Google Scholar
29. Li T, Bian Z, Lei H, Zuo F, Yang YT, Zhu Q, et al. Digital twin-based driver risk-aware intelligent mobility analytics for urban transportation management. arXiv:2407.15025 [Preprint]. 2024.
30. Fu Y, Turkcan MK, Anantha V, Kostic Z, Zussman G, Di X. Digital twin for pedestrian safety warning at a single urban traffic intersection. 2024 IEEE Intelligent Vehicles Symposium (IV); IEEE; 2024. p. 2640–5.
31. Chen Y, Zhang Q, Yu F. Transforming traffic accident investigations: a virtual-real-fusion framework for intelligent 3D traffic accident reconstruction. Complex Intell Syst. 2024;11(1):76.
- View Article
- Google Scholar
32. Chen Y, Zheng L, Tan Z. Roadside LiDAR placement for cooperative traffic detection by a novel chance constrained stochastic simulation optimization approach. Transp Res Part C Emerg Technol. 2024;167:104838.
- View Article
- Google Scholar
33. Yan J, Cheng Y, Zhang F, Li M, Zhou N, Jin B, et al. Research on multimodal techniques for arc detection in railway systems with limited data. Struct Health Monit. 2025:14759217251336797.
- View Article
- Google Scholar
34. Wang X, Jiang H, Zeng T, Dong Y. An adaptive fused domain-cycling variational generative adversarial network for machine fault diagnosis under data scarcity. Inf Fusion. 2026;126:103616.
- View Article
- Google Scholar
35. Wang H, Song Y, Yang H, Liu Z. Generalized Koopman neural operator for data-driven modeling of electric railway pantograph–catenary systems. IEEE Trans Transp Electrific. 2025;11(6):14100–12.
- View Article
- Google Scholar
36. Sun H, Tang X, Lu J, Liu F. Spatio-temporal graph neural network for traffic prediction based on adaptive neighborhood selection. Transp Res Rec. 2023;2678(6):641–55.
- View Article
- Google Scholar
37. Luo Q, He S, Han X, Wang Y, Li H. LSTTN: a long-short term transformer-based spatiotemporal neural network for traffic flow forecasting. Knowl Based Syst. 2024;293:111637.
- View Article
- Google Scholar
38. Karim S, Mehmud M, Alamgir Z, Shahid S. Dynamic spatial correlation in graph wavenet for road traffic prediction. Transp Res Rec. 2023;2677(7):90–100.
- View Article
- Google Scholar
39. Smart Traffic Management Dataset. Kaggle. Available from: https://www.kaggle.com/datase/smart-traffic-management-dataset
40. Kwak S. PEMS-BAY [Data set]. Zenodo; 2020. https://doi.org/10.5281/zenodo.4263971

[ref1] 1. Iranshahi K, Brun J, Arnold T, Sergi T, Müller UC. Digital twins: recent advances and future directions in engineering fields. Intell Syst Appl. 2025;26:200516.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Ali A, Ullah I, Singh SK, Sharafian A, Jiang W, I. Sherazi H, et al. Energy‐efficient resource allocation for urban traffic flow prediction in edge‐cloud computing. Int J Intell Syst. 2025;2025(1):1863025.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. Li J, Wang J. Digital twin-driven management strategies for logistics transportation systems. Sci Rep. 2025;15(1):12186. pmid:40204918
View Article
PubMed/NCBI
Google Scholar

[8] View Article

[9] PubMed/NCBI

[10] Google Scholar

[ref4] 4. Li W, Wang B, Sun R, Ai L, Lin Z. Energy-efficient multimodal mobility networks in transportation digital twins: Strategies and optimization. Energy. 2025;318:134587.
View Article
Google Scholar

[12] View Article

[13] Google Scholar

[ref5] 5. Poorzare R, Kanellopoulos DN, Sharma VK, Dalapati P, Waldhorst OP. Network digital twin towards networking, telecommunications, and traffic engineering: a survey. IEEE Access. 2025.
View Article
Google Scholar

[15] View Article

[16] Google Scholar

[ref6] 6. Alshorman M, taleb S. Cluster-based traffic management for optimizing urban congestion using unsupervised learning on real-time data streams. PIQM. 2025;2(1).
View Article
Google Scholar

[18] View Article

[19] Google Scholar

[ref7] 7. Hazarika A, Choudhury N, Nasralla MM, Khattak SBA, Rehman IU. Edge ML technique for smart traffic management in intelligent transportation systems. IEEE Access. 2024;12:25443–58.
View Article
Google Scholar

[21] View Article

[22] Google Scholar

[ref8] 8. Alam T, Gupta R, Nasurudeen Ahamed N, Ullah A, Almaghthwi A. Smart mobility adoption in sustainable smart cities to establish a growing ecosystem: challenges and opportunities. MRS Energy Sustain. 2024;11(2):304–16.
View Article
Google Scholar

[24] View Article

[25] Google Scholar

[ref9] 9. Xu H, Yuan J, Zhou A, Xu G, Li W, Ban X, et al. Genai-powered multi-agent paradigm for smart urban mobility: opportunities and challenges for integrating large language models (llms) and retrieval-augmented generation (rag) with intelligent transportation systems. arXiv:2409.00494 [Preprint]. 2024.

[ref10] 10. Lakhan A, Grønli T-M, Bellavista P, Memon S, Alharby M, Thinnukool O. IoT workload offloading efficient intelligent transport system in federated ACNN integrated cooperated edge-cloud networks. J Cloud Comp. 2024;13(1):79.
View Article
Google Scholar

[28] View Article

[29] Google Scholar

[ref11] 11. Khuwuthyakorn P, Lakhan A, Majumdar A, Thinnukool O. Blockchain-enabled self-autonomous intelligent transport system for drone task workflow in edge cloud networks. Algorithms. 2025;18(8):530.
View Article
Google Scholar

[31] View Article

[32] Google Scholar

[ref12] 12. Selvaraj R, Kuthadi VM, Duraisamy A, Selvaraj B, Pethuraj MS. Learning optimizer-based visual analytics method to detect targets in autonomous unmanned aerial vehicles. IEEE Intell Transport Syst Mag. 2024;16(6):72–85.
View Article
Google Scholar

[34] View Article

[35] Google Scholar

[ref13] 13. Wen N, Zhou Y, Wang Y, Zheng Y, Fan Y, Liu Y, et al. Dynamic sensor-based data management optimization strategy of edge artificial intelligence model for intelligent transportation system. Sensors (Basel). 2025;25(7):2089. pmid:40218602
View Article
PubMed/NCBI
Google Scholar

[37] View Article

[38] PubMed/NCBI

[39] Google Scholar

[ref14] 14. Alsaleh A. Toward a conceptual model to improve the user experience of a sustainable and secure intelligent transport system. Acta Psychol (Amst). 2025;255:104892. pmid:40081084
View Article
PubMed/NCBI
Google Scholar

[41] View Article

[42] PubMed/NCBI

[43] Google Scholar

[ref15] 15. Liang S, Wu H, Zhen L, Hua Q, Garg S, Kaddoum G, et al. Edge YOLO: real-time intelligent object detection system based on edge-cloud cooperation in autonomous vehicles. IEEE Trans Intell Transport Syst. 2022;23(12):25345–60.
View Article
Google Scholar

[45] View Article

[46] Google Scholar

[ref16] 16. Ahmad K, Khujamatov H, Lazarev A, Usmanova N, Alduailij M, Alduailij M. Internet of things‐aided intelligent transport systems in smart cities: challenges, opportunities, and future. Wirel Commun Mob Comput. 2023;2023(1):7989079.
View Article
Google Scholar

[48] View Article

[49] Google Scholar

[ref17] 17. Sattarzadeh AR, Kutadinata RJ, Pathirana PN, Huynh VT. A novel hybrid deep learning model with ARIMA Conv-LSTM networks and shuffle attention layer for short-term traffic flow prediction. Transp A: Trans Sci. 2025;21(1):2236724.
View Article
Google Scholar

[51] View Article

[52] Google Scholar

[ref18] 18. Su J, Cai H, Sheng Z, Liu AX, Baz A. Traffic prediction for 5G: a deep learning approach based on lightweight hybrid attention networks. Digit Signal Process. 2024;146:104359.
View Article
Google Scholar

[54] View Article

[55] Google Scholar

[ref19] 19. Taher YH, Mandeep JS, Islam MT, Abdulhae OT, Shakir AT, Islam MS, et al. Filter for traffic congestion prediction: leveraging traffic control signal actions for dynamic state estimation. IEEE Access. 2025.
View Article
Google Scholar

[57] View Article

[58] Google Scholar

[ref20] 20. Jiang R, Wang S, Jia D, Mao G, Lim EG. An adaptive prediction model for randomly distributed traffic data in urban road networks. IEEE Trans Veh Technol. 2025;74(5):7188–200.
View Article
Google Scholar

[60] View Article

[61] Google Scholar

[ref21] 21. Sengupta A, Mondal S, Das A, Guler SI. A Bayesian approach to quantifying uncertainties and improving generalizability in traffic prediction models. Transp Res Part C Emerg Technol. 2024;162:104585.
View Article
Google Scholar

[63] View Article

[64] Google Scholar

[ref22] 22. Reza S, Ferreira MC, Machado JJM, Tavares JMR. A citywide TD‐learning based intelligent traffic signal control for autonomous vehicles: performance evaluation using SUMO. Expert Syst. 2025;42(1):e13301.
View Article
Google Scholar

[66] View Article

[67] Google Scholar

[ref23] 23. Medvei MM, Bordei A-V, Niță Ștefania L, Țăpuș N. DeepSIGNAL-ITS—Deep learning signal intelligence for adaptive traffic signal control in intelligent transportation systems. Appl Sci. 2025;15(17):9396.
View Article
Google Scholar

[69] View Article

[70] Google Scholar

[ref24] 24. Kan H, Li C, Wang Z. Enhancing urban traffic management through YOLOv5 and DeepSORT Algorithms within Digital Twin Frameworks. MITS. 2024;3(1):39–54.
View Article
Google Scholar

[72] View Article

[73] Google Scholar

[ref25] 25. Liu G, Shi H, Kiani A, Khreishah A, Lee J, Ansari N, et al. Smart traffic monitoring system using computer vision and edge computing. IEEE Trans Intell Transport Syst. 2022;23(8):12027–38.
View Article
Google Scholar

[75] View Article

[76] Google Scholar

[ref26] 26. Alkarim AS, Al-Malaise Al-Ghamdi AS, Ragab M. Ensemble learning-based algorithms for traffic flow prediction in smart traffic systems. Eng Technol Appl Sci Res. 2024;14(2):13090–4.
View Article
Google Scholar

[78] View Article

[79] Google Scholar

[ref27] 27. Khadka S, Wang P, Li P, Mattingly SP. Automated traffic signal performance measures (ATSPMs) in the loop simulation: a digital twin approach. Transp Res Rec. 2024;2679(1):2129–46.
View Article
Google Scholar

[81] View Article

[82] Google Scholar

[ref28] 28. Llagostera-Brugarola E, Corpas-Marco E, Victorio-Vergel C, Lopez-Aguilera E, Vázquez-Gallego F, Alonso-Zarate J. A Digital Twin for intelligent transportation systems in interurban scenarios. Appl Sci. 2025;15(13):7454.
View Article
Google Scholar

[84] View Article

[85] Google Scholar

[ref29] 29. Li T, Bian Z, Lei H, Zuo F, Yang YT, Zhu Q, et al. Digital twin-based driver risk-aware intelligent mobility analytics for urban transportation management. arXiv:2407.15025 [Preprint]. 2024.

[ref30] 30. Fu Y, Turkcan MK, Anantha V, Kostic Z, Zussman G, Di X. Digital twin for pedestrian safety warning at a single urban traffic intersection. 2024 IEEE Intelligent Vehicles Symposium (IV); IEEE; 2024. p. 2640–5.

[ref31] 31. Chen Y, Zhang Q, Yu F. Transforming traffic accident investigations: a virtual-real-fusion framework for intelligent 3D traffic accident reconstruction. Complex Intell Syst. 2024;11(1):76.
View Article
Google Scholar

[89] View Article

[90] Google Scholar

[ref32] 32. Chen Y, Zheng L, Tan Z. Roadside LiDAR placement for cooperative traffic detection by a novel chance constrained stochastic simulation optimization approach. Transp Res Part C Emerg Technol. 2024;167:104838.
View Article
Google Scholar

[92] View Article

[93] Google Scholar

[ref33] 33. Yan J, Cheng Y, Zhang F, Li M, Zhou N, Jin B, et al. Research on multimodal techniques for arc detection in railway systems with limited data. Struct Health Monit. 2025:14759217251336797.
View Article
Google Scholar

[95] View Article

[96] Google Scholar

[ref34] 34. Wang X, Jiang H, Zeng T, Dong Y. An adaptive fused domain-cycling variational generative adversarial network for machine fault diagnosis under data scarcity. Inf Fusion. 2026;126:103616.
View Article
Google Scholar

[98] View Article

[99] Google Scholar

[ref35] 35. Wang H, Song Y, Yang H, Liu Z. Generalized Koopman neural operator for data-driven modeling of electric railway pantograph–catenary systems. IEEE Trans Transp Electrific. 2025;11(6):14100–12.
View Article
Google Scholar

[101] View Article

[102] Google Scholar

[ref36] 36. Sun H, Tang X, Lu J, Liu F. Spatio-temporal graph neural network for traffic prediction based on adaptive neighborhood selection. Transp Res Rec. 2023;2678(6):641–55.
View Article
Google Scholar

[104] View Article

[105] Google Scholar

[ref37] 37. Luo Q, He S, Han X, Wang Y, Li H. LSTTN: a long-short term transformer-based spatiotemporal neural network for traffic flow forecasting. Knowl Based Syst. 2024;293:111637.
View Article
Google Scholar

[107] View Article

[108] Google Scholar

[ref38] 38. Karim S, Mehmud M, Alamgir Z, Shahid S. Dynamic spatial correlation in graph wavenet for road traffic prediction. Transp Res Rec. 2023;2677(7):90–100.
View Article
Google Scholar

[110] View Article

[111] Google Scholar

[ref39] 39. Smart Traffic Management Dataset. Kaggle. Available from: https://www.kaggle.com/datase/smart-traffic-management-dataset

[ref40] 40. Kwak S. PEMS-BAY [Data set]. Zenodo; 2020. https://doi.org/10.5281/zenodo.4263971

Figures

Abstract

1. Introduction

1.1 Background

1.2 Challenges

1.3 Research strategy

1.4 Contributions

1.5 Research questions

1.6 Paper organization

2. Related works

2.1 Conventional traffic prediction models

2.2 Edge and cloud-based traffic solutions

2.3 Digital Twin applications in urban traffic

2.4 Research gap

3. Methods and methodology

3.1 Dataset explanation

3.2 Data acquisition

3.3 Edge processing

3.4 Spatiotemporal modeling and predictive learning in GEC-DTSP

3.4.1 Graph convolutional network for spatial modeling.

3.4.2 Long Short-Term Memory (LSTM) for temporal modeling.

3.4.3 Transformer-based predictive forecasting for long-term traffic prediction.

3.5 Cloud-based Digital Twin synchronization

3.6 Optimization and decision support

4. Results and performance comparison

4.1 Experimental setup

5. Discussions

6. Limitations

7. Conclusion

References