Figures
Abstract
Given the ongoing expansion of urban traffic signal control coverage, it is imperative to systematically delineate sub-regions based on the topology of road networks and traffic flow characteristics to enhance the stability of traffic management systems. To enhance the rationality of dynamic control sub-region division, this paper establishes a traffic-mechanism-driven edge weight construction framework integrated with modilarity-based community detection. This model integrates traffic volume, signal cycle, and traffic density to compute a comprehensive correlation degree between adjacent intersections. Furthermore, an entropy-weighted TOPSIS framework is proposed to objectively evaluate the proximity of neighboring nodes. This proximity measure is subsequently utilized as the edge weight within an improved Newman fast partitioning algorithm, thereby augmenting modularity and guiding the delineation of sub-regions. Empirical results indicate that while the conventional Newman algorithm exhibits limited adaptability to complex traffic patterns, the proposed model yields more distinct and precise sub-region divisions that more accurately reflect actual traffic conditions. Consequently, this methodology provides a robust foundation for the development of more effective signal control strategies.
Citation: Tian X, Ding J, Liu H, Deng H (2026) An improved Newman fast division algorithm based on multi-factor correlation for dynamic traffic sub-region control. PLoS One 21(3): e0343245. https://doi.org/10.1371/journal.pone.0343245
Editor: Rui Cheng, Guilin University of Electronic Technology, CHINA
Received: September 23, 2025; Accepted: February 3, 2026; Published: March 12, 2026
Copyright: © 2026 Tian et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: This work was supported by the Jilin Province Science and Technology Development Plan Project under Grant YDZJ202301ZYTS291 and the Jilin Provincial Education Department of 2024 Outstanding Youth Project under Grant JJKH20240386KJ. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: NO authors have competing interests Enter: The authors have declared that no competing interests exist.
1 Introduction
In response to the persistent process of urbanization, traffic congestion has become increasingly prevalent. The continual expansion of roadway infrastructure, coupled with the sustained rise in vehicle numbers, necessitates the implementation of regionally coordinated traffic control strategies. However, traditional global unified signal control exhibits limitations when applied to contemporary traffic networks. Partitioning signal control into smaller sub-regions can significantly enhance traffic management effectiveness. The concept of partitioning signal control into sub-regions was initially proposed by Walinchus et al. [1]. In circumstances where the network layout is irregular, and traffic patterns within the region exhibit heterogeneity, network decomposition may be justified. Consequently, sub-regions are categorized as either fixed sub-regions or real-time sub-regions.
Generally, signal control sub-regions are classified into static and dynamic categories. While static sub-regions rely on fixed standards or historical data, they often exhibit limited applicability in real-time management. Consequently, current research predominantly emphasizes dynamic sub-regions. Hu Yawen et al. [2] employed traffic data to analyze the interconnection relationships between intersections, thereby reducing delays and enhancing the efficiency of primary roads. Liu Chang et al. [3] introduced a two-layer framework utilizing trajectory data, resulting in more precise and effective clustering. Zhang Man et al. [4] proposed a Dirichlet-based algorithm that redefines density parameters for dynamic segmentation, outperforming traditional static methods. Lin Shu et al. [5] developed a demand balance model aimed at optimizing traffic flow and facilitating real-time delineation of sub-regions.
The correlation among intersections encapsulates their spatial arrangement and traffic dynamics, thereby playing a crucial role in regional coordination and delineation. Key variables such as traffic volume, signal signal cycle, queue lengths, and traffic density substantially influence correlation metrics. For example, Shen et al. [6] utilized fuzzy logic techniques to estimate correlations based on road distance, vehicle flow, and signal cycle, achieving reductions in travel time and stoppages. Bie et al. [7] identified variations in signal cycles, link lengths, and flow discrepancies as critical factors, which informed the development of a robust correlation model for regional delineation. Yagoda et al. [8] proposed real-time correlation computation methods to synchronize traffic signal phases and dynamically adjust boundary offsets. Zhou et al. [9] introduced a rapid regional partitioning approach employing modular optimization, demonstrating empirical efficacy. Wang et al. [10] proposed an adaptive framework incorporating a correlation index (CI), which exhibited strong performance in regional division tasks. Zhai et al. [11] calculated path flows and correlation degrees based on predefined thresholds, effectively reducing network delays.
Researchers have developed advanced algorithms for regional road network segmentation based on graph theory principles. Shiling Li et al. [12] used the Louvain algorithm for more accurate sub-region division. Duanyang Liu et al. [13] combined Pearson correlation and normalization to enhance partitioning with the snake algorithm. Yan Xing et al. [14] integrated a boundary index model with the enhanced Newman algorithm and spectral clustering, improving metrics like delay and stop count. Jing Luo et al. [15] proposed a dynamic partitioning method merging static division with traffic flow density, validated through modeling. Lu Zhang et al. [16] introduced a dynamic control sub-region division based on an improved Label Propagation Algorithm, reducing delays and travel times. pan et al. [17] applied k-means clustering and the METIS algorithm for data-driven traffic segmentation. Fei Yan et al. [18] partitioned traffic networks into sub-regions using coherence measures, α-Cut, and Moran's I, capturing spatiotemporal variations. Xiaohui Lin et al. [19] used Canopy-K-Means for real-time segmentation with high performance. Xiujuan Tian et al. [20] employed node importance and an improved Newman algorithm for better segmentation.
Research primarily differentiates between static and dynamic segmentation techniques. Static methods employ fixed criteria but exhibit reduced effectiveness in scenarios characterized by rapidly fluctuating traffic conditions. Consequently, advanced approaches such as trajectory refinement, density peak analysis, or the solution of Dirichlet problems are employed. However, challenges persist, including dependence on a limited set of similarity factors, reliance on manually set thresholds, and inadequate modeling of complex roadway networks. Graph-based methodologies, notably Louvain and spectral clustering algorithms, encounter difficulties related to subjective weighting schemes, insufficient integration of multiple influencing factors, and suboptimal modeling of node importance. These limitations contribute to boundary instability and impede application in real-time traffic management.
Addressing the aforementioned limitations, this research focuses on the construction of edge weights driven by traffic mechanisms and their integration into a modularity-based Newman fast partitioning algorithm for dynamic sub-region management. The proposed correlation model integrates multi-dimensional features from adjacent intersections, effectively eliminating the reliance on simplistic single-factor similarity measures. An entropy-weighted TOPSIS framework is employed to objectively determine weights and evaluate proximity across multiple factors, thus reducing dependence on manual threshold calibration and subjective weighting schemes. Additionally, the incorporation of node importance into the construction of edge weights and the optimization process significantly enhances the algorithm's sensitivity and interpretability concerning topologically critical nodes. The research further examines the computational complexity of the proposed algorithm and validates its effectiveness through comparative analyses against alternative clustering methods, including the improved Louvain, Label Propagation, and spectral clustering algorithms, utilizing real-world road network data. Evaluation metrics encompass computational efficiency, stability, robustness to noise, and control effectiveness, exemplified by measures such as delay reduction and minimization of stoppages.
The core innovation of this paper lies in the traffic-mechanism-driven edge weight construction integrated with modularity-based community detection, with the primary contributions summarized as follows:
- (1). This paper proposes an enhanced correlation model that integrates multi-source traffic features, employing entropy-weighted TOPSIS for objective weighting of factors and proximity analysis to mitigate the subjectivity associated with manual threshold settings.
- (2). The Newman fast partitioning framework strategically modifies edge weights based on node significance and incorporates a module optimization mechanism. It employs an enhanced algorithm aimed at increasing accuracy and protecting essential nodes.
- (3). The quality of partitioning, enhancements in traffic management, and overall efficiency of the proposed method were empirically validated using real-world case studies. Additionally, its robustness and adaptability were evidenced through comprehensive ablation analyses.
The following section explains the construction and implementation of the improved correlation model and the Newman algorithm, compares and evaluates the method's performance using simulations and real-world cases, and discusses limitations and deployment recommendations.
2 Research on the improved correlation degree model
This study quantifies the interaction strength between adjacent intersections by integrating invariant spatial distance with dynamic traffic parameters: volume, signal cycle, and density. These factors establish a logical framework for sub-region partitioning. As illustrated in Fig 1, these connectivity parameters fluctuate according to real-time traffic states. Equation (3) calculates the aggregate correlation using a weighted geometric mean, incorporating normalization and lower-bound smoothing to ensure numerical stability across varying conditions.
To ensure conceptual clarity, this study establishes a hierarchical transformation pipeline for the core metrics. The Correlation Degree () quantifies the interaction strength between adjacent intersections derived from individual traffic factors, including volume, signal cycle, and density. These multi-dimensional factors are subsequently fused into the Relative Closeness (
) through an entropy-weighted TOPSIS framework, providing an objective evaluation of node pairs relative to an ideal coordination state. This comprehensive index is ultimately formulated as the Edge Weight (
) through a symmetric function, serving as the primary graph-theoretic input for the improved Newman algorithm to optimize modularity and delineate dynamic traffic sub-regions.
This hierarchical structure ensures that this methodology objectively transforms raw traffic dynamics into stable structural weights, facilitating the integration of traffic mechanisms into modularity-based network partitioning.
2.1 Construction of comprehensive correlation calculation model
Step 1. Perform min–max normalization on each raw indicator , refer to Equation (1).
where denotes the normalized correlation for the
-th feature (traffic volume, signal cycle, or density), with values closer to 1 indicating stronger correlation. The terms
and
represent the maximum and minimum values of the
-th indicator within the time window, respectively.
Step 2. To safeguard against the overall correlation approaching zero in instances where a subterm is exactly zero during multiplication, and to improve numerical stability and robustness to measurement noise, we introduce a lower truncation (smoothing term). Selecting the smoothing parameter is vital for model fidelity. Values exceeding 0.15 diminish discriminative power, while those below 0.01 compromise numerical stability. Adopting
balances sensitivity to traffic variations with mathematical robustness, refer to Equation (2).
where represents the truncated sub-association degree of category k, with a range of
.
denotes the smoothing lower bound (safety buffer), to avoid numerical instability caused by weights approaching zero, a smoothing term [21] is introduced, with a value range of
.
Step 3. Construction of the comprehensive cascade correlation model.
where is the intersection
and
the degree of traffic correlation between them. For intersections, the signal cycle correlation degree between them.
is the correlation degree of traffic flow density between intersections
,
.
,
,
is the correlation degree weight coefficients are respectively, and their values are determined by the entropy weight method [22]. The method allocates feedback weights based on the index's information entropy. Specifically, lower values of information entropy correspond to higher feedback weights, whereas higher values of information entropy are associated with reduced weights.
Equation (3) employs a multiplicative model to capture the bottleneck effect across traffic demand , signal cycle feasibility
, and spatial connectivity
. Unlike additive frameworks where high volume may mask cycle incompatibility, this research utilizes a product approach to ensure aggregate correlation
is dictated by the weakest functional link. This logic provides a rigorous filter for sub-region aggregation by excluding intersections with fundamentally incompatible operational states.
In systems engineering, a product relationship accurately measures the effectiveness of such serial structures. Unlike a weighted sum, this model ensures that if any critical factor approaches zero—for instance, due to insufficient traffic volume, mismatched signal cycles, or sparse vehicle density—the overall coordination benefit nullifies. This captures the cascading effect where the performance of the interconnected micro-system is limited by its weakest component. Consequently, intersections lacking any of these essential correlations are correctly excluded from the same sub-region.
The diagram depicting the two-way roadway segment between successive intersections. In cases where one direction of the two-way segment necessitates coordination and control, it is imperative that both directions are subjected to coordination and control measures. Therefore, the two-way road section system can be divided into two subsystems, denoted as and
, as shown in Fig 2. The road section correlation from intersection
to
is
, and from
to
is
. The overall correlation degree is taken as the maximum correlation degree between the two subsystems, as shown in Equation (4).
2.2 Traffic correlation degree
Traffic flow is inherently linked to intersection correlation. An increase in traffic volume reduces available driving space and prolongs queue lengths downstream, thereby exerting a negative influence on upstream conditions. The Whitson model, as outlined in the American Traffic Control System manual [23], quantitatively assesses this correlation, as represented in Equation (5).
Here, represents the travel time between intersections (in minutes).
defines the maximum traffic volume (vehicles per hour) at the upstream intersection
,
including
,
,
.
is the traffic volume of the lane group of the entrance lane at the downstream intersection
(vehicles per hour).
represents the number of associated flow directions at the upstream intersection
of the entrance lane of the intersection.
Accurately assessing the correlation between adjacent intersections requires accounting for the discrete nature of traffic flow as it propagates downstream. While the standard Whitson model characterizes the relationship between correlation degree, inter-intersection distance, and flow rate, it lacks an explicit mechanism for flow discreteness. To bridge this gap, we enhance the Whitson model by integrating a queue dispersion model and the interconnection expectation index [24]. Furthermore, as formulated in Equation (6), the temporal variable is redefined as either the travel duration from the upstream intersection to the downstream queue tail or the mean travel time under free-flow conditions at the entry point.
where represents the flow correlation degree between intersections
and
.
is the number of inflow and flow directions at the upstream intersection.
represents the maximum traffic volume (vehicles per hour) at the upstream intersection
, and
includes
,
,
.
represents the traffic volume (vehicles per hour) of the lane group at the
entrance of the downstream intersection.
represents the time from the upstream intersection to the end of the queue at the downstream intersection or the average driving time (in minutes) when no vehicle queues at the entrance. The calculation of time
is as shown in Equation (7).
where is the distance (m) between adjacent intersections.
represents the average queue length (m) at the downstream intersection.
represents the average speed of traffic flow on the road section.
To account for the bidirectional nature of traffic between adjacent intersections, the correlation degree must incorporate both flow directions. As formulated in Equation (8), we select the maximum correlation value between the two directions to represent the definitive coupling strength between neighboring intersections.
where represents the traffic flow direction from the upstream intersection
to the downstream intersection
.
represents the traffic flow direction from downstream intersection
to upstream intersection
.
Traffic volume is a primary determinant of accuracy in both grey correlation and probabilistic graphical models. Fig 3 illustrates the 3D distribution of these parameter interrelations, providing the quantitative and visual evidence necessary to substantiate the model analysis.
2.3 Signal period correlation degree
Signal cycle disparities between adjacent nodes significantly impact network connectivity. To maintain stable phase relationships, sub-region nodes should be synchronized with the primary node or the regional maximum cycle length. Although Webster’s formula suggests that extending cycles enhances capacity, it simultaneously increases delays; therefore, uniform cycles are essential for stabilizing phase differences and optimizing coordinated control. These variations are critical parameters when delineating sub-divisions. The relationship between intersection correlation and the period duration factor [25] is formalized in Equation (9), which incorporates cycle length as a fundamental variable.
where represents the correlation degree of the signal cycle duration.
and
are the signal cycle durations of intersection
and intersection
respectively.
is the maximum of the signal cycle durations of the two.
is the one with the smallest signal cycle duration between the two.
As a fundamental parameter, the signal cycle facilitates both correlation modeling and sub-region delineation. It provides the essential input for regional coordinated control, directly influencing the roadway network's operational efficiency. Fig 4 presents a waterfall chart mapping these signal cycle dynamics, illustrating its behavior across spatial and temporal dimensions.
2.4 Correlation degree of traffic flow density
Sub-region delineation requires an integrated analysis of traffic flow, signal cycle duration, and road segment density. As a primary indicator of congestion, segment density determines the necessity for signal coordination between adjacent intersections. The correlation degree relative to this density [26] is quantified through Equations 10‒11.
where represents the density correlation degree between intersections
and
.
represents the traffic flow density of the section from intersection
to
.
represents the traffic flow density of the section from intersection
to intersection
in the saturated state.
represents the total flow from intersection
to intersection
within a specific period of time.
represents the number of lanes from intersection
to intersection
.
represents the length (m) of the section of road between adjacent intersections
and
.
Traffic flow density is a fundamental parameter for quantifying road space utilization, saturation, and the operational efficiency of segments between intersections. To analyze the interrelationships among density, volume, and cycle duration, we developed the mapping presented in Fig 5. This waterfall chart provides a precise visualization of the underlying spatiotemporal correlation dynamics within the model.
3 Construction of edge weights via entropy–TOPSIS correlation model
The preceding section analyzed the correlation levels between intersections utilizing variables such as traffic volume, signal cycle, and density. This section introduces the entropy weight-TOPSIS methodology to establish a composite index for graph-based partitioning. The model computes a closeness measure functioning as an edge weight, thereby facilitating the enhanced application of the Newman fast division algorithm.
Nodes constitute fundamental components within the network architecture, with their interconnections fundamentally influencing the network's efficiency and structural configuration. The ranking of node importance facilitates the identification of pivotal nodes through methodologies such as PageRank [27], NI-LPA [28], and importance matrices [29]. These algorithms, which depend on topological properties, face limitations when applied to real-time data and are often incapable of managing high-frequency dynamism. In transport networks, the strength of the links (edges) is of paramount importance. This research endeavors to develop dynamic, real-time edge weights that precisely mirror traffic conditions, addressing the inflexibility inherent in traditional static methods. The study utilizes the entropy weight-based TOPSIS algorithm, incorporating multi-dimensional traffic data to enable real-time updates. Fig. 6 depicts the model workflow.
3.1 Edge weight determination in complex networks: A multidimensional correlation entropy-weighted TOPSIS model
This research utilizes the entropy-weighted TOPSIS methodology not only as a comprehensive evaluation framework but also specifically to ascertain edge weights between adjacent intersections, quantifying their interrelations. By incorporating traffic flow, signal cycle, and traffic density as evaluation metrics, the approach applies objective weighting procedures and calculates the relative closeness to illustrate the mutual influence of traffic conditions across different intersections through edge weights. This procedure establishes a foundation for subsequent correlation analyses and network modeling. The principal objective of the model is to compute a relative closeness metric, which functions as the weight of the edge linking two intersections (nodes), providing critical input for the subsequent application of the refined Newman fast division algorithm.
The methodology for determining edge weights utilizing the entropy weight-based TOPSIS model is delineated as follows.
- 1). Step 1. Construct the evaluation matrix. Using adjacent intersections as samples, a decision matrix
is built, where
represents the number of samples and
(corresponding to the correlation degrees of traffic flow, signal cycle, and traffic density). All three indicators are viewed as positive attributes, meaning higher values indicate stronger correlations. Construct decision matrix values as shown in Table 1.
where denotes the performance value of the
-th alternative with respect to the
-th criterion. The matrix functions as the foundational dataset for the model, with each row representing a pair of neighboring intersections and each column denoting a correlation indicator. The first column indicates traffic volume correlation
, the second column indicates signal cycle correlation
, and the third column reflects traffic density correlation
.
- 2). Step 2. Construction of a standardized matrix. Standardization procedures are implemented to ensure data comparability and to differentiate between positive and negative indicators. The matrix undergoes standardization, and subsequent processing involves the application of equations such as (14) and (15).
where is the standard matrix in the equation.
is the parameter in the
-th row and
-th column of the standard matrix.
represents the process of normalizing the parameters of the
-th row and
-th column of the evaluation matrix is carried out, and the normalized data
is brought into the entropy weight method. Given that the three correlation indicators may differ in scale and magnitude—such as traffic volume, which is generally higher than density—this phase involves the standardization process to eliminate dimensional effects and facilitate comparability among indicators. This procedure ensures that, in subsequent comprehensive assessments, no single indicator exerts disproportionate influence on the results due to its raw numerical value, thereby promoting a more balanced evaluation.
- 3). Step 3. Determination of the entropy index weight. Initially, the feature proportion pertaining to the objective function is computed. Consistent with the principles of entropy [30–32], an increase in informational content corresponds to a reduction in entropy value. Accordingly, the entropy weight serves as an indicator of the amount of information conveyed. The relevant calculation formula is presented in Equation (16).
where represents the feature proportion under the
-th row and the
-th column.
is the parameter in the
-th row and
-th column of the standard matrix.
Secondly, calculate the entropy value under the
indicator. The calculation formula is as shown in Equation (17).
Here, represents the information entropy value.
represents the feature proportion under the
-th row and the
-th column.
Finally, calculate the weights and the calculation formula, such as equations (18)~(20).
where is the information entropy value in the formula.
is the weight,
.
is the entropy weight matrix.
The entropy weight method objectively determines indicator weights by quantifying dispersion via entropy, which reflects the amount of information conveyed by an indicator. Indicators exhibiting high variability, such as traffic volume, possess low entropy and consequently are assigned higher weights, whereas more stable indicators, like signal signal cycle, have higher entropy and thus lower weights. For example, when traffic volume demonstrates significant variation across intersections while signal cycles remain relatively constant, the method assigns a greater weight to “traffic volume” due to its superior capacity to differentiate among intersections. The entropy weight table is presented in Table 2.
- 4). Step 4. Construction of the Weighted Average Evaluation Matrix. The standardized evaluation matrix, in conjunction with the entropy weight matrix derived via the entropy weight method, is employed to assign weights to the matrix, thereby computing the weighted evaluation matrix. The corresponding calculation procedures are detailed in Equations 21~22.
where in the formula is the weighted evaluation matrix.
is the parameter of the
-th row and
-th column of the weighted matrix.
is the parameter in the
-th row and
-th column of the standard matrix.
is the weight,
.This step combines the standardized data with objective weights. Each element
in matrix
, this metric denotes the weighted normalized correlation value, reflecting both the positional relationship of the intersection relative to a specific indicator and the indicator's significance within the comprehensive evaluation of the current road network.
- 5). Step 5. Calculate the weighted standardized ratings to determine the positive and negative ideal decision schemes
and
respectively. The calculation formulas are as shown in equations (23)~(24).
where represents a positive ideal decision scheme in the formula.
represents a negative ideal decision scheme.
. The positive ideal solution
represents an ideally optimal adjacent intersection pair that achieves the best values across all three correlation indicators, unlike the negative ideal solution.
corresponds to an ideal worst-case adjacent intersection pair with the lowest values in all three indicators.
- 6). Step 6. The Euclidean distance is employed to quantify the separation between each node and the positive and negative ideal solutions. This methodological approach facilitates the calculation of the closeness coefficient. The corresponding mathematical expressions are delineated in equations (25)~(27) .
where represents the relative closeness of node
, which is defined as the edge weight
for the connection between node
and its adjacent nodes.
represents the gap between the
-th pair of actual intersections and the ideal intersection pair in terms of comprehensive traffic characteristics. A smaller distance indicates better performance of the intersection pair.
denotes the gap between the i-th pair of actual intersections and the worst intersection pair. A larger distance indicates the intersection pair is less unsatisfactory. The relative closeness
is a value between 0 and 1 indicating how close the intersection pair is to perfection or the worst case.
is a value near 1 indicates a stronger intersection correlation and closer to the ideal.
- 7). Step 7. Construct Edge Weights
. Define the weight
of any connecting edge
between two adjacent intersections
as a symmetric function of the relative proximity
and
between the two nodes, using the arithmetic mean as follows:
Relative closeness measures how close a scheme is to optimal or worst outcomes [33]. A higher edge weight indicates greater similarity between intersections regarding traffic, signals, and density, reflecting their compatibility. Grouping these in the same control zone with coordinated strategies like green wave synchronization can enhance traffic management. This metric is vital in defining community boundaries in the enhanced Newman algorithm, as shown in Table 3.
Relative closeness evaluates the proximity of a configuration to the ideal solution, serving here as the basis for edge weights. These weights quantify the similarity between adjacent intersections by integrating variations in traffic flow, signal cycles, and density. Within our enhanced Newman algorithm, higher edge weights drive the modularity calculation, signaling stronger node connections and a higher probability of sub-region aggregation. By functioning as a dynamic similarity measure, this weighted approach improves partitioning accuracy over static methods by effectively capturing real-time traffic dynamics.
4. Dynamic control sub-region division based on the improved Newman fast division algorithm
Utilizing a graph-theoretic abstraction, intersections are represented as nodes, while the connecting road segments between adjacent intersections are depicted as edges within the network. The correlation degree index is incorporated into the TOPSIS model, which employs the entropy weight method to quantify relative closeness. This measure of relative closeness is subsequently used as the edge weight between nodes in an enhanced version of the Newman fast partitioning algorithm. Consequently, this paper articulates an improved sub-region division methodology centered on traffic-mechanism-driven edge weights and modularity optimization.
4.1 Newman's Fast Partitioning algorithm
Conventional algorithms such as the Kernighan-Lin (K-L) method [34] and spectral bisection [35] require prior knowledge of the number of communities, which limits their applicability in practical scenarios. The Girvan-Newman algorithm demonstrates high computational complexity, thereby restricting its utility in analyzing complex networks. The Fast Newman (FN) algorithm, developed by Newman et al. [36,37] and based on the Girvan-Newman (GN) approach [38], employs a greedy local search strategy aimed at maximizing modularity, a quantitative metric used to evaluate the quality of a given partition. A higher modularity value signifies a more optimal community division.
The FN methodology assesses variations in the modularity-added value both prior to and subsequent to the formation of network communities. Nodes are successively aggregated based on associated increases in modularity, with the iterative process ceasing once the entire set of nodes has been unified. A dendrogram, generated at the point of peak modularity, depicts the connectivity among nodes and thus delineates the optimal partitioning of sub-regions. The procedural steps of the algorithm are detailed below.
- 1). Step 1.The preliminary stage entails abstracting the road network into a topological graph. The intersections within the network are quantitatively characterized, and the area is subdivided into m discrete communities, each represented by a node symbolizing a sub-region.
- 2). Step 2. Construct the
-order symmetric auxiliary matrix
, where element
is the proportion of the number of connections between communities
and
to the total number of edges in the network. Establish an array
, whose element
is the proportion of the number of edges connected between the internal nodes and external nodes of sub-region
to the total number of edges in the network, and the number of elements in array
is equal to the total number of nodes in the network. Where initialize
and
to satisfy equations (29)~(30), and construct the matrix
and array
as equations (31)~(32).
where represents the proportion of the number of connections between communities
and
to the total number of edges in the network.
represents the proportion of the number of edges connected between the internal nodes and external nodes of sub-region
to the total number of edges in the network.
is the degree of node
, that is, the number of edges connected to node
.
is the total number of edges of the connected nodes in the network.
- 3). Step 3. Merge two communities according to their similarity or Euclidean distance. The equation used to calculate the modularity increment is presented in Equation (33).
where in the formula represents the increment of modularity.
- 4). Step 4. Repeat the process from Step 2 to Step 3 until the entire network merges into a single sub-region. The algorithm concludes, and no more than one merge is necessary.
The process of aggregating nodes into communities within the network is depicted through a hierarchical tree diagram. The partitioning of this tree, which illustrates the node connection relationships corresponding to the maximum modularity Q, signifies the optimal method for segmenting the network.
4.2 Improved Newman fast partitioning algorithm
The Newman fast partitioning algorithm, originally designed for unweighted networks, has been improved for weighted networks by considering the relative closeness degree derived from the correlation degree index input into the TOPSIS algorithm with entropy weights to construct edge weights for dividing sub-regions. Assuming that the network is divided into communities, denoted as
, the symmetric matrix
is defined, as equation (34), and the modularity calculation formula is redefined, as equation (35).
where is the sum of the edge weights of communities
and
, indicating the ratio of edge weights within the sub-region to those in the network.
is the sum of the elements in each row of the matrix, representing the ratio of the edge weights of the connections between the internal nodes and external nodes of sub-region
to the network edge weights.
is the sum of the elements on the diagonal of matrix
, representing the proportion of edge weights within the sub-region to all edge weights.
is the sum of all the elements of matrix
.
The improved Newman fast partitioning algorithm considers the correlation degree between network nodes and uses TOPSIS's relative closeness degree, based on entropy weights, as edge weight. Therefore, the elements and
in matrix
are improved, as shown in equations (36)~(37).
where is the weight of the edge connecting nodes
and
.
The enhanced Newman fast partitioning algorithm advances the conventional approach by integrating multiple correlation indicators into the TOPSIS framework with entropy-based weighting, thereby replacing the reliance on a single correlation measure and facilitating a more objective allocation of weights.
4.3 Dynamic subdivision process
An accurate and precise partitioning of the transportation network is essential for the effective governance of sub-regions. The traffic network, characterized as a complex system, is segmented into control sub-regions through an enhanced Newman fast division algorithm that utilizes a measure of relative closeness as the edge weight. This approach ensures that the resulting sub-regions more accurately represent the operational characteristics of the road network. The detailed division procedure, as illustrated in Fig 7, comprises several sequential steps.
- 1). Step 1. Represent intersections as nodes and road segments connecting them as edges. Assign a unique label to each node.
- 2). Step 2. Utilize traffic flow data, including metrics such as traffic volume and signal timings, collected through traffic detection devices, along with fundamental road network information, to compute the correlation between adjacent intersections using the correlation degree model.
- 3). Step 3. Input the computed correlation coefficient between adjacent intersections into the TOPSIS algorithm employing the entropy weight method. The entropy weight method is used to determine the weights of the indices, while TOPSIS calculates the relative closeness degree. Finally, construct the edge weight matrix utilizing the obtained relative closeness degrees.
- 4). Step 4. Utilize intersections as nodes and their spatial proximity as edge weights, subsequently inputting this data into the advanced Newman fast partitioning algorithm. Calculate the modularity, iteratively merge nodes to optimize it, and select the partition that maximizes modularity as the basis for traffic control.
5 Model verification
Model validation was conducted using traffic flow data from the Changchun Traffic Management Platform. The study network comprises 18 signalized intersections and 38 segments, featuring grid and radial configurations with significant spatiotemporal heterogeneity and tidal fluctuations. For analysis, the network was abstracted into a topological graph where signalized intersections are represented as nodes and road segments as edges, as illustrated in Fig 8.
5.1 Correlation degree calculation
Utilizing open network data and Autonavi Map, information such as the distances between adjacent intersections and the number of lanes is systematically collected for validation purposes, as detailed in Table 4. Publicly available data are also analyzed to determine the cycle durations and traffic volumes at intersections during the period from 17:00–18:00 on a specific day within the region, along with the distances between successive intersections, as depicted in Table 5. The correlations among intersections are quantified using a correlation degree model, and the corresponding weight coefficients are calculated via the entropy weight method implemented in MATLAB. The resulting correlation degree values are compiled in Table 6.
5.2 Intersection relative proximity value
Relative closeness is determined by calculating Euclidean distances to positive and negative ideal solutions, with node-specific results detailed in Table 7. These values function as edge weights in both the standard and enhanced Newman fast algorithms for sub-region partitioning. The heatmap in Fig 9 visualizes the spatial distribution of node performance relative to the network average of 0.52. In this representation, cooler colors indicate higher closeness with minimal variance, while warmer colors denote higher closeness with significant disparity. Specifically, Node 11 achieves the maximum closeness value of 0.592, identifying it as a pivotal junction with the strongest overall correlation in the network.
5.3 Results of traffic control sub-region division
To validate the proposed methodology, regional roadway networks were partitioned using the Newman fast division algorithm and its enhanced variant. This process utilized an intersection correlation degree model as input within a MATLAB environment. As shown in Fig 10, the traditional Newman algorithm divides the road network into four distinct control sub-regions. Sub-region 1 includes intersections 1, 2, and 3. Sub-region 2 encompasses intersections 4, 5, 10, 11, 14, and 15. Sub-region 3 consists of intersections 6, 7, and 18, while sub-region 4 contains intersections 8, 9, 12, 13, 16, and 17. The modularity values generated during this merging process are detailed in Fig 11, where the optimal partition occurs near node 12 with a modularity value of 0.215.
The study further introduces an improved Newman algorithm that incorporates an entropy weight-based TOPSIS model to calculate dynamic edge weights. These weights integrate multi-dimensional traffic features such as volume, signal timing, and density. The resulting network segmentation is illustrated in Fig 12, which also defines four sub-regions. In this refined model, sub-region 1 comprises intersections 1–4, sub-region 2 contains intersections 6, 7, and 18, sub-region 3 includes intersections 8, 9, 16, and 13, and sub-region 4 consists of intersections 10–15. The corresponding modularity variation curves are provided in Fig 13.
Analysis of the modularity metrics reveals that the enhanced algorithm achieves a maximum value of 0.666, representing a substantial improvement over the traditional value of 0.215. This absolute increase of 0.451 reflects a relative enhancement of nearly 210 percent, indicating a superior structural characterization of the traffic network. From a traffic engineering perspective, higher modularity signifies that sub-regions possess greater internal cohesion and fewer inter-regional connections. High internal homogeneity is vital for green-wave progression, ensuring sub-regions maintain correlated traffic rhythms. This alignment optimizes phase synchronization and reduces boundary delays by preventing control unit fragmentation during transient peak periods.
Furthermore, the integration of dynamic edge weights allows the algorithm to maintain robust interactions between nodes, preventing the artificial segmentation often seen in purely topological approaches. A final comparative analysis of the division outcomes is presented in Figs 14 and 15. These results demonstrate that the improved algorithm provides higher granularity by effectively delineating small-scale control units that capture local traffic similarity patterns. By subdividing large, heterogeneous clusters into more precise units—such as the allocation of nodes 10, 11, 14, and 15 to a specific sub-region—the methodology facilitates more responsive and decentralized traffic management in complex urban environments.
An analysis of the traffic control zones generated by both algorithms reveals that the enhanced Newman fast division algorithm yields more detailed and realistic results. The proposed algorithm optimizes network structure and alleviates congestion. Enhanced modularity fosters sub-region stability, while partitioning at minimal-correlation boundaries ensures consistent signal control and mitigates performance losses from signal offsets.
5.4 Evaluation of the division results of traffic control sub-regions
Various metrics are employed to evaluate sub-region segmentation algorithms, including modularity, computational complexity (both temporal and spatial), and benchmarking against graph datasets. Nonetheless, these metrics exhibit several limitations, such as limited adaptability and dependence on single evaluation criteria. In contrast, metrics such as division density and normalized mutual information (NMI) demonstrate greater flexibility and rigor. Given the absence of an unequivocal “true” traffic subdivision, a benchmark partition derived from multi-source information fusion was utilized as an NMI reference. This benchmark integrates link saturation clustering during peak traffic hours, road network connectivity, and traffic management insights. Comparing algorithm outputs with this benchmark allows NMI to serve as an effective measure of practical accuracy.
To distinguish between the results of the Newman Fast and the enhanced Newman Fast partitioning algorithms, this study utilizes normalized mutual information (NMI) and partition density as quantitative evaluation metrics.
5.4.1 Normalization information.
Normalized Mutual Information (NMI [39]) serves as an entropy-based metric for assessing the similarity between the outputs of algorithms and true sub-regions. Elevated entropy values correspond to increased fragmentation, with the metric ranging from 0 to 1. Values approaching 1 denote a strong correspondence with actual sub-regions, as delineated in Formula.
where represents the sub-region label output by the algorithm and the real sub-region label.
represents mutual information, and
and
measure the degree of information overlap.
and
are the entropies of two label sets.
5.4.2 Division density.
Partition density PD [40] quantifies the edge density within a sub-region, functioning as a weighted average of the edge densities across individual sub-regions. It serves as a metric for assessing sub-region quality: High PD indicates strong internal links, supporting efficient signal control like green waves, and sparse external connections, enabling independent optimization and simpler, more robust systems. Higher PD reflects dividing areas with high internal cohesion and low external coupling, aligning with traffic control goals. The corresponding formula is presented in (49).
The number of edges and nodes within the network and its sub-regions is a significant parameter. Partition density (PD) serves as a metric to evaluate the quality of the partitioning, where a higher PD reflects stronger internal connectivity, thereby facilitating efficient signal management such as green wave coordination. Additionally, a higher PD indicates fewer interconnections between sub-regions, which diminishes traffic flow coupling and promotes autonomous control strategies. Increased partition density minimizes inter-zonal coupling, facilitating independent signal optimization and preventing spatial delay propagation. This approach offers a decentralized management framework that effectively balances local control with network-wide stability.
5.4.3 Evaluation and analysis.
This study utilizes normalized mutual information (NMI) and partition density (PD) as metrics for evaluating partitioning results. NMI measures the similarity between algorithm-generated partitions and the true sub-region structure, as depicted in Fig 16. Upon the inclusion of node 16, the optimized Newman fast partitioning algorithm attains its maximum performance relative to the standard implementation. Prior to the addition of node 16, the enhanced algorithm more accurately corresponds to the actual sub-region partitioning than the original Newman algorithm, thereby demonstrating its superior efficacy.
Secondly, the division density PD evaluates the quality of sub-regional outcomes through the analysis of internal edges and connections. As depicted in Fig 17, following the merging to node 16, the improved Newman Fast Division algorithm attained its peak performance relative to the original algorithm. Prior to reaching node 16, it demonstrated consistent superiority over the original, thus indicating enhanced community detection capabilities. Consequently, the refined algorithm exhibits increased effectiveness.
By comparing the Newman Fast partitioning algorithm with its enhanced variant, which incorporates normalized mutual information (NMI) and partitioning density (PD) indices, it becomes apparent from the elevated NMI and PD values displayed in Figs 16 and 17 that the improved algorithm offers a more precise and higher-quality delineation of sub-region structures than the original Newman Fast algorithm. This refined delineation aligns more accurately with the actual boundaries of sub-regions and more effectively identifies high-quality community structures. These results confirm that superior partition density significantly reduces inter-regional coordination complexity. Such improvements foster stable, autonomous traffic management, allowing sub-regions to maintain optimal cycle lengths with minimal external interference.
5.5 Algorithm comparison analysis
To validate the efficacy of the enhanced Newman fast division algorithm, a comparative analysis was conducted involving four benchmark algorithms: an association-enhanced Louvain method, label propagation, spectral clustering utilizing boundary metrics, and Spinglass. These algorithms were evaluated under identical data and environmental conditions within a sub-region of a road network, with performance assessed through the Normalized Mutual Information (NMI) and Partition Density (PD) metrics. The findings are summarized in the accompanying Table 8.
Table 8 presents a comparative analysis of the enhanced Newman fast division algorithm against baseline methodologies—including the traditional Newman algorithm, the improved Louvain method, the Label Propagation Algorithm, and spectral clustering—evaluated using metrics such as modularity Q, normalized mutual information (NMI), and partition density (PD). All techniques employ identical edge weights derived from entropy weight-TOPSIS, ensuring that observed differences in partition quality are attributable solely to the underlying algorithms.
– illustrate that the improved Newman Fast Algorithm outperforms the baseline in metrics such as modularity, NMI, and PD, achieving a maximum Q value of 0.666 in comparison to 0.215. This substantiates the benefits of employing traffic-aware weighted edges.
To evaluate the robustness of the methodology, Gaussian noise with a variance of ±5% was introduced to the input traffic features. Subsequent experiments were conducted, as depicted in Fig 21. The findings suggest that the degradation in performance of the improved Newman algorithm, measured by NMI, PD, and modularity, is less significant than that observed in the baseline, thus indicating its superior stability under measurement noise conditions.
6 Conclusion
This paper presents a systematic methodology for partitioning traffic control sub-regions is presented to enhance urban arterial efficiency and reflect real-world flow patterns. This dynamic division technique integrates an advanced Newman fast partitioning algorithm with a multi-factor correlation model. By analyzing volume, signal timing, and density at adjacent nodes, a correlation framework utilizing entropy-weighted TOPSIS determines the relative closeness between intersections. These closeness values serve as edge weights for the community detection process.
Empirical results confirm superior accuracy in sub-region delineation and higher consistency with actual traffic conditions. Specifically, the enhanced Newman algorithm achieves a maximum modularity of 0.666, representing a significant improvement over the 0.215 benchmark obtained through traditional methods. This absolute increase of 0.451 validates the effectiveness of the integrated modeling strategy.
While the enhanced Newman algorithm demonstrates superior sub-region partitioning, future research will assess its scalability in larger urban networks and integrate more complex dependencies into the correlation model. Subsequent validation via VISSIM or SUMO simulations will facilitate rigorous comparative analyses against conventional methods, evaluating performance through metrics such as vehicle delay and throughput. These efforts will further substantiate the algorithm's practical efficacy and robustness in real-world urban traffic management.
Future research addresses current limitations by incorporating spatiotemporal dependencies among non-adjacent nodes to refine macro-level traffic pattern representations. Algorithm optimization involves investigating approximate methods and distributed computing frameworks to mitigate computational overhead in extensive networks. Comprehensive empirical validation on diverse, large-scale urban infrastructures ensures metropolitan-scale utility and real-world applicability.
Supporting information
S1 Data. Traffic indicators and algorithm performance evaluation data.
This file contains the raw traffic indicators for various intersections, modularity (Q) comparison results, and performance evaluation metrics including NMI and PD.
https://doi.org/10.1371/journal.pone.0343245.s001
(XLSX)
S2 Data. Calculation procedures and results of the Entropy Weight-TOPSIS model.
This file provides the details of weight allocation, normalization matrices, and edge weight calculation results.
https://doi.org/10.1371/journal.pone.0343245.s002
(XLSX)
S3 Data. Source data for traffic parameter visualizations.
This file includes the numerical datasets used to generate the waterfall charts for traffic volume, signal cycle, and traffic flow density.
https://doi.org/10.1371/journal.pone.0343245.s003
(XLSX)
S1 Code. MATLAB source code for the proposed algorithm and analysis.
This compressed archive contains the complete MATLAB scripts used to implement the improved Newman fast partitioning algorithm.
https://doi.org/10.1371/journal.pone.0343245.s004
(ZIP)
References
- 1. Walinchus R. Real-time network decomposition and subnetwork interfacing. Highw Res Rec. 1971.
- 2. Hu Y, Wang Y, Zhang J, Yu H. Correlation degree analysis of arterial adjacent intersections for coordinated control subunit partition. AI & M. 2018;10(1):1687814017748748.
- 3. Liu C, Niu X, Ma Y, Shao S, Wang B. TL-TSD: A two-layer traffic sub-area division framework based on trajectory clustering. Eng Appl Artif Intell. 2024;133:108365.
- 4. Zhang M, Yan F, Yan G, Li P. Dynamic partition of control sub-regions in road network based on Dirichlet problem. Comput Eng. 2020;46(12):21–26,35.
- 5. Lin S, Kong Q, Huang Q. A model‐based demand‐balancing control for dynamically divided multiple urban subnetworks. J Adv Transport. 2016;50(6):1046–60.
- 6. Shen G, Yang Y. A dynamic signal coordination control method for urban arterial roads and its application. Front Inf Technol Electron Eng. 2016;17(9):907–18.
- 7. Bie Y, Wang D, Qu X. Modelling correlation degree between two adjacent signalised intersections for dynamic subarea partition. IET Intelligent Trans Sys. 2013;7(1):28–35.
- 8. Yagoda HN, Principe EH, Vick CE, Leonard BJTE. Subdivision of signal systems into control areas. Inst Traffic Engr. 1973;43(12).
- 9.
Zhou Z, Lin S, Xi Y. A dynamic network partition method for heterogenous urban traffic networks. In: 2012 15th International IEEE Conference on Intelligent Transportation Systems. 2012. p. 820–5. https://doi.org/10.1109/itsc.2012.6338712
- 10. Wang L, Bie YJ. An adaptive model for calculating the correlation degree of multiple adjacent signalized intersections. JMPiE. 2013;2013(1):247184.
- 11. Zhai C, Wu W, Xiao Y. Modeling continuous traffic flow with the average velocity effect of multiple vehicles ahead on gyroidal roads. Digit Transp Saf. 2023;2(2):124–38.
- 12.
Li S, Luo J, Xu J. Study on regional traffic sub-area division based on improved Louvain algorithm and correlation degree. In: 2022 China Automation Congress (CAC). IEEE; 2022.
- 13. Liu D, Wang M, Shen G. A New Combinatorial Characteristic Parameter for Clustering-Based Traffic Network Partitioning. IEEE Access. 2019;7:40175–82.
- 14. Xing Y, Li W, Liu W, Li Y, Zhang Z. A Dynamic Regional Partitioning Method for Active Traffic Control. Sustainability. 2022;14(16):9802.
- 15. Luo J, Zhang Q. Subdivision of urban traffic area based on the combination of static zoning and dynamic zoning. Discrete Dyn Nat Soc. 2021;2021(1):9954267.
- 16. Zhang L, Pang MJ. Dynamic division of control subareas for highway networks based on improved label propagation algorithm. J Traffic Eng. 2024;150(9):04024048.
- 17. Pan F, Yang J, Zhang L, Ma C, Yang J, Zhang P. Impact of countdown signals on traffic safety and efficiency: a review and proposal. Digit Transp Saf. 2023;2(3):200–10.
- 18. Yan F, Zhang M, Shi Z. Dynamic partitioning of urban traffic network sub-regions with spatiotemporal evolution of traffic flow. Nonlinear Dyn. 2021;105(1):911–29.
- 19. Lin X, Xu J. Road network partitioning method based on Canopy-Kmeans clustering algorithm. AoT. 2020;54(2):95–106.
- 20. Tian X, Liang C, Feng TJ. Dynamic control subarea division based on node importance evaluating. JMPiE. 2021;2021(1):9923514.
- 21.
Press WH, Teukolsky SA, Vetterling WT, Flannery BP. Numerical recipes 3rd edition. 2007.
- 22.
Biró C, editor. Structural Sensitivity in Graphs: An Entropy-Based $ k $-Hop Metric and its Applications. 2025 IEEE 19th International Symposium on Applied Computational Intelligence and Informatics (SACI). IEEE; 2025.
- 23.
Gordon RL, Reiss RA, Haenel H, Case E, French RL, Mohaddes A. Traffic control systems handbook. United States: Federal Highway Administration; 1996.
- 24.
Chang ECP, editor. How to decide the interconnection of isolated traffic signals. In: Proceedings of the 17th conference on winter simulation. 1985.
- 25. Li P, Dong B, Li S. A Study of Adjacent Intersection Correlation Based on Temporal Graph Attention Network. Entropy (Basel). 2024;26(5):390. pmid:38785638
- 26. Bie Y-M, Wang L-H, Wang D-H, Song X-M. Strategy of dynamic traffic control subarea partition in urban road network. China J Highw Transp. 2013;26(6):157–68.
- 27. Shen Z-L, Jiao Y-H, Wei Y-K, Wen C, Carpentieri B. Efficient hybrid PageRank centrality computation for multilayer networks. Chaos Soliton Fract. 2025;192:116018.
- 28. Ben El Kouni I, Karoui W, Romdhane LB. Node Importance based Label Propagation Algorithm for overlapping community detection in networks. Exp Syst Appl. 2020;162:113020.
- 29. Zhou X, Zhang F-M, Li K-W, Hui X-B, Wu H-S. Finding vital node by node importance evaluation matrix in complex networks. Acta Phys Sin. 2012;61(5):050201.
- 30. Shannon CE. A mathematical theory of communication. Bell Syst Tech J. 1948;27(3):379–423.
- 31. Yan H, Li H, Sun Q, Jiang Y. Propagation and control of congestion risk in scale-free networks based on information entropy. PLoS One. 2024;19(3):e0300422. pmid:38517877
- 32. Wang D, Tian F, Wei DJ. Correlation analysis of combined layers in multiplex networks based on entropy. J P O. 2022;17(10):e0276344.
- 33. Xu X, Wang Q, Ding X, Chen T, Deng R. Evaluation approach of prefabricated components based on multi-layer complex network model combined with improved topsis method. PLoS One. 2025;20(5):e0322236. pmid:40333809
- 34. Kernighan BW, Lin S. An Efficient Heuristic Procedure for Partitioning Graphs. Bell Syst Tech J. 1970;49(2):291–307.
- 35. Arthur R. Detectability constraints on meso-scale structure in complex networks. PLoS One. 2025;20(1):e0317670. pmid:39841660
- 36. Newman ME, Girvan MJ. Finding and evaluating community structure in networks. Phys Rev E. 2004;69(2):026113.
- 37. Newman MEJ. Fast algorithm for detecting community structure in networks. Phys Rev E Stat Nonlin Soft Matter Phys. 2004;69(6 Pt 2):066133. pmid:15244693
- 38. Girvan M, Newman MEJ. Community structure in social and biological networks. Proc Natl Acad Sci USA. 2002;99(12):7821–6.
- 39.
Jerdee M, Kirkley A, Newman MJNC. Normalized mutual information is a biased measure for classification and community detection. 2025.
- 40. Cheng Q, Liu Z, Huang J, Cheng G. Community detection in hypernetwork via Density-Ordered Tree partition. Appl Math Comput. 2016;276:384–93.