Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Evaluating global measures of network centralization: Axiomatic and numerical assessments

Abstract

Network centralization, driven by hub nodes, impacts communication efficiency, structural integration, and dynamic processes such as diffusion and synchronization. Although numerous centralization measures exist, a major challenge lies in determining measures that are both theoretically sound and empirically reliable across different network contexts. To resolve this challenge, we normalize 11 measures of network centralization and assess them systematically using an axiomatic framework and numerical simulations. Our axiomatic assessment tests each measure against the six postulates of centralization, ensuring consistency with minimal theoretical requirements. In addition, our numerical assessment examines the behavior of normalized centralization measures over different random graphs. Our results indicate major differences among the measures, despite their common aim of quantifying centralization. Together, our assessments point to the relative suitability of three measures: normalized betweenness centralization, normalized closeness centralization, and normalized degree centralization. Applying these three measures to real-world networks from diverse domains reveals meaningful variation in the organization of the networks with respect to hubs. Normalized betweenness centralization highlights path-based dominance; normalized closeness centralization reflects accessibility and efficiency of reach; and normalized degree centralization captures degree-based hub concentration. When used jointly, the three measures demonstrate the required sensitivity to varying levels of centralization and provide complementary aspects of network centralization that no single measure can offer alone. Our dual evaluation framework clarifies conceptual differences among existing measures and offers practical guidance for selecting reliable centralization metrics.

Introduction

Centralization is a fundamental structural property of complex networks, reflecting the extent to which one or a few nodes dominate in terms of centrality. Highly centralized networks tend to be organized around one or a few hubs: nodes with substantially higher degrees than the average node. This plays a critical role in shaping network structure and dynamics [15]. Such hubs can maintain efficient communication pathways, regulate system-level behavior, and influence diffusion processes including information spread, epidemic outbreaks, and synchronization dynamics [6,7]. Network hubs emerge in diverse contexts: influential users in social media [8,9], essential proteins in protein–protein interaction networks [10,11], hub airports in global transportation [12,13], and integrative brain regions in neural systems [14,15].

While node-level centrality measures such as degree centrality, betweenness centrality, or closeness centrality are effective for identifying individual hubs [5], they do not capture the global property of a network to have such dominant nodes. This network-level property, often referred to as centralization, is essential for understanding the extent to which the structure exhibits dominant hubs [1,16,17]. It is an open challenge to quantify centralization in a way that is theoretically sound, interpretable, and comparable across networks of varying size and density.

Several existing global centralization measures are based on centrality measures. Degree centralization [1], the most widely used centralization metric, compares the node degrees to that of a maximally centralized network. Betweenness centralization [1] assess domination in terms of shortest-path control, while closeness centralization [18] relies on a notion of centralization related to geodesic proximity.

Other existing centralization measures include the degree (distribution) variance [1921], hub dominance [22], the Gini coefficient [23,24], hub formation tendency [15], spectral centralization [25,26], and entropy-based centralization [27]. Although these metrics have been used widely for measuring centralization, they differ in their mathematical foundations, normalizations, interpretability, and values. Some measures, such as degree variance or the Gini coefficient, provide statistical descriptions but lack explicit topological reference points. Other measures, such as hub dominance, are simple and interpretable but may not fully reflect nuanced patterns of centralization.

A star network has been described by numerous authors to represent the extreme centralization [1,16,28]. There are multiple existing measures that are aligned with such characterization of maximum centralization. However, without a formal set of requirements, any proposed metric could be labeled a centralization measure, making systematic comparison difficult. Palak and Nguyen have proposed a set of six axioms specifying minimal theoretical requirements for such measures, providing a principled framework for evaluation [16]. To the best of our knowledge, no other study has systematically assessed both the axiomatic validity and numerical behavior of centralization measures across different network types.

In this study, we conduct a comparative evaluation of 11 normalized centralization measures, including centrality-based metrics, statistical, and spectral measures. We take a two-pronged approach:

  1. Axiomatic assessment: We evaluate each measure against the six axioms of Palak and Nguyen [16], identifying their theoretical strengths and weaknesses based on their alignments with the axioms.
  2. Numerical assessment: We analyze the behavior of the measures across synthetic networks for which a desirable behavior is expected. This involves testing measures for expected patterns on star graphs, ring graphs, complete graphs, and their perturbed variants.

We integrate these results into a combined performance score, identifying the three most robust measures out of 11 measures. We then apply the three measures to a diverse set of real-world networks from neuroscience, biology, social systems, and ecology, illustrating their individual strengths and their complementarity. This unified framework clarifies the conceptual and practical differences among centralization measures, enabling researchers to make informed choices when analyzing network centralization in different contexts and disciplines.

Definitions of centralization measures

Centralization measures quantify the presence and intensity of hubs in networks. It is customary that measures are normalized to the range [0,1], where larger values indicate higher centralization. If an existing measure was not normalized already, we normalize it to ensure consistency and comparability across measures.

Let be a simple undirected graph with nodes and links and let di denote the degree of node .

Assortativity-Based Hubness (ABH) can be defined for graphs with n > 1 and m > 0 as:

where r(G) is the degree assortativity coefficient of graph G [29]. In star graphs, as n increases, while in regular graphs . This measure ranges from 0 (maximally assortative) to 1 (maximally disassortative). For n < 2 or m = 0, we define .

Eigenvector Centrality Dispersion (ECD) is a normalized measure of inequality in eigenvector centrality among nodes [30]. For graphs with n > 1, it is defined as:

where CE(i) is the L2-normalized eigenvector centrality of node i and is its mean across all nodes. The denominator represents the maximum possible value of the numerator over graphs with n nodes. The maximum value occurs in certain sparse, disconnected graphs with degree-1 nodes (pendant nodes). For n < 2, .

Normalized Betweenness Centralization (NBC) for graphs with n > 2, measures domination of central nodes in terms of shortest-path control [18]:

where CB(i) is the normalized betweenness centrality of node i and is its maximum value over all nodes of G. The denominator equals the maximum value of the numerator over all graphs with n nodes, which is realized in star graphs. For graphs with n < 3, we define .

Normalized Closeness Centralization (NCC) is defined for n > 2 based on the closeness centrality of nodes [1]:

where CC(i) is the normalized closeness centrality of node i and is its maximum over all nodes of G. The denominator equals the maximum value of the numerator over all graphs with n nodes, realized in star graphs. For n < 3, we define .

Normalized Degree Centralization (NDC), also known as Freeman centralization or Palak-Wojtkiewicz centralization [1,31], is defined for n > 2 as:

where is the maximum node degree and is the average node degree. The denominator corresponds to the maximum possible value of the numerator, realized in star graphs. For n < 3, .

Normalized Degree Entropy (NDE) is defined for graphs with n > 1, based on the Shannon entropy of the degree distribution [32]:

where nk is the number of nodes of degree k. The maximum occurs when all degrees are unique. For n < 2, .

Normalized Degree Variance (NDV) is defined for graphs with n > 2 [19,20]:

The denominator term is defined based on the observation that the maximum variance of degrees occurs in a star graph for n < 7 and in a two-hub graph for . For n < 3, .

Normalized Gini Coefficient (NGC) is defined for graphs with n > 2 [33]:

The denominator equals the maximum value of the numerator over graphs with n nodes, achieved in a one-edge graph with n–2 isolated nodes. For n < 3, .

Normalized Hub Dominance (NHD) is defined for n > 1 [22]:

For n < 2 or m = 0, . While the NHD is particularly simple, it is often used in the context of quantifying hubness for a community [22]. In that case, the measure is based on dividing the highest degree among nodes of a community by (n–1) for the subgraph induced by that community. In such a usage, the measure may take values larger than 1.

Normalized Hub Formation Tendency (NHT) is defined for n > 1 and m > 0 [15]:

The denominator equals the maximum value of the numerator over graphs with m edges, achieved in a star topology. For n < 2 or m = 0, .

Normalized Natural Connectivity (NNC) is defined for graphs with m > 0 [34]:

where are the adjacency matrix eigenvalues of graph G. For m = 0, we define .

Axiomatic framework for centralization measures

Let be a simple undirected graph, with nodes and edges. C(G) denotes the centralization value of graph G based on an arbitrary centralization measure. A saturated node is one with degree , meaning it is connected to all other nodes. Saturating a node refers to adding edges to a non-saturated node so that it gets connected to all other nodes. The six postulates of Palak and Nguyen (2021) [16] can be expressed as follows:

  • P1a: If , then .
  • P1b: If G is complete, then .
  • P1c: If , then .
  • P2: If G is a star graph, then .
  • P3: If G1 and G2 are isomorphic, then .
  • P4: If G has no node of degree , then C(G)<1.
  • P5: If G has at least one saturated node and at least one non-saturated node, then saturating a node of G to obtain graph Y will not increase the centralization in the resulting graph Y, i.e., .
  • P6: If G has no saturated node, then saturating a node of G to obtain graph Y will not decrease the centralization in the resulting graph Y, i.e., .

These six postulates are intuitive and are justified in Palak and Nguyen (2021) [16]. They mainly focus on graphs with a single highly central node (rather than multiple hubs), with the star graph serving as the extremal configuration. The three parts of P1 define minimal centralization and calibrate it to the value 0. P2 defines maximal centralization to be required for a start graph, and calibrates it to the value 1. A star graph has been consistently identified as the graph with the highest centralization by several researchers [1,16,28]. P3 follows directly from the equivalence of degree distributions in isomorphic graphs. P4 specifies maximal centralization to require the presence of a node with the maximum degree. P5 describes a centralization decrease principle: in a network that already has at least one saturated node, saturating another node cannot increase centralization because the existing saturated nodes become less unique and less dominant. P6 describes a centralization increase principle: in a network with no saturated nodes, creating the first saturated node cannot decrease centralization because it makes that node structurally unique and dominant. Further details on these postulates and the six theorems derived from them can be found in Palak and Nguyen (2021) [16].

Axiomatic assessment of centralization measures

We use the six postulates of Palak and Nguyen (2021) [16] as axioms to assess the suitability of each of the 11 centralization measures in our work. Table 1 summarizes the results of these assessments. Proof ideas and counterexamples for our axiomatic assessments are provided in a narrative form as follows.

thumbnail
Table 1. Compliance of centralization measures with the six axioms proposed by Palak and Nguyen (2021) for evaluating network centralization.

A check mark (✓) indicates that the measure satisfies the corresponding axiom, whereas a cross (✓) indicates a violation. The final column reports the total number of satisfied axioms (maximum = 6). Abbreviations: ABH, Assortativity-Based Hubness; ECD, Eigenvector Centrality Dispersion; NBC, Normalized Betweenness Centralization; NCC, Normalized Closeness Centralization; NDC, Normalized Degree Centralization; NDE, Normalized Degree Entropy; NDV, Normalized Degree Variance; NGC, Normalized Gini Coefficient; NHD, Normalized Hub Dominance; NHT, Normalized Hub Formation Tendency; NNC, Normalized Natural Connectivity.

https://doi.org/10.1371/journal.pone.0341630.t001

ABH satisfies P1 because it behaves as expected for graphs related to P1. It satisfies P2 because the degree assortativity of a star graph is always –1. Since degree sequences and edge connections are preserved under isomorphism, ABH satisfies P3. It violates P4 because of counterexamples, including a graph with four nodes and two edges incident on the same node, which yields ABH = 1 despite having no saturated node. The graph with edge list [(0,1),(0,2),(0,3),(0,4),(2,3),(3,4)] shows that ABH violates P5. The five-node graph with two edges incident on the same node shows that ABH violates P6.

ECD satisfies P1 (returns 0 for all graphs considered in P1) but violates P2 because star graphs do not realize the maximum standard deviation of eigenvector centrality. Since eigenvector centralities are preserved under isomorphism, ECD satisfies P3. It violates P4 because it can take value 1 in graphs without saturated nodes. ECD also violates P5, as demonstrated by the graph with the edge list [(0,3),(0,4),(0,2),(1,3),(1,2),(1,4),(2,3),(3,4)]. It violates P6 as shown by the 5-node graph with the edge list [(0,1),(1,2),(2,3)].

NBC is claimed by Palak and Nguyen (2021) [16] to satisfy all six postulates. However, there is one error in that claim: the graph with edge list [(0,1),(0,3),(0,4),(0,5),(1,2)] serves as a counterexample with respect to P6. For this graph, NBC = 0.82. Saturating node 3 reduces NBC to 0.36, thereby violating P6.

NCC satisfies P1 (returns 0 for all P1 cases) and P2 (returns 1 for star graphs). Since closeness centralities are preserved under isomorphism, it also satisfies P3. Moreover, it satisfies P4 because in graphs without saturated nodes, the numerator is strictly smaller than the denominator. NCC satisfies P5, as saturating a node monotonically decreases its value. However, the 5-node graph with edge list [(0,1),(0,2),(0,3)] shows that NCC violates P6.

NDC is claimed in Theorem 5 of Palak and Nguyen (2021) [16] to satisfy all six postulates. However, there is one error in that claim: the graph with edge list [(0,1),(0,3),(0,4),(0,5),(1,2)] has NDC = 0.7. Saturating node 3 yields new edges [(1,3),(2,3),(3,4),(3,5)] and reduces NDC to 0.6, showing that it violates P6.

NDE satisfies P1 but violates P2, returning particularly low values for large star graphs. It satisfies P3 because degree sequences are preserved under isomorphism. It satisfies P4 because the theoretical maximum of Shannon entropy cannot be reached in simple graphs without self-loops. NDE violates P5, as shown by the graph with edge list [(0,3),(0,2),(1,3),(1,4),(2,3),(3,4)]. The 5-node graph that has one edge also shows that NDE violates P6.

NDV satisfies P1 but violates P2 because its normalization does not depend on star graphs. It satisfies P3 due to invariance over isomorphism. It also satisfies P4 because the maximum degree variance occurs in star graphs for n < 7 and in two-hub graphs for , both with at least one saturated node. NDV violates P5 as shown by the 7-node, 7-edge graph that is made up of a star structure with one extra edge. It violates P6, as shown by the graph with edge list [(0,1),(0,2),(1,3),(1,4)].

NGC satisfies P1 but violates P2, taking value 0.5 for star graphs. It satisfies P3, as Gini coefficients are invariant under isomorphism. It violates P4, as it reaches 1 for one-edge graphs with isolated nodes. It also violates P5 as shown by the graph whose degree sequence is . The 5-node graph that has one edge shows that NGC also violates P6.

NHD violates P1b, returning value 1 for complete graphs. It satisfies P2. The isomorphism invariance makes it satisfy P3. It satisfies P4, because graphs without saturated nodes always yield a numerator smaller than the denominator. It satisfies P5 because it returns value 1 for any graph with at least one saturated node. For the same reason, it also satisfies P6.

NHT violates P1b because it does not return 0 for complete graphs. It satisfies P2 and P3. It satisfies P4 because the only graphs where it returns 1 are star graphs and 3-node cycles of saturated nodes. It violates P5 as shown by the graph whose edge list is [(0,1),(0,3),(0,2),(1,2),(1,3),(1,4),(2,4),(3,4)]. The 5-node graph that has one edge shows that NHT also violates P6.

NNC satisfies P1 but violates P2, because the natural connectivity of star graphs is not exactly zero. NNC satisfies P3 because natural connectivity is isomorphism invariant. It satisfies P4 because it returns values below 1 when a saturated node is not present. It satisfies P5 because saturating a node increases natural connectivity and therefore decreases NNC. For the same reason, it violates P6.

Numerical assessment of centralization measures

To illustrate and compare the behavior of centralization measures under different network structures as n grows, we analyze synthetic networks for which a desirable behavior is expected. These networks include star graphs, ring graphs, and complete graphs, along with their perturbed variants (same graph but with one edge rewired or removed). Fig 1 presents the values of the 11 centralization measures for one network type in each panel as n grows. Table 2 summarizes whether the measures returns values consistent with the expected behavior.

thumbnail
Fig 1. Centralization values computed using various measures across different network sizes for three graph configurations: star, ring, and complete graphs (top row), and their perturbed variants with a single edge rewired or removed (bottom row).

Each plot shows how each measure responds to changes in network order. Measures that yield constant values of 0 or 1 are not shown and are instead annotated in subplots. Colors and point styles indicate different centralization measures. Abbreviations: ABH, Assortativity-Based Hubness; ECD, Eigenvector Centrality Dispersion; NBC, Normalized Betweenness Centralization; NCC, Normalized Closeness Centralization; NDC, Normalized Degree Centralization; NDE, Normalized Degree Entropy; NDV, Normalized Degree Variance; NGC, Normalized Gini Coefficient; NHD, Normalized Hub Dominance; NHT, Normalized Hub Formation Tendency; NNC, Normalized Natural Connectivity.

https://doi.org/10.1371/journal.pone.0341630.g001

thumbnail
Table 2. Performance of centralization measures across canonical graph topologies and their perturbed variants. Measures were evaluated on star, ring, and complete graphs, as well as perturbed versions with a single edge rewired or removed.

A check mark (–) indicates expected behavior for the given topology, whereas a cross (‗) indicates deviation from expectations. The final column reports the number of topologies for which each measure performed as expected (maximum = 6). Abbreviations: ABH, Assortativity-Based Hubness; ECD, Eigenvector Centrality Dispersion; NBC, Normalized Betweenness Centralization; NCC, Normalized Closeness Centralization; NDC, Normalized Degree Centralization; NDE, Normalized Degree Entropy; NDV, Normalized Degree Variance; NGC, Normalized Gini Coefficient; NHD, Normalized Hub Dominance; NHT, Normalized Hub Formation Tendency; NNC, Normalized Natural Connectivity.

https://doi.org/10.1371/journal.pone.0341630.t002

The star graph is expected to get values reflecting maximal centralization, consistent with the common assumptions in the literature [1,16,28]. In the top left panel of Fig 1, the six measures, ABH, NBC, NCC, NDC, NHD, and NHT, correctly return the value of 1, reflecting maximal centralization regardless of the network order. ECD and NNC also return high values as network order increases but do not reach 1. Conversely, NDV and NGC produce values distinctly below 1. NDE shows values substantially below 1 for small network sizes and unexpectedly trending toward zero as network order increases, indicating its crucial weakness in quantifying maximal centralization.

In the top center panel of Fig 1, we move on to the ring graph where all nodes have identical degrees and centralities. In this case, we expect a centralization value of zero due to the lack of a distinct or dominant node. This expectation is confirmed by eight measures, ABH, ECD, NBC, NCC, NDC, NDE, NDV, and NGC, all returning zero for all ring networks. However, NHD and NHT show a gradual decay toward zero as network order increases. NNC behaves unintuitively, failing to return zero and instead increasing to large values as network order grows, indicating its unsuitability from this perspective.

The complete graph represents maximal connectivity among structurally identical nodes. Therefore, centralization measures are expected to return a value of 0 for complete graphs in the top right panel of Fig 1. This is indeed the case for nine measures, ABH, ECD, NBC, NCC, NDC, NDE, NDV, NGC, and NNC, across all network orders, reaffirming their validity under axiom P1b. NHT exhibits values with a decreasing trend toward zero as complete graphs grow. NHD remains fixed at 1, highlighting a critical limitation in its ability to reflect an intuitive value for complete graphs.

Rewiring a single edge in the star graph slightly disrupts the hub configuration, but this alteration is expected to become increasingly negligible as the network order grows. The seven measures, ABH, NBC, NCC, NDC, NHD, NHT, and NNC, reflect this expected trend in the bottom left panel of Fig 1. In contrast, NDV and NGC consistently remain far from 1 and are largely insensitive to the rewiring and its interaction with network order. NDE incorrectly shows a steady decline with increasing network order, and ECD shows an increasing trend that does not seem to converge to 1, indicating a poor response to such a perturbation.

In the perturbed ring graph, rewiring one edge introduces a slight increase in centralization, which is expected to diminish as the network order increases. All measures except ABH, ECD, and NNC capture these subtle changes in the bottom center panel of Fig 1. They correctly reflect the existence of nodes that are slightly more hub-like than others (against an otherwise random-regular structure without any distinct nodes). ABH, ECD, and NNC incorrectly exhibit a constant non-zero value or an increasing trend toward 1, revealing their limitations in accurately representing intuitive values for centralization in the rewired ring graphs.

Removing a single edge from the complete graph introduces minimal asymmetry, as the graph remains nearly fully connected. The bottom right panel of Fig 1 shows that despite the subtlety of this perturbation, measures such as ABH and NHD fail to approach zero. They are insensitive to slight deviations from complete connectivity. Other measures, however, show an appropriate decline toward zero, indicating a suitable capacity to reflect the centralization of such perturbed complete graphs.

In summary, across all the graph types and perturbations considered, NBC, NCC, and NDC demonstrate the most robust comparative performance, consistently reaching expected boundary values and responding smoothly and intuitively to structural changes. Other measures show expected behavior in some aspects but behave inconsistently in others. Table 2 summarizes how each measure behaves for the six network types as n increases.

Overall evaluation of centralization measures

To combine the axiomatic and numerical assessments, we define a simple additive overall performance score as a weighted sum of the Axiomatic Score SA (number of axioms satisfied, maximum = 6) and the Numerical Score SN (number of numerical simulations passed, maximum = 6):

(1)

where wA and wN are the weights assigned to the axiomatic score and numerical score respectively. The two weights add up to 1 wA  +  wN = 1. Different weightings can be chosen based on the researcher’s priorities, possibly putting different emphasis on theoretical validity in foundational work or on numerical robustness in applied analyses.

In our assessment study, we set , assigning equal importance to both criteria. Under this weighting, the measures NBC, NCC, and NDC emerge as the strongest overall performers (aggregate score of 5.5 each), achieving near-perfect performance and demonstrating both strong axiomatic compliance and robust numerical behavior.

NHD (scoring 4) performs well on the axiomatic criteria but shows certain weaknesses in numerical behavior, particularly in regular graphs. ABH, NDE, NDV, NHT, and NNC (each scoring 3.5), as well as NGC (scoring 3), display some elements of intuitiveness, showing partial numerical consistency but weak theoretical alignment with the axioms. At the lower end, ECD (scoring 2.5) performs poorly in both domains, indicating substantial limitations in capturing centralization consistently.

This simple evaluation framework provides a flexible and transparent decision-making tool for selecting centralization measures that align with the priorities of a given study, whether focused on theoretical validity, numerical robustness, or a balanced combination of both.

Application to real-world networks

Now that the existing measures are refined down to three measures, we can demonstrate their practical utility. We computed NBC, NCC, and NDC for a diverse set of real-world networks spanning different domains (Table 3). Given that NBC and NCC rely on path-based distances, they cannot be used in disconnected graphs. Therefore, we use the largest connected component of multi-component real networks in this section to demonstrate the utility of all three measures NBC, NCC, and NDC.

thumbnail
Table 3. Normalized Betweenness Centralization (NBC), Normalized Closeness Centralization (NCC), and Normalized Degree Centralization (NDC) for diverse real-world networks.

For each network, the category, description, number of nodes (n), and number of links (m), as well as minimum, average, and maximum degrees () are provided. For multi-component networks, values are computed for the largest connected component of the network.

https://doi.org/10.1371/journal.pone.0341630.t003

These networks vary substantially in size, density, and structural characteristics, providing a robust demonstration of each measure’s behavior beyond synthetic benchmarks. For networks Condensed Matter Collaboration and Yeast Protein Interactions, all three measures yield relatively low values (, , ), reflecting the absence of a single dominant hub and a relatively even distribution of connectivity. These networks are relatively large and sparse. In contrast, networks with more hierarchical structures, such as the Food Web (Michigan) and Zachary’s Karate Club, exhibit substantially higher centralization: NBC (0.196/0.405), NCC (0.701/0.298), and NDC (0.642/0.399). These networks are smaller and more dense. The values indicates the presence of central nodes and their relative extent of dominance.

The relatively lower NBC value in the Food Web (Michigan) suggests that different centralization measures are not necessarily associated and instead capture distinct properties. Interestingly, these three measures often diverge in their sensitivity to network structures, highlighting that they capture complementary aspects of centralization. For example, the Facebook Ego Network shows a markedly higher NBC value (0.480) compared to NDC (0.248), with NCC lying in between (0.367). This suggests that path-based centralization in this network dominate over degree-based centralization, while closeness captures an intermediate notion of reachability in this specific social media network. Conversely, the Brain Network (HCP) shows moderate NBC (0.062) but relatively high NCC (0.288) and NDC (0.291), consistent with a densely connected hub structure in functional brain connectivity [14,15].

These results highlight that while NBC, NCC, and NDC often agree in identifying low- or high-centralization networks, their differing definitions capture potentionally complementary aspects of centralization. NBC reflects path-based control, NCC emphasizes reachability and distance efficiency, and NDC quantifies degree-based hub concentration. When applied together, they can reflect a richer and more reliable picture of network centralization from different aspects compared to what any single measure can reflect alone.

We also examined a temporal friendship network among high-school students, observed across four survey waves during one academic year [43]. The values of NBC, NCC, and NDC fluctuated over time, revealing dynamic shifts in centralization. In Month 1, NBC (0.22), NCC (0.25), and NDC (0.22) were low, reflecting a relatively decentralized and fragmented structure without clear hubs. By Month 2, NBC (0.52), NCC (0.85), and NDC (0.75) peaked, indicating the emergence of a dominant hub both in terms of control over network paths, geodesic proximity, and direct connectivity. In Month 3, NBC dropped sharply (0.10), while both NCC (0.66) and NDC (0.54) remained moderately high, suggesting a transition toward well-connected nodes that preserved degree- and closeness-based centralization without a single dominant path bottleneck. Finally, in Month 4, NBC stabilized at an intermediate level (0.36), NCC rose again (0.82), and NDC increased (0.71), highlighting the persistence of degree- and closeness-based hub concentration alongside more distributed path-based centralization. Taken together, these results show that tracking the three measures reveals a multi-faceted profile of centralization for each snapshot of a network evolving over time (Fig 2).

thumbnail
Fig 2. Temporal snapshots of a high-school friendship network across four survey waves.

Nodes represent individuals, and links denote reciprocated friendship ties. Node size is proportional to degree, and node color intensity reflects betweenness centrality (darker red = higher values), while node border color reflects closeness centrality (darker blue = higher values). Normalized Betweenness Centralization (NBC), Normalized Closeness Centralization (NCC), and Normalized Degree Centralization (NDC) values are reported below each panel.

https://doi.org/10.1371/journal.pone.0341630.g002

All simulations, network analyses, and visualizations were conducted using R, utilizing several packages including igraph [44] for network modeling and ggplot2 [45] for visualization. Code to replicate the results and extend the research is available at: https://github.com/majidsaberi/Centralization.

Discussion

Summary of key findings

This study reviewed and normalized 11 centralization measures from existing approaches and provided an evaluation of them using axioms and numerical simulations. The evaluation combined axioms based on the six postulates of Palak and Nguyen (2021) [16], with numerical assessments on synthetic networks exhibiting extreme centralization behaviors. This dual evaluation revealed that Normalized Betweenness Centralization (NBC), Normalized Closeness Centralization (NCC), and Normalized Degree Centralization (NDC) consistently behaved better than other measures. These three metrics not only satisfied most axioms but also demonstrated stable behavior consistent with the expected behavior across different network types.

Application of these best-performing measures to a diverse set of real-world networks revealed new insights into the complementary nature of the three measures. In several cases, NBC, NCC, and NDC aligned closely in identifying highly centralized systems but from three distinct perspectives. However, in networks such as the Facebook Ego Network and Brain Network, the values of the three measures diverged substantially, reflecting their distinct underlying definitions. NBC highlights path-based dominance, NCC captures closeness-based accessibility, and NDC emphasizes degree-based hub concentration. Furthermore, analysis of a temporal friendship network showed that NBC, NCC, and NDC capture different dynamic expressions of centralization over time.

These results confirm that no single measure fully captures the multi-faceted notion of centralization that appears from a joint usage of selective suitable measures. While our results point to three measures, we argue that the narrower choice of measures can be guided by the definition of centralization and dominance (path-based, closeness-based, or degree-based) that is most relevant to the research question.

Methodological implications

Our results highlight the benefits of assessing centralization measures using a framework that integrates axiomatic validation and numerical performance testing. The axiomatic framework of Palak and Nguyen (2021) [16] provides a theoretical baseline, ensuring that a measure behaves consistently under well-defined structural conditions such as maximal centralization, minimal centralization, and changes in centralization. Notably, some measures satisfied several axioms yet displayed counterintuitive trends on synthetic networks. Therefore, axiomatic compliance can be enhanced by numerical simulations.

Our assessment enables researchers to identify metrics that are both conceptually valid and operationally reliable, while also allowing flexibility to diverge from our specific choices to better satisfy the constraints of their study. Our approach is fairly general and can be adapted to other network contexts, including weighted, directed, or temporal networks, where similar concerns about measure validity and stability are likely to arise.

Choosing between NBC, NCC, and NDC based on network context

Our findings indicate that among the 11 existing measures, NBC, NCC, and NDC are arguably more robust choices for quantifying centralization in undirected unweighted networks, and they capture distinct aspects of centralization.

NBC is particularly suitable for studying networked systems where control over network paths or flow bottlenecks is of primary importance. These systems may include transport networks, communication infrastructures, or networks of organizational information flow. NCC, by contrast, is especially relevant in contexts where accessibility and efficiency of reach are central concerns, such as emergency response networks, social contact networks, or information dissemination systems, where certain nodes minimize average distances to others. NDC is most appropriate for contexts where the concentration of direct connectivity (degree) is a defining factor, such as biological hub proteins, brain network hubs, or models of critical junctions in infrastructure networks.

The interpretation of centralization in empirical settings benefits from the complementary deployment of multiple well-behaved measures (possibly NBC, NCC, and NDC). This is evident in the Facebook Ego Network, where NBC yielded a markedly higher score than NDC. This suggests that while no single node dominated in degree, the structure allows several nodes to exert strong influence over communication flow. Meanwhile, NCC provided an additional perspective on how central nodes reduced distances and improved accessibility across the network. Conversely, in the Brain Network (HCP), NCC and NDC were substantially higher than NBC. This reflected dense connectivity around key hub regions and the efficiency of those hubs in reaching other regions, without equivalent dominance in terms of path-based control. These contrasts illustrate how NBC, NCC, and NDC can together provide complementary insights into the subtle, nuanced, and multi-faceted notion of network centralization.

Beyond static networks, the complementary use of NBC, NCC, and NDC also extends to temporal settings, where they can highlight changes to a dynamic profile of centralization over time.

Extending centralization to weighted, directed, and sparse networks

Many real-world networks are inherently weighted (e.g., transportation intensity, communication frequency, and synaptic strength) or directed (e.g., citation network, neuronal signaling, and trade relations). Therefore, extending centralization measures beyond undirected and unweighted graphs is an essential next step toward a more general and realistic framework. Although substantial research has focused on extending node-level centrality metrics to weighted and directed cases [7], global centralization measures have received far less systematic attention or formal generalization. Extending our axiomatic–numerical framework to weighted and directed networks presents both conceptual and computational challenges:

First, directionality introduces asymmetry, where incoming and outgoing influences can differ between nodes. In this context, all three measures require distinct adjustments. NDC can be divided into in-degree and out-degree centralization, each normalized against the theoretical maximum for a directed star graph (one node receiving from or sending to all others). For NBC, directionality alters shortest-path dependencies. So, paths must be computed with directional constraints, capturing asymmetries in information or flow control. Likewise, NCC in directed networks must account for reachability asymmetry, often necessitating separate in- and out-closeness variants to capture accessibility versus broadcasting efficiency.

Second, weights complicate the notion of maximal centralization because dominance can now arise either through the number of links or through their strength. Extending NDC involves computing degree strength (sum of edge weights) rather than simple counts, normalized against the strongest possible star configuration. For NBC, shortest paths should be computed using weighted distances (e.g., inverse weights) while maintaining normalization relative to a weighted-star reference. NCC can similarly treat weights as inverse distances, thus reflecting the efficiency of strong versus weak ties.

Third, a fundamental theoretical issue concerns the definition of a reference configuration for normalization. In unweighted networks, the star graph uniquely represents maximal centralization. In weighted or directed systems, however, multiple “weighted-star” or “asymmetric-star” structures could yield the same level of dominance depending on how weights are distributed. This ambiguity highlights the need to redefine reference models that preserve both total strength and connectivity constraints, ensuring that normalization remains scale-consistent.

The same axiomatic approach deployed in this study can guide generalizations to weighted and directed networks. Each extended measure should be examined for compliance with the six postulates under asymmetric and weighted conditions. This will reveal whether measures such as NBC, NCC, and NDC retain desirable behaviors when the underlying assumptions of symmetry and binary connectivity are relaxed.

Beyond directionality and weighting, another dimension of generalizability concerns sensitivity in sparse networks. Many empirical systems, especially social, biological, and infrastructural networks, are sparsely connected and subject to small perturbations. Sparse networks tend to amplify centralization effects: the removal or addition of a single hub connection can disproportionately shift global measures. Among the three selected metrics, NDC is generally more sensitive to small link changes because it depends directly on degree distribution, while NBC exhibits moderate sensitivity due to path recalculation in shortest routes. NCC tends to respond more smoothly, as changes in a few links have limited influence on global accessibility unless they occur near key bottlenecks. Quantifying such perturbation sensitivity could be formalized by measuring stability under random edge perturbations, temporal rewiring, or bootstrapped network re-sampling.

A further computational and interpretive challenge arises in multi-component networks, which are common in sparse real-world systems. For such networks, path-based centralization measures like NCC and NBC cannot be directly computed for the entire graph because path lengths between disconnected components are undefined. This issue can distort normalization and lead to misleadingly low or undefined centralization scores. Future extensions of the framework could explore component-weighted aggregation schemes, penalized path-length approximations, or effective distance formulations that preserve interpretability while enabling valid comparison across disconnected networks.

Limitations and future directions

While this study provided a comparative evaluation of 11 normalized centralization measures, several limitations should be acknowledged. First, our analyses focused exclusively on undirected and unweighted and connected networks. The present framework requires major adjustments before being applicable to weighted or directed networks as discussed earlier.

Second, our numerical assessment was inherently binary, each measure was considered to either satisfy or fail a given numerical assessment. In practice, violations may vary in magnitude, and a more nuanced scoring system could provide finer-grained insight into the partial compliance that a measure demonstrates for a given graph type.

Third, although NBC and NCC were identified alongside NDC as relatively strong measures, both rely on distance-based properties of networks. This makes their interpretation problematic in disconnected or multi-component graphs, since shortest paths between components are undefined. In this study, we addressed this challenge by computing NBC and NCC only on the largest connected component of each real-world network. While this ensures comparability and interpretability, it may exclude hub-like structures that exist in smaller disconnected components. Future work should therefore examine the robustness of these measures under broader conditions, including sparse or fragmented networks, or by exploring alternative path-based formulations that account for multi-component networks.

Fourth, as discussed, sparse and near-sparse networks amplify centralization effects: small perturbations, such as the removal or addition of a single hub link, can substantially alter global measures. Systematic sensitivity analyses, for instance, through random edge perturbations or temporal snapshots, could help quantify each measure’s stability under minimal structural change.

Fifth, our numerical assessments were based on three simple random graphs and one perturbed variation of each. While this made the expected behavior more straightforward, future research can investigate the behavior of the centralization measures on more realistic graph families, as well as graphs with multiple hubs of similar degree as a richer stress test for the measures.

Finally, although this study focused on theoretical validity and empirical stability, computational efficiency was not the primary objective in our evaluation. For extremely large-scale networks or time-constrained analyses, algorithmic efficiency may become an important consideration in narrowing down the selection of measures among the well-behaved measures that we have identified.

Conclusion

Our results show that NBC, NCC, and NDC emerged as the three most robust metrics for quantifying network centralization among 11 existing approaches. NBC effectively captures path-based dominance, highlighting the influence of nodes that control communication flow. NCC captures accessibility-based dominance, reflecting the ability of certain nodes to minimize distances and improve efficiency of reach across the network. NDC quantifies degree-based hub concentration and the dominance of nodes with exceptional connectivity. When applied jointly, these three measures provide complementary insights into centralization compared to what any single measure can capture alone.

Our dual evaluation framework benefited from combining theoretical soundness with empirical robustness in metric selection. This framework clarifies conceptual distinctions among measures and assists researchers in choosing appropriate centralization metrics. Our findings encourage a shift away from reliance on single, conventional measures toward evaluating centralization as a nuanced multi-faceted notion, ultimately improving the our understanding of a topological feature that emerges in networks across different fields.

Acknowledgments

The authors thank the anonymous reviewers and Rafał Palak for their helpful comments.

References

  1. 1. Freeman LC. Centrality in social networks conceptual clarification. Social Networks. 1978;1(3):215–39.
  2. 2. Borgatti SP. Centrality and network flow. Social Networks. 2005;27(1):55–71.
  3. 3. Wasserman S. Social network analysis: methods and applications. Cambridge University Press; 1994.
  4. 4. Newman MEJ. Networks: an introduction. Oxford University Press; 2010.
  5. 5. Newman MEJ. The structure and function of complex networks. SIAM Rev. 2003;45(2):167–256.
  6. 6. Barabási AL. Network science. Cambridge University Press; 2016.
  7. 7. Boccaletti S, Latora V, Moreno Y, Chavez M, Hwang D. Complex networks: structure and dynamics. Physics Reports. 2006;424(4–5):175–308.
  8. 8. Wang Q, Miao F, Tayi GK, Xie E. What makes online content viral? The contingent effects of hub users versus non–hub users on social media platforms. J Acad Mark Sci. 2019;47(6):1005–26.
  9. 9. Fan C, Jiang Y, Yang Y, Zhang C, Mostafavi A. Crowd or hubs: information diffusion patterns in online social networks in disasters. International Journal of Disaster Risk Reduction. 2020;46:101498.
  10. 10. Tsai C-J, Ma B, Nussinov R. Protein-protein interaction networks: how can a hub protein bind so many different partners?. Trends Biochem Sci. 2009;34(12):594–600. pmid:19837592
  11. 11. Vallabhajosyula RR, Chakravarti D, Lutfeali S, Ray A, Raval A. Identifying hubs in protein interaction networks. PLoS One. 2009;4(4):e5344. pmid:19399170
  12. 12. Bombelli A, Santos BF, Tavasszy L. Analysis of the air cargo transport network using a complex network theory perspective. Transportation Research Part E: Logistics and Transportation Review. 2020;138:101959.
  13. 13. Rodríguez-Déniz H, Suau-Sanchez P, Voltes-Dorta A. Classifying airports according to their hub dimensions: an application to the US domestic network. Journal of Transport Geography. 2013;33:188–95.
  14. 14. van den Heuvel MP, Sporns O. Network hubs in the human brain. Trends Cogn Sci. 2013;17(12):683–96. pmid:24231140
  15. 15. Saberi M, Khosrowabadi R, Khatibi A, Misic B, Jafari G. Topological impact of negative links on the stability of resting-state brain network. Sci Rep. 2021;11(1):2176. pmid:33500525
  16. 16. Palak R, Nguyen SV. Centralization measures for social networks. Cybernetics and Systems. 2021;52(5):398–418.
  17. 17. Krnc M, Škrekovski R. Group degree centrality and centralization in networks. Mathematics. 2020;8(10):1810.
  18. 18. Freeman LC. A set of measures of centrality based on betweenness. Sociometry. 1977;40(1):35.
  19. 19. Snijders TAB. The degree variance: an index of graph heterogeneity. Social Networks. 1981;3(3):163–74.
  20. 20. Smith KM, Escudero J. Normalised degree variance. Appl Netw Sci. 2020;5(1):32. pmid:32626822
  21. 21. Estrada E. Quantifying network heterogeneity. Phys Rev E Stat Nonlin Soft Matter Phys. 2010;82(6 Pt 2):066102. pmid:21230700
  22. 22. Lancichinetti A, Kivelä M, Saramäki J, Fortunato S. Characterizing the community structure of complex networks. PLoS One. 2010;5(8):e11976. pmid:20711338
  23. 23. van Mierlo T, Hyatt D, Ching AT. Employing the Gini coefficient to measure participation inequality in treatment-focused Digital Health Social Networks. Netw Model Anal Health Inform Bioinform. 2016;5(1):32. pmid:27840788
  24. 24. Atkinson AB, Bourguignon F. Handbook of income distribution. Elsevier; 2014.
  25. 25. Estrada E, Higham DJ. Network properties revealed through matrix functions. SIAM Rev. 2010;52(4):696–714.
  26. 26. Negre CFA, Morzan UN, Hendrickson HP, Pal R, Lisi GP, Loria JP, et al. Eigenvector centrality for characterization of protein allosteric pathways. Proc Natl Acad Sci U S A. 2018;115(52):E12201–8. pmid:30530700
  27. 27. Tutzauer F, Elbirt B. Entropy-based centralization and its sampling distribution in directed communication networks. Communication Monographs. 2009;76(3):351–75.
  28. 28. Butts CT. Exact bounds for degree centralization. Social Networks. 2006;28(4):283–96.
  29. 29. Noldus R, Van Mieghem P. Assortativity in complex networks. J Complex Netw. 2015;3(4):507–42.
  30. 30. Hashem SQ, Giudici P. Systemic risk of conventional and islamic banks: comparison with graphical network models. AM. 2016;07(17):2079–96.
  31. 31. Palak R, Wojtkiewicz K. A centralization measure for social networks assessment. Cybernetics and Systems. 2023;55(3):654–67.
  32. 32. Cao S, Dehmer M, Shi Y. Extremality of degree-based graph entropies. Information Sciences. 2014;278:22–33.
  33. 33. Hu H-B, Wang L. The GINI coefficient’s application to general complex networks. Advs Complex Syst. 2005;08(01):159–67.
  34. 34. Jun W, Barahona M, Yue-Jin T, Hong-Zhong D. Natural connectivity of complex networks. Chinese Phys Lett. 2010;27(7):078902.
  35. 35. Van Essen DC, Smith SM, Barch DM, Behrens TEJ, Yacoub E, Ugurbil K, et al. The WU-Minn human connectome project: an overview. Neuroimage. 2013;80:62–79. pmid:23684880
  36. 36. Leskovec J, Kleinberg J, Faloutsos C. Graph evolution: densification and shrinking diameters. ACM Transactions on Knowledge Discovery from Data. 2007;1(1):2.
  37. 37. Leskovec J, Lang KJ, Dasgupta A, Mahoney MW. Community structure in large networks: natural cluster sizes and the absence of large well-defined clusters. Internet Mathematics. 2009;6(1):29–123.
  38. 38. Klimt B, Yang Y. Introducing the Enron corpus. In: CEAS; 2004. p. 92–6.
  39. 39. Leskovec J, McAuley J. Learning to discover social circles in ego networks. Advances in Neural Information Processing Systems. 2012;25:1–9.
  40. 40. McAuley JJ, Leskovec J. From amateurs to connoisseurs: modeling the evolution of user expertise through online reviews. In: Proceedings of the 22nd International Conference on World Wide Web; 2013. p. 897–908.
  41. 41. von Mering C, Krause R, Snel B, Cornell M, Oliver SG, Fields S, et al. Comparative assessment of large-scale data sets of protein-protein interactions. Nature. 2002;417(6887):399–403. pmid:12000970
  42. 42. Zachary WW. An information flow model for conflict and fission in small groups. Journal of Anthropological Research. 1977;33(4):452–73.
  43. 43. Snijders TAB, van de Bunt GG, Steglich CEG. Introduction to stochastic actor-based models for network dynamics. Social Networks. 2010;32(1):44–60.
  44. 44. Csardi G, Nepusz T. The igraph software package for complex network research. InterJournal Complex Systems. 2006;1695.
  45. 45. Wickham H. ggplot2: elegant graphics for data analysis. Springer; 2016.