Skip to main content
Advertisement
  • Loading metrics

Centrality in complex networks under incomplete data

Abstract

The concept of centrality is one of the essential tools for analyzing complex systems. Over the years, a large number of centrality indices have been proposed that account for different aspects of a network. Unfortunately, most real networks are substantially incomplete, which affects the results of the centrality measures. This article aims to evaluate the sensitivity of 16 centrality measures to the presence of errors or incomplete information about the structure of a complex network. Our experiments are performed across 113 empirical networks. As a result, we identify centrality indices that are highly vulnerable to incomplete data.

Author summary

The robustness of centrality measures is a fundamental problem for the correct identification of important nodes in many real networks, which are partially observed in most cases. Existing studies do not fully address this issue because they are usually limited to a small number of both centrality measures and graphs, while the graph perturbations are performed at random. Our work investigates the robustness of 16 centrality measures by analyzing the variation in the relative ranking of nodes under a set of appropriately defined network perturbations. To draw meaningful and robust conclusions about the average sensitivity of a specific centrality measure, we perform our experiments on a large set of networks. Our findings demonstrate that certain centrality measures may be misinterpreted or misapplied when used on specific classes of networks, while the results of these measures require a cautious interpretation in the presence of missing or incorrect data.

Introduction

Many complex systems, including social, infrastructural, biological and economic systems, can be described as networks with nodes representing the entities and links representing interactions between them. A fundamental challenge in studying these networks is to understand the impact (or importance) of each node. The notion of importance can be defined in different ways depending on the nature of a network or features that a researcher wants to consider while ranking nodes. Therefore, many centrality measures have been introduced in the literature. The total number of proposed indices over the graph history is overwhelming [1], ranging from classical centralities [2] to measures that take into account specific features of a network [35]. These measures have shown great value in understanding many complex networks such as citation networks, computer networks and biological networks. In general, the choice of the most appropriate centrality measure depends on the type of network and the interpretation of important elements.

Unfortunately, most real networks are substantially incomplete or inaccurate due to the high cost of network measurements, data collection errors, the highly dynamic nature of networks, or data privacy issues. For instance, Ficara et al. [6] examine criminal networks that suffer from data incompleteness (due to the nature of the network), data incorrectness (unintentional data collection errors and intentional deception by criminals) and data inconsistency (misleading information from different sources). Aleskerov et al. [7] analyze the banking foreign claims network, which covers about 94% of total foreign claims as some countries do not report their statistics. Meshcheryakova [8] investigates the asymmetry in trade networks as many countries report their own versions of a trade flow between them due to different commodity classification systems, different costs calculations (including/excluding transportation and insurance costs) or a time delay. Hence, some centrality measures, which are very sensitive to small changes in the graph structure, can be misused and lead to the wrong interpretation of important within these networks.

The effects of missing or incorrect data on centrality measures have been extensively studied in the literature. Most studies examine the sensitivity of centrality in artificial networks such as Erdős–Rényi (ER) random graphs, Barabási–Albert (scale-free) graphs, Watts–Strogatz (small-world) graphs and other classical graph structures [915]. These studies are mainly limited to the sensitivity analysis of only 4-6 classical centrality measures to a small number of structural changes (link/node removal or addition) in a graph. Furthermore, all the perturbations in the structure of a network are performed at random, which might be meaningless for real-world networks.

Some studies examine the robustness of centrality measures in real-world networks. Bolland [16] examines the performance of 4 classical centrality measures under conditions of random variations in Chillicothe data. Herland et al. [17] consider 3 classical centrality measures and their robustness to random changes in 4 real networks. Niu et al. [18] evaluate the stability of 5 centrality measures on 9 real datasets towards random link addition/removal/rewiring and have evaluated the Spearman correlation between centrality rankings. Segarra and Ribeiro [11] investigate the effect of random changes on the air traffic network and the network of interactions between sectors of the US economy. Meshcheryakova and Shvydun [19] examine 13 centrality measures in criminal and food trade networks. Costenbader and Valente [20] determine how random sampling from 59 empirical directed networks affects the stability of 11 centrality measures. While previous studies have made significant contributions to the analysis of centrality measures, several limitations remain. First, many of these studies consider only a limited number of empirical networks and, thus, do not allow to generalize the results and draw meaningful conclusions about the class of centrality measures that are most sensitive to incomplete data. Second, most studies explore only classical centrality measures (e.g., degree, betweenness, closeness, and eigenvector centralities) and, consequently, do not provide a comparative analysis with other existing measures. Furthermore, the sensitivity of centrality measures is often assessed under the assumption that data are missing at random, which is not true for many empirical networks.

This article aims to evaluate the robustness of centrality measures on a large set of different benchmark network topologies. Fig 1 illustrates the methodology of our research. We consider 113 networks of varying sizes and domains. For each network, we sequentially perform multiple changes in the structure of the graph by removing existing or adding new links, thus, simulating the most common effects of incomplete or incorrect data. Next, we evaluate the ranking of nodes in initial and modified graphs using a particular centrality measure. Finally, we compare the rankings across different networks and evaluate the robustness of 16 centrality measures. We would like to emphasize that we focus on the problem of incomplete data. Hence, we do not investigate the targeted addition or deletion of links intended to affect the network’s robustness and resilience [21, 22].

Materials and methods

Preliminaries

We consider a graph , where is a set of nodes, , and is a set of links, . The graph G is described by an adjacency matrix A whose elements aij are either one or zero depending on whether there is a link between nodes i and j or not and aii = 0 for . The graph is called undirected if A is symmetric ( for ) and directed, otherwise. Additionally, the graph G can be described by an non-negative weight matrix W, where each element wij represents the intensity of a link between nodes i and j. Given a graph G, a centrality measure associates a real number with each node , which is usually interpreted as follows: the larger is, the more central node i should be. We focus on undirected graphs and examine centrality measures that are applicable to these structures.

Centrality measures

In this subsection, we describe the centrality measures that we study in the paper.

  1. Degree centrality [23]. For undirected graphs, the degree of node i is equivalent to the total number of adjacent links. For weighted graphs, the intensity of a link is considered. High values of the degree centrality indicate nodes with the most connections to other nodes, making it easier for them to access and influence other nodes locally.
  2. k-shell centrality [24]. The centrality of node i assigns the highest order of a core that contains this node. A k-core is a maximal subgraph of G in which each node has at least degree k. Nodes with high k-shell levels tend to be located at the center of the network, giving them greater spreading capability.
  3. Collective Influence (CollInf) [25]. The centrality of node i depends on its degree ki and the degrees of its neighbours at a particular distance l [25], i.e.,(1)
    Nodes with high collective influence have strong connections to their neighbors and are located within dense, well-connected network structures at radius l.
  4. Eigenvector centrality [2, 26]. The eigenvector centrality assigns the relative importance of node i by giving greater weight to more important neighbors, thereby contributing more to the centrality of node i than less important neighbors. The eigenvector centrality is defined by the principal eigenvector of the adjacency matrix A with the leading eigenvalue , i.e.,(2)
  5. Subgraph centrality [27]. It computes the number of closed walks (walks where the first and the last nodes are the same) of different lengths in G, i.e.,(3)
    Nodes with high subgraph centrality scores are involved in many connected subgraphs and, consequently, contribute significantly to the overall connectivity and functionality of the network.
  6. PageRank [28]. The importance of a node depends on the probability to be visited by a random walker, i.e.,(4)where is the probability to continue the walk. Nodes with high PageRank scores have the highest probability of being visited by the random walker and, consequently, play an important role in the network.
  7. Laplacian centrality [29]. The Laplacian centrality of a node measures the network’s ability to respond to the deactivation of that node. It measures the drop in the Laplacian energy after deleting a node from the graph, i.e.,(5)where Gi is the graph obtained by deleting node i from G and , are the eigenvalues of G’s Laplacian matrix.
  8. Betweenness centrality [30]. It measures how often a node lies on the shortest paths between other nodes, i.e.,(6)where is the number of the shortest paths from node j to node k and is the number of the shortest paths from node j to node k that pass through i. Nodes with high betweenness centrality play a vital role in facilitating the flow of information and resources throughout the network.
  9. Stress [31]. A variant of the betweenness centrality where the centrality of node i counts all of the shortest paths which pass through node i, i.e.,(7)where is the number of the shortest paths from j to k that pass through i.
  10. Current-flow betweenness [32]. The extension of the betweenness centrality where all paths between nodes are considered. Nodes with high current-flow betweenness play a vital role in facilitating the flow of information that does not necessarily follow the shortest paths.
  11. Egocentric betweenness [33]. The betweenness centrality of node i within its egocentric network (subgraph of G with node i and its 1-hop neighbors). Nodes with high egocentric betweenness play a vital role in facilitating the flow of information among their neighbors.
  12. Closeness centrality [23]. It evaluates how close each node is to other nodes in the network, i.e.,(8)where dijis the length of the shortest path from node i to node j. Nodes with high closeness centrality are key players in quickly transmitting information along the shortest paths in the network.
  13. Harmonic centrality [34]. A variant of the closeness centrality where the centrality of node i is computed as the sum of inverse distances to other nodes, i.e.,(9)
  14. Decay [35]. Decay centrality reflects the importance of nodes by accounting for the diminishing effects of distance. It is a variant of closeness centrality in which the distance dij between nodes i and j is subject to a decay parameter (10)
  15. Current-flow Closeness [32]. A variant of the closeness centrality, which utilizes the idea of electrical current in a network(11)where pij(i) is the absolute electrical potential of node i based on the electrical current supply from node i to node j. Nodes with high closeness centrality are key players in rapidly transmitting information that does not necessarily follow the shortest paths across the network.
  16. LRIC [3]. LRIC is based on the concept that nodes are not homogeneous, with each node i having an individual threshold of influence qi, while its neighbors may form groups to influence node i. Nodes with high LRIC scores have the greatest direct and indirect group influence on other nodes in the network.

For weighted networks, we consider only weighted degree, eigenvector, PageRank, Laplacian and LRIC centralities, because other betweenness-based and closeness-based measures require the definition of the shortest paths, which may differ across the datasets.

Datasets

The robustness of centrality measures is evaluated on 113 undirected networks. Table 1 provides a short description of the networks. We examine 113 research-quality networks of varying sizes and domains from the Index of Complex Networks (ICON) [36] and our previous study [19]. These networks describe various types of interactions, including social interactions between members of the US Congress (congress-Twitter [37]), members of a university karate club (karate [38]), university employees (CS-Aarhus [39]), movie characters [40], terrorists (the 9/11 terrorist network [41], Madrid2004-terrorists [42], Mali terrorist network [43], Noordin [44], RHODESBOMBING [45]), gangs (Italian gangs [46], London gang [47], Montreal street gangs [48]), criminals (e.g., Siren and Togo operations [49], cocaine smuggling [50], Sicilian Mafia [51]), households (Ugandan village networks [52]) and animals (dolphins [53] and zebras [54]). Additionally, we include one biological network (Fullerene C60 [55]), one economic network (Medieval Russian Trade [56]), 24 technological networks (Agis, Bbnplanet, Bics, Biznet, Cesnet, Chinanet, CrlNetworkServices, DeutscheTelekom, Dfn, Evolink, Forthnet, GtsCzechRepublic, GtsPoland, HurricaneElectric, Integra, Janetbackbone, Litnet, Niif, Psinet, Renater2010, Rnp, Sanet, Xeex, Xspedius [57]), two transportation networks (Amsterdam metro [58], PATH rail system [59]), and one word adjacency network (word-adj [60]). Most of the networks suffer from incompleteness. Therefore, the sensitivity analysis of centrality measures toward graph modifications is reasonable for these networks.

There are some limitations in our study related to the choice of datasets. Due to the high computational complexity of some classical centrality measures, we focus on networks with a relatively small number of nodes (most networks containing fewer than 100 nodes). Hence, the results of the study may not be generalizable to large networks.

Second, approximately 44% of the networks in our study are fictional, as they represent relationships between movie characters. Movie networks are the one-mode projections of bipartite networks, where character co-appearance in the same scene serves as a proxy for connectedness [40]. When n characters appear together in a scene, they form a fully connected clique of size n in the movie network. This results in an overlapping clique structure (a union of densely connected subgroups), which is often unnatural for real-world networks and can notably impact certain centrality measures. For instance, this overlapping clique structure leads to shorter average path lengths, resulting in high closeness centrality for many nodes. It also causes reduced betweenness scores for nodes within cliques, while nodes acting as bridges between cliques tend to have disproportionately high betweenness scores. A similar issue of overlapping clique structures occurs in certain terrorist and criminal networks, which are also one-mode projections derived from bipartite individual-event, individual-membership or individual-location networks [4244, 46].

Next, many movie networks are highly centralized, as primary actors tend to appear in most scenes. Fig 2 illustrates that only 16% of movie networks have low degree centralization [61] values (<0.4), while 59% exhibit degree centralization scores above 0.6. While movie networks serve as a valuable example of incomplete data—where many actor connections remain unobserved, similar to social network studies centered on specific individuals [62]—they may not fully capture the complexity of real-world social interactions.

thumbnail
Fig 2. Degree assortativity and centralization across different datasets.

https://doi.org/10.1371/journal.pcsy.0000042.g002

Finally, our datasets primarily consist of movie (), social () and technological () networks, with other types accounting for approximately 7% of the total number of networks. Fig 2 also illustrates that approximately 91% of the networks exhibit negative degree assortativity values [63]. Therefore, our findings on the average sensitivity of classical centrality measures may not be generalizable to networks from other domains (financial, brain, biological, etc.) or to different network structures, such as those with positive degree assortativity.

The correlation between centrality measures across empirical datasets is discussed in S1 Appendix. For each of the networks, we apply various data imputation strategies and evaluate the sensitivity of centrality measures.

Data imputation

The effect of missing data on the results of centrality measures is hard to estimate as there are various causes of data incompleteness. For instance, trade networks suffer from data incompleteness because some countries do not report their trade flows (missing links), use distinct commodity classifications or evaluate bilateral trade costs differently (data inaccuracy). Protein networks have missing links as there are many undiscovered or unobserved interactions between proteins. Hence, there is no single best strategy how to modify the structure of a network and address all the issues of incomplete data. In other words, perturbation analysis should be performed with respect to the nature of a network of interest and the type of errors in the data.

Since nodes are often easier to observe in many empirical networks (e.g., social or international trade networks), we do not examine node removal or addition and instead focus solely on the relationships between nodes. Table 2 illustrates the actions that we apply to modify the structure of the networks. These actions take into account the two most common types of errors in the data: measurement errors and incomplete information about links. For unweighted networks, the RR scenario examines the robustness of centrality measures if some of the links have been identified incorrectly. For weighted networks, the RC scenario studies the effect of errors in the flow data. Other graph modifications examine the problem of missing links. The main difference between adding links strategies is the probability of choosing unobserved links between nodes. In RA, there is an equal probability of choosing the missing link. However, since missing links are not uniformly distributed in most empirical networks, we consider the DA, PA, and AA scenarios, which assume that the probability of missing links depends on the network’s structure. In DA, the probability of choosing links between nodes is proportional to the product of their degrees. PA is driven by the idea of similarity between nodes, which can be estimated by the geodesic distance between them. Finally, the AA scenario is based on the Adamic-Adar index, which is often used to predict links in social networks [64]. The Adamic-Adar index between nodes i and j is(12)where denotes the set of neighbors of node i. The difference between RR, RA, DA, PA and AA scenarios is discussed in S2 Appendix.

thumbnail
Table 2. List of modifications in the structure of a network.

https://doi.org/10.1371/journal.pcsy.0000042.t002

For each network G and graph perturbation strategy, we generate 1,000 perturbed graphs and then compare the similarity between centrality measures in G and . The parameter k varies from 1% to 30% of the total links in the initial graph G. For weighted graphs, k is based on the total weight of links in G.

The list of graph changes, which is presented in Table 2, is not exhaustive. The choice of the appropriate imputation method depends on the nature of the missing data as well as the type of a network. A discussion of various data sampling and imputation techniques and the effect of missing data on network measurement are provided in [19, 6574].

Performance evaluation

In this subsection, we describe various metrics that are employed in the paper to assess the stability of the centrality measures. As illustrated in Fig 1, we first define the centrality of the nodes in the initial graph G and its modified version . For each centrality measure and graph modification, we rank all the nodes according to their centrality and then compare the rankings as well as the set of the most central elements. The higher the similarity between the rankings of the nodes in G and is, the less sensitive the centrality is to a certain modification of the graph. The list of performance metrics is presented below.

  1. Correlation: the Kendall rank correlation coefficient [75], which measures the similarity between two rankings of nodes as(13)where Nc and Nd are the numbers of concordant and discordant pairs.
  2. TOP1: the Jaccard index, which measures the similarity between two sets of the most important nodes in G and as(14)where C and denote sets of the most important nodes in G and . The most important nodes are the nodes with the highest centrality score.
  3. TOP3: the Jaccard index between TOP-3 nodes in G and .
  4. TOP5: the Jaccard index between TOP-5 nodes in G and .
  5. TOP10: the Jaccard index between TOP-10 nodes in G and .

Fig 3 illustrates an example of the performed experiment for the betweenness centrality. Starting with the initial graph G, we construct a new network (one possible realization of ) by randomly adding three links: (2,3), (5,7) and (6,8). We then evaluate and compare the betweenness centrality in graphs G and . The Kendall rank correlation coefficient between nodes rankings in G and is 0.72, as node 2 in has a higher betweenness score than node 7, node 5 ranks higher than node 1, and node 3 surpasses nodes 1 and 8. The Jaccard index for the TOP3 nodes is 1, indicating that nodes 6, 7, and 2 remain the most important in both graphs G and .

thumbnail
Fig 3. Perturbation analysis for the betweenness centrality: an example.

https://doi.org/10.1371/journal.pcsy.0000042.g003

Since the robustness of centrality measures is evaluated in 1,000 realizations of graph , derived from the corresponding graph G, it is important to aggregate the result of our experiments across all the networks. We employ two methods: the average score and the Copeland score. For instance, the average correlation score of a centrality is the average correlation between graphs G and . The Copeland score is a social choice rule that measures the difference between the cardinality of dominating and dominated sets [76, 77]. The dominating set of centrality x includes centrality measures, which are more sensitive than x to the graph modification in most experiments. Similarly, the dominated set of x contains centralities that are less sensitive than x in most experiments. Therefore, the Copeland score performs a pairwise comparison of the centralities and identifies the measures that are less sensitive to perturbations in most of the experiments.

Fig 4 shows an example where 3 centrality measures are compared in 5 experiments by some performance metric X. Centralities A and B have the same average correlation coefficient, although A outperforms B in 80% of experiments. By contrast, A has the highest Copeland score as it is better than both B and C in most cases. Since our experiments explore possible realizations of the initial partially-observed network, the Copeland score provides additional insights into the ranking of centrality measures.

thumbnail
Fig 4. The difference between the average value and the Copeland score.

https://doi.org/10.1371/journal.pcsy.0000042.g004

Results

In this subsection, we discuss the robustness of the centrality measures with respect to each graph modification. We present the results for the correlation coefficient as well as for the TOP-5 most important nodes. The confidence intervals associated with the reported estimates are provided in S3 Appendix. Through our empirical analysis of 1,000 modified graphs, we observe that the robustness of centrality measures remains consistent across the sampled datasets. We also observe that the 95% confidence intervals are quite narrow, indicating a high level of precision in our estimates. The results for the TOP-1, TOP-3 and TOP-10 nodes agree with the results of this subsection and therefore are provided in S3-S4 Appendices.

Robustness of centrality measures in unweighted graphs

Random removal of links (RR).

Fig 5 illustrates the robustness of the centrality measures with respect to the Kendal rank correlation if links are removed from the initial graph G. The degree centrality, which is the local measure, is the most robust measure that has a very strong correlation coefficient (0.78) with the observed graph G even if 30% of links are removed from the graph. By contrast, the majority of betweenness-based centralities are the most sensitive measures. The removal of links from G has a large impact on the shortest paths between the nodes, therefore, the ranking of nodes may change dramatically. At the same time, closeness, decay and harmonic centralities are more robust than the betweenness centrality because the removal of links has a higher impact on the shortest paths rather than on their lengths. Interestingly, some measures may outperform other centralities with the increase of k. For instance, the egocentric betweenness, which measures a local betweenness of nodes, becomes more robust than the closeness-based centralities for . There are also centralities (LRIC and current-flow betweenness) that provide relatively more robust results for large k.

thumbnail
Fig 5. The average Kendal rank correlation and its Copeland score for RR.

https://doi.org/10.1371/journal.pcsy.0000042.g005

Next, we compare the sets of TOP-5 nodes with Jaccard index in Fig 6. Again, most betweenness-based centralities demonstrate the largest difference in the set of TOP-5 nodes. However, the k-shell centrality, which provides, on average, high Kendal rank correlation coefficient (see Fig 5), has a low Jaccard index for TOP-5 nodes. This observation may be caused by the fact that in many cases the set C of the most important nodes may be very large while the removal of links from G decreases the set C significantly. We can also conclude that the degree, the current-flow betweenness, PageRank, the egocentric betweenness and LRIC provide stable sets of the most important nodes.

thumbnail
Fig 6. The average Jaccard index for TOP-5 nodes and its Copeland score for RR.

https://doi.org/10.1371/journal.pcsy.0000042.g006

Finally, we observe that degree, PageRank, LRIC and current-flow betweenness centralities are the least sensitive measures to the presence of random incorrect links in the initial undirected unweighted graph G. The current-flow betweenness centrality provides a robust set of TOP-5 nodes for any observed while its correlation coefficient (see Fig 5) is more stable than most other measures only for .

Addition of new links (RA, DA, PA and AA).

Assume now that links are not present in a given graph G. Fig 7 illustrates the sensitivity of the centrality measures with respect to the average correlation coefficient and the link addition strategy.

thumbnail
Fig 7. The average Kendal rank correlation of centrality measures for RA, DA, PA, and AA.

https://doi.org/10.1371/journal.pcsy.0000042.g007

First, we notice that the addition of links has almost the same impact on the robustness of centrality measures as the removal of links. However, the ranking of centrality measures is different for link addition and link removal strategies. The addition of links has the largest effect on all betweenness-based measures (stress, betweenness, egocentric betweenness and current-flow betweenness centralities) and the LRIC index. The k-shell and the degree centrality are the most robust measures for any link addition strategy. The ranking of the remaining centralities depends on the link addition strategy. Overall, we observe that the ranking of centrality measures does not differ significantly between RA, DA, PA, and AA scenarios.

Second, we compare the average Jaccard index between sets of TOP-5 nodes. Fig 8 shows that the stress centrality is the most sensitive index for all link addition scenarios. In contrast to the removal of links, the k-shell centrality provides the most robust results with respect to both the correlation score and the TOP-5 nodes for the link addition. We observe that the set of the most central nodes is relatively large and does not change significantly with the addition of new links. Among the other measures, we also observe a high Jaccard index (>0.82) for the degree and the PageRank centralities. The Copeland score of the centrality measures is provided in S3 Appendix.

thumbnail
Fig 8. The comparison of TOP-5 nodes for RA, DA, PA and AA.

https://doi.org/10.1371/journal.pcsy.0000042.g008

Next, we compare different link addition strategies. Fig 7 shows that the addition of random links (RA) to the graph G has the largest effect on the average correlation coefficient. The addition of links with a probability that is inversely proportional to the shortest path distance between nodes (PA) is similar to RA, which may be caused by the small diameter of real networks and, consequently, the fact that almost any link between nodes has almost the same probability. By contrast, degree-based (DA) and Adamic-Adar-based (AA) strategies have the lowest impact on the average correlation coefficient for most centralities. AA also provides the lowest sensitivity of centrality measures in terms of the TOP-5 nodes.

Finally, we emphasize that the presented results demonstrate the average sensitivity of centrality measures across 110 unweighted networks from Table 1. In principle, the robustness of centrality measures depends on the network and the graph perturbation strategy. For instance, Fig 9 illustrates how the mean and the variance of the Kendal rank correlation vary across datasets for DA (). We consider degree (left) and k-shell (right) centralities, which are the most robust measures according to Fig 7. Each point corresponds to a particular dataset. Fig 9 proves that the robustness of the centrality measures depends on the dataset. For instance, the robustness of the k-shell centrality in some transportation and technological networks varies significantly, which can be attributed to their smaller size and lower density, where changes in the network structure have a greater impact on centrality values, depending on where links are added.

thumbnail
Fig 9. The mean and the variance of the Kendal rank correlation for degree and k-shell centralities across datasets.

https://doi.org/10.1371/journal.pcsy.0000042.g009

Robustness of centrality measures in weighted graphs

Random change of link weights (RC).

Fig 10 illustrates the robustness of centrality measures if the graph G contains errors in link weights. The confidence intervals associated with the reported estimates are provided in S4 Appendix.

thumbnail
Fig 10. The average Kendal rank correlation and the average Jaccard index of centrality measures for RC.

https://doi.org/10.1371/journal.pcsy.0000042.g010

Laplacian and eigenvector centralities demonstrate the highest robustness of nodes rankings. However, the sensitivity of centrality measures in terms of r most central elements varies with respect to r. Both LRIC and weighted degree provide quite robust sets of r = 3 most central nodes (TOP-3) but for r = 10 the most stable results are given by the Laplacian and eigenvector centralities. In other words, for LRIC and weighted degree, the perturbation of G has the largest effect on the ranking of TOP-10 nodes, which are not in TOP-3 and TOP-5. By contrast, for Laplacian and eigenvector centralities, the graph perturbation changes the ordering of TOP-10 nodes but not the list of these nodes. Overall, we observe that all centrality measures have a very strong average correlation coefficient (0.89) between rankings of nodes in graphs G and even if there is up to 30% inaccuracy in the weights of the links. Moreover, random changes in link weights (RC) do not significantly influence the centrality of nodes in the network.

Addition of new links (RA and DA).

The effect of the link addition to weighted graphs is shown in Fig 11. The average Kendal rank correlation is higher in DA (degree-based link addition) than in RA (random addition of new links) for all centrality measures. LRIC and PageRank are the most sensitive to missing links in terms of the average correlation coefficient and the set of TOP-5 nodes under RA strategy. Other centrality measures are quite stable: the average Jaccard similarity between TOP5 nodes in graphs G and is greater than 0.9 while the average Kendall rank correlation of nodes rankings is strong (>0.8).

thumbnail
Fig 11. The average Kendal rank correlation and the average Jaccard index of centrality measures for RA and DA.

https://doi.org/10.1371/journal.pcsy.0000042.g011

For DA strategy, weighted degree centrality provides the most stable results as DA favors high-degree nodes. On the contrary, the eigenvector centrality, which is among the most robust measures in RA, becomes more vulnerable in DA compared to other centrality measures. We also conclude that the LRIC index is the most sensitive measure for both RA and DA.

Discussion

The problem of data inaccuracy and incompleteness is a serious challenge for the analysis of complex systems. Since most real networks are partially observed, some centrality measures can be misused and lead to wrong interpretation. In this regard, the choice of the most appropriate centrality measure requires a careful examination.

We discussed the sensitivity of 16 centrality measures to different data imputation techniques. To draw meaningful and robust conclusions about the average sensitivity of different centrality measures, we have performed experiments on a large set of different benchmark network topologies.

Our main observation for unweighted networks is that the addition or the removal of new links has the largest impact on the betweenness-based centralities. Degree and PageRank centralities are among the least sensitive measures to link addition/removal. We have also identified some measures that are more robust to the link removal than to the link addition and vice versa. In general, there is also no evidence of considerable changes in the relative ranking of centrality measures in all the link addition strategies discussed.

For weighted networks, inaccuracy in link weights does not significantly affect the centralities of the nodes (except LRIC). The existence of missing links has a greater influence on the robustness of the centrality measures than the presence of errors in the weights of the links. PageRank and Laplacian centrality are the most robust measures with respect to random link addition (RA) and degree-based link addition (DA) scenarios. Finally, we observe that the perturbations in weighted networks have a lower impact on node centrality than the perturbations in unweighted networks.

The results of our experiments provide an overall picture of the robustness of centrality measures. More precisely, we demonstrate how the centrality measures behave on average (in most experiments on real networks) to a specific type of graph modification. Our experiments do not aim to demonstrate that there exists the most robust measure for all networks because, in general, the perturbation analysis should be conducted with respect to the nature of errors in the network of interest. However, our study emphasizes the problem of the sensitivity of centrality measures in the presence of incomplete data.

As a limitation, our analysis is performed on small empirical networks due to the high computational complexity of some centrality measures. Hence, our findings from small networks may not fully generalize to larger networks, which typically exhibit more complex structures. Our experiments are conducted on a large set of networks, primarily consisting of social, technological, and movie networks. Many of these networks are fictional or represent one-mode projections derived from bipartite co-appearance, co-event, or co-location networks, and this structure affects certain centrality measures. As a result, the findings from the sensitivity analysis may not necessarily generalize to networks from other domains. Comparing centrality measures across larger networks, as well as networks from a wider variety of empirical domains, is an important next step in our future research.

Finally, our work is not intended to demonstrate the deficiency of some centrality indices, but rather to show that some centralities require a cautious interpretation in the presence of missing or incorrect links in the networks.

Supporting information

S1 Appendix. Correlation of centrality measures across empirical datasets.

https://doi.org/10.1371/journal.pcsy.0000042.s001

(PDF)

S3 Appendix. Comparison of TOP-k nodes and confidence intervals with respect to link addition/removal strategies in unweighted graphs.

https://doi.org/10.1371/journal.pcsy.0000042.s003

(PDF)

S4 Appendix. Comparison of TOP-k nodes and confidence intervals with respect to link addition/removal strategies in weighted graphs.

https://doi.org/10.1371/journal.pcsy.0000042.s004

(PDF)

Acknowledgments

Sergey Shvydun sincerely thank Natalia Meshcheryakova from the National Research University Higher School of Economics (HSE University) for her invaluable support, insightful comments and contributions, particularly in visualization and manuscript review.

References

  1. 1. Mahdi J. Centiserver: the most comprehensive centrality resource and web application for centrality measures calculation. 2024. Available from: https://www.centiserver.ir/centrality/list/
  2. 2. Newman MEJ. Networks: an introduction. Oxford, UK: Oxford University Press; 2010.
  3. 3. Aleskerov F, Shvydun S, Meshcheryakova N. New centrality measures in networks: how to take into account the parameters of the nodes and group influence of nodes to nodes. 1st edn. Chapman and Hall/CRC; 2021. https://doi.org/10.1201/9781003203421
  4. 4. Myerson RB. Graphs and cooperation in games. Math OR. 1977;2(3):225–9.
  5. 5. Chanhyun Kang, Molinaro C, Kraus S, Shavitt Y, Subrahmanian VS. Diffusion centrality in social networks. In: 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. 2012. p. 558–64.
  6. 6. Ficara A, Cavallaro L, Curreri F, Fiumara G, De Meo P, Bagdasar O, et al. Criminal networks analysis in missing data scenarios through graph distances. PLoS One. 2021;16(8):e0255067. pmid:34379625
  7. 7. Aleskerov F, Andrievskaya I, Nikitina A, Shvydun S. Key borrowers detected by the intensities of their interactions. In: Handbook of Financial Econometrics, Mathematics, Statistics, and Machine Learning (In 4 volumes), vol. 1, chap. 9. Singapore: World Scientific; 2020. p. 355–89.
  8. 8. Meshcheryakova N. Network analysis of bilateral trade data under asymmetry. In: IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), The Hague, Netherlands; 2020. p. 379–83.
  9. 9. Borgatti SP, Carley KM, Krackhardt D. On the robustness of centrality measures under conditions of imperfect data. Social Networks. 2006;28(2):124–36.
  10. 10. Frantz TL, Cataldo M, Carley KM. Robustness of centrality measures under uncertainty: examining the role of network topology. Comput Math Organ Theory. 2009;15(4):303–28.
  11. 11. Segarra S, Ribeiro A. Stability and continuity of centrality measures in weighted graphs. IEEE Trans Signal Process. 2016;64(3):543–55. https://doi.org/10.1109/ICASSP.2015.7178599
  12. 12. Martin C, Niemeyer P. Influence of measurement errors on networks: estimating the robustness of centrality measures. Net Sci. 2019;7(2):180–95.
  13. 13. Murai S, Yoshida Y. Sensitivity analysis of centralities on unweighted networks. In: The World Wide Web Conference. 2019. p. 1332–42.
  14. 14. Meshcheryakova N, Shvydun S. Perturbation analysis of centrality measures. In: Proceedings of the International Conference on Advances in Social Networks Analysis and Mining. IEEE; 2023. p. 407–14.
  15. 15. Meshcheryakova N, Shvydun S. A comparative analysis of centrality measures in complex networks. ARC. 2024;85(8):773–84.
  16. 16. Bolland JM. Sorting out centrality: an analysis of the performance of four centrality models in real and simulated networks. Soc Netw. 1988;10(3):233–53.
  17. 17. Herland M, Pastran P, Zhu X. An empirical study of robustness of network centrality scores in various networks and conditions. In: 2013 IEEE 25th International Conference on Tools with Artificial Intelligence. 2013. p. 221–8.
  18. 18. Niu Q, Zeng A, Fan Y, Di Z. Robustness of centrality measures against network manipulation. Phys A: Statist Mech Appl. 2015;438:124–31.
  19. 19. Jović M, Amir-Haeri M, Rimfeld K, Ensink JBM, Lindauer RJL, Vrijkotte TGM, et al. Harmonization of SDQ and ASEBA phenotypes: measurement variance across cohorts. J Psychopathol Behav Assess. 2025;47(1):27. pmid:40062209
  20. 20. Costenbader E, Valente TW. The stability of centrality measures when networks are sampled. Soc Netw. 2003;25(4):283–307.
  21. 21. Albert R, Jeong H, Barabasi A. Error and attack tolerance of complex networks. Nature. 2000;406(6794):378–82. pmid:10935628
  22. 22. Artime O, Grassia M, De Domenico M, Gleeson JP, Makse HA, Mangioni G, et al. Robustness and resilience of complex networks. Nat Rev Phys. 2024;6(2):114–31.
  23. 23. Freeman LC. Centrality in social networks conceptual clarification. Soc Netw. 1978;1(3):215–39.
  24. 24. Batagelj V, Zaversnik M. Generalized cores, arXiv:0202039 [Preprint]. 2002 [cited 2024 May 30]. Available from: https://arxiv.org/abs/cs/0202039
  25. 25. Morone F, Makse HA. Influence maximization in complex networks through optimal percolation. Nature. 2015;524(7563):65–8. pmid:26131931
  26. 26. Bonacich P. Power and centrality: a family of measures. Am J Sociol. 1987;92(5):1170–82.
  27. 27. Estrada E, Rodríguez-Velázquez JA. Subgraph centrality in complex networks. Phys Rev E Stat Nonlin Soft Matter Phys. 2005;71(5 Pt 2):056103. pmid:16089598
  28. 28. Brin S, Page L. The anatomy of a large-scale hypertextual web search engine. Comput Netw ISDN Syst. 1998;30(1–7):107–17.
  29. 29. Qi X, Fuller E, Wu Q, Wu Y, Zhang C-Q. Laplacian centrality: a new centrality measure for weighted networks. Inf Sci. 2012;194:240–53.
  30. 30. Freeman LC. A set of measures of centrality based on betweenness. Sociometry. 1977;40(1):35–41.
  31. 31. Shimbel A. Structural parameters of communication networks. Bullet Math Biophys. 1953;15(4):501–7.
  32. 32. Kumar N, Gond C, Singh JD, Datta A. Molecular docking, pharmacological profiling, and MD simulations of glycolytic inhibitors targeting novel SARS CoV-2 main protease and spike protein. In Silico Pharmacol. 2025;13(1):44. pmid:40093584
  33. 33. Freeman LC. Centered graphs and the structure of ego networks. Math Soc Sci. 1982;3(3):291–304.
  34. 34. Beauchamp MA. An improved index of centrality. Behav Sci. 1965;10:161–3. pmid:14284290
  35. 35. Jackson M. Social and economic networks. Princeton: Princeton University Press; 2008.
  36. 36. Clauset A, Tucker E, Sainz M. The Colorado index of complex networks. 2016. Available from: https://icon.colorado.edu/
  37. 37. Fink CG, Omodt N, Zinnecker S, Sprint G. A Congressional Twitter network dataset quantifying pairwise probability of influence. Data Brief. 2023;50:109521. pmid:37701709
  38. 38. Zachary WW. An information flow model for conflict and fission in small groups. J Anthropol Res. 1977;33(4):452–73.
  39. 39. Magnani M, Micenkova B, Rossi L. Combinatorial analysis of multiple networks. arXiv:1303.4986 [Preprint]. 2013 [cited 2024 May 30]. Available from: https://arxiv.org/abs/1303.4986
  40. 40. Kaminski J, Schober M, Albaladejo R, Zastupailo O, Hidalgo C. Moviegalaxies-social networks in movies. 2012. Available from: http://moviegalaxies.com
  41. 41. Krebs VE. Mapping networks of terrorist cells. Connections. 2002;24(3):43–52.
  42. 42. Hayes B. Connecting the dots. Am Scientist. 2006;94(5):400–4.
  43. 43. Walther OJ, Christopoulos D. Islamic Terrorism and the Malian Rebellion. Terrorism Political Violence. 2014;27(3):497–519.
  44. 44. Roberts N. Roberts and everton terrorist data: Noordin top terrorist network (subset). Machinereadable data file. 2011. Available from: https://www.thearda.com/data-archive?fid=TERRNET
  45. 45. Rhodes CJ, Jones P. Inferring missing links in partially observed social networks. J Oper Res Soc. 2009;60(10):1373–83.
  46. 46. Coutinho J. Italian Gangs. 2016. Available from: https://sites.google.com/site/ucinetsoftware/datasets/covert-networks/italiangangs
  47. 47. Grund TU, Densley JA. Ethnic homophily and triad closure. J Contemp Criminal Justice. 2014;31(3):354–70.
  48. 48. Descormiers K, Morselli C. Alliances, conflicts, and contradictions in Montreal’s street gang landscape. Int Criminal Justice Rev. 2011;21(3):297–314.
  49. 49. Morselli C. Inside criminal networks. New York: Springer; 2009.
  50. 50. Giménez-Salinas A, Framis A. Illegal networks or criminal organizations: power, roles and facilitators in four cocaine trafficking structures. In: Morselli C, editors. Crime and Networks. Routledge; 2011. p. 131–47.
  51. 51. Ficara A, Cavallaro L, De Meo P, Fiumara G, Catanese S, Bagdasar O et al. Social network analysis of Sicilian Mafia interconnections. In: Cherifi H, Gaito S, Mendes J, Moro E, Rocha L, editors.Complex networks and their applications VIII: Volume 2 Proceedings of the Eighth International Conference on Complex Networks and Their Applications; 2020: p. 440–50.
  52. 52. Chami GF, Ahnert SE, Kabatereine NB, Tukahebwa EM. Social network fragmentation and community health. Proc Natl Acad Sci U S A. 2017;114(36):E7425–31. pmid:28739906
  53. 53. Lusseau D, Schneider K, Boisseau OJ, Haase P, Slooten E, Dawson SM. The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations. Behav Ecol Sociobiol. 2003;54(4):396–405.
  54. 54. Sundaresan SR, Fischhoff IR, Dushoff J, Rubenstein DI. Network metrics reveal differences in social organization between two fission-fusion species, Grevy’s zebra and onager. Oecologia. 2007;151(1):140–9. pmid:16964497
  55. 55. Aref S, Wilson MC. Balance and frustration in signed networks. J Complex Netw. 2018;7(2):163–89.
  56. 56. Pitts FR. The medieval river trade network of Russia revisited. Soc Netw. 1978;1(3):285–92.
  57. 57. Knight S, Nguyen HX, Falkner N, Bowden R, Roughan M. The internet topology zoo. IEEE J Select Areas Commun. 2011;29(9):1765–75.
  58. 58. Gemeente Amsterdam Klaas-Bindert de Haan. (n.d.). Maps Amsterdam. https://maps.amsterdam.nl/
  59. 59. PATH Rail System: Schedules and Maps. [Internet]. Available from: https://www.panynj.gov/path/en/schedules-maps.html
  60. 60. Newman MEJ. Finding community structure in networks using the eigenvectors of matrices. Phys Rev E Stat Nonlin Soft Matter Phys. 2006;74(3 Pt 2):036104. pmid:17025705
  61. 61. Freeman LC. Centrality in social networks conceptual clarification. Soc Netw. 1978;1(3):215–39.
  62. 62. Mapping the Trump World. Trump World. [Internet]. Available from: https://sharmila-velamur.github.io/trumpworld/home.html
  63. 63. Newman MEJ. Mixing patterns in networks. Phys Rev E Stat Nonlin Soft Matter Phys. 2003;67(2 Pt 2):026126. pmid:12636767
  64. 64. Liben-Nowell D, Kleinberg J. The link prediction problem for social networks. In: Proceedings of the Twelfth International Conference on Information and Knowledge Management. 2003. p. 556–9.
  65. 65. Galaskiewicz J. Estimating point centrality using different network sampling techniques. Soc Netw. 1991;13(4):347–86.
  66. 66. Huisman M. Imputation of missing network data: some simple procedures. J Soc Struct. 2009;10(1):1–29.
  67. 67. Huisman M, Krause RW. Imputation of missing network data. In: Alhajj R, Rokne J, editors. Encyclopedia of social network analysis and mining. Springer, New York, NY; 2017. pp. 1-10. https://doi.org/10.1007/978-1-4614-7163-9_394-1
  68. 68. Smith JA, Moody J. Structural effects of network sampling coverage I: nodes missing at random. Soc Netw. 2013;35(4):652–68. pmid:24311893
  69. 69. Smith JA, Moody J, Morgan J. Network sampling coverage II: the effect of non-random missing data on network measurement. Soc Netw. 2017;48:78–99. pmid:27867254
  70. 70. Žnidaršič A, Ferligoj A, Doreian P. Stability of centrality measures in valued networks regarding different actor non-response treatments and macro-network structures. Net Sci. 2017;6(1):1–33.
  71. 71. Krause RW, Huisman M, Steglich C, Snijders TAB. Missing network data a comparison of different imputation methods. In: 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM). IEEE; 2018. p. 159–63.
  72. 72. Krause RW, Huisman M, Steglich C, Snijders T. Missing data in cross-sectional networks – an extensive comparison of missing data treatment methods. Soc Netw. 2020;62:99–112.
  73. 73. Kossinets G. Effects of missing data in social networks. Soc Netw. 2006;28(3):247–68.
  74. 74. Smith J, Morgan J, Moody J. Network sampling coverage III: imputation of missing network data under different network and missing data conditions. Soc Netw. 2022;68:148–78.
  75. 75. Kendall M. Rank correlation methods, 4th edn. Griffin, London; 1970.
  76. 76. Saari DG, Merlin VR. The Copeland method. Econ Theory. 1996;8(1):51–76.
  77. 77. Shvydun S. Normative properties of multi-criteria choice procedures and their superpositions: I. Working paper WP7/2015/07 (Part 1). Moscow: HSE Publishing House. arXiv:1611.00524 [Preprint]. 2015 [cite 2024 May 30]. Available from: https://arxiv.org/abs/1611.00521