Ranking influential nodes in complex networks with community structure

Stephany Rajeh; Hocine Cherifi

doi:10.1371/journal.pone.0273610

Abstract

Quantifying a node’s importance is decisive for developing efficient strategies to curb or accelerate any spreading phenomena. Centrality measures are well-known methods used to quantify the influence of nodes by extracting information from the network’s structure. The pitfall of these measures is to pinpoint nodes located in the vicinity of each other, saturating their shared zone of influence. In this paper, we propose a ranking strategy exploiting the ubiquity of the community structure in real-world networks. The proposed community-aware ranking strategy naturally selects a set of distant spreaders with the most significant influence in the networks. One can use it with any centrality measure. We investigate its effectiveness using real-world and synthetic networks with controlled parameters in a Susceptible-Infected-Recovered (SIR) diffusion model scenario. Experimental results indicate the superiority of the proposed ranking strategy over all its counterparts agnostic about the community structure. Additionally, results show that it performs better in networks with a strong community structure and a high number of communities of heterogeneous sizes.

Citation: Rajeh S, Cherifi H (2022) Ranking influential nodes in complex networks with community structure. PLoS ONE 17(8): e0273610. https://doi.org/10.1371/journal.pone.0273610

Editor: José F. Vicent Francés, University of Alicante, SPAIN

Received: June 17, 2022; Accepted: August 12, 2022; Published: August 29, 2022

Copyright: © 2022 Rajeh, Cherifi. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the article and its Supporting information files.

Funding: The authors received no specific funding for this work.

Competing interests: I have read the journal’s policy, and the authors of this manuscript have the following competing interests: Author HC serves on the editorial board of PLOS ONE. This does not alter our adherence to PLOS ONE policies on sharing data and materials.

Introduction

Many real-world systems, including transportation, social, technological, infrastructural, information, and biological systems, are complex. One can represent them using networks where nodes represent their constituent entities and links account for their interactions. Influential nodes in these systems play a critical role in the structure and dynamics of the network [1]. Identifying the most influential nodes in these networks is a major issue. Indeed, it allows conducting specific optimization tasks, such as controlling, minimizing, or maximizing a diffusion process. This issue is mainly related to centrality measures [2]. These measures extract diverse information from the network to quantify its importance. For instance, one can link a node’s capacity to infect others with its Degree and Coreness centrality [3]. Betweenness centrality allows for identifying genes related to heart attacks [4]. Other applications include hindering epidemic outbreaks [5], augmenting the effectiveness of marketing campaigns on social media [6], enhancing the resiliency of infrastructural networks [7], and many other [8–10].

Classically, one can divide centrality measures into local or global measures [2]. Local measures such as Degree and Maximum Neighborhood Component quantify the node’s importance based on its neighborhood. Global measures such as Betweenness and PageRank relate the node influence to its position in the entire network. One can also consider multidimensional measures simultaneously combining local and global information [11, 12]. More recent works exploit the network’s community structure to identify influential nodes [13–20]. They show that the community structure is a crucial factor in effectively quantifying the node’s influence [21, 22]. Centrality measures can also be time-dependent [23, 24].

Centrality measures can be signed (positive or negative) or unsigned (positive and negative). In the first case, one ranks nodes in descending order, and a fraction of the top nodes are selected to conduct a specific optimization task. In the second case, one can have a multitude of ranking schemes. One can use a combination of positive and negative ranks, take a fraction of both positions, or convert negative values to positive values and take the aggregate ranks. In either case, one can have two general ranking schemes, strong and weak. In the former, one selects the most critical nodes first. In the latter, one chooses the less important nodes [25]. Although centrality measures provide an effective way of ranking nodes, several challenges exist. The first challenge is that several centrality measures may underestimate the influence of specific nodes depending on the network’s structure. The second challenge is that many nodes with high centrality may be neighbors. Thus, targeting these nodes for diffusion or immunization is inefficient because one uses the resources locally, ignoring large parts of the network. The third challenge is which ranking schemes and/or combination criteria are ideal for a network with specific topological features.

To address these challenges, we propose a community-aware ranking method. Indeed, communities are pervasive in real-world networks [26–28]. A community is a densely connected and cohesive subgroup of nodes sharing few connections with nodes outside their group. The community structure of a network affects its underlying dynamics [29, 30]. Moreover, ongoing research emphasizes the benefits of the community structure as a basis for identifying influential nodes [13–20]. The proposed ranking strategy exploits this precious information. It is simple yet effective and applicable to all centrality measures. The most straightforward ranking strategy, given a centrality measure, targets the top nodes independently of the community structure if any. Instead, we propose to rank the nodes based on their importance in their communities. First, we select the most central nodes in each community. We order these nodes in decreasing order of their community size. Then we move to the next most central node in each community and adopt the same ordering strategy. We iterate this process until we reach the given budget of nodes to rank. This approach naturally selects distant nodes in each community.

To evaluate the proposed ranking strategy, we report a series of experiments on synthetic and real-world networks using a set of six classical centrality measures using the SIR epidemic model [31]. We categorize these centrality measures into three groups, namely neighborhood-based (Degree and Maximum Neighborhood Component), path-based (Betweenness and Closeness), and iterative refinement-based (Katz and PageRank). Experiments on synthetic networks investigate the impact of various network parameters on the proposed ranking strategy. Indeed, one can control the community structure strength, the community size distribution, and the degree distribution. Real-world networks include infrastructural, social, biological, citation, word, and collaboration networks with unknown community structures. Therefore, we uncover the communities using two community detection algorithms to assess the proposed strategy’s consistency linked to community structure variations. Results show that the community-aware ranking strategy is more effective than the classical ranking by descending order of the centrality measure. The main advantages of the proposed method are threefold:

It applies to all types of centrality measures in all types of networks (undirected/directed and unweighted/weighted).
It naturally selects distant nodes to expand any diffusion phenomena based on any given budget.
Its complexity depends on the centrality measure computed.

Proposed ranking strategy

Centrality measures aim at quantifying nodes’ influence. Nodes are usually ranked based on the descending order of their influence. Indeed, nodes with the highest centrality are supposed to be strategically located in the network. Thus, targeting them for any optimization task will yield desired outcomes. However, in real-world networks, these nodes may not be far apart, which can be detrimental to the effectiveness of dissemination strategies. Consider an immunization scenario in an epidemic process. Immunizing neighbors in priority, even if influential, may prevent the protection of vast areas of the network. To overcome this issue, we propose a simple ranking method considering the network’s community structure.

Algorithm

The proposed ranking strategy targets influential nodes spreading across communities in the network. It applies to any centrality measure. First, one computes the centrality of the nodes. Second, one targets top nodes community by community. Such a strategy prevents the concentration of influential nodes in the same network area. The targeted nodes are naturally more dispersed.

Algorithm 1 Community-aware ranking scheme

Input: Graph G(V, E), Centrality measure β, Sorted community set C, Budget B

Output: List of ranked nodes L

1: D ← ⌀ ▹ Compute the centrality of each node

2: for each i ∈ V do

3: D[i] ← β(i)

4: end for

5: for each c_{l,l∈{1,2,…,|C|}} ∈ C do ▹ Sorting the nodes inside their communities

6: for each i ∈ c_l do

7:

8: end for

9:

10: end for

11: while B ≠ 0 do ▹ Extract sorted list of nodes till budget is reached

12: for each and do

13: if then

14:

15: L.append(v)

16: B ← B − 1

17: end if

18: end for

19: end while

Doing so, it is more likely that any diffusion process spreads more uniformly in the network than in the case where targeted nodes by a centrality measure in a descending order ranking scheme are close to each other. Note that we assume the set of communities is sorted from the biggest to the smallest, with ties decided at random. Also, note that the maximum budget is the size of the network. The pseudocode is provided in Algorithm 1, and a Python version of the code is available on GitHub https://github.com/StephanyRajeh/CommunityAwareRankingScheme.

Toy example

Fig 1 illustrates the proposed ranking method on a toy example. The network contains 22 nodes and three communities in this example. Suppose the maximum budget is three nodes out of the whole network. We consider Degree and Betweenness centrality as measures of influence. Tables 1 and 2 in S1 Text report the centrality values and the corresponding ranks using the descending order and the proposed approach. Based on the descending order ranking scheme (Fig 1A) of the Degree centrality, we can see that the highest degree nodes (nodes 1, 4, and 5) belong to the same community C1. Similarly, the nodes with the highest Betweenness centrality (nodes 13, 14, and 15) are all located in the same community C2.

Download:

Fig 1. Illustrating the behavior of the descending order ranking scheme and the community-aware ranking scheme.

The nodes chosen are the top 3 nodes based on the Degree centrality (colored in red) and the Betweenness centrality (colored in blue).

https://doi.org/10.1371/journal.pone.0273610.g001

In contrast, the proposed community-aware ranking scheme (Fig 1B) selects the highest degree node in each community. Indeed, node 5 is picked from community C1, node 13 is picked from community C2, and node 19 is picked from community C3. Results with the Betweenness centrality are in the same vein. Instead of targeting the top three nodes in the same community C2, the proposed ranking approach selects nodes with the highest Betweenness centrality in each community. More precisely node 3 in C1, node 13 in C2, and node 21 in C3. Note that ranks of nodes with the same centrality value in a community are chosen randomly.

One of the main drawbacks of the classical descending order ranking scheme is ignoring the network’s community structure. From the diffusion perspective, if targeted nodes diffusing a piece of information or a virus are very close, the diffusion dies out before spreading across the other communities. On the contrary, the proposed ranking approach naturally selects the most influential nodes in their community. Indeed, the proposed ranking scheme favors nodes from all the dense parts of the network rather than specific communities.

Synthetic networks

We investigate synthetic networks using the LFR benchmark [32]. It allows generating modular networks with controlled power-law degree (γ) and community size (θ) distributions. In addition, one can also tune the community structure strength through the so-called mixing parameter (μ). Small values of μ indicate a strong community structure with few links between communities. Weak community structures correspond to high values of μ with a high fraction of connections between communities. Although one cannot tune the transitivity [33], the LFR benchmark assures generating networks with realistic features [34]. We perform a comparative evaluation of the community-aware ranking method with the classical descending order ranking method based on the SIR diffusion model. Simulation involves a set of synthetic networks with diverse values for the mixing parameter (μ), community size distribution (θ), and degree distribution (γ). Table 1 reports these parameters values.

Download:

Table 1. Synthetic networks’ parameters generated by the LFR model.

https://doi.org/10.1371/journal.pone.0273610.t001

Influence of the community structure strength

This experiment aims to investigate the influence of the community structure strength on the performance of the ranking strategies (the descending order ranking scheme and the proposed community-aware ranking scheme). The mixing parameter (μ) is tuned to cover a wide range of community structure strengths. It spans from very strong to very weak (μ = 0.05, 0.10, 0.20, 0.40, 0.70). Remember that a low value means few links between communities, indicating a strong community structure. In contrast, high value corresponds to networks with many links between communities, indicating a weak community structure.

Fig 2 shows the relative difference in the outbreak size (ΔR) as a function of the fraction of initially infected nodes of the six investigated centrality measures (Degree, Maximum Neighborhood Component, Betweenness, Closeness, Katz, PageRank) with a strong (μ = 0.05), medium (μ = 0.40), and weak (μ = 0.70) community structure strengths. The remaining parameters, including the community size (θ) and degree distribution (γ) exponents, are fixed at 2.7. The outbreak size (ΔR), represented by the red curve, is the difference between the number of nodes recovered after an initial set of nodes ranked based on the community-aware ranking scheme is infected and another initial set of nodes infected ranked based on the classical descending order ranking scheme. Thus, it represents a measure of performance of the community-aware ranking scheme. Positive values indicate that the proposed ranking scheme performs better (see S1 Text for details). The curves with all the values of μ are shown in S1 Fig.

Download:

Fig 2. Impact of the community structure strength (μ) in synthetic networks.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranking strategy with the descending order ranking for the six centrality measures under test. The mixing parameter (μ) varies while the other parameters, including the community size distribution exponent (θ = 2.7) and the degree distribution exponent (γ = 2.7), are fixed.

https://doi.org/10.1371/journal.pone.0273610.g002

In networks with a strong community structure (μ = 0.05), the community-aware ranking scheme always outperforms the classical descending order ranking scheme for all the centrality measures under study. The gain reaches 24% for Katz centrality at a fraction of initially infected nodes (f_o) of 0.20, followed by 22% for Degree, MNC, and Closeness centrality. The performance of these measures is consistent from a fraction of initially infected nodes of 0.10 till 0.25, then they decline. Closeness centrality slightly declines, showing a ΔR of 14% at f_o = 0.50, followed by Katz centrality with 8%, then Degree and MNC obtaining a ΔR of 6%. Betweenness and PageRank are the less performing measures under the community-aware ranking scheme. The maximum gain for Betweenness is 12.5% at f_o = 0.12 and for PageRank is 16.5% at f_o = 0.9. After a peak, performance declines reaching a gain of 2% at f_o = 0.50.

In networks with a medium community structure (μ = 0.40), the community-aware ranking scheme of all the centrality measures still performs better than the classical descending order ranking scheme. When the fraction of initially infected nodes f_o is low (i.e., between 0.01 and 0.05), the gain for all the centrality measures is low, reaching a maximum of 1%. As the fraction of initially infected nodes increases, the performance of the community-aware ranking scheme also increases until it reaches a plateau or barely changes. For example, the relative difference in the outbreak size of Degree centrality increases from f_o equaling 0.10 to 0.25, going till ΔR = 6.5%, and then it hardly changes. MNC, Betweenness, Closeness, Katz, and PageRank show similar behavior with ΔR reaching a maximum between 5% and 10%.

In networks with a weak community structure (μ = 0.70), when the fraction of initially infected nodes is between 0.01 and 0.10, the relative difference in the outbreak size (ΔR) alternates between -1% and +1%. After that, it increases to a maximum of ΔR = 3% and shows a plateau. One can expect these results. Indeed, the frontier between weak community structure and no community structure is thin.

We fix the fraction of initially infected nodes at 0.15 for all the centrality measures in Fig 2 and plot the relative difference in the outbreak size (ΔR) as a function of the mixing parameter (μ) as shown in Fig 3B. As the community structure gets weaker (i.e., from μ = 0.05 to μ = 0.7), the performance of the community-aware ranking scheme starts declining. Moreover, one can differentiate between the centrality measures’ effectiveness. Closeness is the best-performing centrality measure, followed by Katz, Degree, and MNC. In contrast, Betweenness and PageRank perform poorly. However, all the measurements show a higher relative difference in the epidemic outbreak size than the classical descending order ranking scheme.

Download:

Fig 3. The relative difference of the outbreak size (ΔR) as a function of the mixing parameter (μ) when fraction of initially infected nodes (f_o) equals 0.15.

The color of the curve represents the centrality measures under study. (A) Synthetic networks with degree distribution γ = 2.7 and community size distribution θ = 2. (B) Synthetic networks with degree distribution γ = 2.7 and community size distribution θ = 2.7. (C) Synthetic networks with degree distribution γ = 2.7 and community size distribution θ = 3.

https://doi.org/10.1371/journal.pone.0273610.g003

These results show that the community-aware ranking scheme is more effective in networks with a strong community structure. Indeed, in a strong community structure, communities are so well-separated that one can consider them independent subnetworks with their topological characteristics. In turn, targeting the most influential nodes in each community leads to a higher spreading, ensuring that the diffusion reaches all communities. As the community structure gets weaker, the performance of the community-aware ranking scheme decreases. Since the community structure is not well defined, the network is barely distinguishable from the network with no community structure. However, even in the worst-case scenario, the community-aware ranking scheme still is more effective than the classical descending order ranking scheme.

Influence of the community size distribution

This investigation aims to analyze the impact of the community size distribution on the community-aware ranking scheme. One can tune the power-law community size distribution exponent (θ) in the networks generated by the LFR. In this study, we evaluate two values representing extreme cases. In the first case with θ = 2, few small communities coexist with large communities with a large variance in community sizes. In the second case, with θ = 3, more communities of equivalent sizes coexist, and the variance in the community sizes is minor. There are more communities in the second case than in the first case. Table 3 in S1 Text reports the number of communities of each generated network, along with the minimum and maximum size of the communities. The histograms of the distributions are given in S2 Fig. Note that we also perform tests with θ = 2.7. However, there were no significant differences compared to θ = 3.

Fig 4 shows the relative difference in the outbreak size (ΔR) as a function of the fraction of initially infected nodes for the Degree and Katz centrality. The community size distribution exponent θ equals 2 (panel A) and 3 (panel B). The other parameters are fixed, including the mixing parameter (μ = 0.05) and the degree distribution exponent (γ = 2.7). For the sake of brevity, since the remaining results of the centrality measures show similar behavior, they are given in S3 and S4 Figs with θ = 2 and θ = 3, respectively.

Download:

Fig 4. Impact of the community size distribution exponent (θ) in synthetic networks.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranks of the Degree and Katz centrality measures compared to the descending order ranks. The community size distribution exponent (θ) varies while the other parameters, including the mixing parameter (μ = 0.05) and the degree distribution exponent (γ = 2.7), are fixed.

https://doi.org/10.1371/journal.pone.0273610.g004

When θ = 2 (Fig 4A), the networks contain a few small communities coexisting with much larger ones. The relative difference of the outbreak size (ΔR) increases as the fraction of initially infected nodes increases reaching a maximum of 11% for Degree centrality and 12% for Katz centrality for the community-aware ranking scheme. Then, ΔR barely varies when the fraction of initially infected nodes f_o ranges from 0.10 and 0.25. After that, it starts to gradually decrease, reaching 3.5% and 4% for both centralities, respectively, when f_o = 0.50.

When θ equals 3 (Fig 4B), there are many small communities of comparable sizes and a few large ones. The performance of the community-aware ranking scheme for Degree centrality increases, reaching a maximum of 24% gain in terms of ΔR. Then it gradually decreases until it reaches 5.1% gain when the fraction of initially infected nodes is 0.50. Katz centrality exhibits similar behavior. The relative outbreak size increases as the fraction of initially infected nodes increases, reaching a maximum gain of 24%. It decreases until it reaches a gain of 8% when the fraction of initially infected nodes equals 0.50.

Fig 3 reports the relative outbreak size (ΔR) for all the centrality measures in synthetic networks with a community size distribution exponent (θ) spanning from 2 (Panel A) to 3 (Panel C). The fraction of initially infected nodes (f_o) is fixed at 15%. When networks have a large difference in the sizes of the communities leading to fewer communities (θ = 2), the gain in ΔR of the community-aware ranking scheme ranges from 11% as a maximum at μ = 0.05. It decreases, reaching 0% when μ = 0.70. On the contrary, when the networks have many small communities with fewer larger ones leading to many communities, ΔR for Degree, MNC, Closeness, and Katz reach a gain of 23% and a gain of 13% and 12.7% for Betweenness and PageRank, respectively. As the community structure gets weaker, ΔR decreases to a minimum of 1% for PageRank centrality and a maximum of 2.3% for Closeness centrality.

Results indicate that when the network contains a few large communities, the community-aware ranking scheme is not as effective as in networks with many communities of smaller sizes. It is reasonable since when huge communities coexist with a few small communities, the large communities will make up most of the network. When one picks the top influential nodes from each community in the first iteration, the nodes picked in the second iteration inside the large communities are likely to be next to each other. Indeed, when there are substantial communities, there are few communities overall. Thus, infecting the most influential nodes in the same neighborhood is not as effective as covering many communities of comparable sizes with the community-aware ranking scheme.

Influence of the degree distribution

In this experiment, we investigate the effect of the degree distribution on the performance of the community-aware ranking scheme. The degree distribution exponent (γ) is tunable in the LFR model. Studies have shown that real-world networks are scale-free, with a degree distribution exponent in the range of 2 and 3 [35, 36]. Consequently, we test these two values. Note that we also set γ = 2.7, but there are no significant differences compared to γ = 3. When γ equals 2, the network’s structure resembles a hub-and-spoke network [37]. When γ equals 3, the network’s structure is more similar to a random network where more nodes would have comparable frequency of neighbors. Since the LFR model generates also networks with a community structure, the nodes inside the communities have comparable sizes while assuring the community structure is maintained [32].

Fig 5 shows the relative difference in the outbreak size (ΔR) as a function of the fraction of initially infected nodes for the Degree and Katz centrality. The degree distribution exponent γ equals 2 (panel A) and 3 (panel B). We fix all the other parameters, including the mixing parameter (μ = 0.05) and the community size distribution exponent (θ = 2.7). For the sake of brevity, since the remaining results of the centrality measures exhibit similar behavior, they are given in S5 and S6 Figs with γ = 2 and γ = 3, respectively.

Download:

Fig 5. Impact of the degree distribution exponent (γ) in synthetic networks.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranks of the Degree and Katz centrality measures compared to the descending order ranks. The degree distribution exponent (γ) varies while the other parameters, including the mixing parameter (μ = 0.05) and the community size distribution exponent (θ = 2.7), are fixed.

https://doi.org/10.1371/journal.pone.0273610.g005

When γ = 2 (Fig 5A), generating networks with a hub-and-spoke structure, the relative difference of the outbreak size (ΔR) of both Degree centrality and Katz centrality under the community-aware ranking scheme escalates quickly from fraction of initially infected nodes (f_o) amounting to 0.01 till 0.10, reaching a maximum of 24%. ΔR stays in this range between 20% and 24% from f_o = 0.11 till f_o = 0.27 for Degree and f_o = 0.30 for Katz. After which ΔR starts to decrease reaching 6.5% for Degree and 8.5% for Katz at f_o = 0.50.

When γ = 3, the degree distribution of the communities of the generated networks is more random than average. One can observe that both Degree centrality and Katz centrality perform similarly according to the relative difference of the outbreak size (ΔR). Compared to the descending order ranking scheme, the gain of the community-aware ranking scheme reaches 19% and barely changes till f_o = 0.25. Then it starts to gradually decrease reaching ΔR = 5% at f_o = 0.50.

Fig 6 reports the differences in the relative outbreak size (ΔR) for all the centrality measures in networks with a degree distribution exponent (γ) spanning from 2 (Panel A) to 3 (Panel C). The fraction of initially infected nodes (f_o) equals 15% When networks are similar to a hub-and-spoke structure (Fig 6A) with a strong community structure (μ = 0.05), centrality measures under the community-aware ranking scheme always show a higher relative outbreak size difference (ΔR). However, one can consider two categories. The first, including Degree, MNC, Closeness, and Katz, exhibit gains ranging from 21% and 23.5%. The second involving Betweenness and PageRank obtains a gain of around 13%. As we shift to a more random-like structure (γ = 3) in Fig 6C, the categorization of the centrality measures observed at γ = 2 remains the same. However, for the first category, the gain decreases between 17% and 18%. The second category exhibits a gain of around 11%. As the community structure weakens, the difference in the outbreak size becomes less pronounced. Nevertheless, the community-aware ranking scheme always performs better than the descending order ranking scheme.

Download:

Fig 6. The relative difference of the outbreak size (ΔR) as a function of the mixing parameter (μ) when fraction of initially infected nodes (f_o) equals 0.15.

The color of the curve represents the centrality measures under study. (A) Synthetic networks with degree distribution γ = 2 and community size distribution θ = 2.7. (B) Synthetic networks with degree distribution γ = 2.7 and community size distribution θ = 2.7. (C) Synthetic networks with degree distribution γ = 3 and community size distribution θ = 2.7.

https://doi.org/10.1371/journal.pone.0273610.g006

Even though the differences are not as pronounced compared to the variation in the community size distribution, results show that when the communities of the generated networks are more random-like, the performance of the community-aware ranking scheme slightly decreases. Since more nodes have a comparable number of connections internally in a random-like structure, they may have similar centrality values. In turn, the community-aware ranking scheme may be prone to selecting nodes of the same influence inside their communities, saturating the diffusion spread. On the contrary, in a network with well-separated communities such as the hub-and-spoke structure, a community-aware ranking scheme can distinctively pick influential nodes in their communities that are naturally not close to each other due to the hub-and-spoke structure. It results in a higher diffusion to more isolated areas that the descending order ranking scheme cannot reach.

Real-world networks

We also investigate the community-aware ranking scheme on 33 real-world networks covering a wide range of domains (i.e., infrastructural, biological, social, collaboration, and ecological). Since their community structure is unknown, we uncover it using two community detection algorithms: Infomap [38] and Louvain [39]. It allows us to check the impact of the community structure variations on the consistency of the community-aware ranking scheme. Table 4 in S1 Text reports the values of the topological characteristics of the real-world networks.

Spreading power of the proposed method

Since the community structure strength is a significant feature influencing the performance of the proposed community-aware ranking strategy, we classify the networks into three categories. The categories cover networks with strong (μ ≤ 0.20), medium (0.20 < μ < 0.40), and weak (μ ≥ 0.40) community structures. We consider communities uncovered by Infomap as our reference case. For the sake of brevity, we report one network of each category for all the centrality measures under study in Fig 7. The remaining networks are provided in S7–S13 Figs.

Download:

Fig 7. Impact of the community structure strength (μ) in real-world networks.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranking strategy with the descending order ranking for the six centrality measures under test. A strong, medium, and weak mixing parameter (μ) is derived based on the communities in real-world networks (U.S. Airports, Facebook Organizations, and Adolescent Health) identified by the Infomap community detection algorithm.

https://doi.org/10.1371/journal.pone.0273610.g007

The community-aware ranking scheme outperforms the classical descending order ranking scheme in networks with a strong community structure (μ≤ 0.20). The distinction lies in the gain in the relative difference of the outbreak size (ΔR). As depicted by U.S. Airports network (with μ = 0.08) in Fig 7, one can note the outperformance of Closeness centrality, with a difference in the outbreak size (ΔR) reaching a maximum of 21% when the fraction of initially infected nodes (f_o) amounts to 0.41, followed by Degree centrality with ΔR = 20% at f_o = 0.41 and MNC and Katz with ΔR = 20% at f_o = 0.40. Then comes PageRank with ΔR = 15% at f_o = 0.41 followed by Betweenness with ΔR = 14% at f_o = 0.41. In general, in all the networks, Closeness, Degree, MNC, and Katz show higher ΔR compared to Betweenness and PageRank. One can also note three typical behaviors for the performance of the community-aware ranking scheme in networks with a strong community structure. These behaviors are common to all the centrality measures within a given network. For brevity, we report the results of Degree centrality only. The first typical behavior is that ΔR increases as f_o increases. The remaining figures are provided in the supplementary material. It is illustrated by the Princeton network in Fig 8A on the left. Ego Facebook, Facebook Friends, and Facebook Politician Pages share similar behavior. The second typical behavior is that ΔR increases until it reaches a plateau or barely deviates as f_o increases. It is shown in the middle of Fig 8A for the Yeast Collins network. London Transport, Malaria Genes, NetSci, Board of Directors, and DNC Emails show similar behavior. Finally, in the third case, ΔR increases until it reach a specific value of f_o, then decreases gradually. It is demonstrated by the EU Airlines network in Fig 8A on the right. U.S. Airports, Madrid Train Bombings, Reptiles, 911 All Words, Marvel Partnerships, U.S. Power Grid, PGP, EuroRoad, and Internet Topology Cogentco share similar behavior.

Download:

Fig 8. Trends in the performance of the community-aware ranking scheme in real-world networks.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranks of the Degree centrality compared to the descending order ranks. Communities are identified using Infomap. (A) Networks with a strong community structure strength. (B) Networks with a medium community structure strength. (C) Networks with a weak community structure strength.

https://doi.org/10.1371/journal.pone.0273610.g008

The community-aware ranking scheme still outperforms the classical descending order ranking scheme in networks with a medium community structure (0.20 <μ< 0.40). However, the gain is less pronounced compared to networks with a strong community structure. It is depicted by the Facebook Organizations network in Fig 7. One can see that Closeness and Katz centrality measures achieve the highest gain in ΔR amounting to 6% and 7% from a fraction of initially infected nodes amounting to 0.15 till 0.45. For Degree and MNC, the maximum gain is 4% and 4.5%, respectively. Then, Betweenness and PageRank show a gain of ΔR = 2.5% and ΔR = 3.5, respectively. One can also note that within this category, we observe three behaviors for the performance of the community-aware ranking scheme. These behaviors are similar to those in networks with a strong community structure but at a smaller magnitude. We have an increasing ΔR as f_o increases, depicted by the Human Protein network on the left of Fig 8B. Hamsterster and Blumenau Drug share similar behavior. In the second category, we have an increasing ΔR until it reaches a plateau or barely deviates as f_o increases. It is depicted by the Interactome Vidal network in the middle of Fig 8B. Facebook Organizations and Caltech share similar behavior. Finally, the third category shows a slight and gradual decrease directly from the start as f_o increases. It is illustrated by the Yeast Protein network in Fig 8B on the right. Retweets Copenhagen shows similar behavior.

The community-aware ranking scheme outperforms the classical descending order ranking scheme in networks with a weak community structure (μ ≥ 0.40). However, in some networks, the gain in ΔR can even be higher than in networks with a strong or medium community structure. Indeed, the maximum improvement in ΔR can reach up to 15% in the AstroPh network (see Fig 8C) with Degree, MNC, Closeness, and Katz centrality measures and up to 11% and 10% for PageRank and Betweenness centrality measures respectively. At the same time, it can be as low as 5% in Adolescent Health given in Fig 7 for Degree, MNC, Closeness, and Katz and as low as 4% for Betweenness and 3% for PageRank. That being said, in networks with a weak community structure there is one trend despite the difference in magnitude. Indeed, as seen from Fig 8C, all networks (AstroPh, DeezerEU, and Bible Nouns) have an increasing ΔR as f_o increases. The only difference is in the magnitude of ΔR from one network to another.

In summary, the community-aware ranking scheme outperforms the descending order ranking scheme in all real-world networks under study. The gain of the proposed ranking scheme is affected by the community structure strength, as observed in artificial networks with controlled community structure strength. The stronger the community structure, the higher the performance of the community-aware ranking scheme. Nevertheless, it is worth noting that the community-aware ranking also shows high performance in some real-world networks with a weak community structure.

The community-aware ranking scheme has a high performance in networks with strong community structure strength because it does not select nodes in one dense region when there are many well-separated dense areas. We visualize two networks with a strong community structure strength but with different topological structures, namely Yeast Collins and EU Airlines in Fig 9A and 9B, respectively. In these two networks, we pick and increase the size of the top 15% of nodes selected by the descending order ranking scheme and the community-aware ranking scheme. For brevity, we only show Degree centrality and Closeness centrality. Concerning the Yeast Collins network, for both Degree and Closeness centrality, one can directly point out how the descending order scheme selects most of the top nodes in large network communities mainly located at the bottom of the network. On the contrary, the community-aware ranking scheme selects nodes in every community, spreading across all the network regions. A similar interpretation goes for the EU Airlines network, another network with a strong community structure. Indeed, the descending order ranking scheme of Degree and Closeness centrality measures targets only the dark pink and green communities. In contrast, the community-aware ranking scheme does not miss a single community. It is the reason why the community-aware ranking strategy allows a higher diffusion.

Download:

Fig 9. Top nodes selected based on the Degree and Closeness centrality measures according to the descending order and community-aware ranking schemes.

Communities of the Yeast Collins (A), EU Airlines (B), and AstroPh (C) networks are identified by Infomap. The top selected nodes (depicted in bigger sizes) amount to 15% of the network’s size.

https://doi.org/10.1371/journal.pone.0273610.g009

We also investigate the case of networks with a weak community structure where the proposed community-aware ranking scheme performs well. Fig 9C visualizes the AstroPh network, a network with a weak community structure and high performance of the community-aware ranking scheme. Despite having loosely defined communities with a vast number of inter-community connections, the community-aware ranking scheme targets nodes at the core and in the periphery of the AstroPh network, either with Degree or Closeness centrality measures. While with the descending order ranking scheme, using Degree or Closeness centrality measures, nodes picked are mainly in the network’s core. Consequently, the community-aware ranking scheme can ignite a higher diffusion as it reaches regions that the descending order ranking scheme never targets.

It is worth mentioning that networks characterized by a weak community structure may exhibit different topologies [40, 41]. If it is core-periphery-like, such as in the AstroPh network, the community-aware ranking scheme covers all the regions in the network. If the network is very dense with no particular local structure, the community-aware ranking scheme might select nodes in the vicinity of each other. We suggest to use a measure that combines local and global influence of the nodes for better targeting influential nodes [42, 43]. Note that the distance between the nodes should also be addressed. Alternatively, one can also incorporate a minimum distance constraint between nodes in a community so that targeted nodes are scattered. There is room for improvement in networks with a weak community structure.

Influence of the community detection algorithm

In this experiment, we use the Louvain [39] community detection algorithm to extract the communities in real-world networks. Then, we perform the same comparative evaluation process using SIR simulations between the classical and the proposed ranking strategies based on the communities identified by Louvain instead of Infomap [38]. The aim is to investigate the impact of the variations in the community structure induced by the community detection algorithms on the performance of the ranking schemes.

Fig 10 illustrates the relative difference in the outbreak size (ΔR) of the community-aware ranking scheme for Degree and Closeness centrality measures. We comment on the results for three typical networks (Facebook Friends, Human Protein, and Bible Nouns). Facebook Friends and Human Protein belong respectively to the strong and medium community structure categories using Infomap or Louvain. Bible Nouns network is in the medium community structure category based on Louvain and the weak community structure using Infomap. We provide complementary results for all the networks and all the centrality measures in S14–S20 Figs.

Download:

Fig 10. Impact of the community detection algorithm in real-world networks with strong, medium, and weak community structure strengths.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranks of the Degree and Closeness centrality measures compared to the descending order ranks. (A) Communities identified using Infomap. (B) Communities identified using Louvain.

https://doi.org/10.1371/journal.pone.0273610.g010

First, these results demonstrate that the community-aware ranking scheme is robust to the community structure variation induced by the community detection algorithm. Indeed, ΔR is positive whether Infomap or Louvain uncovers communities. This result is independent of the community structure strength. For example, Facebook Friends is a network with a strong community structure. For a fraction of initially infected nodes equal to 0.25, the gain of ΔR equals 15% for Degree centrality and 11% for Closeness centrality. It compares to an 8% increase for Degree centrality and 7% for Closeness centrality using Louvain with the same fraction of initially infected nodes. Consider the Human Protein network with a medium community structure strength. With a fraction of initially infected nodes equal to 0.25, the ΔR gain is 6% for Degree and Closeness centrality measures. In the same situation, using Louvain, the growth is lower. Indeed, ΔR equals 2% for Degree centrality and 3% for Closeness centrality. Finally, the community-aware ranking scheme still outperforms the classical descending order ranking scheme in the Bible Nouns network. For a fraction of initially infected nodes of 0.25, ΔR equals 6% for Degree centrality and 5.5% for Closeness centrality. Using Louvain with the same fraction of initially infected nodes reduces the gain in ΔR to 2.5% for Degree centrality and 2% for Closeness centrality.

Second, one can note that with Infomap, the gain in ΔR is relatively higher than Louvain. For instance, in Facebook Friends with communities identified using Infomap, the maximum ΔR reaches 17.5% at a fraction of initially infected nodes of 0.44 using Degree centrality and 13% using Closeness at a fraction of initially infected nodes of 0.31. In contrast, with the Louvain community detection algorithm, the maximum ΔR reaches 12.5% at a fraction of initially infected nodes of 0.39 using Degree centrality and 10% using Closeness at a fraction of initially infected nodes of 0.08. Thus, the difference in gain of ΔR is +5% for Degree centrality and +3% for Closeness centrality using Infomap. One observes similar results for Human Protein and Bible Nouns. The maximum gain in ΔR in Human Protein amounts to 6% using both Degree and Closeness centrality measures. Meanwhile, Louvain’s maximum gain in ΔR amounts to 3% for the two centrality measures. In Bible Nouns, with Infomap, the maximum improvement in ΔR amounts to 8.5% using both Degree and Closeness centrality measures. In opposition, using Louvain, the maximum gain in ΔR is 4.8% for Degree centrality and 3.8% for Closeness centrality.

To investigate why the performance of the community-aware ranking scheme decreases with Louvain compared to Infomap, we examine the community size distributions of the networks. Fig 10 gives the community size distributions associated with Infomap and Louvain of the networks provided in Fig 11. Indeed, they represent typical cases. The other distributions are given in S21–S23 Figs. Simultaneously, we compute the number of communities uncovered by each community detection algorithm and the minimum and maximum size of the communities for all the networks. Tables 5 and 6 in S1 Text report the results using Infomap and Louvain, respectively.

Download:

Fig 11. Histograms of the community size distribution.

Communities are identified in Facebook Friends, Human Protein, and Bible Nouns by the Infomap (A) and Louvain (B) community detection algorithms.

https://doi.org/10.1371/journal.pone.0273610.g011

The histograms of the community size distributions in Fig 11 show that they are generally more skewed to the right using Infomap. A high number of small communities coexists with very few large communities. In contrast, this distribution is more uniform for the Louvain community structure. We observe many communities with medium and large sizes. Moreover, Infomap uncovers a higher number of communities. For example, it reveals 21 communities in Facebook Friends, 99 in Human Protein, and 88 in Bible Nouns. It compares with 10 communities in Facebook Friends, 14 in Human Protein, and 17 in Bible Nouns identified by Louvain.

The question is, how do these two outcomes affect the performance of the proposed community-aware ranking scheme compared to the classical descending order ranking scheme. On the one hand, Louvain generally uncovers fewer communities, with many medium-sized and large communities making up most of the network. On the other hand, Infomap discovers many more communities with a high number of small communities coexisting with fewer larger ones. The proposed community-aware ranking strategy is more effective with Infomap. Naturally, with more communities, it selects at least one node in every community. The higher the number of communities, the higher its ability to choose distinct nodes. It is not necessarily true with fewer communities. Indeed, first, one selects a node in each community, then at the next iteration, nodes are targeted in the same communities. Thus, nodes inside the same communities are probably closer to each other than when there are many communities. Thus, the diffusion power weakens. Indeed, it does not reach distant regions in the network, and the diffusion stays confined in large communities. Note that these results complement those reported using the generated LFR networks with controlled community size distribution exponent regarding the number of communities. Indeed, when fewer communities exist, the community-aware ranking scheme is more susceptible to picking the next top node close to the former top-picked node inside these communities. Consequently, infecting the most influential nodes in fewer communities is not as efficient as having many communities spanning the whole network.

Conclusion

This paper presents a community-aware ranking scheme that one can use with any centrality measure. The proposed method is simple yet effective at selecting nodes according to their relative influence in a modular network. Unlike the popular descending order ranking scheme, which ranks the most influential nodes from high to low centrality values, it ranks nodes in a sequential order linked to the community size. Consequently, it selects nodes across all regions of the networks. In contrast with the descending order ranking scheme that can select nodes in a few communities that ignore large parts of the network, this strategy targets nodes more uniformly distributed. As a result, the proposed strategy warrants that the diffusion process does not die out locally and reaches distant regions of the network.

Extensive experiments have been conducted on synthetic and real-world networks using the SIR epidemic spreading model. To better understand the interplay between the community structure and the performance of the proposed strategy, we performed a series of experiments in synthetic networks controlling the community structure strength, the exponents of the community size, and degree distributions. Results show that the community-aware ranking scheme is more effective in networks with strong community structure strength. As it weakens, the performance decreases gradually. Nevertheless, the community-aware ranking scheme always performs better than the descending order ranking scheme, even in networks with a weak community structure. The community size distribution also affects the performance of the community-aware ranking scheme. Results show the strategy performs better in networks of many small communities instead of a few large communities. Indeed, the higher the number of communities, the more likely the targeted nodes are scattered across the network regions, igniting a higher epidemic outbreak. The influence of the degree distribution exponent is less pronounced. However, one can notice that the community-aware ranking scheme performs better in hub-and-spoke-like networks than in random-like ones.

The community-aware ranking scheme also outperforms the classical ranking strategy in a set of real-world networks of various domains. The findings are consistent with the synthetic networks’ experiments. Indeed, the community-aware ranking scheme performs better in networks with a strong community structure strength. The gain gradually decreases with the community structure strength. Note that in some networks with a weak community structure strength, the community-aware ranking scheme still creates a higher outbreak than its alternative. Indeed, the community-aware ranking scheme ranks the top nodes inside each community. Even in networks with a weak community structure, it can rank nodes in faraway regions, causing a higher outbreak. We also investigate the influence of the community detection algorithm on the performance of the community-aware ranking scheme. The comparisons involve Infomap and Louvain. Since the community structure uncovered by Louvain results in fewer communities and subsequently larger ones compared to Infomap, the community-aware ranking scheme performs better with the Infomap community structure. This result is coherent with synthetic networks’ community size distribution variation. Whatever the community detection algorithm, the community-aware ranking scheme consistently outperforms the descending order ranking strategy.

The main lesson from this study is to highlight the necessity of incorporating the community structure information in centrality measurements to better rank influential nodes. This work departs from previous community-aware solutions that combine a node’s local influence and global influence. Here, we show that whatever the notion of influence, the ranking strategy is a critical factor in the diffusion process. Whatever the centrality measure, the proposed ranking scheme is decisive in targeting the most influential nodes scattered across the network. This strategy overcomes the drawback frequent in centrality measures using the popular descending order ranking scheme in which the most influential nodes happen to be in the vicinity of each other. The proposed ranking scheme is adequate for igniting higher diffusion for marketing and awareness campaigns or combating diseases and unwanted viruses since it pinpoints influential nodes while assuring that all regions in the network are covered.

In future work, we plan to develop a more sophisticated community-aware ranking scheme to overcome the decrease in performance when there are few and large communities. Another research direction consists of adapting the strategy to networks with an overlapping community structure. Furthermore, one can consider community-aware ranking schemes for multilayer and temporal networks.

Supporting information

S1 Fig. Impact of the community structure strength (μ) in synthetic networks.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranking strategy with the descending order ranking for the six centrality measures under test. The mixing parameter (μ) is varied while the other parameters, including the community size distribution exponent (θ = 2.7) and the degree distribution exponent (γ = 2.7), are fixed.

https://doi.org/10.1371/journal.pone.0273610.s001

(PNG)

S2 Fig. The histograms of the community size distribution for the synthetic networks.

The synthetic networks are generated with θ = 2 and θ = 3 from strong to weak community structure strengths while keeping other parameters fixed including the degree distribution exponent (γ = 2.7).

https://doi.org/10.1371/journal.pone.0273610.s002

(PNG)

S3 Fig. Impact of the community size distribution exponent (θ) in synthetic networks.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranking strategy with the descending order ranking for the six centrality measures under test. The mixing parameter (μ) is varied while the other parameters, including the community size distribution exponent (θ = 2) and the degree distribution exponent (γ = 2.7), are fixed.

https://doi.org/10.1371/journal.pone.0273610.s003

(PNG)

S4 Fig. Impact of the community size distribution exponent (θ) in synthetic networks.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranking strategy with the descending order ranking for the six centrality measures under test. The mixing parameter (μ) is varied while the other parameters, including the community size distribution exponent (θ = 3) and the degree distribution exponent (γ = 2.7), are fixed.

https://doi.org/10.1371/journal.pone.0273610.s004

(PNG)

S5 Fig. Impact of the degree distribution exponent (γ) in synthetic networks.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranking strategy with the descending order ranking for the six centrality measures under test. The mixing parameter (μ) is varied while the other parameters, including the community size distribution exponent (θ = 2.7) and the degree distribution exponent (γ = 2), are fixed.

https://doi.org/10.1371/journal.pone.0273610.s005

(PNG)

S6 Fig. Impact of the degree distribution exponent (γ) in synthetic networks.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranking strategy with the descending order ranking for the six centrality measures under test. The mixing parameter (μ) is varied while the other parameters, including the community size distribution exponent (θ = 2.7) and the degree distribution exponent (γ = 3), are fixed.

https://doi.org/10.1371/journal.pone.0273610.s006

(PNG)

S7 Fig. Impact of the community structure strength (μ) in real-world networks.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranking strategy with the descending order ranking for the six centrality measures under test. The community structure is identified by the Infomap community detection algorithm.

https://doi.org/10.1371/journal.pone.0273610.s007

(PNG)

S8 Fig. Impact of the community structure strength (μ) in real-world networks.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranking strategy with the descending order ranking for the six centrality measures under test. The community structure is identified by the Infomap community detection algorithm.

https://doi.org/10.1371/journal.pone.0273610.s008

(PNG)

S9 Fig. Impact of the community structure strength (μ) in real-world networks.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranking strategy with the descending order ranking for the six centrality measures under test. The community structure is identified by the Infomap community detection algorithm.

https://doi.org/10.1371/journal.pone.0273610.s009

(PNG)

S10 Fig. Impact of the community structure strength (μ) in real-world networks.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranking strategy with the descending order ranking for the six centrality measures under test. The community structure is identified by the Infomap community detection algorithm.

https://doi.org/10.1371/journal.pone.0273610.s010

(PNG)

S11 Fig. Impact of the community structure strength (μ) in real-world networks.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranking strategy with the descending order ranking for the six centrality measures under test. The community structure is identified by the Infomap community detection algorithm.

https://doi.org/10.1371/journal.pone.0273610.s011

(PNG)

S12 Fig. Impact of the community structure strength (μ) in real-world networks.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranking strategy with the descending order ranking for the six centrality measures under test. The community structure is identified by the Infomap community detection algorithm.

https://doi.org/10.1371/journal.pone.0273610.s012

(PNG)

S13 Fig. Impact of the community structure strength (μ) in real-world networks.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranking strategy with the descending order ranking for the six centrality measures under test. The community structure is identified by the Infomap community detection algorithm.

https://doi.org/10.1371/journal.pone.0273610.s013

(PNG)

S14 Fig. The performance of the community-aware ranking scheme using the Louvain community detection algorithm.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranking strategy with the descending order ranking for the six centrality measures under test.

https://doi.org/10.1371/journal.pone.0273610.s014

(PNG)

S15 Fig. The performance of the community-aware ranking scheme using the Louvain community detection algorithm.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranking strategy with the descending order ranking for the six centrality measures under test.

https://doi.org/10.1371/journal.pone.0273610.s015

(PNG)

S16 Fig. The performance of the community-aware ranking scheme using the Louvain community detection algorithm.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranking strategy with the descending order ranking for the six centrality measures under test.

https://doi.org/10.1371/journal.pone.0273610.s016

(PNG)

S17 Fig. The performance of the community-aware ranking scheme using the Louvain community detection algorithm.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranking strategy with the descending order ranking for the six centrality measures under test.

https://doi.org/10.1371/journal.pone.0273610.s017

(PNG)

S18 Fig. The performance of the community-aware ranking scheme using the Louvain community detection algorithm.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranking strategy with the descending order ranking for the six centrality measures under test.

https://doi.org/10.1371/journal.pone.0273610.s018

(PNG)

S19 Fig. The performance of the community-aware ranking scheme using the Louvain community detection algorithm.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranking strategy with the descending order ranking for the six centrality measures under test.

https://doi.org/10.1371/journal.pone.0273610.s019

(PNG)

S20 Fig. The performance of the community-aware ranking scheme using the Louvain community detection algorithm.

The figures represent the relative difference of the outbreak size (ΔR) as a function of the fraction of initially infected nodes. The red curve indicates the relative performance difference of the community-aware ranking strategy with the descending order ranking for the six centrality measures under test.

https://doi.org/10.1371/journal.pone.0273610.s020

(PNG)

S21 Fig. The histograms of the community size distribution for the real-world networks.

Communities are identified by the Infomap (A) and Louvain (B) community detection algorithms.

https://doi.org/10.1371/journal.pone.0273610.s021

(PNG)

S22 Fig. The histograms of the community size distribution for the real-world networks.

Communities are identified by the Infomap (A) and Louvain (B) community detection algorithms.

https://doi.org/10.1371/journal.pone.0273610.s022

(PNG)

S23 Fig. The histograms of the community size distribution for the real-world networks.

Communities are identified by the Infomap (A) and Louvain (B) community detection algorithms.

https://doi.org/10.1371/journal.pone.0273610.s023

(PNG)

S1 Text. Supplementary material.

https://doi.org/10.1371/journal.pone.0273610.s024

(DOCX)

References

1. Stephenson K, Zelen M. Rethinking centrality: Methods and examples. Social networks. 1989;11(1):1–37.
- View Article
- Google Scholar
2. Lü L, Chen D, Ren XL, Zhang QM, Zhang YC, Zhou T. Vital nodes identification in complex networks. Physics Reports. 2016;650:1–63.
- View Article
- Google Scholar
3. De Arruda GF, Barbieri AL, Rodriguez PM, Rodrigues FA, Moreno Y, da Fontoura Costa L. Role of centrality for the identification of influential spreaders in complex networks. Physical Review E. 2014;90(3):032812. pmid:25314487
- View Article
- PubMed/NCBI
- Google Scholar
4. Csermely P, Korcsmáros T, Kiss HJ, London G, Nussinov R. Structure and dynamics of molecular networks: a novel paradigm of drug discovery: a comprehensive review. Pharmacology & therapeutics. 2013;138(3):333–408. pmid:23384594
- View Article
- PubMed/NCBI
- Google Scholar
5. Wang Z, Moreno Y, Boccaletti S, Perc M. Vaccination and epidemics in networked populations—an introduction; 2017.
6. Klein A, Ahlf H, Sharma V. Social activity and structural centrality in online social networks. Telematics and Informatics. 2015;32(2):321–332.
- View Article
- Google Scholar
7. Wandelt S, Sun X. Robustness estimation of infrastructure networks: On the usage of degree centrality. In: Proceedings of the 13th International Conference on Availability, Reliability and Security; 2018. p. 1–7.
8. Jalili M, Salehzadeh-Yazdi A, Gupta S, Wolkenhauer O, Yaghmaie M, Resendis-Antonio O, et al. Evolution of centrality measurements for the detection of essential proteins in biological networks. Frontiers in Physiology. 2016;7:375. pmid:27616995
- View Article
- PubMed/NCBI
- Google Scholar
9. Wang K, Quan W, Cheng N, Liu M, Liu Y, Chan HA. Betweenness centrality based software defined routing: Observation from practical Internet datasets. ACM Transactions on Internet Technology (TOIT). 2019;19(4):1–19.
- View Article
- Google Scholar
10. Das K, Samanta S, Pal M. Study on centrality measures in social networks: a survey. Social network analysis and mining. 2018;8(1):1–11.
- View Article
- Google Scholar
11. Ibnoulouafi A, El Haziti M, Cherifi H. M-centrality: identifying key nodes based on global position and local degree variation. J Stat Mech Theory Exp. 2018;2018(7):073407.
- View Article
- Google Scholar
12. Berahmand K, Bouyer A, Samadi N. A new local and multidimensional ranking measure to detect spreaders in social networks. Computing. 2019;101(11):1711–1733.
- View Article
- Google Scholar
13. Guimera R, Amaral LAN. Functional cartography of complex metabolic networks. nature. 2005;433(7028):895–900. pmid:15729348
- View Article
- PubMed/NCBI
- Google Scholar
14. Chakraborty D, Singh A, Cherifi H. Immunization strategies based on the overlapping nodes in networks with community structure. In: International conference on computational social networks. Springer, Cham; 2016. p. 62–73.
15. Gupta N, Singh A, Cherifi H. Centrality measures for networks with community structure. Physica A: Statistical Mechanics and its Applications. 2016;452:46–59.
- View Article
- Google Scholar
16. Tulu MM, Hou R, Younas T. Identifying influential nodes based on community structure to speed up the dissemination of information in complex network. IEEE Access. 2018;6:7390–7401.
- View Article
- Google Scholar
17. Kumar M, Singh A, Cherifi H. An efficient immunization strategy using overlapping nodes and its neighborhoods. In: Companion Proceedings of the The Web Conference 2018; 2018. p. 1269–1275.
18. Ghalmane Z, El Hassouni M, Cherifi C, Cherifi H. Centrality in modular networks. EPJ Data Science. 2019;8(1):15.
- View Article
- Google Scholar
19. Magelinski T, Bartulovic M, Carley KM. Measuring node contribution to community structure with modularity vitality. IEEE Transactions on Network Science and Engineering. 2021;8(1):707–723.
- View Article
- Google Scholar
20. Blöcker C, Nieves JC, Rosvall M. Map Equation Centrality: A Community-Aware Centrality Score Based on the Map Equation. arXiv preprint arXiv:220112590. 2022.
21. Rajeh S, Savonnet M, Leclercq E, Cherifi H. Comparing Community-Aware Centrality Measures in Online Social Networks. In: International Conference on Computational Data and Social Networks. Springer; 2021. p. 279–290.
22. Rajeh S, Savonnet M, Leclercq E, Cherifi H. Comparative evaluation of community-aware centrality measures. Quality & Quantity. 2022; p. 1–30.
- View Article
- Google Scholar
23. Holme P, Saramäki J. Temporal networks. Physics reports. 2012;519(3):97–125.
- View Article
- Google Scholar
24. Latapy M, Viard T, Magnien C. Stream graphs and link streams for the modeling of interactions over time. Social Network Analysis and Mining. 2018;8(1):1–29.
- View Article
- Google Scholar
25. Sun X, Gollnick V, Wandelt S. Robustness analysis metrics for worldwide airport network: A comprehensive study. Chinese Journal of Aeronautics. 2017;30(2):500–512.
- View Article
- Google Scholar
26. Lancichinetti A, Kivelä M, Saramäki J, Fortunato S. Characterizing the community structure of complex networks. PloS one. 2010;5(8):e11976. pmid:20711338
- View Article
- PubMed/NCBI
- Google Scholar
27. Newman ME, Girvan M. Finding and evaluating community structure in networks. Physical review E. 2004;69(2):026113. pmid:14995526
- View Article
- PubMed/NCBI
- Google Scholar
28. Cherifi H, Palla G, Szymanski BK, Lu X. On community structure in complex networks: challenges and opportunities. Applied Network Science. 2019;4(1):1–35.
- View Article
- Google Scholar
29. Arenas A, Diaz-Guilera A, Pérez-Vicente CJ. Synchronization reveals topological scales in complex networks. Physical review letters. 2006;96(11):114102. pmid:16605825
- View Article
- PubMed/NCBI
- Google Scholar
30. Cheng XQ, Shen HW. Uncovering the community structure associated with the diffusion dynamics on networks. J Stat Mech Theory Exp. 2010;2010(04):P04024.
- View Article
- Google Scholar
31. Anderson RM, May RM. Population biology of infectious diseases: Part I. Nature. 1979;280(5721):361–367. pmid:460412
- View Article
- PubMed/NCBI
- Google Scholar
32. Lancichinetti A, Fortunato S, Radicchi F. Benchmark graphs for testing community detection algorithms. Physical review E. 2008;78(4):046110. pmid:18999496
- View Article
- PubMed/NCBI
- Google Scholar
33. Orman K, Labatut V, Cherifi H. In: Menezes R, Evsukoff A, González MC, editors. An Empirical Study of the Relation between Community Structure and Transitivity. Springer Berlin Heidelberg; 2013. p. 99–110.
34. Orman GK, Labatut V, Cherifi H. Towards realistic artificial benchmark for community detection algorithms evaluation. International Journal of Web Based Communities. 2013;9(3):349–370.
- View Article
- Google Scholar
35. Albert R, Barabási AL. Statistical mechanics of complex networks. Reviews of modern physics. 2002;74(1):47.
- View Article
- Google Scholar
36. Boccaletti S, Latora V, Moreno Y, Chavez M, Hwang DU. Complex networks: Structure and dynamics. Physics reports. 2006;424(4-5):175–308.
- View Article
- Google Scholar
37. Tsiotas D. Detecting differences in the topology of scale-free networks grown under time-dynamic topological fitness. Scientific reports. 2020;10(1):1–16. pmid:32606368
- View Article
- PubMed/NCBI
- Google Scholar
38. Rosvall M, Bergstrom CT. Maps of random walks on complex networks reveal community structure. PNAS. 2008;105(4):1118–1123. pmid:18216267
- View Article
- PubMed/NCBI
- Google Scholar
39. Blondel VD, Guillaume JL, Lambiotte R, Lefebvre E. Fast unfolding of communities in large networks. J Stat Mech Theory Exp. 2008;2008(10):P10008.
- View Article
- Google Scholar
40. Liu W, Pellegrini M, Wang X. Detecting communities based on network topology. Scientific reports. 2014;4(1):1–7. pmid:25033828
- View Article
- PubMed/NCBI
- Google Scholar
41. Abbe E. Community detection and stochastic block models: recent developments. The Journal of Machine Learning Research. 2017;18(1):6446–6531.
- View Article
- Google Scholar
42. Bartolucci S, Caravelli F, Caccioli F, Vivo P. Emerging locality of network influence. arXiv preprint arXiv:200906307. 2020.
43. Bucur D. Top influencers can be identified universally by combining classical centralities. Scientific reports. 2020;10(1):1–14. pmid:33239723
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Stephenson K, Zelen M. Rethinking centrality: Methods and examples. Social networks. 1989;11(1):1–37.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Lü L, Chen D, Ren XL, Zhang QM, Zhang YC, Zhou T. Vital nodes identification in complex networks. Physics Reports. 2016;650:1–63.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. De Arruda GF, Barbieri AL, Rodriguez PM, Rodrigues FA, Moreno Y, da Fontoura Costa L. Role of centrality for the identification of influential spreaders in complex networks. Physical Review E. 2014;90(3):032812. pmid:25314487
View Article
PubMed/NCBI
Google Scholar

[8] View Article

[9] PubMed/NCBI

[10] Google Scholar

[ref4] 4. Csermely P, Korcsmáros T, Kiss HJ, London G, Nussinov R. Structure and dynamics of molecular networks: a novel paradigm of drug discovery: a comprehensive review. Pharmacology & therapeutics. 2013;138(3):333–408. pmid:23384594
View Article
PubMed/NCBI
Google Scholar

[12] View Article

[13] PubMed/NCBI

[14] Google Scholar

[ref5] 5. Wang Z, Moreno Y, Boccaletti S, Perc M. Vaccination and epidemics in networked populations—an introduction; 2017.

[ref6] 6. Klein A, Ahlf H, Sharma V. Social activity and structural centrality in online social networks. Telematics and Informatics. 2015;32(2):321–332.
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref7] 7. Wandelt S, Sun X. Robustness estimation of infrastructure networks: On the usage of degree centrality. In: Proceedings of the 13th International Conference on Availability, Reliability and Security; 2018. p. 1–7.

[ref8] 8. Jalili M, Salehzadeh-Yazdi A, Gupta S, Wolkenhauer O, Yaghmaie M, Resendis-Antonio O, et al. Evolution of centrality measurements for the detection of essential proteins in biological networks. Frontiers in Physiology. 2016;7:375. pmid:27616995
View Article
PubMed/NCBI
Google Scholar

[21] View Article

[22] PubMed/NCBI

[23] Google Scholar

[ref9] 9. Wang K, Quan W, Cheng N, Liu M, Liu Y, Chan HA. Betweenness centrality based software defined routing: Observation from practical Internet datasets. ACM Transactions on Internet Technology (TOIT). 2019;19(4):1–19.
View Article
Google Scholar

[25] View Article

[26] Google Scholar

[ref10] 10. Das K, Samanta S, Pal M. Study on centrality measures in social networks: a survey. Social network analysis and mining. 2018;8(1):1–11.
View Article
Google Scholar

[28] View Article

[29] Google Scholar

[ref11] 11. Ibnoulouafi A, El Haziti M, Cherifi H. M-centrality: identifying key nodes based on global position and local degree variation. J Stat Mech Theory Exp. 2018;2018(7):073407.
View Article
Google Scholar

[31] View Article

[32] Google Scholar

[ref12] 12. Berahmand K, Bouyer A, Samadi N. A new local and multidimensional ranking measure to detect spreaders in social networks. Computing. 2019;101(11):1711–1733.
View Article
Google Scholar

[34] View Article

[35] Google Scholar

[ref13] 13. Guimera R, Amaral LAN. Functional cartography of complex metabolic networks. nature. 2005;433(7028):895–900. pmid:15729348
View Article
PubMed/NCBI
Google Scholar

[37] View Article

[38] PubMed/NCBI

[39] Google Scholar

[ref14] 14. Chakraborty D, Singh A, Cherifi H. Immunization strategies based on the overlapping nodes in networks with community structure. In: International conference on computational social networks. Springer, Cham; 2016. p. 62–73.

[ref15] 15. Gupta N, Singh A, Cherifi H. Centrality measures for networks with community structure. Physica A: Statistical Mechanics and its Applications. 2016;452:46–59.
View Article
Google Scholar

[42] View Article

[43] Google Scholar

[ref16] 16. Tulu MM, Hou R, Younas T. Identifying influential nodes based on community structure to speed up the dissemination of information in complex network. IEEE Access. 2018;6:7390–7401.
View Article
Google Scholar

[45] View Article

[46] Google Scholar

[ref17] 17. Kumar M, Singh A, Cherifi H. An efficient immunization strategy using overlapping nodes and its neighborhoods. In: Companion Proceedings of the The Web Conference 2018; 2018. p. 1269–1275.

[ref18] 18. Ghalmane Z, El Hassouni M, Cherifi C, Cherifi H. Centrality in modular networks. EPJ Data Science. 2019;8(1):15.
View Article
Google Scholar

[49] View Article

[50] Google Scholar

[ref19] 19. Magelinski T, Bartulovic M, Carley KM. Measuring node contribution to community structure with modularity vitality. IEEE Transactions on Network Science and Engineering. 2021;8(1):707–723.
View Article
Google Scholar

[52] View Article

[53] Google Scholar

[ref20] 20. Blöcker C, Nieves JC, Rosvall M. Map Equation Centrality: A Community-Aware Centrality Score Based on the Map Equation. arXiv preprint arXiv:220112590. 2022.

[ref21] 21. Rajeh S, Savonnet M, Leclercq E, Cherifi H. Comparing Community-Aware Centrality Measures in Online Social Networks. In: International Conference on Computational Data and Social Networks. Springer; 2021. p. 279–290.

[ref22] 22. Rajeh S, Savonnet M, Leclercq E, Cherifi H. Comparative evaluation of community-aware centrality measures. Quality & Quantity. 2022; p. 1–30.
View Article
Google Scholar

[57] View Article

[58] Google Scholar

[ref23] 23. Holme P, Saramäki J. Temporal networks. Physics reports. 2012;519(3):97–125.
View Article
Google Scholar

[60] View Article

[61] Google Scholar

[ref24] 24. Latapy M, Viard T, Magnien C. Stream graphs and link streams for the modeling of interactions over time. Social Network Analysis and Mining. 2018;8(1):1–29.
View Article
Google Scholar

[63] View Article

[64] Google Scholar

[ref25] 25. Sun X, Gollnick V, Wandelt S. Robustness analysis metrics for worldwide airport network: A comprehensive study. Chinese Journal of Aeronautics. 2017;30(2):500–512.
View Article
Google Scholar

[66] View Article

[67] Google Scholar

[ref26] 26. Lancichinetti A, Kivelä M, Saramäki J, Fortunato S. Characterizing the community structure of complex networks. PloS one. 2010;5(8):e11976. pmid:20711338
View Article
PubMed/NCBI
Google Scholar

[69] View Article

[70] PubMed/NCBI

[71] Google Scholar

[ref27] 27. Newman ME, Girvan M. Finding and evaluating community structure in networks. Physical review E. 2004;69(2):026113. pmid:14995526
View Article
PubMed/NCBI
Google Scholar

[73] View Article

[74] PubMed/NCBI

[75] Google Scholar

[ref28] 28. Cherifi H, Palla G, Szymanski BK, Lu X. On community structure in complex networks: challenges and opportunities. Applied Network Science. 2019;4(1):1–35.
View Article
Google Scholar

[77] View Article

[78] Google Scholar

[ref29] 29. Arenas A, Diaz-Guilera A, Pérez-Vicente CJ. Synchronization reveals topological scales in complex networks. Physical review letters. 2006;96(11):114102. pmid:16605825
View Article
PubMed/NCBI
Google Scholar

[80] View Article

[81] PubMed/NCBI

[82] Google Scholar

[ref30] 30. Cheng XQ, Shen HW. Uncovering the community structure associated with the diffusion dynamics on networks. J Stat Mech Theory Exp. 2010;2010(04):P04024.
View Article
Google Scholar

[84] View Article

[85] Google Scholar

[ref31] 31. Anderson RM, May RM. Population biology of infectious diseases: Part I. Nature. 1979;280(5721):361–367. pmid:460412
View Article
PubMed/NCBI
Google Scholar

[87] View Article

[88] PubMed/NCBI

[89] Google Scholar

[ref32] 32. Lancichinetti A, Fortunato S, Radicchi F. Benchmark graphs for testing community detection algorithms. Physical review E. 2008;78(4):046110. pmid:18999496
View Article
PubMed/NCBI
Google Scholar

[91] View Article

[92] PubMed/NCBI

[93] Google Scholar

[ref33] 33. Orman K, Labatut V, Cherifi H. In: Menezes R, Evsukoff A, González MC, editors. An Empirical Study of the Relation between Community Structure and Transitivity. Springer Berlin Heidelberg; 2013. p. 99–110.

[ref34] 34. Orman GK, Labatut V, Cherifi H. Towards realistic artificial benchmark for community detection algorithms evaluation. International Journal of Web Based Communities. 2013;9(3):349–370.
View Article
Google Scholar

[96] View Article

[97] Google Scholar

[ref35] 35. Albert R, Barabási AL. Statistical mechanics of complex networks. Reviews of modern physics. 2002;74(1):47.
View Article
Google Scholar

[99] View Article

[100] Google Scholar

[ref36] 36. Boccaletti S, Latora V, Moreno Y, Chavez M, Hwang DU. Complex networks: Structure and dynamics. Physics reports. 2006;424(4-5):175–308.
View Article
Google Scholar

[102] View Article

[103] Google Scholar

[ref37] 37. Tsiotas D. Detecting differences in the topology of scale-free networks grown under time-dynamic topological fitness. Scientific reports. 2020;10(1):1–16. pmid:32606368
View Article
PubMed/NCBI
Google Scholar

[105] View Article

[106] PubMed/NCBI

[107] Google Scholar

[ref38] 38. Rosvall M, Bergstrom CT. Maps of random walks on complex networks reveal community structure. PNAS. 2008;105(4):1118–1123. pmid:18216267
View Article
PubMed/NCBI
Google Scholar

[109] View Article

[110] PubMed/NCBI

[111] Google Scholar

[ref39] 39. Blondel VD, Guillaume JL, Lambiotte R, Lefebvre E. Fast unfolding of communities in large networks. J Stat Mech Theory Exp. 2008;2008(10):P10008.
View Article
Google Scholar

[113] View Article

[114] Google Scholar

[ref40] 40. Liu W, Pellegrini M, Wang X. Detecting communities based on network topology. Scientific reports. 2014;4(1):1–7. pmid:25033828
View Article
PubMed/NCBI
Google Scholar

[116] View Article

[117] PubMed/NCBI

[118] Google Scholar

[ref41] 41. Abbe E. Community detection and stochastic block models: recent developments. The Journal of Machine Learning Research. 2017;18(1):6446–6531.
View Article
Google Scholar

[120] View Article

[121] Google Scholar

[ref42] 42. Bartolucci S, Caravelli F, Caccioli F, Vivo P. Emerging locality of network influence. arXiv preprint arXiv:200906307. 2020.

[ref43] 43. Bucur D. Top influencers can be identified universally by combining classical centralities. Scientific reports. 2020;10(1):1–14. pmid:33239723
View Article
PubMed/NCBI
Google Scholar

[124] View Article

[125] PubMed/NCBI

[126] Google Scholar

Figures

Abstract

Introduction

Proposed ranking strategy

Algorithm

Toy example

Synthetic networks

Influence of the community structure strength

Influence of the community size distribution

Influence of the degree distribution

Real-world networks

Spreading power of the proposed method

Influence of the community detection algorithm

Conclusion

Supporting information

S1 Fig. Impact of the community structure strength (μ) in synthetic networks.

S2 Fig. The histograms of the community size distribution for the synthetic networks.

S3 Fig. Impact of the community size distribution exponent (θ) in synthetic networks.

S4 Fig. Impact of the community size distribution exponent (θ) in synthetic networks.

S5 Fig. Impact of the degree distribution exponent (γ) in synthetic networks.

S6 Fig. Impact of the degree distribution exponent (γ) in synthetic networks.

S7 Fig. Impact of the community structure strength (μ) in real-world networks.

S8 Fig. Impact of the community structure strength (μ) in real-world networks.

S9 Fig. Impact of the community structure strength (μ) in real-world networks.

S10 Fig. Impact of the community structure strength (μ) in real-world networks.

S11 Fig. Impact of the community structure strength (μ) in real-world networks.

S12 Fig. Impact of the community structure strength (μ) in real-world networks.

S13 Fig. Impact of the community structure strength (μ) in real-world networks.

S14 Fig. The performance of the community-aware ranking scheme using the Louvain community detection algorithm.

S15 Fig. The performance of the community-aware ranking scheme using the Louvain community detection algorithm.

S16 Fig. The performance of the community-aware ranking scheme using the Louvain community detection algorithm.

S17 Fig. The performance of the community-aware ranking scheme using the Louvain community detection algorithm.

S18 Fig. The performance of the community-aware ranking scheme using the Louvain community detection algorithm.

S19 Fig. The performance of the community-aware ranking scheme using the Louvain community detection algorithm.

S20 Fig. The performance of the community-aware ranking scheme using the Louvain community detection algorithm.

S21 Fig. The histograms of the community size distribution for the real-world networks.

S22 Fig. The histograms of the community size distribution for the real-world networks.

S23 Fig. The histograms of the community size distribution for the real-world networks.

S1 Text. Supplementary material.

References