Figures
Abstract
Backbone extraction simplifies complex networks while retaining essential features. It reduces complexity without losing critical structural information. However, selecting the most suitable method remains challenging due to the diverse behaviors of existing techniques. This study evaluates eight structural backbone extraction methods designed for weighted networks. These methods leverage network topology rather than statistical weight distributions. A dataset of 33 real-world networks is analyzed, covering diverse sizes, topologies, and domains. Key metrics, such as Jaccard similarity and Overlap Coefficient, reveal distinct method behaviors. A hierarchical relationship emerges among methods. Primary Linkage Analysis (PLAM) captures the most substantial edges, forming the simplest backbone. Minimum Spanning Tree (MSP), Ultrametric Backbone (UMB), and Metric Backbone (MB) build on this structure, progressively adding connectivity and detail. The Doubly Stochastic Filter excels at preserving weight and degree distributions, connectivity, and transitivity. By contrast, the H-Backbone prioritizes high-weight edges but disrupts connectivity. Metric Backbone and Planar Maximally Filtered Graph ensure complete node preservation and maintain high reachability. These insights advance the understanding of structural backbone extraction techniques for weighted networks. They benefit applications in fields like biology, social networks, and transportation. Practitioners can better achieve goals like network simplification for visualization or property preservation for analysis.
Citation: Yassin A, Cherifi H, Seba H, Togni O (2025) Exploring weighted network backbone extraction: A comparative analysis of structural techniques. PLoS One 20(5): e0322298. https://doi.org/10.1371/journal.pone.0322298
Editor: Ahmed Mohamed Aziz Ismail, Central Asian University, UZBEKISTAN
Received: May 6, 2024; Accepted: March 17, 2025; Published: May 20, 2025
Copyright: © 2025 Yassin et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Held or will be held in a public repository: https://github.com/aliyassin4/structural-backbone-methods-comparison.
Funding: This material is based upon work supported by the Agence Nationale de Recherche under grant ANR-20-CE23-0002.
Competing interests: NO authors have competing interests.
1 Introduction
In recent years, the study of complex systems has witnessed a paradigm shift with the emergence of network analysis as a powerful tool [1–4]. Networks, comprised of interconnected nodes and edges, provide a comprehensive framework for understanding the intricate interactions within these systems [5–18]. Incorporating numerical weights on edges has given rise to weighted networks, offering a nuanced perspective on the intensity of node interactions [19].
The computational efficiency of many network algorithms relies on the sparsity principle, assuming that networks exhibit a certain degree of sparsity. While setting a threshold on edge weights is a common approach to reducing network size, it often sacrifices the natural diversity of edge weight distribution. Researchers have developed techniques to extract network backbones to address this challenge, preserving essential characteristics while downsizing the overall structure.
Backbone extraction has proven to be a valuable tool across a wide range of domains and applications. For instance, in biological networks, such as protein interaction and metabolic networks, backbones help identify key functional interactions. They also reveal regulatory mechanisms critical for understanding cellular behavior and disease pathways [8, 20, 21]. In transportation networks, backbone techniques optimize route planning, connectivity analysis, providing insights into critical infrastructure and traffic flow [22–25].
Similarly, in finance, backbone extraction methods are used to analyze systemic risk by identifying the most significant channels of contagion between financial institutions [26–28]. In social networks, backbones uncover the core structure of relationships, enhancing the understanding of information diffusion, opinion dynamics, and community structures [29–32]. Additionally, in ecological networks, backbones highlight the energy flow within food webs and help identify keystone species critical for ecosystem stability [33, 34].
Recent advances demonstrate the adaptability of backbone extraction techniques to novel applications. For example, in misinformation studies, network backbones reveal the structural dynamics of misinformation spread across digital platforms, such as WhatsApp and Facebook, aiding in combating fake news [31, 35]. In global environmental networks, backbone approaches help track international collaborations and highlight central actors critical for environmental governance [36]. Furthermore, in neuroscience, backbone techniques elucidate brain connectivity and its relationship with cognitive performance and mental health [37, 38].
These techniques fall into two main categories: statistical and structural methods. Statistical techniques use hypothesis testing or empirical weight distribution to evaluate the significance of each edge. The Disparity filter, Polya Urn filter, Noise Corrected Filter, Marginal Likelihood filter, ECM filter, GloSS filter, and LANS filter represent diverse approaches within this category.
Hypothesis testing involves generating p-values by contrasting observed edge weights with a null model. The p-values quantify the deviation from the null hypothesis. In contrast, empirical weight distribution-based techniques assess edge significance by comparing observed weights against the overall weight distribution.
On the other hand, structural techniques extract a backbone with specific topological properties by operating on the network’s topology. These methods can be broadly categorized into two categories: methods that extract a single substructure and methods that assign scores to nodes or edges. The latter category offers greater flexibility by adjusting threshold levels.
Researchers have extensively compared statistical methods in previous studies, providing valuable insights into their strengths, limitations, and applicability across various network contexts . Furthermore, several studies have explored the comparative performance of statistical and structural methods, highlighting the distinctive characteristics of these approaches and their suitability for specific types of networks. Researchers have extensively compared statistical methods in previous studies. They provide valuable insights into their strengths, limitations, and applicability across various network contexts [39–41]. Specifically, A recent study [42] systematically compares seven statistical hypothesis-testing backbone extraction methods across diverse networks, including Disparity Filter (DF) [43], Polya Urn Filter (PF) [44], Marginal Likelihood Filter (MLF) [45], Noise Corrected (NC) [46], Enhanced Configuration Model Filter (ECM) [47], Global Statistical Significance Filter (GloSS) [48], and Locally Adaptive Network Sparsification Filter (LANS) [49]. The study evaluates extracted backbones based on similarity, edge-feature correlations, global properties, and edge and degree distributions. It highlights the unique characteristics of these methods, such as DF and LANS favoring high-weight edges, ECM assigning lower significance to high-degree edges, and the PU filter’s strong preservation of weight distribution. Notably, the study reveals a hierarchical relationship among several methods, showing that backbones extracted using methods like NC and MLF encapsulate those of preceding techniques. This work underscores the importance of choosing an appropriate backbone method to preserve specific network properties and provides practical guidance for applications in weighted network analysis.
On the other hand, several studies have explored the comparative performance of statistical and structural methods. They highlight the distinctive characteristics of these approaches and their suitability for specific types of networks.
A prior investigation [50] examines six backbone extraction techniques in the Southeast Asian intercity air transport network. It covers five structural approaches (global weight thresholding, k-core decomposition [51], minimum spanning tree [52], primary linkage analysis [53], and multiple linkage analysis [54]) and one statistical method (the disparity filter) [43]. The study evaluates the extracted backbones based on geographical and topological structures. It highlights the utility of each technique for different transport research applications. K-core decomposition identifies well-connected cores in multiplex networks. Primary linkage analysis reveals hub-and-spoke configurations and functional regions. The disparity filter uncovers hidden structures and offers a comprehensive view of large-scale networks’ topological and spatial information.
Gomes Ferreira et al. [55] propose a four-step principled methodology for comparing and selecting the most appropriate backbone extraction method. This methodology is tailored to a given phenomenon of interest. This analysis includes using two structural methods (Global Thresholding, and High Salience Skeleton [56]) and eight statistical methods (RECAST [57], Disparity [43], Polya Urn [44], Marginal Likelihood [45], Noise Corrected [46], Global Statistical Significance, [48], Tripartite Backbone Extraction [58], and Stochastic Degree Sequence Model filters [59]). They validate their approach in two case studies: online discussions on Instagram and coordinated behavior in WhatsApp groups. Results show that each method can produce very different backbones. This underlines that choosing an adequate method is of utmost importance to reveal valuable knowledge about the particular phenomenon under investigation.
We evaluate six backbone extraction techniques in the context of the American Elementary School contact network [60] . This analysis included two statistical methods (Disparity Filter [43], Locally Adaptive Network Sparsification (LANS) [49]), three structural methods (Doubly Stochastic [61], High Salience Skeleton [56], Metric Backbone [62]), and one hybrid method (Globally and Locally Adaptive Network Backbone (GLANB) [63]). Thre results reveale distinct influences of each technique on the network’s community structure. For example, Disparity and LANS methods results in multi-component backbones. While the Doubly Stochastic method preserves transitivity and maintains a balanced ratio of intra- and inter-community edges. In contrast, other techniques emphasizes intra-community links. The GLANB method stood out for its filtering precision and accurate representation of the network’s community structure.
Despite these contributions, to the best of our knowledge, no prior work has focused exclusively on a comprehensive comparison of structural backbone extraction methods. Our preliminary work [64] addressed this gap by evaluating eight structural backbone extraction techniques in the context of the World Air Transportation network. First, the study examines the similarity between the extracted backbones. This is done by computing the Jaccard similarity scores for pairs of extracted backbones. Results reveal that shortest-path-based methods produced similar backbones, whereas the Doubly Stochastic and H-Backbone methods exhibit distinct and uncorrelated behaviors.
Then we examine the relationship between backbone edge inclusion and edge weight. Our findings show that the H-Backbone method predominantly retains high-weight edges. While the High Salience Skeleton and Doubly Stochastic methods capture a broader range of weight scales.
To quantify information loss resulting from the filtering process, we compare basic topological properties of the extracted backbones including: edge, node, and weight fractions. The Doubly Stochastic and H-Backbone methods preserve a significantly higher proportion of edges compared to other techniques. However, methods like the H-Backbone, High Salience Skeleton, and Doubly Stochastic fail to retain all nodes in the network.
Finally, we evaluate the backbone connectivity and transitivity by analyzing metrics such as reachability, the number of components, the size of the largest connected component, and transitivity. Results highlight that the Primary Linkage Analysis and High Salience Skeleton methods disrupt network connectivity. While the Doubly Stochastic method demonstrates superior performance in preserving transitivity.
However, the scope of this work was limited to a single network and a narrow range of properties. By building on our preliminary work, this study makes the following key contributions:
- Analysis Across a Diverse Dataset: This study extends previous work by employing a significantly larger and more diverse dataset. The dataset encompasses networks from various domains and of varying sizes. The expansion enhances the generalization of the findings. Consequently, they become applicable to a wider range of real-world scenarios, including social, biological, and infrastructural networks.
- Similarity Analysis: We begin by analyzing the similarities between different filtering techniques using the Jaccard Similarity score [65] and the Overlap Coefficient [66]. This experiment provides researchers with insights into which techniques may yield comparable results and which are more distinct.
- Backbone Edge Characteristics: Using the Point Biserial correlation [67] we examine the relationships between backbone edge inclusion and local edge properties such as: weight, degree, weighted degree, betweenness, and weighted betweenness. This analysis helps in understanding the sensitivity of backbone extraction methods to network local properties.
- Comparison of Global Backbone Properties: Adopting a macro-level view, we compare global backbone properties across the filtering techniques. This includes evaluating edge proportions, nodes, weights, weight entropy, reachability, number of components, and transitivity. These metrics provide a comprehensive view of how each technique impacts the network structure, offering valuable insights into their effectiveness in preserving critical properties.
- Comparison of Backbone Distributions: Finally, we compare the weight and degree distributions of the extracted backbones against the original networks using the two-sample Kolmogorov-Smirnov test (KS test) [68]. This experiment highlights how well each method captures the original network’s diversity in connectivity and interaction strength.
Each of these contributions advances the field by addressing critical gaps, providing a comprehensive understanding of structural backbone filtering techniques, and equipping researchers and practitioners with the knowledge to make informed choices tailored to their objectives.
The paper is organized as follows: Sect 2 introduces the structural backbone extraction methods. Sect 3 provides an overview of the dataset used in the study. Sect 4 examines the similarities between techniques using the Jaccard Similarity score and Overlap Coefficient. Sect 5 investigates the relationship between backbone edges and local edge properties, such as weight, degree, and betweenness. Sect 6 evaluates the global topological properties of the extracted backbones. Sect 7 analyzes the weight and degree distributions of the backbones. The results are discussed in Sect 8, followed by the conclusions in Sect 9. Finally, the Supporting information details the data, materials and evaluation measures of this study.
2 Structural backbone extraction methods
This section introduces the structural backbone extraction techniques evaluated in this study. These methods were selected based on several criteria. First, we prioritize weighted methods, as there has been limited prior work comparing weighted methods in weighted networks. Second, we focus on edge-based methods, as they directly target the relationships between nodes. Lastly, we include techniques frequently used in previous surveys and those recognized as the most popular in the field. The selected methods can be categorized into two groups: single-structure extraction methods and score-based extraction methods.
For more detailed information about each method, including their descriptions, advantages, disadvantages, and computational complexities, please refer to Table 1. This summary provides a concise comparison of the methods and highlights the trade-offs associated with each technique.
2.1 Single-structure extraction methods
These methods extract a single substructure from the network without the ability to adjust or tune parameters.
Maximum Spanning Tree Filter (MSP) [52]:.
This method extracts a subgraph that connects all nodes without forming cycles, maximizing the total edge weight. It ensures that all nodes are included while focusing on edges that maximize the network’s cumulative weight.
Planar Maximally Filtered Graph (PMFG) [69]:.
This method operates by iteratively reconstructing the graph. It adds edges with the highest weight one at a time, ensuring the resulting graph remains planar throughout the process. This approach balances edge retention with structural simplicity.
Primary Linkage Analysis (PLAM) [70]:.
For each node, this method selects the edge with the highest weight, creating a backbone that retains the most influential connections based on edge weights.
h-Backbone Filter (HB) [71]:.
Inspired by the h-index, this method combines edge weights, connectivity, and betweenness centrality. The procedure involves the extraction of an h-strength network, where h is the largest natural number such that there are h links, each with a weight at least equal to h. Subsequently, the method extracts an h-bridge network similarly. Here, the bridge of an edge is determined by dividing its edge betweenness by the total number of nodes in the network. Finally, the h-backbone is obtained by merging the information from the h-strength and h-bridge networks.
Metric and Ultrametric Distance Backbone Filters (MB and UMB) [62]:.
Both methods extract subgraphs consisting of shortest paths but differ in their definitions of path length. The Metric filter defines it as the sum of edge distances, while the Ultrametric filter uses the maximum edge distance in the path, resulting in different extracted subgraphs.
2.2 Score-based extraction methods
These methods assign scores to edges or nodes based on topological properties and extract the backbone by applying thresholds or selecting a fraction of scores, allowing for tuning and customization.
Doubly Stochastic Filter (DS) [61]:.
This method transforms the adjacency matrix into a doubly stochastic matrix through iterative normalization of row and column sums. The normalized weights are consideres the new scores. Then edges are sorted by normalized weights in descending order and sequentially added to the backbone until all nodes form a single connected component. Note that the transformation may not always be feasible for some networks.
High Salience Skeleton Filter (HSS) [56]:.
This method computes edge salience based on the proportion of shortest-path trees. Initially, a shortest path tree is constructed for each node by merging all the shortest paths from that node to every other node in the network. The edge salience is then computed, representing the proportion of shortest-path trees in which a given edge is present. The authors observed that edge salience follows a bimodal distribution near the boundaries 0 and 1. Consequently, they retain only the edges with salience near 1.
3 Data and methods
This section outlines the data and methods for each experiment.
3.1 Data
The data used in the experiments consists of 33 real-world networks spanning diverse domains, including character, web, social, biological, infrastructural, and economic networks. This dataset was chosen to ensure a comprehensive evaluation of the backbone extraction techniques across a variety of network types, sizes, and topologies. The diversity in density, average clustering coefficient, and node count provides a robust foundation for assessing the generalizability and adaptability of the methods. In addition, many of these networks have been used in previous surveys [42], adding credibility and comparability to the results.
Fig 1 illustrates the dataset’s diversity using a scatter plot, where density and average clustering coefficient are represented on the axes, and marker sizes indicate the number of nodes in each network. This visualization highlights the variation in structural characteristics across the networks. We refer to the Supporting information for a detailed description of each network.
Marker sizes represent the number of nodes in each network.
Character networks:.
Includes networks such as the Les Misérables [72] and the Game of Thrones network [73] which represents the occurrences of the characters of these books.
Biological networks:.
Includes networks such as the Budapest Connectome [74, 75], Human Connectome [76], C. elegans [77], Messel Shale [78], and Florida Bay [79] networks, each representing different biological systems such as brain connectivity, genetic interactions, and ecological food webs.
Economic networks:.
Consists of the Faculty Hiring US [80] and Job Mobility (https://www.michelecoscia.com/?page_id=312) networks, which depict patterns of academic hiring and job transitions, respectively.
Infrastructural networks:.
Encompass networks like Worldwide Air Transportation [81, 82], Openflights [83], US Airports 500 [84], Paris Bus [85], Paris Rail [85], and GLSN 2015 [86], representing various infrastructural systems such as air transportation, public transportation, and international shipping routes.
Offline social networks:.
Comprise networks such as the Davis’ Southern Women Club[87], Karate Club [88], Madrid Train Bombing Terrorists [89], Windsurfers [90], Illinois High School Students [91], ANU Residence Hall [92], American High School [93], American Middle School [94], American Elementary School [94], French Primary School [95], French High School [95], Workplace [95], ACM Hypertext 2009 Scientific Conference [95], and Geriatric Ward of French Hospital [95] networks, depicting social interactions in different offline contexts including schools, workplaces, and events.
3.2 Methods
This subsection outlines the methods and procedures employed in our experiments.Note that the experiments are conducted on a machine equipped with an Intel Core i9-11950H. We used Python 3.10 and the Netbone package [98] (version 0.2.3) to extract and evaluate the backbones.
3.2.1 Experiment 1: Exploring technique similarities.
The primary objective of the initial experiment is to unveil similarities among various backbone filtering techniques, employing two metrics: the Jaccard similarity (J) and the Overlap Coefficient (Ov).
The first step involves extracting the backbones in each network n using different backbone extraction methods. For each pair of backbones (B1 and B2), the set of edges (E1 and E2) is obtained, and the Jaccard similarity coefficient () is computed. Subsequently, the results are aggregated by calculating the mean (
) and standard deviation (
) across all networks using the following formulas:
In the second step, the Overlap Coefficient is computed based on the perspective of each set ( and
). Similar to the first procedure, the results are aggregated, generating a heatmap for visual representation.
3.2.2 Experiment 2: Exploring the relation between the backbone edges and the edge properties.
This experiment explores the relationship between backbone edges and edge properties, such as weight, degree, and betweenness, and their weighted versions. Since not all the methods assign scores to the edges, an extension of the Pearson correlation coefficient [99] is used. The Point Biserial Correlation coefficient is often used when one of the variables is dichotomous (having only two categories, often coded as 0 and 1), which in our case is the presence and absence of an edge in the backbone.
The analysis aims to uncover how the statistical significance of edges relates to specific network attributes, providing insights into the behavior and nature of the selected backbone edges. By investigating these associations, we can determine whether certain edge properties are more likely to be retained or discarded during the filtering process, thus offering a nuanced understanding of the decision-making mechanisms employed by filtering techniques.
The procedure starts by extracting the backbones for each network n using the different backbone extraction methods. Then, we assign the presence scores PB for each edge: a value 1 to an edge if it is present in the backbone and 0 otherwise.
Subsequently, for each property Y, we compute the edge property Y for every edge in the original network. Following this, Point Biserial correlation coefficients are determined between the edge presence scores PB for each backbone B and each edge property Y.
Finally a violin is generated for each edge property Y to visualize the Point Biserial correlation coefficients distribution for each method across all networks.
3.2.3 Experiment 3: Exploring the backbones global properties.
This experiment comprehensively examines global characteristics in extracted backbones through various techniques. The objective is to discern these methods’ impact on networks’ topological attributes. Additionally, we endeavor to evaluate how the network’s size influences the efficacy of backbone extraction methods in terms of global topological properties.
The experimental procedure commences with extracting backbones for each network n using diverse backbone extraction methods. Subsequently, we compute the value of each topological property (Y), denoted as P(Y,B), for each backbone B. To contextualize these values, we normalize them by the corresponding values of the original network (O) using the equation:
To facilitate the visualization of results, we depict the normalized values in a violin plot for each backbone extraction method.
3.2.4 Experiment 4: Comparing backbones distributions.
In the previous experiment, we explore the overarching characteristics of backbones and revealed the attributes associated with each method of backbone extraction. In this iteration, we replicate the same methodology focusing on distributions, specifically those about weight and degree. The experimental process is initiated by extracting backbones for each network n using diverse extraction methods. Subsequently, we extract the weight and degree values of both the extracted backbones (Dw(B) and Dd(B)) and the original network (Dw(O) and Dd(O)). Due to differences in backbone sizes, the distribution sizes also vary. Consequently, we opt to employ the Kolmogorov-Smirnov test for comparison. The Kolmogorov-Smirnov statistic () and (
) are computed for each backbone distribution in comparison to the original network.
For each property (weight, degree), the distribution of KS values is visually presented in a violin plot for each method of backbone extraction.
Subsequently, the methods are ranked based on the KS statistic values in each network, assigning a rank of 1 to the method closest to the original network and 8 to the method farthest away. Finally, the results are depicted in a box plot. It’s important to note that the experiment used networks with a minimum of 1000 edges.
4 Experiment 1: Exploring technique similarities
This section reveals commonalities among backbone filtering techniques using two complementary measures: the Jaccard score and the Overlap Coefficient. The procedure initiates in each network by extracting the backbones using various backbone extraction methods. Subsequently, the edge sets of each pair of methods are used to calculate the Jaccard score and Overlap Coefficient in each network. The results are then aggregated, and the mean and standard deviation are computed across all networks for each pair. This comprehensive approach provides insights into the similarities and overlaps between backbone extraction methods across diverse networks.
4.1 Jaccard similarity score
Fig 2 displays heatmaps illustrating the mean and standard deviation of Jaccard similarity scores between sets of edges for various backbone filtering techniques. The analysis reveals significant correlations among specific pairs of techniques, such as the strong correlations between Primary Linkage Analysis and Minimum Spanning Tree Filters (PLAM-MSP) and Ultrametric Backbone and Minimum Spanning Tree Filters (UMB-MSP).
The H-Backbone and Doubly Stochastic Filters stand out for their lack of correlation with other techniques, indicating distinct behavior. The low standard deviation in the heatmaps adds credibility to the observed correlations and behaviors, highlighting the consistency and reliability of the findings. Most other techniques show network-specific behaviors without consistent patterns across all networks. The study emphasizes the diverse behavior of backbone filtering techniques, with each method responding uniquely to network characteristics.
4.2 Overlap coefficient
Fig 3 presents heatmaps depicting the mean and standard deviation of the Overlap Coefficient between sets of edges for various pairs of backbones. Key observations from the analysis include:
As the overlap coefficient is calculated relative to each method within the pairs, the resulting heatmap is asymmetrical.
Total Overlap Among Methods:.
The Primary Linkage Analysis Backbone is included in the Minimum Spanning Tree, the Minimum Spanning Tree Backbone is included in the Ultrametric Backbone, and the Ultrametric Backbone is included in the Metric Backbone. In hierarchical terms, the relationships can be summarized as follows:
High Overlap Above 90%:.
High overlap is observed, with, on average, 90% and 97% of the High Salience Skeleton edges overlapping with the Planar Maximally Filtered Graph and Metric Backbones, respectively. On average, 93% of the Primary Linkage Analysis Backbones overlap with the Planar Maximally Filtered Graph Backbone.
Overlap Above 80%:.
Significant overlap is observed, with, on average, 84% and 86% of the H-Backbone edges overlapping with the Planar Maximally Filtered Graph and Metric Backbones, respectively. On average, 85% of the Ultrametric Backbone edges overlap with the Planar Maximally Filtered Graph Backbone. On average, 83% and 88% of the High Salience Skeleton edges overlap with the edges of the Minimum Spanning Tree and Ultrametric Backbones.
Unique Backbones with Doubly Stochastic Filter:.
The Doubly Stochastic filter yields unique backbones with low overlap percentages with other methods. The overlap ranges from 8% with the H-Backbone to 45% with the Metric Backbone. In contrast, all methods show a fair overlap, with approximately 55% of their edges overlapping with the Doubly Stochastic Backbones.
These findings underscore the distinctive characteristics and hierarchical relationships of various backbone filtering techniques. The Doubly Stochastic filter generates unique backbones with limited overlap with other methods.
5 Experiment 2: Exploring relationships between backbone edges and edge properties
In this experiment, we investigate the relationships between backbone edges and edge properties to understand whether specific properties persist or disappear during filtering. We calculate the Point Biserial correlation between backbone edges and each network property to analyze these associations, including weight, degree, weighted degree, betweenness, and weighted betweenness. The findings are then presented in violin plots, which comprehensively visualize each method’s distribution of correlation values.
5.1 Backbone edges and edge weights
Fig 4 displays the results of the Point Biserial correlation between backbone edges and edge weights. Analyzing the averages for each method, represented by the red points, reveals that all averages are less than 0.6. This suggests a lack of a strong correlation between the backbone edges and edge weights across all methods.
The average correlation is highlighted with a red point for each method.
However, when comparing the methods to each other, two extremes emerge. The Doubly Stochastic method exhibits correlation values highly concentrated at the bottom of the violin plot, with the lowest values and the average being less than 0.2. This indicates that the Doubly Stochastic method is the least sensitive among all methods to edge weights.
In contrast, the H-Backbone correlation values are highly concentrated at the top of the violin plot, with values among the highest and an average approaching 0.6. This implies that the H-Backbone method is more sensitive to edge weights than other methods, suggesting that edges with higher weights are more likely to be included in the backbone.
Between these two extremes, the other methods show no consistent behavior across all networks. The correlation values span between 0 and 0.8, indicating a nuanced relationship influenced by other properties that affect the association between backbone edges and edge weights.
5.2 Backbone edges and other edge properties
The Degree of an edge connecting node i with node j is defined as the product of the degrees of the connected nodes, represented as [100]. High-degree edges often link network hubs, a common feature in scale-free networks. Examining the relationship between edge backbones and edge degree provides insights into how backbone filtering techniques handle hub-connected edges.
Edge betweenness is a measure that quantifies the shortest paths traversing an edge in a network [101]. An edge with a high betweenness score is pivotal in connecting different network parts. Removing such edges can profoundly impact communication efficiency between various nodes by disrupting the shortest link paths.
Both properties introduce a weighted version by considering edge weights. Fig 5 displays the results of the Point Biserial correlation between backbone edges and edge degrees, betweenness, and the weighted versions of these two properties. In the top two panels, we observe that the average correlation is less than 0.25 in all methods for both edge degrees and betweenness. This indicates no strong relation between backbone edges and edge degree or edge betweenness. It suggests that these properties do not significantly influence the edge selection process, regardless of the filtering technique. Similar results are observed for the weighted versions of degree and betweenness in the bottom two panels of Fig 5.
Red points indicate each method’s average point biserial correlation, with edge properties including weight, degree, weighted degree, betweenness, and weighted betweenness under examination.
6 Experiment 3: Exploring the global properties of the backbones
This section delves into the impact of various methods on shaping the global network features of the extracted backbones. The procedure begins with the extraction of the backbone for each network. Subsequently, the properties of the extracted backbones are computed and normalized. The outcomes are then presented in a violin plot for each backbone extraction method.
6.1 Edge fraction
The primary goal of backbone extraction methods is to diminish the size of a network. As a result, larger networks allow the filtering of more edges than smaller networks. To scrutinize the behavior of these backbone extraction methods. Fig 6 illustrates the edge fraction of backbones obtained through diverse filtering techniques across all networks. This visual representation provides insights into how the methods perform regarding edge reduction across all networks.
Average edge fractions are highlighted with red points for each method.
The edge fraction distribution of H-backbones is prominently concentrated towards the bottom, with an average of 0.1. This observation underscores the efficacy of the H-backbone method in removing approximately 90% of the edges while preserving around 10%, with a few outliers.
In comparison, the High Salience Skeleton, Primary Linkage Analysis Method, and Minimum Spanning Tree exhibit a similar behavior but with a higher average edge fraction ranging between 0.1 and 0.2. Conversely, the plots of the other methods are more horizontally distributed, indicating a lack of consistent behavior across all networks. Some networks exhibit edge preservation of less than 10%, while others retain more than 50
Notably, the Doubly Stochastic method does not display extreme edge fractions exceeding 0.9 or falling below 0.1, suggesting a more balanced and moderate edge preservation across the networks it processes.
6.2 Node fraction
Preserving network nodes throughout the filtering process poses a considerable challenge for backbone filtering techniques. An effective technique should filter network edges without isolating nodes, ensuring the integrity of the network. It should retain the highest proportion of nodes within the network after the edge filtering process.
Fig 7 depicts the node fraction of backbones obtained through various filtering techniques across all networks. Notably, the Metric Backbone, Ultrametric Backbone, Minimum Spanning Tree, Planar Maximally Filtered Graph Method, and Primary Linkage Analysis filters consistently maintain all nodes across networks of different sizes.
Average node fractions are highlighted with red points for each method.
However, the Doubly Stochastic filter is less conservative in preserving the original network’s nodes, with values sometimes falling below 1, indicating node isolation in certain networks. Conversely, the High Salience Skeleton and H-Backbone filters tend to isolate many nodes, with average node fraction values around 0.6 and 0.4, respectively. Interestingly, node fraction values for the High Salience Skeleton are concentrated at the top, indicating a slightly conservative behavior. In contrast, in the H-Backbone, they are concentrated at the bottom, suggesting a more isolating behavior.
6.3 Weight fraction
Backbone filtering techniques collectively strive to maintain the diversity of edge weights while filtering network edges. However, our previous experiments have revealed that certain methods show a stronger tendency to preserve high edge weights. This leads us to compare the fractions of retained edge weights by these methods.
Fig 8 illustrates the weight fraction of backbones obtained through various filtering techniques across all networks. Notably, only the H-Backbone and High Salience Skeleton preserve a very low weight fraction, typically less than 10%
Average weight fractions are highlighted with red points for each method.
Conversely, the average weight fraction is around 0.5 for the Doubly Stochastic filter, with weight fractions spanning around the average and a few outliers. The Planar Maximally Filtered Graph Method presents a stick-shaped plot. It indicates a network-dependent behavior that preserves a high weight fraction in some networks while maintaining a minimum weight fraction of 0.2 in others. Lastly, the Metric Backbone boasts the highest average weight fraction, around 0.65. It consistently preserves a high weight fraction, exceeding 0.8 in many networks, with some outliers preserving less than 0.5.
6.4 Weight entropy
All backbone filtering techniques share the common objective of maintaining edge weight diversity while filtering network edges. This motivates us to compare the fraction of retained edge weights by these methods to assess their success in preserving these weight scales.
Fig 9 illustrates the normalized weight entropy of backbones obtained through various filtering techniques across all networks. The H-Backbone and High Salience Skeleton exhibit an average normalized weight entropy of around 0.6 with a high standard deviation. In some networks, the weight entropy is fully preserved, as indicated by a normalized entropy of 1, while in other networks, it decreases to a minimum of approximately 20%.
Average normalized weight entropy is highlighted with red points for each method.
Conversely, all other methods display higher average normalized weight entropy with low standard deviation. This indicates the effectiveness of these methods in preserving the original network’s weight entropy. The Metric Backbone and Doubly Stochastic backbones are particularly noteworthy, which exhibit very high normalized weight entropy, approximately 0.9. This underscores their ability to maintain the weight entropy across various networks.
6.5 Reachability
Reachability measures the connectivity between any pair of nodes within a network. Assessing the reachability of extracted backbones allows us to understand how interconnected these backbones are.
Fig 10 illustrates the reachability of backbones obtained through various filtering techniques across all networks. The Metric Backbone, Planar Maximally Filtered Graph Method, Ultrametric Backbone, and Minimum Spanning Tree Backbones consistently exhibit high reachability across all networks. The Doubly Stochastic backbones also demonstrate high reachability values in most networks, with only a few outliers. These methods showcase their ability to extract connected backbones with high reachability between nodes.
Average reachability is highlighted with red points for each method.
In contrast, the other methods show lower reachability values. The H-Backbone plot resembles a stick, with values between 0 and 1, indicating its network dependence. The High Salience Skeleton and Primary Linkage Analysis Method plots are highly concentrated at the bottom, suggesting low reachability values. Their average reachability is approximately 0.2, indicating a tendency to extract disconnected backbones in most networks.
6.6 Number of components
Low reachability values indicate a disconnected network, but they do not provide insight into the specific fragmentation pattern. To better understand the connectedness of the extracted backbones, we investigate the number of components within them.
Fig 11 illustrates the number of components in backbones obtained through various filtering techniques across all networks. Remarkably, the Planar Maximally Filtered Graph Method, Metric Backbone, Ultrametric Backbone, and Minimum Spanning Tree filters consistently exhibit the behavior of extracting a single component.
Average reachability is highlighted with red points for each method. Note, that the y-axis is in log scale.
Conversely, the other methods extract backbones with multiple components. Specifically, the Primary Linkage Analysis filter always retains multiple components. This observation aligns with the low reachability values observed in the previous experiment for these methods, except for the Doubly Stochastic filter. Although this method has many components, it also exhibits high reachability values. This discrepancy can be explained by large connected components in the Doubly Stochastic filter, unlike other methods where the extracted components are smaller.
6.7 Transitivity
Transitivity quantifies the likelihood that two neighboring nodes are also connected, offering insights into the interconnectedness among local clusters or triangles within the extracted backbones.
Fig 12 illustrates the transitivity of backbones obtained through various filtering techniques across all networks. The Primary Linkage Analysis, Minimum Spanning Tree, and High Salience Skeleton backbones consistently exhibit a transitivity equal to zero, indicating the systematic breaking of all cycles in the network through the filtration process. The Ultrametric Backbone plot shows normalized transitivity concentrated at the bottom, with an average of around 0.2. This method also tends to break many cycles in the network, with some outliers where transitivity values are high and occasionally preserved.
Average normalized transitivity is highlighted with red points for each method.
For the Metric Backbone, Planar Maximally Filtered Graph Method, and H-Backbones, there is no consistent behavior across all networks, indicating network dependency. In some networks, cycles are broken, while in others, cycles are less disrupted and more preserved. Notably, the Planar Maximally Filtered Graph Method and H-Backbones sometimes increase transitivity, possibly explained by breaking more network triads instead of cycles.
Finally, the Doubly Stochastic normalized transitivity values are concentrated at the top, with an average of approximately 0.8. This method demonstrates superiority over others in preserving the transitivity of the original network, indicating its efficacy in maintaining local cluster interconnections during the filtering process.
7 Experiment 4: Comparing backbone distributions
In the preceding experiment, we delved into the overarching characteristics of backbones and pinpointed attributes linked to each extraction method. We employ a parallel methodology, concentrating on distributions, specifically in weight and degree. The procedure initiates by extracting the backbones for each network utilizing distinct backbone extraction methods. Subsequently, we calculate the Kolmogorov-Smirnov statistic for each backbone distribution compared to the original network. Following this, we depict the KS statistic for each method across all networks in a violin plot. Finally, within each network, methods are ordered based on the KS statistic values, with the ranking reflecting the degree of alignment of each method with the original network. Ultimately, the outcomes are presented in a box plot.
7.1 Weight distribution
The comparison of the weight distribution of the filtered network backbone is crucial in evaluating the effectiveness of the filtering technique in retaining different weight scales. The weight scales in a network reflect the connections and relationships between the nodes, and a suitable filtering technique should preserve different scales of these weights, thus preserving the original weight distribution. This comparison allows us to identify which filtering methods best maintain the original network weight distribution.
In Fig 13, the violin plots illustrate the KS-statistic values for each backbone extraction method across all networks. Two distinct extreme behaviors are evident. On the extreme left, represented by the Doubly Stochastic (DS), there is a notable concentration of values between 0 and 0.2, with a few outliers creating an upward tail. This signifies the superiority of this method in accurately capturing the original weight distribution within the extracted backbones. Conversely, on the extreme right, depicted by the H-Backbone (HB), there is a high concentration of values between 0.8 and 1, with outliers causing a downward tail in the plot. This indicates that this method consistently falls short of capturing the original weight distribution. Between these extremes, the remaining methods consistently exhibit uniform values across all scales. This characteristic is depicted in the violin plot, resembling sticks. This suggests a network dependency, wherein these methods effectively capture the original weight distribution in specific networks but face challenges in others.
The KS-static represents the distance between the backbone and the original weight distribution.
Thus far, we have identified the most effective and least effective methods for capturing the original weight distribution. To rank the remaining methods, we analyze the boxplots illustrating the rankings of backbone extraction methods based on the Kolmogorov-Smirnov statistic across all networks, as depicted in Fig 14. Clearly, the Doubly Stochastic (DS)and H-Backbone (HB) occupy the first and last ranks, respectively. The Planar Maximally Filtered Graph Method (PMFG) is positioned between them, securing ranks 2 and 3, followed by the Metric backbone (MB), falling between ranks 3 and 4. The Ultrametric Backbone (UMB) attains an average rank of 4, while the Primary Linkage Analysis (PLAM) and Minimum Spanning Tree (MSP) methods rank between (4 and 5) and (5 and 6), respectively.
7.2 Degree distribution
Evaluating the degree distribution of the filtered network backbone is pivotal in assessing the filtering technique’s effectiveness in preserving diverse degrees of connectivity. Node degrees signify the connections and influence of nodes in a network, and an effective filtering method should retain various degrees, thereby conserving the original degree distribution. This comparative analysis enables the identification of filtering methods that most effectively uphold the inherent diversity of node degrees in the original network.
In Fig 15, the violin plots depict the KS-statistic values for each backbone extraction method across all networks, revealing two distinct extreme behaviors. On the extreme left, represented by the Doubly Stochastic (DS), the values are well-distributed between 0 and 1. This suggests a network dependency, where these methods adeptly capture the original degree distribution in specific networks. Conversely, on the extreme right, illustrated by the Primary Linkage Analysis Method (PLAM), there is a high concentration of values between 0.8 and 1, with outliers causing a downward tail in the plot. This indicates that this method consistently falls short of capturing the original degree distribution. Between these extremes, the values of the remaining methods are slightly concentrated at the top with longer tails. This signifies that these methods effectively capture the original degree distribution in specific networks but face challenges in others.
The KS-static represents the distance between the backbone and the original degree distribution.
Up to this point, we have identified the most and least effective methods for capturing the original degree distribution. To rank the remaining methods, we examine the boxplots illustrating the rankings of backbone extraction methods based on the Kolmogorov-Smirnov statistic across all networks, as depicted in Fig 16. The Doubly Stochastic (DS) and Primary Linkage Analysis (PLAM) methods occupy the first and last ranks, respectively. The Metric Backbone (MB) is positioned between them with an average rank of 2, followed by the Planar Maximally Filtered Graph Method (PMFG), falling between ranks 2 and 3. The Ultrametric Backbone (UMB) attains an average rank of 3, while the H-Backbone (HB) and Minimum Spanning Tree (MSP) methods hold average ranks of 4 and 5, respectively.
8 Discussion
This study compares eight structural backbone edge filtering techniques across a dataset of 33 networks with various sizes and topological properties from different domains. The primary objectives are to evaluate the similarity between filtering techniques and identify properties associated with each backbone extraction method.
In the initial experiment, the investigation into the similarity between backbone filtering techniques reveals a hierarchical relationship among backbone methods (). The Primary Linkage Analysis Backbone PLAM captures the most influential edges at the level of each node. The Minimum Spanning Tree Backbone MSP encompasses edges forming a maximum spanning tree, including those from PLAM. The Ultrametric Backbone UMB focuses on maximum-distance paths, incorporating edges from MSP. The Metric Backbone MB emphasizes shortest paths based on the sum of edge distances, including edges from UMB. These relationships imply that each method adds complexity or specificity to the selected edges, with PLAM focusing on influential edges, MSP emphasizing overall network connectivity, UMB prioritizing paths with maximum distances, and MB concentrating on paths with minimum total edge distances. This relation stems from their reliance on similar global approaches, such as shortest paths and connectivity. In contrast, the Doubly Stochastic Filter exhibits distinct behavior. It prioritizes local adjacency relationships through iterative normalization. This leads to minimal overlap with hierarchical methods (e.g., 8% with H-Backbone, 45% with MB). This contrast shows two complementary approaches. Shortest-path-based methods preserve global network structure, while the Doubly Stochastic Filter emphasizes localized relationships.
Substantial overlap between the High Salience Skeleton with Planar Maximally Filtered Graph and Metric Backbones indicates that the high salience skeleton backbone is nearly planar, with only a few edges breaking this planar property. Since the Metric backbone is unaffected by the reduction of the original graph, no shortest path is impacted. The High Salience Skeleton filter preserves over 90% of these shortest paths and removes only a few due to threshold settings.
The pronounced overlap between Primary Linkage Analysis and Planar Maximally Filtered Graph is expected, as both methods prioritize high-weight edges. However, the former operates at the node level, while the latter considers the entire graph. Similarly, the H-Backbone filters overlap with the Planar Maximally Filtered Graph and Metric Backbones.
In the second experiment, the focus shifts to the relationships between backbone edges and local edge properties. Examining edge weight, edge degree, edge betweenness, and their weighted versions reveals inconsistent relations between backbone edges and edge weights across networks. However, it identifies that high-weight edges are more likely present in the H-Backbone and less likely in the Doubly Stochastic and High Salience Skeleton Backbones. Other methods display inconsistent behaviors, indicating network-dependent relations affected by other network properties. No correlation is found between backbone edges and other edge properties across all networks, suggesting these properties do not influence the filtering process.
The third experiment involves extracting backbones and analyzing their global topological properties, such as the fraction of preserved edges, nodes, weights, weight entropy, reachability, number of components, and transitivity. Results, presented in Table 2, highlight consistent preservative behaviors in green, network-dependent behaviors in blue, and consistent non-preservative behaviors in red.
For example, the Minimum Spanning Tree filter significantly reduces graph density through heavy filtering while preserving other properties and breaking cycles. The Primary Linkage Analysis filter exhibits similar heavy filtering behavior but disrupts network connectivity during filtration.
The H-Backbone filter consistently demonstrates heavy filtering behavior but fails to preserve network nodes, with other properties depending on the specific network.
The High Salience Skeleton filter consistently exhibits heavy filtering behavior but fails to preserve network connectivity and transitivity, while node preservation and weight entropy depend on the specific network.
The Metric Backbone and Planar Maximally Filtered Graph filters consistently preserve all nodes in the backbone, maintaining their connectivity and weight entropy. The density of filtration and transitivity are network-dependent. The Ultrametric Backbone filter behaves similarly but consistently breaks cycles and disturbs network transitivity.
The Doubly Stochastic filter consistently preserves weight entropy, network connectivity, and transitivity. However, graph density and node preservation are network-dependent.
Furthermore, we investigate the influence of the network categories on the backbone properties. To do so, for each property under investigation, we plot a heatmap with the backbone methods on the x-axis and the network categories on the y-axis. The figures are provided in the Supporting information. Results show that the network type does not influence the performance and properties of the filtering techniques.
Finally, examining the weight and degree distributions of the extracted backbones reveals that the Doubly Stochastic filter better captures the weight distribution of the original network. In contrast, the H-Backbone filter exhibits the most considerable deviation. Regarding degree distribution, the Doubly Stochastic and Metric Backbone filters better preserve it than other methods, with the Primary Linkage Analysis and Minimum Spanning Tree filter’s backbones deviating the most.
Overall, these comprehensive experiments provide valuable insights into the behavior and characteristics of different structural backbone filtering techniques, shedding light on their strengths, weaknesses, and unique features in diverse network contexts.
9 Conclusion
In conclusion, this comprehensive study has systematically compared eight structural backbone edge filtering techniques across a diverse set of 33 networks spanning various sizes, topological properties, and domains. The goal was to assist users in selecting the most suitable technique for extracting a network backbone, a process crucial for expediting analysis and enhancing visualization in diverse applications.
The overlap analysis uncovered distinctive behavior in the Doubly Stochastic Filter, and hierarchical relationships among the backbones () highlighted the incremental complexity added by each subsequent method. The second experiment revealed significant overlaps between certain filtering techniques, such as the High Salience Skeleton with Planar Maximally Filtered Graph and Metric Backbones, indicating planarity and specific edge preservation characteristics.
The correlation analysis identifies a tendency for high-weight edges to be more prevalent in the H-Backbone and less likely in the Doubly Stochastic and High Salience Skeleton Backbones. Conversely, other filtering methods exhibit inconsistent behaviors, indicating that other network properties influence these relationships and are, thus, network-dependent.
The properties analysis conducted in this study revealed three distinctive behavioral types for each topological property: consistent, inconsistent, and network-dependent behaviors, as succinctly summarized in Table 2. Notable observations include the Doubly Stochastic filter as the singular method effectively preserving network transitivity. Conversely, the Minimum Spanning Tree, High Salience Skeleton, H-Backbone, and Primary Linkage Analysis filters significantly reduced network size. Among the filters, the Minimum Spanning Tree, Doubly Stochastic, Metric, Ultrametric Backbone, and Planar Maximally Filtered Graph demonstrated efficacy in preserving network connectivity and weight entropy.
Distribution analysis brought to light the unique proficiency of the Doubly Stochastic filter, setting it apart from all other methods, particularly in capturing the original weight and degree distributions. The Metric Backbone filter closely followed suit, showcasing similar capabilities in weight distribution.
These findings provide valuable insights for users who select an appropriate backbone extraction method based on individual method properties. It is worth noting that future research endeavors will expand on this investigation, encompassing additional backbone extraction methods and properties further to enhance our understanding of network backbone extraction techniques. The systematic comparison presented here lays a solid foundation for informed decision-making in choosing the most suitable method for a given analytical or visualization task in diverse network applications.
Supporting information
S1 File. Provides a detailed description of the Dataset.
https://doi.org/10.1371/journal.pone.0322298.s001
(PDF)
S2 File. Provides a detailed description for the Network Properties.
https://doi.org/10.1371/journal.pone.0322298.s002
(PDF)
S3 File. Provides a detailed description for the Evaluation Measures.
https://doi.org/10.1371/journal.pone.0322298.s003
(PDF)
S4 File. Provides detailed results of the influence of network categories on the backbone properties.
https://doi.org/10.1371/journal.pone.0322298.s004
(PDF)
References
- 1. Newman MEJ. Analysis of weighted networks. Phys Rev E. 2004;70(5):056131.
- 2. Albert R, Barabási AL. Statistical mechanics of complex networks. Rev Mod Phys. 2002;74(1):47–97.
- 3. Newman MEJ. The structure and function of complex networks. SIAM Rev. 2003;45(2):167–256.
- 4.
Newman M. Networks. Oxford University Press; 2010. Available from: https://academic.oup.com/book/27303
- 5.
Brugnoli E, Cinelli M, Zollo F, Quattrociocchi W, Scala A. Lexical convergence and collective identities on Facebook; 2020. Available from: http://arxiv.org/abs/1903.11452
- 6. Grinberg N, Joseph K, Friedland L, Swire-Thompson B, Lazer D. Fake news on Twitter during the 2016 U.S. presidential election. Science. 2019;363(6425):374–8.
- 7. Güell O, Sagués F, Serrano MÁ. Detecting the significant flux backbone of Escherichia coli metabolism. FEBS Lett. 2017;591(10):1437–51. pmid:28391640
- 8. Serrano MÁ, Boguñá M, Sagués F. Uncovering the hidden geometry behind metabolic networks. Molecular BioSystems. 2012;8(3):843–50. pmid:22228307
- 9. Zádor Z, Zhu Z, Smith M, Gorgoni S. A Weighted and Normalized Gould-Fernandez brokerage measure. PLOS ONE. 2022;17(9):e0274475.
- 10. Zappitelli J, Smith E, Padgett K, Bilec MM, Babbitt CW, Khanna V. Quantifying energy and greenhouse gas emissions embodied in global primary plastic trade network. ACS Sustain Chem Eng. 2021;9(44):14927–36.
- 11. Tilly S, Livan G. Macroeconomic forecasting with statistically validated knowledge graphs. Expert Systems with Applications. 2021;186:115765.
- 12. Li M-X, Palchykov V, Jiang Z-Q, Kaski K, Kertész J, Miccichè S, et al. Statistically validated mobile communication networks: the evolution of motifs in European and Chinese data. New J Phys. 2014;16(8):083038.
- 13. Keller-Ressel M, Nargang S. The hyperbolic geometry of financial networks. Scientific Reports. 2021;11(1):4732.
- 14.
Ho ATY. Interconnectedness through the lens of consumer credit markets. In: De Paula Á, Tamer E, Voia MC, editors. Advances in econometrics. Emerald Publishing Limited; 2020. p. 315–33. Available from: https://www.emerald.com/insight/content/doi/10.1108/S0731-905320200000042015/full/html
- 15.
Alanis-Lobato G, Andrade-Navarro MA. A reliable and unbiased human protein network with the disparity filter; 2017. Available from: https://www.biorxiv.org/content/10.1101/207761v1
- 16. Cantini L, Medico E, Fortunato S, Caselle M. Detection of gene communities in multi-networks reveals cancer drivers. Sci Rep. 2015;5(1):17386. pmid:26639632
- 17. Gonzalez-Astudillo J, Cattai T, Bassignana G, Corsi M-C, De Vico Fallani F. Network-based brain-computer interfaces: principles and applications. J Neural Eng. 2021;18(1):011001. pmid:33147577
- 18. Frassineti L, Parente A, Manfredi C. Multiparametric EEG analysis of brain network dynamics during neonatal seizures. J Neurosci Meth. 2021;348:109003. pmid:33249182
- 19. Barrat A, Barthélemy M, Pastor-Satorras R, Vespignani A. The architecture of complex weighted networks. Proc Natl Acad Sci. 2004;101(11):3747–52. pmid:15007165
- 20. Zádor Z, Zhu Z, Smith M, Gorgoni S. A weighted and normalized Gould-Fernandez brokerage measure. PLOS ONE. 2022;17(9):e0274475. pmid:36107935
- 21.
Güell O. Detection of evolution and adaptation fingerprints in metabolic networks. In: A network-based approach to cell metabolism. Cham: Springer International Publishing; 2017. p. 101–13. Available from: http://link.springer.com/10.1007/978-3-319-64000-6_5
- 22. Teixeira F, Derudder B. SKYNET: An R package for generating air passenger networks for urban studies. Urban Stud. 2019;56(14):3030–44.
- 23. Charyyev B, Gunes MH. Complex network of United States migration. Computat Soc Netw. 2019;6(1):1.
- 24.
Charyyev B, Solmaz M, Gunes MH. Dynamic network of United States air transportation at multiple levels. Complex Netw XI. 2020;282–93. http://link.springer.com/10.1007/978-3-030-40943-2_24
- 25.
Rashid NU, Chan KT. A review on transportation network based on complex network approach; 2023. Available from: https://www.semanticscholar.org/paper/A-Review-on-Transportation-Network-based-on-Complex-Rashid-Chan/07447503357a93843b2c03c7ca05d8154a0d11f2
- 26. Keller-Ressel M, Nargang S. The hyperbolic geometry of financial networks. Sci Rep. 2021;11(1):4732. pmid:33637827
- 27. Musciotto F, Piilo J, Mantegna RN. High-frequency trading and networked markets. Proc Natl Acad Sci U S A. 2021;118(26):e2015573118. pmid:34172575
- 28. Liebig J, Rao A. Fast extraction of the backbone of projected bipartite networks to aid community detection. EPL. 2016;113(2):28003.
- 29.
Del Vicario M, Zhang Q, Bessi A, Caldarelli G, Zollo F. Structural patterns of the occupy movement on Facebook. In: Cherifi H, Gaito S, Quattrociocchi W, Sala A, editors. Complex networks & their applications V. Studies in computational intelligence. Cham: Springer International Publishing; 2017. p. 595–606.
- 30. Yang HN, Xu XJ, Liang H, Wang X. A comparative study of online communities and popularity of BBS in four Chinese universities. PLOS ONE. 2020;15(6):e0234469.
- 31. Nobre GP, Ferreira CHG, Almeida JM. A hierarchical network-oriented analysis of user participation in misinformation spread on WhatsApp. Inform Process Manag. 2022;59(1):102757.
- 32.
Ferreira CHG, Mellia M, Almeida JM. Modeling and analyzing collective behavior captured by many-to-many networks. Brazilian Symposium on Multimedia and the Web; 2023.
- 33. Bellingeri M, Bodini A. Food web’s backbones and energy delivery in ecosystems. Oikos. 2016;125(4):586–94.
- 34.
Compson ZG, Monk WA, Hayden B, Bush A, O’Malley Z, Hajibabaei M, et al. Network-based biomonitoring: Exploring freshwater food webs with stable isotope analysis and DNA metabarcoding. Front Ecol Evol. 2019;7. https://doi.org/10.3389/fevo.2019.00395
- 35. Rauchfleisch A, Siegen D, Vogler D. How COVID-19 displaced climate change: Mediated climate change activism and issue attention in the Swiss media and online sphere. Environ Commun. 2023;17(3):313–21.
- 36.
Carattini S, Fankhauser S, Gao J, Gennaioli C, Panzarasa P. What does network analysis teach us about international environmental cooperation?; 2021. Available from: http://arxiv.org/abs/2106.08883
- 37. Frassineti L, Parente A, Manfredi C. Multiparametric EEG analysis of brain network dynamics during neonatal seizures. J Neurosci Meth. 2021;348:109003. pmid:33249182
- 38. Zangrossi A, Zanzotto G, Lorenzoni F, Indelicato G, Cannas Aghedu F, Cermelli P, et al. Resting-state functional brain connectivity predicts cognitive performance: An exploratory study on a time-based prospective memory task. Behav Brain Res. 2021;402:113130. pmid:33444694
- 39. Yassin A, Cherifi H, Seba H, Togni O. Air transport network: A comparison of statistical backbone filtering techniques. Complex Netw Appl XI. 2023;1078:551–64.
- 40.
Yassin A, Cherifi H, Seba H, Togni O. Exploring Statistical Backbone Filtering Techniques in the Air Transportation Network. In: 2022 IEEE Workshop on Complexity in Engineering (COMPENG). Florence, Italy: IEEE; 2022. p. 1–8. Available from: https://ieeexplore.ieee.org/document/9905432/
- 41.
Neal ZP, Domagalski R, Sagan B. Comparing models for extracting the backbone of bipartite projections. arXiv; 2021.
- 42. Yassin A, Cherifi H, Seba H, Togni O. Backbone extraction through statistical edge filtering: A comparative study. PLOS ONE. 2025;20(1):e0316141. pmid:39752450
- 43. Serrano MA, Boguna M, Vespignani A. Extracting the multiscale backbone of complex weighted networks. Proc Natl Acad Sci. 2009;106:6483–8.
- 44. Marcaccioli R, Livan G. A Pólya urn approach to information filtering in complex networks. Nat Commun. 2019;10:745. pmid:30765706
- 45. Dianati N. Unwinding the hairball graph: pruning algorithms for weighted complex networks. Physical Review E. 2016;93:012304. pmid:26871089
- 46.
Coscia M, Neffke FMH. Network backboning with noisy data. IEEE; 2017. p. 425–36.
- 47.
Gemmetto V, Cardillo A, Garlaschelli D. Irreducible network backbones: Unbiased graph filtering via maximum entropy; 2017. Available from: https://arxiv.org/abs/1706.00230
- 48. Radicchi F, Ramasco JJ, Fortunato S. Information filtering in complex weighted networks. Phys Rev E. 2011;83:046101.
- 49. Foti NJ, Hughes JM, Rockmore DN. Nonparametric sparsification of complex multiscale networks. PLoS ONE. 2011;6:e16431. pmid:21346815
- 50. Dai L, Derudder B, Liu X. Transport network backbone extraction: A comparison of techniques. J Transp Geogr. 2018;69:271–81.
- 51.
Alvarez-Hamelin J, Dall’ Asta L, Barrat A, Vespignani A. Large scale networks fingerprinting and visualization using the k-core decomposition. In: Weiss Y, Schölkopf B, Platt J, editors. Advances in neural information processing systems. Vol. 18. MIT Press; 2005.
- 52. Kruskal JB Jr. On the shortest spanning subtree of a graph and the traveling salesman problem. Proc Am Math Soc. 1956;7:48–50.
- 53.
Nystuen JD, Dacey MF. A graph theory interpretation of nodal regions. Papers of the Regional Science Association. Vol. 7. Springer; 1961. p. 29–42.
- 54. Rushton G, Haggett P, Cliff A, Frey A. Locational analysis in human geography. Geograph Rev. 1980;70:112.
- 55. Gomes Ferreira CH, Murai F, Silva APC, Trevisan M, Vassio L, Drago I, et al. On network backbone extraction for modeling online collective behavior. PLOS ONE. 2022;17(9):1–36. pmid:36107952
- 56. Grady D, Thiemann C, Brockmann D. Robust classification of salient links in complex networks. Nat Commun. 2012;3:864. pmid:22643891
- 57. Vaz de Melo POS, Viana AC, Fiore M, Jaffrès-Runser K, Le Mouël F, Loureiro AAF, et al. RECAST: Telling apart social and random relationships in dynamic networks. Perform Eval. 2015;87:19–36.
- 58.
Gomes Ferreira CH, Murai F, Couto da Silva AP, de Almeida JM, Trevisan M, Vassio L, et al. Unveiling community dynamics on instagram political network. Proceedings of the 12th ACM Conference on Web Science; 2020. p. 231–40.
- 59. Neal ZP, Domagalski R, Sagan B. Comparing alternatives to the fixed degree sequence model for extracting the backbone of bipartite projections. Sci Rep. 2021;11(1):23929. pmid:34907253
- 60.
Yassin A, Cherifi H, Seba H, Togni O. A modular network exploration of backbone extraction techniques. In: Cherifi H, Rocha LM, Cherifi C, Donduran M, editors. Complex networks & their applications XII. Cham: Springer Nature Switzerland; 2024. p. 296–308.
- 61. Slater PB. A two-stage algorithm for extracting the multiscale backbone of complex weighted networks. Proc Natl Acad Sci U S A. 2009;106(26):E66. pmid:19549841
- 62. Simas T, Correia RB, Rocha LM. The distance backbone of complex networks. J Complex Netw. 2021;9:cnab021. pmid:38348382
- 63. Zhang X, Zhang Z, Zhao H, Wang Q, Zhu J. Extracting the globally and locally adaptive backbone of complex networks. PLOS ONE. 2014;9(6):e100428.
- 64. Yassin A, Cherifi H, Seba H, Togni O. Air transportation network backbone extraction: A comparative analysis of structural filtering techniques. Computat Data Soc Netw. 2024;14479:337–48. https://link.springer.com/10.1007/978-981-97-0669-3_31
- 65. Jaccard P. The distribution of the flora in the alpine zone. New Phytol. 1912;11(2):37–50.
- 66. Vijaymeena MK, Kavitha K. A survey on similarity measures in text mining. Mach Learn Appl: Int J. 2016;3:19–28.
- 67. MacCallum RC, Zhang S, Preacher KJ, Rucker DD. On the practice of dichotomization of quantitative variables. Psychol Meth. 2002;7(1):19–40. pmid:11928888
- 68. Hodges JL. The significance probability of the smirnov two-sample test. Ark Mat. 1958;3(5):469–86.
- 69. Tumminello M, Aste T, Di Matteo T, Mantegna RN. A tool for filtering information in complex systems. Proc Natl Acad Sci U S A. 2005;102(30):10421–6. pmid:16027373
- 70.
Nystuen J, Dacey M. A graph theory interpretation of nodal regions. Papers of the Regional Science Association; 2005. p. 7. https://doi.org/10.1111/j.1435-5597.1961.tb01769.x
- 71. Zhang RJ, Stanley HE, Ye FY. Extracting h-backbone as a core structure in weighted networks. Sci Rep. 2018;8(1):1–7.
- 72.
Knuth D. The Stanford GraphBase. A platform for combinatorial computing; 1993.
- 73. Beveridge A, Shan J. Network of thrones. Math Horizons. 2016;23:18.
- 74. Szalkai B, Kerepesi C, Varga B, Grolmusz V. The Budapest reference connectome server v2.0. Neurosci Lett. 2015;595:60–2. pmid:25862487
- 75. Szalkai B, Kerepesi C, Varga B, Grolmusz V. Parameterizable consensus connectomes from the Human Connectome Project: The Budapest reference connectome server v3.0. Cogn Neurodyn. 2017;11:113–6. pmid:28174617
- 76. Hagmann P, Cammoun L, Gigandet X, Meuli R, Honey CJ, Wedeen VJ, et al. Mapping the structural core of human cerebral cortex. PLoS Biol. 2008;6:e159. pmid:18597554
- 77. Cook SJ, Jarrell TA, Brittin CA, Wang Y, Bloniarz AE, Yakovlev MA, et al. Whole-animal connectomes of both Caenorhabditis elegans sexes. Nature. 2019;571(7763):63–71. pmid:31270481
- 78. Dunne J, Labandeira C, Williams R. Highly resolved early Eocene food webs show development of modern trophic structure after the end-Cretaceous extinction. Proc Biol Sciences/The Royal Society. 2014;281:20133280. pmid:24648225
- 79.
Ulanowicz R, Heymans J, Egnotovich M. Network analysis of trophic dynamics in South Florida ecosystems, FY 99: The Graminoid Ecosystem. FY97: The Florida Bay Ecosystem; 2000.
- 80. Wapman K, Clauset A, Larremore D. Quantifying hierarchy and dynamics in US faculty hiring and retention. Nature. 2022;610:1–8. pmid:36131023
- 81. Alves L, Aleta A, Rodrigues F, Moreno Y, Amaral L. Centrality anomalies in complex networks as a result of model over-simplification. New J Phys. 2020;22:013043.
- 82. Diop IM, Cherifi C, Diallo C, Cherifi H. Revealing the component structure of the world air transportation network. Appl Netw Sci. 2021;6(1):1–50. pmid:34841043
- 83.
Opsahl T. Why Anchorage is not (that) important: Binary ties and Sample selection; 2011. Available from: https://wp.me/poFcY-Vw
- 84. Colizza V, Pastor-Satorras R, Vespignani A. Reaction-diffusion processes and metapopulation models in heterogeneous network. Nat Phys. 2007;3.
- 85. Kujala R, Weckström C, Darst R, Mladenović M, Saramäki J. A collection of public transport network data sets for 25 cities. Sci Data. 2018;5:180089.
- 86. Xu M, Pan Q, Xia H, Masuda N. Estimating international trade status of countries from global liner shipping networks. R Soc Open Sci. 2020;7:200386. pmid:33204446
- 87. Clark TD, Davis A, Gardner BB, Gardner MR, Warner WL. Deep south: A social anthropological study of caste and class. J South History. 1942;8.
- 88.
Shore KA. Complex networks: Principles, methods and applications. Contemp Phys. 2018. https://doi.org/10.1080/00107514.2018.1450296
- 89. Hayes B. Connecting the dots: Can the tools of graph theory and social-network studies unravel the next big plot? Am Sci. 2006;94:400
- 90. Freeman L, Freeman S, Michaelson A. On human social intelligence. Journal of Social and Biological Systems. 1988;11:415–25.
- 91.
Coleman JS. Introduction to mathematical Sociology; 1964. Available from: https://www.abebooks.com/Introduction-Mathematical-Sociology-COLEMAN-James-S/189127582/bd
- 92. Freeman LC, Webster CM, Kirke DM. Exploring social structure using dynamic three-dimensional color images. Soc Netw. 1998;20:109–18.
- 93. Stehlé J, Voirin N, Barrat A, Cattuto C, Colizza V, Isella L, et al. Simulation of an SEIR infectious disease model on the dynamic contact network of conference attendees. BMC Med. 2011;9:87. pmid:21771290
- 94. Toth D, Leecaster M, Pettey W, Gundlapalli A, Gao H, Rainey J, et al. The role of heterogeneity in contact timing and duration in network models of influenza spread in schools. J R Soc Interface. 2015;12(108):20150279. pmid:26063821
- 95.
SocioPatterns: Data-driven social dynamics and human activity. Available from: http://www.sociopatterns.org
- 96. Opsahl T. Triadic closure in two-mode networks: Redefining the global and local clustering coefficients. Soc Netw. 2013;35:159–67.
- 97. Fu X, Yu S, Benson AR. Modelling and analysis of tagging networks in Stack Exchange communities. J Complex Netw. 2020;8.
- 98. Yassin A, Haidar A, Cherifi H, Seba H, Togni O. An evaluation tool for backbone extraction techniques in weighted complex networks. Sci Rep. 2023;13(1):17000. pmid:37813946
- 99.
Benesty J, Chen J, Huang Y, Cohen I. Pearson correlation coefficient. Berlin, Heidelberg: Springer Berlin Heidelberg; 2009. p. 1–4.
- 100. Oehlers M, Fabian B. Graph metrics for network robustness—A survey. Mathematics. 2021;9:895.
- 101. Brandes U. On variants of shortest-path betweenness centrality and their generic computation. Soc Netw. 2008;30(2):136–45.