Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

A weighted network analysis framework for the hourglass effect—And its application in the C. elegans connectome

  • Ishaan Batta,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Validation, Writing – original draft, Writing – review & editing

    Affiliation School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, Georgia, United States of America

  • Qihang Yao,

    Roles Data curation, Formal analysis, Investigation, Methodology, Validation, Visualization, Writing – review & editing

    Affiliation School of Computer Science, Georgia Institute of Technology, Atlanta, Georgia, United States of America

  • Kaeser M. Sabrin,

    Roles Methodology, Software

    Affiliation School of Computer Science, Georgia Institute of Technology, Atlanta, Georgia, United States of America

  • Constantine Dovrolis

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Validation, Writing – review & editing

    constantine@gatech.edu

    Affiliation School of Computer Science, Georgia Institute of Technology, Atlanta, Georgia, United States of America

Abstract

Understanding hierarchy and modularity in natural as well as technological networks is of utmost importance. A major aspect of such analysis involves identifying the nodes that are crucial to the overall processing structure of the network. More recently, the approach of hourglass analysis has been developed for the purpose of quantitatively analyzing whether only a few intermediate nodes mediate the information processing between a large number of inputs and outputs of a network. We develop a new framework for hourglass analysis that takes network weights into account while identifying the core nodes and the extent of hourglass effect in a given weighted network. We use this framework to study the structural connectome of the C. elegans and identify intermediate neurons that form the core of sensori-motor pathways in the organism. Our results show that the neurons forming the core of the connectome show significant differences across the male and hermaphrodite sexes, with most core nodes in the male concentrated in sex-organs while they are located in the head for the hermaphrodite. Our work demonstrates that taking weights into account for network analysis framework leads to emergence of different network patterns in terms of identification of core nodes and hourglass structure in the network, which otherwise would be missed by unweighted approaches.

Introduction

Many networks in both technological and biological systems tend to follow a structure where a large number of inputs and outputs are connected by pathways through a small number of intermediate nodes in the network [13]. In such a hierarchical organization of a network, these intermediate nodes are typically of high importance as they form either the bottleneck or the core of the overall information flow in the network. This pattern of hierarchy has been observed in many networks in biological as well as techno-social systems [2, 46]. In the case of various biological networks, modularity and hierarchy are well known to help organisms function ceaselessly with better robustness despite external disturbances to the input and output modules [7, 8]. In addition to this robustness for survival, modular and hierarchical networks are also helpful in evolvability [9], as certain peripheral (input/output) modules of less criticality in the information-flow of the network can be modified more effectively in response to prolonged external environment changes spanning across multiple generations of the organism [7, 10, 11].

In such scenarios, it is important to examine the set of nodes which are critical to the structure or functioning of the network. Many measures have been used in network science to analyze centrality structure in networks [1215]. More recently frameworks characterizing the bow-tie structure have been developed to study networks that are organized in a hierarchically modular fashion and facilitate distributed information processing [1618]. In the same line, [19] developed the hourglass analysis framework to study hierarchical dependency networks, especially for networks with a relatively higher number of inputs and outputs mediated through a much smaller set of intermediate modules. The hourglass effect has been observed in networks from various domains in biology including metabolism [16, 20, 21], neuronal structure for visual-cognitive tasks [22]. The hourglass framework identifies a set of core nodes (known as the τ-core) in a source-target dependency network based on the path centrality metric [19], returning a small set of critical nodes through which most input to output pathways pass. Additionally, [19] also developed the H-score metric to quantify the extent to which a given network shows the hourglass effect.

Among the neuronal networks in biology, the connectome of the Caenorhabditis elegans (C. elegans) has been of particular interest. It is the only fully-mapped structural connectome among all organisms with a nervous system, enabling the study of structural properties of neuronal networks, many of which could be generic to systems as complex as the human brain [23]. Recent work has pointed out that the C. elegans nervous system can be viewed more as a distributed information processing system [24]. The aim to identify critical neurons in the connectome has thus focused on creating measures that take an integrated picture from the whole network while computing the critical neurons and analyzing the properties of the network as a whole [25]. Had performed the hourglass analysis on the C. elegans connectome by considering it as an information processing system with an environmental input received by the input nodes (sensory neurons), processed by an intermediate network of nodes (inter-neurons) to generate an appropriate output response through output nodes (motor neurons). The hourglass analysis framework was extended to handle feedback loops in the given network doing away with the assumption of the network representing the information processing system being a directed acyclic graph (DAG) [25]. However, all the studies analyzing the hourglass properties have been done using the unweighted hermaphrodite connectome of the C. elegans published by [23]. Recently [26], published a new connectome of the C. elegans with datasets for both male and hermaphrodite sexes along with the weights for each connection based on the number and size of neuronal synapses between a pair of neurons as seen in electron microscopy (EM) series.

In this paper, we firstly present a new framework to study the hourglass effect in weighted networks in the context of the C. elegans connectome. The importance of taking weights in a network into account is well known [27, 28]. For the first purpose of the weighted hourglass analysis, we develop a method named multi-edge transformation (MET) which involves performing hourglass analysis on a weight-based transformation of the original network. An important concept in identifying the “waist” or core nodes in the hourglass analysis is the path centrality metric [19, 25], which was earlier defined only for unweighted networks. The MET method presented in this paper redefines the path centrality of a node by taking into account the weights of edges along the source-target (sensory-motor) paths that traverse through the node. Thus MET assigns importance to nodes in the network based on not merely the number of source-target paths passing through them but also the extent to which these paths contain highly weighted edges. Our results on the C. elegans connectome data show that the weighted version (MET) of the hourglass analysis identifies a more specific yet not fully overlapping set of core nodes (neurons) as compared to the unweighted version (UNW). We also show through randomization experiments that the hourglass effect exhibited by the weighted networks at hand is not due to chance, as the empirical network had a significantly higher H-score compared to the corresponding networks generated through the randomization procedure.

Secondly, we also present a comparison between the male and hermaphrodite connectomes in terms of their hourglass properties. By using both the unweighted and weighted versions of the hourglass analysis on the male and hermaphrodite connectomes, we show that the neurons that form the core of the network structure differ significantly between the male and hermaphrodite, both in location and function. Almost all of the neurons that form the core of hermaphrodite were located in the head and are involved in functions like information integration, behavioral response, locomotion, and commanding. However, a large portion of the core neurons of male connectomes are in the pre-anal ganglion and involved in various functions related to mating behavior. While [26] had also observed differences between the male and hermaphrodite connectomes in terms of dedicated sex neurons in the male for certain reproductive functions, our work shows that the sex-specific neurons in the male are actually at the core of network structure, and these interneurons are highly intervened in a lot of sensorimotor pathways.

Taking together, our work presents a way to analyze a given weighted network for a small set of core nodes through which most of the input-output pathways pass, making them essential for most sensorimotor activity in the specific case of neuronal connectomes. The application of this framework on the C. elegans connectome shows that there are significant differences in the way the male and hermaphrodite connectomes are organized, especially in terms of the set of neurons at the core of end-to-end neuronal pathways. This difference pattern between the sexes persists irrespective of whether the connectome is considered as a weighted or an unweighted network. Our work opens a new possibility of how modern network analysis frameworks can be used to understand subtle structural differences that may play a part in defining the differences in behavioral characteristics of organisms.

Materials and methods

Datasets/Connectomes

Datasets with neuronal networks of the C. elegans for both male and hermaphrodite sexes from [29] were used in this study, referred to as maleCook and hermCook in the paper. Directed networks were built from the dataset with all chemical synapse connections between pairs of neurons represented as directed edges, and the number of synapses in each connection was used as the weight of each edge. While the dataset also provides information about the size of the chemical synapses, it is yet unclear how to interpret the influences that the size of a chemical synapse has in the neuronal signal transmission, thus we interpreted only the number of synapses between neurons as edge weight, and built our following models based on this interpretation. Only non-pharyngeal neurons were used to create the networks with each neuron categorized into sensory (S), intermediate (I), and motor (M) neurons. The number of neurons in each category for both male and hermaphrodite connectomes is detailed in Table 1. Hereafter, the term connectome would refer to the sub-network consisting of only S, I, and M neurons, unless mentioned otherwise.

thumbnail
Table 1. Number of nodes (neurons) and edges (connections) in the networks generated by the three connectomes.

https://doi.org/10.1371/journal.pone.0249846.t001

For comparison purposes, we also include a previously widely-used unweighted connectome from [23](termed as hermVarshney). The details of number of neurons and edges in the networks generated by these connectomes can be found in Table 1. hermCook consists of considerably larger number of edges than hermVarshney, and the weights of the edges unique to hermCook were found to be significantly weaker than the weights of all the edges in hermVarshney (p-value <10−76), as shown in Fig 1.

thumbnail
Fig 1. Distribution of edge weights for the edges unique in hermCook and edges common to both networks.

Vertical dashed lines show μint (green) and μuniq (red), which are the average weights (count of synapses) of edges common to both hermCook and hermVarshney, and unique to hermCook, respectively. The additional edges in the hermCook connectome are significantly weaker (p-value <10−76) than the edges that are common between both the connectomes.

https://doi.org/10.1371/journal.pone.0249846.g001

Preprocessing

There are no nodes that play a dual role (out of S, I, M) as per the provided data in [29]. Four neurons in the male connectome have been mentioned in the cell list but not present in the connectome, which include two male-specific neurons ‘CA01’ (inter) and ‘CP00’ and two sex-shared neurons ‘BAGR’ (sensory), ‘ADAR’ (inter). Neurons near the pharynx were previously found to form a self-contained, autonomously acting nervous system [26]. Therefore, we include only non-pharyngal neurons in the hourglass analysis. After these processes, one neuron in the male connectome, ‘URYDL’ (sensory) was found to be not connected to other remaining neurons. We exclude it in further analysis.

The details of number of edges for all connectomes after feedback edge removal can be found in Table 1. Table 2 shows the proportion of the 9 possible directed edge directions between S, I and M neurons. The feedback edges, which include IS, MS and MI connections, were removed while doing the analysis. This was done to ensure efficient source-target path computation for the hourglass analysis [25].

thumbnail
Table 2. Proportion of connections between sensory (S), inter (I) and motor (M) neurons for all connectomes.

https://doi.org/10.1371/journal.pone.0249846.t002

The data provided in [29] also includes a separate section on sex-specific neurons. The male connectome has 84 sex-specific neurons leaving out muscles, categorized into sensory or inter neurons while the hermaphrodite connectome has only 8 sex-specific neurons (HSNL, HSNR, VC01, VC02, VC03, VC04, VC05, VC06), all of which are motor neurons.

Hourglass analysis on unweighted networks

The hourglass analysis as described in [19] is a framework to analyze a network in terms of nodes that are responsible for most of the information flow between the source (or input) and target (or output) nodes of a network. If only a few nodes are a part of majority of the paths connecting the source and target nodes, then the network is said to exhibit hourglass property, and these nodes can be said to constitute the core of the network, similar to the waist of an hourglass. The following subsections detail out the mathematical formulation of this concept by defining the relevant metrics and methods to compute them.

Generating source-target paths.

If the C. elegans connectome is thought of as an information processing system to be analyzed in terms of the hourglass property, it can be done in terms of the number of source-target paths (S-T paths) i.e. paths connecting a source node to a target node in the network. Nodes through which a larger proportion of S-T paths pass can be considered as nodes of importance for the information processing system. Among all S-T paths present in a network, usually only a subset of the paths would be used in the information flow, which requires domain-specific knowledge to determine. In terms of C. elegans neuronal network, previous work [25] proposed a series of path constraints based on consideration of routing efficiency (top shortest paths) or diffusion efficiency (paths with limited lengths).

In this study, we used the set of S-T paths that are at most 4 hops in length. We restrict the set of paths in terms of maximum length rather than selecting the shortest ones because there is no biological evidence so far supporting the neuronal network’s ability to route information through the shortest paths—which requires each neuron to have complete information about the connectome. Moreover, previous work on the C. elegans connectome evaluated a number of sensory-to-motor neuron path selection schemes—without noticing major differences [25]. We rely on this path selection scheme for brevity as it seems more plausible for neuronal communication. We use the notation P4 to denote this set of paths. Hereafter, the term S-T paths will be used to refer to the set of paths from source to target nodes that belong to the P4 paths.

τ-core and path centrality.

There are numerous metrics for examining the extent to which a node plays a role in information transfer in the network. While metrics like degree and eigenvector centrality are based on local properties, centrality metrics that are informed from the network in its entirety are also used for paradigms involving information flow. In the context of the C. elegans neuronal connectome, a relevant metric should account for the fact that the network function involves transfer of neuronal signals between various parts of the nervous system. We used the path centrality metric in this study for formulating the centrality of a node in the network. With the set of paths identified, the next step is to find the core of the network. This is equivalent to finding τ-core, defined as the smallest set of nodes covering a fraction τ (the path-coverage threshold) of the entire set of paths. Unfortunately, finding τ-core is an NP-Complete problem [19]. Nevertheless, by defining the concept of the path centrality we could build a greedy heuristic approach to compute the approximately optimal solution.

The path centrality P(v) of a node v, also known as stress of a node [30], is defined as the number of S-T paths that pass through v. In the context of C. elegans connectome, the path centrality represents the number of sensory-motor neuronal pathways in which a particular neuron participates, thus capturing how important a neuron is to the overall neuronal transmissions in the network.

We could find an approximate solution for computing the τ-core by iteratively adding nodes with the highest path centrality in the network into the core set, starting initially with an empty set. In each step i after a node vi is picked, we compute all the paths in the P4 that could be traversed by node vi, and removed all the nodes and edges in this set of paths. Then, we recalculate the path centrality of each node before adding the next node. This is repeated until at least a fraction τ of S-T paths have been removed (covered). The entire process of obtaining the τ-core using this iterative removal of nodes with highest path centrality is delineated more formally in Algorithm 1.

Algorithm 1. Computation of τ-core using greedy heuristic

1: procedure ComputeCore(τ, PST)

2:  τ ← path-coverage threshold

3:  PST ← set of S-T paths

4:  np ← 0 (number of paths passing through core nodes)

5:  nt ← |PST| (total number of S-T paths)

6:   C ← ∅ (set of core nodes initiated with empty set)

7:  while np < τ nt do

8:   PC ← [P(v) ∀ vV, vC]   ⊳ Path centralities of nodes which are not in the core set

9:   bestNodeargmaxv PC   ⊳ Select node with the highest path centrality

10:   CC ∪ {bestNode}   ⊳ Add the node to set of core nodes

11:   Pbest ← {ppPST, bestNodep}   ⊳ Create a set of paths passing through bestNode

12:   npnp + |Pbest|

13:   PSTPST\Pbest ⊳ Remove all paths passing through the newly added core node

14:  end

15:   return C

Hourglass score.

While the computation for the τ-core of a network provides a set of nodes through which a large fraction (τ) of S-T paths pass, to evaluate the hourglass property of a network we still need a reference with which the size of the core can be compared. This is because while the τ-core can be computed for a very large τ close to 1, there is no gaurantee whether the size of this τ-core is small enough to say that the network exhibits an hourglass effect when compared to a network that definitely does not. To resolve this, the H-score (or the Hourglass Score) has been used as a metric involving a comparison between the size of τ-core of the original network G with a derived network Gf known as the flat dependency network. Gf is constructed from G in such a manner that Gf has no hourglass effect while it captures the source-target relationships of G [19]. The procedure is as follows:

  • Nodes The source and the target nodes are the same in Gf as they are in G. However, Gf does not have any intermediate nodes.
  • Edges The edges for Gf are constructed by replacing each S-T path connecting each pair of source node s and target node t in G with an edge from s to t in Gf. Since this can lead to multiple edges between source node s and target node t as there can be multiple S-T paths from s to t, these edges can be replaced with a single edge of weight w, where w is the number of S-T paths between s and t in G.

It should be noted that the Gf is constructed in a manner such that the dependencies between source and target nodes are maintained along with the number of S-T paths between each pair of source-target nodes. However, it does not have any intermediate nodes (or set of nodes) through which most of the S-T paths would pass. In other words, Gf cannot exhibit hourglass property by design. Also, τ-core of Gf cannot be larger than the τ-core of G. This is because the τ-core of Gf is made up of only source and target nodes while the τ-core of G can have intermediate nodes replacing multiple source or target nodes in the core set. Therefore we can say that, (1) where C(τ) and Cf(τ) are the sizes of the τ-core of G and Gf respectively. S and T represent the set of source and target nodes respectively. The H-score is defined using this premise to measure the extent to which the network G shows the hourglass effect. The H-score H(τ) of a given network and threshold τ is defined as: (2)

It can be inferred that 0 ≤ H(τ) ≤ 1. The H-score is closer to one if the size of the τ-core of the original network G is much smaller than that of Gf, thus implying that G shows stronger hourglass effect because of a smaller core set compared to Gf.

Randomization experiment—Edge shuffling.

The H-score metric captures the extent to which the hourglass effect is present in a given network. However, we cannot rule out the possibility of the hourglass effect arising due to random artifacts, while not being a property of the topology of the particular network. To check this on unweighted networks, we performed a randomization procedure as in [25] for testing the statistical significance of the hourglass effect. This procedure shuffles the network edges while preserving total number of nodes and edges, in-degree of all nodes and the partial ordering of the nodes in the network [25].

Hourglass analysis on weighted networks

The aforementioned hourglass analysis pipeline could be further extended by utilizing the edge weight information to provide a more accurate estimation of the hourglass property of a network. The core idea is that we could assign a weight to each path for estimating the relative importance of each path in the information flow in a network, followed by the computation of path centrality and τ-core based on it. The following subsections detail the formulation of weighted hourglass analysis.

Path weighting with multi-edge transformation.

For each path in the candidate paths identified through path selection (P4 in this case), we assign a path weight based on the weight of edges in involved in the given path. Mathematically, we define the path weight as (3) where wp is the weight of a path p, e1, …en are the edges in p and denotes their corresponding weights. f can be any function that accepts a randomly-large set of values and returns a single value. Different definitions of the edge weights may expect different definitions of f. For example, in networks where edge weight represents the flow through the edge, f could be defined as taking the minimum among edges weights, so that the weight of the path represents the maximal possible flow along the path. If the weight of the edge represents the efficiency of communication between nodes, f could be taking the inverse of the sum of the inverse of the inputs, so that the weight of path represents the communication efficiency between source and target.

In the case of C. elegans neuronal network, the weight of an edge represents the number of synapses in each connection. Therefore, we introduce the multi-edge transformation (MET) model, in which each edge in the network with weight w is replaced of w parallel edges of weight 1. As a result, each path p identified in the original hourglass analysis will be counted wp times in the weighted hourglass analysis, with wp being the number of unique combinations of edges that could form the path. Mathematically, we have (4)

Besides intuitive biological interpretation, the MET model has several advantages mathematically: 1) It can reduce to the original hourglass analysis when the weight of edges in the network is equal to 1; 2) The weight of any edges in a path is important for determining the weight of the path.

Weighted path centrality and τ-core.

If we interpret the path weight as the importance of a path, we could further modify the definition of core in hourglass analysis to be the minimum set of nodes that play a major role in the information flow in the neuronal network. In other words, we define τ-core to be the smallest set of nodes that cover paths that explain more than τ ratio of total path weights in the network.

Again we can use the weighted path centrality to build a heuristic algorithm to compute the τ-core. For a network with PST as the set of S-T paths, we define the weighted path centrality of a node v as (5)

To visualize this concept, Fig 2 shows an example of a network and Table 3 values shows the path centrality computed for each of the nodes in this network, for both unweighted hourglass analysis and weighted hourglass analysis.

thumbnail
Fig 2. A sample weighted network with 10 nodes.

Source, intermediate, and target nodes are represented in green, blue, and red colors respectively. It should be noted that node d has the highest path centrality as per the unweighted hourglass model since most of the paths pass through it. However, if the weights are considered and the multi-edge transformation (MET) model is used, then nodes f and g would be the nodes with the highest path centrality and forming the core.

https://doi.org/10.1371/journal.pone.0249846.g002

There may be nodes in the network which would not be a part of the core in an otherwise unweighted network with the same set of nodes and edges, but could still be responsible for most of the source to target pathways when the network is imagined in terms of the number of synaptic connections between neurons. Thus transforming the graph based on the weights i.e. number of synapses between neurons essentially captures this notion.

Randomization experiment—Edge weight permutation.

To test the statistical significance of H-score against randomized networks [25], had done a randomization analysis in their work which involved rearranging the network edges. For weighted networks, however, we could reduce the variance brought by randomization by using edge weight permutation. This can be done by shuffling the weight of the edges while keeping the structural topology of the network intact. This randomization allows us to more precisely evaluate the hourglass effects of the empirical network caused only by the weights instead of the network topology. We compared the empirical network with 500 such randomized networks generated by edge-weight permutation to check the statistical significance of the hourglass effect in the networks.

Results

Hourglass effect in weighted and unweighted networks

We apply the hourglass analysis according to the framework mentioned in subsection Hourglass analysis on unweighted networks using the routing scheme P4 for generating source-target paths as described in subsection Generating source-target paths. Let P denote the set of paths generated by this routing scheme, the main aim of the analysis is to compute the smallest set of nodes (neurons) that cover a fraction τ of all the source-target (sensorimotor) paths in P. The path centrality (or coverage) metric is defined in Hourglass analysis on unweighted networks subsection to quantify the importance of a given node (or set of nodes) in the S-T information flow pathways in the network. To measure the extent to which a network exhibits the hourglass effect in its structure, the H-score is computed as detailed in subsection Hourglass analysis on unweighted networks. This subsection deals with comparison between the weighted (MET method) and unweighted (UNW) versions of the hourglass analysis for the C. elegans connectome, while the comparison between the hermaphrodite and male sexes is done in the subsequent subsection (Sex-wise comparison of the hourglass effect).

Fig 3 summarizes the results for the hourglass analysis using the unweighted (UNW) framework on hermCook and hermVarshney connectomes, as well as the weighted hourglass framework (MET) on the hermCook connectome. As detailed below, the weighted version differs from the unweighted version in terms of various properties in the hourglass analysis.

thumbnail
Fig 3. Comparison of hourglass effects in hermVarshney, hermCook with or without MET transformation.

UNW: Unweighted version of hourglass analysis. MET: Mutli-edge transformation method of hourglass analysis for weighted networks. As described in subsections Hourglass analysis on unweighted networks and Hourglass analysis on weighted networks, hourglass analysis was performed to compute (a) the set of core nodes (neurons) in the τ-core at τ = 0.9 along with their path coverage as fraction of S-T paths that pass through the node, (b) Cumulative path coverage with the addition of each node in decreasing order of path coverage, and (c) the H-score metric at various values of τ to quantify the extent to which the network shows the hourglass effect.

https://doi.org/10.1371/journal.pone.0249846.g003

In both weighted and unweighted cases, the networks do show the hourglass effect with a high H-score (Fig 3(c)). It can be noted in Fig 3(c) that with additional weaker edges, the H-score using UNW method for hermCook is much lower than that for hermVarshney, while the H-score for MET framework is much higher than that of the UNW framework on hermCook. Even though the UNW analysis on hermCook seems to indicate a much weaker hourglass effect, taking weights into account in the MET framework uncovers a much stronger hourglass structure in the same connectome (hermCook).

To identify the neurons that form the important nodes in the hourglass structure of the network, the τ-core was computed as a set of iteratively selected nodes in the network that have the highest path coverage (see subsection τ-core and path centrality and Algorithm 1). The nodes in the τ-core can be considered to form the ‘waist’ of the hourglass structure in the network. Comparing the τ-core for the UNW and MET cases (Fig 3(a)), the MET method results in a more specific set of core nodes than the UNW case for hermCook, while still retaining the set of essential high-centrality nodes. Moreover, the nodes corresponding to the pair of neurons RIAR and AVAL/R show much higher hourglass effect in the case of weighted analysis than in the case of unweighted analysis. The UNW method for hermCook identifies a much broader set of nodes compared the the hermVarshney connectome, which could be attributed to the additional edges in the hermCook connectome with weaker weights, thus weakening the hourglass structure. However, as is observable in the cumulative path coverage plots as well (Fig 3(b)), the weighted version of the analysis (MET) identifies a more specific set of nodes with much higher path coverage. In the case of the weighted hourglass analysis using MET method, the curve has steeper slope than in other cases, implying that certain nodes have much higher path centrality in the weighted analysis and less nodes are needed to cover the same fraction τ of all the weighted S-T paths.

Thus, when taking the weights into account, a stronger hourglass structure in the network can be unravelled, consisting of a smaller yet not fully overlapping set of core nodes compared to unweighted analysis. The MET method has the highest H-score for larger values of τ (0.5 < τ < 1) despite the lowest values for UNW method on the same hermCook dataset. This further indicates that a framework considering the connectome as a weighted network reveals very different information about the hourglass properties of the network structure.

Sex-wise comparison of the hourglass effect

To study the differences between the hourglass properties of the hermaphrodite (hermCook) and male maleCook connectomes, the results from weighted hourglass analysis using the MET method were analyzed in terms of their core neurons (i.e., nodes in the τ-core at τ = 0.9), cumulative path coverage and the H-score metrics. The connectomes maleCook and hermCook are visualized in Fig 4 with node-size representing the path centrality of the nodes as computed using the multi-edge transformation (MET) method. Fig 5 summarizes these results for comparison between the hermaphrodite and male connectomes. It can be noticed in Fig 5(a) that the set of nodes in the τ-core have four common nodes between the two connectomes, corresponding to the neurons RIAL/R and AVAL/R which are ring and ventral chord inter-neurons located in the head, respectively. However, the hermaphrodite connectome has four unique nodes in its τ-core corresponding to the neurons AVEL/R, AIBR and RIMR all of which are inter-neurons located in the lateral ganglia in the head, while the male connectome has five unique nodes corresponding to the neurons PVZ, PVV, PDB, PVX and PHCR. These neurons are mainly located in the pre-anal ganglion and involved in various functions related to mating including posture control, vulva detection and copulation [31]. These results are detailed in Table 4.

thumbnail
Fig 4. A visualization of (a) hermCook and (b) maleCook connectomes.

Each node represents a neuron in the connectome, with its color representing the type of neuron (red: sensory neuron, green: inter-neuron, blue: motor neuron), and its transparency representing the path centrality calculated using the MET method. The vertical position of nodes are calculated based on the location metric in [25], which measures a node’s relative position in the feedforward network. Horizontally, nodes are placed in the manner that nodes with higher path centrality are closer to the center. Each edge represents the collection of chemical synapses from one neuron to another, and the transparency indicates the count of chemical synapses in collection. The nodes forming the τ-core (at τ = 0.9) as described in subsection Hourglass analysis on weighted networks are labeled with their names.

https://doi.org/10.1371/journal.pone.0249846.g004

thumbnail
Fig 5. Sex-wise comparison of the hourglass effects on maleCook, hermCook connectomes.

The multi-edge transformation (MET) was used as described in subsection Hourglass analysis on weighted networks to compute (a) the set of core nodes (neurons) in the τ-core at τ = 0.9 and their path coverage as fraction of S-T paths passing through the node, (b) Cumulative path coverage with the addition of each node in decreasing order of path coverage, and (c) the H-score metric to quantify the extent to which the network shows the hourglass effect.

https://doi.org/10.1371/journal.pone.0249846.g005

thumbnail
Table 4. Neurons in the τ-core at τ = 0.9 in the weighted hourglass analysis on hermCook and maleCook connectomes.

https://doi.org/10.1371/journal.pone.0249846.t004

Comparison with random networks

To check whether or not the hourglass effect is due to random effects, a comparison with randomized networks was done. The weighted connectomes were randomized by shuffling the edge weights, while keeping the set of edges the same to ensure that the structure of the network is maintained. Weighted hourglass analysis using the MET method was done on a set of 500 such networks and H-scores were computed for each of them at the same path-coverage threshold as used for the original networks (τ = 0.9). Such a comparison between the H-score of the original connectomes with the corresponding set of randomized connectomes for each of the sexes can tell whether the hourglass effect observed is because of random effects simply due to the network structure or because of a more specific pattern of weights and S-T paths that leads the network to exhibit the hourglass property. It should be noted that for unweighted networks, the randomization procedure carried out by [25] suffices as a check, but it is not the minimal randomization in the case of weighted networks.

The difference between H-score for the 500 random networks and the real network is statistically significant (with p-value <10−2) for both the male and hermaphrodite connectomes when using the MET method as well as the UNW method. Fig 6 shows the distribution of the H-score curves of randomized networks and original network.

thumbnail
Fig 6. Hourglass scores of the empirical network against networks generated with edge weight permutation.

For testing the results of the MET method in both (a) hermCook and (b) maleCook connectomes, only the edge weights were shuffled keeping the edge structure intact to generate 500 randomized networks in each case. Hourglass analysis on in all the above cases reveals significantly lower (p-value <10−2) H-score than the corresponding original network.

https://doi.org/10.1371/journal.pone.0249846.g006

Discussion

In this work, we perform a comparison between the male and hermaphrodite structural connectomes of the C. elegans by developing a framework for studying the hourglass effect in weighted networks. Until recently, the C. elegans connectome was available only as an unweighted network for hermaphrodite. But with the recent advances [26], the connectome has been mapped for both the sexes along with edge weights based on the number and size of the synapses as seen with electron microscopy (EM). Our results show that there is a significant difference between the hourglass properties in the male and hermaphrodite connectomes, both in terms of the set of nodes that form the core of the network as well as the extent to which the network shows the hourglass effect.

Moreover, the hourglass analysis framework for weighted networks is able to narrow down the core nodes to a much smaller number of nodes in the network, which could mainly be because of taking the edge-weights into consideration in the computation of the centrality of the source-target pathways in the network. We had used five different networks created from two datasets. The widely used dataset by [23] only has the unweighted connectome for the hermaphrodite, while the remaining four networks were the weighted/unweighted networks from the male/hermaphrodite connectomes published by [26]. Among these connectomes, the variation within the same sex based on the dataset and choice of analysis (MET vs UNW) can be observed in Fig 3. In terms of the core nodes, the multi-edge transformation (MET) method for hourglass analysis results in a much smaller set of core nodes compared to the unweighted version (UNW) for both the sexes (Fig 5(a)). Additionally, the cumulative path coverage and H-score plots (Fig 3(b) and 3(c)) also indicate a similar trend that in the weighted paradigm of the analysis, the hourglass effect is more visible in the weighted connectomes as compared to the unweighted case. This could primarily be because of the enhancement of the centrality of certain neuronal pathways due to their predominant use in the organism as compared to others, thus leading to a higher number and size of synaptic connections between the participating neurons. This trend of significantly higher connectivity (and synaptic connections) of certain neurons and their pathways has been observed previously [32]. It can be further observed that the hourglass metrics (cumulative path coverage and H-score) are higher for the unweighted analysis (UNW) on the older dataset by [23] compared to the same analysis on the unweighted connectomes by [26] (See Fig 3(b) and 3(c)). The additional set of edges provided in the connectomes by [26] are significantly weaker compared to the older connectome by [23] (Fig 1), which could explain the emergence of a weaker hourglass effect when using the unweighted analysis (UNW). However, the same metrics under the weighted analysis (MET) are higher for the dataset by [26] indicating that the weighted framework is able to take into account the existence of numerous weaker connections which could otherwise change the result in case of the unweighted version of the analysis.

The comparison between the male and hermaphrodite connectomes reveals interesting patterns about how the neuronal networks in both the sexes are organized. While the hermaphrodite shows a stronger hourglass effect than the male, they have the same core size containing 9 neurons each (Fig 5(a)). However, in the case of the male connectome, the sex-specific neurons form the majority of the core nodes. This is in line with the observations from previous studies that the pruning of synapses, that occurs at sexual maturation, affects a lot of sensorimotor pathways leading to a sexually dimorphic neuronal connectivity in the organism [33]. Moreover, previous work has found that the posterior nervous system in the males gets enlarged to support mating behavior, adding about 30% more neurons with parallel pathways and complex connectivity, matching that of the whole nervous system of the hermaphrodite [34]. Our results additionally show that apart from just the addition of the sex-specific subnetwork, the sexual dimorphism in the C. elegans also results in significantly different set of neurons forming the core of the sensorimotor information processing pathways in the connectome.

Through this work, we highlight a method to analyze a given weighted network for its hourglass property using an algorithmic approach to quantify its extent. The application of the framework to the weighted C. elegans connectomes for both the sexes reveals differences in the composition of the neurons that form the core of the hourglass structure in both the networks. Additionally, we show that when taking edge weights into account using the multi-edge transformation (MET) hourglass framework, different properties of the network emerge in terms of the core nodes as well the scale of the hourglass effect.

Various methods have been used previously for identifying critical neurons in the C. elegans connectome, such as graph perturbation techniques [35] and controllability frameworks [36]. The proposed framework in our paper is applicable only to directed networks with a given set of paths from source nodes to target nodes. The network may include cycles and the nodes may be of the same or different type—the hourglass method is applicable as long as there is a given set of directed paths from source to target nodes. A more extensive comparison between the hourglass framework and other related methods, such as core-periphery or rich-club approaches [3740] can be found in [19].

More broadly, there are many approaches to identify important nodes (or rank them) in general networks [18, 39, 41]. The hourglass analysis framework is fundamentally different from approaches that are degree-based (such as k-core decomposition), that only consider shortest paths (such as betweenness centrality), or that consider all possible paths (potentially weighted based on their length, such as Katz centrality). A comparative study of existing and extended frameworks along with an interpretation of the outcomes should be done in the near future to enhance the understanding of the nature of optimization followed by naturally occurring neuronal networks.

Acknowledgments

The authors are thankful to Prof. Scott Emmons from Albert Einstein College of Medicine, Bronx, NY, USA for help with clarifications on the C. elegans connectome dataset.

References

  1. 1. Ravasz E, Somera AL, Mongru DA, Oltvai ZN, Barabási AL. Hierarchical organization of modularity in metabolic networks. science. 2002;297(5586):1551–1555. pmid:12202830
  2. 2. Ravasz E, Barabási AL. Hierarchical organization in complex networks. Physical review E. 2003;67(2):026112. pmid:12636753
  3. 3. Sales-Pardo M, Guimera R, Moreira AA, Amaral LAN. Extracting the hierarchical organization of complex systems. Proceedings of the National Academy of Sciences. 2007;104(39):15224–15229. pmid:17881571
  4. 4. Meunier D, Lambiotte R, Bullmore ET. Modular and hierarchically modular organization of brain networks. Frontiers in neuroscience. 2010;4:200. pmid:21151783
  5. 5. Gupte M, Shankar P, Li J, Muthukrishnan S, Iftode L. Finding hierarchy in directed online social networks. In: Proceedings of the 20th international conference on World wide web; 2011. p. 557–566.
  6. 6. Yerra BM, Levinson DM. The emergence of hierarchy in transportation networks. The Annals of Regional Science. 2005;39(3):541–553.
  7. 7. Kirsten H, Hogeweg P. Evolution of networks for body plan patterning; interplay of modularity, robustness and evolvability. PLoS Comput Biol. 2011;7(10):e1002208.
  8. 8. Kitano H. Biological robustness. Nature Reviews Genetics. 2004;5(11):826–837. pmid:15520792
  9. 9. Dovrolis C, Streelman JT. Evolvable network architectures: What can we learn from biology? ACM SIGCOMM Computer Communication Review. 2010;40(2):72–77.
  10. 10. Lorenz DM, Jeng A, Deem MW. The emergence of modularity in biological systems. Physics of life reviews. 2011;8(2):129–160. pmid:21353651
  11. 11. Verd B, Monk NA, Jaeger J. Modularity, criticality, and evolvability of a developmental gene regulatory network. Elife. 2019;8:e42832. pmid:31169494
  12. 12. Junker BH, Koschützki D, Schreiber F. Exploration of biological network centralities with CentiBiN. BMC bioinformatics. 2006;7(1):219. pmid:16630347
  13. 13. Crucitti P, Latora V, Porta S. Centrality in networks of urban streets. Chaos: an interdisciplinary journal of nonlinear science. 2006;16(1):015113.
  14. 14. Jeong H, Mason SP, Barabási AL, Oltvai ZN. Lethality and centrality in protein networks. Nature. 2001;411(6833):41–42. pmid:11333967
  15. 15. Koschützki D, Schreiber F. Comparison of centralities for biological networks. In: German Conference on Bioinformatics 2004, GCB 2004. Gesellschaft für Informatik eV; 2004.
  16. 16. Zhao J, Tao L, Yu H, Luo J, Cao Z, Li Y. Bow-tie topological features of metabolic networks and the functional significance. Chinese Science Bulletin. 2007;52(8):1036–1045.
  17. 17. Friedlander T, Mayo AE, Tlusty T, Alon U. Evolution of bow-tie architectures in biology. PLoS Comput Biol. 2015;11(3):e1004055. pmid:25798588
  18. 18. Mattie H, Engø-Monsen K, Ling R, Onnela JP. Understanding tie strength in social networks using a local “bow tie” framework. Scientific reports. 2018;8(1):1–9. pmid:29921970
  19. 19. Sabrin KM, Dovrolis C. The hourglass effect in hierarchical dependency networks. Network Science. 2017;5(4):490–528.
  20. 20. Zhao J, Yu H, Luo JH, Cao ZW, Li YX. Hierarchical modularity of nested bow-ties in metabolic networks. BMC bioinformatics. 2006;7(1):1–16. pmid:16916470
  21. 21. Tanaka R, Csete M, Doyle J. Highly optimised global organisation of metabolic networks. IEE Proceedings-Systems Biology. 2005;152(4):179–184. pmid:16986258
  22. 22. Quiroga RQ, Reddy L, Kreiman G, Koch C, Fried I. Invariant visual representation by single neurons in the human brain. Nature. 2005;435(7045):1102–1107. pmid:15973409
  23. 23. Varshney LR, Chen BL, Paniagua E, Hall DH, Chklovskii DB. Structural properties of the Caenorhabditis elegans neuronal network. PLoS computational biology. 2011;7:2. pmid:21304930
  24. 24. Kaplan HS, Nichols AL, Zimmer M. Sensorimotor integration in Caenorhabditis elegans: a reappraisal towards dynamic and distributed computations. Philosophical Transactions of the Royal Society B: Biological Sciences. 2018;373(1758):20170371. pmid:30201836
  25. 25. Sabrin KM, Wei Y, van den Heuvel M, Dovrolis C. The hourglass organization of the Caenorhabditis elegans connectome. PLoS computational biology. 2020;16(2):e1007526. pmid:32027645
  26. 26. Cook SJ, Crouse CM, Yemini E, Hall DH, Emmons SW, Hobert O. The connectome of the Caenorhabditis elegans pharynx. BioRxiv. 2019; p. 868513.
  27. 27. Opsahl T, Agneessens F, Skvoretz J. Node centrality in weighted networks: Generalizing degree and shortest paths. Social networks. 2010;32(3):245–251.
  28. 28. Farkas I, Ábel D, Palla G, Vicsek T. Weighted network modules. New Journal of Physics. 2007;9(6):180.
  29. 29. Cook SJ, Jarrell TA, Brittin CA, Wang Y, Bloniarz AE, Yakovlev MA, et al. Whole-animal connectomes of both Caenorhabditis elegans sexes. Nature. 2019;571(7763):p.63. pmid:31270481
  30. 30. Ishakian V, Erdös D, Terzi E, Bestavros A. A framework for the evaluation and management of network centrality. In: Proceedings of the 2012 SIAM International Conference on Data Mining. SIAM; 2012. p. 427–438.
  31. 31. Susoy V, Hung W, Witvliet D, Whitener JE, Wu M, Graham BJ, et al. Natural sensory context drives diverse brain-wide activity during C. elegans mating. bioRxiv. 2020;.
  32. 32. Liu H, Kim J, Shlizerman E. Functional connectomics from neural dynamics: probabilistic graphical models for neuronal network of Caenorhabditis elegans. Philosophical Transactions of the Royal Society B: Biological Sciences. 2018;373(1758):20170377. pmid:30201841
  33. 33. DiLoreto EM, Chute CD, Bryce S, Srinivasan J. Novel Technological Advances in Functional Connectomics in C. elegans. Journal of developmental biology. 2019;7(2):8. pmid:31018525
  34. 34. Jarrell TA, Wang Y, Bloniarz AE, Brittin CA, Xu M, Thomson JN, et al. The connectome of a decision-making neural network. science. 2012;337(6093):437–444. pmid:22837521
  35. 35. Kim S, Kim H, Kralik JD, Jeong J. Vulnerability-based critical neurons, synapses, and pathways in the Caenorhabditis elegans connectome. PLoS computational biology. 2016;12(8):e1005084.
  36. 36. Yan G, Vértes PE, Towlson EK, Chew YL, Walker DS, Schafer WR, et al. Network control principles predict neuron function in the Caenorhabditis elegans connectome. Nature. 2017;550(7677):519–523. pmid:29045391
  37. 37. Borgatti SP, Everett MG. Models of core/periphery structures. Social networks. 2000;21(4):375–395.
  38. 38. Csermely P, London A, Wu LY, Uzzi B. Structure and dynamics of core/periphery networks. Journal of Complex Networks. 2013;1(2):93–123.
  39. 39. Rombach MP, Porter MA, Fowler JH, Mucha PJ. Core-periphery structure in networks. SIAM Journal on Applied mathematics. 2014;74(1):167–190.
  40. 40. Holme P. Core-periphery organization of complex networks. Physical Review E. 2005;72(4):046111. pmid:16383472
  41. 41. Maesa DDF, Marino A, Ricci L. The bow tie structure of the bitcoin users graph. Applied Network Science. 2019;4(1):1–22.