Figures
Abstract
Background
Several studies have mentioned network modularity—that a network can easily be decomposed into subgraphs that are densely connected within and weakly connected between each other—as a factor affecting metabolic robustness. In this paper we measure the relation between network modularity and several aspects of robustness directly in a model system of metabolism.
Methodology/Principal Findings
By using a model for generating chemical reaction systems where one can tune the network modularity, we find that robustness increases with modularity for changes in the concentrations of metabolites, whereas it decreases with changes in the expression of enzymes. The same modularity scaling is true for the speed of relaxation after the perturbations.
Citation: Holme P (2011) Metabolic Robustness and Network Modularity: A Model Study. PLoS ONE 6(2): e16605. https://doi.org/10.1371/journal.pone.0016605
Editor: Matej Oresic, Governmental Technical Research Centre of Finland, Finland
Received: October 10, 2010; Accepted: December 22, 2010; Published: February 2, 2011
Copyright: © 2011 Petter Holme. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by the Swedish Foundation for Strategic Research, the Swedish Research Council, and the WCU program through the National Research Foundation (NRF) of the Republic of Korea funded by the Ministry of Education, Science and Technology (MEST) (R31-2008-000-10029-0). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The author has declared that no competing interests exist.
Introduction
Graph theoretical methods are useful to study the large-scale organization of biological systems [1]. One such system is the metabolism—the set of chemical reactions needed to sustain the normal, healthy state of an organism. We call a graph derived from a metabolic reaction system a metabolic network. One of the main findings from statistical studies of metabolic networks is that the metabolism has larger network modularity [2], [3] —the tendency for a network to be divisible into subgraphs that are densely connected within, and sparsely connected between each other—than expected [4]. However, metabolic networks are far from perfectly modular—no matter how the network modules are defined, there will be plenty of connections between them [4]–[8]. The network modules are often interpreted as biological modules—functionally independent subunits [9]. This interpretation is a natural consequence of interpreting edges as functional couplings of relatively equal strength. Despite the lack of comprehensive experimental evidence, metabolism is assumed to be robust to e.g. changes in concentration of metabolites [10]. Modularity is often thought to contribute to the robustness of various biological systems [11]–[13]. But if this is true for metabolism too, that modularity contributes to both functionality and robustness, then how come there are so many cross-modular couplings? One explanation could be that these couplings are inevitable—the laws of physics give no way of avoiding intermodular connections. Another explanation could be that the intermodular edges actually stabilize the system so that the organization we observe is a compromise where adding functionality increases modularity and adding robustness decreases modularity. Such a role of modularity relates to the concept of distributed robustness [14]—if a module fails, many other modules can collectively compensate for this loss, there need not be any replacement module. In terms of metabolic networks, this means that there will be many connections between the modules and thus that the network modularity will be comparatively low. In this paper we investigate the role of network modularity in large chemical reaction systems as directly as possible—by measuring the system's response to different types of perturbations in a model with tunable network modularity.
Our simulations start by generating a chemical reaction system. This generative algorithm is stochastic and by tuning the input parameters, we can control the expected network modularity (Fig. 1) [15]. Then we generate a random distribution of metabolites and relax the system to equilibrium (using mass-action kinetics with an implicit enzymatic control). From this state, we apply a certain type of perturbation to the system and let it relax to a new equilibrium. To quantify robustness, we measure how close the two equilibria are to each other. We also measure the relaxation time, i.e. how fast the system can respond to the perturbation (and for that reason, we do not employ faster calculations of the equilibrium state [16], [17]). In Fig. 2 we show an example of these steps. As the reaction system is generated by a stochastic method we repeat the procedure above to obtain averages. For each value of the input parameters, we measure average values over 500 realizations of all steps above of both the network modularity and the quantities characterizing robustness. From these data points we derive trends in the modularity-dependence of different aspects of robustness.
In A we see how the two substrates and one product (circles) of a reaction
(triangle) gets reduced to a substance graph. An arrow going into a reaction
marks the substrate, an arrow going out marks the product. Panel B
illustrates a reaction system obtained with the method of the manuscript.
The parameter values for this reaction-system example are
,
,
,
and
. The algorithm proceeds by assembling reactions and
metabolites in disjoint clusters (the three larger clusters of distinct
colors). Then we add a fraction of metabolites and reactions that can
connect to any parts of the system. The larger this fraction of global
reactions is, the lower is the network modularity of the projected
network.
The figure illustrates a reaction system at equilibrium visualized by its reaction graph A, getting perturbed by redistributing the mass of (in this case two) metabolites B and how the system relaxes to another equilibrium (c,d). The concentration is illustrated by the size of the circles (the total mass, not the concentration is conserved, so the total areas of the circles are not the same in the different panels). The change in concentration is indicated by color. A metabolite unaffected by the perturbation is colored black.
Results
Robustness as a function of network modularity
Robustness is a broad concept that hardly can be condensed into one measure, even for a system as specific as metabolism. In general, robustness can be defined as a system's ability to remain unchanged when perturbed. One can imagine several types of perturbations. We investigate two rather different classes—changes in concentrations of metabolites and changes in the reaction system (new reactions replacing old) by genetic control. We refer to the first case as metabolic perturbations and the second as genetic perturbations. We will also distinguish between: if the perturbations are localized to one module, or if they can appear anywhere in the network. In total we consider four classes of perturbations—they can be either localized or global, and metabolic or genetic.
The main robustness measure, defined in the Methods section, is basically the relative change in the
concentration of a metabolite averaged over a set of metabolites. We consider
two such sets, either the whole set of metabolites, which gives the
system-wide robustness
, or the metabolites that are perturbed giving the
focal robustness
.
In Fig. 3, we plot the
average values of our robustness measures as functions of the average network
modularity . The robustness to global metabolic perturbations
increases while the robustness to perturbations within a module remains fairly
constant (Fig. 3A). If one
looks only at the metabolites that were originally perturbed (Fig. 3A), the situation is
different—these metabolites are more affected by sudden shifts in the
concentrations the more modular the system is. This seems logical—if the
modularity is lower, the coupling to the rest of the network is stronger, so
there are more metabolites to influence the relaxation and to absorb the
perturbation. The fact that the system is more robust to global, compared to
localized, perturbations can be explained in a similar way—a localized
perturbation gives a larger impact on a restricted subsystem and this subsystem
cannot absorb that large impact as much as the whole system would. But why does
the system-wide robustness increase with modularity? One scenario is that
metabolic perturbations are better absorbed in a distributed fashion. With
global perturbations and high modularity each module handles its internal
perturbations and, if this fails, flows between the modules are too weak for the
perturbation to spread.
Panels A and B show data for the robustness against metabolic
perturbations. A displays robustness of the system as a whole; B shows
the robustness measured over the perturbed metabolites only. Panels C
and D show the corresponding plots for robustness against genetic
perturbations. Circles represent perturbations made in one module;
crosses indicate data for perturbations made in different modules. The
data is averaged over more than 500 runs (network realizations). The
errorbars in the average are
smaller than the symbol size.
For the genetic perturbations all curves are decreasing, meaning that modularity
makes the system less robust. These perturbations virtually add new reactions
and delete old. Even if the perturbations are designed not to affect the average
structure of the system (keeping e.g. the system size
and the modularity
constant), they
obviously affect
more than the
metabolic perturbations (cf. Fig.
3A and Fig. 3C).
A network module can presumably not handle a genetic perturbation as efficient
as a metabolic perturbation. Another factor for the decreasing
-curve could be that the interface between the modules
can change from the perturbations and that the interfaces get more influential
with increasing modularity. As seen in Fig. 3D, the localized perturbations
influence the directly affected metabolites (the ones that are involved in
reactions changed by the genetic perturbations) less strongly than the global
perturbations. From the changes at the interfaces, we can understand that
localized perturbations affect the rest of the system to a greater deal here
than compared with metabolic perturbations.
is larger for the
local compared with global genetic perturbations meaning that for metabolites
within a single module rewired by genetic perturbations the changes will be
larger than if the perturbations are more distributed.
Relaxation time as a function of network modularity
In Fig. 4, we show the
relaxation time as a function of
modularity. A small
value means that
the system reaches its new equilibrium fast. This dynamic response is different
for the two types of perturbations—the system reaches its new state faster
with higher modularity for the metabolic perturbations, but slower with the
genetic ones. The decreasing
curves for
metabolic perturbations is in line with the above mentioned scenario that if
modules handle the perturbations independently, then the more modular the system
is the better (in this case faster) is the recovery. That, for genetic
perturbations, robustness increases with modularity is something we interpret as
an effect of the changed couplings across at the boundary. The stronger the
modularity is, the slower is the flow between the modules and the longer does
the system need to find a new equilibrium.
Panel A displays the corresponding data for perturbations in the concentrations of metabolites. Panel B shows the relaxation time for genetic perturbations within one module (circles) or the whole system (crosses). The data represents averages over more than 500 runs (the same runs as in Fig. 3).
Discussion
We have, in a model framework, directly measured the effects of network modularity on the robustness of chemical reaction systems. The main conclusion is that modularity does affect robustness but not in a unique way. Modularity is thus, it seems, not a general principle for either strengthening or weakening robustness, not even in such a specific system as metabolism. When relating robustness and modularity, one needs to specify what kind of perturbation we measure robustness against. For sudden changes icn concentration levels, in our model, more modular reaction systems are more robust and converge to an equilibrium state faster than less modular systems. If, on the other hand, the genetic control is altered—so that other enzymes are expressed—then modularity decreases robustness. In an evolutionary perspective, this essentially means that we need more detailed studies. Real metabolic networks are more modular (in the network-modularity sense) than random networks, but still far from, say, a system engineered by humans [18]. One scenario is that robustness is key driving force in evolution of metabolic-network structure and that this weakly modular structure above comes from trade-offs between robustness-increasing and robustness-decreasing changes in modularity. However, functionality and chemical constraint probably also play a major role in this evolution. Note that if one considers smaller feedback loops as modules, rather than network clusters, evolution is by necessity modular in the sense that adding the production of a new substance often needs the addition of its degradation (this is because many substances cannot penetrate the cell membrane and would be toxic if accumulated). The conclusion that modularity does not affect robustness in a single direction has further implications for synthetic biology that often, at least theoretically, strives to design functionality from combination of modules [19], [20]—our study hints the such an approach would not give robustness for free.
For the future, we anticipate more studies cataloguing the principles of robustness, and the effects of modularity. We believe model studies like the present are the best theoretical way to proceed. An alternative is to compare the modularity of different organisms [21] to find changes in the modularity over the course of evolution, but in such an approach it would be hard to tease apart fundamental physical constraints from evolutionary pressure. It would of course also be interesting to experimentally compare the response of different organisms, or cell types, with metabolism of different network modularity to perturbations. Further into the future, we hope for experimental methods to measure the dynamics of the entire chemical composition of cells.
Methods
Notations and mathematical framework
We consider a reaction system of metabolites
and
reactions
. A reaction
is characterized
by its substrates
, their
multiplicities
, its products
and their multiplicities
, and a reaction
coefficient
. Consider, for example, the reaction
2H
+ O
2 H
O. Then we have
,
is H
,
is O
,
,
,
is H
O and
. From a reaction system one can derive a graph
, where
(
in this case) is the set of vertices of the graph and
is the set of edges. One can define several types of
metabolic graphs. In this work we focus on substance graphs (claimed to encode
more functional information about the graphs than other simple-graph
representations [5], [15]), where the vertices are substances and there is an
(undirected) edge between two vertices if they are either substrates or products
of the same reaction (edges between a vertex to itself is not allowed). In the
example above, the reaction will contribute with three
edges—
,
and
—to the
substance graph (see Fig.
1A).
Network modularity
We will shortly discuss how network modularity is calculated. For a more
comprehensive review, see Refs. [2], [3]. Let the vertex set be
partitioned into groups and let denote the
fraction of edges between group
and
. The modularity of this partition is defined as
(1)where the sum is over all the vertex
groups. The term
is the expectation
value of
in a random graph. The measure for graph modularity that
we use is
—
maximized over all
partitions (by a heuristics proposed in Ref. [3]). Comparing
of graphs with different sizes and degree distributions
is not completely straightforward. Even for networks generated by one particular
model (that one would from construction expect to have the same modularity)
can vary with the network size [22]. Fortunately, for this
work, such changes are monotonous. This means that we can use
to detect changes in robustness in response to changes
in modularity even though the particular functional forms of the curves of
robustness vs.
are hard to
interpret.
Model reaction systems with tunable network modularity
In this section, we will sketch the model of reaction systems with tunable
network modularity. The model we use treats atoms of the molecular species
explicitly. The set of all atoms is divided into groups (or
proto-modules) of equal size
.
reactions are added to the system such that they obey
mass conservation (for all atom species, the number of individuals is the same
for substrates and products).
reactions are
added between molecules consisting of atoms from the same group. The remaining
reactions are added between molecules of any atomic
composition. For low
-values, relatively
few reactions will connect different groups and therefore the derived network
modularity will be low. If
is close to one,
the derived graphs will be more modular. The molecules are constructed by
randomly combining atoms of a group. Reactions are generated by randomly
splitting and recombining molecules. If the mass conservation is broken, or the
reaction already exists in the data set, then the molecule construction is
repeated. If no reaction fulfilling mass conservation has been found after
iterations, then this is done by defining new molecules.
With a larger value of
, the substance
graphs will thus be both denser and have fewer metabolites
(
is, perhaps a little unusually, an output of the model,
whereas
is a control parameter).
There are a number of other technicalities, like how the molecules are constructed from the atoms etc., that are explained in detail in Ref. [15]. We also modify the algorithm of Ref. [15] when it comes to inter-group reactions. In Ref. [15] these always act as sources and sinks (so that there is never a flow between modules); here all inter-group reactions are bridges between the modules (so that these reactions have at least one substrate in one group and one product in the other).
In this work we use the parameter values ,
,
and
(the values of the other parameters, related to the
details in Ref. [15] are the same as in that paper).
Reaction kinetics
To simulate the biochemical dynamics, we use simple mass-action kinetics. This
approach is, technically speaking, assuming all enzymatic effects can be encoded
into the reaction coefficients and the reaction system itself. The main reason
for this simplification is that, when speaking about network modularity, enzymes
are usually only included implicitly (via the active reactions), so to relate
the robustness to network modularity we need a kinetic description of the same
level of description. Given a reaction system generated by the scheme above we
assign a rate constant to each reaction
drawn from a normal distribution
(the sign of
defines the
direction of the reaction) and initial concentration
of a substance
in
(setting negative concentrations to zero). From this
starting point, we use the kinetic equation
(2)where the sum is over all reactions
with
as a product,
where
is
's
multiplicity in the reaction
. To simulate the
metabolic flux we also add source and sink terms to Eq. 2 for some metabolites.
We let all the metabolites that are not substrates of any reaction be sinks
(otherwise their mass would just accumulate) and all metabolites that are not a
product of any reaction to be sources. In practice there will always be both
sources and sinks in the generated reaction systems. (If the reaction systems
would be generated in some other way one would need to put in sources and sinks
explicitly.) We model the outflux by letting the sink-metabolites flow out of
the system with a rate proportional to
times the
concentration of the metabolite. In our simulations we use
. We keep the inflow rate the same as the outflow rate so
that the total mass is conserved. The inflow is distributed to the inflow
metabolites in proportion to
, a random variable
for each inflow metabolite drawn from a
distribution when
the reaction system is generated.
From the above setup, we run the system is until it converges (which it always
does for the dynamic systems in question). We integrate the system with the
Euler method (with time step until the time
when
(3)
We use in this simulations. Higher precision in
or
does not change
the outcome significantly. In this paper we use the parameter values
,
,
,
,
,
and
.
Genetic perturbations
Since we exclude genetic control and explicit enzymes in our reaction-system
kinetics, we have to model the genetic perturbations indirectly. This is on the
other hand quite straightforward. We replace randomly chosen
reactions following the same rules as when the reaction system was first
constructed. For local perturbations, the reactions are chosen from one randomly
selected cluster (identified by the cluster-detection algorithm above). A
reaction is associated to the module to which a majority of its metabolites are
categorized (if there is a tie, we select a cluster randomly). In this process,
new metabolites will inevitably be generated and others possibly deleted. To
conserve mass in case the number of metabolites changes, we split the mass of
the deleted metabolites equally among the new. We also go over the system and
update the sources and sinks in the same way as when the reaction system was
constructed.
Metabolic perturbations
Analogously to the genetic perturbations, we also require the metabolic
perturbations to conserve the total mass. We control the magnitude of the
perturbation by a parameter by requiring that
(4)where
is the total mass
of metabolite
before the perturbation and
is the total mass after, and
is a set of metabolites. In practice the masses have a
right-skewed, heavy tailed distribution (as observed in real systems [23]). This
means that if we just continue adding metabolites randomly until the condition
Eq. (4) is fulfilled, and
is not very small
(we use
), we will have to perturb a rather large fraction of the
metabolites. To get around this problem, consider a set
of metabolite pairs. For the local perturbations, we
choose a cluster (as detected by the algorithm above) at random as
and add pairs of metabolites picked at random to
until the condition is met or all there are no
metabolites left in the cluster. For the global perturbations we let
and split the metabolites into two sets
and
where the total
mass of any metabolite in
is larger than any
metabolite in
and
is as small as
possible such that the total mass of
is larger than
. In our simulations
always has more
elements than
. Then we add pairs where one metabolite is randomly
selected from
and one is randomly selected from
until Eq. (4) is true.
Robustness measures
Any measure of robustness should increase the more similar the system is before
and after a perturbation. For biological functionality, it could be just as
important to keep the concentrations of rare metabolites steady as those of the
most abundant ones. Let be the
concentration of metabolite
before the
perturbation and
be the
concentration after. A natural choice would be to take the average over the
metabolites of the change
rescaled by the
typical concentration of
as a measure of
unrobustness (and thus its reciprocal value as a measure of robustness). As
“typical concentration” one choice is the average. In practice, the
metabolites that are very close to zero in concentration can give a rather large
signal due just to numerical errors. To suppress such numerical noise, we rather
use the quadratic mean, which decreases the expression's sensitivity to
fluctuations in the denominator in the frequent situation that the
concentrations are close to zero, thus putting a lower weight on the more
uncertain terms. Our robustness measure thus becomes
(5)where
is a set of
metabolites and
denotes the
absolute value of a number or the number of elements of a set. We consider two
versions of this measure, one averaged over the whole set of metabolites, which
we call system-wide perturbations
, and one averaged
over the metabolites directly affected by the perturbations (the metabolites
participating in a reaction catalyzed by a perturbed enzyme in the case of
genetic perturbations or, trivially, the perturbed metabolites of a metabolic
perturbation), which we refer to as focal robustness
.
Author Contributions
Conceived and designed the experiments: PH. Performed the experiments: PH. Analyzed the data: PH. Contributed reagents/materials/analysis tools: PH. Wrote the paper: PH.
References
- 1. Barabási AL (2004) Network biology: Understanding the cell's functional organization. Nature Reviews Genetics 4: 101–114.
- 2. Fortunato S (2010) Community detection in graphs. Physics Reports 486: 75–174.
- 3. Newman MEJ (2006) Modularity and community structure in networks. Proc Natl Acad Sci USA 103: 8577–8582.
- 4. Huss M, Holme P (2007) Currency and commodity metabolites: Their identification and relation to the modularity of metabolic networks. IET Systems Biology 1: 280–285.
- 5. Holme P, Huss M (2010) Substance graphs are optimal simple-graph representations of metabolism. Chinese Science Bulletin 55: 3161–3168.
- 6. Zhao J, Yu H, Luo JH, Cao ZW, Li YX (2006) Hierarchical modularity of nested bow-ties in metabolic networks. BMC Bioinformatics 7: 386.
- 7. Ravasz E, Somera AL, Mongru DA, Oltvai ZN, Barabási AL (2002) Hierarchical organization of modularity in metabolic networks. Science 297: 1553–1555.
- 8. Zhao J, Ding GH, Tao L, Yu H, Yu ZH, et al. (2007) Modular co-evolution of metabolic networks. BMC Bioinformatics 8: 311.
- 9. Alon U (2003) Biological networks: The tinkerer as an engineer. Science 301: 1866–1867.
- 10. Smart AG, Amaral LAN, Ottino JM (2008) Cascading failure and robustness in metabolic networks. Proc Natl Acad Sci USA 105: 13223–13228.
- 11.
Eum S, Arakawa S, Murata M (2007) Toward bio-inspired network robustness - step 1. modularity. In:
Bio-Inspired Models of Network, Information and Computing Systems
2. New York: Association of Computing Machinery. pp. 84–87.
- 12. Viana MP, Tanck E, Beletti ME, Costa LF (2009) Modularity and robustness of bone networks. Mol Biosyst 5: 255–261.
- 13. Hinze A, Adami C (2008) Evolution of complex modular biological networks. PLoS Comput Biol 4: e23.
- 14.
Wagner A (2005) Robustness and Evolvability in Living Systems. Princeton, NJ: Princeton University Press.
- 15. Holme P (2009) Model validation of simple-graph representations of metabolism. J Roy Soc Interface 40: 1027–1034.
- 16. Fell D (1992) Metabolic control analysis: a survey of its theoretical and experimental development. Biochem J 1: 313–330.
- 17. Schuster S, Dandekar T, Fell D (1999) Detection of elementary ux modes in biochemical networks: A promising tool for pathway analysis and metabolic engineering. Trends Biotech 17: 53–60.
- 18. Snel B, Huynen MA (2004) Quantifying modularity in the evolution of biomolecular systems. Genome Research 14: 391–397.
- 19. Andrianantorando E, Basu S, Karig DK, Weiss R (2006) Synthetic biology: new engineering rules for an emerging discipline. Molecular systems biology 2: 2006.0028.
- 20. Keasling JD (2010) Manufacturing molecules through metabolic engineering. Science 330: 1355–1358.
- 21.
Li X, Erten S, Bebek G, Koyuturk M, Li J (2009) Comparative analysis of modularity in biological
systems. 2009 Ohio Collaborative Conference on
Bioinformatics. pp. 104–109.
- 22. Guimerà R, Sales-Pardo M, Nunes Amaral LA (2004) Modularity from uctuations in random graphs and complex networks. Phys Rev E 70: 025101.
- 23.
Kaneko K (2006) Life: An introduction to complex systems biology. Berlin: Springer.