Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

The partition representation of enzymatic reaction networks and its application for searching bi-stable reaction systems

  • Takashi Naka

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    naka@is.kyusan-u.ac.jp

    Affiliation Faculty of Science and Engineering, Kyushu Sangyo University, Fukuoka, Japan

Abstract

The signal transduction system, which is known as a regulatory mechanism for biochemical reaction systems in the cell, has been the subject of intensive research in recent years, and its design methods have become necessary from the viewpoint of synthetic biology. We proposed the partition representation of enzymatic reaction networks consisting of post-translational modification reactions such as phosphorylation, which is an important basic component of signal transduction systems, and attempted to find enzymatic reaction networks with bistability to demonstrate the effectiveness of the proposed representation method. The partition modifiers can be naturally introduced into the partition representation of enzymatic reaction networks when applied to search. By randomly applying the partition modifiers as appropriate, we searched for bistable and resettable enzymatic reaction networks consisting of four post-translational modification reactions. The proposed search algorithm worked well and we were able to find various bistable enzymatic reaction networks, including a typical bistable enzymatic reaction network with positive auto-feedbacks and mutually negative regulations. Since the search algorithm is divided into an evaluation function specific to the characteristics of the enzymatic reaction network to be searched and an independent algorithm part, it may be applied to search for dynamic properties such as biochemical adaptation, the ability to reset the biochemical state after responding to a stimulus, by replacing the evaluation function with one for other characteristics.

Introduction

The intracellular signal transduction system functions as a mechanism to control cell proliferation, apoptosis, differentiation, and homeostasis, and its malfunction is thought to be one of the substantial causes of cancer formation for cells. The regulatory mechanism of signal transduction systems is closely related to the mechanism of action of drugs, such as anticancer drugs, and has therefore been the subject of extensive research in recent years. In addition, from the viewpoint of synthetic biology, which has been the focus of much research in recent years, design methods for signal transduction systems have become necessary. However, because biochemical reactions are nonlinear, it is difficult to establish theoretical analysis methods, and signal transduction systems in particular have not been systematically analyzed because their interactions are more complex than those of intracellular metabolic systems, which they control. The current situation is that the parameters are fixed for each system to be analyzed, and the analysis is performed by computer simulation.

The MAPK cascade, a representative signal transduction system, which relays cell growth factor (EGF) from the cell membrane to the cell nucleus, and its abnormality is thought to be the cause of cell canceration, and much knowledge has been obtained [16]. A major component of the signal transduction system represented by the MAPK cascade is the cyclic reaction system, which activates and deactivates enzymes by phosphorylation and dephosphorylation. The cyclic reaction system is a combination of two post-translational modification reactions. Therefore, we have been conducting a comprehensive analysis of the stability of regulatory networks that consist of activation and inactivation cyclic reaction systems of enzymes and their mutual controls [7]. There, the control relationship between the cyclic reaction systems is represented by a control matrix. The control matrix, which is an adjacency matrix, has 1 as its i-th row and j-th column components when the activating enzyme of cyclic reaction system j catalyzes the activation of cycle reaction system i, and -1 when it catalyzes the deactivation.

In a similar study, Kuwahara et al. [8] exhaustively analyzed the influence of the control structure on the stochastic properties for regulatory networks of three to five nodes with only one feedback control. They also mentioned the control structures in which bistability emerges. Ramakrishnan et al. [9], Shah et al. [10], and Siegal-Gaskins et al. [11] have conducted exhaustive analyses of the ultrasensitive properties and bistability of signaling systems. Ultrasensitivity was reported as a response characteristic of the MAPK cascade, a typical signaling system mentioned above [1]. In a system with ultrasensitivity, the steady-state enzyme concentration of the output chemical species with respect to the enzyme concentration of the input chemical species increases rapidly after a certain threshold of the input. Bistability is when this ultrasensitivity is more strongly expressed, and hysteresis appears in the change of output to input. Thus, in a region with input, two steady state values of the output appear. In particular, it is said to be resettable if it has the property of being able to mutually transition between two steady states by varying the total concentration of the input chemical species within the range of protein concentrations expected in the cell [12].

Ma et al. [13] and Yao et al. [14] analyzed the biochemical adaptation and robustness properties using a similar control network of three nodes. Biochemical adaptation is the property that when the concentration of an input chemical species is increased in a steady-state system, the response of the output chemical species transiently increases and then decreases to a value close to the original steady-state value [13]. Robustness is the property that the various response properties of a system are less sensitive to the concentration of other chemical species in the system [12].

Adler et al. [15] focused on fold-change detection, which is a dynamic property in which the temporal pattern of the system’s output in response to a transient input change in the cell depends on the ratio rather than the difference of the input changes, demonstrating the effectiveness of comprehensive analysis using regulatory networks. In these studies, the enzymatic reaction mechanism is mainly based on the Michaelis-Menten approximation or a simplified linear equation in order to reduce the computational complexity. However, it has been reported that approximations can lead to inaccuracies in the bistability and dynamics of the system [16, 17]. In order to accurately analyze the properties of a system, it would be better to describe the system using only the law of mass action, but this would increase the number of parameters of the system and make the originally huge search space even larger, making exhaustive analysis difficult to carry out. Therefore, there is a need to develop a method to search for enzymatic reaction networks with useful properties such as ultrasensitivity, bistability, biochemical adaptation, robustness, and fold-change detection as mentioned above [1820].

In this study, we focus on the enzymatic reaction network, which is formulated as a regulatory network in which post-translational modification reactions are the elements and they mutually regulate each other. Post-translational modification reactions, such as phosphorylation, are smaller units than the cyclic reaction systems. Instead of the control matrix described above, we propose to use a partition of the set as its representation, which is considered to be more suitable for search. In addition, we report the results of our search for a bistable and resettable enzymatic reaction network with the aim of demonstrating its effectiveness. Resettable is the property of being able to transition between two steady states by changing the total concentration of the input chemical species within the range of expected protein concentrations in the cell [12].

Materials and methods

Partition representations for enzymatic reaction networks

A post-translational modification reaction with Michaelis-Menten-type enzymatic reaction as the reaction mechanism is considered as the basic component, considering to construct the enzymatic reaction network composed of N components. In this regard, reactions in which multiple enzymes bind to a single substrate simultaneously are not considered for simplicity in this formulation. The Michaelis-Menten-type enzymatic reaction is a series of chemical reactions in which an enzyme (E) binds to a substrate (S) to form a temporary enzyme-substrate complex (C), the substrate is transformed into a product (P) on the complex, and then decomposes into the product (P) and the enzyme (E), as shown in Eq (1), where the reaction rate constants are denoted by a, d, and k. (1) The substrates, enzymes, and products of N Michaelis-Menten-type enzymatic reactions are denoted by Si, Ei, and Pi, respectively, and the set is denoted by M. That is, M = {S1, E1, P1, S2, E2, P2, …, SN, EN, PN}.

Take one partition P of the set M. A partition is a family of mutually disjoint subsets of a set, and the union of those subsets is the original set. By identifying the elements of the partition, i.e., the chemical species that belong to one subset of the original set M, an enzymatic reaction network is constructed from multiple post-translational modification reactions. Fig 1 shows an example of a basic enzymatic reaction network consisting of post-translational modification reactions. The primary building blocks of post-translational modification reactions are numbered consecutively; the substrate, enzyme, and product of the i-th post-translational modification reaction are labeled Si, Ei, and Pi, respectively. If the chemical species to be equated is the substrate Si and the product Pi, the names of the species to be equated are listed in the rectangle. The enzyme Ei is represented by the small circle between the red arrows, and if there is a chemical species that is identical to the enzyme, it is connected to that chemical species by a dotted line. It may be easier to understand intuitively if you think of the small circle as representing the enzyme-substrate complex.

thumbnail
Fig 1. Basic enzymatic reaction networks consisting of post-translational modification reactions.

Examples of enzymatic reaction networks composed of post-translational modification reactions as the primary building blocks; a: product of enzymatic reaction 1 catalyzes enzymatic reaction 2, b: two-step reaction, c: branching reaction, d: confluence reaction, e: autocatalysis, f: association-dissociation reaction, g: dimer formation separation reaction. The primary building blocks of post-translational modification reactions are numbered consecutively; the substrate, enzyme, and product of the i-th post-translational modification reaction are labeled Si, Ei, and Pi, respectively. If the chemical species to be equated is the substrate Si and the product Pi, the names of the species to be equated are listed in the rectangle. The enzyme Ei is represented by the small circle between the red arrows, and if there is a chemical species that identifies with the enzyme, it is connected to that species by a dotted line.

https://doi.org/10.1371/journal.pone.0263111.g001

Fig 1A–1D are examples of an enzymatic reaction network consisting of N = 2, i.e., two post-translational modification reactions, where the original set of partitions is M = {S1, E1, P1, S2, E2, P2}. Fig 1A shows the enzymatic reaction network represented by the partition {{S1}, {E1}, {P1, E2}, {S2}, {P2}} of M, where the product of enzymatic reaction 1 functions as the enzyme in enzymatic reaction 2. The identification of Pi and Ej corresponds to the fact that the product of enzymatic reaction i functions as the enzyme of enzymatic reaction j. This is a typical mode of enzymatic reaction chain in which the activated enzyme catalyzes other enzymatic reactions as seen in signal transduction systems. Fig 1B shows the enzymatic reaction network represented by the partition {{S1}, {E1}, {P1, S2}, {E2}, {P2}} of M, where the products of enzymatic reaction 1 are the substrates of enzymatic reaction 2. The identification of Pi and Sj is a representation of the two-step enzymatic reaction from Si to Pj. Fig 1C and 1D show the enzymatic reaction networks represented by the partitions, {{S1, S2}, {E1}, {P1}, {E2}, {P2}} and {{S1}, {E1}, {S2}, {E2}, {P1, P2}} of M, respectively. The identification of Si and Sj, and Pi and Pj, respectively, is a representation of the branching into and confluence from two enzymatic reactions.

Fig 1E to g are examples of enzymatic reaction networks consisting of N = 1, i.e., one post-translational modification reaction, where the original set of partitions is M = {S1, E1, P1}. Fig 1E shows the enzymatic reaction network represented by this partition {{S1}, {E1, P1}} of M, which is a representation of an autocatalytic reaction. It corresponds to the case where i = j = 1 in Fig 1A. Fig 1F shows the enzymatic reaction network represented by the partition {{S1, P1}, {E1}} of M, showing the association and dissociation reaction between the identical substrate and product S1 = P1 and the enzyme E1. The identification of Si and Pi is a representation of the association and dissociation reaction with the enzyme. Fig 1G shows the enzymatic reaction network represented by the partition {{S1, P1, E1}} of M, where the identified substrate and product and the enzyme S1 = P1 = E1 associate and dissociate themselves, corresponding to the dimer formation separation reaction. The identification of Si, Pi, and Ei is an expression of the dimer formation-separation reaction.

Fig 2 shows examples of more complex enzymatic reaction networks and their partition representations. Fig 2A shows an example of an enzymatic reaction network consisting of N = 4, i.e., four post-translational modification reactions, where the original set of the partition is M = {S1, E1, P1, S2, E2, P2, S3, E3, P3, S4, E4, P4} and the partitions is {{S1, P2}, {P1, S2, E1, E4}, {S3, P4}, {P3, S4, E2, E3}}. This is an example of an enzymatic reaction network consisting of two cyclic reaction systems with auto-activating and mutually inhibitory feedbacks, which is an typical of enzymatic reaction network with bistable properties. The MAPK cascade, one of the representative signaling systems, is shown in Fig 2B. The MAPK cascade consists of a cascade of three kinases, MAPKKK, MAPKKK, and MAPK. The activated enzyme activates the phosphotransferase in the next step, thereby transmitting signals within the cell. An example of an enzymatic reaction network consisting of 10 post-translational modification reactions, with M = {S1, E1, P1, S2, E2, P2, …, S10, E10, P10} can be represented by the partition {{P2, S1}, {P1, S2, E3, E4}, {E1}, {E2}, {P6, S3}, {P3, P5, S4, S6}, {P4, S5, E7, E8}, {P10, S7}, {P7, P9, S8, S10}, {P8, S9}, {E5}, {E6}, {E9}, {E10}}, where {P2, S1} and {P1, S2, E3, E4} are the inactive and active MAPKKKs, respectively. Also, {P6, S3}, {P3, P5, S4, S6}, and {P4, S5, E7, E8} are inactive, one phosphorylated, and two phosphorylated MAPKKs, respectively. {P10, S7}, {P7, P9, S8, S10}, and {P8, S9} are inactive, one-phosphorylated, and two-phosphorylated MAPKs, respectively.

thumbnail
Fig 2. Complex enzymatic reaction networks consisting of post-translational modification reactions.

The way to represent the figure is the same as in Fig 1; a: auto-activating mutual inhibitiory network, b: MAPK cascade.

https://doi.org/10.1371/journal.pone.0263111.g002

The advantage of the partition representation is that every partition corresponds to an enzymatic reaction network, and thus there is a one-to-one relationship between the partition and the enzymatic reaction network. The number of different enzymatic reaction networks consisting of N post-translational modification reactions is the same as the total number of different partitions of the set with number of elements 3N, a number whose value is known as the Bell number B3N, expressed in the recurrence equation as in Eq (2), where B0 = B1 = 1. (2) For example, the total number of enzymatic reaction networks consisting of four post-translational modification reactions, as shown in Fig 2A, is B12 = 4213597.

In the space where the partition is an element, we can introduce the distance D (A, B) between the partition A = {a1, a2, …, am} and the partition B = {b1, b2, …, bn} of the set M as shown in Eq (3). Here, ai and bj are subsets of the set M. (3) The symbol # is a function that returns the number of elements in a set. Therefore, # (aibjaibj) is the number of elements that are not common to the sets ai and bj. For example, let the source set of the partition be M = {x1, x2, x3, x4, x5, x6}, and consider three partitions, A = {a1, a2, a3}, B = {b1, b2}, and C = {c1, c2}, where a1 = {x1, x2, x3}, a2 = {x4, x5}, a3 = {x6}, b1 = {x1, x2}, b2 = {x3, x4, x5, x6}, c1 = {x1, x2, x3}, c2 = {x4, x5, x6}. Then d (A, b1) = mini s (ai, b1) = s (a1, b1) = 1, d (A, b2) = mini s (ai, b2) = s (a2, b2) = 2, so maxj d (A, bj) = d (A, b2) = 2, and similarly maxj d (B, aj) = 3. Therefore, D (A, B) = max (2, 3) = 3. Similarly, we can see that D (B, C) = 1 and D (A, C) = 2. In other words, A and B are the farthest, B and C are the closest, and A and C are in the middle distance. To check this in an enzymatic reaction network, for example, let the original set of chemical species be M = {S1, P2, E2, P1, E1, S2}. In other words, x1 = S1, x2 = P2, x3 = E2, x4 = P1, x5 = E1, x6 = S2. Then partition A becomes {{S1, P2, E2}, {P1, E1}, {S2}}, B becomes {{S1, P2}, {E2, P1, E1, S2}}, and C becomes {{S1, P2, E2}, {P1, E1, S2}}, corresponding to a, b, and c of the enzymatic reaction network shown in Fig 3.

thumbnail
Fig 3. Enzymatic reaction networks specified by divisions and distances between the divisions.

The way to represent the figure is the same as in Fig 1. The enzymatic reaction networks represented by the partitions; A: {{S1, P2, E2}, {P1, E1}, {S2}}, B: {{S1, P2}, {E2, P1, E1, S2}}, C: {{S1, P2, E2}, {P1, E1, S2}}.

https://doi.org/10.1371/journal.pone.0263111.g003

It can be seen that the similarity of the enzyme reaction network correlates with this distance. Furthermore, this distance between partitions satisfies the triangle inequality and can be used to visualize the distribution of partitions.

Derivation of differential equation systems and conservation laws from partitions

From the partition, we can derive a system of differential equations and conservation laws that describe the behavior of the corresponding enzymatic reaction network as follows. If the number of post-translational modification reactions constituting the enzymatic reaction network is N, the total number of elements in the set from which the partitioning is made is 3N, since three elements, that is, a substrate, an enzyme, and a product, correspond to one post-translational modification reaction. Therefore, if we denote the elements by xi, the source set of the partition can be expressed as M = {x1, x2, , x3N}.

Let one of the partitions of M be A = {a1, a2, , am}, and let sk be the set of subscript i of xi which is an element of the kth element ak of the partitions A. For this partition A, a system of differential equations can be generated by the following procedure.

  1. Generate 4N reaction rate equations for substrate Si, enzyme Ei, enzyme-substrate complex Ci, and product Pi in each post-translational modification reaction, as shown in Eq (4). Note that x3N+1, x3N+2, , x4N correspond to the enzyme-substrate complex Ci. (4)
  2. For each element ak in the partition, generate a chemical species yk that identifies the all elements of ak, and replace the original reaction rate equations for the elements of ak with the following Eq (5), while all elements of ak appearing on the right side of Eq (5) are replaced by yk. The sk is the subscript set of the elements of ak.
(5)

For the derivation of Eq (4), if only the law of mass action is used as the reaction equation for the post-translational modification reaction, and the variable name is expressed as x[chemical species], the reaction equation derived is the following for the post-translational modification reaction i. (6) For example, if the original set of partitions is M = {S1, E1, P1, S2, E2, P2}, and the partition is A = {{S1}, {E1}, {P1, E2}, {S2}, {P2}} as seen in Fig 1A, then by applying the law of mass action, the above Eq (4) becomes the following Eq (7), where the variable name is represented by x[chemical species]. (7) The equations corresponding to Eq (5), obtained by equating P1 and E2 according to division A, are Eq (8). In particular, when some chemical species are considered identical, they are listed in brackets. (8) For the conservation law, we first generate a set of lists Q = {q1, …, qi, …, q2N} of chemical species whose total concentration is conserved for each post-translational modification reaction. The reason why the total number is 2N is that for each post-translational modification reaction, there is a conservation law for the enzyme and a conservation law for the substrate. Next, the following procedure is applied sequentially to each element ak of the partition A, in turn.

  1. Concatenate all the list of elements in Q that contain elements in common with ak, and let rk be the list.
  2. Remove the elements of ak from the list rk and add the chemical species yk, which is identical to the elements of ak.
  3. Add rk to the element of Q that does not contain an element in common with ak, and make it Q again.

It is important to note that the above procedure concatenates, rather than sums, the elements of Q that contain elements in common with ak. This is because the overlap is intrinsic when the list to be combined contains the common enzyme-substrate complex Ci.

If only the law of mass action is used as the reaction equation for the post-translational modification reaction, and the variable name is represented by x[chemical species], the initial elements of the set of conservation laws Q to be derived are the following for the post-translational modification reaction i. (9) For example, for the enzymatic reaction networks in Fig 1A, the partition is A = {{S1}, {E1}, {P1, E2}, {S2}, {P2}}. Then, if the variable names are denoted by x[chemical species] as in the example above, the original list of conservation laws becomes {{x[S1], x[C1], x[P1]}, {x[E1], x[C1]}, {x[S2], x[C2], x[P2]}, {x[E2], x[C2]}}, and by equating P1 and E2, we get Q = {{x[S1], x[C1], x[P1, E2], x[C2]}, {x[E1], x[C1]}, {x[S2], x[C2], x[P2]}}.

Resettabe bistability

A search for bistable and resettable enzymatic reaction networks was attempted to demonstrate the effectiveness of the partition representation of enzymatic reaction networks. Resettable is the property of being able to mutually transition between two steady states by varying the total concentration of the input chemical species within the range of expected protein concentrations in the cell [12]. A typical resettable bistable enzymatic reaction network is shown in Fig 2A.

The relationship between bistability and resettability when the input chemical species is E1 and the output chemical species is P2 is shown in Fig 4. For example, in the enzymatic reaction network of Fig 2A, the input chemical species E1 corresponds to the small circle labeled 1, and the output chemical species P2 corresponds to the rectangle in the lower left corner that equates S1 and P2. The horizontal axis is the logarithm of the input species concentration with a base of 2, and the vertical axis is the total output species concentration normalized to between 0 and 1. The range of protein concentrations expected in the cell is 2−5 to 25 in m mol/m3.

thumbnail
Fig 4. Relationship between bistability and resettability.

The horizontal axis is the total concentration of the enzyme including the input chemical species E1 and the vertical axis is the relative concentration of the output chemical species P2 to the total concentration; a: resettable bistable, b: non-resettable bistable, but the transition from the upper stable state to the lower can occur, c: non-resettable bistable. The solid line corresponds to the stable equilibrium point, i.e., the steady state value, and the dashed line corresponds to the unstable equilibrium point, The solid blue lines at both ends depict the monostable state.

https://doi.org/10.1371/journal.pone.0263111.g004

The curve represents the concentration of the output species at equilibrium, where the solid line corresponds to the stable equilibrium point, or steady state value, and the dashed line corresponds to the unstable equilibrium point.

Fig 4A shows the resettable bistable aspect. The solid blue curves at both ends of the graph correspond to a single steady state, which are monostable states. The red curve in the center has two steady states, corresponding to bistable states. As the total concentration of the input species is increased from the smaller one, the black arrow on the right makes a discontinuous jump from the upper steady state to the blue monostable state on the right. When the total input species concentration is reduced from that state, the system jumps to the steady state below at the black arrow on the left side. Thus, it is a resettable bistable because it can mutually transition between two steady states.

Fig 4B is an example where the curve in Fig 4A extends to the right and the region of monostability on the right exceeds the range of protein concentrations expected in the cell. In this case, the black arrows show that it is possible to jump from the upper steady state to the lower steady state, but it is not possible to jump from the lower steady state to the upper steady state, even if the total input chemical species concentration is increased within the range of expected protein concentrations in the cell. This is an example of a bistable that is not resettable.

Fig 4C shows an example where the graph extends further to the left. In such a case, if the initial state of the system is above the red dotted line in the center, it shifts to the upper steady state, and if it is below, it shifts to the lower steady state, and the state cannot be changed even if the total concentration of the input chemical species is changed within the range of the expected protein concentration in the cell. This is another example of bistability that is not resettable.

Exploration of enzymatic reaction networks in the partition representation space

For the partition representation of the enzymatic reaction network, an algorithm to search for the enzymatic reaction network with the desired response characteristics can be implemented in a natural way by sequentially applying the following two partition modifiers.

  • Separate partition modifier: Randomly selects an element from a partition and randomly separates it into two elements. For example, by separating the second element {P1, E1, S2} of the partition {{S1, P2, E2}, {P1, E1, S2}} corresponding to the enzymatic reaction network C shown in Fig 3 into two elements {P1, E1} and {S2}, we obtain the partition {{S1, P2, E2}, {P1, E1}, {S2}} corresponding to the enzymatic reaction network A is obtained.
  • Join partition modifier: Combines two randomly selected elements. For example, in the example of Fig 3 above, by combining the second and third elements of the partition corresponding to the enzymatic reaction network A, the partition corresponding to the enzymatic reaction network C is obtained.

First, we set up an evaluation function ϕ(P,ν) that quantifies the desired response properties of the dynamics or steady state of the enzymatic reaction network generated from the partition. ϕ(P,ν) is a function of the partition P, with the parameter ν, which is a pair of reaction rate constants of each post-translational modification reaction and the total concentration of enzymes that constitute the enzymatic reaction network corresponding to the partition.

In the following algorithm, we use the function Ф(P) to perform a random search on the parameter value ν for a given partition P. The function Ф(P) quantifies the evaluation value at λ randomly chosen parameter values from the set of candidate values Λ of the parameter ν by ϕ(P,ν) and returns the pair of the largest evaluation value and the parameter ν at that time as the value.

The main loop of the algorithm is as follows.

  1. Determine one partition P at random.
  2. If the value of the evaluation function Ф(P) is less than ρ or the number of iterations is less than γ, repeat P = ψ(P, P, σ).
  3. Return P as the value.

The partition search function ψ(P0, P, σ) is a recursive function whose arguments are the initial partition P0, the partition P, and the partition depth σ described below. ψ(P0, P, σ) returns the partition whose evaluation function is greater than or equal to Ф(P0).

  1. If σ is zero, return P0 as the value.
  2. Generate P’ from P with the join partition modifier.
  3. If Ф(P’) > Ф(P0), return P’ as the value.
  4. Generate P” from P with the separate partition modifier.
  5. If Ф(P”) > Ф(P0), return P” as the value.
  6. let P = ψ(P0, P’, σ-1).
  7. If Ф(P) > Ф(P0), return P as the value.
  8. let P = ψ(P0, P”, σ-1).
  9. If Ф(P) > Ф(P0), return P as the value.
  10. If none of the above conditions are satisfied, return P0 as the value.

We apply this random search algorithm to the aforementioned search for enzymatic reaction networks with resettable bistability with the aim of demonstrating the effectiveness of the partition representation of enzymaric reaction networks. In other words, we design an evaluation function ϕ(P,ν) for resettable bistability and use it in this algorithm.

Results

Evaluation function for resettable bistability

Bistability is the property of having two steady state values for a given parameter value ν, as described above. Furthermore, resettable is the property that allows the transition between these two steady states by changing the value of the total concentration of the enzyme containing the input chemical species. Here, we create an evaluation function ϕ(P,ν) that gives a high evaluation value for a typical resettable bistable aspect as shown in Fig 4A, i.e., an enzymatic reaction network that is monostable at the edge of the range taken by the total concentration of input chemical species and bistable in the central part.

ϕ(P,ν) is a function of the partition P, with the reaction rate constants of each post-translational modification reaction and the total concentration of enzymes comprising the enzymatic reaction network corresponding to the partition as parameters ν. The variations in the total concentration of enzymes containing the input chemical species were set to 21 discrete values of Ω = 2−5, 2−4.5, …, 25. These values, as well as the reaction rate constants to be set as ν, are set to include approximately the values reported for the kinases that make up the MAPK cascade, which is known to be a typical cellular signaling system.

The following procedure corresponds to the aforementioned evaluation function ϕ(P,ν). P and ν are given as arguments.

  1. For a given parameter value ν, only the total concentration of the enzyme containing the input chemical species is varied sequentially and discretely for 21 values of Ω = 2−5, 2−4.5, …, 25, the total concentration of each enzyme is randomly assigned as the initial value so as to satisfy the conservation law of each enzyme. The steady state values were obtained by numerically solving the generated differential equations and making convergence judgments as appropriate. The details are described in the next section. Then 21 pairs of steady state values of the output chemical species are obtained. Here, the steady-state value of the output enzyme species is normalized to its total concentration, and is a value between 0 and 1.
  2. This is repeated θ times to obtain θ pairs of normalized steady state values for each of the 21 different total concentrations of the input chemical species, where only data for which all 21 values are converged are used. Therefore, the number of them is at most θ. Fixing one from 21 different total concentrations of the input chemical species means that we have at most θ steady state values. When all these values are the same, it means monostable, and when they are divided into two different values, it means bistable.
  3. The degree of bistability can be quantitatively evaluated by the variance of several steady-state values obtained from different initial states. This value takes a maximum value of 1/4 in a perfect bistable state, i.e., when half of the values are 0 and the rest are 1, and a minimum value of 0 in a monostable state, i.e., when all the values are the same. It also takes an intermediate value in the mixed states. By multiplying the value of the variance by four, we can normalize the number representing the degree of bistability from 0 to 1.
  4. Up to this point, we obtain 21 normalized variances V = {V1, V2,…,V21}, which indicate the degree of bistability for each of 21 values of the total concentration of the enzyme containing the input chemical species: 2−5, 2−4.5, …, 25.
  5. Return the value of the function eVL for this V, eVL(V), as the evaluation value.

The function eVL is the function defined in Eq (10). (10) Resettable bistability is the property of being monostable at the edge of the range taken by the total concentration of the input chemical species and bistable in the middle part, as explained in Fig 4. The eVL evaluates the degree of this property. Its input V is a vector of the above 21 quadruple variance values and Vi denotes the each element. The value of the function eVL was normalized by a maximum value so that it would be between 0 and 1. The α is the coefficient for this purpose, which is the reciprocal of the maximum value 0.604762 that the summation equation on the right side takes when V = {0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0}. The 3D shape of the function wVL that makes up the function eVL is shown in Fig 5. Its input v is the quadruple variance value Vi and l is the position i of Vi in the vector V. The function wVL is designed to be maximal for perfect monostability (v = 0) at the edge of the region, and also to be maximal for perfect bistability (v = 1) at the center of the region, and continuous intermediate evaluation values in the middle of the region.

thumbnail
Fig 5. Function wVL for evaluation of resettable bistability.

Input v is the quadruple variance value Vi and l is the position i of Vi in the vector V. The wVL is designed to take a maximum value for perfect monostability (v = 0) at the edge of the region, a maximum value for perfect bistability (v = 1) in the center, and continuous intermediate evaluation values in the middle of the region.

https://doi.org/10.1371/journal.pone.0263111.g005

Hyperparameters of the search algorithm

The following is a summary of the hyperparameters included in the algorithm for searching for resettable bistable enzymatic reaction networks, which is an application of the partition representation, and the values used in the actual search. First, the hyperparameters that are independent of the characteristics of the enzymatic reaction networks to be explored include the following.

  • Number of post-translational modification reactions N: 4
  • Input and output chemical species: input is E1 and output is P2
  • Partitioning depth (depth of recursive search in binary tree) σ: 4
  • Maximum number of loops to change the division γ: 150
  • Value of the evaluation function to be judged as convergence ρ: 0.8
  • A set of candidate values of a parameter to try randomly Λ: {2−5, 2−4.5, …, 25}
  • Number of pairs of parameter values to try randomly λ: 5

As mentioned earlier, the enzymatic reaction network shown in Fig 2B is known as a typical resettable bistable enzymatic reaction network [7], and the number of post-translational modification reactions that make up the network is 4, so the value of N was set to 4. The input and output chemical species are set to be E1 and P2, respectively, which belongs to the different post-translational modification reactions, because the enzymatic reaction network explored would be severely limited if the input and output chemical species were those of the same post-translational modification reaction.

We did not know whether the values we chose for the depth of recursive search σ and the maximum number of loops γ were optimal or not. After trying several combinations, we chose the one that would allow us to explore a single enzyme reaction network in at least a few days. The set of values Λ used for the total concentration of enzymes and the rate constants of the enzymatic reactions was set to include approximately the values in mmol/m3-s system reported for the phosphatases comprising the MAPK cascade [2125]. For λ, we tried several other values, but did not find much difference.

Hyperparameters specific to the search target, i.e., the search for bistable enzyme reaction networks, include the following. In general, these will alter for different search targets.

  • A series of total concentrations of the input chemical species Ω: {2−5, 2−4.5, …, 25}
  • Number of steady-state values with random initial values θ: 5

We used the same variation of Λ as described above for Ω. We tried several other values, but did not find much difference for θ.

To obtain the above steady state values, the generated differential equations were solved numerically many times while updating the initial values, and the convergence was judged. This part has the following hyperparameters, which are common when steady state values are needed in the search process.

  • Calculation time for one step τ: 10 s
  • Maximum number of step repetitions κ: 360
  • Convergence decision error ε: 10−3

Wolfram Mathematica’s NDSolve function was used to solve the differential equations numerically. The protein concentration after τ seconds was calculated by NDSolve, and was judged to be converged when all of the relative error of the change, i.e., the ratio of the change to the total concentration of the protein, was less than ε. If there is no convergence, the protein concentration after another τ seconds is calculated and the same decision is made. If it does not converge after repeating this process κ times, it returns the result that it did not converge. The product of τ and κ, 3600 seconds, or 1 hour, was taken as the critical time to reach steady state. This time was taken as a reference for the response time of the signaling system in the cell. The values of the calculation time τ for one step and the convergence decision error ε were adopted after trial and error.

Enzymatic reaction networks found by the search algorithm

The search was conducted on a machine with a 3.4GHz i7-3770 processor CPU, 16GB of memory, and a Windows 10 operating system. The search program was implemented in Wolfram Mathematica 12.0. The bistable enzymatic reaction networks found in the search at the values of the hyperparameters described above are shown in Fig 6A–6H. Each of them uses a different seed of random numbers for the initial setting. The maximum CPU time required to discover a single enzymatic reaction network was approximately 30 hours. For example, the enzymatic reaction network in Fig 6A consists of three chemical species: the input is the chemical species that identifies E1, E2, P1, P4, S1, and S4, and the output is the chemical species that identifies P2 and S3. The rest is a chemical species that identify E3, E4, P3, and S2. In particular, the input chemical species is in equilibrium with its dimer. Fig 6G is identical to the typical bistable enzymatic reaction network shown in Fig 2A. The reaction rate constants for each enzymatic reaction network, the total concentration of each chemical species, and the derived differential equation systems are shown in S1S4 Tables provided as the supporting information, respectively.

thumbnail
Fig 6. Bistable enzymatic reaction networks found in the search.

The enzymatic reaction networks a—h use the different seeds of random numbers. The way to represent the figure is the same as in Fig 1.

https://doi.org/10.1371/journal.pone.0263111.g006

Fig 7 shows the bistable aspect of each enzymatic reaction network, where a—h corresponds to a—h in Fig 6. In order to plot the bistable aspect, the Monte Carlo method was used to find the steady state values of the output chemical species for each value of the total concentration of the enzyme containing the input chemical species. That is, when numerically solving the system of differential equations derived from the partition, the initial values were randomly allocated to the chemical species included in each conservation law. We can see that all the obtained enzymatic reaction networks exhibit resettable bistability. Resettable is the property of being able to go back and forth between two bistable steady state values by changing the value of the input.

thumbnail
Fig 7. Bistable aspects of enzymatic reaction networks found in the search.

The enzymatic reactoin networks a—h correspond to Fig 6. The horizontal axis is the total concentration of the enzyme including the input chemical species, E1, in mmol/m3, and the vertical axis is the relative concentration of the output chemical species, P2, to the total concentration.

https://doi.org/10.1371/journal.pone.0263111.g007

Fig 8 shows an aspect of the search process. Fig 8A shows the evolution of the evaluation value as the search progresses. The horizontal axis is the number of times the structure or parameter values of the enzymatic reaction network were changed. The vertical axis is the evaluation value, with the maximum being 1. The legends a—h corresponds to a—h in Fig 6. Since the evaluation value reaches 0.6 in all cases in the first few steps, and almost all graphs overlap up to that point, the range of the vertical axis is set to 0.55 or higher so that there is less overlap thereafter. The search is limited to 150 times, and the search is terminated when the evaluation value exceeds 0.8. It can be seen that the aspect of convergence varies depending on the seeds of random numbers used at initial settings. Fig 6B shows movements within the partition space, based on the distance between the partitions defined by Eq (3). Wolfram Mathematica’s Graph function was used for drawing. The distance between all partitions was measured by the introduced distance function and the values were used in the EdgeWeight option. The GrpahLayout option specifies SpringEmbedding, which is arranged so that the total energy of the mutually coupled partitions is minimized by a spring of force equivalent to the distance between the partitions. The legends a—h corresponds to a—h in Fig 6. The red circles denote the final partitions in each search. We can see that the search is moving through the entire partition space.

thumbnail
Fig 8. Aspects of the search process.

The enzymatic reaction networks a—h correspond to Fig 6. A: Aspect of convergence. Horizontal axis is the number of iterations of the search. The vertical axis is the evaluation value. Convergence occurs when the evaluation value is greater than 0.8 or the number of iterations exceeds 150. B: Distribution of partitions in the search process based on the distance between partitions. The search proceeds in the direction of the arrow, and the red circles are the final points.

https://doi.org/10.1371/journal.pone.0263111.g008

Discussion

We have shown that an enzymatic reaction network composed of post-translational modification reactions can be represented by a partition of the set whose elements are the enzymes, substrates, and products. By identifying elements of each subset in the partition as the same chemical species, an enzymatic reaction network can be constructed. In particular, by equating the substrate and product within the same post-translational modification reaction, we can also describe the association and dissociation reaction between enzyme and substrate (product). However, it does not represent the binding of multiple proteins as found in scaffold proteins and as seen in membrane receptor proteins. As an extension of the partition representation, enzyme-substrate complexes may be added as elements of the original set, but an algorithm for deriving the system of differential equations and conservation laws from the partition needs to be devised.

Furthermore, the partition representation of enzymatic reaction networks was applied to the search for bistable networks. The two partition modifiers introduced into the partition representation worked well and various bistable enzymatic reaction networks were obtained. One of the enzymatic reaction networks obtained in the search were the typical bistable enzymatic reaction network in which two cyclic reaction systems with positive auto-regulatory feedbacks mutually negatively regulate each other.

Regarding the performance of the search algorithm, it was confirmed that the search was completed within an acceptable processing time. It is especially noteworthy that the processing time required for the search was greatly reduced compared to an exhaustive search. In addition, the visualization of the search path using the distance between the partitions introduced in this study was able to show the aspect of the search.

It is not difficult to increase the granularity of the primary building blocks from post-translational modification reactions to cyclic reaction systems. Instead of the original set of elements being the substrate, enzyme, and product of the post-translational modification reaction, we can have four types of enzymes: the active enzyme, the inactive enzyme, the activating enzyme that catalyzes the reaction that turns the inactive enzyme into the active enzyme, and vice versa. We plan to try it in the future.

For the hyperparameter of the search algorithm, which is the search depth, and the parameter values for evaluating the structure of an enzymatic reaction network, which are the reaction rate constant and the number of times the total concentration of each chemical species is randomly tested, we set the search depth to 4 and the number of random tests to 5, but we did not systematically test other values. It is possible that adjusting these values can speed up the process. Further speed-up can be achieved by using GPUs or by using tQCCM [16], an improved form of the Michaelis-Menten approximation, as a formulation of the reaction mechanism instead of the law of mass action.

Although the bistability targeted in this study is a steady-state property, the search algorithm is divided into an evaluation function specific to the characteristics of the enzymatic reaction network to be explored and an independent algorithmic part, so that it may be possible to apply it to the exploration of dynamic properties such as biochemical adaptation, by replacing the evaluation function. We plan to try it in the future. The number of post-translational modification reactions, which is the primary building block, was set to be 4, but even when the number was set to be 3, a bistable enzymatic reaction networks could be found. If the mechanism for automatic increase or decrease in the number of post-translational modification reactions can be included in the search algorithm, it will be possible to find a more optimal enzyme reaction network.

Supporting information

S1 Fig. An example of the transitions in search for enzymatic reaction networks.

Transitions of the enzymatic reaction networks in the process of searching for the discovered enzymatic reaction network Fig 6G. The top left is the initial enzymatic reaction network and transitions to the right, then moves to the second line and transitions from left to right, and the third line transitions from left to right as well. The bottom right is the final enzymatic reaction network Fig 6G.

https://doi.org/10.1371/journal.pone.0263111.s001

(EPS)

S1 Table. Reaction rate constant values for the enzymatic reaction networks found.

Each column is the value of the respective reaction rate constant. Each line corresponds to a—h in Fig 6.

https://doi.org/10.1371/journal.pone.0263111.s002

(XLSX)

S2 Table. The total concentration of enzymes in each enzymatic reaction network found.

Each line corresponds to a—h in Fig 6. The second column is a list of sets of chemical species corresponding to the conservation laws. The third column is their total concentration in the same order as the second column. However, for the set containing E1, the input chemical species, we have moved it in the range of 2-5 to 25, instead of the values in the table.

https://doi.org/10.1371/journal.pone.0263111.s003

(XLSX)

S3 Table. Differential equation system for each enzymatic reaction network found (a—d).

The first column corresponds to a—d in Fig 6. The second column is the name of the variable to be differentiated on the left side of the differential equation and the third column is the right side of the differential equation. The variable name is represented by x[chemical species name]. In particular, when some chemical species are considered identical, they are listed in brackets.

https://doi.org/10.1371/journal.pone.0263111.s004

(XLSX)

S4 Table. Differential equation system for each enzymatic reaction network found (e–h).

The first column corresponds to e–h in Fig 6. The second column is the name of the variable to be differentiated on the left side of the differential equation and the third column is the right side of the differential equation. The variable name is represented by x[chemical species name]. In particular, when some chemical species are considered identical, they are listed in brackets.

https://doi.org/10.1371/journal.pone.0263111.s005

(XLSX)

References

  1. 1. Ferrell JE Jr. How regulated protein translocation can produce switch-like responses. Trends Biochem Sci. 1998;23:461–5. pmid:9868363
  2. 2. Jeschke M, Baumgartner S, Legewie S. Determinants of cell-to-cell variability in protein kinase signaling. PLoS Comput Biol. 2013;9(12):e1003357. Epub 2013/12/18. PCOMPBIOL-D-13-00825 [pii]. pmid:24339758; PubMed Central PMCID: PMC3854479.
  3. 3. Kholodenko BN. Cell-signalling dynamics in time and space. Nat Rev Mol Cell Biol. 2006;7(3):165–76. pmid:16482094.
  4. 4. Mai Z, Liu H. Random parameter sampling of a generic three-tier MAPK cascade model reveals major factors affecting its versatile dynamics. PLoS One. 2013;8(1):e54441. Epub 2013/02/01. PONE-D-12-24261 [pii]. pmid:23365667; PubMed Central PMCID: PMC3554771.
  5. 5. Qiao L, Nachbar RB, Kevrekidis IG, Shvartsman SY. Bistability and oscillations in the Huang-Ferrell model of MAPK signaling. PLoS Comput Biol. 2007;3(9):1819–26. Epub 2007/10/03. doi: 07-PLCB-RA-0294 [pii] pmid:17907797; PubMed Central PMCID: PMC1994985.
  6. 6. Volinsky N, Kholodenko BN. Complexity of receptor tyrosine kinase signal processing. Cold Spring Harb Perspect Biol. 2013;5(8):a009043. Epub 2013/08/03. a009043 [pii] 5/8/a009043 [pii]. pmid:23906711.
  7. 7. Sueyoshi C, Naka T. Stability Analysis for the Cellular Signaling Systems Composed of Two Phosphorylation-Dephosphorylation Cyclic Reactions. Computational Molecular Bioscience. 2017;7:33–45.
  8. 8. Kuwahara H, Gao X. Stochastic effects as a force to increase the complexity of signaling networks. Sci Rep. 2013;3:2297. Epub 2013/07/31. srep02297 [pii]. pmid:23892365; PubMed Central PMCID: PMC3725509.
  9. 9. Ramakrishnan N, Bhalla US. Memory switches in chemical reaction space. PLoS Comput Biol. 2008;4(7):e1000122. Epub 2008/07/19. pmid:18636099; PubMed Central PMCID: PMC2440819.
  10. 10. Shah NA, Sarkar CA. Robust network topologies for generating switch-like cellular responses. PLoS Comput Biol. 2011;7(6):e1002085. Epub 2011/07/07. 10-PLCB-RA-2608 [pii]. pmid:21731481; PubMed Central PMCID: PMC3121696.
  11. 11. Siegal-Gaskins D, Mejia-Guerra MK, Smith GD, Grotewold E. Emergence of switch-like behavior in a large family of simple biochemical networks. PLoS Comput Biol. 2011;7(5):e1002039. Epub 2011/05/19. PCOMPBIOL-D-10-00298 [pii]. pmid:21589886; PubMed Central PMCID: PMC3093349.
  12. 12. Gedeon T, Cummins B, Harker S, Mischaikow K. Identifying robust hysteresis in networks. PLoS Comput Biol. 2018;14(4):e1006121. Epub 2018/04/24. pmid:29684007; PubMed Central PMCID: PMC5933818.
  13. 13. Ma W, Trusina A, El-Samad H, Lim WA, Tang C. Defining network topologies that can achieve biochemical adaptation. Cell. 2009;138(4):760–73. Epub 2009/08/26. S0092-8674(09)00712-0 [pii]. pmid:19703401; PubMed Central PMCID: PMC3068210.
  14. 14. Yao G, Tan C, West M, Nevins JR, You L. Origin of bistability underlying mammalian cell cycle entry. Molecular systems biology [electronic resource]. 2011;7:485. Epub 2011/04/29. msb201119 [pii]. pmid:21525871; PubMed Central PMCID: PMC3101952.
  15. 15. Adler M, Szekely P, Mayo A, Alon U. Optimal Regulatory Circuit Topologies for Fold-Change Detection. Cell Syst. 2017;4(2):171–81 e8. Epub 2017/01/17. doi: S2405-4712(16)30419-7 [pii] pmid:28089543.
  16. 16. Kim JK, Tyson JJ. Misuse of the Michaelis-Menten rate law for protein interaction networks and its remedy. PLoS Comput Biol. 2020;16(10):e1008258. Epub 2020/10/23. pmid:33090989; PubMed Central PMCID: PMC7581366.
  17. 17. Naka T. Validity of the Michaelis-Menten Approximation for the Stability Analysis in Regulatory Reaction Networks. BIOSTEC 2020 (13th International Joint Conference on Biomedical Engineering Systems and Technologies); February; Valletta, Malta2020. p. 176–82.
  18. 18. Cho DY, Cho KH, Zhang BT. Identification of biochemical networks by S-tree based genetic programming. Bioinformatics. 2006;22(13):1631–40. pmid:16585066.
  19. 19. Lok L, Brent R. Automatic generation of cellular reaction networks with Moleculizer 1.0. Nat Biotechnol. 2005;23(1):131–6. pmid:15637632.
  20. 20. Gabel M, Hohl T, Imle A, Fackler OT, Graw F. FAMoS: A Flexible and dynamic Algorithm for Model Selection to analyse complex systems dynamics. PLoS Comput Biol. 2019;15(8):e1007230. Epub 2019/08/17. pmid:31419221; PubMed Central PMCID: PMC6697322.
  21. 21. Brightman FA, Fell DA. Differential feedback regulation of the MAPK cascade underlies the quantitative differences in EGF and NGF signalling in PC12 cells. FEBS Lett. 2000;482:169–74. pmid:11024454
  22. 22. Hatakeyama M, Kimura S, Naka T, Kawasaki T, Yumoto N, Ichikawa M, et al. A computational model on the modulation of mitogen-activated protein kinase (MAPK) and Akt pathways in heregulin-induced ErbB signalling. Biochem J. 2003;373(Pt 2):451–63. pmid:12691603.
  23. 23. Huang CF, Ferrell JE Jr. Ultrasensitivity in the mitogen-activated protein kinase cascade. Proc Natl Acad Sci USA. 1996;93:10078–83. pmid:8816754
  24. 24. Levchenko A, Bruck J, Sternberg PW. Scaffold proteins may biphasically affect the levels of mitogen-activated protein kinase signaling and reduce its threshold properties. Proc Natl Acad Sci USA. 2000;97(11):5818–23. pmid:10823939
  25. 25. Schoeberl B, Eichler-Jonsson C, Gilles ED, Muller G. Computational modeling of the dynamics of the MAP kinase cascade activated by surface and internalized EGF receptors. Nat Biotechnol. 2002;20:370–5. pmid:11923843