Metabolic adaptations to complex perturbations, like the response to pharmacological treatments in multifactorial diseases such as cancer, can be described through measurements of part of the fluxes and concentrations at the systemic level and individual transporter and enzyme activities at the molecular level. In the framework of Metabolic Control Analysis (MCA), ensembles of linear constraints can be built integrating these measurements at both systemic and molecular levels, which are expressed as relative differences or changes produced in the metabolic adaptation. Here, combining MCA with Linear Programming, an efficient computational strategy is developed to infer additional non-measured changes at the molecular level that are required to satisfy these constraints. An application of this strategy is illustrated by using a set of fluxes, concentrations, and differentially expressed genes that characterize the response to cyclin-dependent kinases 4 and 6 inhibition in colon cancer cells. Decreases and increases in transporter and enzyme individual activities required to reprogram the measured changes in fluxes and concentrations are compared with down-regulated and up-regulated metabolic genes to unveil those that are key molecular drivers of the metabolic response.
Deciphering the essential events in the reprogramming of metabolic networks subjected to complex perturbations, including the response to pharmacological treatments in multifactorial diseases like cancer, is crucial for the design of efficient therapies. Yet, tools to infer the molecular drivers sustaining such metabolic responses remain elusive for large metabolic networks. Here we develop an efficient computational strategy that integrates measured changes at systemic and molecular levels and combines metabolic control analysis with linear programming tools to infer key molecular drivers sustaining the metabolic adaptations to complex perturbations, such as an antitumoral drug therapy. The collective behavior is approximated using linear expressions where the adaptation of systemic concentrations and fluxes to a perturbation is described as a function of the molecular reprogramming of transport and enzyme activities. Starting from measured changes in fluxes and concentrations, we identify changes in the reprogramming of transporter and enzyme activities that are required to orchestrate the metabolic adaptation of colon cancer cells to a cell cycle inhibitor.
Citation: de Atauri P, Tarrado-Castellarnau M, Tarragó-Celada J, Foguet C, Karakitsou E, Centelles JJ, et al. (2021) Integrating systemic and molecular levels to infer key drivers sustaining metabolic adaptations. PLoS Comput Biol 17(7): e1009234. https://doi.org/10.1371/journal.pcbi.1009234
Editor: Kiran Raosaheb Patil, University of Cambridge, UNITED KINGDOM
Received: January 5, 2021; Accepted: July 1, 2021; Published: July 23, 2021
Copyright: © 2021 de Atauri et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting Information files. Two Mathematica notebooks, two pdf with instructions for use and excel files for the generation of results in Figs 2, 4 and 5 are freely available on Zenodo at link: http://dx.doi.org/10.5281/zenodo.5081161.
Funding: MC was supported by Agència de Gestió d’Ajuts Universitaris i de Recerca (AGAUR; Generalitat de Catalunya) (2017SGR1033), Instituto de Salud Carlos III (Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas - CIBEREHD-CB17/04/00023, Instituto Nacional de Bioinformática - PT17/0009/0018), Ministerio de Economía y Competitividad (SAF2017-89673-R) and Ministerio de Ciencia e Innovación (PID2020-115051RB-I00) (Co-funded by the European Regional Development Fund - “Una manera de hacer Europa”). JTC acknowledges the support from the Ministerio de Educación y Formación Profesional (FPU14-05992). MC acknowledges the support received through the prize “ICREA Academia” for excellence in research, funded by ICREA foundation – Generalitat de Catalunya. The funders had no role in study design, data collection, and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Metabolism is a structured network of metabolites connected by transporters and enzyme-catalyzed reactions. The onset of multifactorial diseases like cancer and their response to pharmacological treatments are associated with complex metabolic adaptations [1,2]. Such metabolic adaptations are responses to large perturbations and often involve metabolic reprogramming driven by multiple changes in transporter and enzyme activities. At the systemic level, variables such as metabolite concentrations or reaction fluxes (i.e., transport and reaction rates) depend on the system’s collective behavior and are measurable using various experimental methods . In particular, complete estimations of the distribution of reaction fluxes throughout a metabolic network can be achieved with metabolic flux analysis supported by stable isotope-resolved metabolomics (SIRM) techniques [4,5].
These systemic variables depend on variations at the molecular level, such as individual transporter and enzyme activities. Given a perturbation, mathematical models are used to describe the adaptation of systemic concentrations and fluxes as a function of reprogramming at the molecular level. There are multiple modeling approaches for integrating information at the systemic and molecular levels. On the one hand, when there is a lack of detailed information at the molecular level, the dependencies between systemic reaction fluxes can be explored by stoichiometric models . These models rely on reaction stoichiometry constraints to find viable steady-state intracellular flux distributions. These are the constraints used in the integration of SIRM data to obtain quantitative estimations of flux distributions, although limited to small or medium-sized metabolic networks. Alternatively, stoichiometric models can be applied at the genome-scale coupled to various optimization methods and the integration of multiple layers omics data [7–9]. On the other hand, when there is enough information at the molecular level, the dependencies of concentrations, fluxes, and individual activities, among others, can be explored with kinetic models [9–11]. By integrating kinetic rate laws for reaction and transport processes in systems of time-dependent ordinary differential equations, kinetic models explicitly describe reaction fluxes as a function of metabolite concentrations and individual activities, enabling dynamic simulations of systemic concentrations and fluxes. There are different kinetic modeling frameworks, each providing advantages and limitations . Unfortunately, they are often limited by the availability of kinetic data and by the fact that the cell environment is far from the ideal conditions that are assumed by most kinetic models [13,14]. Several frameworks have been developed in this context of uncertainty, including approximate rate laws, such as (log)-linear or power-law based on linear Taylor’s approximation . These strategies are valid in the proximity of a reference steady-state and usually are associated with Metabolic Control Analysis (MCA) [15–19] or the closely related Biochemical Systems Theory (BST) [20,21]. They provide the advantage of simplified formulations and are frequently used in different computational methodologies based on optimization [22–25] and sampling [17,26–28]. The ultimate objective of all these methodologies is the extraction (i.e., inference, prediction, identification) of new information from sets of observations and assumptions, which constitute groups of constraints that must be satisfied.
Both MCA and BST can be equivalently applied using sensitivity coefficients to quantify the variations at systemic and molecular levels in response to system perturbations. Sensitivity coefficients are implicitly associated with kinetic models and most frequently explicitly derived from them. MCA has been defined using formulations that are slightly different [15,16,18,19,28], although equivalent, to those used in BST [21,29], and where the dependencies of sensitivity coefficients at the systemic and molecular levels are explicitly described. At the systemic level, sensitivity coefficients can formally be divided into control and response coefficients. They describe variations in metabolite concentrations and reaction fluxes in response to perturbations in transporter and enzyme individual activities (control coefficients) or, in general, to any other parameter p (response coefficients). Analogously, at the molecular level, elasticities are described as variations in transporter and enzyme activities in response to perturbations in metabolite concentrations (metabolite elasticities) or, in general, to any other perturbation (parameter elasticities). In Table 1, formal definitions of these sensitivity coefficients and the dependencies among them are provided for a metabolic network with n internal metabolites (i = 1,…,n), and m transport and transformation processes (j = k = 1,…,m), where each xi describes a metabolite concentration, each Ji the systemic reaction flux (rate) through a particular process, and each vk the transport or enzyme activity of a particular process.
Each sensitivity is a dimensionless coefficient, which measures the fractional change in some variable A per fractional change in some parameter B around a steady-state (xio,Jko = vko). In control and response coefficients, the variations in systemic concentrations and fluxes are the result of the collective adaptation of the entire system after a transient period of adjustment, which is indicated using total derivatives . In contrast, regarding elasticities, the changes in transport or enzyme activities happen as isolated individual processes. A positive sign indicates that variations in A and B magnitudes follow the same direction, both decreasing or both increasing. A negative sign indicates that variations in A and B magnitudes follow opposite directions, one increasing and the other decreasing.
Starting from the definition of concentration and flux response coefficient (), (1) (2) with a direct application of the chain rule, Eqs (1) and (2) can be expanded to express response coefficients as a function of special elasticities and control coefficients [30–33]: (3) (4)
Each control coefficient describes the variations in a concentration or flux to the perturbation of one particular activity. Altogether, the complete set of control coefficients provides a full description of the potential behavior when a unique activity is perturbed. In response to a perturbation, each response coefficient in Eq (3) and (4) is a function of all control coefficients weighted by parameter elasticities (). Each response coefficient describes the overall variation in a concentration or reaction flux in response to a perturbation in some parameter p that can affect one or multiple activities simultaneously, i.e., including any perturbation leading to a complex metabolic response.
A variety of optimization methods have exploited the dependencies among steady-state concentrations, fluxes, and system parameters such as enzyme levels or variations of them. They can take advantage of the particular formulation of the rate laws used in time-dependent differential equations, such as (log)-linear in MCA [34–36] and S-system and Generalized Mass Action (GMA) in BST [22–25,37]. At steady-state, mass balances provide sufficient constraints to account for all dependencies. The potential behavior is fixed through variables, such as metabolite elasticities for MCA, or their equivalent kinetic orders with BST. In the (log)-linear formulation [34,35,38], reformulated for mixed-integer linear programming (MILP), logarithmic deviations of the metabolite concentrations and enzyme levels take the role of decision variables, together with binary decision variables, in mass-balance derived linear constraints. The objective was to determine which enzymes should be present at different levels, the extent of such changes, and the accompanying modifications in the regulatory structure that optimize metabolic outputs, such as metabolite production [34,35]. Also, looking for steady-state optimizations in the BST framework, mass-balance derived constraints have been applied using the S-system and GMA power-law formulations [22–25]. In GMA, a power-law for each separate reaction is used to describe each metabolite’s mass balance. In contrast, in S-system, the mass balance for each metabolite is represented by two competing power-law functions, one resulting from the aggregation of the separate power-laws for synthesis and the other from the aggregation of the separate power-laws for consumption . Fixing rate constants and kinetic orders, the advantage of S-system representation is that in logarithmic coordinates the steady-state mass balances are linear equations, enabling the use of linear optimization techniques following a linear programming (LP) / MILP form [23,37,39–43]. GMA does not allow for a linear reformulation. However, efficient optimization tasks have also been performed using alternative optimization methods taking advantage of the structural regularity of the GMA representation, such as those relying on geometric programming techniques [23,24,44,45]. Another alternative optimization strategy that takes advantage of GMA is to apply outer approximation algorithms that decompose the target problem into master MILPs and slave nonlinear programming problems [22,46,47].
The formulation of optimization strategies using linear constraints enables to solve them using LP efficiently while guaranteeing convergence to an optimum solution point (minimum or maximum) . In the methodology proposed in this paper, by combining MCA and LP, required decreases and increases previously unknown are extracted from linear constraints involving continuous domains in the form of bounded (closed) intervals measuring the differences in reaction fluxes, metabolite concentrations, and individual activities comparing the initial and final states during the adaptation to a metabolic perturbation. Subject to a set of predetermined experimental values in the form of some initially restricted domains, the feasible range of values for all domains (previously restricted or not) is determined by successively minimizing and then maximizing the value for each reaction. Those domains reduced to only negative or positive values will identify required decreases or increases to be satisfied, respectively. Among them, changes required at the molecular level in individual activities will identify molecular drivers required for the metabolic adaptation.
As proof of concept, we use two case studies based on previously published experimental data. First, a glycolysis-case study covering the upper glycolysis and the oxidative branch of the pentose phosphate pathway (PPP) [48,49]. Second, a more complex cancer-case study expanded to all central carbon metabolism, associated with a set of experimental measurements obtained in cultured human colon cancer cells (HCT116) following the inhibition of cyclin-dependent kinases 4 and 6 (CDK4/6) . A complete kinetic model reconstructed from experimental data supports the first case study. This model is built by assuming a full and ideal description of the system behavior, and it is used as a “toy” model to illustrate the proposed methodology. Under more realistic experimental conditions, the second study is built in a context of uncertainty, with partial knowledge and associated with a complex metabolic adaptation to a large perturbation. This study provides an example of the adaptation to simultaneous and coordinated changes in multiple activities, supported by a SIRM-based description of the altered flux distribution, together with the measurements of some metabolite concentrations and the analysis of the differential gene expression. Gene expression analyses are commonly applied to identify metabolic drivers, and therefore potential vulnerabilities to be exploited as targets in drug therapies. Although changes in gene expression should be related to changes in individual activities of their encoded products, alternative “moonlighting” roles cannot be discarded . Also, although individual activities can be directly associated with enzyme concentrations measurable by specific activities, it is worth noting that they are also dependent on other measurable events, such as inactivation/activation by phosphorylation/dephosphorylation. Evidence supporting the role as a key molecular driver of an altered metabolic gene could be provided by the correspondence in the changes in transcript levels and the changes in the predicted activity of the encoded product, the latter being imposed by the changes observed at the systemic level. With the cancer-case study, we illustrate the application of the proposed methodology identifying such metabolic drivers by comparing changes in gene expression and changes required in the transporter and enzyme activities identified by the combination of MCA and LP.
Response to a large perturbation
Response coefficients and special elasticities measure the response to infinitesimal (or small) perturbations around a particular steady-state. However, metabolic adaptations in response to the large perturbations will lead to complex changes in the qualitative steady-state leading to two separate states, before and after the perturbation. The control coefficients are redistributed during the adaptation from one state to the other. Taking a known flux control coefficient, assuming that variations in enzyme activity are small enough for this control coefficient to be constant, approximate predictions can be made about the change in the flux value by applying the following expression [16,21,52]: (5) which corresponds to the integration of the definition of flux control coefficients in Table 1. Predictions for larger perturbations in enzyme activities can be made, although with an increasingly approximate value. Analogously, for perturbations involving several transporters and enzymes for fluxes and concentrations we expand the above expression to the following equations: (6) (7) which are directly associated with Eqs (3) and (4), respectively, but constraining control coefficients with differences in reaction fluxes, metabolite concentrations, and individual activities. Thus, scaling by Δ log p, the two expressions are rewritten as: (8) (9) where Δ log xi, Δ log Jj , and Δ log vk are differences between the measurements before and after perturbation. The above expressions for a large perturbation are an approximation of the description of response coefficients as a function of control coefficients and special elasticities in Eqs (3) and (4), being equivalent for an infinitesimal perturbation. Accordingly, response coefficients and special elasticities could be approximated for large perturbations as: (10)
Inference by bound contraction
Given a mathematical model that constrains the values of variables, such as concentrations, fluxes, and individual activities, this model must allow for the description of the different experimental situations that may potentially occur. Once the model is defined, it can be applied by adjusting the variables to reproduce particular conditions. In MCA, in the context of Eqs (3) and (4), or Eqs (6) and (7) for larger changes, the first level of “model description” can be provided by using control coefficients with fixed values. Once the model is established, a second level of “model application” can be done by adjusting the fluxes, concentrations, and individual activities. These variables can be taken as continuous domains in the form of bounded (closed) intervals measuring the differences in reaction fluxes, metabolite concentrations, and individual activities that are comparing the values before and after the adaptation to a metabolic perturbation. Because any combination of values for the variables must satisfy all model constraints, the restriction of the range of possible values for a part of the variables can, in turn, serve to infer new information in the form of additional restrictions on the values of any of the model variables. Starting with Eqs (6) and (7), an optimization-based procedure for bound contraction is reformulated following LP, where each variable is successively minimized and then maximized. In this reformulation, differences in fluxes, concentrations, and individual activities can be taken as decision variables in linear constraints, with control coefficients taken as the constant coefficients, fixed according to the available information. Therefore, given,
- a complete description of the potential behavior, in the form of known control coefficients,
- a metabolic response (adaptation), expressed using continuous domains for all differences in fluxes, concentrations, and individual activities; with generic lower and upper limits for all of them, and subject to additionally restricted lower and upper bounds for some of them based on experimental measurements,
and then, redistributing the terms in Eqs (6) and (7), we can define linear systems of equations with the following equalities, (11) where, control coefficients are defined as constant parameters, and differences in fluxes, concentrations, and individual activities are defined as variables, for which initial lower (lb) and upper bounds (ub) of the initial domains are set in the form of inequalities . Taking all variables as decision variables, a series of LP problems can be solved. Each problem is solvable if all the model constraints and initial domains are satisfied, in other words, if all initial constraints are compatible. If the problem is solvable, each initial domain should contain at least one value satisfying all inequalities and constraints in Eq (11). Otherwise, the complete problem formulation must be discarded. The following maximization / minimization LP formulation is solved for each variable, one at a time: (12) where any logarithm base would lead to the same results. The analysis is repeated for each decision variable (Δ log Jj, Δ log xi, Δ log vk ); therefore, a total of 2n+4m LP analyses are performed to solve the complete problem. Each application of LP returns a combination of values for the complete set of decision variables that satisfies at the same time the set of constraints and inequalities in the initial domains, including the lower or upper bound of the minimized or maximized decision variable. Starting with an initial domain, taking the minimum and maximum solutions, this LP formulation provides a reduced final domain for each decision variable. Therefore, applying LP twice per each sensitivity coefficient will provide final lower and upper inequalities satisfying all conditions .
Previously, in the context of GMA-based applications of the outer approximation algorithm to analyze stress responses in yeast, a bound-contraction strategy was systematically applied [22,46,47]. The objective was to identify parameter regions in enzyme levels containing admissible solutions, and therefore changes, that were compatible with the considered physiological constraints. Our use of LP is equivalent to that done in the framework of these stoichiometric models (Flux Variability Analysis) , where the mass balance around each metabolite is a system of linear constraints involving reaction fluxes. Subject to predetermined experimental values for a few fluxes, the feasible range of flux values is determined by minimizing and then maximizing the value for each reaction [53–58]. As the problem formulation is based on linearization, our objective can be qualitative but not quantitative. Accordingly, the objective was to obtain collections of negative and positive signs, looking to capture the trends of the changes, decreases (negative) or increases (positive), and whether these trends are significant. Fixed-sign final domains that are required to explain all the initial constraints will be identified from domains that have only negative or only positive values. Although each final domain identifies a range of values that is required, it does not imply that, in turn, this domain is sufficient alone to constraint the initial domains. Therefore, in the context of the metabolic adaptations, signs will identify changes that are required, although not necessarily sufficient, to sustain all the initial constraints.
Flow chart of the proposed analysis
Starting with the control coefficients calculated in Fig 1, Fig 2 illustrates the application of the formulation presented above using the model of the glycolysis-case study as a toy model. In this figure a flow chart is provided:
- Setting a model description. Each problem formulation implies a model description in the form of a complete set of constant control coefficients. For the estimation of control coefficients, different related matrix formulations have been developed in the context of MCA [31,32,59–62], which are closely related to those applied in the context of BST . Accordingly, the complete set of constant control coefficients is usually presented as a matrix, where each element is a control coefficient. Setting the values of all metabolite elasticities and the steady-state ratios of dependent fluxes and concentrations, the system’s potential behavior is fixed and described in the form of known control coefficients. These matrix methods are formulations that imply all summation and connectivity dependencies in Table 1, together with the stoichiometric dependencies of fluxes and concentrations of species involved in moiety conserved cycles. The dependencies among control coefficients in the panel A of Fig 2 were a consequence of these flux and concentration stoichiometric dependencies for the glycolysis-case in the panel C of Fig 1. A detailed description of this case-study and a detailed explanation of the calculation of control coefficients is provided in Material and Methods and Fig 1.
- Setting initial domains. The variables are continuous domains measuring the differences in reaction fluxes, metabolite concentrations, and individual activities. First, common lower and upper bounds are set for the domains of all variables to be enclosed between a minimum lower bound and a maximum upper bound, assuming that differences outside this enclosure are not accepted. Although this enclosure is arbitrarily set, it can be adapted depending on the observed changes and contributes to reducing the space of solutions. In both case studies (initial domains in panel C in Fig 2 for glycolysis-case), common lower and upper bounds were set to be -3 and +3, which was consistent with the magnitude of expected changes. Second, the domains for differences in concentrations, fluxes, and individual activities measured experimentally are additionally restricted using the measured confidence intervals as bounds. These additional restrictions are a fundamental part of the problem formulation because the objective is to see the effect of these additional restrictions on all other variables, to see if new previously unknown fixed-sign domains finally appear. As an example, for the glycolysis-case the observed metabolic adaptation, expressed using additionally restricted domains (initial domains in red, panel C Fig 2), could be: “an increase in the flux through the first step (J1) has been observed, although neither the concentrations of most of the intermediaries (x1, x2, x3, x4), nor the individual activity through one of the branches (v5), have changed”. As for the model description, each problem formulation implies a complete set of initial domains, with some of them additionally restricted.
- Solving final domains to identify negative and positive signs. Once the problem has been formulated, the maximization / minimization LP formulation in Eq (12) is solved for each variable, one at a time. In the analyzed example (final domains in panel C in Fig 2), the problem was solvable; therefore, all initial constraints were compatible. The system was constrained enough to reduce the domain for most of the variables, even restricting some of them, fluxes and individual activities, to have fixed-sign domains with only positive values. As a measure of this bound contraction, a percentage gain metric has been added for each variable to quantify the percentage of reduction in the size of their final domains with respect to their initial domains. Looking at the molecular level, the required change in individual activities identifies four key molecular drivers (v1, v2, v3, v4) that are required, although not sufficient, to explain the whole set of constraints for the observed metabolic adaptation.
(A) Network scheme. (B) Rate laws and parameters. (C) System of ordinary differential equations (ODEs) and stoichiometric dependencies of fluxes and concentrations. (D) Calculation of control coefficients.
(A) Model description in the form of fixed control coefficients. The values correspond to the glycolysis-case. Inside the brown square are the dependencies among control coefficients. (B) Maximization / minimization LP formulation for bound contraction in Eq (12). (C) Columns in Table: 1) variable (reaction flux, metabolite concentration or individual activity); 2) initial domain, described using inequality notation, with additionally (experimentally) restricted initial domains in red; 3) final domain, described using inequality notation; 4) % gain, comparing initial and final domains for each variable; and 5) sign, fixed positive signs (values can be only positive) or fixed negative signs (values can be only negative). (C) All logarithms are to base two. See Material and Methods for a supplementary description of the model and abbreviations.
Starting with calculated control coefficients and initial values, a Mathematica notebook is provided to solve the final domains and identify negative and positive signs (see Calculations in Material and Methods).
Constraints propagate in all directions among systemic and molecular levels. However, as shown in previous works, the analyses of alternative topological designs under MCA  and BST  applying sampling techniques highlight the relevance of the structural constraints on the possible values of sensitivity coefficients. The control coefficients provide a complete description of the potential collective behavior, which implies not only summation and connectivity theorems but also all stoichiometric flux and concentration dependencies. As highlighted in panel C Fig 2 for the glycolysis-case (see labels a, b, and c), by setting the potential behavior with known control coefficients, all these dependencies for control coefficients in panel A are also implicit in the linear formulation for the differences in fluxes and concentrations.
The first case study was used to illustrate the application of the proposed analysis using a toy model. In contrast, the second case study is an example of a more complex metabolic response to a large perturbation. A detailed description of this second case-study is provided in the Material and Methods section and S1 and S2 Tables.
Adding constraints among individual activities to the problem formulation
In addition to a larger number of metabolites and processes, the problem formulation can require additional constraints. In the formulation in Eq (12) alone, it is implicit that each individual activity corresponds to an independent variable. However, as happens in the cancer-case study, some activities can be interdependent, such as the two activities (R13 and R14) catalyzed by transketolase (TKT), or can be assumed to behave as a coordinated block. Thus, additional constraints were added in the problem formulation to include dependencies among transporter/enzyme activities. On the one hand, a constraint was added for the two activities for TKT: (13)
On the other hand, in the initial part of glycolysis, the consecutive activities for glucose (Glc) transport (Glc transp) (R01) and the reactions catalyzed by hexokinase (HK) (R02), phosphofructokinase (PFK) (R04), and enolase (ENO) (R08) were assumed to be coordinately regulated (Δ log v01 = Δ log v02 = Δ log v04 = Δ log v08): (14)
Setting a model description: coupling the problem formulation with uncertainty
In the glycolysis-case study, following a local perspective, a unique matrix of control coefficients was derived from a detailed kinetic model built around a well-defined steady-state. Therefore, by solving a unique set of linear constraints. In contrast, the cancer-case study was done under realistic experimental conditions, involving a more complex metabolic response to CDK4/6 inhibition, with two different steady-states, before and after CDK4/6 inhibition. On the one hand, a unique matrix of known and constant control coefficients cannot adequately be applied because the values change during the adaptation to the large perturbation. On the other hand, there is limited availability of data to estimate metabolite elasticities and the ratios among stoichiometrically dependent fluxes and concentrations. Moving from the local analysis, both limitations can be tackled simultaneously by applying a more global analysis, solving ensembles of the complete problem, each one associated with a matrix of control coefficients generated by random sampling methods, therefore covering a wider parameter space as described by Kent et al. . In the context of the (MCA-based) (log)-linear formulation, control coefficients and response coefficients are derived under uncertainty by sampling techniques (ORACLE) [17,28,66,67], integrating data such as flux distributions and displacements of the reactions from equilibrium. When details about the enzyme’s rate expressions are not available, elasticity values can be randomly generated. Also, stability analyses based on the analysis of Jacobian matrices can be derived from random sampling of elasticities (Structural kinetic modeling) [26,27,68]. We adapted these tools to our objective and data available in the cancer-case study. An ensemble of 100 problems was formulated, each associated with a complete set of control coefficients estimated by direct sampling of metabolite elasticities. Like the glycolysis-case study, metabolite elasticities are a function of the steady-state, transport or kinetic mechanisms, and regulatory states. Although we did not use a complete kinetic model accounting precisely for elasticities, together with measured fluxes, the sampling of elasticities was constrained by different assumptions, including enzyme saturation, displacement from equilibrium, and literature-based data regarding moiety conservations, inhibitions, and activations.
See Material and Methods for a detailed explanation of the calculation of control coefficients coupled with the sampling of metabolite elasticities.
Setting initial domains
The model used to characterize the metabolic response to CDK4/6 inhibition covered the central carbon metabolism, with a subset of the transport and reaction processes conforming a core network (Fig 3) that is wrapped in a simplified network of boundary processes. As discussed above, first, common lower and upper bounds were set to be -3 and +3. Second, the initial domains for differences in all the reaction fluxes and some of the metabolite concentrations were additionally restricted according to experimental observations. Also, part of the initial domains for differences in individual activities were additionally restricted to force the adaptation of the lightly modeled boundaries to the CDK4/6 inhibition. Thus, the individual activities (Δ log vk) for the simplified-boundary processes, that are a link of the core network with the whole cellular metabolic environment, were set to have the same tightly restricted initial domains as the corresponding reaction fluxes .
Solving final domains to identify negative and positive signs
Fig 4 illustrates the application of the developed strategy using the cancer-case study. Once initial values were set, the analysis was repeated over the ensemble of 100 formulated problems. Among these analyses, 16 were not compatible with all initial domains and constraints and were discarded. For each problem, we identified changes at the systemic and molecular levels with fixed-sign final domains and, therefore, associated with decreases (negative) or increases (positive) required to explain the observed differences. They included 14 individual activities, seven concentrations, and six reaction fluxes. The analysis of the unions and intersections in Fig 5 provides complementary information to that provided by the signs. Although the applied analysis has a qualitative value, such unions and intersections provide a numerical summary to assess the magnitude of these changes, which was significant in all the cases, and also (as the signs) tell us about the dependence on the sampling. A coincidence of the lower and upper bounds of the unions and the intersections will correspond to a no dependence on the sampling of metabolite elasticities.
Cancer-case study. Each column with a number in the top is equivalent to the last column of the table in the panel C (sign) for the glycolysis-case study in Fig 2, for each of the first 20 solved problems of the ensemble of 100 formulated problems. Percentages of negative (% -) and positive (% +) signs refer to the solved problems. The average % gain for all variables and the 100 formulated problems was 29%. In orange, signs dependent on the constraint in Eq (14) that assumes a coordinated regulation changing in parallel the individual activities of Glc transp, HK, PFK, and ENO. See Fig 5 for a numerical summary of the final domains. See Material and Methods for a supplementary description of the network and abbreviations.
Cancer-case study. Interval unions and interval intersections of the ensemble of final domains for each variable in Fig 4. All logarithms are to base two.
The degree of identifiability of the signs depends on the degrees of freedom for each variable, and therefore, on the available information. A detailed local description of the variations in all fluxes and concentrations around a process will impose variations in the associated individual activity. As shown in Fig 4, signs identified for some of the variables were repeatedly negative or positive signs, thus independent of the sampled metabolite elasticities. The signs largely depended first on structural constraints [63,64], although a part of them was also dependent on metabolite elasticities. Among others, metabolite elasticities are a function of reversibility levels, which were fixed values in all the formulated problems (see Material and Methods for the cancer-case description). For example, the required increase in fumarate (Fum) concentration and glutaminase (GLS) activity disappear when the reactions R25 (reversible hydration of Fum in malate (Mal)) and R35 (transport of glutamine (Gln)), respectively, are switched to be far from equilibrium (ρ = 0.9 to ρ = 0). As another example of the dependence on metabolite elasticities, the constraints associated with the sampled elasticities can be enough to require a decrease in the individual activity of HK, as shown in the 3rd model formulation (Fig 4), while for the rest of the model formulations the additional constraints in Eq(14) connecting the changes in the activity of HK with those for Glc transp, PFK and ENO are needed.
For those variables that were more occasionally associated with repeated negative or positive signs or associated with a mixture of signs, such as glutamate (Glu) transport (Glu-transp), the presence of signs was more dependent on the uncertainty associated with the sampling of metabolite elasticities. Although an important level of uncertainty was permitted, the assumptions related to levels of saturation, moiety conservations, inhibitions, and activations, altogether affected the numerical value of the bounds of final domains in almost all variables. This can be shown in Fig 5 by comparing the interval unions and interval intersections of the final domains for each variable. Further availability of mechanistic data should constrain the metabolic elasticities, where the maximum restriction is achieved with a complete kinetic model, as for the glycolysis-case study.
Supporting the role as metabolic drivers of differentially expressed genes
In our previous work , we demonstrated that the inhibition of CDK4 and CDK6 resulted in the perturbation of fundamental regulators of the metabolic activity. Thus, in response to CDK4/6 inhibition, in combination with MYC’s upregulation and the activation of the PI3K/Akt-mTOR signaling axis, the hypoxia-inducible factor 1 (HIF1) was strongly downregulated. A detailed screening was performed among the differentially expressed genes detected in an Affymetrix GeneChip Human Genome U133 Plus 2.0 Array. A subset of them was experimentally verified by qRT-PCR (Fig 3). Also, our experimental observations supported previous reports sustaining that part of these differentially expressed genes were HIF1-dependent genes [69–71]. The effect on gene expression of hypoxia and CDK4/6 inhibition was the opposite for several genes , therefore suggesting that the effects of CDK4/6 inhibition were in part driven by HIF1 downregulation. In particular, among the selected genes, SLC2A3, HK2, PFKFB4, ENO2, PDK1, and PDK3 were upregulated in response to hypoxia and downregulated in response to CDK4/6 inhibition. Based on these evidences we assumed a coordinated regulation by the transcriptional regulator HIF1 of glycolytic SLC2A3, HK2, PFKFB4, and ENO2 genes, and therefore the coordinated regulation of the individual activities of their encoded products (Glc transp, HK, PFK, and ENO, respectively) in Eq (14).
For illustrative purposes, those genes coding for products known to directly affect individual activities of the core part of the model are listed in Table 2. The table provides the percentages of identified changes in the activities affected by their encoded products.
The percentages of negative and positive signs were used to assess if the computational analysis qualitatively supports a possible role as molecular drivers of the metabolic adaptations for the measured changes in gene expression. There was, in general, a good correspondence that supported this role for most of these analyzed genes. In Table 2 (last column), we have classified these changes as “supported” or “supported, but sampling dependent” and “not supported, but sampling dependent”. The role as key drivers of the metabolic adaptation for IDH2 and GLS1 was supported by the high percentages of signs indicating that the changes predicted in metabolic activities are going in the same direction as the changes expected from the measurements in gene expression. For the rest, the lower percentages do not allow for any conclusion, rather than this will depend on the sampling of metabolite elasticities.
Assuming the coordinated regulation of glycolytic SLC2A3, HK2, PFKFB4, and ENO2 by the constraints in Eq (14), the required decreases in their affected activities followed the measured reduction in HIF1. This assumption strongly reduced the space of solutions, facilitating the appearance of signs supporting the requirement of such transcriptional co-regulation, as measured experimentally. However, the signs (in orange, Fig 4) disappeared when the constraints assuming the coordinated regulation were not included.
Although some negative signs were predicted for the HIF1-dependent mitochondrial pyruvate dehydrogenase complex (PDH) activity, the downregulation of PDK1/PDK3 should drive an increase in PDH activity. This discrepancy is helpful as it indicates that additional information for the post-translationally regulated PDH activity is required to explain the observed behavior.
Finally, the required increase in the activity of GLS, supported by western blot analysis, followed the expected up-regulation of MYC-dependent GLS1, since MYC upregulates GLS activity by suppressing the expression of miR-23a and miR-23b, which target the GLS1 transcripts . Indeed, in our published analysis, the combined inhibition of GLS activity and CDK4/6 was experimentally validated as a promising synergetic combination for the efficient and selective killing of cancer cells.
Exploring the dependencies of metabolic variations measured in concentrations, fluxes, and transporter and enzyme activities can be done using kinetic models that accurately simulate the system behavior. Undeniably, coupling kinetic models with optimization methods provides great advantages in metabolic modeling. However, even if these models are based on approximated rate laws, they require detailed knowledge of enzyme kinetics inaccessible in vivo. To overcome this limitation, sampling strategies can be applied in a context of partial knowledge, although with the disadvantage of having a large space of solutions to be explored. Coupled with sampling strategies, our proposed method exploits the advantages of both approaches. First, a limited ensemble of control-coefficient matrices is generated by sampling metabolite elasticities, which together describe the partial knowledge of the system behavior. This is then used to formulate problems based on linear constraints. Finally, the dependencies of the reaction fluxes, concentrations, and individual activities are exhaustively explored by linear optimization methods in light of these linear constraints.
With the goal of identifying molecular drivers of the changes observed in metabolic adaptations to perturbations, the applied analysis is based on a combination of the MCA description of response coefficients as a function of special elasticities and control coefficients [30–33], and the subsequent reformulation as part of a linear optimization-based strategy for bound contraction in the context of large changes. This reformulation takes advantage of linearization around a reference steady-state and therefore cannot be applied for large changes with a quantitative aim. However, it can be used to capture the trends of the changes, increases (positive) or decreases (negative). Accordingly, the objective was to obtain a collection of positive and negative signs, whose repetition will allow us to assess the significance of these changes when the problem is coupled with uncertainty. First, the glycolysis-case study allowed us to illustrate this method using a unique matrix of control coefficients derived from an unambiguously reconstructed system around a steady-state. Second, the study of the effects of CDK4/6 inhibition, coupled with uncertainty, was performed in more realistic conditions, i.e., for a more complex perturbation and with partial knowledge, solving an ensemble of problems derived from sampling techniques. The interpretation of the solutions for this ensemble of problems must be made from a more global perspective . Although each solved problem satisfies the constraints associated with our knowledge of the metabolic system, the ensemble describes different possible behaviors as a measure of uncertainty. By analyzing the set of resulting final domains for each variable, the requirement for negative or positive signs in just a few cases is of null relevance. In contrast, repeated signs in the entire set are very relevant. For some variables, the required changes were largely independent of the sampling of metabolite elasticities and dependent on the applied observations and assumptions. The set of 100 ensemble formulated problems was sufficient to provide a qualitative assessment of the significance of the changes. However, this number should be adapted depending on the size of the analyzed problem, and a more robust test with statistical value could be addressed in future work.
In summary, in the context of the well-known central carbon metabolism, therefore appropriate to illustrate the proposed methodology, this analysis was applied to support the role as metabolic drivers of genes differentially expressed. Our procedure successfully enabled the inference of required changes, although not necessarily sufficient, to sustain the whole set of constraints and inequalities associated with the mixture of observations and assumptions used to characterize a metabolic adaptation.
Material and methods
This case study was mostly based on a published kinetic model covering the upper glycolysis , which was derived from experiments performed on mice muscle extracts. This published model describes a linear pathway at a given steady-state. In our glycolysis-case study, one ramification and one moiety conservation were added by including the first step of the oxidative PPP, which was taken from another published kinetic model in rat liver cells . For all reactions were available kinetic laws, parameters, as well as steady-state concentrations and fluxes.
G6P, x1, glucose-6-phosphate; F6P, x2, fructose-6-phosphate; FbP, x3, fructose-1,6-diphosphate; NADP (NADP+) and NADPH, x4 and x5, oxidized and reduced forms of nicotinamide adenine dinucleotide phosphate; HK, R1, hexokinase; GPI, R2, glucose-phosphate isomerase; PFK, R3, phosphofructokinase; ALD, R4, aldolase; G6PD, R5, glucose-6-phosphate dehydrogenase; and NADPase, R6, irreversible process accounting for all processes oxidizing NADPH.
The network analyzed in this glycolysis-case study includes five system-dependent metabolites and six enzyme-catalyzed reactions, with one moiety conservation. Fig 1 provides a complete scheme of the associated kinetic model, including also the associated system of ODEs, rate laws for all the enzyme-catalyzed reactions, parameter values, and stoichiometric dependencies of fluxes and concentrations.
Model description using control coefficients.
A unique set of control coefficients was derived from the detailed kinetic model, therefore permitting a unique problem formulation. Control coefficients were estimated by following this procedure: 1) steady-state concentrations and fluxes are calculated by setting the initial conditions and solving the system of ODEs; 2) metabolite elasticities are calculated using the steady-state concentrations and fluxes; 3) stoichiometric flux and concentration dependencies are derived and used in the form of flux and concentration ratios, together with metabolite elasticities, to calculate control coefficients by applying the matrix method developed by Cascante et al. [31,32] (panel D in Fig 1).
A second study spanning all central carbon metabolism was also analyzed, based on our previous work covering the effects of CDK4/6 inhibition in the HCT116 colon cancer cell line . The study characterized the metabolic reprogramming, both at the systemic and molecular levels, in response to CDK4/6 inhibition. To this aim, at the molecular level, downregulations and upregulations for gene levels were measured using transcriptome microarrays and qRT-PCR. At the systemic level, differences on the levels for all fluxes and some metabolites (alanine, aspartate, citrate, Glu, Mal, NADPH, pyruvate, and α-ketoglutarate) were measured for control cells (before perturbation) and CDK4/6-inhibited cells. A stoichiometric model for the central carbon metabolism was constructed and solved by applying SIRM techniques to estimate all flux distributions throughout the metabolic network, including forward and reverse reaction rates when required. For this, direct extracellular measurements, such as oxygen consumption, and consumption and production rates for Glc, lactate, and all amino acids, as well as protein synthesis rates, were combined with 13C isotopologue (mass isotopomer) enrichments measured in several metabolites. Such enrichments emerge from the propagation of 13C from labeled Glc and Gln to metabolites through the network and are informative of the underlying flux distribution. The metabolites analyzed for label propagation included lactate and amino acids from incubation media, glycogen, ribose from RNA, palmitate, and several other internal metabolites.
Metabolites: AcoA.c/AcoA.m, x01/x02, cytosolic/mitochondrial acetyl-CoA; ADP/ATP, x03/x09, adenosine di/triphosphate; αKG, x04, α-ketoglutarate; Ala, x05, alanine; Asn, x07, asparagine; Asp, x08, aspartate; Cit, x11, citrate; CoA.c/CoA.m, x12/x13, cytosolic/mitochondrial coenzyme A; Cys, x14, cysteine; FBP, x18, fructose 1,6-bisphosphate; Fum, x19, fumarate; Gln, x23, glutamine; Glu, x24, glutamate; Ile, x27, isoleucine; Leu, x29, leucine; Mal, x31, malate; Met, x32, methionine; NADH.c/NADH.m, x33/x34, cytosolic/mitochondrial reduced form of nicotinamide adenine dinucleotide; NADP/NADPH, x35/x36, oxidized/reduced form of nicotinamide adenine dinucleotide phosphate; NAD.c/NAD.m, x37/x38, cytosolic/mitochondrial oxidized form of nicotinamide adenine dinucleotide; OAA.c/OAA.m, x39/x40, cytoplasmic/mitochondrial oxaloacetate; P5C, x41, Δ1-pyrroline-5-carboxylate; PEP, x43, phosphoenol pyruvate; Phe, x44, phenylalanine; Pro, x45, proline; Pyr, x46, pyruvate; Ser, x48, serine; Suc, x49, succinate; and Val, x53, valine. Transport and reactions processes: Glc-transp, R01, glucose transport; HK, R02, hexokinase; PFK, R04, phosphofructokinase; PGM/ENO, R08, phosphoglycerate mutase / enolase pool; PK, R09, pyruvate kinase; oxid. PPP, R12, oxidative part of pentose-phosphate pathway; TKT, R13 and R14, transketolase; TA, R15, transaldolase; PDH, R19, pyruvate dehydrogenase complex; ACO/IDH, R21, aconitase / isocitrate dehydrogenase pool; αKGDH/SCS, R22, α-ketoglutarate dehydrogenase / succinyl-CoA synthetase pool; SDH/CII, R23, succinate dehydrogenase / oxidative phosphorylation from complex II of respiratory chain; FH, R25, fumarate hydratase; MDH.m, R26, malate dehydrogenase (mitochondrial); PC, R27, pyruvate carboxylase; ACLY, R28, citrate lyase; ME, R30, malic enzyme; ALT, R31, alanine transaminase; ASP, R33/R34, mitochondrial/cytosolic aspartate transaminase; Gln-transp, R35, glutamine transport; GLS, R36, glutaminase; GDH, R37, glutamate dehydrogenase; Glu-transp, R38, glutamate transport; His-transp, R44, histidine transport; Trp-transp, R52, tryptophan transport; and PYCR, R60, pyrroline-5-carboxylate reductase.
The network analyzed in this cancer-case study includes 53 system-dependent metabolites (S1 Table) and 76 transport processes and enzyme-catalyzed reactions (S2 Table), with moiety conservations involving six pairs of metabolites (ACoA.c/CoA.c; ACoA.m/CoA.m; ATP/ADP; NAD.c/NADH.c; NAD.m/NADH.m; and NADP / NADPH). The associated stoichiometric model did not include rate laws, but rather just the reaction stoichiometry. A subset of the transport and reaction processes (R01 –R38) conforms a core network, including all processes with a significant flux. This core network includes the enzyme-catalyzed reactions for glycolysis, glutaminolysis, PPP, and TCA cycle, together with the measured uptake of Glc, Gln, and Ser, the release of Ala and Glu, and the fueling of mitochondrial respiration by NADH and succinate for ATP production. Fig 3 provides a scheme of the core network. The rest of the reactions (R39 –R76) are associated with simplified pools of reactions accounting for boundary processes, such as the release, uptake, and oxidation of the rest of amino acids, fatty acid synthesis, and glycogen synthesis. Among these simplified processes are protein synthesis, which was included to balance the exchange and utilization of amino acids, reactions for ATP and NADPH utilization, and the recycling of mitochondrial acetyl-CoA. All of them were included to have appropriate balances of productions and consumptions.
Model description using control coefficients.
In a context of uncertainty, multiple problem formulations were solved, each one associated with a complete set of control coefficients. To estimate each matrix of control coefficients, all metabolite elasticities were simultaneously generated by applying random sampling. The sampling of metabolite elasticities was done satisfying different constraints, including restricted domains for them.
A first restriction that can be applied refers to the “positive” role of substrates and activators and the “negative” role of products and inhibitors:
- ; M is a substrate or activator of v.
- ; M is a product or inhibitor of v.
where the lower or upper bound, respectively for positive or negative elasticities, is set to a value of zero and corresponded to a situation of total saturation by M. However, the values of the metabolite elasticities also depend on other constraints. For a particular transport [74,75] or enzyme-catalyzed reaction , characteristics such as the particular mechanism, the level of saturation, and the displacement of the reaction with respect to the chemical equilibrium determine the domains of possible values of the metabolite elasticities for this process [27,66–68,76–78]. Thus, taking a general reversible reaction: (15) where v corresponds to the net rate of the reversible reaction, vf the forward reaction rate and vr the reverse reaction rate, the values of metabolite elasticities for vf and vr can be additionally restricted. The elasticity for a mass-action rate law is equal to the order of reaction of the reactants, while the elasticities derived for more complex rate laws, such as those associated with mechanisms for mediated transport and enzyme-catalyzed reactions, can range from zero to different values depending on the mechanism and the curve of saturation. For example, for reactions following cooperativity, which follow a non-hyperbolic curve of saturation, the limit is the Hill coefficient (), such as for the reaction catalyzed by PFK in the glycolysis-case model (Fig 1). For transport and enzyme-catalyzed reactions following a hyperbolic Michaelis-Menten type curve of saturation, all metabolite elasticities of the forward and reverse reaction rates range between zero and one [27,66,68]: (16) where 0 means total saturation and S and P refer to a substrate and a product, respectively, in reactions with one or more substrates and products. The elasticities for the overall reaction rate v will be a function of these elasticities for the forward and reverse reaction rates and the levels of displacement with respect to the equilibrium.
Assuming, for simplicity, that all modeled transport and reaction-steps follow a Michaelis-Menten type saturation, with metabolite elasticities for all forward and backward reaction rates ranging between 0 and +1 or -1 (Eq (16)), the limits for the elasticities of the net reaction rates can be described as: (17) where N depends on the level of displacement from equilibrium. At steady-state, the net, forward and reverse reaction rates can be expressed as a function of the disequilibrium ratio (ρ) , (18) where, (19) and the following equation can describe the metabolite elasticity of the net rate: (20) where v can be positive or negative, R is a reactant (substrate or product), in reactions with one or more substrates and products, Keq is the equilibrium constant, Γ is the mass-action ratio, and ρ is the disequilibrium ratio. Given three extreme situations:
- ρ = 0, the reaction is irreversible (v = vf and vr = 0) and
- ρ = 1, the reaction is in equilibrium (vf = vr) and (indeterminate)
- ρ = ∞, the reaction is irreversible (vf = 0 and v = −vr) and
Taking 0<ρ<1, and therefore for v>0, by substituting in Eq (20) and according to the equality in Eq (21) for S () and and according to the equality in Eq (22) for P (), the following equations are derived [15,79], respectively: (23) (24) which permitted in our calculations the indirect calculation of from the sampled values for , respectively. The first and second right-hand terms in Eqs (23) and (24) correspond to the so-called regulatory saturation term and thermodynamic or mass action term, respectively [79,80]. It should be noted that the application of Eq (24) implies that even for reactions far from equilibrium (ρ = 0), we will account for products’ effect, consistent with the fact that in multienzyme systems the products’ concentrations are rarely zero [15,81].
Accordingly, for each problem formulation, metabolite elasticities with respect to substrates and products were sampled for the forward flux (Eq (16)), and then used to calculate the elasticities for the net reactions applying Eq (23) for substrates and Eq (24) for products. This required an estimation of the disequilibrium ratios (S2 Table). Most of the reactions associated with transport and simplified pools of reactions accounting for boundary processes were assumed to be far from equilibrium. For some other processes, disequilibrium ratios known to be closer to equilibrium in biological conditions were set to ρ = 0.9, except for some cases for which ρ were corrected to lower values using as an approximation SIRM-estimated reverse to forward rates (Eq (19)). The upper level of 0.9 for ρ was selected as a compromise to avoid unrealistic values of control coefficients. In addition, although kinetic details were unknown, some constraints and bounded domains for metabolite elasticities were added from literature. These included constraints associated with known substrate/product competitive inhibitions [82–85]: ATP/ADP for PK, PC and ACLY; PEP/Pyr for PK; CoA.m/ACoA,m for PDH; Suc/Fum for SDH; αKG/Pyr and Glu/Ala for ALT; αKG/OAA and Glu/Asp for AST; Glu/αKG for GDH; and P5C/Pro and NADP/NADPH for PYCR. Then, an additional constraint was considered that must be satisfied for each of these competitive inhibitions involving a substrate and a product : (25)
Also, elasticities accounting for other inhibitions and activations were considered, which are affecting the activity of the following enzymes: PFK inhibited by ATP and Cit, and activated by ADP ; PK inhibited by Ala, Cys, Met, Phe, Val, Leu, Ile and Pro, and activated by FBP and Ser ; PC inhibited by Glu ; PC activated by ACoA.m ; SDH inhibited by Mal ; SDH inhibited by OAA.m ; GLS activated by ADP ; and GDH inhibited by ATP (GTP), and activated by ADP and Leu . For them, values were sampled into ranges between zero and one: (26) where A and I refer to an activator and an inhibitor, respectively. These constraints and bounded domains were imposed during the sampling of metabolite elasticities.
Following the applied matrix formulation [31,32], control coefficients are not only a function of metabolite elasticities, but also a function of the ratios between dependent fluxes and concentrations. The steady-state values for the flux ratios were approximated using average values for the reaction fluxes for control cells (before perturbation) and for CDK4/6 inhibited cells (S2 Table), which presented close flux distributions. For the moiety conservations involving six pairs of metabolites, the ratios for concentrations were set from literature as: 1) ACoA.c/CoA.c = 0.019 (cytosolic acetyl-CoA and CoA) and ACoA.m/CoA.m = 0.33 (mitochondrial acetyl-CoA and CoA) ; 2) ATP/ADP = 8, NAD.c/NADH.c = 120 and NAD.m/NADH.m = 6 ; and 3) NADP/NADPH = 0.52 .
Only matrices with all control coefficients within a -3.5 to +3.5 range were selected. As for the differences in concentrations, fluxes, and individual activities, values outside this enclosure were not accepted. Although it is arbitrarily set, this enclosure was added to avoid models with unrealistic sensitivities. In our particular application, to select 100 matrices of control coefficients, a total of 4686 were generated.
The S2 Table contains the list of reactions, stoichiometry, reaction fluxes, and disequilibrium ratios.
The initial domains for differences in all the reaction fluxes and some of the metabolite concentrations were restricted according to experimental observations comparing measurements at the states before the perturbation () and after the perturbation ().: (27)
A logarithm base of two was used, and therefore the differences are expressed as log2-fold changes. Lower and upper bounds were estimated from the lower and upper bounds of confidence intervals (fluxes) and mean ± standard deviation (concentrations) of the measured values after perturbation. Other factors, such as variations in volume, could lead to a proportional bias of all fluxes, concentrations, and activities in one state with respect to the other. In the original study, quantities for fluxes and concentrations were expressed as quantities per cell. Assuming a uniform distribution of the metabolic species in the cells, a normalization or correcting factor σ was applied to compare quantities per cell volume. The S3 Table contains the list of initial domains with additionally reduced bounds based on the measured differences.
All calculations were done using “Wolfram Mathematica 11/12” (www.wolfram.com). In particular, the function “LinearProgramming” was used to apply LP.
A Mathematica notebook, “DomainSolver.nb”, is provided to solve the final domains and identify negative and positive signs starting from calculated control coefficients and initial values. This makes the procedure fully reproducible, permitting the generation of the final domains used in Fig 2 (glycolysis-case) and Figs 4 and 5 (cancer-case). The Mathematica notebook contains an interactive script that requires to open two (glycolysis-case) or three (cancer-case) excel files (xlsx) with: 1) one matrix (glycolysis-case) or an ensemble of control-coefficient matrices (cancer-case); 2) list of initial domains (glycolysis-case and cancer-case); and 3) additional constraints for individual activities (only cancer-case, Eqs (13) and (14)). Also, a second Mathematica notebook, “ControlSolver.nb”, is provided to generate the control coefficients by random sampling of metabolite elasticities for the cancer-case study. This notebook contains another interactive script that requires to open one (cancer-case) excel file (xlsx) with a description of the network structure, flux values, disequilibrium ratios, substrate/product competitive inhibitions (Substrate competitions), inhibitors, activators, and moiety conservations. Two documents are provided with the instructions for use. These files are freely available on Zenodo at link http://dx.doi.org/10.5281/zenodo.5081161.
S2 Table. List of reactions, stoichiometry, reaction fluxes, and disequilibrium ratios.
- 1. Intlekofer AM, Finley LWS. Metabolic signatures of cancer cells and stem cells. Nat Metab. 2019;1(2):177–88. pmid:31245788
- 2. Tarrado-Castellarnau M, de Atauri P, Cascante M. Oncogenic regulation of tumor metabolic reprogramming. Oncotarget. 2016;7(38):62726–53. pmid:28040803
- 3. Rosato A, Tenori L, Cascante M, De Atauri Carulla PR, Martins Dos Santos VAP, Saccenti E. From correlation to causation: analysis of metabolomics data using systems biology approaches. Metabolomics. 2018;14(4):37. pmid:29503602
- 4. Balcells C, Foguet C, Tarragó-Celada J, de Atauri P, Marín S, Cascante M. Tracing metabolic fluxes using mass spectrometry: Stable isotope-resolved metabolomics in health and disease. Trends in Analytical Chemistry. 2019;120:115371.
- 5. Niedenfuhr S, Wiechert W, Noh K. How to measure metabolic fluxes: a taxonomic guide for (13)C fluxomics. Curr Opin Biotechnol. 2015;34:82–90. pmid:25531408
- 6. Orth JD, Thiele I, Palsson BO. What is flux balance analysis? Nat Biotechnol. 2010;28(3):245–8. pmid:20212490
- 7. Maranas CD, Zomorrodi AR. Optimization methods in metabolic networks. Hoboken, NJ: Wiley; 2016.
- 8. Vijayakumar S, Conway M, Lio P, Angione C. Seeing the wood for the trees: a forest of methods for optimization and omic-network integration in metabolic modelling. Brief Bioinform. 2018;19(6):1218–35. pmid:28575143
- 9. Karakitsou E, Foguet C, de Atauri P, Kultima K, Khoonsari PE, Martins dos Santos VAP, et al. Metabolomics in systems medicine: an overview of methods and applications. Curr Opin Syst Biol. 2019;15:91–9.
- 10. Costa RS, Hartmann A, Vinga S. Kinetic modeling of cell metabolism for microbial production. J Biotechnol. 2016;219:126–41. pmid:26724578
- 11. Saa PA, Nielsen LK. Formulation, construction and analysis of kinetic models of metabolism: A review of modelling frameworks. Biotechnol Adv. 2017;35(8):981–1003. pmid:28916392
- 12. Strutz J, Martin J, Greene J, Broadbelt L, Tyo K. Metabolic kinetic modeling provides insight into complex biological questions, but hurdles remain. Curr Opin Biotechnol. 2019;59:24–30. pmid:30851632
- 13. Savageau MA. Critique of the enzymologist’s test tube. In: Bittar EE, editor. Fundamentals of Medical Cell Biology. 3A. Greenwich, CT: JAI Press; 1992. pp. 45–108.
- 14. Teusink B, Passarge J, Reijenga CA, Esgalhado E, van der Weijden CC, Schepper M, et al. Can yeast glycolysis be understood in terms of in vitro kinetics of the constituent enzymes? Testing biochemistry. Eur J Biochem. 2000;267(17):5313–29. pmid:10951190
- 15. Cornish-Bowden A. Fundamentals of enzyme kinetics. 4th ed. Weinheim: Wiley-Blackwell; 2012.
- 16. Fell DA. Understanding the Control of Metabolism. London: Portland Press; 1996. https://doi.org/10.1016/s0026-0495(96)90067-0 pmid:8769367
- 17. Miskovic L, Tokic M, Savoglidis G, Hatzimanikatis V. Control Theory Concepts for Modeling Uncertainty in Enzyme Kinetics of Biochemical Networks. Ind Eng Chem Res. 2019;58(30):13544–54.
- 18. Heinrich R, Schuster S. The regulation of cellular systems. New York: Chapman & Hall; 1996.
- 19. Sauro H. Systems Biology: An Introduction to Metabolic Control Analysis. Ambrosius Publishing; 2019.
- 20. Voit EO. Computational analysis of biochemical systems: a practical guide for biochemists and molecular biologists. Cambridge: Cambridge University Press; 2000.
- 21. Savageau MA, Voit EO, Irvine DH. Biochemical systems theory and metabolic control theory: 2. the role of summation and connectivity relationships. Mathematical Biosciences. 1987;86(2):147–69.
- 22. Sorribas A, Pozo C, Vilaprinyo E, Guillen-Gosalbez G, Jimenez L, Alves R. Optimization and evolution in metabolic pathways: global optimization techniques in Generalized Mass Action models. J Biotechnol. 2010;149(3):141–53. pmid:20152867
- 23. Vera J, González-Alcón C, Marín-Sanguino A, Torres N. Optimization of biochemical systems through mathematical programming: Methods and applications. Computers & Operations Research. 2010;37(8):1427–38.
- 24. Xu G. Steady-state optimization of biochemical systems through geometric programming. Eur J Oper Res. 2013;225(1):12–20.
- 25. Xu G, Li Y. Steady-state optimization of biochemical systems by bi-level programming. Comput Chem Eng. 2017;106:286–96.
- 26. Childs D, Grimbs S, Selbig J. Refined elasticity sampling for Monte Carlo-based identification of stabilizing network patterns. Bioinformatics. 2015;31(12):i214–20. pmid:26072485
- 27. Grimbs S, Selbig J, Bulik S, Holzhutter HG, Steuer R. The stability and robustness of metabolic states: identifying stabilizing sites in metabolic networks. Mol Syst Biol. 2007;3:146. pmid:18004279
- 28. Miskovic L, Hatzimanikatis V. Production of biofuels and biochemicals: in need of an ORACLE. Trends Biotechnol. 2010;28(8):391–7. pmid:20646768
- 29. Curto R, Sorribas A, Cascante M. Comparative characterization of the fermentation pathway of Saccharomyces cerevisiae using biochemical systems theory and metabolic control analysis: model definition and nomenclature. Math Biosci. 1995;130(1):25–50. pmid:7579901
- 30. Kholodenko BN. How do external parameters control fluxes and concentrations of metabolites? An additional relationship in the theory of metabolic control. FEBS Lett. 1988;232(2):383–6. pmid:3378629
- 31. Cascante M, Franco R, Canela EI. Use of implicit methods from general sensitivity theory to develop a systematic approach to metabolic control. I. Unbranched pathways. Math Biosci. 1989;94(2):271–88. pmid:2520171
- 32. Cascante M, Franco R, Canela EI. Use of implicit methods from general sensitivity theory to develop a systematic approach to metabolic control. II. Complex systems. Math Biosci. 1989;94(2):289–309. pmid:2520172
- 33. Hofmeyr JH, Cornish-Bowden A. Quantitative assessment of regulation in metabolic systems. Eur J Biochem. 1991;200(1):223–36. pmid:1879427
- 34. Hatzimanikatis V, Emmerling M, Sauer U, Bailey JE. Application of mathematical tools for metabolic design of microbial ethanol production. Biotechnol Bioeng. 1998;58(2–3):154–61. pmid:10191385
- 35. Hatzimanikatis V, Floudas CA, Bailey JE. Analysis and design of metabolic reaction networks via mixed-integer linear optimization. AIChE J. 1996;42(5):1277–92.
- 36. Hatzimanikatis V, Bailey JE. Effects of spatiotemporal variations on metabolic control: approximate analysis using (log)linear kinetic models. Biotechnol Bioeng. 1997;54(2):91–104. pmid:18634077
- 37. Voit EO. Optimization in integrated biochemical systems. Biotechnol Bioeng. 1992;40(5):572–82. pmid:18601153
- 38. Hatzimanikatis V, Bailey JE. MCA has more to say. J Theor Biol. 1996;182(3):233–42. pmid:8944154
- 39. Hatzimanikatis V, Floudas CA, Bailey JE. Optimization of regulatory architectures in metabolic reaction networks. Biotechnol Bioeng. 1996;52(4):485–500. pmid:18629921
- 40. Torres NV, Voit EO, Glez-Alcón C, Rodríguez F. An indirect optimization method for biochemical systems: Description of method and application to the maximization of the rate of ethanol, glycerol, and carbohydrate production in Saccharomyces cerevisiae. Biotechnol Bioeng. 1997;55(5):758–72. pmid:18636586
- 41. Vera J, de Atauri P, Cascante M, Torres NV. Multicriteria optimization of biochemical systems by linear programming: application to production of ethanol by Saccharomyces cerevisiae. Biotechnol Bioeng. 2003;83(3):335–43. pmid:12783489
- 42. Xu G. Bi-objective optimization of biochemical systems by linear programming. Appl Math Comput. 2012;218(14):7562–72.
- 43. Alvarez-Vasquez F, Canovas M, Iborra JL, Torres NV. Modeling, optimization and experimental assessment of continuous L-(-)-carnitine production by Escherichia coli cultures. Biotechnol Bioeng. 2002;80(7):794–805. pmid:12402325
- 44. Marin-Sanguino A, Voit EO, Gonzalez-Alcon C, Torres NV. Optimization of biotechnological systems through geometric programming. Theor Biol Med Model. 2007;4:38. pmid:17897440
- 45. Xu G, Wang L. An Improved Geometric Programming Approach for Optimization of Biochemical Systems. J Appl Math. 2014;2014:1–10.
- 46. Guillen-Gosalbez G, Sorribas A. Identifying quantitative operation principles in metabolic pathways: a systematic method for searching feasible enzyme activity patterns leading to cellular adaptive responses. BMC Bioinf. 2009;10:386. pmid:19930714
- 47. Pozo C, Guillen-Gosalbez G, Sorribas A, Jimenez L. Identifying the preferred subset of enzymatic profiles in nonlinear kinetic metabolic models via multiobjective global optimization and Pareto filters. PLoS One. 2012;7(9):e43487. pmid:23028457
- 48. Puigjaner J, Raïs B, Burgos M, Comín B, Ovadi J, Cascante M. Comparison of control analysis data using different approaches: modelling and experiments with muscle extract. FEBS Lett. 1997;418(1–2):47–52. pmid:9414093
- 49. Sabate L, Franco R, Canela EI, Centelles JJ, Cascante M. A model of the pentose phosphate pathway in rat liver cells. Mol Cell Biochem. 1995;142(1):9–17. pmid:7753046
- 50. Tarrado-Castellarnau M, de Atauri P, Tarrago-Celada J, Perarnau J, Yuneva M, Thomson TM, et al. De novo MYC addiction as an adaptive response of cancer cells to CDK4/6 inhibition. Mol Syst Biol. 2017;13(10):940. pmid:28978620
- 51. Sriram G, Martinez JA, McCabe ER, Liao JC, Dipple KM. Single-gene disorders: what role could moonlighting enzymes play? Am J Hum Genet. 2005;76(6):911–24. pmid:15877277
- 52. Ortega F, Cascante M, Acerenza L. Kinetic properties required for sustained or paradoxical control of metabolic fluxes under large changes in enzyme activities. J Theor Biol. 2008;252(3):569–73. pmid:18045618
- 53. Mahadevan R, Schilling CH. The effects of alternate optimal solutions in constraint-based genome-scale metabolic models. Metab Eng. 2003;5(4):264–76. pmid:14642354
- 54. Burgard AP, Vaidyaraman S, Maranas CD. Minimal reaction sets for Escherichia coli metabolism under different growth requirements and uptake environments. Biotechnol Prog. 2001;17(5):791–7. pmid:11587566
- 55. Llaneras F, Pico J. An interval approach for dealing with flux distributions and elementary modes activity patterns. J Theor Biol. 2007;246(2):290–308. pmid:17292923
- 56. Gudmundsson S, Thiele I. Computationally efficient flux variability analysis. BMC Bioinf. 2010;11:489. pmid:20920235
- 57. Chowdhury A, Zomorrodi AR, Maranas CD. k-OptForce: integrating kinetics with flux balance analysis for strain design. PLoS Comput Biol. 2014;10(2):e1003487. pmid:24586136
- 58. Chan SHJ, Wang L, Dash S, Maranas CD, Wren J. Accelerating flux balance calculations in genome-scale metabolic models by localizing the application of loopless constraints. Bioinformatics. 2018;34(24):4248–55. pmid:29868725
- 59. Fell DA, Sauro HM. Metabolic control and its analysis. Additional relationships between elasticities and control coefficients. Eur J Biochem. 1985;148(3):555–61. pmid:3996393
- 60. Westerhoff HV, Kell DB. Matrix method for determining steps most rate-limiting to metabolic fluxes in biotechnological processes. Biotechnol Bioeng. 1987;30(1):101–7. pmid:18576589
- 61. Reder C. Metabolic control theory: a structural approach. J Theor Biol. 1988;135(2):175–201. pmid:3267767
- 62. Small JR, Fell DA. The matrix method of metabolic control analysis: its validity for complex pathway structures. J Theor Biol. 1989;136(2):181–97. pmid:2779266
- 63. Klipp E, Liebermeister W, Wierling C. Inferring dynamic properties of biochemical reaction networks from structural knowledge. Genome Inform. 2004;15(1):125–37. pmid:15712116
- 64. Alves R, Savageau MA. Extending the method of mathematically controlled comparison to include numerical comparisons. Bioinformatics. 2000;16(9):786–98. pmid:11108701
- 65. Kent E, Neumann S, Kummer U, Mendes P. What can we learn from global sensitivity analysis of biochemical systems? PLoS One. 2013;8(11):e79244. pmid:24244458
- 66. Wang L, Birol I, Hatzimanikatis V. Metabolic control analysis under uncertainty: framework development and case studies. Biophys J. 2004;87(6):3750–63. pmid:15465856
- 67. Chakrabarti A, Miskovic L, Soh KC, Hatzimanikatis V. Towards kinetic modeling of genome-scale metabolic networks without sacrificing stoichiometric, thermodynamic and physiological constraints. Biotechnol J. 2013;8(9):1043–57. pmid:23868566
- 68. Steuer R, Gross T, Selbig J, Blasius B. Structural kinetic modeling of metabolic networks. Proc Natl Acad Sci U S A. 2006;103(32):11868–73. pmid:16880395
- 69. Manalo DJ, Rowan A, Lavoie T, Natarajan L, Kelly BD, Ye SQ, et al. Transcriptional regulation of vascular endothelial cell responses to hypoxia by HIF-1. Blood. 2005;105(2):659–69. pmid:15374877
- 70. Elvidge GP, Glenny L, Appelhoff RJ, Ratcliffe PJ, Ragoussis J, Gleadle JM. Concordant regulation of gene expression by hypoxia and 2-oxoglutarate-dependent dioxygenase inhibition: the role of HIF-1alpha, HIF-2alpha, and other pathways. J Biol Chem. 2006;281(22):15215–26. pmid:16565084
- 71. Benita Y, Kikuchi H, Smith AD, Zhang MQ, Chung DC, Xavier RJ. An integrative genomics approach identifies Hypoxia Inducible Factor-1 (HIF-1)-target genes that form the core response to hypoxia. Nucleic Acids Res. 2009;37(14):4587–602. pmid:19491311
- 72. Chesney J, Clark J, Klarer AC, Imbert-Fernandez Y, Lane AN, Telang S. Fructose-2,6-bisphosphate synthesis by 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase 4 (PFKFB4) is required for the glycolytic response to hypoxia and tumor growth. Oncotarget. 2014;5(16):6670–86. pmid:25115398
- 73. Gao P, Tchernyshyov I, Chang TC, Lee YS, Kita K, Ochi T, et al. c-Myc suppression of miR-23a/b enhances mitochondrial glutaminase expression and glutamine metabolism. Nature. 2009;458(7239):762–5. pmid:19219026
- 74. Friedman MH. Principles and Models of Biological Transport. 2nd ed. New York: Springer; 2008.
- 75. Stein WD, Litman T. Channels, carriers, and pumps: an introduction to membrane transport. 2nd ed. London: Elsevier/Academic Press; 2015.
- 76. Acerenza L. Metabolic Control Design. J theor Biol. 1993;165(1):63–85. pmid:8264249
- 77. Liebermeister W, Uhlendorf J, Klipp E. Modular rate laws for enzymatic reactions: thermodynamics, elasticities and implementation. Bioinformatics. 2010;26(12):1528–34. pmid:20385728
- 78. Saa P, Nielsen LK. A general framework for thermodynamically consistent parameterization and efficient sampling of enzymatic reactions. PLoS Comput Biol. 2015;11(4):e1004195. pmid:25874556
- 79. Rohwer JM, Hofmeyr JH. Kinetic and thermodynamic aspects of enzyme control and regulation. J Phys Chem B. 2010;114(49):16280–9. pmid:21028763
- 80. Hofmeyr JH. Metabolic regulation: a control analytic perspective. J Bioenerg Biomembr. 1995;27(5):479–90. pmid:8718453
- 81. Hofmeyr JH, Cornish-Bowden A. The reversible Hill equation: how to incorporate cooperative enzymes into metabolic models. CABIOS. 1997;13(4):377–85. pmid:9283752
- 82. Barman TE. Enzyme handbook. Berlin: Springer; 1969.
- 83. Barman TE. Enzyme handbook: supplement. New York: Springer-Verlag; 1974. pmid:4838757
- 84. Zollner H. Handbook of enzyme inhibitors. 3rd rev. and enl. ed. Weinheim: Wiley-VCH; 1999.
- 85. Schomburg I, Chang A, Placzek S, Sohngen C, Rother M, Lang M, et al. BRENDA in 2013: integrated reactions, kinetic data, enzyme function data, improved disease classification: new options and contents in BRENDA. Nucleic Acids Res. 2013;41(Database issue):D764–72. pmid:23203881
- 86. Kemp RG, Gunasekera D. Evolution of the allosteric ligand sites of mammalian phosphofructo-1-kinase. Biochemistry. 2002;41(30):9426–30. pmid:12135364
- 87. Mazurek S. Pyruvate kinase type M2: a key regulator of the metabolic budget system in tumor cells. Int J Biochem Cell Biol. 2011;43(7):969–80. pmid:20156581
- 88. Zeczycki TN, Maurice MS, Attwood PV. Inhibitors of Pyruvate Carboxylase. Open Enzym Inhib J. 2010;3:8–26. pmid:22180764
- 89. Adina-Zada A, Zeczycki TN, Attwood PV. Regulation of the structure and activity of pyruvate carboxylase by acetyl CoA. Arch Biochem Biophys. 2012;519(2):118–30. pmid:22120519
- 90. Gutman M, Silman N. The steady state activity of succinate dehydrogenase in the presence of opposing effectors. I. The effect of L malate and CoQH2 on the enzymic activity. Mol Cell Biochem. 1975;7(1):51–8. pmid:1134499
- 91. Gutman M, Silman N. The steady state activity of succinate dehydrogenase in the presence of opposing effectors. II. Reductive activation of succinate dehydrogenase in presence of oxaloacetate. Mol Cell Biochem. 1975;7(3):177–85. pmid:239334
- 92. Masola B, Ngubane NP. The activity of phosphate-dependent glutaminase from the rat small intestine is modulated by ADP and is dependent on integrity of mitochondria. Arch Biochem Biophys. 2010;504(2):197–203. pmid:20831857
- 93. Li M, Li C, Allen A, Stanley CA, Smith TJ. The structure and allosteric regulation of glutamate dehydrogenase. Neurochem Int. 2011;59(4):445–55. pmid:21070828
- 94. Gumaa KA, McLean P, Greenbaum AL. Calculation of the intracellular distribution of Acetyl CoA and CoA, based on the use of citrate synthase as an equilibrium enzyme. FEBS Letters. 1973;29(2):193–6. pmid:4737025
- 95. Milo R, Jorgensen P, Moran U, Weber G, Springer M. BioNumbers—the database of key numbers in molecular and cell biology. Nucleic Acids Res. 2010;38(Database issue):D750–3. pmid:19854939