## Figures

## Abstract

Traditional (genome-scale) metabolic models of cellular growth involve an approximate biomass “reaction”, which specifies biomass composition in terms of precursor metabolites (such as amino acids and nucleotides). On the one hand, biomass composition is often not known exactly and may vary drastically between conditions and strains. On the other hand, the predictions of computational models crucially depend on biomass. Also elementary flux modes (EFMs), which generate the flux cone, depend on the biomass reaction. To better understand cellular phenotypes across growth conditions, we introduce and analyze new classes of elementary vectors for comprehensive (next-generation) metabolic models, involving explicit synthesis reactions for all macromolecules. Elementary growth modes (EGMs) are given by stoichiometry and generate the growth cone. Unlike EFMs, they are not support-minimal, in general, but cannot be decomposed “without cancellations”. In models with additional (capacity) constraints, elementary growth vectors (EGVs) generate a growth polyhedron and depend also on growth rate. However, EGMs/EGVs do not depend on the biomass composition. In fact, they cover all possible biomass compositions and can be seen as unbiased versions of elementary flux modes/vectors (EFMs/EFVs) used in traditional models. To relate the new concepts to other branches of theory, we consider autocatalytic sets of reactions. Further, we illustrate our results in a small model of a self-fabricating cell, involving glucose and ammonium uptake, amino acid and lipid synthesis, and the expression of all enzymes and the ribosome itself. In particular, we study the variation of biomass composition as a function of growth rate. In agreement with experimental data, low nitrogen uptake correlates with high carbon (lipid) storage.

## Author summary

Next-generation, genome-scale metabolic models allow to study the reallocation of cellular resources upon changing environmental conditions, by not only modeling flux distributions, but also expression profiles of the catalyzing proteome. In particular, they do no longer assume a fixed biomass composition. Methods to identify *optimal* solutions in such comprehensive models exist, however, an unbiased understanding of all *feasible* allocations is missing so far. Here we develop new concepts, called elementary growth modes and vectors, that provide a generalized definition of minimal pathways, thereby extending classical elementary flux modes (used in traditional models with a fixed biomass composition). The new concepts provide an understanding of all possible flux distributions *and* of all possible biomass compositions. In other words, elementary growth modes and vectors are the unique functional units in any comprehensive model of cellular growth. As an example, we show that lipid accumulation upon nitrogen starvation is a consequence of resource allocation and does not require active regulation. Our work puts current approaches on a theoretical basis and allows to seamlessly transfer existing workflows (e.g. for the design of cell factories) to next-generation metabolic models.

**Citation: **Müller S, Széliová D, Zanghellini J (2022) Elementary vectors and autocatalytic sets for resource allocation in next-generation models of cellular growth. PLoS Comput Biol 18(2):
e1009843.
https://doi.org/10.1371/journal.pcbi.1009843

**Editor: **Christoph Kaleta,
Christian Albrechts Universitat zu Kiel, GERMANY

**Received: **February 24, 2021; **Accepted: **January 18, 2022; **Published: ** February 1, 2022

**Copyright: ** © 2022 Müller et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

**Data Availability: **All relevant data are within the manuscript and its Supporting information files.

**Funding: **This study was funded by the Austrian Science Fund (P33218) to SM and by the Austrian Centre of Industrial Biotechnology (acib - Next Generation Bioproduction) to DS and JZ. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

**Competing interests: ** The authors have declared that no competing interests exist.

## 1 Introduction

A major characteristics of life is self-fabrication, involving self-maintenance and self-replication. Cellular self-fabrication requires the acquisition and transformation of nutrients, not only to maintain the cell, but also to replicate, that is, to grow. During one cycle, a cell needs to duplicate all its building blocks and to cover the related costs. In constant environments, this process is balanced and leads to exponential growth. Indeed, exponential growth means that all cellular components are synthesized in proportion to their abundance. Clearly, metabolic activity depends on growth rate since higher growth rates imply higher synthesis rates which in turn require more ribosomes that produce the additional enzymes (and the ribosomes themselves). Thus, growth can be seen as a process that allocates cellular resources, limited by environmental and physico-chemical constraints.

Arguably, some of the most successful approaches to study (microbial) growth processes are rooted in constraint-based modeling [1]. At their heart sits a (genome-scale) metabolic model, which captures all possible (cell-specific) biochemical transformations in an annotated and mathematically structured form. The resulting stoichiometric matrix, coupled with environmental and physico-chemical constraints, is already sufficient to study (fundamental aspects of) growth. Various biased and unbiased methods have been developed to predict (steady-state) metabolic phenotypes [2]. However, first-generation genome-scale metabolic models do not involve the expression of proteins and hence do not account for the individual enzyme costs of metabolic fluxes. These models rather use a fixed biomass composition that represents the average costs of growth. Recent efforts focus on the development of next-generation genome-scale metabolic models that also account for the enzyme demands of individual reactions. As one prominent example, we mention resource balance analysis (RBA) [3]. Although specific approaches differ in their degree of mechanistic detail and mathematical formulation, the central concept in the analysis of next-generation models is the (optimal) allocation of resources. However, we currently lack a theoretical understanding of all feasible allocations in next-generation metabolic models.

In traditional, first-generation models, elementary flux modes and vectors (EFMs [4–6] and EFVs [7, 8]) allow an unbiased study of metabolic pathways, in particular, an interpretation of any *feasible* flux distribution in terms of unique functional units. On the other hand, flux balance analysis (FBA) enables the computationally efficient identification of *optimal* fluxes [9]. In this respect, RBA can be viewed as a generalization of FBA that also accounts for enzyme costs. It represents the corresponding tool to identify optimal fluxes (and corresponding optimal allocations) in comprehensive, next-generation models. However, the analogues of EFMs and EFVs have not been identified, yet. For kinetic models with one enzyme constraint, optimal solutions have been characterized as EFMs [10, 11], and “elementary growth states” have been introduced for kinetic models of cellular self-fabrication [12], but for constraint-based models, the corresponding theoretical concepts are still missing.

In this work, we introduce elementary growth modes and vectors (EGMs and EGVs) that are given by stoichiometry and irreversibility (EGMs) as well as by additional constraints and growth rate (EGVs). In fact, we develop a mathematical theory that enables an unbiased characterization of all feasible flux distributions in constraint-based models of cellular growth. In analogy to EFMs and EFVs, EGMs and EGVs can be interpreted as unique functional units of self-fabrication. Even more importantly, they provide an unbiased understanding of all possible biomass compositions. In a small example of a self-fabricating cell, we highlight that the experimentally observed lipid accumulation upon nitrogen starvation is a general feature of a comprehensive, next-generation metabolic model and not the result of active regulation.

Finally, we relate EGMs and EGVs to other branches of theory. Obviously, purely stoichiometric models do not reflect implications from autocatalysis (namely, that all catalysts of active reactions need to be synthesized) and kinetics (namely, that all species involved in active reactions need to be present with nonzero concentrations). We observe that additional (capacity) constraints often ensure that EGVs are autocatalytic.

## 2 Results

In this section, we introduce and analyze *elementary growth modes* and *vectors* for computational models of cellular growth. Further, we relate our approach to the theory of autocatalytic sets. In order to motivate the new concepts and to illustrate our theoretical results, we use a running example (a small model of a self-fabricating cell).

In Section 3 (Methods), we provide the relevant mathematical background (elementary vectors in polyhedral geometry), and in Section 4 (Discussion), we compare our theory with the constraint-based study of traditional models and the analysis of (semi-)kinetic models [12].

To begin with, we summarize the mathematical notation used throughout this work.

**Notation**. We denote the positive real numbers by and the nonnegative real numbers by . Let *I* be an index set; often *I* = {1, …, *n*}, and stands for . For , we write *x* > 0 if , *x* ≥ 0 if , and we denote its *support* by supp(*x*) = {*i* ∈ *I*∣*x*_{i} ≠ 0}. Recall that a nonzero vector is *support-minimal* (in *X*) if, for all nonzero *x*′ ∈ *X*, supp(*x*′) ⊆ supp(*x*) implies supp(*x*′) = supp(*x*). For , we define its *sign vector* sign(*x*) ∈ {−, 0, +}^{I} by applying the sign function component-wise, that is, sign(*x*)_{i} = sign(*x*_{i}) for *i* ∈ *I*. The relations 0 < − and 0 < + on {−, 0, +} induce a partial order on {−, 0, +}^{I}: for *X*, *Y* ∈ {−, 0, +}^{I}, we write *X* ≤ *Y* if the inequality holds component-wise. For , we denote the component-wise product by , that is, (*x* ∘ *y*)_{i} = *x*_{i} *y*_{i}. For and subindex set , we write for the corresponding subvector.

### 2.1 Growth model

The dynamic model of cellular growth studied in this work is given by the mass balance equation (1a) and, additionally, by the (dry) mass constraint (1b)

Here, *x* and *ω* are the vectors of concentrations and molar masses (of the molecular species), respectively, *v* is the vector of reaction rates, *N* is the stoichiometric matrix which captures the net effect of the chemical reactions (and other processes), and *μ* is the growth rate. We summarize the fundamental objects and quantities in Table 1 and provide a minimal derivation of the set of Eqs (1) in Section A in S1 Text. For alternative derivations, see e.g. [13] or [12].

In the following, we also consider (dimensionless) mass fractions . Obviously, .

By Eq (1), growth rate is given by (2)

By distinguishing between exchange and internal reactions, Eq (2) can be rewritten as
(3)
since *ω*^{T} *N*_{int} = 0, by mass conservation. In biological terms, growth rate is determined by the exchange reactions with the environment (the uptake of nutrients and the excretion of waste products).

We highlight two observations, proven in Section A in S1 Text:

*Conservation laws*. In a model of cellular growth, there cannot be any conservation laws. In mathematical terms, .*Dependent concentrations*. Still, there can be dependent concentrations. That is, possibly ker*N*^{T}≠ {0}.

In order to motivate the new concepts in this work and to illustrate our results, we will use a running example.

### 2.2 Example

Consider the small model of a self-fabricating cell shown in Fig 1a. The cell takes up glucose (G) and ammonium (N) via the catalytic reactions respectively, and forms amino acids (AA) and lipids,

**(a)** The cell has two types of importers, taking up glucose (G) and ammonium (N) from the environment, and three types of metabolic enzymes, synthesizing amino acids (AA) from glucose and ammonium, forming lipid droplets (LD) from glucose, and transferring lipids (L) to the membrane. Amino acids are used by the ribosome (R) to synthesize importers (IN, IG), metabolic enzymes (EAA, ELD, EL), and the ribosome itself. **(b)** The resulting stoichiometric matrix and the corresponding flux vector. (Thereby *v* is used for import and enzymatic reactions and *w* for synthesis reactions.) In traditional stoichiometric models, only the metabolic part of the stochiometric matrix (shaded in blue) is considered (together with a “biomass reaction”); see also Section 4.

In fact, lipids are stored in the form of lipid droplets (LD) and transferred to the membrane (L).

Amino acids are the essential building blocks for the importers (*I* = IG, IN), the enzymes (*E* = EAA, ELD, EL), and the ribosome R,

Thereby, stoichiometric coefficients are denoted by *n*_{i} with *i* ∈ {L, I, E, R}. The resulting stoichiometric matrix and the corresponding flux vector are displayed in Fig 1b, and parameter values are given in Table 2.

In the next subsection, we introduce the basic “building blocks” of any possible flux distribution in a (purely stoichiometric) model of cellular growth.

### 2.3 Elementary growth modes

At steady state, Eq (1a) implies (4)

Further, in a given setting, some reactions may have a given direction, as determined by thermodynamics. That is, (5) where denotes the set of irreversible reactions.

The inequalities *Nv* ≥ 0 and (for the fluxes *v*) specify a polyhedral cone which suggests the following definition.

**Definition 1**. *Growth modes* (GMs) for the dynamic growth model (1) are elements of the *growth cone*

A GM *v* ∈ *C*_{g} has an *associated* growth rate *μ*(*v*) = *ω*^{T} *Nv* ≥ 0 and, if *μ*(*v*) > 0, an *associated* concentration vector .

*Elementary* growth modes (EGMs) are conformally non-decomposable GMs.

At this point, we refer the reader to Subsection 3.1 for an introduction to elementary vectors in polyhedral geometry (in particular, for the definitions of polyhedral cones and conformal non-decomposability). Still, we also try to provide an intuitive understanding of Definition 1:

- GMs are fluxes given by stoichiometry and irreversibility; in particular, they do not depend on concentrations or growth rate. Still, GMs have an associated growth rate and an associated concentration vector, as given by Eqs (2) and (4).
- A growth cone
*C*_{g}is a general polyhedral cone. In contrast, a flux cone , arising also in traditional models of cellular growth, is a special cone lying in a subspace (the nullspace of the stoichiometric matrix). For visualizations of a special and a (more) general polyhedral cone, see Subsection 3.1. - Whereas the elementary vectors of a flux cone are support-minimal, the elementary vectors of a growth cone are conformally non-decomposable. Thereby, a conformally non-decomposable vector cannot be written as a sum of other vectors
*without cancellations*[14]. - In applications, the growth cone (as the flux cone) is often pointed (due to irreversibility).

GMs are scalable, but the associated concentrations are scale invariant.

**Proposition 2**. *For a GM v* ∈ *C*_{g} *with associated concentration x*(*v*) *and* λ > 0, *it holds that x*(λ*v*) = *x*(*v*).

Further, we can characterize the case of zero growth rate.

**Proposition 3**. *For a GM v* ∈ *C*_{g}, *μ*(*v*) = 0 *is equivalent to Nv* = 0.

*Proof*. Obviously, *Nv* = 0 implies *μ*(*v*) = *ω*^{T} *Nv* = 0. Conversely, *μ*(*v*) = *ω*^{T} *Nv* = 0 with and implies *Nv* = 0.

GMs *v* with *μ*(*v*) = 0 are flux modes (FMs), that is, elements of the flux cone

EGMs *v* with *μ*(*v*) = 0 are elementary flux modes (EFMs), that is, support-minimal elements of *C*_{f}. They are not support-minimal elements of *C*_{g}, in general. (Again recall that *N* is the stoichiometric matrix of a comprehensive model of cellular growth, and hence *C*_{f} and its EFMs are different from the corresponding objects in traditional models).

Finally, we apply the general theory of elementary vectors and state the main result of this subsection.

**Theorem 4**. *Every nonzero GM is a conformal sum of EGMs*.

*Proof*. By Theorem 11 in Subsection 3.1. The growth cone *C*_{g} is a general polyhedral cone, and its elementary vectors are the conformally non-decomposable vectors, that is, the EGMs.

In fact, EGMs with nonzero growth rate can be scaled to have the same growth rate as the given GM.

**Corollary 5**. *Let v be a nonzero GM with associated growth rate μ*(*v*) ≕ *μ*. *Then, there exist (possibly empty) finite sets E*_{0} *and E*_{μ} *of EGMs with associated growth rates 0 and μ* > 0, *respectively, such that* λ_{e} ≥ 0, *and* . *Moreover, if μ* > 0, *then* .

*Proof*. By Theorem 4, there exist (possibly empty) finite sets *E*_{0} and *E*_{>} of EGMs (with associated growth rates 0 and >0, respectively) such that

In particular, . If *μ* > 0, that is, *E*_{>} ≠ ∅, then
where with *μ*(*e*′) = *μ* and with .

Finally, if *μ* > 0, then .

In Corollary 5, we actually fix growth rate which turns the growth cone into a polyhedron. Hence, the result is also an instance of Theorem 12 in Subsection 3.1.

#### Example (EGMs).

Recall the small model of a self-fabricating cell given in Fig 1 and note that all reactions are assumed to be irreversible, . As it turns out, there are 11 EGMs (up to scaling), corresponding to the 11 molecular species, that is, with *i* ∈ Mol = {G, N, AA, LD, L, IG, IN, EAA, ELD, EL, R}. Explicitly,

Thereby, we introduced the factors *μ*/*ω*_{i} to obtain correct units (mol g^{−1} h^{−1}) and equal growth rate *μ* for all EGMs. We note the following points:

- Every EGM “produces” exactly one molecular species, as indicated by its name. (For every EGM, there is exactly one molecular species with nonzero associated concentration.) Formally,
*Ne*^{i}= (*μ*/*ω*_{i})*u*^{i}, where*u*^{i}is the*i*th unit vector in . That is,*e*^{G}produces G (glucose),*e*^{N}produces N (ammonium), …, and*e*^{R}produces R (ribosome).

This special situation arises from the fact that the stoichiometric matrix*N*is square. In general, there may be more than one EGM for the exclusive production of some molecular species or, conversely, no EGM for exclusive production (just joint production with another species). - EGMs
*e*^{G}and*e*^{N}are support-minimal, but all other EGMs are not. - For the associated growth rates, we obtain
*μ*(*e*^{i}) =*ω*^{T}*Ne*^{i}=*μ*/*ω*_{i}⋅*ω*^{T}*u*^{i}=*μ*/*ω*_{i}⋅*ω*_{i}=*μ*. (This is just the consequence of introducing the factors above.) For the associated concentrations, we obtain*x*(*e*^{i}) =*Ne*^{i}/*μ*(*e*^{i}) = (*μ*/*ω*_{i})*u*^{i}/*μ*=*u*^{i}/*ω*_{i}. - There are no EGMs with zero growth rate, that is, there are no EFMs.

To give an example of a conformal sum, we consider the GM
with support in the import/enzymatic reactions *r*_{IG}, *r*_{IN}, *r*_{EAA} and in the synthesis reactions *s*_{IG}, *s*_{IN}. In particular, *v*′ is a GM without lipid synthesis. Now, let *μ*(*v*′) = *μ*. By Theorem 4 (and Corollary 5), *v*′ is a conformal (and convex) sum of EGVs,
with λ_{G}, λ_{N}, λ_{AA} ≥ 0, λ_{IG}, λ_{IN}> 0 and λ_{G} + λ_{N} + λ_{AA} + λ_{IG} + λ_{IN} = 1. Note that *v*′ produces IG, IN (since λ_{IG}, λ_{IN}> 0), however, it produces G, N, AA if and only if also λ_{G}, λ_{N}, λ_{AA} > 0.

Since GMs are given by stoichiometry (and irreversibility), they do not reflect constraints implied by autocatalysis and kinetics.

#### Example (autocatalysis and kinetics).

Obviously, the GM *v*′ above involves catalytic reactions. In particular, it involves *r*_{EAA} (amino acid synthesis) as well as *s*_{IG}, *s*_{IN} (the expression of the importers). However, it is not catalytically closed in the sense that reactions *r*_{EAA} and *s*_{IG}, *s*_{IN} carry fluxes, but the corresponding catalysts, the enzyme EAA and the ribosome R, are not expressed. Now, EGV *s*^{EAA} involves *e*_{EAA} (the expression of EAA), and analogously *e*^{R} involves *s*_{R}. Hence, we form a convex sum
with λ_{EAA}, λ_{R} > 0, to obtain a GM that is catalytically closed.

Further, the EGM *e*^{G} is kinetically consistent in the sense that it involves the reaction *r*_{IG} and hence the species G *and* it has a corresponding nonzero associated concentration *x*_{G}(*e*^{G}) > 0, as implied by kinetics. By the discussion above, the GM *v*′ is kinetically consistent if and only if λ_{G}, λ_{N}, λ_{AA} > 0. In this case, all species involved in active reactions have nonzero associated concentrations.

In the next subsection, we elaborate on the concepts of catalytic closure and kinetic consistency.

### 2.4 Implications from autocatalysis and kinetics

Cellular growth is autocatalytic in the sense that the cell fabricates itself (thereby exchanging substrates/products with the environment). One needs to distinguish this notion of “network autocatalysis” from “autocatalytic subnetworks” [15, 16].

Consider the overall reaction corresponding to (the flux through) a subnetwork. If a molecular species appears on both the educt and product sides, in particular, with a larger stoichiometric coefficient on the product side than on the educt side, then it is *formally autocatalytic* (cf. [17]). In fact, there are several competing notions of autocatalytic species and subnetworks (cf. [15–17]).

In this work, we consider network autocatalysis. Before we state possible definitions, we distinguish two modeling approaches.

**Detailed models**(without individual catalytic reactions):

In this approach, catalysis occurs on the level of (small) subnetworks. In particular, individual reactions are not catalytic. For example, a simple catalytic mechanism (involving enzyme E, substrate S, and product P) is given by E + S ↔ ES ↔ EP ↔ E + P.**Coarse-grained models**(with individual catalytic reactions):

In this approach, catalysis occurs on the level of individual reactions. For example, the catalytic mechanism above is written as E + S ↔ E + P or . Due to coarse-graining, catalysis cannot be identified from the stoichiometric matrix. Hence, for every catalytic reaction, the corresponding catalyst is specified explicitly.

For **detailed models**, one may call a growth mode autocatalytic if it contains an autocatalytic species or subnetwork and it is catalytically closed. Formal definitions and their comparison are beyond the scope of this work. For **coarse-grained models** (like the small model of a self-fabricating given in Fig 1), we give a formal definition of network autocatalysis.

**Definition 6**. For a coarse-grained model, let Cat ⊆ Rxn be the set of catalytic reactions. A GM *v* ∈ *C*_{g} is *basically catalytic* (BC) if there is a catalytic reaction *r* ∈ supp (*v*) ∩ Cat. Further, a GM *v* ∈ *C*_{g} is *catalytically closed* (CC) if, for every catalytic reaction *r* ∈ supp (*v*) ∩ Cat, it holds that (*Nv*)_{s} > 0 for the corresponding catalyst *s* ∈ Mol. Finally, a GM *v* ∈ *C*_{g} is *autocatalytic* (AC) if it is BC and CC.

A subset of reactions is *autocatalytic* (AC) if there exists an autocatalytic GM *v* ∈ *C*_{g} with *S* = supp(*v*). A nonempty subset of reactions is *minimally* autocatalytic (MAC) if it is AC and inclusion-minimal.

In the literature, a closure condition is also crucial in the definitions of “reflexive autocatalysis” [18–20] and “chemical organizations” [21–23].

We note the following points:

- AC is implied by two conditions: BC guarantees that there is at least one active catalytic reaction, and CC ensures that all active catalysts are produced.

For an illustration, recall the motivating paragraph “Example (autocatalysis and kinetics)” just before this subsection. In the running example, all reactions are catalytic, and hence all GMs are BC. Whereas the GM*v*′ is not CC (the active enzyme EAA and the ribosome R are not produced), the GM*v*″ is CC and hence AC. Its support supp(*v*″) = {*r*_{IG},*r*_{IN},*r*_{EAA};*s*_{IG},*s*_{IN},*s*_{EAA},*s*_{R}} is an AC subset of reactions; in fact, it is the only MAC subset of reactions. - Catalytic closure is defined for fluxes, but it refers to concentrations via
*x*(*v*)_{s}= (*Nv*)_{s}/*μ*(*v*). In particular, (*Nv*)_{s}> 0 implies*x*(*v*)_{s}> 0.

In addition to autocatalysis, also kinetics implies constraints on growth modes. In particular, a growth mode is kinetically consistent if all species involved in active reactions (not necessarily all species in the model) have nonzero associated concentrations.

**Definition 7**. A GM *v* ∈ *C*_{g} is *kinetically consistent* if, for every *r* ∈ supp (*v*) ⊆ Rxn and *s* ∈ Mol, *N*_{sr} ≠ 0 implies (*Nv*)_{s} > 0.

We note the following points:

- Like catalytic closure, kinetic consistency is given by stoichiometry.
- Interestingly, kinetic consistency implies formal autocatalysis for all (not necessarily catalytic) species involved.

In the next subsection, we consider additional (capacity) constraints which often ensure catalytic closure.

### 2.5 Constraint-based models

For many systems, kinetic models are not yet available, and constraint-based models are used. Steady-state reaction rates (fluxes) *v* are considered as independent variables, that is, the non-linear dependence of the kinetics on the concentrations *x* is neglected.

Most importantly, catalytic processes imply linear capacity constraints for *x* and *v*. Additional constraints can be formulated for processes that are not catalytic (in the given model), e.g. lower bounds for concentrations or fluxes. In compact form, linear constraints can be can be written as
with , , and .

Altogether, *constraint-based* growth models involve steady-state, irreversibility, (dry) mass, and additional linear constraints,
(6)

#### Example (additional constraints).

Recall the small model of a self-fabricating cell given in Fig 1. In addition to steady-state, irreversibility, and (dry) mass constraints, we consider capacity constraints for all catalysts (importers, enzymes, and the ribosome) and membrane constraints.

Let *k*_{cat} be the turnover number of the importers IG, IN and enzymes EAA, ELD, EL and *k*_{el} be the elongation rate of the ribosome R. The resulting capacity constraints are given by
(7a)
for *I* = IG, IN and *E* = EAA, ELD, EL and
(7b)

The cell membrane area is formed by lipids L and importers IG and IN, leading to the (equality) constraint
(7c)
where *A*_{L} and *A*_{I} denote the areas of lipids and importers, respectively, *r* denotes the surface-to-volume ratio, *ρ* denotes cell density, and *N*_{A} is Avogadro’s number.

Additionally, we require that a minimum fraction *α* of the surface area is formed by lipids, leading to the (inequality) constraint
(7d)

For the derivation of the membrane constraints, see Section B in S1 Text. As stated above, the additional constraints (7) can be summarized as *Ax* + *Bv* ≥ *b*.

All constraints are based on realistic data. The parameter values for the small model of a self-fabricating cell are given in Table 2.

In the final subsection, we introduce the basic “building blocks” of any possible flux distribution in a constraint-based model of cellular growth.

### 2.6 Elementary growth vectors

For given *μ* > 0 and flux vector *v*, the concentration vector *x* = *Nv*/*μ* is not an independent variable. In particular, we have *Nv* = *μx* ≥ 0 and *ω*^{T} *Nv* = *ω*^{T}(*μx*) = *μ*. Hence, the constraint-based growth model (6) is equivalent to constraints just in terms of *v*,

The inequalities above (for the fluxes *v*) specify a polyhedron which suggests the following definition.

**Definition 8**. Let *μ* > 0. *Growth vectors* (GVs) for the constraint-based growth model (6) are elements of the *growth polyhedron*

A GV *v* ∈ *P*_{g}(*μ*) has an *associated* concentration vector .

*Elementary* growth vectors (EGVs) are convex-conformally non-decomposable GVs and conformally non-decomposable elements of the recession cone

For the definitions of polyhedra, convex-conformal non-decomposability, and the recession cone, we refer the reader to Subsection 3.1. Still, we also try to provide an intuitive understanding of Definition 8:

- GVs are fluxes given by stoichiometry, irreversibility, additional constraints,
*and*growth rate. (If growth rate was not fixed, then the problem would become nonlinear, and the resulting set would not be a polyhedron.) In particular, GVs do not depend on concentrations. Still, GVs have an associated concentration vector. - Every polyhedron can be written as the sum of a polytope and the recession cone. Moreover, every polyhedron has a unique set of conformal generators, and these are the convex-conformally non-decomposable vectors (generating a polytope) and the conformally non-decomposable elements of the recession cone. Thereby, a convex-conformally non-decomposable vector cannot be written as a convex sum of other vectors without cancellations [14].
- The recession cone of the growth polyhedron is a subset of the flux cone. In applications, the recession cone (as a subset of the flux cone) is often pointed (due to irreversibility).

EGMs and EGVs can be computed directly, by algorithms for vertex enumeration (such as lrs [29] or EFMlrs [30]), or indirectly (after transforming a system of inequalities to a system of equalities), by algorithms for EFM enumeration (such as efmtool [31]).

Again, we apply the general theory of elementary vectors and state the main result of this subsection.

**Theorem 9**. *Let v be a GV for growth rate μ* > 0. *Then, there exist finite sets E*_{0} ⊆ *R and E*_{μ} ⊆ *P*_{g}(*μ*) *of EGVs such that* λ_{e} ≥ 0, *and* . *Moreover*, .

*Proof*. By Theorem 12 in Subsection 3.1. The growth polyhedron *P*_{g}(*μ*) is a general polyhedron, and its elementary vectors are the convex-conformally non-decomposable vectors and the conformally non-decomposable vectors of its recession cone.

Note that every GV is also a GM, and hence every nonzero GV is also a conformal sum of EGMs.

#### Example (EGVs).

Recall the small model of a self-fabricating cell given in Fig 1 and consider the additional constraints (7). For convenience, the essential constraints are summarized in Table A in S1 Text. We compute EGVs as functions of growth rate, and we observe two distinct regimes. From zero to a critical growth rate *μ*_{crit} = 1.21 h^{−1}, we find 24 EGVs. From *μ*_{crit} to the maximum growth rate *μ*_{max} = 1.27 h^{−1}, we find 10 EGMs. Eight of these EGVs exist for all growth rates, 16 only in regime L (0 < *μ* < *μ*_{crit}) and two only in regime H (*μ*_{crit} < *μ* < *μ*_{max}). At *μ*_{crit}, two sets of eight EGVs (existing in regime L) merge into two EGVs (in regime H). At *μ*_{max}, all EGVs (existing in regime H) merge. EGVs and their characteristic properties are summarized in Table 3.

Totally, there are 26 EGVs, and every EGV has full support. This is a consequence of the additional (capacity and membrane) constraints which ensure that all reactions are active. As a consequence, every EGV is autocatalytic, and the set of all reactions is the only autocatalytic set of reactions.

In contrast to EFMs, EGVs are not determined by their supports, but they are *characterized* by their sets of *inactive* inequality constraints. For every EGV, two constraints are inactive, and all other constraints are active. Further, for every EGV, the associated concentrations of lipids L and macromolecules {IG, IN, EAA, ELD, EL, R} are nonzero. However, at most one of the associated concentrations of the metabolites {G, LD, N, AA} is nonzero. That is, no EGV is kinetically consistent.

In the example, the cell takes up only glucose and ammonium (and does not excrete any species). Hence, Eq (3) takes the form *μ* = *ω*_{G} *v*_{IG} + *ω*_{N} *v*_{IN}, and we introduce the relative contributions of the import fluxes to growth rate,
with *γ*_{IG} + *γ*_{IN} = 1. The relative contribution of ammonium uptake to growth as a function of growth rate is visualized in Fig 2.

Note that *γ*_{IG} = 1 − *γ*_{IN}. Colors correspond to classes of EGVs described in the main text.

As it turns out, EGVs can be grouped into six classes regarding their uptake/storage behavior, namely into classes

Thereby, EGVs in classes 1a, 2a, and 3a exist for all growth rates, whereas EGVs in classes 1b, 2b, and 3b exist either in regime L or H.

Fig 2 highlights the uptake behaviors of the classes of EGVs. In classes 3a and 3b (turquoise), we observe an (almost) constant ammonium uptake of *γ*_{IN} = 0.091 (independently of growth rate). This corresponds to equal uptake rates of glucose and ammonium. In classes 1a (blue, dotted) and 1b (blue, dashed), ammonium uptake rate is smaller than glucose uptake rate, whereas in classes 2a (red, dotted) and 2b (red, dashed), it is larger. However, all EGVs approach balanced uptake for high growth rates. Classes 1b and 2b (dashed, existing in regime L) reach balanced uptake at *μ*_{crit}, whereas classes 1a and 2a (dotted, existing for all growth rates) reach it at *μ*_{max}.

For classes with unbalanced uptake behavior, there is an asymmetry between ammonium and glucose uptake at *μ* → 0. For smaller ammonium uptake, *γ*_{IN} → 0, whereas for larger ammonium uptake *γ*_{IN} → 0.87 (instead of *γ*_{IN} → 1). This asymmetry is related to the membrane equality constraint (7c).

Next, we analyze the correlation between ammonium uptake *γ*_{IN} and the mass fraction *s*_{C} = *ω*_{G} *x*_{G} + *ω*_{LD} *x*_{LD} + *ω*_{L} *x*_{L}, corresponding to carbon storage in glucose and, most importantly, in lipids. As expected, when ammonium is limiting, then glucose is accumulated or rerouted towards lipid synthesis (instead of amino acid synthesis). Ammonium uptake *γ*_{IN} and carbon storage *s*_{C} as a function of growth rate are visualized in Fig 3.

Panels correspond to classes of EGVs described in the main text. Ammonium uptake in blue/red/turquoise (see also Fig 2). Carbon storage in black (dashed and solid: *x*_{L} > 0; dashed: *x*_{G} = 0, *x*_{LD} > 0 or *x*_{G} > 0, *x*_{LD} = 0; solid: *x*_{G} = *x*_{LD} = 0).

In particular, Fig 3 highlights the uptake/storage behaviors of the six classes of EGVs defined above. The left panels correspond to EGVs in classes 1a and 1b. For low growth rate, they display low ammonium uptake and high carbon storage. The middle panels correspond to classes 2a and 2b, where EGVs display high ammonium uptake and low carbon storage. In fact, ammonium is accumulated. Finally, the right panels correspond to classes 3a and 3b. EGVs display balanced uptake behavior and low carbon storage.

In S1 Text, we provide details about the individual EGVs. For each of the 26 EGVs, we display the fluxes (of the 11 reactions) and the associated mass fractions (of the 11 species), as a function of growth rate. See Figs A and B in S1 Text.

Conversely, for each of the 11 species, we show their mass fractions in the 26 EGVs, and analogously, for each of the 11 reactions, we show their fluxes in the 26 EGVs. See Fig 4 in the main text and Fig C in S1 Text.

### 2.7 Biological interpretation

We used our theoretical concepts to study the storage behavior in the small model of a self-fabricating cell given in Fig 1. In particular, we identified 26 EGVs in three classes regarding their storage behaviors (cf. Fig 3):

- carbon accumulates in the form of glucose or lipids;
- nitrogen accumulates in the form of ammonium;
- ammonium and glucose are stored together in the form of proteins.

We observed that carbon accumulation (class 1) goes hand in hand with low ammonium uptake. In fact, a common strategy to increase cellular carbon content in biotechnological applications is to limit the source of nitrogen [32–40]. Restricting nitrogen limits the synthesis of amino acids and proteins, so the remaining carbon source can be rerouted towards lipid and carbohydrate synthesis, which does not require nitrogen. Remarkably, this effect is a feature of a constraint-based metabolic model and does not require active regulation. Experimentally, this was also observed for an oleaginous fungus, where nitrogen limitation leads to triglyceride accumulation without an up-regulation of the triglyceride synthesis genes, which suggests that it results from a reallocation of resources [41]. Experimentally, nitrogen limitation sometimes leads to the accumulation of fat [32–40] or starch with [36, 38] or without [42] a simultaneous lipid accumulation. All of these possibilities are covered by EGVs and agree with our observations. Of course, which of these behaviors is realized is the result of cellular regulation and hence outside the scope of our analysis.

Further, nitrogen accumulation (class 2) takes place when ammonium uptake exceeds glucose import. In fact, most of the ammonium accumulates since protein synthesis is limited by insufficient glucose import. Still, carbohydrate concentrations remain at normal levels. Such phenotypes have been observed in species such as diatoms, foraminifers, and fungi [43].

The last behavior (class 3) occurs when ammonium and glucose uptake are balanced. The individual EGVs correspond to situations where a certain protein’s abundance is larger than required by the flux level of the catalyzed reaction [44, 45]. This mirrors the fact that enzyme expression is not necessarily indicative of flux levels.

At high growth rate, storage becomes exceedingly expensive for a cell since most resources are required to support growth [44]. Consistently, all EGVs show decreasing storage with increasing growth rate. At maximum growth rate, all EGVs (that exist for high growth rate) merge. Uptake is balanced, and there is no storage (except lipids in the membrane).

Alternatively, the 26 EGVs can be grouped into three classes regarding their ribosome content, namely into classes

Fig 5 highlights the ribosome mass fraction *ω*_{R} *x*_{R} (as a function of growth rate) of the three classes.

Panels correspond to classes of EGVs described in the main text.

In class II, the ribosome content is linear as a function of growth rate. This behavior has been observed experimentally (however, with an offset at zero growth rate) [46] and explained theoretically (thereby assuming growth rate-independent “housekeeping” proteins) [47].

In class III, we observe an (almost) constant (and high) ribosome mass fraction of *ω*_{R} *x*_{R} = 0.9 (independently of growth rate). EGVs in this class may describe exactly those ribosomes that are required for the production of growth rate-independent “housekeeping” proteins (and explain the offset at zero growth rate).

In class I, the ribosome content is non-linear at high growth rates. EGVs in this class have nonzero associated metabolite concentrations and may explain a potential non-linear behavior.

Altogether, the experimentally observed ribosome mass fraction [46] can be understood as a convex, conformal sum of EGVs from the three classes.

## 3 Methods

### 3.1 Elementary vectors in polyhedral geometry

For the objects of polyhedral geometry (subspaces, cones, polyhedra), there is no *unique* minimal set of generators, in general. However, *elementary vectors* (EVs) form unique sets of *conformal* generators [14, Section 3.4]. For linear subspaces and s-cones (arising from linear subspaces and nonnegativity constraints), elementary vectors are the support-minimal (SM) vectors; for general polyhedral cones, they are the conformally non-decomposable (cND) vectors; and for general polyhedra, they are the convex-conformally non-decomposable (ccND) vectors plus the cND vectors of the recession cone [14].

Below, we summarize basic definitions and results for s-cones, general polyhedral cones, and polyhedra.

#### S-cones.

Given a linear subspace and a subindex set , an *s-cone* (special cone, subspace cone) is given by . Note that a linear subspace is an s-cone, *S* = *C*(*S*, ∅).

A vector is *elementary* if it is SM. (For linear subspaces, the definition of elementary vectors (EVs) as SM vectors was given in [48]).

The following result is fundamental. See [14, Theorem 3] based on [48, Theorem 1].

**Theorem 10**. *Let* *be an s-cone. Every nonzero vector* *is a conformal sum of EVs. That is, there exists a finite set E of EVs such that*

*The set E can be chosen such that* |*E*| ≤ dim(*S*) *and* |*E*| ≤ |supp(*x*)|.

**Example**. Consider a minimal metabolic network, involving one internal molecular species *X*, where substrate is taken up, and two products are excreted, that is, (*S*)→*X*, *X* → (*P*_{1}), *X* → (*P*_{2}), leading to the stoichiometric matrix

The flux cone is an s-cone, with *S* = ker *N* and (since all reactions are assumed to be irreversible). The cone is generated by two (support-minimal) EVs, *e*^{2} = (1, 1, 0)^{T} and *e*^{3} = (1, 0, 1)^{T}. The cone is indicated in Fig 6.

#### General polyhedral cones.

Let *C* be a polyhedral cone, that is, for some . A nonzero vector *x* ∈ *C* is *conformally non-decomposable* (cND) if, for all nonzero *x*^{1}, *x*^{2} ∈ *C* with sign(*x*^{1}), sign(*x*^{2}) ≤ sign(*x*), the decomposition *x* = *x*^{1} + *x*^{2} implies *x*^{1} = λ*x*^{2} with λ > 0. A vector *x* ∈ *C* is *elementary* if it is cND.

By defining EVs as cND vectors (instead of SM vectors), Theorem 10 can be extended to general polyhedral cones. See [14, Theorem 8].

**Theorem 11**. *Let* *be a polyhedral cone. Every nonzero vector x* ∈ *C is a conformal sum of EVs. That is, there exists a finite set E of EVs such that*

*The set E can be chosen such that* |*E*| ≤ dim(*C*) *and* |*E*| ≤ |supp(*x*)| + |supp(*Ax*)|.

**Example**. Consider a minimal growth model, involving three internal molecular species, a precursor *P*, a metabolic “enzyme” *E*, and the ribosome *R*, where the syntheses of *P*, *E*, and *R* are catalyzed, that is, , , , leading to the stoichiometric matrix

The growth cone is a general polyhedral cone, that is, with where I_{3} is the identity matrix. The cone is generated by three (conformally non-decomposable) EVs, *e*^{1} = (1, 0, 0)^{T}, *e*^{2} = (1, 1, 0)^{T}, and *e*^{3} = (1, 0, 1)^{T}. (Note that only *e*^{1} is SM.) The cone is indicated in Fig 7.

#### Polyhedra.

Let *P* be a polyhedron, that is, for some and . A vector *x* ∈ *P* is *convex-conformally non-decomposable* (ccND) if for all *x*^{1}, *x*^{2} ∈ *P* with sign(*x*^{1}), sign(*x*^{2}) ≤ sign(*x*) and 0 < λ < 1, the decomposition *x* = λ*x*^{1} + (1 − λ)*x*^{2} implies *x*^{1} = *x*^{2}.

Let be the *recession cone* of *P*. A vector *e* ∈ *P* ∪ *R* is *elementary* (an EV of *P*) if *e* ∈ *P* is ccND or *e* ∈ *R* is cND.

Ultimately, Theorem 10 can be extended to general polyhedral cones. See [14, Theorem 13].

**Theorem 12**. *Let P* = {*x* ∣ *Ax* ≥ *b*} *be a polyhedron and R* = {*x* ∣ *Ax* ≥ 0} *its recession cone. Every vector x* ∈ *P is a conformal sum of EVs. That is, there exist finite sets E*_{0} ⊆ *R and E*_{1} ⊆ *P of EVs such that* λ_{e} ≥ 0, *and* . (*Hence*, |*E*_{1}| ≥ 1).

*The set E* = *E*_{0} ∪ *E*_{1} *can be chosen such that* |*E* |≤ dim(*P*) + 1 *and* |*E*| ≤ |supp(*x*)| + |supp(*Ax*)| + 1.

### 3.2 Software

Simulations were performed in Python 3.7.9 using the package efmtool 0.2.0 [31]. Fig 1 was created with BioRender.com. The remaining figures were created with R 4.0.2. All code is available on GitHub at https://github.com/diana-sz/EGMs.

## 4 Discussion

In traditional models of cellular growth, elementary flux modes and vectors (EFMs [4, 5] and EFVs [7, 8]) allow the unbiased characterization of all feasible metabolic phenotypes in terms of unique functional units, namely EFMs (in flux cones) and EFVs (in flux polyhedra). More specifically, a certain cellular function can only be performed if there is a corresponding EFM or EFV. However, EFMs and EFVs depend on a fixed biomass composition, which is specified as a biomass “reaction” in addition to the actual metabolic reactions. As a consequence, EFMs and EFVs are not able to describe cellular resource reallocations, in particular, upon changes of environmental conditions [49–51].

In more comprehensive models, the biomass reaction is replaced by explicit synthesis reactions for all (model-specific) macromolecules. In such next-generation models (as used in RBA [3]), biomass composition is no longer fixed, but results from the expression of all proteins (and the synthesis of other macromolecules). In fact, the concentrations *x* = *Nv*/*μ* of all molecular species are determined by the stoichiometric matrix *N*, the fluxes *v*, and growth rate *μ*, cf. Eq (4).

In this work, we developed a mathematical theory that allows the unbiased characterization not only of all feasible metabolic phenotypes, but also of all feasible biomass compositions in comprehensive, next-generation models. In analogy to EFMs and EFVs, we introduced elementary growth modes and vectors (EGMs and EGVs) which can be interpreted as unique functional units of self-fabrication.

Since EGMs are the basic “building blocks” of any possible flux distribution in *purely stoichiometric* models of cellular growth, they are typically not “self-fabricating” themselves. In particular, they do not reflect implications from autocatalysis, namely that all catalysts of active reactions are synthesized. Still, additional capacity constraints of the form *v*_{E} ≤ *k*_{cat} *x*_{E} ensure nonzero enzyme concentrations *x*_{E} for nonzero fluxes *v*_{E}. As a consequence, EGVs in *constraint-based* models are often autocatalytic, as in our running example.

### Comparison with traditional models

It is elucidating to explicitly compare our theory for comprehensive growth models with the analysis of traditional models. In the running example, we take the metabolic part of the stoichiometric matrix (shaded in blue in Fig 1) and add a fixed biomass composition as the last column; see Fig 8 for the resulting stoichiometric matrix. In fact, we choose the biomass composition at maximum growth rate (computed for the comprehensive growth model), cf. Fig 4. Instead of the concentrations of the individual enzymes and the ribosome (which are not part of the traditional model), we use their amino-acid content. The total amino acid concentration is given by

The matrix consists of the metabolic part (shaded in blue, cf. Fig 1) and the biomass composition (last column). Scaling with the concentration is required to obtain dimensionless entries.

Additionally, we consider enzyme capacity constraints as given in (7a) using the enzyme concentrations at maximum growth. We obtain one EFM and one EFV in the constraint-based model. In other words, the traditional model is feasible, but does not provide further information. This is in stark contrast to the comprehensive model, where we identified 11 EGMs and 10 (or 24) EGVs in the constraint-based model (depending on growth rate), cf. Subsections 2.3 and 2.6. Most importantly, our EGM/EGV analysis provided a characterization of all feasible biomass compositions. In biological terms, we showed that the experimentally observed carbon accumulation upon nitrogen starvation is predicted by our small model of a self-fabricating cell.

### Comparison with (semi-)kinetic models

Recently, de Groot *et al*. [12] analyzed *(semi-)kinetic* models of cellular growth, and one of the authors of this work started a study of constraint-based and (semi-)kinetic models; for a preprint, see [52]. Here, we compare the theoretical concepts introduced in [12] and in this work by means of an example, namely, the minimal growth model shown in Fig 9a. The following discussion is based on a complete mathematical analysis given in Section D in S1 Text.

**(a)** The minimal growth model with alternative pathways is given by 4 molecular species {AA, E1, E2, R} (circles) and 5 reactions (rectangles) labeled by the corresponding fluxes *v* = (*v*_{1}, *v*_{2}; *w*_{1}, *w*_{2}, *w*_{R})^{T}. The “enzymes” E1, E2 catalyze the formation of amino acids AA, and the ribosome R catalyzes the synthesis of the enzymes and the ribosome itself. **(b)** After scaling () and projection to the (scaled) synthesis fluxes , the growth cone becomes a growth polytope (a 3-dimensional simplex), generated by the (scaled and projected) EGMs *e*^{AA}, *e*^{E1}, *e*^{E2}, and *e*^{R}. Every EGM produces exactly one species as indicated by its name. **(c)** After projection, the MAC subsets of reactions are *m*_{1} = {*w*_{1}, *w*_{R}} and *m*_{2} = {*w*_{2}, *w*_{R}} (indicated by the corresponding fluxes). MAC sets represent minimal pathways: GMs with support *m*_{1} (in red) lie in the two-dimensional simplex generated by the EGMs *e*^{AA}, *e*^{E1}, and *e*^{R}. Analogously for GMs with support *m*_{2} (in blue). EGMs (in gray), lying on the axes, are not AC. **(d)** Enzyme capacity imposes additional *inequality* constraints, and the growth polytope gets restricted (depending on growth rate and *k*^{cat} values). Here, the growth polytope is generated by 6 (scaled and projected) EGVs. Thereby, only EGVs *e*^{1} and *e*^{2} have nonzero ribosome flux and hence are AC. **(e)** Alternatively, in (semi-)kinetic models, enzyme kinetics imposes additional *equality* constraints, and the growth polytope gets restricted (depending on growth rate and metabolite concentrations). Here, the growth polytope is generated by the (scaled and projected) EGVs *ε*^{1} and *ε*^{2} (and hence is 1-dimensional). The EGVs have nonzero ribosome flux and hence are AC. For a complete mathematical analysis of the minimal growth model, see Section D in S1 Text. An interactive version of panel (e) is available online at https://diana-sz.shinyapps.io/EGMs_EGVs/.

In the example, a cell is able to grow on alternative substrates. Both “enzymes” E1 and E2 can form the amino acids AA used by the ribosome R to synthesize the enzymes and the ribosome itself. Recall that the growth cone is the set of all stoichiometrically feasible fluxes that allow growth (*Nv* ≥ 0 instead of *Nv* = 0 in the classical setting). After scaling and projection to the synthesis fluxes (*w*_{1}, *w*_{2}, *w*_{R})^{T}, the growth cone becomes a growth polytope (a 3-dimensional simplex) generated by the (scaled and projected) EGMs *e*^{AA}, *e*^{E1}, *e*^{E2}, and *e*^{R}, see Fig 9a. This is in analogy to the classical setting, where EFMs generate the flux cone.

However, EGMs are not autocatalytic (AC). In the example, each EGM produces exactly one molecular species, but not all catalysts of active reactions are synthesized. For instance, the EGM *e*^{E1} produces E1 (in reaction *w*_{1}), but the ribosome R that catalyzes *w*_{1} is not produced. In technical terms, *e*^{E1} is basically catalytic (BC), since it contains an active catalytic reaction, but not catalytically closed (CC). Only if we combine *e*^{E1} with *e*^{R} (producing the ribosome in reaction *w*_{R}), the resulting flux becomes CC and hence AC. In fact, every (positive) combination of *e*^{E1}, *e*^{R} and *e*^{AA} is AC, and its support *m*_{1} = {*w*_{1}, *w*_{R}} is a minimally autocatalytic (MAC) set of reactions. Such fluxes represent minimal pathways and lie on a 2-dimensional facet of the growth polytope, see Fig 9c.

Additional enzyme capacity constraints restrict the growth polyhedron. In the example, the growth polytope is generated by 6 (scaled and projected) EGVs, see Fig 9(d). All EGVs fulfill the enzyme capacity constraints and hence are CC w.r.t. the enzymatic reactions. However, only EGVs *e*^{1} and *e*^{2} have nonzero ribosome flux and hence are fully AC. Still, every biologically meaningful flux (fulfilling the constraints) is a (nonnegative) combination of EGVs. Again, this is in analogy to the classical setting, where EFVs generate the flux polyhedron [8], determined by a fixed biomass composition, cf. Fig 8. However, in comprehensive models such as in this example, every element of the growth polytope corresponds to a different biomass composition, which allows to study the re-allocation of cellular resources.

In (semi-)kinetic models, enzyme kinetics (for fixed metabolite concentrations) further restricts the growth polytope. In the example, the EGVs *ε*^{1} and *ε*^{2} generate the growth polytope (a 1-dimensional simplex), see Fig 9(e). Given that enzyme kinetics and enzyme capacity constraints use the same *k*^{cat} values, the growth polytope for the kinetic model lies inside the polytope for the constraint-based model. Both EGVs have nonzero ribosome flux and hence are AC.

In the light of this discussion, we find that “elementary growth states” as introduced in de Groot *et al*. [12] are (projections of) EGVs in (semi-)kinetic models. Equivalence classes of their supports (called “elementary growth modes” in [12]) are MAC sets of reactions. In *our* definition, EGMs are the elementary vectors of the growth cone (in the sense of polyhedral geometry). They only depend on stoichiometry (but not on kinetics) and are defined for arbitrary models (including detailed synthesis reactions for macromolecules).

### Elementary vectors and computation

Whereas EFMs are *support-minimal* vectors of the flux cone (a special cone), already EFVs are *conformally non-decomposable* (and not support-minimal) vectors of a flux polyhedron. Also EGMs of the growth cone (a general polyhedral cone) and EGVs of a growth polyhedron are conformally non-decomposable vectors. Thus, conformal non-decomposability (and not support-minimality) is the key feature that defines elementary vectors in computational models of cellular growth, see also [8].

In terms of computation, EGM and EGV analysis (just like EFM and EFV analysis) suffer from a combinatorial explosion in the number of elementary vectors with the size of the network. As with EFMs and EFVs, one may focus on the enumeration of subsets of EGMs and EGVs or consider extra constraints [53–57] such as thermodynamic feasibility [58–61].

## Supporting information

### S1 Text.

Supporting information in four sections: **A. The dynamic model of cellular growth**. We derive the dynamic model of cellular growth. **B. Example: membrane constraints**. For the running example (the small model of a self-fabricating cell), we derive the membrane constraints. **C. Example: figures and tables**. For the running example, we provide supplementary figures and tables. **D. Minimal growth model with alternative pathways**. We consider a minimal growth model with alternative pathways and provide a complete mathematical analysis of both a constraint-based and a (semi-)kinetic model.

https://doi.org/10.1371/journal.pcbi.1009843.s001

(PDF)

## Acknowledgments

We thank Christoph Flamm, Daan de Groot, Bob Planqué, and Ralf Steuer for helpful discussions (on autocatalytic sets, elementary vectors, and constraint-based models, respectively) and three anonymous referees for helpful comments.

## References

- 1.
Klamt S, Hädicke O, Kamp Av. Stoichiometric and Constraint-Based Analysis of Biochemical Reaction Networks. In: Benner P, Findeisen R, Flockerzi D, Reichl U, Sundmacher K, editors. Large-Scale Networks in Engineering and Life Sciences. Modeling and Simulation in Science, Engineering and Technology. Springer International Publishing; 2014. p. 263–316.
- 2.
Lewis NE, Nagarajan H, Palsson BØ. Constraining the metabolic genotype-phenotype relationship using a phylogeny of
*in silico*methods. Nature Reviews Microbiology. 2012;10(4):291–305. pmid:22367118 - 3. Goelzer A, Fromion V. Bacterial growth rate reflects a bottleneck in resource allocation. Biochimica et Biophysica Acta (BBA)—General Subjects. 2011;1810(10):978–988. pmid:21689729
- 4. Schuster S, Hilgetag C. On elementary flux modes in biochemical reaction systems at steady state. J Biol Syst. 1994;2:165–182.
- 5. Schuster S, Hilgetag C, Woods JH, Fell DA. Reaction routes in biochemical reaction systems: algebraic properties, validated calculation procedure and example from nucleotide metabolism. J Math Biol. 2002;45(2):153–181. pmid:12181603
- 6. Zanghellini J, Ruckerbauer DE, Hanscho M, Jungreuthmayer C. Elementary flux modes in a nutshell: Properties, calculation and applications. Biotechnology Journal. 2013;8(9):1009–1016. pmid:23788432
- 7. Urbanczik R. Enumerating constrained elementary flux vectors of metabolic networks. IET Syst Biol. 2007;1(5):274–279. pmid:17907675
- 8. Klamt S, Regensburger G, Gerstl MP, Jungreuthmayer C, Schuster S, Mahadevan R, et al. From elementary flux modes to elementary flux vectors: Metabolic pathway analysis with arbitrary linear flux constraints. PLoS Comput Biol. 2017;13(4):e1005409. pmid:28406903
- 9. Edwards JS, Covert M, Palsson B. Metabolic modelling of microbes: the flux-balance approach. Environmental Microbiology. 2002;4(3):133–140. pmid:12000313
- 10. Müller S, Regensburger G, Steuer R. Enzyme allocation problems in kinetic metabolic networks: Optimal solutions are elementary flux modes. J Theoret Biol. 2014;347:182–190. pmid:24295962
- 11. Wortel MT, Peters H, Hulshof J, Teusink B, Bruggeman FJ. Metabolic states with maximal specific rate carry flux through an elementary flux mode. FEBS Journal. 2014;281(6):1547–1555. pmid:24460934
- 12. de Groot DH, Hulshof J, Teusink B, Bruggeman FJ, Planqué R. Elementary Growth Modes provide a molecular description of cellular self-fabrication. PLoS Comput Biol. 2020;16(1):e1007559. pmid:31986156
- 13. de Jong H, Casagranda S, Giordano N, Cinquemani E, Ropers D, Geiselmann J, et al. Mathematical modelling of microbes: metabolism, gene expression and growth. Journal of The Royal Society Interface. 2017;14(136):20170502.
- 14. Müller S, Regensburger G. Elementary vectors and conformal sums in polyhedral geometry and their relevance for metabolic pathway analysis. Front Genet. 2016;7(90):1–11. pmid:27252734
- 15. Barenholz U, Davidi D, Reznik E, Bar-On Y, Antonovsky N, Noor E, et al. Design principles of autocatalytic cycles constrain enzyme kinetics and force low substrate saturation at flux branch points. eLife. 2017;6. pmid:28169831
- 16. Blokhuis A, Lacoste D, Nghe P. Universal motifs and the diversity of autocatalytic systems. Proceedings of the National Academy of Sciences. 2020;117(41):25230–25236. pmid:32989134
- 17. Andersen JL, Flamm C, Merkle D, Stadler PF. Defining Autocatalysis in Chemical Reaction Networks. J Syst Chem. 2020;8:121–133.
- 18. Kauffman SA. Autocatalytic sets of proteins. Journal of Theoretical Biology. 1986;119(1):1–24.
- 19. Steel M. The emergence of a self-catalysing structure in abstract origin-of-life models. Applied Mathematics Letters. 2000;13(3):91–95.
- 20. Hordijk W, Steel M. Detecting autocatalytic, self-sustaining sets in chemical reaction systems. Journal of Theoretical Biology. 2004;227(4):451–461. pmid:15038982
- 21. Fontana W, Buss LW. “The arrival of the fittest”: Toward a theory of biological organization. Bulletin of Mathematical Biology. 1994;56(1):1–64.
- 22. Kaleta C, Centler F, Dittrich P. Analyzing Molecular Reaction Networks: From Pathways to Chemical Organizations. Molecular Biotechnology. 2006;34(2):117–124. pmid:17172657
- 23. Dittrich P, di Fenizio PS. Chemical Organisation Theory. Bulletin of Mathematical Biology. 2007;69(4):1199–1231. pmid:17415616
- 24.
Alberts B, Johnson A, Lewis J, Raff M, Roberts K, Walter P. The lipid bilayer. Garland Science; 2002.
- 25. Bar-Even A, Noor E, Savir Y, Liebermeister W, Davidi D, Tawfik DS, et al. The Moderately Efficient Enzyme: Evolutionary and Physicochemical Trends Shaping Enzyme Parameters. Biochemistry. 2011;50(21):4402–4410. pmid:21506553
- 26. Harris LK, Theriot JA. Surface Area to Volume Ratio: A Natural Variable for Bacterial Morphogenesis. Trends in Microbiology. 2018;26(10):815–832. pmid:29843923
- 27. Milo R, Jorgensen P, Moran U, Weber G, Springer M. BioNumbers—the database of key numbers in molecular and cell biology. Nucleic Acids Research. 2009;38(suppl_1):D750–D753. pmid:19854939
- 28. Caspi R, Billington R, Fulcher CA, Keseler IM, Kothari A, Krummenacker M, et al. The MetaCyc database of metabolic pathways and enzymes. Nucleic acids research. 2018;46(D1):D633–D639. pmid:29059334
- 29. Avis D, Fukuda K. A Pivoting Algorithm for Convex Hulls and Vertex Enumeration of Arrangements and Polyhedra. Discret Comput Geom. 1992;8:295–313.
- 30. Buchner BA, Zanghellini J. EFMlrs: a Python package for elementary flux mode enumeration via lexicographic reverse search. BMC bioinformatics. 2021;22(1):1–21. pmid:34758748
- 31. Terzer M, Stelling J. Large-scale computation of elementary flux modes with bit pattern trees. Bioinformatics. 2008;24(19):2229–2235. pmid:18676417
- 32. Calvey CH, Su YK, Willis LB, McGee M, Jeffries TW. Nitrogen limitation, oxygen limitation, and lipid accumulation in Lipomyces starkeyi. Bioresource technology. 2016;200:780–788. pmid:26580895
- 33. Zhang H, Wu C, Wu Q, Dai J, Song Y. Metabolic flux analysis of lipid biosynthesis in the yeast Yarrowia lipolytica using 13C-labled glucose and gas chromatography-mass spectrometry. PloS one. 2016;11(7):e0159187. pmid:27454589
- 34. Kolouchová I, Mat’átková O, Sigler K, Masák J, Řezanka T. Lipid accumulation by oleaginous and non-oleaginous yeast strains in nitrogen and phosphate limitation. Folia microbiologica. 2016;61(5):431–438. pmid:26931336
- 35. Griffiths MJ, van Hille RP, Harrison ST. The effect of nitrogen limitation on lipid productivity and cell composition in Chlorella vulgaris. Applied microbiology and biotechnology. 2014;98(5):2345–2356. pmid:24413971
- 36. Pancha I, Chokshi K, George B, Ghosh T, Paliwal C, Maurya R, et al. Nitrogen stress triggered biochemical and morphological changes in the microalgae Scenedesmus sp. CCNM 1077. Bioresource technology. 2014;156:146–154. pmid:24495540
- 37. Jiang Y, Yoshida T, Quigg A. Photosynthetic performance, lipid production and biomass composition in response to nitrogen limitation in marine microalgae. Plant Physiology and Biochemistry. 2012;54:70–77. pmid:22387274
- 38. Pruvost J, Van Vooren G, Le Gouic B, Couzinet-Mossion A, Legrand J. Systematic investigation of biomass and lipid productivity by microalgae in photobioreactors for biodiesel application. Bioresource technology. 2011;102(1):150–158. pmid:20675127
- 39. Xiong W, Liu L, Wu C, Yang C, Wu Q. 13C-Tracer and Gas Chromatography-Mass Spectrometry Analyses Reveal Metabolic Flux Distribution in the Oleaginous Microalga Chlorella protothecoides. Plant Physiology. 2010;154(2):1001–1011. pmid:20720172
- 40. Hsieh CH, Wu WT. Cultivation of microalgae for oil production with a cultivation strategy of urea limitation. Bioresource technology. 2009;100(17):3921–3926. pmid:19362823
- 41. Lu H, Chen H, Tang X, Yang Q, Zhang H, Chen YQ, et al. Time-resolved multi-omics analysis reveals the role of nutrient stress-induced resource reallocation for TAG accumulation in oleaginous fungus Mortierella alpina. Biotechnology for biofuels. 2020;13(1):1–17. pmid:32625246
- 42. Pleissner D, Eriksen NT. Effects of phosphorous, nitrogen, and carbon limitation on biomass composition in batch and continuous flow cultures of the heterotrophic dinoflagellate Crypthecodinium cohnii. Biotechnology and bioengineering. 2012;109(8):2005–2016. pmid:22354618
- 43. Kamp A, Høgslund S, Risgaard-Petersen N, Stief P. Nitrate storage and dissimilatory nitrate reduction by eukaryotic microbes. Frontiers in microbiology. 2015;6:1492. pmid:26734001
- 44. O’Brien EJ, Utrilla J, Palsson BO. Quantification and classification of E. coli proteome utilization and unused protein costs across environments. PLoS computational biology. 2016;12(6):e1004998. pmid:27351952
- 45. Yang L, Yurkovich JT, Lloyd CJ, Ebrahim A, Saunders MA, Palsson BO. Principles of proteome allocation are revealed using proteomic data and genome-scale models. Scientific reports. 2016;6(1):1–8. pmid:27857205
- 46. Scott M, Gunderson CW, Mateescu EM, Zhang Z, Hwa T. Interdependence of cell growth and gene expression: origins and consequences. Science. 2010;330(6007):1099–1102. pmid:21097934
- 47. Scott M, Klumpp S, Mateescu EM, Hwa T. Emergence of robust growth laws from optimal regulation of ribosome synthesis. Mol Syst Biol. 2014;10:747. pmid:25149558
- 48.
Rockafellar RT. The elementary vectors of a subspace of
*R*^{N}. In: Combinatorial Mathematics and its Applications (Proc. Conf., Univ. North Carolina, Chapel Hill, N.C., 1967). Chapel Hill, N.C.: Univ. North Carolina Press; 1969. p. 104–127. - 49. Széliová D, Ruckerbauer DE, Galleguillos SN, Petersen LB, Natter K, Hanscho M, et al. What CHO is made of: Variations in the biomass composition of Chinese hamster ovary cell lines. Metabolic engineering. 2020;61:288–300. pmid:32619503
- 50.
Tomàs-Gamisans M, Ferrer P, Albiol J. Fine-tuning the
*P. pastoris*iMT1026 genome-scale metabolic model for improved prediction of growth on methanol or glycerol as sole carbon sources. Microbial biotechnology. 2018;11(1):224–237. pmid:29160039 - 51.
Nookaew I, Jewett MC, Meechai A, Thammarongtham C, Laoteng K, Cheevadhanarak S, et al. The genome-scale metabolic model iIN800 of
*Saccharomyces cerevisiae*and its validation: a scaffold to query lipid metabolism. BMC systems biology. 2008;2(1):1–15. pmid:18687109 - 52. Müller S. Elementary growth modes/vectors and minimal autocatalytic sets for kinetic/constraint-based models of cellular growth. Biorxiv. 2021;.
- 53. De Figueiredo LF, Podhorski A, Rubio A, Kaleta C, Beasley JE, Schuster S, et al. Computing the shortest elementary flux modes in genome-scale metabolic networks. Bioinformatics. 2009;25(23):3158–3165. pmid:19793869
- 54.
Kaleta C, Figueiredo LFd, Behre J, Schuster S. EFMEvolver: Computing elementary flux modes in genome-scale metabolic networks. In: German conference on bioinformatics 2009. Gesellschaft für Informatik eV; 2009. p. 179–189.
- 55. Jungreuthmayer C, Ruckerbauer DE, Zanghellini J. regEfmtool: Speeding up elementary flux mode calculation using transcriptional regulatory rules in the form of three-state logic. Biosystems. 2013;113(1):37–39. pmid:23664840
- 56. Pey J, Planes FJ. Direct calculation of elementary flux modes satisfying several biological constraints in genome-scale metabolic networks. Bioinformatics. 2014;30(15):2197–2203. pmid:24728852
- 57. Jungreuthmayer C, Ruckerbauer DE, Gerstl MP, Hanscho M, Zanghellini J. Avoiding the enumeration of infeasible elementary flux modes by including transcriptional regulatory rules in the enumeration process saves computational costs. PloS one. 2015;10(6):e0129840. pmid:26091045
- 58. Jol SJ, Kümmel A, Terzer M, Stelling J, Heinemann M. System-level insights into yeast metabolism by thermodynamic analysis of elementary flux modes. PLoS computational biology. 2012;8(3):e1002415. pmid:22416224
- 59. Gerstl MP, Ruckerbauer DE, Mattanovich D, Jungreuthmayer C, Zanghellini J. Metabolomics integrated elementary flux mode analysis in large metabolic networks. Scientific reports. 2015;5(1):1–8. pmid:25754258
- 60. Gerstl MP, Jungreuthmayer C, Zanghellini J. tEFMA: computing thermodynamically feasible elementary flux modes in metabolic networks. Bioinformatics. 2015;31(13):2232–2234. pmid:25701571
- 61. Gerstl MP, Jungreuthmayer C, Müller S, Zanghellini J. Which sets of elementary flux modes form thermodynamically feasible flux distributions? The FEBS Journal. 2016;283(9):1782–1794. pmid:26940826